Commit History
address PR feedback
0c6f928
winglian
commited on
add streaming dataset support for pretraining datasets
eea2731
winglian
commited on
linting fix
1db46a9
winglian
commited on
more gpt-neox long ctx fixes
ab5cd28
winglian
commited on
fix bettertransformers save, force it to skip after saving correctly in callback
1a82082
winglian
commited on
more tweaks to do pre-training with bettertransformers
1210dc8
winglian
commited on
experimental expansion of ctx len
488a67d
winglian
commited on
add validation/warning for bettertransformers and torch version
71a43f8
winglian
commited on
use pythia-12b, neox-20b is flaky
3961902
winglian
commited on
add flash attn context for efficient training and attempt setting model to train mode:
8792199
winglian
commited on
add support for opimum bettertransformers
1edc30c
winglian
commited on
fix for local variable 'LlamaForCausalLM' referenced before assignment
14163c1
winglian
commited on
Merge pull request #181 from OpenAccess-AI-Collective/xpos-rope
41e4f6c
unverified
winglian
commited on
Merge branch 'main' into patch-1
79e2a6f
unverified
Angainor Development
commited on
Remove explicit definition of cfg.inference
c250898
unverified
Angainor Development
commited on
Merge pull request #180 from Glavin001/feat/stream-inference
215d775
unverified
winglian
commited on
formatting for linter
f36e227
unverified
winglian
commited on
add option to readme
5878bb1
winglian
commited on
add support to extend context with xpos rope
a03a7d7
winglian
commited on
Add streaming inference & fix stopping at EOS
fec6bcc
Glavin001
commited on
Merge pull request #179 from OpenAccess-AI-Collective/fix-max_seq_len
931e606
unverified
winglian
commited on
fix for max sequence len across different model types
7f09106
winglian
commited on
Merge pull request #178 from PocketDocLabs/main
6b50200
unverified
Nanobit
commited on
Update README.md to reflect current gradient checkpointing support
16f9e28
unverified
PocketDoc
commited on
Merge pull request #176 from NanoCode012/fix/peft-import
b9083a7
unverified
Nanobit
commited on
Fix backward compat for peft
aefb2fc
Nanobit
commited on
Merge pull request #169 from NanoCode012/feat/landmark
b5aa8d8
unverified
Nanobit
commited on
Merge pull request #171 from OpenAccess-AI-Collective/NanoCode012-falcon-lora-matrix
4d6490b
unverified
Nanobit
commited on
Fix falcon support lora
b242b69
unverified
Nanobit
commited on
Merge pull request #170 from OpenAccess-AI-Collective/NanoCode012-lambdalabs-fix
320beb2
unverified
Nanobit
commited on
Feed cfg.inference
bd3b537
unverified
Angainor Development
commited on
WIP: Rely on cfg.inference
813cfa4
unverified
Angainor Development
commited on
Improve lambda labs instruction
2e13cef
unverified
Nanobit
commited on
Fix grad checkpoint and outputs param
2a801b0
Nanobit
commited on
Fix patching via import instead of hijacking
e44c9e0
Nanobit
commited on
Feat: Add landmark attention
55b8542
Nanobit
commited on
Merge pull request #168 from bratao/main
febe902
unverified
winglian
commited on
Disable Wandb
f4df266
Bruno Cabral
commited on
Merge pull request #167 from NanoCode012/fix/redundant-save-eval-steps
281dc3d
unverified
Nanobit
commited on
Refactor out unmodified save_steps and eval_steps
2ef4634
Nanobit
commited on
Merge pull request #166 from NanoCode012/fix/seed
7eae903
unverified
Nanobit
commited on
Merge pull request #132 from utensil/falcon-7b-qlora
c8242de
unverified
Nanobit
commited on
Set to use cfg.seed or 42 for backward compat
2cfe9e9
Nanobit
commited on
Trim trailing whitespace
79a8f52
unverified
utensil
commited on
Merge pull request #164 from NanoCode012/fix/falcon-fsdp-validate
afaa0d2
unverified
Nanobit
commited on
Fix failing test
bfd27ba
Nanobit
commited on
Validate falcon with fsdp
babf0fd
Nanobit
commited on
Merge pull request #163 from NanoCode012/feat/matmul-tf32
81911d1
unverified
Nanobit
commited on