Commit History
Docs: add instructions to 1-click launching on public clouds (#862)
b33c1d5
unverified
zongheng
commited on
multipack len should use max, not min (#863)
0c2a630
unverified
winglian
commited on
adds llama and mistral dropout support (#858)
db8a8af
unverified
winglian
commited on
various bugfixes (#856)
1470650
unverified
winglian
commited on
chore(doc): Separate section on runpod (#860)
501b4d1
unverified
Nanobit
commited on
feat(doc): add more info on train_on_split (#855)
306fe19
unverified
Nanobit
commited on
include the suffix modified string in ascii art (#852)
614cff4
unverified
fpreiss
commited on
cleanup the old multipack dataloader (#841)
1a6309c
unverified
winglian
commited on
Pin optimum package (#838)
105d0b3
unverified
Bryan Thornbury
commited on
don't compile deepspeed or bitsandbytes from source (#837)
f544ab2
unverified
winglian
commited on
multipack w batch sampler (#795)
641e6f7
unverified
winglian
commited on
use temp_dir kwarg instead
6dc68a6
winglian
commited on
missing dunder-init
7de6a56
winglian
commited on
chore: lint
c74f045
winglian
commited on
make sure to cleanup tmp output_dir for e2e tests
0402d19
winglian
commited on
use accelerate logging for zero/main loggin only
b2430ce
winglian
commited on
cleanup verbosity a bit
4c834bf
winglian
commited on
add deepspeed-kernels dependency for deepspeed>=0.12.0 (#827)
8056ecd
unverified
fpreiss
commited on
Feat: Added Gradio support (#812)
738a057
unverified
stillerman
commited on
update table for rwkv4 support, fix process count for dataset (#822)
cdc71f7
unverified
winglian
commited on
fix: pin autogptq (#818)
6459ac7
unverified
Nanobit
commited on
fix model parallel (#816)
964d858
unverified
winglian
commited on
fix(tokenizer): update log order after update (#806)
10388a8
unverified
Nanobit
commited on
feat(doc): add dummyoptim faq fix (#802)
9f7e8a9
unverified
Nanobit
commited on
fix(config): Set eos/bos to tokenizer if different (#801)
637ed09
unverified
Nanobit
commited on
refactor neft patch to be more re-usable similar to trl's impl (#796)
827ec3d
unverified
winglian
commited on
fix eval_steps to be a sane default (#797)
8b79ff0
unverified
winglian
commited on
Fix Deepspeed Zero3 Config (#791)
d3193be
unverified
Teknium
commited on
Add docker advanced instruction to README (#792)
2e71ff0
unverified
gordicaleksa
commited on
GitBook: No commit message
facc49f
unverified
chanvichetvong
commited on
Create preprocess CLI (#785)
e50ab07
unverified
casperhansen
commited on
Threaded MultipackDistributedDataloader with prefetched samples (#759)
05bd6f1
unverified
casperhansen
commited on
chore(readme): Improve documentation on conversation field (#782)
20aa4b5
unverified
Nanobit
commited on
chore: refactor truthy check and fix mypy (#780)
11d1d60
unverified
Nanobit
commited on
refactor setup trainer so we can add more hooks (#773)
6c81c61
unverified
winglian
commited on
disable eval table w sample packing in examples (#778)
9b43e7e
unverified
winglian
commited on
simplify by removing duplicate base_model_config (#772)
2d8def6
unverified
winglian
commited on
Fix: Warn when fullfinetune without adapter (#770)
44c9d01
unverified
Nanobit
commited on
convert exponential notation lr to floats (#771)
ca84cca
unverified
winglian
commited on
Hotfix for not saving correctly (#762)
32eeeb5
unverified
casperhansen
commited on
Fix: Cannot tokenize with bf16 and on cpu (#766)
afedc47
unverified
Nanobit
commited on
Fix: eval table conflict with eval_sample_packing (#769)
9923b72
unverified
Nanobit
commited on
remove lora fused packing test (#758)
21cf09b
unverified
winglian
commited on
Implement fused modules (#747)
15d3a65
unverified
add to docs (#703)
a21935f
unverified
winglian
commited on
chore: bump transformers to v4.34.1 to fix tokenizer issue (#745)
8966a6f
unverified
Nanobit
commited on
Fix DeepSpeed Zero 3 Saving (#709)
e4d1585
unverified
add a latest tag for regular axolotl image, cleanup extraneous print statement (#746)
70157cc
unverified
winglian
commited on