allow overriding of model_config parameters from the YML (#853) 1bc1186 unverified winglian commited on Nov 16, 2023
add e2e tests for checking functionality of resume from checkpoint (#865) b3a61e8 unverified winglian commited on Nov 16, 2023
lint fix that didn't get caught by linter (#866) 332984d unverified winglian commited on Nov 15, 2023
Update data.py for signature generation (#851) 48630f5 unverified MilesQLi winglian commited on Nov 15, 2023
Docs: add instructions to 1-click launching on public clouds (#862) b33c1d5 unverified zongheng commited on Nov 15, 2023
feat(doc): add more info on train_on_split (#855) 306fe19 unverified Nanobit commited on Nov 15, 2023
include the suffix modified string in ascii art (#852) 614cff4 unverified fpreiss commited on Nov 15, 2023
don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified winglian commited on Nov 9, 2023
add deepspeed-kernels dependency for deepspeed>=0.12.0 (#827) 8056ecd unverified fpreiss commited on Nov 5, 2023
update table for rwkv4 support, fix process count for dataset (#822) cdc71f7 unverified winglian commited on Nov 5, 2023
fix(tokenizer): update log order after update (#806) 10388a8 unverified Nanobit commited on Oct 31, 2023
fix(config): Set eos/bos to tokenizer if different (#801) 637ed09 unverified Nanobit commited on Oct 29, 2023
refactor neft patch to be more re-usable similar to trl's impl (#796) 827ec3d unverified winglian commited on Oct 29, 2023
Update to adapt to sharegpt datasets with "assistant" rather than "gp… (#774) 0800885 unverified MilesQLi winglian commited on Oct 28, 2023
Add docker advanced instruction to README (#792) 2e71ff0 unverified gordicaleksa commited on Oct 27, 2023
Threaded MultipackDistributedDataloader with prefetched samples (#759) 05bd6f1 unverified casperhansen commited on Oct 26, 2023
chore(readme): Improve documentation on conversation field (#782) 20aa4b5 unverified Nanobit commited on Oct 24, 2023
refactor setup trainer so we can add more hooks (#773) 6c81c61 unverified winglian commited on Oct 23, 2023
disable eval table w sample packing in examples (#778) 9b43e7e unverified winglian commited on Oct 23, 2023
simplify by removing duplicate base_model_config (#772) 2d8def6 unverified winglian commited on Oct 23, 2023
Fix: Warn when fullfinetune without adapter (#770) 44c9d01 unverified Nanobit commited on Oct 22, 2023
convert exponential notation lr to floats (#771) ca84cca unverified winglian commited on Oct 22, 2023
Fix: eval table conflict with eval_sample_packing (#769) 9923b72 unverified Nanobit commited on Oct 22, 2023