Added advanced DDP args (#515) 396a7a7 unverified Jan Philipp Harries Jan Philipp Harries commited on Aug 31, 2023
Fix(doc): Clarify no amp to full yaml docs (#496) 48c5647 unverified Nanobit commited on Aug 29, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023
ReLoRA implementation (with quantization) (#322) bde3c5a unverified chargoddard winglian commited on Aug 24, 2023
feat: add Metharme prompt strategy (#446) f474650 unverified TearGosling Nanobit commited on Aug 22, 2023
feat(docs): improve user customized prompts (#443) 04a42b6 unverified Nanobit commited on Aug 21, 2023
feat(doc): add pillow to lambda instructions (#445) 919f4ca unverified Nanobit commited on Aug 21, 2023
support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348) d2e7f27 unverified winglian commited on Aug 20, 2023
use save_strategy from config if available (#434) b3f5e00 unverified winglian commited on Aug 19, 2023
flash attn pip install (#426) cf66547 unverified mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on Aug 18, 2023
Fix(docs): Remove gptq+lora and fix xformer compat list (#423) 3d1f203 unverified Nanobit commited on Aug 16, 2023
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config f806e86 unverified mhenrichsen commited on Aug 15, 2023
Feat(doc): Add lr_quadratic_warmup to readme (#412) 2b990eb unverified Nanobit commited on Aug 15, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
add templates, CoC and contributing guide (#126) 31db0ec unverified lightningRalf winglian Nanobit commited on Aug 15, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023
Merge pull request #279 from NanoCode012/feat/multi-gpu-readme 469c08c unverified winglian commited on Jul 16, 2023
Add example of dataset with configuration name to README 8bba642 chargoddard commited on Jul 15, 2023
Merge pull request #275 from NanoCode012/feat/safetensors 231031a unverified Nanobit commited on Jul 14, 2023