Fix(docs): Remove gptq+lora and fix xformer compat list (#423) 3d1f203 unverified Nanobit commited on Aug 16, 2023
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config f806e86 unverified mhenrichsen commited on Aug 15, 2023
Feat(doc): Add lr_quadratic_warmup to readme (#412) 2b990eb unverified Nanobit commited on Aug 15, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
add templates, CoC and contributing guide (#126) 31db0ec unverified lightningRalf winglian Nanobit commited on Aug 15, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023
Merge pull request #279 from NanoCode012/feat/multi-gpu-readme 469c08c unverified winglian commited on Jul 16, 2023
Add example of dataset with configuration name to README 8bba642 chargoddard commited on Jul 15, 2023
Merge pull request #275 from NanoCode012/feat/safetensors 231031a unverified Nanobit commited on Jul 14, 2023
Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum 16bb627 unverified winglian commited on Jun 14, 2023
Update README.md to include a community showcase 5ff547d unverified PocketDoc commited on Jun 13, 2023