skip some flash attn patches unless explicitly enabled (#643) 895f0a0 unverified winglian commited on Sep 27, 2023
Added quotes to the pip install -e command to fix an incompatibility with shells that do glob expansion like zsh (#632) 5e5296a unverified Fernando Tarin Morales commited on Sep 25, 2023
Feat(data): Allow loading local csv and text (#594) 00dce35 unverified Nanobit commited on Sep 17, 2023
support custom field for completion from yml (#580) f7a2263 unverified winglian commited on Sep 15, 2023
refactor scripts/finetune.py into new cli modules (#550) 861ceca unverified winglian Nanobit commited on Sep 15, 2023
Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified Glavin001 commited on Sep 13, 2023
document that packaging needs to be installed before flash-attn (#559) 9845c5e unverified winglian commited on Sep 12, 2023
ergonomic update to optimizer config doc (#548) 6d57f2f unverified The Objective Dad commited on Sep 11, 2023
update readme to point to direct link to runpod template, cleanup install instrucitons (#532) 34c0a86 unverified winglian commited on Sep 8, 2023
Fix(doc): Inform Windows users to use WSL/docker (#518) f51c9c5 unverified Nanobit commited on Sep 1, 2023
Added advanced DDP args (#515) 396a7a7 unverified Jan Philipp Harries Jan Philipp Harries commited on Aug 31, 2023
Fix(doc): Clarify no amp to full yaml docs (#496) 48c5647 unverified Nanobit commited on Aug 29, 2023
pad_to_worst_case_seq_len boolean, for testing memory limits (#498) 8e197f6 unverified Birch-san tmm1 commited on Aug 28, 2023
ReLoRA implementation (with quantization) (#322) bde3c5a unverified chargoddard winglian commited on Aug 24, 2023
feat: add Metharme prompt strategy (#446) f474650 unverified TearGosling Nanobit commited on Aug 22, 2023
feat(docs): improve user customized prompts (#443) 04a42b6 unverified Nanobit commited on Aug 21, 2023
feat(doc): add pillow to lambda instructions (#445) 919f4ca unverified Nanobit commited on Aug 21, 2023
support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348) d2e7f27 unverified winglian commited on Aug 20, 2023
use save_strategy from config if available (#434) b3f5e00 unverified winglian commited on Aug 19, 2023
flash attn pip install (#426) cf66547 unverified mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on Aug 18, 2023
Fix(docs): Remove gptq+lora and fix xformer compat list (#423) 3d1f203 unverified Nanobit commited on Aug 16, 2023
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config f806e86 unverified mhenrichsen commited on Aug 15, 2023
Feat(doc): Add lr_quadratic_warmup to readme (#412) 2b990eb unverified Nanobit commited on Aug 15, 2023
Fix(config): Update handling of deepspeed config (#404) c01015f unverified Nanobit commited on Aug 15, 2023
add templates, CoC and contributing guide (#126) 31db0ec unverified lightningRalf winglian Nanobit commited on Aug 15, 2023
Attention mask and position id fixes for packing (#285) 2bb0b78 unverified winglian commited on Aug 12, 2023
Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified Morgan McGuire Morgan McGuire winglian commited on Aug 12, 2023