Commit History
Feat(doc): Update eval_steps doc (#487)
ad8be43
unverified
Nanobit
commited on
ReLoRA implementation (with quantization) (#322)
bde3c5a
unverified
Fix(doc): Clarify config (#466)
55c23c7
unverified
Nanobit
commited on
feat: add Metharme prompt strategy (#446)
f474650
unverified
feat(docs): improve user customized prompts (#443)
04a42b6
unverified
Nanobit
commited on
feat(doc): add pillow to lambda instructions (#445)
919f4ca
unverified
Nanobit
commited on
support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348)
d2e7f27
unverified
winglian
commited on
docs(readme): add `cd axolotl` (#440)
d21318d
unverified
philpax
commited on
use save_strategy from config if available (#434)
b3f5e00
unverified
winglian
commited on
flash attn pip install (#426)
cf66547
unverified
Fix(docs): Remove gptq+lora and fix xformer compat list (#423)
3d1f203
unverified
Nanobit
commited on
hopefully improve the README (#419)
2495909
unverified
winglian
commited on
Merge pull request #413 from mhenrichsen/chore/update-deepseed-config
f806e86
unverified
mhenrichsen
commited on
Feat(doc): Add lr_quadratic_warmup to readme (#412)
2b990eb
unverified
Nanobit
commited on
update path to align with fsdp example
bd8cab4
mhenrichsen
commited on
Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified
Nanobit
commited on
Fix(docs): Update flash attn requirements (#409)
72fe3f8
unverified
Nanobit
commited on
update docs for tokenizer_legacy (#401)
47961fd
unverified
winglian
commited on
Feat(doc): Add how to save by epochs (#396)
be294fd
unverified
Nanobit
commited on
Feat(doc): Add max_steps to readme (#389)
41ecb45
unverified
Nanobit
commited on
Feat(config): Add hub_strategy (#386)
73a0b6e
unverified
Nanobit
commited on
Feat(doc): Improve sharegpt doc (#378)
729c299
unverified
Nanobit
commited on
Attention mask and position id fixes for packing (#285)
2bb0b78
unverified
winglian
commited on
Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified
Feat: Add rope scaling (#343)
b521206
unverified
Nanobit
commited on
Update README.md on pretraining_dataset (#360)
fae6ed8
unverified
Nanobit
commited on
Clarify pre-tokenize before multigpu (#359)
94d03c8
unverified
Nanobit
commited on
note pattern when using groups
b4d1d22
tmm1
commited on
update comment for group_by_length
9f99104
tmm1
commited on
python 3.10 and 3.11 both work fine, as does pytorch 2.1.0.dev
58d6659
tmm1
commited on
there is no configs folder
cc7e800
tmm1
commited on
update README for updated docker images (#328)
41a4d15
unverified
winglian
commited on
Merge pull request #306 from ethanhs/xgen
dcdec44
unverified
winglian
commited on
don't resize embeddings to multiples of 32x by default
1066751
winglian
commited on
Add XGen info to README and example config
3881143
ethanhs
commited on
Fix(readme): Improve wording for push model
165907f
unverified
Nanobit
commited on
fix(readme): remove accelerate config
b64f411
unverified
Nanobit
commited on
Merge pull request #279 from NanoCode012/feat/multi-gpu-readme
469c08c
unverified
winglian
commited on
Add dataset name to all yaml options in README
3cdd8e4
chargoddard
commited on
Feat(readme): improve docs on multi-gpu
cf5ae6b
Nanobit
commited on
Fix formatting mistake
46032a1
chargoddard
commited on
Add example of dataset with configuration name to README
8bba642
chargoddard
commited on
Merge pull request #275 from NanoCode012/feat/safetensors
231031a
unverified
Nanobit
commited on
Feat: Add save_safetensors
5491278
Nanobit
commited on
Feat(docs): Add model_revision arg
896c1ae
unverified
Nanobit
commited on
Fix for linter
41da98b
unverified
Nanobit
commited on
Fix local path loading and custom strategy type
9e64f42
unverified
Nanobit
commited on
Fix future deprecation push_to_hub_model_id
e79c8e6
Nanobit
commited on