qwerrwe / README.md

Commit History

skip some flash attn patches unless explicitly enabled (#643)
895f0a0
unverified

winglian commited on

use fastchat conversations template (#578)
e7d3e2d
unverified

winglian commited on

Feat: Add support for upstream FA2 (#626)
19a600a
unverified

Nanobit commited on

Added quotes to the pip install -e command to fix an incompatibility with shells that do glob expansion like zsh (#632)
5e5296a
unverified

Fernando Tarin Morales commited on

Feat(doc): Add eval_sample_packing to doc (#625)
67b9888
unverified

Nanobit commited on

update README w deepspeed info (#605)
c25ba79
unverified

winglian commited on

Feat(data): Allow loading local csv and text (#594)
00dce35
unverified

Nanobit commited on

Feat(doc): Add features to doc (#583)
3a2edc8
unverified

Nanobit commited on

support custom field for completion from yml (#580)
f7a2263
unverified

winglian commited on

update support matrix with btlm and phi (#579)
a5a625f
unverified

winglian commited on

refactor scripts/finetune.py into new cli modules (#550)
861ceca
unverified

winglian Nanobit commited on

let hf trainer handle torch compile (#516)
a4e1bb6
unverified

winglian tmm1 commited on

Add training callback to send predictions to WandB table (#521)
5b67ea9
unverified

Glavin001 commited on

document that packaging needs to be installed before flash-attn (#559)
9845c5e
unverified

winglian commited on

ergonomic update to optimizer config doc (#548)
6d57f2f
unverified

The Objective Dad commited on

update readme to point to direct link to runpod template, cleanup install instrucitons (#532)
34c0a86
unverified

winglian commited on

Adding NCCL Timeout Guide (#536)
5e2d8a4
unverified

The Objective Dad commited on

Fix(doc): Inform Windows users to use WSL/docker (#518)
f51c9c5
unverified

Nanobit commited on

Added advanced DDP args (#515)
396a7a7
unverified

Jan Philipp Harries Jan Philipp Harries commited on

support for datasets with multiple names (#480)
5ac3392
unverified

winglian commited on

Fix(doc): Clarify no amp to full yaml docs (#496)
48c5647
unverified

Nanobit commited on

pad_to_worst_case_seq_len boolean, for testing memory limits (#498)
8e197f6
unverified

Birch-san tmm1 commited on

Feat(doc): Update eval_steps doc (#487)
ad8be43
unverified

Nanobit commited on

ReLoRA implementation (with quantization) (#322)
bde3c5a
unverified

chargoddard winglian commited on

Fix(doc): Clarify config (#466)
55c23c7
unverified

Nanobit commited on

feat: add Metharme prompt strategy (#446)
f474650
unverified

TearGosling Nanobit commited on

feat(docs): improve user customized prompts (#443)
04a42b6
unverified

Nanobit commited on

feat(doc): add pillow to lambda instructions (#445)
919f4ca
unverified

Nanobit commited on

support user defined prompters, pretokenized datasets in config, local parquet, local arrow files (#348)
d2e7f27
unverified

winglian commited on

docs(readme): add `cd axolotl` (#440)
d21318d
unverified

philpax commited on

use save_strategy from config if available (#434)
b3f5e00
unverified

winglian commited on

flash attn pip install (#426)
cf66547
unverified

mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on

Fix(docs): Remove gptq+lora and fix xformer compat list (#423)
3d1f203
unverified

Nanobit commited on

hopefully improve the README (#419)
2495909
unverified

winglian commited on

Merge pull request #413 from mhenrichsen/chore/update-deepseed-config
f806e86
unverified

mhenrichsen commited on

Feat(doc): Add lr_quadratic_warmup to readme (#412)
2b990eb
unverified

Nanobit commited on

update path to align with fsdp example
bd8cab4

mhenrichsen commited on

Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified

Nanobit commited on

Fix(docs): Update flash attn requirements (#409)
72fe3f8
unverified

Nanobit commited on

update docs for tokenizer_legacy (#401)
47961fd
unverified

winglian commited on

add templates, CoC and contributing guide (#126)
31db0ec
unverified

lightningRalf winglian Nanobit commited on

Feat(doc): Add how to save by epochs (#396)
be294fd
unverified

Nanobit commited on

Feat(doc): Add max_steps to readme (#389)
41ecb45
unverified

Nanobit commited on

Feat(config): Add hub_strategy (#386)
73a0b6e
unverified

Nanobit commited on

Feat(doc): Improve sharegpt doc (#378)
729c299
unverified

Nanobit commited on

Attention mask and position id fixes for packing (#285)
2bb0b78
unverified

winglian commited on

Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified

Morgan McGuire Morgan McGuire winglian commited on

Feat: Add rope scaling (#343)
b521206
unverified

Nanobit commited on

Update README.md on pretraining_dataset (#360)
fae6ed8
unverified

Nanobit commited on

Clarify pre-tokenize before multigpu (#359)
94d03c8
unverified

Nanobit commited on