qwerrwe / README.md

Commit History

flash attn pip install (#426)
cf66547
unverified

mhenrichsen Ubuntu mhenrichsen Mads Henrichsen winglian commited on

Fix(docs): Remove gptq+lora and fix xformer compat list (#423)
3d1f203
unverified

Nanobit commited on

hopefully improve the README (#419)
2495909
unverified

winglian commited on

Merge pull request #413 from mhenrichsen/chore/update-deepseed-config
f806e86
unverified

mhenrichsen commited on

Feat(doc): Add lr_quadratic_warmup to readme (#412)
2b990eb
unverified

Nanobit commited on

update path to align with fsdp example
bd8cab4

mhenrichsen commited on

Fix(config): Update handling of deepspeed config (#404)
c01015f
unverified

Nanobit commited on

Fix(docs): Update flash attn requirements (#409)
72fe3f8
unverified

Nanobit commited on

update docs for tokenizer_legacy (#401)
47961fd
unverified

winglian commited on

add templates, CoC and contributing guide (#126)
31db0ec
unverified

lightningRalf winglian Nanobit commited on

Feat(doc): Add how to save by epochs (#396)
be294fd
unverified

Nanobit commited on

Feat(doc): Add max_steps to readme (#389)
41ecb45
unverified

Nanobit commited on

Feat(config): Add hub_strategy (#386)
73a0b6e
unverified

Nanobit commited on

Feat(doc): Improve sharegpt doc (#378)
729c299
unverified

Nanobit commited on

Attention mask and position id fixes for packing (#285)
2bb0b78
unverified

winglian commited on

Add wandb_entity to wandb options, update example configs, update README (#361)
7019509
unverified

Morgan McGuire Morgan McGuire winglian commited on

Feat: Add rope scaling (#343)
b521206
unverified

Nanobit commited on

Update README.md on pretraining_dataset (#360)
fae6ed8
unverified

Nanobit commited on

Clarify pre-tokenize before multigpu (#359)
94d03c8
unverified

Nanobit commited on

note pattern when using groups
b4d1d22

tmm1 commited on

update comment for group_by_length
9f99104

tmm1 commited on

python 3.10 and 3.11 both work fine, as does pytorch 2.1.0.dev
58d6659

tmm1 commited on

there is no configs folder
cc7e800

tmm1 commited on

update README for updated docker images (#328)
41a4d15
unverified

winglian commited on

Merge pull request #306 from ethanhs/xgen
dcdec44
unverified

winglian commited on

don't resize embeddings to multiples of 32x by default
1066751

winglian commited on

Add XGen info to README and example config
3881143

ethanhs commited on

Fix(readme): Improve wording for push model
165907f
unverified

Nanobit commited on

fix(readme): remove accelerate config
b64f411
unverified

Nanobit commited on

Merge pull request #279 from NanoCode012/feat/multi-gpu-readme
469c08c
unverified

winglian commited on

Add dataset name to all yaml options in README
3cdd8e4

chargoddard commited on

Feat(readme): improve docs on multi-gpu
cf5ae6b

Nanobit commited on

Fix formatting mistake
46032a1

chargoddard commited on

Add example of dataset with configuration name to README
8bba642

chargoddard commited on

Merge pull request #275 from NanoCode012/feat/safetensors
231031a
unverified

Nanobit commited on

Feat: Add save_safetensors
5491278

Nanobit commited on

Feat(docs): Add model_revision arg
896c1ae
unverified

Nanobit commited on

Fix for linter
41da98b
unverified

Nanobit commited on

Fix local path loading and custom strategy type
9e64f42
unverified

Nanobit commited on

Fix future deprecation push_to_hub_model_id
e79c8e6

Nanobit commited on

open orca support
78a1e1f

winglian commited on

Update README.md
c146880
unverified

Nanobit commited on

optionally define whether to use_fast tokenizer
47d601f

winglian commited on

add docs
c969f0a

winglian commited on

hint to what AMP means
d7635b7

winglian commited on

add float16 docs and tweak typehints
88e17ff

winglian commited on

Merge pull request #92 from OpenAccess-AI-Collective/flash-optimum
16bb627
unverified

winglian commited on

Fix sharegpt type
3513885
unverified

Nanobit commited on

Update README.md to include a community showcase
5ff547d
unverified

PocketDoc commited on