Commits · Dovakiins/qwerrwe

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

winglian commited on Oct 28, 2023

Update to adapt to sharegpt datasets with "assistant" rather than "gp… (#774)

0800885
unverified

MilesQLi

winglian commited on Oct 28, 2023

Fix Deepspeed Zero3 Config (#791)

d3193be
unverified

Teknium commited on Oct 28, 2023

Add docker advanced instruction to README (#792)

2e71ff0
unverified

gordicaleksa commited on Oct 27, 2023

GitBook: No commit message

facc49f
unverified

chanvichetvong commited on Oct 26, 2023

Create preprocess CLI (#785)

e50ab07
unverified

casperhansen commited on Oct 26, 2023

Threaded MultipackDistributedDataloader with prefetched samples (#759)

05bd6f1
unverified

casperhansen commited on Oct 26, 2023

chore(readme): Improve documentation on conversation field (#782)

20aa4b5
unverified

Nanobit commited on Oct 24, 2023

chore: refactor truthy check and fix mypy (#780)

11d1d60
unverified

Nanobit commited on Oct 24, 2023

refactor setup trainer so we can add more hooks (#773)

6c81c61
unverified

winglian commited on Oct 23, 2023

disable eval table w sample packing in examples (#778)

9b43e7e
unverified

winglian commited on Oct 23, 2023

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

winglian commited on Oct 23, 2023

Fix: Warn when fullfinetune without adapter (#770)

44c9d01
unverified

Nanobit commited on Oct 22, 2023

convert exponential notation lr to floats (#771)

ca84cca
unverified

winglian commited on Oct 22, 2023

Hotfix for not saving correctly (#762)

32eeeb5
unverified

casperhansen commited on Oct 22, 2023

Fix: Cannot tokenize with bf16 and on cpu (#766)

afedc47
unverified

Nanobit commited on Oct 22, 2023

Fix: eval table conflict with eval_sample_packing (#769)

9923b72
unverified

Nanobit commited on Oct 22, 2023

remove lora fused packing test (#758)

21cf09b
unverified

winglian commited on Oct 22, 2023

Implement fused modules (#747)

15d3a65
unverified

casperhansen

winglian commited on Oct 21, 2023

add to docs (#703)

a21935f
unverified

winglian commited on Oct 20, 2023

chore: bump transformers to v4.34.1 to fix tokenizer issue (#745)

8966a6f
unverified

Nanobit commited on Oct 20, 2023

Fix DeepSpeed Zero 3 Saving (#709)

e4d1585
unverified

tokestermw

winglian commited on Oct 19, 2023

add a latest tag for regular axolotl image, cleanup extraneous print statement (#746)

70157cc
unverified

winglian commited on Oct 19, 2023

improve: Enhance code readability of prompt_tokenizers.py (#707)

3a99495
unverified

seungduk commited on Oct 19, 2023

Fix(model): Linear detected and added to target module with rope linear (#738)

440c3ab
unverified

Nanobit commited on Oct 19, 2023

catch ConnectionError when checking dataset from HuggingFace (#743)

992d57f
unverified

Napuh commited on Oct 19, 2023

badge (#739)

91a016f
unverified

mhenrichsen commited on Oct 18, 2023

Mistral: Sliding Window Attention with Flash Attention and Sample Packing (#732)

a045db0
unverified

casperhansen

winglian commited on Oct 16, 2023

Clarify custom format example (#729)

e1b214c
unverified

casperhansen commited on Oct 14, 2023