Commit History
chore(doc): clarify micro_batch_size (#1579) [skip ci]
1aeece6
unverified
Nanobit
commited on
PoSE context length ext (#1567)
5294653
unverified
winglian
commited on
Add ORPO example and e2e test (#1572)
98c25e1
unverified
tokestermw
commited on
make sure everything stays in the same dtype when using dpo + FSDP (#1559)
68601ec
unverified
winglian
commited on
Add support for Gemma chat template (#1530)
60f5ce0
unverified
wrap prepared_ds_path in str() to avoid TypeError in fsspec package (#1548)
7477a53
unverified
ORPO Trainer replacement (#1551)
7d1d22f
unverified
winglian
commited on
fix(yml): update llama-3 config (#1543) [skip ci]
0e8f340
unverified
Nanobit
commited on
fix(packages): lock datasets version (#1545)
59ef254
unverified
Nanobit
commited on
fix broken linting (#1541)
c10563c
unverified
winglian
commited on
Adding Llama-3 qlora (#1536)
37c037c
unverified
aaditya
commited on
llama-3 examples (#1537)
15f7910
unverified
winglian
commited on
feat(doc): Add example for pad_token (#1535)
d28ba2e
unverified
Nanobit
commited on
Create mixtral_22.yml (#1514) [skip ci]
0eadfc8
unverified
Atlas
commited on
Update Readme to include support for Mixtral8X22B (#1518) [skip ci]
bcaa923
unverified
Atlas
commited on
Update README.md (#1521) [skip ci]
7d9bafc
unverified
YTING
commited on
add docs around pre-processing (#1529)
e07dcb2
unverified
winglian
commited on
Unsloth gradient checkpointing offload (#1528)
6319da1
unverified
winglian
commited on
DBRX Model Support (#1462)
132eb74
unverified
winglian
commited on
use locale agnostic seperator to make large nums easier to read (#1503)
da9b1a3
unverified
winglian
commited on
WIP: Support table logging for mlflow, too (#1506)
057fa44
unverified
Correctly handle splits for datasets.arrow_dataset.Dataset objects (#1504)
8fa0785
unverified
Print versions (#1496)
4313b1a
unverified
winglian
commited on
Fix the wrong adapter in qwen2-moe-qlora example (#1501) [skip ci]
7f17eff
unverified
MaziyarPanahi
commited on
add field to sft dataset pydantic for completion support (#1497)
ff01c45
unverified
winglian
commited on
ignore issues with calculating # params when printing (#1493)
2fa65b9
unverified
winglian
commited on
Remove `validate_quantized_dora` (#1485)
9430b6e
unverified
xzuyn
commited on
drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)
934fc85
unverified
winglian
commited on
fix: reduce sample_packing warning (#1484)
bda48f0
unverified
Nanobit
commited on
feat: validate sample packing requires flash_attention (#1465)
bf4cd67
unverified
Nanobit
commited on
add support for cohere chat template (#1478)
05b0b7e
unverified
winglian
commited on
don't use deepspeed or fsdp when merging loras (#1479)
87ca3f9
unverified
winglian
commited on
refactor utils.data module for line count linter (#1476)
e0fcef4
unverified
winglian
commited on
Feat: update doc (#1475) [skip ci]
c2b64e4
unverified
Nanobit
commited on
fix toc
5760099
hamel
commited on
Pretrain multipack v2 (#1470)
5aa5097
unverified
winglian
commited on
Added pip install ninja to accelerate installation of flash-attn (#1461)
cae608f
unverified
melvinebenezer
commited on
fix pretraining_ on odd datasets (#1463)
586bd8d
unverified
monsoon-nlp
commited on
Reorganize Docs (#1468)
86b7d22
unverified
hamel
commited on
reduce verbosity of the special tokens (#1472)
0b10377
unverified
winglian
commited on
make sure to install causal_conv1d in docker (#1459)
89134f2
unverified
winglian
commited on
qwen2_moe support w multipack (#1455)
6086be8
unverified
winglian
commited on
Nightlies fix v4 (#1458) [skip ci]
4a92a3b
unverified
winglian
commited on
fix yaml parsing for workflow (#1457) [skip ci]
46a73e3
unverified
winglian
commited on
fix how nightly tag is generated (#1456) [skip ci]
da3415b
unverified
winglian
commited on
configure nightly docker builds (#1454) [skip ci]
8cb127a
unverified
winglian
commited on