Commit History
Print versions (#1496)
4313b1a
unverified
winglian
commited on
Fix the wrong adapter in qwen2-moe-qlora example (#1501) [skip ci]
7f17eff
unverified
MaziyarPanahi
commited on
add field to sft dataset pydantic for completion support (#1497)
ff01c45
unverified
winglian
commited on
ignore issues with calculating # params when printing (#1493)
2fa65b9
unverified
winglian
commited on
Remove `validate_quantized_dora` (#1485)
9430b6e
unverified
xzuyn
commited on
drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)
934fc85
unverified
winglian
commited on
fix: reduce sample_packing warning (#1484)
bda48f0
unverified
Nanobit
commited on
feat: validate sample packing requires flash_attention (#1465)
bf4cd67
unverified
Nanobit
commited on
add support for cohere chat template (#1478)
05b0b7e
unverified
winglian
commited on
don't use deepspeed or fsdp when merging loras (#1479)
87ca3f9
unverified
winglian
commited on
refactor utils.data module for line count linter (#1476)
e0fcef4
unverified
winglian
commited on
Feat: update doc (#1475) [skip ci]
c2b64e4
unverified
Nanobit
commited on
fix toc
5760099
hamel
commited on
Pretrain multipack v2 (#1470)
5aa5097
unverified
winglian
commited on
Added pip install ninja to accelerate installation of flash-attn (#1461)
cae608f
unverified
melvinebenezer
commited on
fix pretraining_ on odd datasets (#1463)
586bd8d
unverified
monsoon-nlp
commited on
Reorganize Docs (#1468)
86b7d22
unverified
hamel
commited on
reduce verbosity of the special tokens (#1472)
0b10377
unverified
winglian
commited on
make sure to install causal_conv1d in docker (#1459)
89134f2
unverified
winglian
commited on
qwen2_moe support w multipack (#1455)
6086be8
unverified
winglian
commited on
Nightlies fix v4 (#1458) [skip ci]
4a92a3b
unverified
winglian
commited on
fix yaml parsing for workflow (#1457) [skip ci]
46a73e3
unverified
winglian
commited on
fix how nightly tag is generated (#1456) [skip ci]
da3415b
unverified
winglian
commited on
configure nightly docker builds (#1454) [skip ci]
8cb127a
unverified
winglian
commited on
fix some of the edge cases for Jamba (#1452)
05b398a
unverified
winglian
commited on
Support loading datasets saved via save_to_disk (#1432)
e634118
unverified
fozziethebeat
commited on
Jamba (#1451)
02af082
unverified
winglian
commited on
fix layer_replication arg to peft (#1446)
4155e99
unverified
winglian
commited on
support layer replication for peft and fix rslora integration (#1445)
25afd35
unverified
winglian
commited on
fix for accelerate env var for auto bf16, add new base image and expand torch_cuda_arch_list support (#1413)
da265dd
unverified
winglian
commited on
Fix falcon tokenization step (#1441) [skip ci]
bcdc9b1
unverified
turn sample_packing on for training (#1438) [skip ci]
c19d060
unverified
satpalsr
commited on
make sure to capture non-null defaults from config validation (#1415)
601b77b
unverified
winglian
commited on
fix(dataset): normalize tokenizer config and change hash from tokenizer class to tokenizer path (#1298)
ff939d8
unverified
Nanobit
commited on
docs: update link to docs of advance topic in README.md (#1437)
324d59e
unverified
pphuc25
commited on
chore(config): refactor old mistral config (#1435)
f1ebaa0
unverified
Nanobit
commited on
Fix ORPO multi gpu (#1433)
34ba634
unverified
winglian
commited on
Update docs.yml
4e69aa4
unverified
hamel
commited on
Bootstrap Hosted Axolotl Docs w/Quarto (#1429)
629450c
unverified
hamel
commited on
strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)
2a1589f
unverified
winglian
commited on
HF / FEAT: Optimize HF tags (#1425) [skip ci]
7d55607
unverified
fixes for dpo and orpo template loading (#1424)
7803f09
unverified
winglian
commited on
support galore once upstreamed into transformers (#1409)
dd449c5
unverified
winglian
commited on
Feat: Add sharegpt multirole (#1137)
40a88e8
unverified
Nanobit
commited on
fix(config): passing gradient_checkpoint_kwargs (#1412)
b1e3e1b
unverified
Nanobit
commited on