Commit History

multipack for gemma (#1313)
2752d5f
unverified

winglian commited on

Adding Google's gemma Model (#1312)
9e300ac
unverified

aaditya commited on

fix(readme): update inference md link (#1311) [skip ci]
3d2cd80
unverified

Nanobit commited on

Add instructions for playing with qlora model to colab example (#1290)
6ab69ec
unverified

Jared Palmer Nanobit JohanWork commited on

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)
3c00f40
unverified

David Meikle commited on

fix(examples): remove is_*_derived as it's parsed automatically (#1297)
a7a9a14
unverified

Nanobit commited on

Validation always happens on first step (#1300)
e2786cc
unverified

LeonardoEmili commited on

Add seq2seq eval benchmark callback (#1274)
5a5d474
unverified

LeonardoEmili commited on

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
8430db2
unverified

jinwonkim93 commited on

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)
4b997c3
unverified

winglian commited on

Add MPS support (#1264)
fac2d98
unverified

Maxime winglian commited on

don't use load and push together (#1284)
ea00dd0
unverified

winglian commited on

Update README.md (#1281)
b2a4cb4
unverified

hamel commited on

run the docker image builds and push on gh action gpu runners (#1218)
aaf54dc
unverified

winglian commited on

add support for https remote yamls (#1277)
9bca7db
unverified

hamel commited on

allow remote data paths (#1278)
91cf4ee
unverified

hamel commited on

copy edits (#1276)
1daecd1
unverified

winglian commited on

Add link to axolotl cloud image on latitude (#1275)
4a654b3
unverified

winglian commited on

simplify haldning for newer multipack patches so they can be added in a single place (#1270)
5698943
unverified

winglian commited on

contributor avatars (#1269)
411293b
unverified

winglian commited on

Fix bug preventing model_kwargs being injected (#1262)
73f1bda
unverified

Zac Brannelly commited on

lock pytorch (#1247) [skip ci]
1c7ed26
unverified

JohanWork commited on

Add more save strategies for DPO training. (#1255)
13eea21
unverified

Philip May commited on

Fix typo `bloat16` -> `bfloat16` (#1257)
1072f28
unverified

chiragjn commited on

Pretrain transforms (#1261)
c7cf381
unverified

winglian commited on

relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified

winglian commited on

fix(model): apply gate fp32 only for mixtral (#1241)
2d65f47
unverified

Nanobit winglian commited on

add contact info for dedicated support for axolotl [skip ci] (#1243)
dfd1885
unverified

winglian commited on

support for true batches with multipack (#1230)
00568c1
unverified

winglian commited on

Peft deepspeed resume (#1227)
c67fb71
unverified

winglian commited on

Support for additional_special_tokens (#1221) [skip ci]
25e037f
unverified

DreamGenX winglian commited on

Update rlhf.md (#1237) [skip ci]
52c83d3
unverified

hamel commited on

add a helpful motd for cloud image (#1235) [skip ci]
d113331
unverified

winglian commited on

set torch version to what is installed during axolotl install (#1234)
8f2b591
unverified

winglian commited on

Fix and document test_datasets (#1228)
5787e1a
unverified

DreamGenX winglian commited on

Fix typo (#1231) [skip ci]
8608d80
unverified

xhedit commited on

Peft lotfq (#1222)
4cb7900
unverified

winglian commited on

FEAT: add tagging support to axolotl for DPOTrainer (#1209)
18f8119
unverified

Filippo Broggini winglian commited on

Update FUNDING.yml [skip ci]
afb5dd9
unverified

winglian commited on

Revert "run PR e2e docker CI tests in Modal" (#1220) [skip ci]
8da1633
unverified

winglian commited on

run PR e2e docker CI tests in Modal (#1217) [skip ci]
36d053f
unverified

winglian commited on

ADD: warning if hub_model_id ist set but not any save strategy (#1202)
af29d81
unverified

JohanWork winglian commited on

ensure the tests use the same version of torch as the latest base docker images (#1215) [skip ci]
1b18003
unverified

winglian commited on

Respect sliding_window=None (#1214)
62ca4a2
unverified

DreamGenX commited on

Update qlora.yml - remove `max_packed_sequence_len` (#1210) [skip ci]
5407ddd
unverified

7flash commited on

drop py39 docker images, add py311, upgrade pytorch to 2.1.2 (#1205)
74c72ca
unverified

winglian commited on

more checks and fixes for deepspeed and fsdp (#1208) [skip ci]
e923e62
unverified

winglian commited on

workaround for transformers bug requireing do_sample for saveing pretrained (#1206)
ba944e6
unverified

winglian commited on

make sure to register the base chatml template even if no system message is provided (#1207)
badda37
unverified

winglian commited on

Update deps 202401 (#1204) [skip ci]
a01b998
unverified

winglian commited on