Commit History

lora+ support (#1352)
decb66e
unverified

winglian commited on

add lion-pytorch optimizer (#1299) [skip ci]
1648279
unverified

Maxime winglian commited on

make mlflow optional (#1317)
5894f0e
unverified

winglian commited on

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)
3c00f40
unverified

David Meikle commited on

Add seq2seq eval benchmark callback (#1274)
5a5d474
unverified

LeonardoEmili commited on

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)
8430db2
unverified

jinwonkim93 commited on

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)
4b997c3
unverified

winglian commited on

simplify haldning for newer multipack patches so they can be added in a single place (#1270)
5698943
unverified

winglian commited on

Add more save strategies for DPO training. (#1255)
13eea21
unverified

Philip May commited on

relora: magnitude pruning of the optimizer (#1245)
8c2e05a
unverified

winglian commited on

support for true batches with multipack (#1230)
00568c1
unverified

winglian commited on

Fix and document test_datasets (#1228)
5787e1a
unverified

DreamGenX winglian commited on

FEAT: add tagging support to axolotl for DPOTrainer (#1209)
18f8119
unverified

Filippo Broggini winglian commited on

precompute dpo logprobs setting and fixes (#1199) [skip ci]
33e1170
unverified

winglian commited on

fix learning rate scheduler's warnings (#1135) [skip ci]
b4ac96a
unverified

ricdomolm winglian commited on

more dpo fixes for dataset loading and docs (#1185) [skip ci]
5bce45f
unverified

winglian commited on

DPO fixes v2 (#1174)
59a31fe
unverified

winglian commited on

Phi2 multipack (#1173)
814aee6
unverified

winglian commited on

Add mlflow callback for pushing config to mlflow artifacts (#1125)
b8e5603
unverified

JohanWork commited on

jupyter lab fixes (#1139) [skip ci]
eaaeefc
unverified

winglian commited on

Qwen2 (#1166)
f5a828a
unverified

winglian commited on

Multipack simplify for Mixtral (#1142)
6910e6a
unverified

winglian commited on

swap the data collator for evals if not using sample packing (#1076)
ead34c5
unverified

winglian commited on

paired kto support (#1069)
d7057cc
unverified

winglian commited on

Add: mlflow for experiment tracking (#1059) [skip ci]
090c24d
unverified

Johan Hansson winglian commited on

Cosine learning rate schedule - minimum learning rate (#1062)
04b978b
unverified

ricdomolm winglian commited on

Efficiently get the length of the tokenized docs (#1063)
81d3845
unverified

ricdomolm winglian commited on

Phi2 rewrite (#1058)
732851f
unverified

winglian commited on

streaming multipack for pretraining dataset (#959)
553c80f
unverified

jinwonkim93 jinwonkim93@github.com winglian commited on

feat: always push checkpoint to hub if set (#1049) [skip ci]
cbdbf9e
unverified

Nanobit commited on

RL/DPO (#935)
f243c21

winglian commited on

use recommended setting for use_reentrant w gradient checkpointing (#1021)
4d2e842
unverified

winglian commited on

remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified

winglian commited on

FEAT: add tagging support to axolotl (#1004)
db9094d
unverified

Younes Belkada winglian commited on

fix: add lr scheduler kwargs to Trainer (#972)
13e9381
unverified

Nanobit commited on

fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified

dg-kalle commited on

support for mamba (#915)
40a6362
unverified

winglian commited on

Feat(wandb): Refactor to be more flexible (#767)
a1da39c
unverified

Nanobit commited on

feature: loss watchdog for terminating training runs that are failing (#899)
58ec8b1
unverified

user735 Karl-Johan Alm commited on

Feat: Add warmup_ratio (#893)
fb12895
unverified

Nanobit commited on

don't train if eval split is too small (#873)
797f3dd
unverified

winglian commited on

various bugfixes (#856)
1470650
unverified

winglian commited on

cleanup the old multipack dataloader (#841)
1a6309c
unverified

winglian commited on

multipack w batch sampler (#795)
641e6f7
unverified

winglian commited on

Threaded MultipackDistributedDataloader with prefetched samples (#759)
05bd6f1
unverified

casperhansen commited on

refactor setup trainer so we can add more hooks (#773)
6c81c61
unverified

winglian commited on