qwerrwe / src /axolotl /core /trainer_builder.py

Commit History

precompute dpo logprobs setting and fixes (#1199) [skip ci]
33e1170
unverified

winglian commited on

fix learning rate scheduler's warnings (#1135) [skip ci]
b4ac96a
unverified

ricdomolm winglian commited on

more dpo fixes for dataset loading and docs (#1185) [skip ci]
5bce45f
unverified

winglian commited on

DPO fixes v2 (#1174)
59a31fe
unverified

winglian commited on

Phi2 multipack (#1173)
814aee6
unverified

winglian commited on

Add mlflow callback for pushing config to mlflow artifacts (#1125)
b8e5603
unverified

JohanWork commited on

jupyter lab fixes (#1139) [skip ci]
eaaeefc
unverified

winglian commited on

Qwen2 (#1166)
f5a828a
unverified

winglian commited on

Multipack simplify for Mixtral (#1142)
6910e6a
unverified

winglian commited on

swap the data collator for evals if not using sample packing (#1076)
ead34c5
unverified

winglian commited on

paired kto support (#1069)
d7057cc
unverified

winglian commited on

Add: mlflow for experiment tracking (#1059) [skip ci]
090c24d
unverified

Johan Hansson winglian commited on

Cosine learning rate schedule - minimum learning rate (#1062)
04b978b
unverified

ricdomolm winglian commited on

Efficiently get the length of the tokenized docs (#1063)
81d3845
unverified

ricdomolm winglian commited on

Phi2 rewrite (#1058)
732851f
unverified

winglian commited on

streaming multipack for pretraining dataset (#959)
553c80f
unverified

jinwonkim93 jinwonkim93@github.com winglian commited on

feat: always push checkpoint to hub if set (#1049) [skip ci]
cbdbf9e
unverified

Nanobit commited on

RL/DPO (#935)
f243c21

winglian commited on

use recommended setting for use_reentrant w gradient checkpointing (#1021)
4d2e842
unverified

winglian commited on

remove landmark attn and xpos rope implementations (#1010)
70b46ca
unverified

winglian commited on

FEAT: add tagging support to axolotl (#1004)
db9094d
unverified

Younes Belkada winglian commited on

fix: add lr scheduler kwargs to Trainer (#972)
13e9381
unverified

Nanobit commited on

fix: switch to using the HuggingFace Transformers NEFT implementation (#941)
ef24342
unverified

dg-kalle commited on

support for mamba (#915)
40a6362
unverified

winglian commited on

Feat(wandb): Refactor to be more flexible (#767)
a1da39c
unverified

Nanobit commited on

feature: loss watchdog for terminating training runs that are failing (#899)
58ec8b1
unverified

user735 Karl-Johan Alm commited on

Feat: Add warmup_ratio (#893)
fb12895
unverified

Nanobit commited on

don't train if eval split is too small (#873)
797f3dd
unverified

winglian commited on

various bugfixes (#856)
1470650
unverified

winglian commited on

cleanup the old multipack dataloader (#841)
1a6309c
unverified

winglian commited on

multipack w batch sampler (#795)
641e6f7
unverified

winglian commited on

Threaded MultipackDistributedDataloader with prefetched samples (#759)
05bd6f1
unverified

casperhansen commited on

refactor setup trainer so we can add more hooks (#773)
6c81c61
unverified

winglian commited on