Commits · Dovakiins/qwerrwe

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

alimosavian Ali Mosavian

winglian commited on May 14

improve save callbacks (#1592)

29cf15a
unverified

winglian commited on May 5

FIX: TRL trainer preprocessing step was running in one process (#1583)

b9bb169
unverified

Ali Mosavian Ali Mosavian commited on May 3

PoSE context length ext (#1567)

5294653
unverified

winglian commited on Apr 27

make sure everything stays in the same dtype when using dpo + FSDP (#1559)

68601ec
unverified

winglian commited on Apr 22

ORPO Trainer replacement (#1551)

7d1d22f
unverified

winglian commited on Apr 19

DBRX Model Support (#1462)

132eb74
unverified

winglian commited on Apr 12

WIP: Support table logging for mlflow, too (#1506)

057fa44
unverified

DavidFarago Dave Farago

winglian commited on Apr 9

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)

934fc85
unverified

winglian commited on Apr 7

LISA (#1469)

0ddfb24
unverified

winglian

tmm1 commited on Apr 1

Fix ORPO multi gpu (#1433)

34ba634
unverified

winglian commited on Mar 22

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

winglian commited on Mar 21

HF / FEAT: Optimize HF tags (#1425) [skip ci]

7d55607
unverified

Younes Belkada

winglian commited on Mar 21

support galore once upstreamed into transformers (#1409)

dd449c5
unverified

winglian commited on Mar 19

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

Nanobit commited on Mar 19

ORPO (#1419)

2ea70eb
unverified

winglian commited on Mar 18

FDSP + QLoRA (#1378)

9b6ee83
unverified

winglian commited on Mar 8

lora+ support (#1352)

decb66e
unverified

winglian commited on Mar 5

add lion-pytorch optimizer (#1299) [skip ci]

1648279
unverified

Maxime

winglian commited on Feb 26

make mlflow optional (#1317)

5894f0e
unverified

winglian commited on Feb 26

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)

3c00f40
unverified

David Meikle commited on Feb 21

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

LeonardoEmili commited on Feb 13

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

jinwonkim93 commited on Feb 13

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

winglian commited on Feb 12

simplify haldning for newer multipack patches so they can be added in a single place (#1270)

5698943
unverified

winglian commited on Feb 7

Add more save strategies for DPO training. (#1255)

13eea21
unverified

Philip May commited on Feb 6

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

winglian commited on Feb 6

support for true batches with multipack (#1230)

00568c1
unverified

winglian commited on Feb 1

Fix and document test_datasets (#1228)

5787e1a
unverified

DreamGenX

winglian commited on Jan 31

FEAT: add tagging support to axolotl for DPOTrainer (#1209)

18f8119
unverified

Filippo Broggini

winglian commited on Jan 27

precompute dpo logprobs setting and fixes (#1199) [skip ci]

33e1170
unverified

winglian commited on Jan 25

fix learning rate scheduler's warnings (#1135) [skip ci]

b4ac96a
unverified

ricdomolm

winglian commited on Jan 25

more dpo fixes for dataset loading and docs (#1185) [skip ci]

5bce45f
unverified

winglian commited on Jan 24

DPO fixes v2 (#1174)

59a31fe
unverified

winglian commited on Jan 23

Phi2 multipack (#1173)

814aee6
unverified

winglian commited on Jan 23

DPO cleanup (#1126)

7523d1f
unverified

winglian

plaguss HF staff commited on Jan 23

Add mlflow callback for pushing config to mlflow artifacts (#1125)

b8e5603
unverified

JohanWork commited on Jan 22

jupyter lab fixes (#1139) [skip ci]

eaaeefc
unverified

winglian commited on Jan 22

Qwen2 (#1166)

f5a828a
unverified

winglian commited on Jan 22

Multipack simplify for Mixtral (#1142)

6910e6a
unverified

winglian commited on Jan 18

swap the data collator for evals if not using sample packing (#1076)

ead34c5
unverified

winglian commited on Jan 10

paired kto support (#1069)

d7057cc
unverified

winglian commited on Jan 9

Add: mlflow for experiment tracking (#1059) [skip ci]

090c24d
unverified

Johan Hansson

winglian commited on Jan 9

Cosine learning rate schedule - minimum learning rate (#1062)

04b978b
unverified

ricdomolm

winglian commited on Jan 9

Efficiently get the length of the tokenized docs (#1063)

81d3845
unverified

ricdomolm

winglian commited on Jan 8

Phi2 rewrite (#1058)

732851f
unverified

winglian commited on Jan 8

streaming multipack for pretraining dataset (#959)

553c80f
unverified

jinwonkim93 jinwonkim93@github.com

winglian commited on Jan 6

feat: always push checkpoint to hub if set (#1049) [skip ci]

cbdbf9e
unverified

Nanobit commited on Jan 5

RL/DPO (#935)

f243c21

winglian commited on Jan 4

use recommended setting for use_reentrant w gradient checkpointing (#1021)

4d2e842
unverified

winglian commited on Jan 2

Commit History

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584) 1e1921b unverified

improve save callbacks (#1592) 29cf15a unverified

FIX: TRL trainer preprocessing step was running in one process (#1583) b9bb169 unverified

PoSE context length ext (#1567) 5294653 unverified

make sure everything stays in the same dtype when using dpo + FSDP (#1559) 68601ec unverified

ORPO Trainer replacement (#1551) 7d1d22f unverified

DBRX Model Support (#1462) 132eb74 unverified

WIP: Support table logging for mlflow, too (#1506) 057fa44 unverified

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490) 934fc85 unverified

LISA (#1469) 0ddfb24 unverified

Fix ORPO multi gpu (#1433) 34ba634 unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428) 2a1589f unverified

HF / FEAT: Optimize HF tags (#1425) [skip ci] 7d55607 unverified

support galore once upstreamed into transformers (#1409) dd449c5 unverified

fix(config): passing gradient_checkpoint_kwargs (#1412) b1e3e1b unverified

ORPO (#1419) 2ea70eb unverified

FDSP + QLoRA (#1378) 9b6ee83 unverified

lora+ support (#1352) decb66e unverified

add lion-pytorch optimizer (#1299) [skip ci] 1648279 unverified

make mlflow optional (#1317) 5894f0e unverified

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291) 3c00f40 unverified

Add seq2seq eval benchmark callback (#1274) 5a5d474 unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273) 8430db2 unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287) 4b997c3 unverified

simplify haldning for newer multipack patches so they can be added in a single place (#1270) 5698943 unverified

Add more save strategies for DPO training. (#1255) 13eea21 unverified

relora: magnitude pruning of the optimizer (#1245) 8c2e05a unverified

support for true batches with multipack (#1230) 00568c1 unverified

Fix and document test_datasets (#1228) 5787e1a unverified

FEAT: add tagging support to axolotl for DPOTrainer (#1209) 18f8119 unverified

precompute dpo logprobs setting and fixes (#1199) [skip ci] 33e1170 unverified

fix learning rate scheduler's warnings (#1135) [skip ci] b4ac96a unverified

more dpo fixes for dataset loading and docs (#1185) [skip ci] 5bce45f unverified

DPO fixes v2 (#1174) 59a31fe unverified

Phi2 multipack (#1173) 814aee6 unverified

DPO cleanup (#1126) 7523d1f unverified

Add mlflow callback for pushing config to mlflow artifacts (#1125) b8e5603 unverified

jupyter lab fixes (#1139) [skip ci] eaaeefc unverified

Qwen2 (#1166) f5a828a unverified

Multipack simplify for Mixtral (#1142) 6910e6a unverified

swap the data collator for evals if not using sample packing (#1076) ead34c5 unverified

paired kto support (#1069) d7057cc unverified

Add: mlflow for experiment tracking (#1059) [skip ci] 090c24d unverified

Cosine learning rate schedule - minimum learning rate (#1062) 04b978b unverified

Efficiently get the length of the tokenized docs (#1063) 81d3845 unverified

Phi2 rewrite (#1058) 732851f unverified

streaming multipack for pretraining dataset (#959) 553c80f unverified

feat: always push checkpoint to hub if set (#1049) [skip ci] cbdbf9e unverified

RL/DPO (#935) f243c21

use recommended setting for use_reentrant w gradient checkpointing (#1021) 4d2e842 unverified

FIX: max_length and max_prompt_length was not being sent to ORPOTrainer (#1584)

1e1921b
unverified

improve save callbacks (#1592)

29cf15a
unverified

FIX: TRL trainer preprocessing step was running in one process (#1583)

b9bb169
unverified

PoSE context length ext (#1567)

5294653
unverified

make sure everything stays in the same dtype when using dpo + FSDP (#1559)

68601ec
unverified

ORPO Trainer replacement (#1551)

7d1d22f
unverified

DBRX Model Support (#1462)

132eb74
unverified

WIP: Support table logging for mlflow, too (#1506)

057fa44
unverified

drop empty token from beginning if tokenizer has no bos_token (in the case of qwen) (#1490)

934fc85
unverified

LISA (#1469)

0ddfb24
unverified

Fix ORPO multi gpu (#1433)

34ba634
unverified

strip out hacky qlora-fsdp workarounds now that qlora-fsdp fixes are upstreamed (#1428)

2a1589f
unverified

HF / FEAT: Optimize HF tags (#1425) [skip ci]

7d55607
unverified

support galore once upstreamed into transformers (#1409)

dd449c5
unverified

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

ORPO (#1419)

2ea70eb
unverified

FDSP + QLoRA (#1378)

9b6ee83
unverified

lora+ support (#1352)

decb66e
unverified

add lion-pytorch optimizer (#1299) [skip ci]

1648279
unverified

make mlflow optional (#1317)

5894f0e
unverified

Allow load_best_model_at_end to be configured for early stopping on custom evaluation datasets (#1291)

3c00f40
unverified

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

simplify haldning for newer multipack patches so they can be added in a single place (#1270)

5698943
unverified

Add more save strategies for DPO training. (#1255)

13eea21
unverified

relora: magnitude pruning of the optimizer (#1245)

8c2e05a
unverified

support for true batches with multipack (#1230)

00568c1
unverified

Fix and document test_datasets (#1228)

5787e1a
unverified

FEAT: add tagging support to axolotl for DPOTrainer (#1209)

18f8119
unverified

precompute dpo logprobs setting and fixes (#1199) [skip ci]

33e1170
unverified

fix learning rate scheduler's warnings (#1135) [skip ci]

b4ac96a
unverified

more dpo fixes for dataset loading and docs (#1185) [skip ci]

5bce45f
unverified

DPO fixes v2 (#1174)

59a31fe
unverified

Phi2 multipack (#1173)

814aee6
unverified

DPO cleanup (#1126)

7523d1f
unverified

Add mlflow callback for pushing config to mlflow artifacts (#1125)

b8e5603
unverified

jupyter lab fixes (#1139) [skip ci]

eaaeefc
unverified

Qwen2 (#1166)

f5a828a
unverified

Multipack simplify for Mixtral (#1142)

6910e6a
unverified

swap the data collator for evals if not using sample packing (#1076)

ead34c5
unverified

paired kto support (#1069)

d7057cc
unverified

Add: mlflow for experiment tracking (#1059) [skip ci]

090c24d
unverified

Cosine learning rate schedule - minimum learning rate (#1062)

04b978b
unverified

Efficiently get the length of the tokenized docs (#1063)

81d3845
unverified

Phi2 rewrite (#1058)

732851f
unverified

streaming multipack for pretraining dataset (#959)

553c80f
unverified

feat: always push checkpoint to hub if set (#1049) [skip ci]

cbdbf9e
unverified

RL/DPO (#935)

f243c21

use recommended setting for use_reentrant w gradient checkpointing (#1021)

4d2e842
unverified