Commits · Dovakiins/qwerrwe

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

Nanobit commited on Mar 19

Update README.md (#1418)

e8c8ea6
unverified

jbl commited on Mar 18

Fix(readme): Improve README QuickStart info (#1408)

f083aed
unverified

Nanobit commited on Mar 16

Feat(readme): Add instructions for Google GPU VM instances (#1410)

868c339
unverified

Nanobit commited on Mar 16

Add QLoRA + FSDP Docs (#1403)

8b12468
unverified

hamel commited on Mar 14

JarvisLabs (#1372)

638c2da
unverified

winglian commited on Mar 7

add docs for `input_output` format (#1367) [skip ci]

ed70a08
unverified

hamel commited on Mar 6

Remove unsupported python version 3.9 from README (#1364) [skip ci]

3765747
unverified

Nicolas Rojas commited on Mar 6

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

winglian commited on Feb 28

Mps mistral lora (#1292) [skip ci]

0f6af36
unverified

Maxime

Nanobit

winglian commited on Feb 27

ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci]

d756534
unverified

JohanWork

Nanobit

winglian commited on Feb 26

chore: update readme to be more clear (#1326) [skip ci]

c6b01e0
unverified

Nanobit commited on Feb 26

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

winglian commited on Feb 26

fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci]

2ed52bd
unverified

Nanobit commited on Feb 24

fix(readme): update inference md link (#1311) [skip ci]

3d2cd80
unverified

Nanobit commited on Feb 21

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

LeonardoEmili commited on Feb 13

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

jinwonkim93 commited on Feb 13

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

winglian commited on Feb 12

Update README.md (#1281)

b2a4cb4
unverified

hamel commited on Feb 9

add support for https remote yamls (#1277)

9bca7db
unverified

hamel commited on Feb 9

allow remote data paths (#1278)

91cf4ee
unverified

hamel commited on Feb 8

copy edits (#1276)

1daecd1
unverified

winglian commited on Feb 8

Add link to axolotl cloud image on latitude (#1275)

4a654b3
unverified

winglian commited on Feb 8

contributor avatars (#1269)

411293b
unverified

winglian commited on Feb 7

add contact info for dedicated support for axolotl [skip ci] (#1243)

dfd1885
unverified

winglian commited on Feb 1

support for true batches with multipack (#1230)

00568c1
unverified

winglian commited on Feb 1

Fix and document test_datasets (#1228)

5787e1a
unverified

DreamGenX

winglian commited on Jan 31

Peft lotfq (#1222)

4cb7900
unverified

winglian commited on Jan 28

Feat/chatml add system message (#1117)

98b4762
unverified

mhenrichsen Mads Henrichsen

winglian commited on Jan 25

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

winglian commited on Jan 24

update docs [skip ci] (#1176)

b715cd5
unverified

winglian commited on Jan 23

Fine-Tuning Mistral-7b for Real-World Chatbot Applications Using Axolotl (Lora used) (#1155)

cc25039
unverified

Tilemachos Chatzipapas twenty8th

winglian commited on Jan 23

Update README.md (#1169) [skip ci]

9135b9e
unverified

Ayush Singh commited on Jan 23

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

winglian

Nanobit commited on Jan 22

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

winglian commited on Jan 20

feat(dataset): add config to keep processed dataset in memory (#1152)

3db5f2f
unverified

Nanobit commited on Jan 20

Fix link for Minotaur model (#1146) [skip-ci]

08b8ba0
unverified

jrc commited on Jan 18

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

jrc joecummings

winglian commited on Jan 18

Agnostic cloud gpu docker image and Jupyter lab (#1097)

ece0211
unverified

winglian commited on Jan 16

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

xzuyn commited on Jan 16

fix(readme): clarify custom user prompt [no-ci] (#1124)

9cd27b2
unverified

Nanobit commited on Jan 16

Add link on README to Docker Debugging (#1107)

2dc4310
unverified

hamel

winglian commited on Jan 12

Update README.md (#1103)

b502392
unverified

hamel commited on Jan 12

Add Debugging Guide (#1089)

7512c3a
unverified

hamel

winglian commited on Jan 11

paired kto support (#1069)

d7057cc
unverified

winglian commited on Jan 9

Add: mlflow for experiment tracking (#1059) [skip ci]

090c24d
unverified

Johan Hansson

winglian commited on Jan 9

Cosine learning rate schedule - minimum learning rate (#1062)

04b978b
unverified

ricdomolm

winglian commited on Jan 9

Sponsors (#1065)

1496441
unverified

winglian commited on Jan 8

feature: better device mapping for large models (#918)

bdfefaf
unverified

dg-kalle Karl-Johan Alm

winglian commited on Jan 5

set default for merge (#1044)

63fb3eb
unverified

hamel commited on Jan 5

Commit History

fix(config): passing gradient_checkpoint_kwargs (#1412) b1e3e1b unverified

Update README.md (#1418) e8c8ea6 unverified

Fix(readme): Improve README QuickStart info (#1408) f083aed unverified

Feat(readme): Add instructions for Google GPU VM instances (#1410) 868c339 unverified

Add QLoRA + FSDP Docs (#1403) 8b12468 unverified

JarvisLabs (#1372) 638c2da unverified

add docs for `input_output` format (#1367) [skip ci] ed70a08 unverified

Remove unsupported python version 3.9 from README (#1364) [skip ci] 3765747 unverified

fix for protected model_ namespace w pydantic (#1345) 6b3b271 unverified

Mps mistral lora (#1292) [skip ci] 0f6af36 unverified

ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci] d756534 unverified

chore: update readme to be more clear (#1326) [skip ci] c6b01e0 unverified

Pydantic 2.x cfg (#1239) cc3cebf unverified

fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci] 2ed52bd unverified

fix(readme): update inference md link (#1311) [skip ci] 3d2cd80 unverified

Add seq2seq eval benchmark callback (#1274) 5a5d474 unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273) 8430db2 unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287) 4b997c3 unverified

Update README.md (#1281) b2a4cb4 unverified

add support for https remote yamls (#1277) 9bca7db unverified

allow remote data paths (#1278) 91cf4ee unverified

copy edits (#1276) 1daecd1 unverified

Add link to axolotl cloud image on latitude (#1275) 4a654b3 unverified

contributor avatars (#1269) 411293b unverified

add contact info for dedicated support for axolotl [skip ci] (#1243) dfd1885 unverified

support for true batches with multipack (#1230) 00568c1 unverified

Fix and document test_datasets (#1228) 5787e1a unverified

Peft lotfq (#1222) 4cb7900 unverified

Feat/chatml add system message (#1117) 98b4762 unverified

Mixtral fixes 20240124 (#1192) [skip ci] 54d2ac1 unverified

update docs [skip ci] (#1176) b715cd5 unverified

Fine-Tuning Mistral-7b for Real-World Chatbot Applications Using Axolotl (Lora used) (#1155) cc25039 unverified

Update README.md (#1169) [skip ci] 9135b9e unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified

Deprecate max packed sequence len (#1141) 2ce5c0d unverified

feat(dataset): add config to keep processed dataset in memory (#1152) 3db5f2f unverified

Fix link for Minotaur model (#1146) [skip-ci] 08b8ba0 unverified

Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified

Agnostic cloud gpu docker image and Jupyter lab (#1097) ece0211 unverified

Add `layers_to_transform` for `lora_config` (#1118) 8487b97 unverified

fix(readme): clarify custom user prompt [no-ci] (#1124) 9cd27b2 unverified

Add link on README to Docker Debugging (#1107) 2dc4310 unverified

Update README.md (#1103) b502392 unverified

Add Debugging Guide (#1089) 7512c3a unverified

paired kto support (#1069) d7057cc unverified

Add: mlflow for experiment tracking (#1059) [skip ci] 090c24d unverified

Cosine learning rate schedule - minimum learning rate (#1062) 04b978b unverified

Sponsors (#1065) 1496441 unverified

feature: better device mapping for large models (#918) bdfefaf unverified

set default for merge (#1044) 63fb3eb unverified

fix(config): passing gradient_checkpoint_kwargs (#1412)

b1e3e1b
unverified

Update README.md (#1418)

e8c8ea6
unverified

Fix(readme): Improve README QuickStart info (#1408)

f083aed
unverified

Feat(readme): Add instructions for Google GPU VM instances (#1410)

868c339
unverified

Add QLoRA + FSDP Docs (#1403)

8b12468
unverified

JarvisLabs (#1372)

638c2da
unverified

add docs for `input_output` format (#1367) [skip ci]

ed70a08
unverified

Remove unsupported python version 3.9 from README (#1364) [skip ci]

3765747
unverified

fix for protected model_ namespace w pydantic (#1345)

6b3b271
unverified

Mps mistral lora (#1292) [skip ci]

0f6af36
unverified

ADD: push checkpoints to mlflow artifact registry (#1295) [skip ci]

d756534
unverified

chore: update readme to be more clear (#1326) [skip ci]

c6b01e0
unverified

Pydantic 2.x cfg (#1239)

cc3cebf
unverified

fix(readme): Clarify doc for tokenizer_config (#1323) [skip ci]

2ed52bd
unverified

fix(readme): update inference md link (#1311) [skip ci]

3d2cd80
unverified

Add seq2seq eval benchmark callback (#1274)

5a5d474
unverified

Scheduler implementation of Continual Pre-Training of Large Language Models: How to (re)warm your model? (#1273)

8430db2
unverified

allow the optimizer prune ratio for ReLoRA to be configurable (#1287)

4b997c3
unverified

Update README.md (#1281)

b2a4cb4
unverified

add support for https remote yamls (#1277)

9bca7db
unverified

allow remote data paths (#1278)

91cf4ee
unverified

copy edits (#1276)

1daecd1
unverified

Add link to axolotl cloud image on latitude (#1275)

4a654b3
unverified

contributor avatars (#1269)

411293b
unverified

add contact info for dedicated support for axolotl [skip ci] (#1243)

dfd1885
unverified

support for true batches with multipack (#1230)

00568c1
unverified

Fix and document test_datasets (#1228)

5787e1a
unverified

Peft lotfq (#1222)

4cb7900
unverified

Feat/chatml add system message (#1117)

98b4762
unverified

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

update docs [skip ci] (#1176)

b715cd5
unverified

Fine-Tuning Mistral-7b for Real-World Chatbot Applications Using Axolotl (Lora used) (#1155)

cc25039
unverified

Update README.md (#1169) [skip ci]

9135b9e
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Deprecate max packed sequence len (#1141)

2ce5c0d
unverified

feat(dataset): add config to keep processed dataset in memory (#1152)

3db5f2f
unverified

Fix link for Minotaur model (#1146) [skip-ci]

08b8ba0
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

Agnostic cloud gpu docker image and Jupyter lab (#1097)

ece0211
unverified

Add `layers_to_transform` for `lora_config` (#1118)

8487b97
unverified

fix(readme): clarify custom user prompt [no-ci] (#1124)

9cd27b2
unverified

Add link on README to Docker Debugging (#1107)

2dc4310
unverified

Update README.md (#1103)

b502392
unverified

Add Debugging Guide (#1089)

7512c3a
unverified

paired kto support (#1069)

d7057cc
unverified

Add: mlflow for experiment tracking (#1059) [skip ci]

090c24d
unverified

Cosine learning rate schedule - minimum learning rate (#1062)

04b978b
unverified

Sponsors (#1065)

1496441
unverified

feature: better device mapping for large models (#918)

bdfefaf
unverified

set default for merge (#1044)

63fb3eb
unverified