Commits · Dovakiins/qwerrwe

Peft lotfq (#1222)

4cb7900
unverified

winglian commited on Jan 28

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

winglian commited on Jan 24

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

winglian

Nanobit commited on Jan 22

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

jrc joecummings

winglian commited on Jan 18

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

Tim Dolan commited on Jan 3

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

winglian commited on Dec 12, 2023

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

Nanobit commited on Dec 4, 2023

various bugfixes (#856)

1470650
unverified

winglian commited on Nov 15, 2023

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

winglian commited on Nov 9, 2023

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

winglian commited on Oct 28, 2023

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

winglian commited on Oct 23, 2023

Implement fused modules (#747)

15d3a65
unverified

casperhansen

winglian commited on Oct 21, 2023

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

winglian commited on Oct 3, 2023

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

winglian commited on Sep 26, 2023

default model changed

4fecbfe

mhenrichsen commited on Sep 24, 2023

support to disable exllama for gptq (#604)

faecff9
unverified

winglian commited on Sep 19, 2023

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

chargoddard commited on Aug 27, 2023

don't use mask expansion for inference (#392)

1687be6
unverified

winglian commited on Aug 15, 2023

new llama-2 default settings (#370)

fdffef5
unverified

mhenrichsen Mads Henrichsen commited on Aug 14, 2023

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

Morgan McGuire Morgan McGuire

winglian commited on Aug 12, 2023

set group_by_length to false in examples

36fefcf

tmm1 commited on Aug 7, 2023

feat/llama-2 examples (#319)

dc71d88
unverified

mhenrichsen Mads Henrichsen commited on Aug 3, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

Peft lotfq (#1222)

4cb7900
unverified

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified

Commit History

Peft lotfq (#1222) 4cb7900 unverified

Mixtral fixes 20240124 (#1192) [skip ci] 54d2ac1 unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci] 782b6a4 unverified

Add shifted sparse attention (#973) [skip-ci] 1d70f24 unverified

added tiny llama examples for lora and qlora (#1027) c75f916 unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944) 5f79b82 unverified

Feat(wandb): Refactor to be more flexible (#767) a1da39c unverified

various bugfixes (#856) 1470650 unverified

don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified

fix eval_steps to be a sane default (#797) 8b79ff0 unverified

simplify by removing duplicate base_model_config (#772) 2d8def6 unverified

Implement fused modules (#747) 15d3a65 unverified

prepared dataset caching, other misc fixes (#665) e50a64e unverified

eval_table isn't quite stable enough to be in default llama configs (#637) d887ad8 unverified

default model changed 4fecbfe

support to disable exllama for gptq (#604) faecff9 unverified

Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified

recommend padding when using sample packing (#531) 3437149 unverified

Add support for GPTQ using native transformers/peft (#468) 3355706 unverified

Add example Llama 2 ReLoRA config (#471) fe4d6ba unverified

don't use mask expansion for inference (#392) 1687be6 unverified

new llama-2 default settings (#370) fdffef5 unverified

Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified

set group_by_length to false in examples 36fefcf

feat/llama-2 examples (#319) dc71d88 unverified

Peft lotfq (#1222)

4cb7900
unverified

Mixtral fixes 20240124 (#1192) [skip ci]

54d2ac1
unverified

set fp16 to false if bf16, update bf16: auto in example YAMLs (#1122) [skip ci]

782b6a4
unverified

Add shifted sparse attention (#973) [skip-ci]

1d70f24
unverified

added tiny llama examples for lora and qlora (#1027)

c75f916
unverified

new evals_per_epoch and saves_per_epoch to make things cleaner (#944)

5f79b82
unverified

Feat(wandb): Refactor to be more flexible (#767)

a1da39c
unverified

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified