Commits · Dovakiins/qwerrwe

various bugfixes (#856)

1470650
unverified

winglian commited on Nov 15, 2023

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

winglian commited on Nov 9, 2023

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

winglian commited on Oct 28, 2023

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

winglian commited on Oct 23, 2023

Implement fused modules (#747)

15d3a65
unverified

casperhansen

winglian commited on Oct 21, 2023

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

winglian commited on Oct 3, 2023

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

winglian commited on Sep 26, 2023

default model changed

4fecbfe

mhenrichsen commited on Sep 24, 2023

support to disable exllama for gptq (#604)

faecff9
unverified

winglian commited on Sep 19, 2023

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

Glavin001 commited on Sep 13, 2023

recommend padding when using sample packing (#531)

3437149
unverified

winglian commited on Sep 6, 2023

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

winglian commited on Sep 5, 2023

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

chargoddard commited on Aug 27, 2023

don't use mask expansion for inference (#392)

1687be6
unverified

winglian commited on Aug 15, 2023

new llama-2 default settings (#370)

fdffef5
unverified

mhenrichsen Mads Henrichsen commited on Aug 14, 2023

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

Morgan McGuire Morgan McGuire

winglian commited on Aug 12, 2023

set group_by_length to false in examples

36fefcf

tmm1 commited on Aug 7, 2023

feat/llama-2 examples (#319)

dc71d88
unverified

mhenrichsen Mads Henrichsen commited on Aug 3, 2023

Spaces:

Dovakiins
/

qwerrwe

Build error

Commit History

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified

Commit History

various bugfixes (#856) 1470650 unverified

don't compile deepspeed or bitsandbytes from source (#837) f544ab2 unverified

fix eval_steps to be a sane default (#797) 8b79ff0 unverified

simplify by removing duplicate base_model_config (#772) 2d8def6 unverified

Implement fused modules (#747) 15d3a65 unverified

prepared dataset caching, other misc fixes (#665) e50a64e unverified

eval_table isn't quite stable enough to be in default llama configs (#637) d887ad8 unverified

default model changed 4fecbfe

support to disable exllama for gptq (#604) faecff9 unverified

Add training callback to send predictions to WandB table (#521) 5b67ea9 unverified

recommend padding when using sample packing (#531) 3437149 unverified

Add support for GPTQ using native transformers/peft (#468) 3355706 unverified

Add example Llama 2 ReLoRA config (#471) fe4d6ba unverified

don't use mask expansion for inference (#392) 1687be6 unverified

new llama-2 default settings (#370) fdffef5 unverified

Add wandb_entity to wandb options, update example configs, update README (#361) 7019509 unverified

set group_by_length to false in examples 36fefcf

feat/llama-2 examples (#319) dc71d88 unverified

various bugfixes (#856)

1470650
unverified

don't compile deepspeed or bitsandbytes from source (#837)

f544ab2
unverified

fix eval_steps to be a sane default (#797)

8b79ff0
unverified

simplify by removing duplicate base_model_config (#772)

2d8def6
unverified

Implement fused modules (#747)

15d3a65
unverified

prepared dataset caching, other misc fixes (#665)

e50a64e
unverified

eval_table isn't quite stable enough to be in default llama configs (#637)

d887ad8
unverified

default model changed

4fecbfe

support to disable exllama for gptq (#604)

faecff9
unverified

Add training callback to send predictions to WandB table (#521)

5b67ea9
unverified

recommend padding when using sample packing (#531)

3437149
unverified

Add support for GPTQ using native transformers/peft (#468)

3355706
unverified

Add example Llama 2 ReLoRA config (#471)

fe4d6ba
unverified

don't use mask expansion for inference (#392)

1687be6
unverified

new llama-2 default settings (#370)

fdffef5
unverified

Add wandb_entity to wandb options, update example configs, update README (#361)

7019509
unverified

set group_by_length to false in examples

36fefcf

feat/llama-2 examples (#319)

dc71d88
unverified