Spaces:

Dovakiins
/

qwerrwe

Build error

App Files Files Community

qwerrwe / configs /cerebras_1_3B_alpaca.yml

Commit History

4bit quantized support (wip)

77fca25

winglian commited on Apr 17, 2023

deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches

d1aed4c

winglian commited on Apr 16, 2023

more logging, wandb fixes

05fffb5

winglian commited on Apr 15, 2023

improve prepared dataset loading, fix inference

b164725

winglian commited on Apr 15, 2023

config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes

f2a2029

winglian commited on Apr 14, 2023