Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d1aed4c
qwerrwe
/
configs
100 contributors
History:
10 commits
winglian
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c
almost 2 years ago
cerebras_1_3B_alpaca.yml
906 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
llama_65B_alpaca.yml
931 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
llama_7B_alpaca.yml
929 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago
pythia_1_2B_alpaca.yml
974 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago