Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
Fetching metadata from the HF Docker repository...
d1aed4c
qwerrwe
/
scripts
100 contributors
History:
16 commits
winglian
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c
almost 2 years ago
alpaca_json_to_jsonl.py
834 Bytes
black formatting
almost 2 years ago
finetune.py
16.6 kB
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
almost 2 years ago