Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
d1aed4c
qwerrwe
100 contributors
History:
18 commits
winglian
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
d1aed4c
over 1 year ago
configs
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
over 1 year ago
data
WIP for axolotl trainer
over 1 year ago
scripts
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
over 1 year ago
src
various bugfixes
over 1 year ago
.editorconfig
Safe
186 Bytes
WIP for axolotl trainer
over 1 year ago
.gitattributes
Safe
49 Bytes
make it work with pythia in the cloud
over 1 year ago
.gitignore
41 Bytes
WIP for axolotl trainer
over 1 year ago
README.md
1.81 kB
config chooser, update readme instructions, device config, llama flash attention, debug out the labels, fix config key checks, other bugfixes
over 1 year ago
ds_config.json
844 Bytes
deepspeed doesn't work with flash-attn, and the gpu savings w flash attn are better than the deepspeed headaches
over 1 year ago
pyproject.toml
Safe
90 Bytes
WIP for axolotl trainer
over 1 year ago
requirements.txt
209 Bytes
helpful info output
over 1 year ago
setup.cfg
560 Bytes
various bugfixes
over 1 year ago