Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
c146880
qwerrwe
/
examples
/
mpt-7b
100 contributors
History:
4 commits
winglian
swap batch size for gradient accumulation steps to decouple from num gpu
c2a0792
over 1 year ago
README.md
Safe
89 Bytes
add support for trust_remote_code for mpt models
over 1 year ago
config.yml
Safe
1.21 kB
swap batch size for gradient accumulation steps to decouple from num gpu
over 1 year ago