Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Spaces:
Dovakiins
/
qwerrwe
like
0
Build error
App
Files
Files
Community
8a20a7b
qwerrwe
/
examples
/
jamba
/
README.md
winglian
fix some of the edge cases for Jamba (#1452)
05b398a
unverified
8 months ago
preview
code
|
raw
Copy download link
history
blame
Safe
318 Bytes
Jamba
β qlora w/ deepspeed Zero-2 needs at least 2x GPUs and
35GiB VRAM per GPU w minimal context length
56GiB VRAM per GPU (w multipack enabled)
β qlora w/ deepspeed Zero-3 needs at least 2x GPUs and 67GiB VRAM (wtf?)
β qlora single-gpu, ~51GiB VRAM
β multipack
β FSDP
β 8-bit LoRA