This contains q4_0, q4_1, q5_0 and q5_1 GGML(v3) and GGUF(v2) quantizations of the model https://huggingface.co/CobraMamba/mamba-gpt-3b-v4

Downloads last month
3,089
GGUF
Model size
3.43B params
Architecture
llama

4-bit

5-bit

Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.