This contains q4_0, q4_1, q5_0 and q5_1 GGML(v3) and GGUF(v2) quantizations of the model https://huggingface.co/CobraMamba/mamba-gpt-3b-v4

GGUF

Model size

3.43B params

Architecture

llama

4-bit

5-bit

Inference Providers NEW

This model is not currently available via any of the supported Inference Providers.

The model cannot be deployed to the HF Inference API: The model has no library tag.