QuantFactory
/

Sparse-Llama-3.1-8B-2of4-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

Sparse-Llama-3.1-8B-2of4-GGUF

1 contributor

History: 8 commits

aashish1904's picture

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub

53e452b verified about 2 months ago

.gitattributes

1.95 kB

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub about 2 months ago
README.md

6.06 kB

Upload README.md with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf

4.66 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf

5.13 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf

4.92 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf

5.73 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf

6.6 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub about 2 months ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf

8.54 GB
LFS

Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub about 2 months ago