Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
280
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
1740907
Sparse-Llama-3.1-8B-2of4-GGUF
1 contributor
History:
17 commits
aashish1904
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub
1740907
verified
22 days ago
.gitattributes
2.6 kB
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub
22 days ago
README.md
6.06 kB
Upload README.md with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q2_K.gguf
3.18 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q2_K.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf
4.32 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf
4.02 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf
3.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf
5.13 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
4.92 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf
4.69 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q5_0.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q5_1.gguf
6.07 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_1.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf
5.73 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf
6.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf
8.54 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
22 days ago