Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
293
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
0a8ebd8
Sparse-Llama-3.1-8B-2of4-GGUF
1 contributor
History:
10 commits
aashish1904
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
0a8ebd8
verified
27 days ago
.gitattributes
2.09 kB
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
27 days ago
README.md
6.06 kB
Upload README.md with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf
5.13 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
4.92 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_0.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf
5.73 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf
5.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf
6.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
27 days ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf
8.54 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
27 days ago