Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
QuantFactory
/
Sparse-Llama-3.1-8B-2of4-GGUF
like
4
Follow
Quant Factory
280
Text Generation
GGUF
vllm
sparsity
Inference Endpoints
arxiv:
2301.00774
arxiv:
2310.06927
License:
llama3.1
Model card
Files
Files and versions
Community
Deploy
Use this model
53e452b
Sparse-Llama-3.1-8B-2of4-GGUF
1 contributor
History:
8 commits
aashish1904
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
53e452b
verified
22 days ago
.gitattributes
1.95 kB
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
22 days ago
README.md
6.06 kB
Upload README.md with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_0.gguf
4.66 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_1.gguf
5.13 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf
4.92 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf
5.73 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q6_K.gguf
6.6 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
22 days ago
Sparse-Llama-3.1-8B-2of4.Q8_0.gguf
8.54 GB
LFS
Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
22 days ago