Text Generation
GGUF
vllm
sparsity
Inference Endpoints
Sparse-Llama-3.1-8B-2of4-GGUF / Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf

Commit History

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
cbe03d4
verified

aashish1904 commited on