Text Generation
GGUF
vllm
sparsity
Inference Endpoints
aashish1904's picture
Upload Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf with huggingface_hub
1dbdb39 verified