Text Generation
GGUF
vllm
sparsity
Inference Endpoints

Commit History

Upload Sparse-Llama-3.1-8B-2of4.Q2_K.gguf with huggingface_hub
98695b5
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q5_1.gguf with huggingface_hub
f830001
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q5_0.gguf with huggingface_hub
0a8ebd8
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_S.gguf with huggingface_hub
cbe03d4
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q5_K_M.gguf with huggingface_hub
53e452b
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q6_K.gguf with huggingface_hub
cb15065
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q8_0.gguf with huggingface_hub
0d7a372
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q4_0.gguf with huggingface_hub
604fb8f
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q4_1.gguf with huggingface_hub
3a3a6b4
verified

aashish1904 commited on

Upload Sparse-Llama-3.1-8B-2of4.Q4_K_M.gguf with huggingface_hub
ecf1278
verified

aashish1904 commited on

Upload README.md with huggingface_hub
6f2fa5b
verified

aashish1904 commited on

initial commit
f749b15
verified

aashish1904 commited on