Text Generation
GGUF
vllm
sparsity
Inference Endpoints
aashish1904 commited on
Commit
1740907
1 Parent(s): 65ad6fb

Upload Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf with huggingface_hub

Browse files
.gitattributes CHANGED
@@ -47,3 +47,4 @@ Sparse-Llama-3.1-8B-2of4.Q3_K_L.gguf filter=lfs diff=lfs merge=lfs -text
47
  Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
48
  Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
49
  Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
 
 
47
  Sparse-Llama-3.1-8B-2of4.Q3_K_M.gguf filter=lfs diff=lfs merge=lfs -text
48
  Sparse-Llama-3.1-8B-2of4.Q3_K_S.gguf filter=lfs diff=lfs merge=lfs -text
49
  Sparse-Llama-3.1-8B-2of4.Q4_K_S.gguf filter=lfs diff=lfs merge=lfs -text
50
+ Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf filter=lfs diff=lfs merge=lfs -text
Sparse-Llama-3.1-8B-2of4.Q4_0_4_4.gguf ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c5861445432e5a14752ae18881ad021df26d18e475f0d88a653cfa9375379c5e
3
+ size 4661211904