qwp4w3hyb
/

Mistral-22B-v0.1-iMat-GGUF

importance matrix

Inference Endpoints

Model card Files Files and versions Community

qwp4w3hyb commited on Apr 12

Commit

8c1ce2a

•

1 Parent(s): 2424488

Improve README.md

Files changed (1) hide show

README.md +29 -0

README.md CHANGED Viewed

@@ -1,3 +1,32 @@
 ---
 license: apache-2.0
 ---

 ---
+base_model: Vezora/Mistral-22B-v0.1
+tags:
+- moe
+- mistral
+- mixtral
+- merge
+- importance matrix
+- imatrix
+language:
+  - fr
+  - it
+  - de
+  - es
+  - en
+model-index:
+- name: Mistral-22B-v0.1-iMat-GGUF
+  results: []
 license: apache-2.0
 ---
+# Vezora/Mistral-22B-v0.1 GGUFs created with an importance matrix
+Source Model: [Vezora/Mistral-22B-v0.1](https://huggingface.co/Vezora/Mistral-22B-v0.1)
+Quantized with [llama.cpp](https://github.com/ggerganov/llama.cpp) commit [5dc9dd7152dedc6046b646855585bd070c91e8c8](https://github.com/ggerganov/llama.cpp/commit/5dc9dd7152dedc6046b646855585bd070c91e8c8) (master from 2024-04-09)
+Imatrix was generated from the f16 gguf via this command:
+./imatrix -c 512 -m $out_path/$base_quant_name -f $llama_cpp_path/groups_merged.txt -o $out_path/imat-f16-gmerged.dat
+Using the dataset from [here](https://github.com/ggerganov/llama.cpp/discussions/5263#discussioncomment-8395384)