Sdff-Ltba
/

LightChatAssistant-2x7B-GGUF

Text Generation

Mixture of Experts

Not-For-All-Audiences

nsfw

Inference Endpoints

Model card Files Files and versions Community

Sdff-Ltba commited on Apr 3

Commit

2e25030

•

1 Parent(s): 7e48eec

Upload README.md

Files changed (1) hide show

README.md +24 -0

README.md ADDED Viewed

	@@ -0,0 +1,24 @@

+---
+language:
+- ja
+tags:
+- mistral
+- mixtral
+- not-for-all-audiences
+- nsfw
+pipeline_tag: text-generation
+---
+# chatntq_chatvector-MoE-Antler_chatvector-2x7B-GGUF
+[Sdff-Ltba/chatntq_chatvector-MoE-Antler_chatvector-2x7B](https://huggingface.co/Sdff-Ltba/chatntq_chatvector-MoE-Antler_chatvector-2x7B)をGGUF変換したものです。
+iMatrixを併用して量子化しています。
+## 量子化手順
+以下の通りに実行しました。
+```
+python ./llama.cpp/convert.py ./chatntq_chatvector-MoE-Antler_chatvector-2x7B --outtype f16 --outfile ./gguf-model_f16.gguf
+./llama.cpp/imatrix -m ./gguf-model_f16.gguf -f ./wiki.train.raw -o ./gguf-model_f16.imatrix --chunks 32
+./llama.cpp/quantize --imatrix ./gguf-model_f16.imatrix ./gguf-model_f16.gguf ./chatntq_chatvector-MoE-Antler_chatvector-2x7B_iq3xxs.gguf iq3_xxs
+```