noopSD
/

maywell_kiqu-70b-GGUF

Text Generation

Model card Files Files and versions Community

noopSD commited on Feb 25

Commit

18a24a4

•

1 Parent(s): 0623579

Update README.md

Files changed (1) hide show

README.md +11 -0

README.md CHANGED Viewed

@@ -16,6 +16,17 @@ pipeline_tag: text-generation
 > This repo contains quantized large language model(LLM) weight files in GGUF format for [maywell/kiqu-70b](https://huggingface.co/maywell/kiqu-70b). The IQ quantized model files are calibrated with [20k_random_data.txt](https://github.com/ggerganov/llama.cpp/files/13970111/20k_random_data.txt)
 # **kiqu-70b** [(Arena Leaderboard)](https://huggingface.co/spaces/instructkr/ko-chatbot-arena-leaderboard)
 <img src="./kiqu.webp" alt="kiqu-70B" width="390"/>

 > This repo contains quantized large language model(LLM) weight files in GGUF format for [maywell/kiqu-70b](https://huggingface.co/maywell/kiqu-70b). The IQ quantized model files are calibrated with [20k_random_data.txt](https://github.com/ggerganov/llama.cpp/files/13970111/20k_random_data.txt)
+| Quant Type  |       Size    |            BPW |            Perplexity |
+| ----------- | ------------: | -------------: | --------------------: |
+| IQ1_S       |       14.5 GB |  1.5625 – 1.69 |   16.5308 +/- 0.13137 |
+| IQ2_XXS     |       18.3 GB |  2.0625 – 2.12 |   12.1174 +/- 0.09202 |
+| IQ2_XS      |       20.3 GB |  2.3125 – 2.36 |   11.2679 +/- 0.08525 |
+| IQ3_XXS     |         27 GB |  3.0625 – 3.13 |   10.0546 +/- 0.07674 |
+| Q2_K        |       25.5 GB |           2.95 |    4.2965 +/- 0.02164 |
+| Q4_0        |       38.9 GB |           4.51 |    3.7527 +/- 0.01835 |
+***
 # **kiqu-70b** [(Arena Leaderboard)](https://huggingface.co/spaces/instructkr/ko-chatbot-arena-leaderboard)
 <img src="./kiqu.webp" alt="kiqu-70B" width="390"/>