starble-dev
commited on
Commit
•
1119c19
1
Parent(s):
3ea3df4
Update README.md
Browse files
README.md
CHANGED
@@ -22,7 +22,7 @@ library_name: transformers
|
|
22 |
|
23 |
# Quants
|
24 |
PPL = Perplexity, lower is better<br>
|
25 |
-
Comparisons are done as
|
26 |
| Quant Type | Note | Size |
|
27 |
| ---- | ---- | ---- |
|
28 |
| [Q2_K](https://huggingface.co/starble-dev/mini-magnum-12b-v1.1-GGUF/blob/main/Mini-Magnum-12B-v1.1-Q2_K.gguf) | +3.5199 ppl @ Llama-3-8B | 4.79 GB |
|
|
|
22 |
|
23 |
# Quants
|
24 |
PPL = Perplexity, lower is better<br>
|
25 |
+
Comparisons are done as QX_X Llama-3-8B against FP16 Llama-3-8B, recommended as a guideline and not as fact.
|
26 |
| Quant Type | Note | Size |
|
27 |
| ---- | ---- | ---- |
|
28 |
| [Q2_K](https://huggingface.co/starble-dev/mini-magnum-12b-v1.1-GGUF/blob/main/Mini-Magnum-12B-v1.1-Q2_K.gguf) | +3.5199 ppl @ Llama-3-8B | 4.79 GB |
|