Update README.md
Browse files
README.md
CHANGED
@@ -15,9 +15,31 @@ This method is applied across all compressions from IQ1 to Q8.
|
|
15 |
Even IQ1_S - the most compressed verison - works well, however IQ4/Q4 are suggested as minumums for quality.
|
16 |
Highest quality will be Q6/Q8.
|
17 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
18 |
This merge was an experiment to test already established Roleplay, Fiction and Story
|
19 |
generation of "Tiefighter" with a some of "Orca 2"'s qualities.
|
20 |
|
|
|
|
|
21 |
For Imatrix plus this was a test of high precision in specific areas of the model leading to a slightly larger compressed file.
|
22 |
In addition the Imatrix process itself used a larger "calibration" file than standard to further enhance quality.
|
23 |
|
|
|
15 |
Even IQ1_S - the most compressed verison - works well, however IQ4/Q4 are suggested as minumums for quality.
|
16 |
Highest quality will be Q6/Q8.
|
17 |
|
18 |
+
How big a difference is this merge?
|
19 |
+
|
20 |
+
Orginal Tiefighter IQ1_S (with imatrix enhancements) tested at a perplexity of:
|
21 |
+
PPL = 17.2589 +/- 0.12466*
|
22 |
+
|
23 |
+
Tiefighter Orca 2 IQ1_S (with imatrix enhancements) tested at a perplexity of:
|
24 |
+
PPL = 12.6985 +/- 0.09106*
|
25 |
+
|
26 |
+
Note that LOWER perplexity is better.
|
27 |
+
|
28 |
+
* Tested using llamacpp, perplexity.exe with wiki.raw.
|
29 |
+
|
30 |
+
In addition the Imatrix file used to "fix" the compressed files post compression resulted in
|
31 |
+
over 2 whole points lower perplexity at IQ1_S vs some of the other "Imatrix" files currently in use.
|
32 |
+
|
33 |
+
Orginal Tiefighter IQ1_S (with imatrix enhancements) tested with a different "Imatrix" repair file at a perplexity of:
|
34 |
+
PPL = 19.6355 +/- 0.14435
|
35 |
+
|
36 |
+
Likewise the merge itself affected perplexity too.
|
37 |
+
|
38 |
This merge was an experiment to test already established Roleplay, Fiction and Story
|
39 |
generation of "Tiefighter" with a some of "Orca 2"'s qualities.
|
40 |
|
41 |
+
Additional merge experiements are in progress.
|
42 |
+
|
43 |
For Imatrix plus this was a test of high precision in specific areas of the model leading to a slightly larger compressed file.
|
44 |
In addition the Imatrix process itself used a larger "calibration" file than standard to further enhance quality.
|
45 |
|