DavidAU commited on
Commit
1680410
·
verified ·
1 Parent(s): 64a3ed2

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +22 -0
README.md CHANGED
@@ -15,9 +15,31 @@ This method is applied across all compressions from IQ1 to Q8.
15
  Even IQ1_S - the most compressed verison - works well, however IQ4/Q4 are suggested as minumums for quality.
16
  Highest quality will be Q6/Q8.
17
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
18
  This merge was an experiment to test already established Roleplay, Fiction and Story
19
  generation of "Tiefighter" with a some of "Orca 2"'s qualities.
20
 
 
 
21
  For Imatrix plus this was a test of high precision in specific areas of the model leading to a slightly larger compressed file.
22
  In addition the Imatrix process itself used a larger "calibration" file than standard to further enhance quality.
23
 
 
15
  Even IQ1_S - the most compressed verison - works well, however IQ4/Q4 are suggested as minumums for quality.
16
  Highest quality will be Q6/Q8.
17
 
18
+ How big a difference is this merge?
19
+
20
+ Orginal Tiefighter IQ1_S (with imatrix enhancements) tested at a perplexity of:
21
+ PPL = 17.2589 +/- 0.12466*
22
+
23
+ Tiefighter Orca 2 IQ1_S (with imatrix enhancements) tested at a perplexity of:
24
+ PPL = 12.6985 +/- 0.09106*
25
+
26
+ Note that LOWER perplexity is better.
27
+
28
+ * Tested using llamacpp, perplexity.exe with wiki.raw.
29
+
30
+ In addition the Imatrix file used to "fix" the compressed files post compression resulted in
31
+ over 2 whole points lower perplexity at IQ1_S vs some of the other "Imatrix" files currently in use.
32
+
33
+ Orginal Tiefighter IQ1_S (with imatrix enhancements) tested with a different "Imatrix" repair file at a perplexity of:
34
+ PPL = 19.6355 +/- 0.14435
35
+
36
+ Likewise the merge itself affected perplexity too.
37
+
38
  This merge was an experiment to test already established Roleplay, Fiction and Story
39
  generation of "Tiefighter" with a some of "Orca 2"'s qualities.
40
 
41
+ Additional merge experiements are in progress.
42
+
43
  For Imatrix plus this was a test of high precision in specific areas of the model leading to a slightly larger compressed file.
44
  In addition the Imatrix process itself used a larger "calibration" file than standard to further enhance quality.
45