File size: 1,142 Bytes
b5ab98b |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 |
---
license: apache-2.0
language:
- en
tags:
- story
- general usage
- roleplay
- creative
- rp
- fantasy
- story telling
- ultra high precision
---
<B>NEO CLASS Ultra Quant for : L3-8B-Stheno-v3.2</B>
Additional quants are uploading...
The NEO Class tech was created after countless investigations and over 120 lab experiments backed by
real world testing and qualitative results.
NEO Class results:
Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.
In addition quants now operate above their "grade" so to speak :
IE: Q4 / IQ4 operate at Q5KM/Q6 levels.
Likewise for Q3/IQ3 operate at Q4KM/Q5 levels.
The examples below illustrate the least amount of improvement using some of the lowest quants.
Perplexity drop of 1191 points VS regular quant of IQ4XS.
(lower is better)
<B> Model Notes: </B>
Maximum context is 8k. Please see original model maker's page for details, and usage information for this model.
Special thanks to the model creators at SAO10K for making such a fantastic model:
[ https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ] |