DavidAU's picture
Create README.md
b5ab98b verified
|
raw
history blame
1.14 kB
metadata
license: apache-2.0
language:
  - en
tags:
  - story
  - general usage
  - roleplay
  - creative
  - rp
  - fantasy
  - story telling
  - ultra high precision

NEO CLASS Ultra Quant for : L3-8B-Stheno-v3.2

Additional quants are uploading...

The NEO Class tech was created after countless investigations and over 120 lab experiments backed by real world testing and qualitative results.

NEO Class results:

Better overall function, instruction following, output quality and stronger connections to ideas, concepts and the world in general.

In addition quants now operate above their "grade" so to speak :

IE: Q4 / IQ4 operate at Q5KM/Q6 levels.

Likewise for Q3/IQ3 operate at Q4KM/Q5 levels.

The examples below illustrate the least amount of improvement using some of the lowest quants.

Perplexity drop of 1191 points VS regular quant of IQ4XS.

(lower is better)

Model Notes:

Maximum context is 8k. Please see original model maker's page for details, and usage information for this model.

Special thanks to the model creators at SAO10K for making such a fantastic model:

[ https://huggingface.co/Sao10K/L3-8B-Stheno-v3.2 ]