GGUF / IQ / Imatrix for Cosmic-Citrus-9B
Why Importance Matrix?
Importance Matrix, at least based on my testing, has shown to improve the output and performance of "IQ"-type quantizations, where the compression becomes quite heavy. The Imatrix performs a calibration, using a provided dataset. Testing has shown that semi-randomized data can help perserve more important segments as the compression is applied.
Related discussions in Github: [1] [2]
The imatrix.txt file that I used contains general, semi-random data, with some custom kink.
Cosmic-Citrus-9B
Another attempt at merging Cerebrum, InfinityRP, LemonadeRP, and Laymonade, all already merged in my previous merges, now into a 9B containing TheSpice.
So far in my tests, it seems to follow my cards in intriguing way, using refined language, with more consideration of what the prompt is saying.
In fact, I'm quite positively surprised as the creativity surpassed my expectations. It's quickly becoming a favorite to use for me.
Merge Details
This is a merge of pre-trained language models created using mergekit.
Merge Method
This model was merged using the passthrough merge method.
Models Merged
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
slices:
- sources:
- model: ABX-AI/Cerebral-Infinity-7B
layer_range: [0, 20]
- sources:
- model: ABX-AI/Spicy-Laymonade-7B
layer_range: [12, 32]
merge_method: passthrough
dtype: float16
- Downloads last month
- 17
3-bit
4-bit
5-bit
6-bit