GGUF / IQ / Imatrix for Cosmic-Citrus-9B

image/png

Why Importance Matrix?

Importance Matrix, at least based on my testing, has shown to improve the output and performance of "IQ"-type quantizations, where the compression becomes quite heavy. The Imatrix performs a calibration, using a provided dataset. Testing has shown that semi-randomized data can help perserve more important segments as the compression is applied.

Related discussions in Github: [1] [2]

The imatrix.txt file that I used contains general, semi-random data, with some custom kink.

Cosmic-Citrus-9B

Another attempt at merging Cerebrum, InfinityRP, LemonadeRP, and Laymonade, all already merged in my previous merges, now into a 9B containing TheSpice.

So far in my tests, it seems to follow my cards in intriguing way, using refined language, with more consideration of what the prompt is saying.

In fact, I'm quite positively surprised as the creativity surpassed my expectations. It's quickly becoming a favorite to use for me.

Merge Details

This is a merge of pre-trained language models created using mergekit.

Merge Method

This model was merged using the passthrough merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

slices:
  - sources:
      - model: ABX-AI/Cerebral-Infinity-7B
        layer_range: [0, 20]
  - sources:
      - model: ABX-AI/Spicy-Laymonade-7B
        layer_range: [12, 32]
merge_method: passthrough
dtype: float16
Downloads last month
17
GGUF
Model size
9B params
Architecture
llama
Hardware compatibility
Log In to view the estimation

3-bit

4-bit

5-bit

6-bit

Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ABX-AI/Cosmic-Citrus-9B-GGUF-IQ-Imatrix

Collections including ABX-AI/Cosmic-Citrus-9B-GGUF-IQ-Imatrix