If you want any specific quantization to be added, feel free to ask.

All credits belong to the respective creators.

Base⇢ GGUF(F16)⇢ GGUF(Quants)

Using llama.cpp-b2222. For --imatrix, included reference imatrix-Q8_0.dat was used.

Original model information:

https://huggingface.co/ChaoticNeutrals/Prima-LelantaclesV5-7b/tree/main/ST%20presets

This model was merged using the DARE TIES merge method.

The following models were included in the merge:

Test157t/Pasta-Lake-7b + Test157t/Prima-LelantaclesV4-7b-16k

Configuration

The following YAML configuration was used to produce this model:

merge_method: dare_ties
base_model: Test157t/Prima-LelantaclesV4-7b-16k
parameters:
  normalize: true
models:
  - model: Test157t/Pasta-Lake-7b
    parameters:
      weight: 1
  - model: Test157t/Prima-LelantaclesV4-7b-16k
    parameters:
      weight: 1
dtype: float16

Downloads last month: 92

GGUF

Model size

7B params

Architecture

llama

Hardware compatibility

3-bit

4-bit

5-bit

6-bit

Model tree for Lewdiculous/Prima-LelantaclesV5-7b-GGUF

Base model

Nitral-AI/Pasta-Lake-7b

Quantized

(3)

this model

Collection including Lewdiculous/Prima-LelantaclesV5-7b-GGUF

Quantized Models (GGUF, IQ, Imatrix)

Collection

Various GGUF quantizations of small models. Models with a "checkmark" are personal favorites. An "orange arrow" means it's being uploaded. • 98 items • Updated Sep 2 • 68