Edit model card

QuantFactory Banner

QuantFactory/NarraThinker12B-GGUF

This is quantized version of ClaudioItaly/NarraThinker12B created using llama.cpp

Original Model Card

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the SLERP merge method.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: NeverSleep/Lumimaid-v0.2-12B
  - model: nbeerbower/mistral-nemo-gutenberg-12B-v4
merge_method: slerp
base_model: nbeerbower/mistral-nemo-gutenberg-12B-v4
dtype: bfloat16
parameters:
  t: [0.1, 0.1, 0.1, 0.1, 0.1, 0.1, 0.2, 0.2, 0.2, 0.3, 0.3, 0.4, 0.5, 0.6, 0.7, 0.8, 0.9, 0.9, 0.9, 0.9, 0.9, 0.9, 0.9]
  layers: [0, 5, 10, 15, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 85, 90, 95, 100, 105, 110]
tokenizer_merge_method: slerp
tokenizer_parameters:
  t: 0.2
Downloads last month
40
GGUF
Model size
12.2B params
Architecture
llama

2-bit

3-bit

4-bit

5-bit

6-bit

8-bit

Inference API
Unable to determine this model’s pipeline type. Check the docs .

Model tree for QuantFactory/NarraThinker12B-GGUF