Merge

This is a merge of pre-trained language models created using mergekit.

Merge Method

This model was merged using the Model Stock merge method using TheDrummer/Gemmasutra-9B-v1.1 as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: TheDrummer/Gemmasutra-9B-v1.1
  - model: Rombo-Org/Rombo-LLM-V2.7-gemma-2-9b
  - model: allura-org/G2-9B-Aletheia-v1
  - model: anthracite-org/magnum-v4-9b
  - model: nbeerbower/Gemma2-Gutenberg-Doppel-9B
  - model: DavidAU/Gemma-The-Writer-Mighty-Sword-9B
merge_method: model_stock
base_model: TheDrummer/Gemmasutra-9B-v1.1
parameters:
    normalize: true
dtype: bfloat16

Open LLM Leaderboard Evaluation Results

Detailed results can be found here

Metric Value
Avg. 21.58
IFEval (0-Shot) 15.82
BBH (3-Shot) 43.62
MATH Lvl 5 (4-Shot) 2.79
GPQA (0-shot) 13.76
MuSR (0-shot) 17.23
MMLU-PRO (5-shot) 36.24
Downloads last month
57
Safetensors
Model size
10.2B params
Tensor type
BF16
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Model tree for Triangle104/Gemmadevi-Stock-10B

Spaces using Triangle104/Gemmadevi-Stock-10B 3

Collections including Triangle104/Gemmadevi-Stock-10B

Evaluation results