LlamaTronic-3.1-4B / README.md
bunnycore's picture
Upload folder using huggingface_hub
7407580 verified
|
raw
history blame
1.91 kB
metadata
base_model:
  - rasyosef/Llama-3.1-Minitron-4B-Chat
  - anthracite-org/magnum-v2-4b
  - Delta-Vector/Holland-4B-V1
  - nvidia/Llama-3.1-Minitron-4B-Width-Base
  - Magpie-Align/MagpieLM-4B-Chat-v0.1
  - bunnycore/LLama-3.1-4B-TitanFusion
library_name: transformers
tags:
  - mergekit
  - merge

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the TIES merge method using nvidia/Llama-3.1-Minitron-4B-Width-Base as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

models:
  - model: anthracite-org/magnum-v2-4b
    parameters:
      weight: 1
      density: 1

  - model: Magpie-Align/MagpieLM-4B-Chat-v0.1
    parameters:
      weight: 1
      density: 1
      
  - model: rasyosef/Llama-3.1-Minitron-4B-Chat
    parameters:
      weight: 1
      density: 1

  - model: bunnycore/LLama-3.1-4B-TitanFusion
    parameters:
      weight: 1
      density: 1

  - model: Delta-Vector/Holland-4B-V1
    parameters:
      weight: 1
      density: 1

merge_method: ties
base_model: nvidia/Llama-3.1-Minitron-4B-Width-Base
parameters:
  density: 1
  normalize: true
  int8_mask: true
dtype: bfloat16