MD-Zephyria-42b [EXPERIMENTAL]

Model Information

Base Model: unsloth/Mistral-Small-Instruct-2409

Strategy: Mid Duplication

Total Layers: 55

Duplication Start: Layer 22 (40% of model)

Duplicated Layers: 27 (49.1% of model)

Unique Final Layers: 7 (12.7% of model)

Model Characteristics

  • Models down_proj and o_proj layers have been nulled and will require healing
  • Balances early feature extraction and later refinement
  • Even split between unique and duplicated sections
  • Good for general-purpose tasks with balanced low and high-level processing
  • May provide a good compromise for a wide range of applications

Configuration Visualization


[     Unique     ][     Duplicated     ][  Unique  ]
0 ------------- 21 22 ------------- 48 49 ------- 54
      40%              49.1%            10.9%
      
Downloads last month
3
Safetensors
Model size
42.1B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Model tree for TheSkullery/MD-Zephyria-42b

Finetuned
(9)
this model
Quantizations
2 models