Hermes-3-Llama-3.1.3-70B / mergekit_config.yml
Triangle104's picture
Upload folder using huggingface_hub
832c709 verified
raw
history blame contribute delete
285 Bytes
models:
- model: NousResearch/Hermes-3-Llama-3.1-70B
- model: unsloth/Llama-3.3-70B-Instruct
merge_method: slerp
base_model: unsloth/Llama-3.3-70B-Instruct
dtype: bfloat16
parameters:
t: [0, 0.5, 1, 0.5, 0] # V shaped curve: Llama for input & output, Hermes in the middle layers