JoPmt
/

mix-llama-3-8B-inst-line

Text Generation

NousResearch/Meta-Llama-3-8B-Instruct

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

mix-llama-3-8B-inst-line / mergekit_config.yml

JoPmt's picture

Upload folder using huggingface_hub

734c5b7 verified 7 months ago

583 Bytes


	dtype: bfloat16
	merge_method: linear
	slices:
	- sources:
	- layer_range: [0, 16] # Assuming the first half of the model is more general and can be reduced more
	model: NousResearch/Meta-Llama-3-8B-Instruct
	parameters:
	weight: 0.5 # Reduce the weight of the first half to make room for the second half
	- layer_range: [16, 32] # Assuming the second half of the model is more specialized and can be reduced less
	model: NousResearch/Meta-Llama-3-8B-Instruct
	parameters:
	weight: 0.5 # Maintain the weight of the second half