--- base_model: - nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B-v2 - Fizzarolli/MN-12b-Sunrose - anthracite-org/magnum-v4-12b - mistralai/Mistral-Nemo-Instruct-2407 library_name: transformers tags: - mergekit - merge --- # inferor 0.1 Another iteration of inferor but with different base model ### Recommended settings on: [Infermatic/MN 12B Inferor v0.0 Article](https://infermatic.ai/infermatic-mn-12b-inferor-v0-0/) ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64be962a38953777feaabfc0/VflBXBEkNWGwfK_xVQQis.png) This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [mistralai/Mistral-Nemo-Instruct-2407](https://huggingface.co/mistralai/Mistral-Nemo-Instruct-2407) as a base. ### Models Merged The following models were included in the merge: * [nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B-v2](https://huggingface.co/nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B-v2) * [Fizzarolli/MN-12b-Sunrose](https://huggingface.co/Fizzarolli/MN-12b-Sunrose) * [anthracite-org/magnum-v4-12b](https://huggingface.co/anthracite-org/magnum-v4-12b) ### Configuration The following YAML configuration was used to produce this model: ```yaml base_model: mistralai/Mistral-Nemo-Instruct-2407 dtype: bfloat16 merge_method: model_stock slices: - sources: - layer_range: [0, 40] model: Fizzarolli/MN-12b-Sunrose - layer_range: [0, 40] model: nbeerbower/Mistral-Nemo-Gutenberg-Doppel-12B-v2 - layer_range: [0, 40] model: anthracite-org/magnum-v4-12b - layer_range: [0, 40] model: mistralai/Mistral-Nemo-Instruct-2407 ```