--- base_model: - sthenno-com/miscii-14b-1225 - SicariusSicariiStuff/Impish_QWEN_14B-1M - arcee-ai/Virtuoso-Small - ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign - deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - Qwen/Qwen2.5-14B-Instruct-1M - sthenno/tempesthenno-nuslerp-0124 - sthenno/tempesthenno-0126-ckpt150 library_name: transformers tags: - mergekit - merge --- # merge This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit). ## Merge Details ### Merge Method This model was merged using the [SCE](https://arxiv.org/abs/2408.07990) merge method using [sthenno-com/miscii-14b-1225](https://huggingface.co/sthenno-com/miscii-14b-1225) as a base. ### Models Merged The following models were included in the merge: * [SicariusSicariiStuff/Impish_QWEN_14B-1M](https://huggingface.co/SicariusSicariiStuff/Impish_QWEN_14B-1M) * [arcee-ai/Virtuoso-Small](https://huggingface.co/arcee-ai/Virtuoso-Small) * [ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign](https://huggingface.co/ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign) * [deepseek-ai/DeepSeek-R1-Distill-Qwen-14B](https://huggingface.co/deepseek-ai/DeepSeek-R1-Distill-Qwen-14B) * [Qwen/Qwen2.5-14B-Instruct-1M](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct-1M) * [sthenno/tempesthenno-nuslerp-0124](https://huggingface.co/sthenno/tempesthenno-nuslerp-0124) * [sthenno/tempesthenno-0126-ckpt150](https://huggingface.co/sthenno/tempesthenno-0126-ckpt150) ### Configuration The following YAML configuration was used to produce this model: ```yaml merge_method: sce models: - model: sthenno/tempesthenno-nuslerp-0124 - model: Qwen/Qwen2.5-14B-Instruct-1M - model: sthenno/tempesthenno-0126-ckpt150 - model: arcee-ai/Virtuoso-Small - model: deepseek-ai/DeepSeek-R1-Distill-Qwen-14B - model: SicariusSicariiStuff/Impish_QWEN_14B-1M - model: ToastyPigeon/Qwen2.5-14B-Instruct-1M-Unalign base_model: sthenno-com/miscii-14b-1225 parameters: select_topk: 1.0 dtype: bfloat16 normalize: true ```