UPD: this model series is succeeded by EVA
Unprivated, to store for historical reasons
There's not much point in those merges, Celeste 70B 0.1 pretty much melded Celeste's and Magnum's datasets anyway
To be continued, but on a different base, under a different name, and actually trained this time, without shortcuts
MN-12B-Starcannon-v2
This is a merge of pre-trained language models created using mergekit. Turned out to be a bit more Magnum-esque, but still is very creative, and writing style is pretty nice, even if some slop words appear time to time. Might be a good fit for people wanting more variety than Magnum has, and more verbose prose than Celeste v1.9 has.
Dynamic FP8
Static GGUF (by Mradermacher)
EXL2 (by kingbri of RoyalLab)
Merge Details
Merge Method
This model was merged using the TIES merge method using nothingiisreal/MN-12B-Celeste-V1.9 as a base.
Merge fodder
The following models were included in the merge:
Configuration
The following YAML configuration was used to produce this model:
models:
- model: intervitens/mini-magnum-12b-v1.1
parameters:
density: 0.3
weight: 0.5
- model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
density: 0.7
weight: 0.5
merge_method: ties
base_model: nothingiisreal/MN-12B-Celeste-V1.9
parameters:
normalize: true
int8_mask: true
dtype: bfloat16
- Downloads last month
- 3,863