Svak commited on
Commit
dbc40a5
·
verified ·
1 Parent(s): f02b325

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +52 -0
README.md ADDED
@@ -0,0 +1,52 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ base_model:
3
+ - Sao10K/14B-Qwen2.5-Kunou-v1
4
+ - Qwen/Qwen2.5-14B-Instruct
5
+ - huihui-ai/Qwen2.5-14B-Instruct-abliterated-v2
6
+ - v000000/Qwen2.5-Lumen-14B
7
+ - v000000/Qwen2.5-14B-Gutenberg-1e-Delta
8
+ library_name: transformers
9
+ tags:
10
+ - mergekit
11
+ - merge
12
+
13
+ ---
14
+ # svakiemerge3
15
+
16
+ This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
17
+
18
+ ## Merge Details
19
+ ### Merge Method
20
+
21
+ This model was merged using the [Model Stock](https://arxiv.org/abs/2403.19522) merge method using [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct) as a base.
22
+
23
+ ### Models Merged
24
+
25
+ The following models were included in the merge:
26
+ * [Sao10K/14B-Qwen2.5-Kunou-v1](https://huggingface.co/Sao10K/14B-Qwen2.5-Kunou-v1)
27
+ * [huihui-ai/Qwen2.5-14B-Instruct-abliterated-v2](https://huggingface.co/huihui-ai/Qwen2.5-14B-Instruct-abliterated-v2)
28
+ * [v000000/Qwen2.5-Lumen-14B](https://huggingface.co/v000000/Qwen2.5-Lumen-14B)
29
+ * [v000000/Qwen2.5-14B-Gutenberg-1e-Delta](https://huggingface.co/v000000/Qwen2.5-14B-Gutenberg-1e-Delta)
30
+
31
+ ### Configuration
32
+
33
+ The following YAML configuration was used to produce this model:
34
+
35
+ ```yaml
36
+ name: Qwen2.5-14B-Infermablit
37
+ merge_method: model_stock
38
+ base_model: Qwen/Qwen2.5-14B-Instruct
39
+ tokenizer_source: base
40
+ parameters:
41
+ int8_mask: true
42
+ normalize: true
43
+ rescale: false
44
+ models:
45
+ - model: v000000/Qwen2.5-Lumen-14B
46
+ - model: v000000/Qwen2.5-14B-Gutenberg-1e-Delta
47
+ - model: Sao10K/14B-Qwen2.5-Kunou-v1
48
+ - model: huihui-ai/Qwen2.5-14B-Instruct-abliterated-v2
49
+ dtype: bfloat16
50
+ out_dtype: bfloat16
51
+
52
+ ```