Update README.md
Browse files
README.md
CHANGED
@@ -7,7 +7,8 @@ library_name: transformers
|
|
7 |
tags:
|
8 |
- mergekit
|
9 |
- merge
|
10 |
-
|
|
|
11 |
---
|
12 |
# merge
|
13 |
|
@@ -21,7 +22,7 @@ This model was merged using the SLERP merge method.
|
|
21 |
### Models Merged
|
22 |
|
23 |
The following models were included in the merge:
|
24 |
-
* [v000000/Qwen2.5-14B-Gutenberg-
|
25 |
* [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
26 |
|
27 |
### Configuration
|
@@ -32,9 +33,9 @@ The following YAML configuration was used to produce this model:
|
|
32 |
models:
|
33 |
- model: Qwen/Qwen2.5-14B-Instruct
|
34 |
merge_method: slerp
|
35 |
-
base_model: v000000/Qwen2.5-14B-Gutenberg-
|
36 |
parameters:
|
37 |
t:
|
38 |
- value: [0, 0, 0.3, 0.4, 0.5, 0.6, 0.5, 0.4, 0.3, 0, 0]
|
39 |
dtype: bfloat16
|
40 |
-
```
|
|
|
7 |
tags:
|
8 |
- mergekit
|
9 |
- merge
|
10 |
+
- qwen2
|
11 |
+
- dpo
|
12 |
---
|
13 |
# merge
|
14 |
|
|
|
22 |
### Models Merged
|
23 |
|
24 |
The following models were included in the merge:
|
25 |
+
* [v000000/Qwen2.5-14B-Gutenberg-1e-Delta](https://huggingface.co/v000000/Qwen2.5-14B-Gutenberg-1e-Delta)
|
26 |
* [Qwen/Qwen2.5-14B-Instruct](https://huggingface.co/Qwen/Qwen2.5-14B-Instruct)
|
27 |
|
28 |
### Configuration
|
|
|
33 |
models:
|
34 |
- model: Qwen/Qwen2.5-14B-Instruct
|
35 |
merge_method: slerp
|
36 |
+
base_model: v000000/Qwen2.5-14B-Gutenberg-1e-Delta
|
37 |
parameters:
|
38 |
t:
|
39 |
- value: [0, 0, 0.3, 0.4, 0.5, 0.6, 0.5, 0.4, 0.3, 0, 0]
|
40 |
dtype: bfloat16
|
41 |
+
```
|