Locutusque
commited on
Commit
•
c4f29a3
1
Parent(s):
cef5451
Create README.md
Browse files
README.md
ADDED
@@ -0,0 +1,27 @@
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- Locutusque/inst_mix_v2_top_100k
|
5 |
+
language:
|
6 |
+
- en
|
7 |
+
pipeline_tag: text-generation
|
8 |
+
---
|
9 |
+
# LocutusqueXFelladrin-TinyMistral248M-Instruct
|
10 |
+
This model was created by merging Locutusque/TinyMistral-248M-Instruct and Felladrin/TinyMistral-248M-SFT-v4 using mergekit. After the two models were merged, the resulting model was further trained on ~20,000 examples on the Locutusque/inst_mix_v2_top_100k at a low learning rate to further normalize weights. The following is the YAML config used to merge:
|
11 |
+
|
12 |
+
```yaml
|
13 |
+
models:
|
14 |
+
- model: Felladrin/TinyMistral-248M-SFT-v4
|
15 |
+
parameters:
|
16 |
+
weight: 0.5
|
17 |
+
- model: Locutusque/TinyMistral-248M-Instruct
|
18 |
+
parameters:
|
19 |
+
weight: 1.0
|
20 |
+
merge_method: linear
|
21 |
+
dtype: float16
|
22 |
+
```
|
23 |
+
|
24 |
+
The resulting model combines the best of both worlds. With Locutusque/TinyMistral-248M-Instruct's coding capabilities and reasoning skills, and Felladrin/TinyMistral-248M-SFT-v4's low hallucination and instruction-following capabilities.
|
25 |
+
|
26 |
+
## Evaluation
|
27 |
+
Coming soon...
|