Locutusque commited on
Commit
c4f29a3
1 Parent(s): cef5451

Create README.md

Browse files
Files changed (1) hide show
  1. README.md +27 -0
README.md ADDED
@@ -0,0 +1,27 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - Locutusque/inst_mix_v2_top_100k
5
+ language:
6
+ - en
7
+ pipeline_tag: text-generation
8
+ ---
9
+ # LocutusqueXFelladrin-TinyMistral248M-Instruct
10
+ This model was created by merging Locutusque/TinyMistral-248M-Instruct and Felladrin/TinyMistral-248M-SFT-v4 using mergekit. After the two models were merged, the resulting model was further trained on ~20,000 examples on the Locutusque/inst_mix_v2_top_100k at a low learning rate to further normalize weights. The following is the YAML config used to merge:
11
+
12
+ ```yaml
13
+ models:
14
+ - model: Felladrin/TinyMistral-248M-SFT-v4
15
+ parameters:
16
+ weight: 0.5
17
+ - model: Locutusque/TinyMistral-248M-Instruct
18
+ parameters:
19
+ weight: 1.0
20
+ merge_method: linear
21
+ dtype: float16
22
+ ```
23
+
24
+ The resulting model combines the best of both worlds. With Locutusque/TinyMistral-248M-Instruct's coding capabilities and reasoning skills, and Felladrin/TinyMistral-248M-SFT-v4's low hallucination and instruction-following capabilities.
25
+
26
+ ## Evaluation
27
+ Coming soon...