Locutusque
/

LocutusqueXFelladrin-TinyMistral248M-Instruct

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

Locutusque commited on Dec 14, 2023

Commit

14128e0

•

1 Parent(s): 11130c4

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -29,7 +29,7 @@ inference:
     top_p: 0.34
     top_k: 30
     max_new_tokens: 250
-    repetition_penalty: 1.15
 ---
 # LocutusqueXFelladrin-TinyMistral248M-Instruct
 This model was created by merging Locutusque/TinyMistral-248M-Instruct and Felladrin/TinyMistral-248M-SFT-v4 using mergekit. After the two models were merged, the resulting model was further trained on ~20,000 examples on the Locutusque/inst_mix_v2_top_100k at a low learning rate to further normalize weights. The following is the YAML config used to merge:

     top_p: 0.34
     top_k: 30
     max_new_tokens: 250
+    repetition_penalty: 1.16
 ---
 # LocutusqueXFelladrin-TinyMistral248M-Instruct
 This model was created by merging Locutusque/TinyMistral-248M-Instruct and Felladrin/TinyMistral-248M-SFT-v4 using mergekit. After the two models were merged, the resulting model was further trained on ~20,000 examples on the Locutusque/inst_mix_v2_top_100k at a low learning rate to further normalize weights. The following is the YAML config used to merge: