matchaaaaa
/

Chaifighter-20B-v2

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

matchaaaaa commited on May 19

Commit

fa4f53e

•

1 Parent(s): d27118d

Update README.md

Files changed (1) hide show

README.md +15 -1

README.md CHANGED Viewed

@@ -67,12 +67,26 @@ slices:
     - model: SanjiWatsuki/Kunoichi-7B
       layer_range: [8, 16]
   - sources:
-    - model: Mytho-Lemon-11B # my own merge of MythoMist-7B and LemonadeRP-4.5.3
       layer_range: [8, 48]
 merge_method: passthrough
 dtype: bfloat16
 ```
 It's a lot better than v1 :skull:
 So, the idea was to start with Fimbulvetr-11B-v2, a super solid RP model that punches wayyy above its weight especially for its coherence, reasoning, and even spatial awareness. Keeping the layers intact apparently is somewhat unusual, but I wanted to keep it closest to the input layers. I thought it would improve logic and open the door for more creativity later in the stack. I added Kunoichi next for its context and instruction following skills. This worked very well in v1. Lastly, I used a frankenmerge of MythoMist and LemonadeRP for the last layers. These are pretty creative models with solid writing. MythoMist in theory would give the model flavor and verbosity. LemonadeRP was recommended by a friend, and I thought it really complimented the rest of the mix quite nicely!

     - model: SanjiWatsuki/Kunoichi-7B
       layer_range: [8, 16]
   - sources:
+    - model: Mytho-Lemon-11B # my own merge (see below).
       layer_range: [8, 48]
 merge_method: passthrough
 dtype: bfloat16
+```
+And here's Mytho-Lemon-11B. Yep, named it backwards.
+```yaml
+slices:
+  - sources:
+    - model: KatyTheCutie/LemonadeRP-4.5.3
+      layer_range: [0, 24]
+  - sources:
+    - model: Gryphe/MythoMist-7B # manually added tokenizer files
+      layer_range: [8, 32]
+merge_method: passthrough
+dtype: bfloat16
 ```
 It's a lot better than v1 :skull:
 So, the idea was to start with Fimbulvetr-11B-v2, a super solid RP model that punches wayyy above its weight especially for its coherence, reasoning, and even spatial awareness. Keeping the layers intact apparently is somewhat unusual, but I wanted to keep it closest to the input layers. I thought it would improve logic and open the door for more creativity later in the stack. I added Kunoichi next for its context and instruction following skills. This worked very well in v1. Lastly, I used a frankenmerge of MythoMist and LemonadeRP for the last layers. These are pretty creative models with solid writing. MythoMist in theory would give the model flavor and verbosity. LemonadeRP was recommended by a friend, and I thought it really complimented the rest of the mix quite nicely!