schnapper79 commited on
Commit
9ec8a55
1 Parent(s): 41a960f

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -14
README.md CHANGED
@@ -1,26 +1,43 @@
1
  ---
 
 
 
2
  base_model: []
3
  library_name: transformers
4
  tags:
5
  - mergekit
6
- - merge
7
 
8
  ---
9
- # merge
10
 
11
- This is a merge of pre-trained language models created using [mergekit](https://github.com/cg123/mergekit).
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
12
 
13
  ## Merge Details
14
  ### Merge Method
15
 
16
- This model was merged using the della_linear merge method using /workspace/text-generation-webui/models/mistralai_Mistral-Large-Instruct-2407 as a base.
17
-
18
- ### Models Merged
19
 
20
- The following models were included in the merge:
21
- * /workspace/text-generation-webui/models/migtissera_Tess-3-Mistral-Large-2-123B
22
- * /workspace/text-generation-webui/models/FluffyKaeloky_Luminum-v0.1-123B
23
- * /workspace/text-generation-webui/models/anthracite-org_magnum-v2-123b
24
 
25
  ### Configuration
26
 
@@ -28,20 +45,20 @@ The following YAML configuration was used to produce this model:
28
 
29
  ```yaml
30
  models:
31
- - model: /workspace/text-generation-webui/models/anthracite-org_magnum-v2-123b
32
  parameters:
33
  weight: 0.19
34
  density: 0.5
35
- - model: /workspace/text-generation-webui/models/FluffyKaeloky_Luminum-v0.1-123B
36
  parameters:
37
  weight: 0.34
38
  density: 0.8
39
- - model: /workspace/text-generation-webui/models/migtissera_Tess-3-Mistral-Large-2-123B
40
  parameters:
41
  weight: 0.24
42
  density: 0.7
43
  merge_method: della_linear
44
- base_model: /workspace/text-generation-webui/models/mistralai_Mistral-Large-Instruct-2407
45
  parameters:
46
  epsilon: 0.05
47
  lambda: 1
 
1
  ---
2
+ license: other
3
+ license_name: mistral-ai-research-licence
4
+ license_link: https://mistral.ai/licenses/MRL-0.1.md
5
  base_model: []
6
  library_name: transformers
7
  tags:
8
  - mergekit
9
+ - lumikabra-123B
10
 
11
  ---
 
12
 
13
+ ## exl2 8.0 quant
14
+
15
+ # lumikabra-123B
16
+
17
+
18
+ <div style="width: auto; margin-left: auto; margin-right: auto; margin-bottom: 3cm">
19
+ <img src="https://huggingface.co/schnapper79/lumikabra-123B_v0.1/resolve/main/artspace-ai-1725345028689.png" alt="Lumikabra" style="width: 100%; min-width: 400px; display: block; margin: auto;">
20
+ </div>
21
+
22
+ This is lumikabra. It's based on [Mistral-Large-Instruct-2407 ](https://huggingface.co/mistralai/Mistral-Large-Instruct-2407), merged with Magnum-v2-123B, Luminum-v0.1-123B and Tess-3-Mistral-Large-2-123B.
23
+
24
+ I shamelessly took this idea from [FluffyKaeloky](FluffyKaeloky/Luminum-v0.1-123B). Like him, i always had my troubles with each of the current large mistral based models.
25
+ Either it gets repetitive, shows too many GPTisms, is too horny or too unhorny. RP and storytelling is always a matter of taste, and i found myself swiping too often for new answers or even fixing them when I missed a little spice or cleverness.
26
+
27
+ Luminum was a great improvement, mixing a lot of desired traits, but I still missed some spice, another sauce.
28
+ So i took Luminum, added magnum again and also Tess for knowledge and structure.
29
+
30
+ This is my very first merge, but I feel it has some potential. I was honestly a little surprised when my character was killed in RP just because it kinda fitted the story. So I guess, dark is a possible theme.
31
+
32
+
33
+
34
+
35
 
36
  ## Merge Details
37
  ### Merge Method
38
 
39
+ This model was merged using [mergekit](https://github.com/cg123/mergekit) with the della_linear merge method using mistralai_Mistral-Large-Instruct-2407 as a base.
 
 
40
 
 
 
 
 
41
 
42
  ### Configuration
43
 
 
45
 
46
  ```yaml
47
  models:
48
+ - model: anthracite-org_magnum-v2-123b
49
  parameters:
50
  weight: 0.19
51
  density: 0.5
52
+ - model: FluffyKaeloky_Luminum-v0.1-123B
53
  parameters:
54
  weight: 0.34
55
  density: 0.8
56
+ - model: migtissera_Tess-3-Mistral-Large-2-123B
57
  parameters:
58
  weight: 0.24
59
  density: 0.7
60
  merge_method: della_linear
61
+ base_model: mistralai_Mistral-Large-Instruct-2407
62
  parameters:
63
  epsilon: 0.05
64
  lambda: 1