aashish1904 commited on
Commit
18703c1
1 Parent(s): c47c4ba

Upload README.md with huggingface_hub

Browse files
Files changed (1) hide show
  1. README.md +209 -0
README.md ADDED
@@ -0,0 +1,209 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+
2
+ ---
3
+
4
+ language:
5
+ - en
6
+ - fr
7
+ - de
8
+ - es
9
+ - it
10
+ - pt
11
+ - ru
12
+ - zh
13
+ - ja
14
+ license: apache-2.0
15
+ tags:
16
+ - merge
17
+ datasets:
18
+ - Epiculous/SynthRP-Gens-v1.1-Filtered-n-Cleaned
19
+ - anthracite-org/stheno-filtered-v1.1
20
+ - PJMixers/hieunguyenminh_roleplay-deduped-ShareGPT
21
+ - Gryphe/Sonnet3.5-Charcard-Roleplay
22
+ - Epiculous/Synthstruct-Gens-v1.1-Filtered-n-Cleaned
23
+ - anthracite-org/kalo-opus-instruct-22k-no-refusal
24
+ - anthracite-org/nopm_claude_writing_fixed
25
+ - anthracite-org/kalo_opus_misc_240827
26
+ pipeline_tag: text-generation
27
+ model-index:
28
+ - name: Violet_Twilight-v0.2
29
+ results:
30
+ - task:
31
+ type: text-generation
32
+ name: Text Generation
33
+ dataset:
34
+ name: IFEval (0-Shot)
35
+ type: HuggingFaceH4/ifeval
36
+ args:
37
+ num_few_shot: 0
38
+ metrics:
39
+ - type: inst_level_strict_acc and prompt_level_strict_acc
40
+ value: 45.32
41
+ name: strict accuracy
42
+ source:
43
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
44
+ name: Open LLM Leaderboard
45
+ - task:
46
+ type: text-generation
47
+ name: Text Generation
48
+ dataset:
49
+ name: BBH (3-Shot)
50
+ type: BBH
51
+ args:
52
+ num_few_shot: 3
53
+ metrics:
54
+ - type: acc_norm
55
+ value: 23.94
56
+ name: normalized accuracy
57
+ source:
58
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
59
+ name: Open LLM Leaderboard
60
+ - task:
61
+ type: text-generation
62
+ name: Text Generation
63
+ dataset:
64
+ name: MATH Lvl 5 (4-Shot)
65
+ type: hendrycks/competition_math
66
+ args:
67
+ num_few_shot: 4
68
+ metrics:
69
+ - type: exact_match
70
+ value: 2.72
71
+ name: exact match
72
+ source:
73
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
74
+ name: Open LLM Leaderboard
75
+ - task:
76
+ type: text-generation
77
+ name: Text Generation
78
+ dataset:
79
+ name: GPQA (0-shot)
80
+ type: Idavidrein/gpqa
81
+ args:
82
+ num_few_shot: 0
83
+ metrics:
84
+ - type: acc_norm
85
+ value: 2.13
86
+ name: acc_norm
87
+ source:
88
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
89
+ name: Open LLM Leaderboard
90
+ - task:
91
+ type: text-generation
92
+ name: Text Generation
93
+ dataset:
94
+ name: MuSR (0-shot)
95
+ type: TAUR-Lab/MuSR
96
+ args:
97
+ num_few_shot: 0
98
+ metrics:
99
+ - type: acc_norm
100
+ value: 13.61
101
+ name: acc_norm
102
+ source:
103
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
104
+ name: Open LLM Leaderboard
105
+ - task:
106
+ type: text-generation
107
+ name: Text Generation
108
+ dataset:
109
+ name: MMLU-PRO (5-shot)
110
+ type: TIGER-Lab/MMLU-Pro
111
+ config: main
112
+ split: test
113
+ args:
114
+ num_few_shot: 5
115
+ metrics:
116
+ - type: acc
117
+ value: 23.45
118
+ name: accuracy
119
+ source:
120
+ url: https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard?query=Epiculous/Violet_Twilight-v0.2
121
+ name: Open LLM Leaderboard
122
+
123
+ ---
124
+
125
+ [![QuantFactory Banner](https://lh7-rt.googleusercontent.com/docsz/AD_4nXeiuCm7c8lEwEJuRey9kiVZsRn2W-b4pWlu3-X534V3YmVuVc2ZL-NXg2RkzSOOS2JXGHutDuyyNAUtdJI65jGTo8jT9Y99tMi4H4MqL44Uc5QKG77B0d6-JfIkZHFaUA71-RtjyYZWVIhqsNZcx8-OMaA?key=xt3VSDoCbmTY7o-cwwOFwQ)](https://hf.co/QuantFactory)
126
+
127
+
128
+ # QuantFactory/Violet_Twilight-v0.2-GGUF
129
+ This is quantized version of [Epiculous/Violet_Twilight-v0.2](https://huggingface.co/Epiculous/Violet_Twilight-v0.2) created using llama.cpp
130
+
131
+ # Original Model Card
132
+
133
+
134
+ ![image/png](https://cdn-uploads.huggingface.co/production/uploads/64adfd277b5ff762771e4571/P962FQhRG4I8nbU_DJolY.png)
135
+
136
+ Now for something a bit different, Violet_Twilight-v0.2! This model is a SLERP merge of Azure_Dusk-v0.2 and Crimson_Dawn-v0.2!
137
+
138
+ # Quants!
139
+ <strong>full</strong> / [exl2](https://huggingface.co/Epiculous/Violet_Twilight-v0.2-exl2) / [gguf](https://huggingface.co/Epiculous/Violet_Twilight-v0.2-GGUF)
140
+
141
+ ## Prompting
142
+ The v0.2 models are trained on ChatML, the prompting structure goes a little something like this:
143
+
144
+ ```
145
+ <|im_start|>user
146
+ Hi there!<|im_end|>
147
+ <|im_start|>assistant
148
+ Nice to meet you!<|im_end|>
149
+ <|im_start|>user
150
+ Can I ask a question?<|im_end|>
151
+ <|im_start|>assistant
152
+ ```
153
+
154
+ ### Context and Instruct
155
+ The v0.2 models are trained on ChatML, please use that Context and Instruct template.
156
+
157
+ ### Current Top Sampler Settings
158
+ [Spicy_Temp](https://files.catbox.moe/9npj0z.json) <br/>
159
+ [Violet_Twilight-Nitral-Special](https://files.catbox.moe/ot54u3.json) <br/>
160
+
161
+ ## Merging
162
+ The following config was used to merge Azure Dusk and Crimson Dawn
163
+ ```yaml
164
+ slices:
165
+ - sources:
166
+ - model: Epiculous/Azure_Dusk-v0.2
167
+ layer_range: [0, 40]
168
+ - model: Epiculous/Crimson_Dawn-V0.2
169
+ layer_range: [0, 40]
170
+ merge_method: slerp
171
+ base_model: Epiculous/Azure_Dusk-v0.2
172
+ parameters:
173
+ t:
174
+ - filter: self_attn
175
+ value: [0, 0.5, 0.3, 0.7, 1]
176
+ - filter: mlp
177
+ value: [1, 0.5, 0.7, 0.3, 0]
178
+ - value: 0.5 # fallback for rest of tensors
179
+ dtype: bfloat16
180
+
181
+ ```
182
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
183
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Epiculous__Violet_Twilight-v0.2)
184
+
185
+ | Metric |Value|
186
+ |-------------------|----:|
187
+ |Avg. |18.53|
188
+ |IFEval (0-Shot) |45.32|
189
+ |BBH (3-Shot) |23.94|
190
+ |MATH Lvl 5 (4-Shot)| 2.72|
191
+ |GPQA (0-shot) | 2.13|
192
+ |MuSR (0-shot) |13.61|
193
+ |MMLU-PRO (5-shot) |23.45|
194
+
195
+
196
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
197
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_Epiculous__Violet_Twilight-v0.2)
198
+
199
+ | Metric |Value|
200
+ |-------------------|----:|
201
+ |Avg. |18.53|
202
+ |IFEval (0-Shot) |45.32|
203
+ |BBH (3-Shot) |23.94|
204
+ |MATH Lvl 5 (4-Shot)| 2.72|
205
+ |GPQA (0-shot) | 2.13|
206
+ |MuSR (0-shot) |13.61|
207
+ |MMLU-PRO (5-shot) |23.45|
208
+
209
+