Commit
•
5c32987
1
Parent(s):
e649b5c
Update Readme.md with Open LLM LB scores (#2)
Browse files- Update Readme.md with Open LLM LB scores (4914592b0427b013ef6d5688d0ae5f64fc0cbe17)
Co-authored-by: Fodor <gaborfodor@users.noreply.huggingface.co>
README.md
CHANGED
@@ -185,12 +185,12 @@ LlamaForCausalLM(
|
|
185 |
### 🤗 Open LLM Leaderboard v2
|
186 |
| Benchmark | acc_n |
|
187 |
|:--------------|:--------:|
|
188 |
-
| Average |
|
189 |
-
| IFEval |
|
190 |
-
| BBH |
|
191 |
-
| MATH Lvl 5 |
|
192 |
-
| GPQA |
|
193 |
-
| MUSR | 10.
|
194 |
| MML-PRO | 19.1 |
|
195 |
|
196 |
### 🤗 Open LLM Leaderboard v1
|
@@ -202,7 +202,7 @@ LlamaForCausalLM(
|
|
202 |
| Hellaswag | 79.05 |
|
203 |
| MMLU | 55.61 |
|
204 |
| TruthfulQA | 46.84 |
|
205 |
-
| Winogrande | 75.93
|
206 |
| GSM8K | 51.18 |
|
207 |
|
208 |
### MT-Bench
|
|
|
185 |
### 🤗 Open LLM Leaderboard v2
|
186 |
| Benchmark | acc_n |
|
187 |
|:--------------|:--------:|
|
188 |
+
| Average | 16.21 |
|
189 |
+
| IFEval | 50.21 |
|
190 |
+
| BBH | 10.94 |
|
191 |
+
| MATH Lvl 5 | 2.11 |
|
192 |
+
| GPQA | 4.7 |
|
193 |
+
| MUSR | 10.2 |
|
194 |
| MML-PRO | 19.1 |
|
195 |
|
196 |
### 🤗 Open LLM Leaderboard v1
|
|
|
202 |
| Hellaswag | 79.05 |
|
203 |
| MMLU | 55.61 |
|
204 |
| TruthfulQA | 46.84 |
|
205 |
+
| Winogrande | 75.93 |
|
206 |
| GSM8K | 51.18 |
|
207 |
|
208 |
### MT-Bench
|