Pankaj Mathur
commited on
Commit
•
dacfc88
1
Parent(s):
965ed2a
Update README.md
Browse files
README.md
CHANGED
@@ -23,14 +23,14 @@ I evaluated orca_mini_v2_13b on a wide range of tasks using [Language Model Eval
|
|
23 |
Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
24 |
|
25 |
|
26 |
-
|
27 |
-
|
28 |
-
|**Task**|**
|
29 |
-
|*arc_challenge*|
|
30 |
-
|*hellaswag*|
|
31 |
-
|*mmlu*|
|
32 |
-
|*truthfulqa_mc*|
|
33 |
-
|*Total Average*|
|
34 |
|
35 |
|
36 |
|
|
|
23 |
Here are the results on metrics used by [HuggingFaceH4 Open LLM Leaderboard](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
24 |
|
25 |
|
26 |
+
||||
|
27 |
+
|:------:|:-------------:|:---------:|
|
28 |
+
|**Task**|**Value**|**Stderr**|
|
29 |
+
|*arc_challenge*|0.5478|0.0145|
|
30 |
+
|*hellaswag*|0.7023|0.0040|
|
31 |
+
|*mmlu*|0.4969|0.035|
|
32 |
+
|*truthfulqa_mc*|0.44|0.0158|
|
33 |
+
|*Total Average*|0.54675|0.0114|
|
34 |
|
35 |
|
36 |
|