qnguyen3 leaderboard-pr-bot commited on
Commit
40e603a
1 Parent(s): f81aab9

Adding Evaluation Results (#2)

Browse files

- Adding Evaluation Results (b164c6705acf986e0b721d9a5d624590a229cee6)


Co-authored-by: Open LLM Leaderboard PR Bot <leaderboard-pr-bot@users.noreply.huggingface.co>

Files changed (1) hide show
  1. README.md +18 -4
README.md CHANGED
@@ -1,11 +1,11 @@
1
  ---
2
- library_name: transformers
3
  license: other
4
- license_name: qwen-research
5
- license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
6
- base_model: Qwen/Qwen2.5-3B
7
  tags:
8
  - generated_from_trainer
 
 
 
9
  model-index:
10
  - name: outputs/gelato-3b
11
  results: []
@@ -34,3 +34,17 @@ GGUFs: https://huggingface.co/mradermacher/raspberry-3B-GGUF
34
  |MuSR (0-shot) |40.61|
35
  |MMLU-PRO (5-shot) |28.49|
36
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
2
  license: other
3
+ library_name: transformers
 
 
4
  tags:
5
  - generated_from_trainer
6
+ base_model: Qwen/Qwen2.5-3B
7
+ license_name: qwen-research
8
+ license_link: https://huggingface.co/Qwen/Qwen2.5-3B-Instruct/blob/main/LICENSE
9
  model-index:
10
  - name: outputs/gelato-3b
11
  results: []
 
34
  |MuSR (0-shot) |40.61|
35
  |MMLU-PRO (5-shot) |28.49|
36
 
37
+
38
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
39
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_arcee-ai__raspberry-3B)
40
+
41
+ | Metric |Value|
42
+ |-------------------|----:|
43
+ |Avg. |15.40|
44
+ |IFEval (0-Shot) |31.54|
45
+ |BBH (3-Shot) |19.53|
46
+ |MATH Lvl 5 (4-Shot)| 7.63|
47
+ |GPQA (0-shot) | 3.69|
48
+ |MuSR (0-shot) | 9.41|
49
+ |MMLU-PRO (5-shot) |20.60|
50
+