Files changed (1) hide show
  1. README.md +12 -14
README.md CHANGED
@@ -133,9 +133,19 @@ This model is an advanced iteration of the powerful `meta-llama/Llama-3.2-3B`, s
133
  All GGUF models are available here: [MaziyarPanahi/calme-3.1-llamaloi-3b-GGUF](https://huggingface.co/MaziyarPanahi/calme-3.1-llamaloi-3b-GGUF)
134
 
135
 
136
- # 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
 
 
 
 
 
 
 
 
 
 
 
137
 
138
- Leaderboard 2 coming soon!
139
 
140
 
141
  # Prompt Template
@@ -182,16 +192,4 @@ model = AutoModelForCausalLM.from_pretrained("MaziyarPanahi/calme-3.1-llamaloi-3
182
  # Ethical Considerations
183
 
184
  As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.
185
- # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
186
- Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.1-llamaloi-3b)
187
-
188
- | Metric |Value|
189
- |-------------------|----:|
190
- |Avg. |24.01|
191
- |IFEval (0-Shot) |73.75|
192
- |BBH (3-Shot) |23.77|
193
- |MATH Lvl 5 (4-Shot)|16.77|
194
- |GPQA (0-shot) | 4.14|
195
- |MuSR (0-shot) | 1.11|
196
- |MMLU-PRO (5-shot) |24.50|
197
 
 
133
  All GGUF models are available here: [MaziyarPanahi/calme-3.1-llamaloi-3b-GGUF](https://huggingface.co/MaziyarPanahi/calme-3.1-llamaloi-3b-GGUF)
134
 
135
 
136
+ # 🏆 [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
137
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_MaziyarPanahi__calme-3.1-llamaloi-3b)
138
+
139
+ | Metric |Value|
140
+ |-------------------|----:|
141
+ |Avg. |24.01|
142
+ |IFEval (0-Shot) |73.75|
143
+ |BBH (3-Shot) |23.77|
144
+ |MATH Lvl 5 (4-Shot)|16.77|
145
+ |GPQA (0-shot) | 4.14|
146
+ |MuSR (0-shot) | 1.11|
147
+ |MMLU-PRO (5-shot) |24.50|
148
 
 
149
 
150
 
151
  # Prompt Template
 
192
  # Ethical Considerations
193
 
194
  As with any large language model, users should be aware of potential biases and limitations. We recommend implementing appropriate safeguards and human oversight when deploying this model in production environments.
 
 
 
 
 
 
 
 
 
 
 
 
195