MMLU is only 25.64, anything wrong?
I just check https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard and found the metric about this model is really bad, is there anything wrong about the score?
Hey @cloudyu -- I don't see this score or any model with this score average on the leaderboard. Can you specify the model name you are seeing?
Command R plus is not yet on the leaderboard -- it should be on the leaderboard shortly. We submitted it jointly with hugging face yesterday and my understanding is that it will be made public shortly.
I'm going to close this for now -- but feel free to re-open with additional details.
Now MMLU is 75.73 on the leaderboard; that's great.
Thanks @cloudyu ! Our full results on the Open LLM leaderboard is now public on https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard -- here is a quick comparison with a subset of other relevant models whose scores are publicly available on the leaderboard.
Hope this is helpful!