@fdaudens on Hugging Face: "Look at that 👀 Actual benchmarks have become too easy for recent models…"

Hugging Face

Join the conversation

Join the community of Machine Learners and AI enthusiasts.

Back to feed

fdaudens

posted an update Jun 26, 2024

Post

1907

Look at that 👀

Actual benchmarks have become too easy for recent models, much like grading high school students on middle school problems makes little sense. So the team worked on a new version of the Open LLM Leaderboard with new benchmarks.

Stellar work from @clefourrier @SaylorTwift and the team!

👉 Read the blog post: open-llm-leaderboard/blog
👉 Explore the leaderboard: open-llm-leaderboard/open_llm_leaderboard

dillfrescott

Jun 27, 2024

Can't wait to see deepseek coder v2 on there. I have a feeling it will score high. I love that model

In this post

fdaudens Florent Daudens
dillfrescott Cross