12.8k
Open LLM Leaderboard
🏆
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
Embedding Leaderboard
Display chatbot leaderboard and stats
Explore and analyze code evaluation data
VLMEvalKit Evaluation Results Collection
Explore and benchmark visual document retrieval models
Request evaluation of a speech recognition model
Display and filter leaderboard results for LLM judges