Chatbot Arena Leaderboard
Display chatbot leaderboard and stats
A collection of Leaderboards for LLMs ⚡️⚖️ 🤗
Display chatbot leaderboard and stats
Track, rank and evaluate open LLMs and chatbots
Run a Streamlit web app
View and submit LLM evaluations
Explore LLM performance across hardware
View and submit machine learning model evaluations
Display and explore model leaderboards and chat history
Embedding Leaderboard
Track, rank and evaluate open LLMs' CoT quality
View LLM Performance Leaderboard
Explore and analyze code evaluation data
Display and analyze PyTorch Image Models leaderboard
Browse and submit large language model evaluations
VLMEvalKit Eval Results in video understanding benchmark
A leaderboard for multimodal models
Compare Open LLM Leaderboard results
Explore and benchmark visual document retrieval models
Compare AI models by voting on responses
VLMEvalKit Evaluation Results Collection
Interact with multiple chatbots simultaneously
Official Leaderboard for OmniEval
Submit models for evaluation and view leaderboard
Blind vote on HF TTS models!
Teach, test, evaluate language models with MTEB Arena
Realtime Image/Video Gen AI Arena
Ranking of LLMs for agentic tasks
Request evaluation of a speech recognition model
A Leaderboard that demonstrates LMM reasoning capabilities
A leaderboard for LLMs powering smolagents