Running
on
CPU Upgrade
11.7k
π
Open LLM Leaderboard 2
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
VLMEvalKit Evaluation Results Collection
More advanced and challenging multi-task evaluation
VLMEvalKit Eval Results in video understanding benchmark
Compare Open LLM Leaderboard results