Running
on
CPU Upgrade
11.7k
🏆
Open LLM Leaderboard 2
Track, rank and evaluate open LLMs and chatbots
Track, rank and evaluate open LLMs and chatbots
VLMEvalKit Evaluation Results Collection
Jailbreak the LLM and privacy guardrails
Track, rank and evaluate open Arabic LLMs and chatbots
Evaluate open LLMs in the languages of LATAM and Spain.