The Open Medical-LLM Leaderboard: Benchmarking Large Language Models in Healthcare Apr 19, 2024 โข 131
The Hallucinations Leaderboard, an Open Effort to Measure Hallucinations in Large Language Models Jan 29, 2024 โข 19
Running on CPU Upgrade 68 68 Open Ita Llm Leaderboard ๐ Track, rank and evaluate open LLMs in the italian language!
Running on CPU Upgrade 50 50 Open CoT Leaderboard ๐ฅ Track, rank and evaluate open LLMs' CoT quality
Running on CPU Upgrade 12.4k 12.4k Open LLM Leaderboard ๐ Track, rank and evaluate open LLMs and chatbots