benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 14.5k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 247 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 16.2k • 91 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 86.6k • 343
benchmark lmlmcat/cmmlu Updated Jul 13, 2023 • 14.5k • 73 nlp-waseda/JMMLU Updated Feb 27, 2024 • 247 • 10 HAERAE-HUB/KMMLU Viewer • Updated Mar 5, 2024 • 244k • 16.2k • 91 openai/openai_humaneval Viewer • Updated Jan 4, 2024 • 164 • 86.6k • 343