Multimodal benchmarks that test various aspects of LLMs, VLMs, LMMs
-
3π
Multimodal Clembench
-
81π
SEED-Bench Leaderboard
-
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI
Paper β’ 2311.16502 β’ Published β’ 35 -
MMMU-Pro: A More Robust Multi-discipline Multimodal Understanding Benchmark
Paper β’ 2409.02813 β’ Published β’ 28