Is Your LLM Secretly a World Model of the Internet? Model-Based Planning for Web Agents Paper • 2411.06559 • Published Nov 10, 2024 • 16
Navigating the Digital World as Humans Do: Universal Visual Grounding for GUI Agents Paper • 2410.05243 • Published Oct 7, 2024 • 19
Flatness-Aware Prompt Selection Improves Accuracy and Sample Efficiency Paper • 2305.10713 • Published May 18, 2023
MMMU: A Massive Multi-discipline Multimodal Understanding and Reasoning Benchmark for Expert AGI Paper • 2311.16502 • Published Nov 27, 2023 • 37
Multilingual Coreference Resolution in Multiparty Dialogue Paper • 2208.01307 • Published Aug 2, 2022