VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper β’ 2509.01055 β’ Published 4 days ago β’ 57 β’ 4
VerlTool: Towards Holistic Agentic Reinforcement Learning with Tool Use Paper β’ 2509.01055 β’ Published 4 days ago β’ 57 β’ 4
StructEval: Benchmarking LLMs' Capabilities to Generate Structural Outputs Paper β’ 2505.20139 β’ Published May 26 β’ 18 β’ 1
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper β’ 2505.16175 β’ Published May 22 β’ 42 β’ 3
QuickVideo: Real-Time Long Video Understanding with System Algorithm Co-Design Paper β’ 2505.16175 β’ Published May 22 β’ 42 β’ 3
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs Paper β’ 2406.11833 β’ Published Jun 17, 2024 β’ 64 β’ 6
Benchmarking Multi-Image Understanding in Vision and Language Models: Perception, Knowledge, Reasoning, and Multi-Hop Reasoning Paper β’ 2406.12742 β’ Published Jun 18, 2024 β’ 15 β’ 5
MMDU: A Multi-Turn Multi-Image Dialog Understanding Benchmark and Instruction-Tuning Dataset for LVLMs Paper β’ 2406.11833 β’ Published Jun 17, 2024 β’ 64 β’ 6