VisRL: Intention-Driven Visual Perception via Reinforced Reasoning Paper • 2503.07523 • Published Mar 10 • 1
Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling Paper • 2508.03404 • Published Aug 5 • 4
SIFThinker: Spatially-Aware Image Focus for Visual Reasoning Paper • 2508.06259 • Published Aug 8 • 1
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow Paper • 2509.21789 • Published 28 days ago • 9
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published 2 days ago • 19
CodingTeachLLM: Empowering LLM's Coding Ability via AST Prior Knowledge Paper • 2403.15426 • Published Mar 13, 2024
DV-Matcher: Deformation-based Non-Rigid Point Cloud Matching Guided by Pre-trained Visual Features Paper • 2408.08568 • Published Aug 16, 2024
Think with 3D: Geometric Imagination Grounded Spatial Reasoning from Limited Views Paper • 2510.18632 • Published 2 days ago • 19
SIFThinker: Spatially-Aware Image Focus for Visual Reasoning Paper • 2508.06259 • Published Aug 8 • 1
VisRL: Intention-Driven Visual Perception via Reinforced Reasoning Paper • 2503.07523 • Published Mar 10 • 1
Visual Document Understanding and Question Answering: A Multi-Agent Collaboration Framework with Test-Time Scaling Paper • 2508.03404 • Published Aug 5 • 4
RLFR: Extending Reinforcement Learning for LLMs with Flow Environment Paper • 2510.10201 • Published 12 days ago • 35
Visual Multi-Agent System: Mitigating Hallucination Snowballing via Visual Flow Paper • 2509.21789 • Published 28 days ago • 9