Unleashing the Potential of Multimodal LLMs for Zero-Shot Spatio-Temporal Video Grounding Paper • 2509.15178 • Published Sep 18 • 6 • 2
Shape-for-Motion: Precise and Consistent Video Editing with 3D Proxy Paper • 2506.22432 • Published Jun 27 • 13 • 1