LLM-Grounder: Open-Vocabulary 3D Visual Grounding with Large Language Model as an Agent Paper • 2309.12311 • Published Sep 21, 2023 • 17
APISR: Anime Production Inspired Real-World Anime Super-Resolution Paper • 2403.01598 • Published Mar 3, 2024 • 2
This&That: Language-Gesture Controlled Video Generation for Robot Planning Paper • 2407.05530 • Published Jul 8, 2024 • 3
VCISR: Blind Single Image Super-Resolution with Video Compression Synthetic Data Paper • 2311.00996 • Published Nov 2, 2023 • 1
3D-GRAND: A Million-Scale Dataset for 3D-LLMs with Better Grounding and Less Hallucination Paper • 2406.05132 • Published Jun 7, 2024 • 28
LU-NeRF: Scene and Pose Estimation by Synchronizing Local Unposed NeRFs Paper • 2306.05410 • Published Jun 8, 2023 • 2