Omni-RGPT: Unifying Image and Video Region-level Understanding via Token Marks Paper • 2501.08326 • Published 2 days ago • 27
CamCo: Camera-Controllable 3D-Consistent Image-to-Video Generation Paper • 2406.02509 • Published Jun 4, 2024 • 9
Compositional Text-to-Image Generation with Dense Blob Representations Paper • 2405.08246 • Published May 14, 2024 • 15
AGG: Amortized Generative 3D Gaussians for Single Image to 3D Paper • 2401.04099 • Published Jan 8, 2024 • 9