Light-A-Video: Training-free Video Relighting via Progressive Light Fusion Paper • 2502.08590 • Published 19 days ago • 39
IDArb: Intrinsic Decomposition for Arbitrary Number of Input Views and Illuminations Paper • 2412.12083 • Published Dec 16, 2024 • 12
InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions Paper • 2412.09596 • Published Dec 12, 2024 • 94
GPT4Point: A Unified Framework for Point-Language Understanding and Generation Paper • 2312.02980 • Published Dec 5, 2023 • 10
Alpha-CLIP: A CLIP Model Focusing on Wherever You Want Paper • 2312.03818 • Published Dec 6, 2023 • 32
HyperDreamer: Hyper-Realistic 3D Content Generation and Editing from a Single Image Paper • 2312.04543 • Published Dec 7, 2023 • 22
Gemini vs GPT-4V: A Preliminary Comparison and Combination of Vision-Language Models Through Qualitative Cases Paper • 2312.15011 • Published Dec 22, 2023 • 17
GPT-4V(ision) is a Human-Aligned Evaluator for Text-to-3D Generation Paper • 2401.04092 • Published Jan 8, 2024 • 21
ComboVerse: Compositional 3D Assets Creation Using Spatially-Aware Diffusion Guidance Paper • 2403.12409 • Published Mar 19, 2024 • 10
Make-it-Real: Unleashing Large Multimodal Model's Ability for Painting 3D Objects with Realistic Materials Paper • 2404.16829 • Published Apr 25, 2024 • 5
Bootstrap3D: Improving 3D Content Creation with Synthetic Data Paper • 2406.00093 • Published May 31, 2024 • 1
MotionClone: Training-Free Motion Cloning for Controllable Video Generation Paper • 2406.05338 • Published Jun 8, 2024 • 40
OmniObject3D: Large-Vocabulary 3D Object Dataset for Realistic Perception, Reconstruction and Generation Paper • 2301.07525 • Published Jan 18, 2023
V3Det Challenge 2024 on Vast Vocabulary and Open Vocabulary Object Detection: Methods and Results Paper • 2406.11739 • Published Jun 17, 2024
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side Images Paper • 2407.06191 • Published Jul 8, 2024 • 12