LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching
Abstract
The recent advancements in text-to-3D generation mark a significant milestone in generative models, unlocking new possibilities for creating imaginative 3D assets across various real-world scenarios. While recent advancements in text-to-3D generation have shown promise, they often fall short in rendering detailed and high-quality 3D models. This problem is especially prevalent as many methods base themselves on Score Distillation Sampling (SDS). This paper identifies a notable deficiency in SDS, that it brings inconsistent and low-quality updating direction for the 3D model, causing the over-smoothing effect. To address this, we propose a novel approach called Interval Score Matching (ISM). ISM employs deterministic diffusing trajectories and utilizes interval-based score matching to counteract over-smoothing. Furthermore, we incorporate 3D Gaussian Splatting into our text-to-3D generation pipeline. Extensive experiments show that our model largely outperforms the state-of-the-art in quality and training efficiency.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- PaintHuman: Towards High-fidelity Text-to-3D Human Texturing via Denoised Score Distillation (2023)
- Text-to-3D with Classifier Score Distillation (2023)
- Control3D: Towards Controllable Text-to-3D Generation (2023)
- HumanNorm: Learning Normal Diffusion Model for High-quality and Realistic 3D Human Generation (2023)
- IPDreamer: Appearance-Controllable 3D Object Generation with Image Prompts (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper