Instant3D: Fast Text-to-3D with Sparse-View Generation and Large Reconstruction Model
Abstract
Text-to-3D with diffusion models have achieved remarkable progress in recent years. However, existing methods either rely on score distillation-based optimization which suffer from slow inference, low diversity and Janus problems, or are feed-forward methods that generate low quality results due to the scarcity of 3D training data. In this paper, we propose Instant3D, a novel method that generates high-quality and diverse 3D assets from text prompts in a feed-forward manner. We adopt a two-stage paradigm, which first generates a sparse set of four structured and consistent views from text in one shot with a fine-tuned 2D text-to-image diffusion model, and then directly regresses the NeRF from the generated images with a novel transformer-based sparse-view reconstructor. Through extensive experiments, we demonstrate that our method can generate high-quality, diverse and Janus-free 3D assets within 20 seconds, which is two order of magnitude faster than previous optimization-based methods that can take 1 to 10 hours. Our project webpage: https://jiahao.ai/instant3d/.
Community
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- DreamGaussian: Generative Gaussian Splatting for Efficient 3D Content Creation (2023)
- TOSS: High-quality Text-guided Novel View Synthesis from a Single Image (2023)
- SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D (2023)
- HiFi-123: Towards High-fidelity One Image to 3D Content Generation (2023)
- Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models (2023)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
Can you generate a 3D model for fasteners.
Models citing this paper 0
No model linking this paper
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper