Conceptos Básicos
DreamGaussian proposes an efficient 3D content generation framework that leverages generative Gaussian splatting and texture refinement to produce high-quality textured meshes in just a few minutes, significantly accelerating the optimization-based 2D lifting approach.
Resumen
The paper introduces DreamGaussian, a novel 3D content generation framework that achieves both efficiency and quality simultaneously. The key insight is to adapt 3D Gaussian splatting, a differentiable 3D representation, into the generative setting. This allows for faster convergence compared to previous methods using Neural Radiance Fields (NeRF).
The framework consists of two stages:
Stage 1 - Generative Gaussian Splatting:
- The 3D Gaussians are initialized with random positions and progressively densified during optimization.
- Score Distillation Sampling (SDS) is used to optimize the 3D Gaussians, leveraging powerful 2D diffusion models as priors for both image-to-3D and text-to-3D tasks.
- The generated 3D Gaussians tend to be blurry due to the ambiguity in SDS supervision.
Stage 2 - Efficient Mesh Extraction and Texture Refinement:
- An efficient algorithm is proposed to extract a textured mesh from the 3D Gaussians, including a local density query and color back-projection.
- A UV-space texture refinement stage is introduced, using a multi-step denoising process with MSE loss to enhance the texture details, avoiding the artifacts caused by directly applying SDS loss.
Extensive experiments demonstrate that DreamGaussian can produce high-quality textured meshes in just 2 minutes from a single-view image, achieving approximately 10 times acceleration compared to existing optimization-based methods while maintaining competitive generation quality.
Estadísticas
The paper does not provide any specific numerical data or metrics in the main text. However, the supplementary materials may contain additional details and quantitative results.
Citas
"DreamGaussian aims at accelerating the optimization process of both image- and text-to-3D tasks. We are able to generate a high quality textured mesh in several minutes."
"Compared to previous methods with the NeRF representation, which find difficulties in effectively pruning empty space, our generative Gaussian splatting significantly simplifies the optimization landscape."
"Extensive experiments demonstrate that DreamGaussian can produce high-quality textured meshes in just 2 minutes from a single-view image, achieving approximately 10 times acceleration compared to existing optimization-based methods while maintaining competitive generation quality."