The paper introduces WonderWorld, a novel framework for interactive 3D scene generation. The key challenges are achieving fast generation of 3D scenes, as existing approaches are slow due to the need for progressively generating many views and optimizing scene geometry representations.
To address this, the paper proposes the Fast LAyered Gaussian Surfels (FLAGS) representation and an algorithm to generate it from a single view. This allows for fast generation (less than 10 seconds per scene) by removing the need for progressive dense view generation and leveraging a geometry-based initialization that significantly reduces optimization time.
Another challenge is generating coherent geometry that allows all scenes to be connected. The paper introduces guided depth diffusion to improve the alignment between the geometry of newly generated scenes and existing scenes.
WonderWorld enables users to interactively specify scene contents and layout, and see the created scenes in low latency. This unlocks new possibilities for applications in virtual reality, gaming, and creative design, where users can quickly generate and explore diverse 3D scenes.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Hong-Xing Yu... at arxiv.org 09-11-2024
https://arxiv.org/pdf/2406.09394.pdfDeeper Inquiries