Core Concepts
AnyHome enables the generation of detailed 3D indoor scenes from open-vocabulary text inputs, offering realism and customizability.
Abstract
The content introduces AnyHome, a framework for translating text into structured and textured indoor scenes. It focuses on generating diverse house-scale 3D environments with high realism. The process involves textual input modulation, structured geometry generation, and egocentric refinement. AnyHome stands out for its ability to create detailed geometries and textures that outperform existing methods quantitatively and qualitatively.
Introduction:
AnyHome aims to transform free-form textual narratives into realistic 3D indoor scenes.
The framework offers extensive editing capabilities at varying levels of granularity.
Previous research has struggled with creating robust 3D structures but AnyHome bridges this gap effectively.
Methodology:
AnyHome employs Large Language Models (LLMs) with designed templates for scene generation.
The process includes Score Distillation Sampling for geometry refinement and egocentric inpainting for texture addition.
Graph-based intermediate representations are used to describe the geometry structure.
Results:
AnyHome generates diverse scenes from open-vocabulary text inputs, showcasing versatility in design styles.
The framework supports comprehensive editing capabilities, allowing modifications at various levels.
Comparison with baselines shows superior performance in layout quality and content alignment.
Stats
"By prompting Large Language Models (LLMs) with designed templates, our approach converts provided textual narratives into amodal structured representations."
"These represen-tations guarantee consistent and realistic spatial layouts by directing the synthesis of a geometry mesh within defined constraints."
"AnyHome generates detailed geometries and textures that outperform existing methods in both quantitative and qualitative measures."
Quotes
"Imagine the possibilities if we could articulate our ideal living spaces in natural language and see them come to life."
"Our method surpasses these direct LLM-generated plans, especially with abstract prompts, by preserving room relationships and accommodating diverse shapes and sizes."