Centrala begrepp
A novel coarse-to-fine framework with hierarchical neural representation is proposed to efficiently generate high-quality stylized novel views from sparse input scenes.
Sammanfattning
The paper presents a novel coarse-to-fine framework for sparse-view 3D scene stylization. The key contributions are:
-
Coarse Geometry Generation:
- A simplified NeRF architecture with low-frequency positional encoding is used to capture the coarse scene geometry from sparse input views.
- This coarse representation provides a reasonable semantic content of the scene without high-frequency artifacts.
-
Fine Detail Stylization:
- A hierarchical encoding-based neural representation is introduced, which models the high-frequency geometric details as residual values using multi-resolution hash-based feature grids.
- The coarse geometric features are combined with the multi-resolution feature grids to assist the MLP in transferring high-frequency information while preserving the semantic content.
-
Content Annealing:
- A new optimization strategy with content strength annealing is designed to better balance the stylization effect and content preservation.
- The content loss weight is gradually decreased during training, allowing the model to focus on learning low-frequency details first and then shift to style matching.
The proposed framework effectively handles the stylization of sparse-view 3D scenes, outperforming state-of-the-art methods in both qualitative and quantitative evaluations.
Statistik
Our method can generate high-quality stylized novel views from sparse input scenes, outperforming state-of-the-art methods in terms of both consistency and stylization quality.
Citat
"We propose a coarse-to-fine framework for sparse-view 3D scene stylization, which enables efficient and high-quality stylized novel view generation."
"We introduce a hierarchical encoding-based scene representation to model a sparse-view scene from coarse to fine, where the coarse-level representation is first optimized to capture the coarse geometry of a scene from sparse inputs, and then the fine-level representation is directly optimized with the target style to generate the final stylized scene."
"We design a new optimization strategy with content annealing for fine 3D stylized scene generation. Our model can generate accurate semantic content in the early phase of stylization optimization, and later gradually synthesizes high-quality stylized textures that faithfully match the reference style."