Concepts de base
DaRePlane, a novel direction-aware representation based on the dual-tree complex wavelet transform (DTCWT), enhances dynamic scene reconstruction in both NeRF and Gaussian Splatting frameworks by overcoming limitations of traditional wavelet representations.
Résumé
DaRePlane: Direction-aware Representations for Dynamic Scene Reconstruction
This research paper introduces DaRePlane, a novel approach for reconstructing dynamic 3D scenes from 2D images, addressing limitations of existing methods like Neural Radiance Fields (NeRF) and Gaussian Splatting (GS).
The study aims to develop an efficient and robust method for high-fidelity dynamic scene reconstruction, overcoming the limitations of traditional wavelet-based representations in handling shift variance and direction ambiguity.
The researchers propose DaRePlane, a direction-aware representation inspired by the dual-tree complex wavelet transform (DTCWT). This approach captures scene dynamics from six distinct orientations, enhancing detail and motion capture. DaRePlane is integrated into both NeRF and GS frameworks. In NeRF, it computes features for each space-time point, feeding them to an MLP for color regression. In GS, it computes features for Gaussian points, followed by deformation prediction using a multi-head MLP. A trainable masking approach addresses redundancy in wavelet coefficients, optimizing storage efficiency. The method is evaluated on various datasets, including real-world and surgical dynamic scenes, using metrics like PSNR, DSSIM, and LPIPS.