The content discusses the limitations of existing methods in self-supervised depth estimation and proposes a new framework, DO3D, to address the challenges. It introduces a hybrid Transformer and CNN model for depth estimation and a motion estimation module with object-wise rigid and non-rigid motion prediction. The system aims to model 3D motion and geometry for accurate depth and motion estimation.
In un'altra lingua
dal contenuto originale
arxiv.org
Approfondimenti chiave tratti da
by Xiuzhe Wu,Xi... alle arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.05895.pdfDomande più approfondite