The content discusses the limitations of existing methods in self-supervised depth estimation and proposes a new framework, DO3D, to address the challenges. It introduces a hybrid Transformer and CNN model for depth estimation and a motion estimation module with object-wise rigid and non-rigid motion prediction. The system aims to model 3D motion and geometry for accurate depth and motion estimation.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Xiuzhe Wu,Xi... a las arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.05895.pdfConsultas más profundas