The content discusses the limitations of existing methods in self-supervised depth estimation and proposes a new framework, DO3D, to address the challenges. It introduces a hybrid Transformer and CNN model for depth estimation and a motion estimation module with object-wise rigid and non-rigid motion prediction. The system aims to model 3D motion and geometry for accurate depth and motion estimation.
Іншою мовою
із вихідного контенту
arxiv.org
Ключові висновки, отримані з
by Xiuzhe Wu,Xi... о arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.05895.pdfГлибші Запити