The content discusses the limitations of existing methods in self-supervised depth estimation and proposes a new framework, DO3D, to address the challenges. It introduces a hybrid Transformer and CNN model for depth estimation and a motion estimation module with object-wise rigid and non-rigid motion prediction. The system aims to model 3D motion and geometry for accurate depth and motion estimation.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Xiuzhe Wu,Xi... lúc arxiv.org 03-12-2024
https://arxiv.org/pdf/2403.05895.pdfYêu cầu sâu hơn