核心概念
This paper presents a novel hybrid visual odometry framework that leverages pose-only supervision, offering a balanced solution between robustness and the need for extensive labeling. It introduces a self-supervised homographic pre-training method for enhancing optical flow learning from pose-only labels and a random patch-based salient point detection strategy for more accurate optical flow patch extraction.
摘要
The paper proposes a hybrid visual odometry (VO) framework that utilizes pose-only supervision to address the limitations of traditional geometry-based and deep learning-based VO methods. The key contributions are:
-
Self-supervised homographic pre-training: This pre-training phase empowers the network to refine its optical flow estimation capabilities and bolster feature representations from just one image, proving advantageous for the subsequent sparse optical flow-based VO tasks that depend exclusively on pose supervision.
-
Salient patch detection and refinement: A salient patch detection module identifies points with significant image features, retaining valuable patches while discarding unnecessary ones. A salient patch refining training step further enhances the network's cooperation with salient patches, improving accuracy and reliability, particularly in monotonous environments.
The experiments show that the pose-only supervised method achieves competitive results on standard datasets and greater robustness and generalization ability in extreme and unseen scenarios, even compared to dense optical flow-supervised state-of-the-art methods. The live experiment in a meeting room with significant illumination changes demonstrates the superior robustness and generalization of the proposed approach.
統計資料
The paper does not contain any explicit numerical data or statistics to support the key claims. The performance comparisons are presented in the form of tables showing the Absolute Trajectory Error (ATE) on various datasets.
引述
"To the best of our knowledge, we are the first to investigate the hybrid sparse optical flow-based Visual Odometry with pose-only supervision."
"We unveil a groundbreaking self-supervised homographic pre-training method for optical flow. This approach empowers the network to refine its optical flow estimation capabilities and bolster feature representations from just one image, proving advantageous for the subsequent sparse optical flow-based VO tasks that depend exclusively on pose supervision."
"A salient patch detection module and a salient patch refining step are introduced in the proposed system. The salient point detection module identifies those points with significant image features, striving to retain valuable patches while discarding unnecessary ones, and the salient patch refining training step enhances the network's cooperation with salient patches, thus improving accuracy and reliability, particularly in monotonous environments."