Główne pojęcia
HiPose establishes 3D-3D correspondences in a coarse-to-fine manner with a hierarchical binary surface encoding, enabling efficient and accurate 6DoF object pose estimation from a single RGB-D image without any time-consuming refinement.
Streszczenie
The paper presents HiPose, a novel method for 6DoF object pose estimation from a single RGB-D image. The key contributions are:
-
Hierarchical Binary Surface Encoding:
- The network predicts a binary code for each point in the input point cloud, representing a correspondence to a sub-surface on the object model.
- The binary code is split into two parts - the first m bits encode a coarse surface correspondence, while the remaining n bits are used for iterative fine-grained matching.
-
Hierarchical Correspondence Pruning:
- The coarse pose estimated from the initial m-bit correspondence is used to identify and remove outlier matches based on point-to-surface distance.
- The process is repeated for the finer n-bit correspondences, gradually improving the pose estimate and eliminating outliers.
-
RANSAC-free Pose Estimation:
- The hierarchical pruning approach eliminates the need for RANSAC, which is commonly used with the Kabsch algorithm for pose estimation.
- This makes the pose estimation process more stable and efficient compared to RANSAC-based methods.
Extensive experiments on the LM-O, YCB-V, and T-LESS datasets demonstrate that HiPose outperforms state-of-the-art methods in terms of accuracy while being significantly faster, as it does not require any time-consuming pose refinement.
Statystyki
The paper does not provide any specific numerical data or statistics in the main text. The results are presented in the form of performance metrics on benchmark datasets.