The paper introduces a novel CNN-based single-stage method called Dual-Path Hierarchical Relation Network (DHRNet) for multi-person pose estimation. The key highlights are:
DHRNet employs a dual-path interaction modeling module (DIM) that strategically organizes cross-instance and cross-joint interaction modeling modules in two complementary orders. This allows the model to extract instance-to-joint and joint-to-instance interactions concurrently, enriching the interaction information.
The dual-path design of DIM enables the model to leverage the complementarity between cross-instance and cross-joint interactions, which is crucial for accurate joint localization.
DHRNet outperforms state-of-the-art methods on challenging benchmarks like COCO, CrowdPose, and OCHuman datasets, demonstrating the effectiveness of the proposed approach.
Extensive ablation studies validate the importance of the dual-path interaction modeling and the adaptive feature fusion module in enhancing the model's performance.
Qualitative analysis showcases how DHRNet utilizes cross-instance and cross-joint correlations to locate human joints, especially in occluded and crowded scenarios.
Sang ngôn ngữ khác
từ nội dung nguồn
arxiv.org
Thông tin chi tiết chính được chắt lọc từ
by Yonghao Dang... lúc arxiv.org 04-23-2024
https://arxiv.org/pdf/2404.14025.pdfYêu cầu sâu hơn