The paper introduces a novel CNN-based single-stage method called Dual-Path Hierarchical Relation Network (DHRNet) for multi-person pose estimation. The key highlights are:
DHRNet employs a dual-path interaction modeling module (DIM) that strategically organizes cross-instance and cross-joint interaction modeling modules in two complementary orders. This allows the model to extract instance-to-joint and joint-to-instance interactions concurrently, enriching the interaction information.
The dual-path design of DIM enables the model to leverage the complementarity between cross-instance and cross-joint interactions, which is crucial for accurate joint localization.
DHRNet outperforms state-of-the-art methods on challenging benchmarks like COCO, CrowdPose, and OCHuman datasets, demonstrating the effectiveness of the proposed approach.
Extensive ablation studies validate the importance of the dual-path interaction modeling and the adaptive feature fusion module in enhancing the model's performance.
Qualitative analysis showcases how DHRNet utilizes cross-instance and cross-joint correlations to locate human joints, especially in occluded and crowded scenarios.
إلى لغة أخرى
من محتوى المصدر
arxiv.org
الرؤى الأساسية المستخلصة من
by Yonghao Dang... في arxiv.org 04-23-2024
https://arxiv.org/pdf/2404.14025.pdfاستفسارات أعمق