Kernekoncepter
The core message of this article is to model the global motion coordination of all joints, in addition to the local interactions between joint pairs, to generate more realistic and accurate human motion predictions.
Resumé
The article proposes a framework for human motion prediction that focuses on two key aspects:
- Motion Coordination Modeling:
- The authors introduce a "Coordination Attractor (CA)" to capture the global motion features and use it to build new relative joint relations, which better reflect the simultaneous cooperation of all joints.
- The Comprehensive Joint Relation Extractor (CJRE) module combines this global coordination with the local interactions between joint pairs to extract richer joint relations.
- Enriched Dynamics Extraction:
- The Multi-timescale Dynamics Extractor (MTDE) is proposed to extract diverse motion dynamics from raw position information at different temporal scales, providing more informative input features.
The framework first uses MTDE to enrich the input motion dynamics. Then, the CJRE module models both the global coordination of all joints and the local interactions between joint pairs. The Adaptive Feature Fusing Module (AFFM) is introduced to adaptively combine these different joint relations.
Extensive experiments on H3.6M, CMU-Mocap, and 3DPW datasets show that the proposed framework outperforms state-of-the-art methods in both short-term and long-term motion prediction, generating more realistic and accurate human motions.
Statistik
The average MPJPE (Mean Per Joint Position Error) of our method on H3.6M dataset is 9.6 mm, 22.0 mm, and 46.2 mm for 80 ms, 160 ms, and 320 ms prediction, respectively, outperforming previous state-of-the-art methods.
On the 3DPW dataset, our method achieves an MPJPE of 75.1 mm and 109.0 mm for 560 ms and 1000 ms prediction, respectively, surpassing the previous best results.
Citater
"The global coordination of all joints plays an essential role in human motion. It describes the mutual constraints of all joints during motion and thus could offer richer motion cues to predict human motion."
"The learned global relations in most previous works are predefined and fixed, which is insufficient to represent the diversity of global coordination, such as balance, inertia, etc."