The paper proposes a novel pipeline for 3D reconstruction of interacting multi-person in clothing from a single image. The key challenges arise from occlusion, where parts of the human body are not visible due to occlusion by others or self-occlusion.
To address this, the authors utilize two human priors: complete 3D geometry and surface contacts. For the geometry prior, an encoder-decoder network is used to regress the image of a person with missing body parts to a latent vector, which is then decoded to produce 3D features. An implicit network combines these features with a surface normal map to reconstruct a complete and detailed 3D human mesh.
For the contact prior, an image-space contact detector is developed that outputs a probability distribution of surface contacts between people in 3D. These priors are used to globally refine the body poses, enabling penetration-free and accurate reconstruction of interacting multi-person in clothing on the scene space.
The results demonstrate that the proposed method can produce complete, globally coherent, and physically plausible 3D reconstructions compared to existing methods.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Junuk Cha,Ha... at arxiv.org 04-02-2024
https://arxiv.org/pdf/2401.06415.pdfDeeper Inquiries