Generating Diverse and Coordinated Holistic Co-Speech Motions for 3D Avatars
This paper presents ProbTalk, a unified probabilistic framework that jointly models facial expressions, hand gestures, and body poses to generate variable and coordinated holistic co-speech motions for 3D avatars.