The author proposes a novel method, Say Anything with Any Style (SAAS), for generating stylized talking head videos by extracting speaking styles in a discrete manner and utilizing a multi-task VQ-VAE model.
Discrete representation learning and dynamic-weight method enhance stylized talking head generation.