Unified Multi-Modal Diffusion Model for Simultaneous Generation of Diverse Data Types
The proposed MT-Diffusion model enables simultaneous modeling and generation of multi-modal data, such as images and labels, within a unified diffusion framework by integrating multi-task learning losses in a principled manner.