Mask-ControlNet: Enhancing Text-to-Image Generation with Mask Prompts for Higher-Quality and Controllable Image Synthesis
Introducing an additional mask prompt to better model the relationship between foreground and background, enabling the diffusion model to generate higher-quality and more controllable images that maintain higher fidelity to the reference image.