Enhancing Text-to-Image Diffusion Models with Concept Matching and Attribute Concentration
The core message of this paper is that the misalignment between text prompts and generated images in diffusion models is caused by insufficient attention to certain text tokens, which can be addressed by incorporating an image-to-text concept matching mechanism and an attribute concentration module.