Enhancing Text-to-Image Alignment with Discriminative Probing and Tuning
The author argues that improving the discriminative abilities of text-to-image (T2I) models can enhance text-image alignment for better generation results. By introducing a Discriminative Probing and Tuning (DPT) paradigm, the author aims to boost both generative and discriminative performance of T2I models.