CAMANet enhances cross-modal alignment in radiology report generation by leveraging class activation maps and attention consistency.
The author proposes CAMANet to enhance cross-modal alignment and discriminative representation in radiology report generation, outperforming previous methods on benchmark datasets.