Incorporating textual information alongside visual features to enhance the model's understanding of the data and improve its generalization across diverse clinical domains.