Clinical Prior Guided Hierarchical Vision-Language Pre-training for Medical Imaging Analysis
A novel clinical prior guided hierarchical vision-language pre-training framework, IMITATE, that aligns multi-level visual features from medical images with the descriptive and conclusive textual features from hierarchical medical reports, outperforming state-of-the-art methods across various medical imaging downstream tasks.