The article introduces ManiNeg, a novel approach that utilizes manifestations (observable symptoms or signs of a disease) as proxies to mine hard negative samples for contrastive learning in mammography analysis. This addresses the challenges posed by the small size and obscured nature of breast lumps, which undermine the assumptions of traditional contrastive learning methods.
The key highlights are:
The authors critically evaluate the limitations of conventional hard negative sampling methods in contrastive learning for mammographic data analysis, and advocate the use of manifestations as a viable proxy to overcome these challenges.
They introduce the ManiNeg framework, which strategically samples hard negative samples based on the Hamming distance between manifestation vectors. This approach leverages the structured and semantically meaningful nature of manifestations to enhance representation learning.
The authors have developed the Mammography Visual-Knowledge-Linguistic (MVKL) dataset, which includes multi-view mammograms, corresponding radiology reports, meticulously annotated manifestations, and pathologically confirmed benign-malignant outcomes. This comprehensive dataset supports the evaluation of ManiNeg and future research in this domain.
Empirical studies demonstrate that ManiNeg significantly improves representation learning in both unimodal and multimodal settings, and exhibits strong generalization across datasets.
Till ett annat språk
från källinnehåll
arxiv.org
Djupare frågor