The paper presents a semi-supervised learning framework, SelectiveKD, for building a cancer detection model for Digital Breast Tomosynthesis (DBT) that leverages unlabeled slices in a DBT stack. The key insights are:
Obtaining large-scale accurate annotations for DBT is challenging due to the volumetric nature of the data. Existing approaches often annotate only a sparse set of slices, which limits the scale of annotated datasets and introduces potential noise.
SelectiveKD builds upon knowledge distillation (KD) and pseudo-labeling (PL) to effectively utilize unannotated slices in a DBT stack. The teacher model provides a supervisory signal to the student model for all slices, and PL is used to selectively include unannotated slices with high-confidence predictions.
Experiments on a large real-world dataset of over 10,000 DBT exams show that SelectiveKD significantly improves cancer detection performance and generalization across different device manufacturers, without requiring annotations from the target devices.
The framework can achieve similar cancer detection performance with considerably fewer labeled images, leading to large potential annotation cost savings for building practical CAD systems for DBT.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Laurent Dill... at arxiv.org 09-26-2024
https://arxiv.org/pdf/2409.16581.pdfDeeper Inquiries