The authors propose a new lymph node (LN) detection transformer model, LN-DETR, to address the challenging task of accurately detecting LNs in CT scans. The key contributions are:
Enhancing the 2D backbone with a multi-scale 2.5D feature fusion to incorporate 3D context explicitly.
Introducing a location debiased query selection module to choose LN queries with higher localization accuracy as the decoder query's initialization. This addresses the misalignment between classification confidence and localization quality.
Employing a query contrastive learning module at the decoder's output to explicitly reinforce LN queries towards their best-matched ground-truth queries, reducing false positives and duplicate predictions.
The proposed LN-DETR is trained and evaluated on a large-scale dataset of 1067 patients with over 10,000 labeled LNs across different body parts and diseases. Compared to previous leading methods, LN-DETR significantly improves the average recall by over 4-5% at the same false positive rates in both internal and external testing. It also achieves the top performance on the universal lesion detection task using the NIH DeepLesion benchmark.
Para Outro Idioma
do conteúdo original
arxiv.org
Principais Insights Extraídos De
by Qinji Yu,Yir... às arxiv.org 04-08-2024
https://arxiv.org/pdf/2404.03819.pdfPerguntas Mais Profundas