核心概念
A new transformer-based lymph node detection model, LN-DETR, that significantly improves performance by incorporating location-debiased query selection and contrastive query representation learning.
要約
The authors propose a new lymph node (LN) detection transformer model, LN-DETR, to address the challenging task of accurately detecting LNs in CT scans. The key contributions are:
-
Enhancing the 2D backbone with a multi-scale 2.5D feature fusion to incorporate 3D context explicitly.
-
Introducing a location debiased query selection module to choose LN queries with higher localization accuracy as the decoder query's initialization. This addresses the misalignment between classification confidence and localization quality.
-
Employing a query contrastive learning module at the decoder's output to explicitly reinforce LN queries towards their best-matched ground-truth queries, reducing false positives and duplicate predictions.
The proposed LN-DETR is trained and evaluated on a large-scale dataset of 1067 patients with over 10,000 labeled LNs across different body parts and diseases. Compared to previous leading methods, LN-DETR significantly improves the average recall by over 4-5% at the same false positive rates in both internal and external testing. It also achieves the top performance on the universal lesion detection task using the NIH DeepLesion benchmark.
統計
LN-DETR is trained and evaluated on a dataset of 1067 patients with over 10,000 labeled lymph nodes.
The dataset covers different body parts (neck, chest, abdomen) and various diseases (head & neck cancer, esophageal cancer, lung cancer, COVID, etc.).
LN-DETR achieves an average recall of 56.27% across 0.5 to 4 false positives per patient in internal testing, outperforming previous leading methods by over 4-5%.
On the external testing set, LN-DETR maintains a high average recall of 52.04%, demonstrating strong generalization ability.
On the NIH DeepLesion benchmark for universal lesion detection, LN-DETR achieves the top performance of 88.46% average recall across 0.5 to 4 false positives per image.
引用
"Lymph node (LN) assessment is a critical, indispensable yet very challenging task in the routine clinical workflow of radiology and oncology."
"Finding scatteredly distributed, low-contrast clinically relevant LNs in 3D CT is difficult even for experienced physicians under high inter-observer variations."
"Previous automatic LN detection works typically yield limited recall and high false positives (FPs) due to adjacent anatomies with similar image intensities, shapes or textures (vessels, muscles, esophagus, etc)."