Simons, S. J., & Papiez, B. W. (2024). SpineFM: Leveraging Foundation Models for Automatic Spine X-ray Segmentation. arXiv preprint arXiv:2411.00326.
This research paper introduces a novel pipeline called SpineFM for automatic segmentation and identification of vertebral bodies in cervical and lumbar spine radiographs. The study aims to address the limitations of existing methods by leveraging foundation models and an inductive approach for improved accuracy and efficiency.
SpineFM utilizes a multi-step pipeline combining different models: a Mask R-CNN for initial vertebrae localization, Medical-SAM-Adaptor (Med-SA) as a foundation model for refined segmentation, a ResNet classifier for vertebra type identification, and a small fully connected neural network for predicting subsequent vertebrae locations based on previous ones. This inductive approach exploits the spine's anatomical structure for efficient and robust segmentation. The method was trained and evaluated on two publicly available datasets: NHANES II and CSXA.
SpineFM achieved state-of-the-art performance on both datasets, demonstrating superior accuracy compared to existing methods like VertXNet, Mask R-CNN, nnUNet, and MDR2U-Net. The pipeline achieved an overall Dice Similarity Coefficient (DSC) of 0.925 and 0.916 for cervical and lumbar sections respectively on the NHANES II dataset, and successfully identified 99.5% of labeled vertebrae on the CSXA dataset.
SpineFM presents a significant advancement in automated spine X-ray segmentation by effectively leveraging foundation models and an inductive approach. The method's robustness, accuracy, and efficiency make it a promising tool for clinical applications, potentially aiding in the diagnosis and treatment of spinal conditions.
This research contributes significantly to the field of medical image analysis, particularly in the area of spine X-ray interpretation. The proposed SpineFM pipeline has the potential to automate and expedite the process of vertebrae segmentation, leading to faster and more accurate diagnoses, reduced clinician workload, and improved patient care.
While SpineFM demonstrates impressive performance, the study acknowledges limitations related to image quality and ground truth mask accuracy in the datasets used. Future research could explore the application of SpineFM to larger and more diverse datasets, investigate the impact of different foundation models, and refine the pipeline for improved handling of challenging cases with poor image quality or anatomical variations.
To Another Language
from source content
arxiv.org
Deeper Inquiries