Bibliographic Information: Mok, D., Bum, J., Tai, L.D., & Choo, H. (2024). Cross Feature Fusion of Fundus Image and Generated Lesion Map for Referable Diabetic Retinopathy Classification. arXiv preprint arXiv:2411.03618v1.
Research Objective: This paper proposes a novel cross-learning framework for referable diabetic retinopathy (DR) classification by leveraging lesion segmentation information obtained through a Swin U-Net architecture and integrating it with a Swin-T classification model using a cross-attention mechanism.
Methodology: The proposed method consists of two main steps. First, a Swin U-Net model is trained on the FGADR dataset to segment lesion maps from fundus images. This pre-trained model is then used to generate lesion maps for the EyePACS dataset. Second, a Swin-T model, pre-trained on ImageNet, is used for classification. This model takes both the original fundus image and the generated lesion map as input and utilizes a cross-attention mechanism to effectively combine features from both sources. The model is then fine-tuned on the EyePACS dataset for referable DR classification.
Key Findings: The proposed method achieves state-of-the-art performance on the EyePACS dataset for referable DR classification, surpassing existing methods by a significant margin. The integration of lesion segmentation information through the generated lesion maps and the use of cross-attention are identified as key contributors to the improved performance.
Main Conclusions: The proposed cross-learning framework effectively leverages lesion segmentation information and cross-attention to enhance the accuracy of referable DR classification. This approach offers a promising solution for improving automated DR screening and diagnosis.
Significance: This research significantly contributes to the field of medical image analysis, particularly in the context of DR classification. The proposed method addresses the limitations of traditional methods by incorporating lesion-specific information and utilizing advanced deep learning techniques.
Limitations and Future Research: The study is limited by its reliance on two specific datasets. Future research could explore the generalizability of the proposed method on larger and more diverse datasets. Additionally, investigating the interpretability of the model's decisions could further enhance its clinical applicability.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Dahyun Mok, ... at arxiv.org 11-07-2024
https://arxiv.org/pdf/2411.03618.pdfDeeper Inquiries