Bibliographic Information: Lyu, P., Yeung, P., Cheng, X., Yu, X., Wu, C., & Rajapakse, J. C. (2020). Efficient Fourier Filtering Network with Contrastive Learning for UAV-based Unaligned Bi-modal Salient Object Detection. Journal of LaTeX Class Files, 18(9), 1-8.
Research Objective: This paper addresses the challenge of real-time salient object detection in unaligned RGB and thermal images captured by UAVs, aiming to improve both accuracy and efficiency compared to existing methods.
Methodology: The authors propose AlignSal, a novel deep learning model that leverages contrastive learning and Fourier filtering for efficient and effective bi-modal salient object detection. The model consists of a dual-stream encoder, a semantic contrastive alignment loss (SCAL), a synchronized alignment fusion (SAF) module, and a decoder. SCAL aligns RGB and thermal modalities at the semantic level, while SAF performs pixel-level alignment and fusion using an FFT-based multiple-filtering strategy.
Key Findings: Extensive experiments on the UAV RGB-T 2400 dataset and three weakly aligned datasets demonstrate that AlignSal achieves state-of-the-art performance across various evaluation metrics while maintaining real-time inference speed. Notably, AlignSal outperforms the previous top-performing model (MROS) in terms of accuracy and efficiency, with a significant reduction in parameters and floating point operations.
Main Conclusions: AlignSal effectively addresses the challenges of unaligned bi-modal salient object detection in UAV images by leveraging contrastive learning and Fourier filtering. The proposed model achieves a balance between accuracy and efficiency, making it suitable for real-time applications on UAVs.
Significance: This research contributes to the field of computer vision, specifically in the area of salient object detection, by introducing a novel and efficient model for processing unaligned bi-modal images captured by UAVs. The proposed approach has potential applications in various domains, including surveillance, search and rescue, and environmental monitoring.
Limitations and Future Research: While AlignSal demonstrates promising results, future research could explore the integration of additional modalities, such as depth or LiDAR data, to further enhance the model's robustness and accuracy in complex environments. Additionally, investigating the model's performance on other UAV-based datasets and real-world scenarios would provide valuable insights into its practical applicability.
To Another Language
from source content
arxiv.org
Key Insights Distilled From
by Pengfei Lyu,... at arxiv.org 11-07-2024
https://arxiv.org/pdf/2411.03728.pdfDeeper Inquiries