Core Concepts
The proposed end-to-end Transformer-based model efficiently translates visible light images into high-fidelity infrared images by leveraging a Dynamic Fusion Aggregation Module and an Enhanced Perception Attention Module to capture and preserve crucial textural and color features.
Abstract
The paper introduces a novel end-to-end Transformer-based model for efficiently translating visible light images into high-quality infrared images. The key highlights are:
The model incorporates a Color Perception Adapter (CPA) to extract and adapt RGB information from visible light images to the infrared domain, and an Enhanced Feature Mapping Module (EFM) to capture intricate textural details.
The Dynamic Fusion Aggregation Module (DFA) integrates the features extracted from visible light and maps them onto a latent space, enabling a more precise capture and characterization of imagery information across diverse environments and conditions.
The Enhanced Perception Attention Module (EPA) mitigates information loss due to obstructions or low-light conditions, enhancing the image's details and structure to augment the textural detail features.
The Transformer module integrates global contextual information to refine the final image output.
Comprehensive experiments on multiple datasets demonstrate the superior performance of the proposed model compared to existing methods, both qualitatively and quantitatively. The model's efficiency, with low computational overhead, makes it a practical and scalable solution for real-world applications requiring high-quality infrared imaging.
Stats
The paper reports the following key metrics:
PSNR: 14.01
SSIM: 0.48
Quotes
"The Dynamic Fusion Aggregation Module (DFA) plays an essential role in integrating features extracted from the visible spectrum and projecting them into a latent space that mediates between visible and infrared domains."
"The Enhanced Perception Attention Module (EPA) significantly contributes to the model's effectiveness by mitigating information loss due to occlusions or low-light conditions."