The proposed VIFNet is an end-to-end multimodal fusion network that effectively combines visible and infrared modalities to restore high-quality haze-free images, outperforming state-of-the-art single-modality dehazing methods.
The paper proposes a novel framework that directly estimates the intermediate distortion flow from the underlying global shutter image to the rolling shutter image, enabling efficient and high-quality rolling shutter correction.
The core message of this paper is to propose an automated framework for generating realistic image splicing datasets using state-of-the-art image composition techniques, in order to bridge the gap between the quality and quantity of existing image forgery datasets and the real-world manipulations.
The core message of this paper is to introduce a novel problem called 'Small Object Semantic Correspondence (SOSC)' and propose a Keypoint Bounding box-centered Cropping (KBC) method to address the challenge of closely located keypoints associated with small objects, which leads to the fusion of their features and makes it difficult to identify the corresponding keypoints.
The proposed OBJ-GSP algorithm leverages semantic segmentation to extract object-level geometric structures and preserves them during image stitching, achieving superior alignment and distortion prevention compared to existing methods.