Core Concepts
The core message of this paper is to propose an automated framework for generating realistic image splicing datasets using state-of-the-art image composition techniques, in order to bridge the gap between the quality and quantity of existing image forgery datasets and the real-world manipulations.
Abstract
The paper discusses the problem of image forgery detection, where the advancement in image editing tools has outpaced the progress in manipulation detection techniques. The key challenge is the lack of large, high-quality, and realistic image forgery datasets for training deep learning models.
The authors propose an automated framework to generate a splicing dataset using image composition techniques. The framework involves the following steps:
Object Placement: The authors leverage the Object Placement Assessment (OPA) dataset to obtain rational object placements and background images.
Image Matting: The authors use the MatteFormer model to enhance the segmentation masks of the foreground objects, producing higher-quality alpha mattes.
Image Blending: The authors combine the foreground objects and background images using alpha blending.
Image Harmonization: The authors employ the Harmonizer model to harmonize the composite images, making the spliced regions more seamless and difficult to detect.
The authors evaluate the generated dataset using the Early Fusion image manipulation detection model, which achieves a lower detection accuracy compared to other existing datasets, indicating the increased realism and difficulty of the generated images.
The paper also discusses future work, including expanding the dataset coverage by using a wider range of image composition techniques and object sources, as well as exploring the potential of image composition methods for other types of image forgery, such as copy-move.
Stats
The authors generated a dataset of 24,964 spliced images, with 3,588 test images and 21,376 training images.
Quotes
"None of the previously published splicing datasets have the characteristics of artifact diversity, high number of images, ground truth and realistic manipulations at the same time."
"Employing image composition methods for image forgery data generation should not be limited to image splicing. There is also a potential in copy move forgery for object placement to generate rational forgeries and image matting to reduce clipping artifacts."