Temel Kavramlar
This work proposes a CLIP-based framework, OCC-CLIP, to determine if a given image was generated by the same model as a set of few-shot examples, even when the target model cannot be accessed.
Özet
The paper addresses the problem of origin attribution for generated images, where the goal is to identify the model that generated a given image. The authors formulate this as a few-shot one-class classification task, where only a few images generated by a source model are available, and the source model cannot be accessed.
To solve this task, the authors propose OCC-CLIP, a CLIP-based framework that enables the identification of an image's source model, even among multiple candidates. The key components are:
- Treating the few images from the source model as the target class, and randomly sampled images as the non-target class.
- Optimizing learnable prompts for the target and non-target classes, respectively.
- Employing an adversarial data augmentation (ADA) technique to extend the coverage of the non-target class space and better approximate the boundary to the target space.
Extensive experiments on various generative models, including diffusion models and GANs, verify the effectiveness of the OCC-CLIP framework. The authors also demonstrate the applicability of their solution to a real-world commercial image generation system, DALL·E-3.
The paper further explores the sensitivity of OCC-CLIP to factors such as the choice of source models, non-target datasets, number of shots, and image preprocessing. It also shows the effectiveness of OCC-CLIP in multi-source origin attribution scenarios.
İstatistikler
Recent visual generative models can produce high-quality images, raising concerns about intellectual property protection and accountability for misuse.
The authors formulate the origin attribution problem as a few-shot one-class classification task, where only a few images generated by a source model are available, and the source model cannot be accessed.
Extensive experiments are conducted on 8 generative models, including diffusion models and GANs, as well as the recently released DALL·E-3 API.
Alıntılar
"Recent progress in visual generative models enables the generation of high-quality images. To prevent the misuse of generated images, it is important to identify the origin model that generates them."
"We aim to conduct origin attribution in a practical open-world setting (Fig. 1), where model parameters cannot be accessed and only a few samples generated by the model are available."