The paper proposes a novel framework called AdvLogo for generating adversarial patches against object detectors. The key ideas are:
Hypothesis: Every semantic space contains an adversarial subspace where images can cause detectors to fail in recognizing objects.
Approach: Leverage the semantic understanding of the diffusion denoising process to drive the process to adversarial subareas by perturbing the latent and unconditional embeddings at the last timestep.
Optimization in Frequency Domain: Apply perturbation to the latent in the frequency domain using Fourier Transform to mitigate the distribution shift and preserve image quality.
Unconditional Embeddings Optimization: Synchronize the optimization of unconditional embeddings with the latent variables to significantly boost the adversarial effectiveness while maintaining visual quality.
Gradient Approximation: Derive a simple gradient approximation method based on the chain rule to efficiently update both the latent variables and unconditional embeddings during the denoising process.
Extensive experiments demonstrate that AdvLogo achieves strong attack performance against diverse object detectors while maintaining high visual quality, outperforming existing patch attack methods.
Til et annet språk
fra kildeinnhold
arxiv.org
Viktige innsikter hentet fra
by Boming Miao,... klokken arxiv.org 09-12-2024
https://arxiv.org/pdf/2409.07002.pdfDypere Spørsmål