The paper proposes a novel framework called AdvLogo for generating adversarial patches against object detectors. The key ideas are:
Hypothesis: Every semantic space contains an adversarial subspace where images can cause detectors to fail in recognizing objects.
Approach: Leverage the semantic understanding of the diffusion denoising process to drive the process to adversarial subareas by perturbing the latent and unconditional embeddings at the last timestep.
Optimization in Frequency Domain: Apply perturbation to the latent in the frequency domain using Fourier Transform to mitigate the distribution shift and preserve image quality.
Unconditional Embeddings Optimization: Synchronize the optimization of unconditional embeddings with the latent variables to significantly boost the adversarial effectiveness while maintaining visual quality.
Gradient Approximation: Derive a simple gradient approximation method based on the chain rule to efficiently update both the latent variables and unconditional embeddings during the denoising process.
Extensive experiments demonstrate that AdvLogo achieves strong attack performance against diverse object detectors while maintaining high visual quality, outperforming existing patch attack methods.
toiselle kielelle
lähdeaineistosta
arxiv.org
Tärkeimmät oivallukset
by Boming Miao,... klo arxiv.org 09-12-2024
https://arxiv.org/pdf/2409.07002.pdfSyvällisempiä Kysymyksiä