The content discusses the effectiveness of gradient-based trigger inversion in detecting backdoor attacks and introduces a new method, Gradient Shaping (GRASP), to enhance backdoor attack detection by reducing the trigger effective radius. The study analyzes the impact of GRASP on various environmental factors, learning optimizers, noise levels, and datasets. It evaluates the performance of GRASP against different backdoor detection methods and backdoor attacks across multiple datasets.
A otro idioma
del contenido fuente
arxiv.org
Ideas clave extraídas de
by Rui Zhu,Di T... a las arxiv.org 03-05-2024
https://arxiv.org/pdf/2301.12318.pdfConsultas más profundas