The content discusses the effectiveness of gradient-based trigger inversion in detecting backdoor attacks and introduces a new method, Gradient Shaping (GRASP), to enhance backdoor attack detection by reducing the trigger effective radius. The study analyzes the impact of GRASP on various environmental factors, learning optimizers, noise levels, and datasets. It evaluates the performance of GRASP against different backdoor detection methods and backdoor attacks across multiple datasets.
In eine andere Sprache
aus dem Quellinhalt
arxiv.org
Wichtige Erkenntnisse aus
by Rui Zhu,Di T... um arxiv.org 03-05-2024
https://arxiv.org/pdf/2301.12318.pdfTiefere Fragen