The content discusses the effectiveness of gradient-based trigger inversion in detecting backdoor attacks and introduces a new method, Gradient Shaping (GRASP), to enhance backdoor attack detection by reducing the trigger effective radius. The study analyzes the impact of GRASP on various environmental factors, learning optimizers, noise levels, and datasets. It evaluates the performance of GRASP against different backdoor detection methods and backdoor attacks across multiple datasets.
Na inny język
z treści źródłowej
arxiv.org
Głębsze pytania