The content discusses the effectiveness of gradient-based trigger inversion in detecting backdoor attacks and introduces a new method, Gradient Shaping (GRASP), to enhance backdoor attack detection by reducing the trigger effective radius. The study analyzes the impact of GRASP on various environmental factors, learning optimizers, noise levels, and datasets. It evaluates the performance of GRASP against different backdoor detection methods and backdoor attacks across multiple datasets.
Іншою мовою
із вихідного контенту
arxiv.org
Ключові висновки, отримані з
by Rui Zhu,Di T... о arxiv.org 03-05-2024
https://arxiv.org/pdf/2301.12318.pdfГлибші Запити