The content discusses the effectiveness of gradient-based trigger inversion in detecting backdoor attacks and introduces a new method, Gradient Shaping (GRASP), to enhance backdoor attack detection by reducing the trigger effective radius. The study analyzes the impact of GRASP on various environmental factors, learning optimizers, noise levels, and datasets. It evaluates the performance of GRASP against different backdoor detection methods and backdoor attacks across multiple datasets.
Egy másik nyelvre
a forrásanyagból
arxiv.org
Mélyebb kérdések