Centrala begrepp
This research paper introduces a novel attention mask-guided PGD adversarial attack method that outperforms existing methods in achieving a balance between stealth, efficiency, and explainability, effectively fooling XAI-based safety monitors for image classification.
Statistik
17% faster than benchmark PGD.
0.01% less effective with 14% more stealth.
12% increase in attack efficiency [clean accuracy baseline: 55%].
10% increase in attack stealth.
97% confidence in fooling XAI-based safety monitor.
CIFAR-10 image resolution: 32x32.
MNIST image resolution: 28x28.