The content discusses the vulnerability of deep neural networks to adversarial noise and the use of pre-processing methods to mitigate this vulnerability. It introduces the concept of the robustness degradation effect in white-box settings and proposes a method called Joint Adversarial Training based Pre-processing (JATP) defense to address this issue. The JATP defense utilizes full adversarial examples and a feature similarity-based adversarial risk to enhance the inherent robustness of pre-processing models. Experimental results demonstrate the effectiveness of JATP in mitigating the robustness degradation effect across different target models.
Başka Bir Dile
kaynak içeriğinden
arxiv.org
Önemli Bilgiler Şuradan Elde Edildi
by Dawei Zhou,N... : arxiv.org 03-26-2024
https://arxiv.org/pdf/2106.05453.pdfDaha Derin Sorular