The content discusses the vulnerability of deep neural networks to adversarial noise and the use of pre-processing methods to mitigate this vulnerability. It introduces the concept of the robustness degradation effect in white-box settings and proposes a method called Joint Adversarial Training based Pre-processing (JATP) defense to address this issue. The JATP defense utilizes full adversarial examples and a feature similarity-based adversarial risk to enhance the inherent robustness of pre-processing models. Experimental results demonstrate the effectiveness of JATP in mitigating the robustness degradation effect across different target models.
Іншою мовою
із вихідного контенту
arxiv.org
Ключові висновки, отримані з
by Dawei Zhou,N... о arxiv.org 03-26-2024
https://arxiv.org/pdf/2106.05453.pdfГлибші Запити