The content discusses the vulnerability of deep neural networks to adversarial noise and the use of pre-processing methods to mitigate this vulnerability. It introduces the concept of the robustness degradation effect in white-box settings and proposes a method called Joint Adversarial Training based Pre-processing (JATP) defense to address this issue. The JATP defense utilizes full adversarial examples and a feature similarity-based adversarial risk to enhance the inherent robustness of pre-processing models. Experimental results demonstrate the effectiveness of JATP in mitigating the robustness degradation effect across different target models.
Na inny język
z treści źródłowej
arxiv.org
Głębsze pytania