The content discusses the vulnerability of deep neural networks to adversarial noise and the use of pre-processing methods to mitigate this vulnerability. It introduces the concept of the robustness degradation effect in white-box settings and proposes a method called Joint Adversarial Training based Pre-processing (JATP) defense to address this issue. The JATP defense utilizes full adversarial examples and a feature similarity-based adversarial risk to enhance the inherent robustness of pre-processing models. Experimental results demonstrate the effectiveness of JATP in mitigating the robustness degradation effect across different target models.
เป็นภาษาอื่น
จากเนื้อหาต้นฉบับ
arxiv.org
ข้อมูลเชิงลึกที่สำคัญจาก
by Dawei Zhou,N... ที่ arxiv.org 03-26-2024
https://arxiv.org/pdf/2106.05453.pdfสอบถามเพิ่มเติม