ADVREPAIR: A Novel Approach for Provable Repair of Adversarial Attacks in Deep Neural Networks
ADVREPAIR is a novel approach that leverages formal verification to construct patch modules that can be seamlessly integrated into the original neural network, delivering provable and specialized repairs within the robustness neighborhood. Additionally, ADVREPAIR incorporates a heuristic mechanism for assigning patch modules, allowing this defense against adversarial attacks to generalize to other inputs, significantly improving the overall robustness of the network.