Alapfogalmak
A novel one-class graph embedding classification (OCGEC) framework that leverages graph neural networks to effectively detect backdoor attacks in deep neural network models without requiring any knowledge of the attack strategy or poisoned training data.
Kivonat
The paper proposes a novel OCGEC framework for detecting backdoor attacks in deep neural network (DNN) models. The key highlights are:
-
A novel model-to-graph approach is developed to efficiently capture the structural information and weight features of DNN models, which proves highly effective for backdoor detection.
-
OCGEC utilizes a pre-trained graph auto-encoder (GAE) to learn meaningful representations of the DNN graphs, and combines it with a one-class classification optimization objective to form a classification boundary between backdoor and benign models.
-
OCGEC only requires a small amount of clean data and does not rely on any knowledge of the backdoor attacks, making it well-suited for real-world applications.
-
Extensive experiments show that OCGEC achieves excellent performance in detecting backdoor models against various backdoor attacks across diverse datasets, outperforming state-of-the-art backdoor detection techniques.
-
OCGEC exhibits strong generalization capabilities in identifying previously unseen backdoors, demonstrating its effectiveness and robustness.
Statisztikák
Deep Neural Networks (DNNs) have demonstrated remarkable performance in solving various real-world problems.
The high cost of training DNNs has led to the rise of third-party online machine learning platforms, which creates opportunities for attackers to manipulate DNN models through backdoor attacks.
Backdoor attacks can grant the attacker complete control over the model's outputs when triggered by special inputs, while the model works well on normal inputs.
Existing backdoor detection methods often rely on specific assumptions about the attack strategies and require full access to the datasets, limiting their practicality in real-world scenarios.
Idézetek
"Deep Neural Networks (DNNs) have demonstrated remarkable performance in solving various real-world problems."
"Backdoor attacks can manipulate DNN models by injecting specific triggers into the training dataset or creating a backdoor neural network. Models under backdoor attacks work well on normal inputs. However, when triggered by special inputs, these backdoors grant the attacker complete control over the model's outputs."
"Existing detection methods typically require training data access, neural network architectures, types of triggers, target classes, etc. Our OCGEC, however, is capable of overcoming these issues."