A lightweight defense mechanism, PAD-FT, that effectively disinfects poisoned deep neural network models without requiring additional clean data.
Venom proposes a novel approach to enhance the survivability of backdoor attacks against model reconstruction-based defenses by coupling decision paths, preserving attack capabilities while improving survivability.
The author proposes using model merging as an effective defense against backdoor attacks on language models, showcasing robustness and versatility in various contexts.