The author proposes using model merging as an effective defense against backdoor attacks on language models, showcasing robustness and versatility in various contexts.
Venom proposes a novel approach to enhance the survivability of backdoor attacks against model reconstruction-based defenses by coupling decision paths, preserving attack capabilities while improving survivability.
A lightweight defense mechanism, PAD-FT, that effectively disinfects poisoned deep neural network models without requiring additional clean data.