Effective and Covert Clean-Label Attacks on Prompt-Based Learning Models
Contrastive Shortcut Injection (CSI) is an effective and stealthy clean-label attack method that leverages activation values to craft stronger shortcut features, enabling high attack success rates at low poisoning rates across full-shot and few-shot scenarios.