Unveiling the Reliability of Concept Removal Methods for Diffusion Models
The author investigates the effectiveness of safety mechanisms in dealing with a wide range of prompts for T2I diffusion models, introducing Ring-A-Bell as a red-teaming tool to assess and reveal limitations. The approach involves generating problematic prompts to evaluate concept removal methods and online services.