Crescendo: A Novel Multi-Turn Jailbreak Attack Targeting Aligned Large Language Models
Crescendo is a novel multi-turn jailbreaking technique that uses benign human-readable prompts to gradually steer aligned large language models into performing unintended and potentially harmful tasks, bypassing their safety measures.