Gradient Cuff proposes a two-step method to detect jailbreak attempts on Large Language Models by exploring refusal loss landscapes.