Zer0-Jack: Jailbreaking Black-Box Multi-Modal Large Language Models Using a Memory-Efficient Gradient-Based Method
Zer0-Jack, a novel jailbreaking method, effectively attacks black-box Multi-modal Large Language Models (MLLMs) by leveraging zeroth-order optimization and patch coordinate descent to generate malicious image inputs with high success rates and low memory usage.