SALAD-Bench: A Comprehensive Safety Benchmark for Large Language Models
Large Language Models (LLMs) require robust safety evaluations, leading to the development of SALAD-Bench, a comprehensive benchmark for assessing LLMs' safety, attack, and defense methods.