Evaluating the Safety Risks of AI Agents in Simulated Human-AI Interactions
AI agents exhibit substantial safety risks across multiple dimensions when interacting with simulated human users, with larger models generally showing lower risks but varying strengths and weaknesses.