LLM Agents Can Autonomously Exploit Real-World One-Day Vulnerabilities
LLM agents, specifically GPT-4, can autonomously exploit real-world one-day vulnerabilities in various systems, including websites, container management software, and vulnerable Python packages, with an 87% success rate. This capability far exceeds that of other LLMs and open-source vulnerability scanners.