Leading AI models are more vulnerable to malicious prompts than vendors claim

Kylie Bielby 2026-05-28T12:46:30+00:00 View Original

Full Report

Cisco’s evaluation of 15 leading AI models from OpenAI, Anthropic, Google, Amazon and xAI “found that single-turn attack success rate (ASR) is not a reliable proxy for what happens when an attacker can adapt across turns,” researchers Nicholas Conley and Amy Chang wrote. Their tests revealed that AI models were much more susceptible to multi-turn…

Analysis Summary