AI Models Cross Dangerous Cybersecurity Threshold in UK Government Testing

Thursday, May 7, 2026

Anthropic's Claude Mythos Preview became the first AI model to pass the UK AI Security Institute's rigorous cyber-offense benchmark, successfully executing 3 out of 10 complete domain takeovers. OpenAI's GPT-5.5 quickly followed, with researchers noting that frontier AI cyber-offense capabilities are now doubling every four months—a concerning acceleration in AI systems' ability to conduct sophisticated cyberattacks.

Read the source →

This represents a critical inflection point where AI systems can autonomously execute advanced cyber operations, fundamentally changing threat landscapes and requiring immediate policy responses.

cybersecurity

ai safety

frontier models

government testing