AI Models Cross Dangerous Cybersecurity Threshold in UK Government Testing
Anthropic's Claude Mythos Preview became the first AI model to pass the UK AI Security Institute's rigorous cyber-offense benchmark, successfully executing 3 out of 10 complete domain takeovers. OpenAI's GPT-5.5 quickly followed, with researchers noting that frontier AI cyber-offense capabilities are now doubling every four months—a concerning acceleration in AI systems' ability to conduct sophisticated cyberattacks.
This represents a critical inflection point where AI systems can autonomously execute advanced cyber operations, fundamentally changing threat landscapes and requiring immediate policy responses.
cybersecurity
ai safety
frontier models
government testing