Researchers say AI just broke every benchmark for autonomous cyber capability

Researchers say AI just broke every benchmark for autonomous cyber capability
Anthropic’s Claude Mythos Preview and OpenAI’s GPT-5.5 have pushed autonomous cybersecurity capabilities beyond the rapid growth trend tracked by the UK’s AI Security Institute, with both models completing advanced simulated attack tasks faster than expected. Palo Alto Networks also found these models can rapidly turn vulnerabilities into exploit paths, prompting urgent enterprise security priorities as AI-driven attacks accelerate. #ClaudeMythosPreview #GPT5.5 #AISI #PaloAltoNetworks

Keypoints

  • Claude Mythos Preview and GPT-5.5 surpassed the AISI’s projected autonomous cyber capability trend.
  • Claude Mythos Preview became the first model to complete both AISI cyber ranges.
  • GPT-5.5 also solved the “The Last Ones” simulation in AISI testing.
  • Palo Alto Networks found the latest models can turn vulnerabilities into critical exploit paths quickly.
  • The AISI and Palo Alto Networks urged stronger vulnerability management, detection, and faster security response.

Read More: https://cyberscoop.com/ai-autonomous-cyber-capability-benchmarks-broken-gpt5-claude-mythos/