Anthropic Data Leak Reveals Upcoming Mythos AI Model


TL;DR

  • Data Breach: A CMS misconfiguration exposed nearly 3,000 Anthropic internal documents, including details of an unreleased model called Claude Mythos.
  • Model Capabilities: Anthropic describes Mythos as a step change in reasoning and cybersecurity, positioning it above its current Opus lineup.
  • Release Strategy: Anthropic plans to release Mythos first to cyber defense organizations before making it broadly available.
  • Competitive Context: OpenAI finished pretraining its own frontier model, codenamed Spud, as both companies prepare for IPOs later in 2026.

AI safety startup Anthropic confirmed on March 27 that a basic content management misconfiguration had exposed nearly 3,000 internal documents, inadvertently revealing details of an unreleased model called Claude Mythos that Anthropic describes as a step change in reasoning and cybersecurity capabilities. Anthropic now faces the uncomfortable task of defending both its operational security and its competitive strategy at the same time.

According to Fortune, which broke the story on March 26, a default setting in Anthropic’s content management system made all uploaded assets publicly accessible. Among the exposed files were draft blog posts revealing details about a model with the internal product name Capybara. Both Anthropic and OpenAI are racing to release flagship models ahead of planned IPOs later in 2026.

Leaked Details Reveal Anthropic’s Strongest Model

Leaked draft blog posts describe Capybara as a new tier of model above Opus, which was previously Anthropic’s top-performing offering. Internal documents position Capybara as larger and more intelligent than the Opus lineup, with dramatically higher scores in software coding, academic reasoning, and cybersecurity. One draft described Mythos as “currently far ahead of any other AI model in cyber capabilities” and warned it presages a wave of models that can exploit vulnerabilities in ways that far outpace defenders.

Leaked benchmark comparisons showed gains across multiple domains, though Anthropic has not publicly released specific numbers or methodology details. Introducing a fourth tier above Opus positions Anthropic to compete directly with OpenAI’s frontier models on cybersecurity, a domain where government and enterprise contracts carry outsized value.

Given those capabilities, Anthropic plans to release Mythos first to defense organizations, giving them a head start at hardening their systems before broader public availability. Security researchers Roy Paz of LayerX Security and Alexandre Pauwels of the University of Cambridge independently reviewed the leaked data for Fortune, confirming the authenticity of the exposed documents. Anthropic is already testing Mythos with a small group of early access customers, suggesting the model is further along in development than competitors may have expected.