Anthropic Says Claude Now Writes 80% of Its Production Code, Targets Full Self-Improving AI


TL;DR

  • Code Authorship: Anthropic says Claude now writes more than 80% of code merged into its production systems.
  • Review Risk: The disclosure shifts the enterprise question from whether AI can generate code to whether teams can review, test, and approve it safely.
  • Autonomy Caveat: Anthropic still frames full recursive self-improvement as a future possibility, not something Claude has already achieved.
  • Control Gates: Teams using AI coding agents need audit trails, security checks, rollback paths, and human approval before AI-authored code reaches production.

Anthropic says Claude wrote more than 80% of the code it merged in May 2026, turning its own engineering workflow into a test of AI coding at production scale. For enterprise teams, the disclosure moves the practical question from whether models can generate code to whether review, security testing, and approval gates can keep pace before AI-written changes reach live systems.

Engineers remain inside the loop. They choose the work, review generated changes, and decide what is merged. Anthropic says code shipped per engineer per quarter has increased eightfold from its 2021-2025 baseline, a gain it connects to a broader self-optimizing loop in which AI helps build and test AI systems. For now, that loop still runs through human review before production code lands.

How Claude Changed Anthropic’s Engineering Loop

Claude Code is the product behind that shift. What began as a coding agent now feeds Anthropic’s own production workflow, with engineers reviewing model-written changes before merge. The tool has also become a successful commercial developer product, so the same control problem follows enterprise customers: generated code needs a clear path through review, security checks, and approval before it reaches production.

Anthropic says Claude’s success rate on its most open-ended internal engineering tasks reached 76% in May after a 50-point rise over six months. That helps explain why the company is treating AI coding as a development loop, but the remaining human role is still larger than checking syntax or approving volume. An employee quoted by Anthropic described that advantage as “seeing the bigger picture” beyond the immediate task.

Control now depends on review capacity as much as generation speed. Anthropic’s current internal workflow uses an automated reviewer to inspect proposed code changes for bugs, security flaws, and other defects before merge. Claude Code also asks for permission before modifying files or running commands.