Overview
On May 22–23, 2025, Anthropic released Claude Opus 4 and Claude Sonnet 4 simultaneously — its most capable model family to date. The release marked a definitive shift in Anthropic’s positioning: Claude was no longer primarily a conversational assistant but a designed-for-agentic-work AI capable of sustained, multi-hour autonomous task completion.
Claude Opus 4 achieved 72.5% on SWE-bench Verified — the highest score any model had achieved on this coding benchmark at the time of release, surpassing Claude 3.7’s previous record of 70.3%.
Key Capabilities
Extended Agentic Workflows
Claude 4’s primary design goal, according to Anthropic, was reliability in multi-hour autonomous tasks. Previous models (including Claude 3.7) could handle extended reasoning in a single session, but would degrade in quality or lose context in sustained workflows spanning hours or days. Claude 4 addressed:
- Context persistence: Maintaining coherent task state over extended operations
- Error recovery: Detecting when sub-tasks had failed and replanning without human intervention
- Tool use fidelity: More consistent and accurate use of code execution, file access, web browsing, and external APIs
Claude Code
Anthropic simultaneously expanded Claude Code — the terminal-based AI coding agent launched in preview with Claude 3.7 — into a full product. Claude Code could:
- Navigate and modify large codebases autonomously
- Write, test, and debug code in iterative loops
- Handle multi-file refactors and architectural changes
- Run in the background as a software engineering “teammate”
Safety Architecture
Consistent with Anthropic’s “responsible scaling policy,” Claude 4’s release was accompanied by a detailed safety card documenting:
- Dangerous capability evaluations (bioweapons uplift, cyberoffense, CBRN risks)
- Behavioral testing results for deception, manipulation, and autonomy thresholds
- Pre-deployment red-teaming methodology
The Claude 4.x Iteration Cycle
Following the initial release, Anthropic shipped successive capability updates:
| Model | Release Date | Key Addition |
|---|---|---|
| Claude Opus 4 | May 22, 2025 | Initial release, SWE-bench 72.5% |
| Claude Sonnet 4 | May 23, 2025 | Fast/cost-efficient tier |
| Claude Opus 4.5 | November 24, 2025 | Enhanced long-context handling |
| Claude Opus 4.6 | February 5, 2026 | Improved tool use, reduced refusals |
| Claude Sonnet 4.6 | February 17, 2026 | Production-tier capability update |
| Claude Opus 4.7 | April 16, 2026 | Latest frontier model |
Context: Anthropic’s Mission and Commercial Reality
Anthropic’s stated mission is the “responsible development and maintenance of advanced AI for the long-term benefit of humanity.” Claude 4 represented its most direct argument that this mission is commercially compatible with frontier capability development.
The timing was notable: Anthropic had recently completed a major funding round, and Claude’s API revenue was its primary commercial validation. Claude 4’s agentic capabilities — particularly for enterprise software development, research, and data analysis workflows — were positioned as the core commercial justification for continued frontier investment.