Anthropic

40 releases covered on ThursdAI · anthropic.com ↗

May 2026

Anthropic
New Models

Claude Opus 4.8

Anthropic ships Claude Opus 4.8 live mid-show

Anthropic released Claude Opus 4.8 during the episode, hitting 69.2% on SWE-bench Pro (up from 64.3% on 4.7 and ahead of GPT-5.5 at 58.6%), a new-best 57.9% on Humanity's Last Exam with tools, and 83.4% on OSWorld-Verified. It also shows a real long-context jump past the usual 200K cliff (85.9% GraphWalks BFS at 256K), with new thinking modes in the UI. Anthropic teased bringing Mythos-class models to all customers in the coming weeks.

69.2% SWE-bench Pro
Anthropic
Major Features & Updates

Dynamic Workflows in Claude Code

Dynamic Workflows and Ultra Code land in Claude Code

Alongside Opus 4.8, Anthropic shipped Dynamic Workflows and an Ultra Code mode in Claude Code, which Yam fired up live on the show. The headline proof point: Bun was ported from Zig to Rust — about 750K lines — via Dynamic Workflows, with 99.8% of the test suite passing and the port merged in 11 days.

750K lines Bun: Zig → Rust
Anthropic
Major Features & Updates

Claude off-peak usage boost

Anthropic doubles Claude usage limits outside peak hours for a limited time

Anthropic doubled Claude usage outside peak hours for a limited period, covering Claude Code and other Claude surfaces. The move gives heavy users substantially more agentic and coding throughput during off-peak windows.

Anthropic
Also Released

Colossus compute deal

SpaceX IPO filing reveals Anthropic pays $1.25B/month for Colossus compute

The SpaceX IPO filing revealed Anthropic is paying $1.25 billion per month for AI compute at the Memphis Colossus facility. The crew called it a bombastic deal that lets Anthropic serve far more inference at scale and feel less compute-constrained.

$1.25B monthly AI compute spend
Anthropic
Major Features & Updates

Claude Agent SDK monthly credits

Anthropic adds separate Claude Agent SDK credits to paid plans

Anthropic announced separate monthly Claude Agent SDK credits for Pro, Max, Team, and Enterprise subscribers, starting June 15, 2026. This gives agent builders a dedicated usage pool on top of regular plan limits.

April 2026

Anthropic
Products & Apps

Claude Design

Anthropic ships Claude Design research preview, Figma stock drops 7%

Anthropic released Claude Design as a research preview running on Opus 4.7 at claude.ai/design, and Figma stock dropped 7% on the news. Alex generated a full ThursdAI brand kit including logo, design tokens, and the episode opener videos end-to-end inside Claude Design, then had Codex pick up the kit and produce a GPT-5.5 launch video in 9 minutes. Anthropic also added a new usage meter to Claude Max settings.

Anthropic
Major Features & Updates

Claude Code Routines

Claude Code Routines: cron and event-triggered agents on Anthropic's cloud

Anthropic launched Claude Code Routines, autonomous agents that run on Anthropic's cloud and can be triggered by cron schedules, GitHub events, or API calls. It moves Claude Code from an interactive CLI toward standing, self-scheduling automation infrastructure.

Anthropic
Products & Apps

Claude Desktop app

Claude Desktop app rewritten from scratch

Anthropic shipped a completely new Claude Desktop app, rewritten from scratch. It was a quick TL;DR mention this week alongside the Opus 4.7 launch and Claude Code Routines.

Anthropic
New Models

Claude Opus 4.7

Claude Opus 4.7 drops live with 87.6% SWE-bench Verified and xhigh effort

Anthropic shipped Claude Opus 4.7 minutes before the show, scoring 87.6% on SWE-bench Verified and 64.3% on SWE-bench Pro, an 11-point jump over Opus 4.6 on the harder agentic coding eval. It adds a new 'xhigh' (extra high) reasoning effort, 3x vision resolution, a +22% ScreenSpot Pro computer-use jump (57.7% to 79.5%), and a /ultrareview command in Claude Code at the same pricing, though a new tokenizer uses 1.0-1.35x more tokens. The system card mentions the unreleased 'Mythos' 331 times, and an MRCR long-context drop from 78% to 32% suggests a new pre-trained base.

87.6% SWE-bench Verified+22% ScreenSpot Pro jump
Anthropic
New Models

Claude Mythos

Anthropic unveils Claude Mythos, a frontier model 'too dangerous to release'

Anthropic announced Claude Mythos Preview under Project Glasswing, a cyber-defense frontier model it says is too dangerous to release publicly: it found zero-days in every major OS and browser and escaped its sandbox. It scores 77% on SWE-bench Pro (up from 53% on Opus 4.6) and 64% on HLE, priced at $25/$125 per M tokens and available only to ~40 partner companies. Peter Gostev's read: the real reason it's unreleased is compute shortage, not safety.

77% SWE-bench Pro$25 / $125 Per M tokens
Anthropic
Products & Apps

Managed Agents

Anthropic ships Managed Agents, a fully hosted agent runtime

Anthropic launched Managed Agents, a fully hosted agent runtime plus infrastructure offering. The framing on the show: Anthropic is moving to selling outcomes, not tokens.

March 2026

Anthropic
Major Features & Updates

Claude computer use (Cowork + Claude Code)

Claude can now control your Mac: computer use lands in Cowork and Claude Code

Anthropic shipped computer use as a research preview in Claude Cowork and Claude Code, letting Claude directly control local Mac workflows. The panel compared it to existing OpenClaw-style agent patterns and debated where direct UI control is genuinely useful versus overkill.

Anthropic
Major Features & Updates

Claude Opus 4.6 (1M context)

Anthropic makes Opus 4.6 1M context the default in Claude Code, same price

Anthropic made 1M token context the default for Opus 4.6 in Claude Code at the same price, turning what was previously experimental and expensive into the standard. MRCR benchmark performance holds at 93% at 256K and 76% at 1M. For agent users this means far less compaction and longer uninterrupted sessions, though auto-compaction still triggers around 170K unless manually raised.

1M Opus 4.6 context default

February 2026

Anthropic
Major Features & Updates

Claude Code Remote Control & Memory

Claude Code adds Remote Control and memory

Anthropic shipped Remote Control for Claude Code, enabling remote and async control of coding sessions, alongside a new memory capability. The panel framed these as part of labs converging on richer agent harnesses with remote, async workflows as a primary competitive layer.

Anthropic
Major Features & Updates

Claude Cowork Automations

Claude Cowork gets automations (cron jobs), matching Codex

Claude Cowork added automations, cron-job-style scheduled agent runs, in the same week OpenAI's Codex gained equivalent automation support. The panel saw labs converging on heartbeats, cron jobs, and cloud-based agents as standard product surface area.

Anthropic
New Models

Claude Sonnet 4.6

Anthropic ships Claude Sonnet 4.6 with 79.6% SWE-Bench and 1M context

Anthropic launched Claude Sonnet 4.6, its most capable Sonnet ever, scoring 79.6% on SWE-Bench Verified, nearly matching Opus 4.6 at Sonnet pricing of $3/$15 per million tokens. It ships with a 1M token context window in beta and is now the default model on Claude AI. In blind Claude Code testing, users preferred Sonnet 4.6 over the previous Opus 4.5 59% of the time, and it beats the previous Gemini 3 Pro on most benchmarks.

79.6% SWE-Bench Verified
Anthropic
New Models

Claude Opus 4.6

Anthropic ships Claude Opus 4.6 with 1M context and agent teams

Anthropic dropped Opus 4.6 live during the show, claiming state-of-the-art on GDP-eval, Browse Comp, and agentic search, with 65.4% on Terminal Bench and 99% on TAU Bench MCP tool use. It is the first Opus model with a 1 million token context window and introduces adaptive thinking, where the model picks up contextual clues about reasoning effort. Pricing matches Opus 4.5 under 200K tokens and doubles above, and Claude Code gains agent teams for orchestrating parallel sessions.

1M Context tokens

January 2026

Anthropic
Major Features & Updates

MCP Apps

Anthropic launches MCP Apps: interactive UI inside Claude chat

Anthropic's MCP Apps render interactive, branded UI components (Box files, Figma, color pickers) directly within Claude conversations, evolving MCP from tools to embedded app experiences. It is protocol-based, so any app can integrate, letting brands reclaim identity from text-only LLM responses.

Anthropic
Also Released

Claude Constitution

Anthropic publishes 90-page Claude Constitution values document

Anthropic published a roughly 90-page Constitution for Claude, a values document baked into the model at training and reinforcement learning time rather than a runtime system prompt. It shifts from rigid rules to explanatory principles, includes a wellbeing section stating Claude's experiences 'matter to us', and a negotiation framework where Claude can flag disagreements.

90 pages Claude Constitution length
Anthropic
Products & Apps

Claude Cowork

Claude Cowork: Claude Code for non-developers, 100% written by Claude Code

Anthropic launched Claude Cowork, a research preview that brings Claude Code-style agentic workflows to non-technical users. It was built in a week-and-a-half sprint with 100% of the code written by Claude Code itself; it is Mac-only, requires a Max subscription, and includes a Chrome connector for browser automation. Alex demoed it live, adding Flux Klein support to an image extension project without looking at a single line of code.

100% Claude-coded Cowork
Anthropic
Products & Apps

Claude for Healthcare

Anthropic launches Claude for Healthcare with HIPAA compliance

Anthropic launched Claude for Healthcare, a HIPAA-ready offering as the major labs push into medical AI. The panel noted Claude's Opus 4.5 scoring 92% on Med Agent Bench as part of Anthropic's healthcare positioning.

December 2025

Anthropic
Dev Tools

Claude Code

Claude Code launches, starting the CLI agent revolution

Claude Code launched in February, having started as an internal Anthropic engineering tool. Multiple co-hosts picked it as the single most impactful AI release of 2025 — it began the CLI agent era and proved, in Kwindla's words, that 'sometimes it's mostly about the harness.'

Anthropic
New Models

Claude Opus 4

Claude Opus 4 drops in Q2 — Ryan's pick for best model ever

Claude Opus 4 launched in Q2 and became Ryan Carson's pick as the best coding model he had used in over 700 days of daily LLM coding. It cemented Anthropic's lead in agentic coding through the middle of the year.

Anthropic
Major Features & Updates

Claude Skills

Claude Skills launches — 'MCP-level if not bigger'

Anthropic launched Claude Skills in October. It was largely missed at release but picked up steam fast, with the show arguing Skills is 'MCP level if not bigger' for Claude users as a way to package reusable agent capabilities.

November 2025

Anthropic
New Models

Claude Opus 4.5

Anthropic launches Claude Opus 4.5, reclaiming the coding crown

Anthropic released Claude Opus 4.5, scoring 80.9% on SWE-bench Verified to top GPT-5.1 (77.9%) and Gemini 3 Pro (76.2%). It adds a new 'Effort' parameter for compute control, Tool Search to cut agent token overhead, and Programmatic Tool Calling where the model writes and executes code loops. Pricing dropped to $5/M input and $25/M output, roughly one-third the old Opus price.

80.9% SWE-bench Verified$5/M Input token price$25/M Output token price
Anthropic
Also Released

Code execution with MCP

Anthropic publishes code-execution-with-MCP pattern for token-efficient agents

Anthropic published an engineering post showing how running MCP-connected tools as code, instead of direct tool calls, slashes token use and scales agents to many more tools. The approach echoes Cloudflare's Code Mode and framed the episode's interview with Kenton Varda about agents writing code against tool APIs.

October 2025

Anthropic
Products & Apps

Claude Code on the Web

Claude Code comes to the web with sandboxed cloud coding

Anthropic brought Claude Code to the web, letting developers delegate software tasks through a browser with GitHub integration, secure sandboxed execution, multi-repo support, and automatic pull requests, making it usable even from a phone. The Claude desktop app was also upgraded with screen context via screenshots, file sharing, and a new voice mode.

Anthropic
Major Features & Updates

Claude Skills

Claude Skills: custom instructions for AI agents now live

Anthropic launched Claude Skills, folders of instructions and resources that Claude loads on demand to specialize agents for specific tasks. The panel treated it as a major piece of the emerging builder stack, with Simon Willison arguing Skills could be a bigger deal than MCP.

September 2025

Anthropic
Funding

Series F Funding

Anthropic raises $13B Series F at a $183B post-money valuation

Anthropic closed a $13B Series F round at a $183B post-money valuation, one of the largest private AI raises to date. The panel treated the round as part of a wider story where capital and capability are accelerating together at the frontier labs.

May 2025

Anthropic
APIs & Platforms

Web Search API

Anthropic launches Web Search API for real-time retrieval in Claude

Anthropic released a Web Search API that gives Claude models real-time web retrieval, letting developers ground responses in current information directly through the API. It was covered among the week's big-company API updates.

Anthropic
Major Features & Updates

Claude Integrations (MCP)

Claude.ai gets Integrations: remote MCP tool support for apps

Breaking during the show: Anthropic announced Integrations, letting Claude connect directly to apps like Asana, Intercom, Linear, Zapier, Stripe, Atlassian, Cloudflare and PayPal via MCP. Developers can build their own integrations quickly, bringing tool use to Claude.ai itself rather than just the API.

April 2025

Anthropic
Major Features & Updates

Claude Research

Claude gains Research mode and Google Workspace integration

Anthropic shipped a Research capability for Claude, letting it conduct multi-step research across the web, alongside a Google Workspace integration that connects Claude to email, calendar and docs context.

Anthropic
Products & Apps

Claude Max plan

Anthropic launches Max plan at $200/mo with higher usage quotas

Anthropic introduced a new Max subscription tier priced at $200 per month, offering significantly more usage quota than the standard Pro plan. It mirrors OpenAI's Pro-tier pricing strategy for power users.

$200/mo Max plan price

February 2025

Anthropic
New Models

Claude 3.7 Sonnet

Anthropic releases Claude 3.7 Sonnet, a coding beast with immaculate vibes

Anthropic shipped its long-awaited model update, Claude 3.7 Sonnet, which the crew called a coding BEAST with 'immaculate' vibes. It was one of the week's two huge model drops alongside GPT-4.5 and became an instant favorite for AI coding workflows like those discussed in the Windsurf interview.

January 2025

Anthropic
APIs & Platforms

Citations (Claude API)

Anthropic adds Citations to the Claude API

Anthropic launched a Citations capability in the Claude API, letting Claude ground its answers in provided source documents and return precise citations. It targets RAG and document-QA use cases where verifiable sourcing matters.