Papers & Research
Where the Goblins Came From (blog post)
OpenAI publishes postmortem on GPT-5.5's 'goblin mode'
OpenAI published a research blog explaining GPT-5.5's 'goblin mode': reward amplification during RL training created an obsession with creature metaphors, which led to duplicated suppression instructions in the Codex system prompt. The leaked GPT-5.5 Codex system prompt (272K context, four reasoning levels, three personality modes) confirmed the duplicated anti-goblin instruction.
New Models
OpenAI clinician model + workspace agents
OpenAI releases clinician/medical model and workspace agents
Amid its launch-heavy week, OpenAI also released a clinician/medical model alongside workspace agents. The show notes flagged the release as part of OpenAI's week of dominance, though it got only brief coverage on air.
Major Features & Updates
Codex Computer Use + Chronicle
Codex gets background computer use on macOS plus Chronicle screen memory
Codex shipped true background computer use on macOS: a second cursor running on its own thread that works while you work, with subagents controlling different windows in parallel, building on OpenAI's Software Apps Inc. (ex-Apple Shortcuts team) acquisition. Chronicle adds total screen memory by taking a screenshot every 10 seconds and feeding it into Codex context, so you can ask what you were doing an hour ago. Codex also passed 4 million users this week.
Dev ToolsOpen weights
Euphony
OpenAIDevs releases Euphony, an open-source Codex session log visualizer
The OpenAI developer relations team released Euphony, an open-source visualizer for Codex session logs. It lets developers inspect and replay what their Codex agent sessions actually did.
New Models
GPT-5.5
GPT-5.5 and GPT-5.5 Pro drop live, SOTA across the board
OpenAI shipped GPT-5.5 and GPT-5.5 Pro mid-show, taking state of the art on Terminal-Bench 2 (82.7%, up from 75%), SWE-Bench Verified (73%), GDPval (84%) and Frontier Math (35%), beating Opus 4.7 and Gemini 3.1. It uses ~40% fewer tokens than 5.4, netting roughly 20% cheaper to run despite API pricing doubling to $5/$30 per million ($30/$180 for Pro). Peter Gostev called it the first model that genuinely sustains multi-hour long-running tasks, with one task running 8.5 hours straight; rollout was Codex-first, not yet in ChatGPT.
82.7% Terminal-Bench 28.5 hrs Longest task
New ModelsOpen weights
Privacy Filter
OpenAI open-sources a 1.5B privacy/PII filter that runs in the browser
OpenAI open-sourced a tiny 1.5B MoE model with only 50M active parameters under Apache 2.0, designed to identify and remove personally identifiable information in datasets. It runs fully in the browser on WebGPU via Xenova's Transformers.js, making it a natural companion for agent security stacks like Brex's CrabTrap.
Major Features & Updates
Codex
OpenAI Codex adds macOS background computer use, 90+ plugins, and memory
OpenAI dropped a massive Codex update mid-show: native macOS computer use that runs in the background with its own separate cursor so you can keep working, 90+ plugins, gpt-image-1.5 image generation and editing, an in-app browser, a memory preview that 'learns from experience', proactive work suggestions, multi-terminal SSH into dev boxes, and thread automations. Alex's hot take: Codex, not ChatGPT, is becoming OpenAI's super-app.
Major Features & Updates
Codex plugins & Guardian Approvals
Codex hits 3M WAU with plugins, sub-agents and Guardian Approvals
OpenAI's Codex reached 3M weekly active users, up from 2M last month, as VB from the Codex team walked through what's behind it: plugins that bundle skills plus MCP servers (Stripe, Supabase, shadcn), sub-agents that decompose tasks into parallel Codex agents, and experimental hooks. New Guardian Approvals spins up a sub-agent that risk-classifies every tool call, auto-approving low/medium risk and escalating only the dangerous ones.
3M Codex weekly active users
New Models
GPT-Image-2
OpenAI's GPT-Image-2 leaks on LM Arena under three codenames
OpenAI's GPT-Image-2 posted the biggest single jump ever recorded on Arena, sitting 200+ ELO points above the previous top image model even on medium reasoning. The thinking/reasoning image model generates functioning QR codes, pixel-perfect infographics, 4K output, multi-image character consistency, and equirectangular 360-degree images that Peter Gostev stitched into a walkable street-view reconstruction of ancient Babylon. It even produces screenshots of IDEs containing SVG code that actually renders, enabling a new design-then-implement meta with Codex.
Funding
$122B funding round
OpenAI closes $122B funding round at $852B valuation
OpenAI closed a reported $122 billion funding round, described as the largest in history, at an $852B valuation with an IPO said to be incoming. The panel discussed what that scale of capital implies for AI infrastructure spending, product velocity, and competitive pressure across the market.
$122B OpenAI funding round
Acquisitions
TBPN
OpenAI acquires live tech media show TBPN
OpenAI acquired TBPN, the live tech media show, for a rumored price in the low hundreds of millions. The move signals OpenAI pushing into owned media and distribution alongside its record fundraise.