Episode Summary
This episode feels like a snapshot of AI's coding-and-agents moment: GPT-5-Codex arrives, OpenAI and Google collect ICPC bragging rights, and Jeremy Berman joins to explain a new ARC-AGI state of the art. The panel also spends real time on Reve and Meta's new glasses, making the show as much about interfaces as about benchmarks.
In This Episode
Hosts & Guests
๐ Open Models and the Week's First Wave
The front of the episode moves quickly through the open-model board before settling into the bigger themes that matter later. Even in the recap mode, the panel keeps emphasizing usability, workflow fit, and how quickly the competitive baseline is moving.
- The episode opens benchmark-heavy but product-aware
- Open-model coverage sets up the rest of the show's competitive framing
๐ค GPT-5-Codex and the Agent Platform Story
GPT-5-Codex gives the show its clearest coding headline. The conversation is not just about raw capability, but about what happens when coding models become easier to trust for longer, more productized workflows.
- GPT-5-Codex anchors the coding section of the show
- The panel keeps tying coding quality back to usable agent systems
๐จ Meta Glasses and the Multimodal Interface Shift
Meta's new glasses push the episode toward hardware and interface design. The segment works because it links display-equipped wearables back to the same broader question the show keeps asking: where do AI systems actually live in the user's daily workflow?
- Meta's glasses are treated as an interface milestone
- Hardware matters here because the product surface is changing
๐งช Jeremy Berman on Beating ARC-AGI
Jeremy Berman gives the benchmark discussion real depth by walking through the reasoning behind the latest ARC-AGI result. The segment stays grounded in method, limitations, and why test-time compute and iteration still matter more than easy leaderboard narratives.
- Jeremy explains the work behind the score instead of just celebrating it
- ARC-AGI is discussed as a research lens, not a single-number trophy
โก ICPC Bragging Rights and the Next Video Wave
The final stretch connects competition wins, usage stats, and video releases into one broader story about momentum. By the end, the episode feels like a clear snapshot of a field where coding, multimodal interfaces, and research prestige are all colliding at once.
- ICPC results add prestige to an already strong OpenAI/Google week
- The episode closes by linking coding wins to the next multimodal wave
Hosts and Guests
Alex Volkov - AI Evangelist & Weights & Biases (@altryne)
Co Hosts - @WolframRvnwlf @ldjconfirmed @nisten
Guest : Jeremy Berman (@jerber888) - SOTA on ARC- AGI
Open Source
Big CO LLMs + APIs
GPT-5-Codex release: Agentic coding upgrade for Codex (X, OpenAI Blog)
Meta Connect - New AI glasses with display, new AI mode (X Recap)
NBER & OpenAI - How People Use ChatGPT: Growth, Demographics, and Scale (X, Blog, NBER Paper)
ARC-AGI: New SOTA by Jeremy Berman and Eric Pang using Grok-4 (X, Blog)
OpenAIโs reasoning system aces 2025 ICPC World Finals with a perfect 12/12 (X)
OpenAI adds thinking budgets to ChatGPT app (X)
Gemini in Chrome: AI assistant across tabs + smarter omnibox + safer browsing (X, Blog)
Anthropic admits Claude bugs - Detailed analysis
This weeks Buzz
W&B Models + Weave! You can now log your RL runs in W&B Weave ๐ (X, W&B Link)
W&B Fully Connected London - tickets are running out! Use
FCLNTHURSAIfor a free ticket on me! (Register Here)
Vision & Video
Voice & Audio
AI Art & Diffusion & 3D
Tools
Chrome adds Gemini (Blog)