Episode Summary
Thanksgiving comes every Thursday, and ThursdAI's third annual Thanksgiving special delivered a feast of AI releases to be genuinely thankful for. Anthropic finally brought back Opus 4.5 β and it's reclaiming the coding crown with 80.9% SWE-bench Verified at a third the old price. Open source had its own feast: Prime Intellect's INTELLECT-3 (106B MoE), DeepSeek Math V2, Microsoft's Fara-7B, and BFL's FLUX.2 all dropped in one week. Plus, Ido Salomon and Liad Yosef returned to discuss MCP-UI becoming the official 'MCP Apps' standard adopted by both Anthropic and OpenAI β the foundation of what Alex calls 'the agentic web.'
In This Episode
Hosts & Guests
By The Numbers
π₯ Breaking During The Show
π Open Source LLMs
A Thanksgiving feast of open-source drops: Prime Intellect's INTELLECT-3 (106B MoE) shows a small lab can train frontier-scale models, DeepSeek surfaces a 685B math model with IMO gold performance, and Microsoft's Fara-7B brings on-device computer use to 7B parameters. Z-Image Turbo from Tongyi makes image generation sub-second, and FLUX.2 from BFL enables multi-reference image editing at 32B scale.
- INTELLECT-3: 106B MoE, 90% AIME 2024/2025, fully open-sourced training stack
- DeepSeek Math V2: 685B Apache-2.0, IMO gold-level β first open-weights math champion
- Fara-7B: Microsoft's 7B on-device computer use agent, 73.5% WebVoyager
- Z-Image Turbo: sub-second image generation from Tongyi/Alibaba
- FLUX.2: 32B multi-reference image editing from Black Forest Labs
β‘ This Week's Buzz β W&B Serverless LoRA
Alex previews the brand-new Serverless LoRA Inference launch from Weights & Biases on CoreWeave: upload a LoRA adapter to W&B Artifacts, serve it instantly on top of any base model with no cold starts and no dedicated GPU. Alex demos a 'Mocking SpongeBob' LoRA he trained in 25 minutes.
- W&B + CoreWeave: upload LoRA adapters, serve instantly via API
- No cold starts, no dedicated GPU instances needed
- Demo: SaRcAsTiC SpongeBob LoRA on Qwen 2.5 base
π€ Interview: MCP Apps & the Agentic Web
Ido Salomon (and Liad Yosef off-camera) return to the show to discuss MCP-UI's transformation into 'MCP Apps' β now an official standard jointly adopted by Anthropic and OpenAI. The pair explain how agents can now render full interactive HTML UIs directly inside chat, ending the era of tool outputs being just plain text.
- MCP-UI β MCP Apps: jointly standardized by Anthropic and OpenAI
- Agents can now render full interactive HTML UIs in-chat
- Avoids 'iOS vs Android' fragmentation: one open standard
- mcpui.dev already has demos running with Qwen and Claude
π’ Big CO LLMs β Claude Opus 4.5
Anthropic's Opus 4.5 is finally here and it's reclaiming the coding throne: 80.9% SWE-bench Verified, a new 'Effort' parameter for compute control, Tool Search to cut agent overhead, and Programmatic Tool Calling for code-loop data management β all at one-third the old Opus price. Yam and Wolfram both stress-tested it; Yam was blown away by the depth of detail it holds for complex stacks.
- Opus 4.5: 80.9% SWE-bench Verified, tops GPT-5.1 (77.9%) and Gemini 3 Pro (76.2%)
- New 'Effort' parameter: control thinking depth like o1 reasoning tokens
- Tool Search: massively cuts token overhead for agents with many tools
- Programmatic Tool Calling: Opus writes and executes code loops
- $5/M input, $25/M output β 3x cheaper than old Opus
π₯ Vision & Video β HunyuanOCR + LTX Retake
Tencent's HunyuanOCR (1B) scores 860 on OCRBench, beating 72B models β a stunning example of task-specialized small models. HunyuanVideo 1.5 brings lightweight open video generation. LTX Studio's Retake enables Photoshop-style editing of specific objects within video frames, and a mysterious 'Whisper Thunder' tops the video arena leaderboard.
- HunyuanOCR 1B: 860 OCRBench, beats Qwen3-VL-72B
- HunyuanVideo 1.5: lightweight open-source video generation
- LTX Retake: video inpainting/object editing β Photoshop for video
- Whisper Thunder: mystery model at #1 on video arena
Hosts and Guests
Alex Volkov - AI Evangelist & Weights & Biases (@altryne)
Co-Hosts - @WolframRvnwlf @yampeleg @nisten @ldjconfirmed
Guests: @idosal1 @liadyosef - MCP-UI/MCP Apps
Big CO LLMs + APIs
Anthropic launches Claude Opus 4.5 - worldβs top model for coding, agents, and tool use (X, Announcement, Blog)
OpenAI Integrates ChatGPT Voice Mode Directly into Chats (X)
Open Source LLMs
Interview: MCP Apps
MCP-UI standardized as MCP Apps by Anthropic and OpenAI (X, Blog, Announcement)
Vision & Video
AI Art & Diffusion
This Weekβs Buzz