Episode Summary

A quiet-looking week turns into a surprisingly dense ThursdAI once Kimi K2 Thinking lands, Apple-Gemini-for-Siri rumors heat up, and Amazon pushes back on Perplexity. Kenton Varda joins to talk Code Mode, Workers, and why real deployment primitives matter, while Alex closes with a practical N8N automation walkthrough for agent builders.

Hosts & Guests

Alex Volkov
Alex Volkov
Host ยท W&B / CoreWeave
@altryne
Kenton Varda
Kenton Varda
Principal Engineer / Architect ยท Cloudflare Workers
@KentonVarda
Wolfram Ravenwolf
Wolfram Ravenwolf
Weekly co-host, AI model evaluator
@WolframRvnwlf
Yam Peleg
Yam Peleg
AI builder & founder
@Yampeleg
Nisten Tahiraj
Nisten Tahiraj
AI operator & builder
@nisten

๐Ÿ“ฐ Nov 6 Highlights

Alex, Yam, Wolfram, and Nisten use the opening stretch to sort the week into the themes that will matter most: open models, platform moves, coding tools, voice, and automation. The episode feels calm at first, but the news density rises fast once the panel gets into the details.

  • A slow week quickly stops looking slow
  • The panel frames the episode around practical signal, not just launch noise

๐Ÿ”“ Kimi K2 Thinking Changes the Open-Source Mood

The Kimi K2 Thinking release becomes the clear open-source centerpiece of the show. The panel treats it as a real momentum moment because the conversation is not just about benchmark screenshots, but about reasoning, coding, and whether open models can keep closing the usability gap.

  • Kimi K2 Thinking becomes the standout model story of the week
  • The discussion keeps returning to reasoning quality and coding utility

๐Ÿข Apple, Gemini, Amazon, and Perplexity

The big-company section is driven by interface control: who owns the assistant layer, who gets distribution, and how much leverage platform companies still have. Apple's Gemini rumor and Amazon's pressure on Perplexity both fit that same broader fight.

  • Apple-Gemini rumors raise the stakes around Siri
  • Amazon vs. Perplexity becomes a platform and distribution story

๐Ÿ› ๏ธ Kenton on Code Mode and Cloudflare Workers

Kenton Varda joins for the most builder-focused segment of the episode. The discussion ties coding assistants back to the infrastructure they need underneath: execution environments, deployment surfaces, and the difference between a flashy demo and something you can actually trust in production.

  • Kenton connects code generation to real runtime constraints
  • Workers-style primitives stay relevant because agent tools need dependable execution

๐Ÿค– Voice AI Demos and Alex's N8N Workflow

Maya One and Inworld TTS show how quickly voice quality is moving, but Alex does not leave the conversation at demos. He pivots into an N8N walkthrough that grounds the episode in practical automation and shows listeners how these tools can become real workflows, not just novelty clips.

  • Voice quality keeps improving across new demos
  • The N8N walkthrough turns the finale into a usable automation segment
TL;DR and Show Notes + Links
  • Hosts and Guests

  • Open Source LLMs

    • Smol Training Playbook โ€” a 200+ page, end-to-end guide to reliably pretrain and operate LLMs (X, Announcement)

    • Ai2 launches OlmoEarth โ€” foundation models + open, end-to-end platform for fast, high-resolution Earth intelligence (X, Blog)

    • Moonshot AI releases Kimi K2 Thinking โ€” an open-source 1T-parameter MoE agent with 256K context and huge tool-calling capacity (X, HF, Blog, Arxiv)

    • LongCat flash Omni - 560B (27A) omni model (text, audio, video input)

  • Big CO LLMs + APIs

    • Apple will pay roughly $1B/year to license a custom 1.2 trillionโ€‘parameter Google Gemini model to power a revamped Siri (X, Announcement)

    • Perplexity says Amazon issued a legal threat to block Comet AI assistants from shopping on Amazon (X, Blog)

    • AWS announces multi-year strategic infrastructure partnership with OpenAI to power ChatGPT inference, training, and agentic AI (X)

  • Robotics

    • Xpeng unveils โ€˜Ironโ€™ humanoid claiming โ€˜most human-likeโ€™ design with soft skin, bionic muscles, VLT brain and a 2026 production plan (X)

  • Coding with AI

    • Anthropic shows how running MCP-connected tools as code slashes token use and scales agents (X, Blog)

    • Windsurf Codemaps โ€” AIโ€‘annotated, navigable maps of your codebase powered by SWE-1.5 (Fast) and Sonnet 4.5 (Smart) (X, Announcement)

    • Conversation with Kenton Varda (@KentonVarda) from Cloudflare about MCP and Code Mode

    • Cursor added in IDE browser - very performant!

  • Audio & Video

    • Maya-1 - Open source voice generation model.

    • Inworld TTS - new #1 on artifical analysis benchmark.

  • Tools & Gadgets

    • Sandbar launches Stream โ€” a voice-first personal assistant โ€” and Stream Ring, a wearable โ€˜mouse for voiceโ€™, available for preorder (X, Blog)