Episode Summary
TL;DR Hosts and Guests Alex Volkov - AI Evangelist & Weights & Biases ( @altryne ) Co-Hosts: @WolframRvnwlf , @yampeleg , @nisten , @ldjconfirmed Guest: Max Spero ( @max_spero_ ) - Co-founder, Pangram Labs Healthcare AI Mayo Clinicβs REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists ( Announcement ) Open Source LLMs DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datacenters ( Arxiv ) SenseTime open-sources SenseNova U1 - unified multimodal 8B/3B-active MoE with no encoder/VAE ( HF , GitHub ) IBM releases Granite 4.1 family (3B/8B/30B) - non-thinking dense models with 20x token efficiency over Qwen3.5 9B, Apache 2.0 ( Blog , HF ) Mistral launches Medium 3.5 - 128B dense flagship with 256K context, configurable reasoning, plus Vibe coding
In This Episode
- β‘ Cold Open & Welcome
- 𧬠Mayo Clinic: AI Detects Pancreatic Cancer
- β‘ Top News Round Table
- π° TLDR: This Week in AI
- π Open Source: SenseNova U1
- π Open Source: DeepSeek V4
- π Open Source: IBM Granite 4.1
- π’ OpenAI Goblin Mode
- β‘ Guest: Max Spero (Pangram) - AI Detection
- π€ Stripe Link: Wallets for AI Agents
- π€ This Week's Buzz: Wolf Bench & Cursor Agent
- π 11 Labs Music & Outro
Hosts & Guests
By The Numbers
β‘ Cold Open & Welcome
Welcome to ThursdAI, your weekly live AI news show. This has been a long week, but this is why we're here.
- As a reminder, this is why we're here.
- You remember when we used to cover everything that happens in the ai, including some time for papers?
- This, Alex Volkov: no, it's impossible.
𧬠Mayo Clinic: AI Detects Pancreatic Cancer
Alex Volkov: There's announcement from Mayo Clinic that AI with Mayo Clinic. this is published in an actual medical journal.
- Mayo Clinic AI detects pancreatic cancer up to three years before diagnosis.
- Red mod achieves 73% sensitivity versus 39 for radiologists on the same scans, 73 versus 39 for human radiologist.
- And specifically this is, as far as I'm concerned, this is the one use of AI that we need to accelerate even more.
β‘ Top News Round Table
Alex Volkov: very quick round of, what's the top news in AI from, since last we chatted. And then we're gonna dive into the TLDR.
- We also have a guest on the show today.
- If you ever looked at a piece of text like wondered, I wonder if this is AI written or not.
- sometimes you can know if people use like the lower models.
π° TLDR: This Week in AI
This is the TLDR of Thursday Eye for, April 30th. This is everything that we've gathered from this week's news.
- there's a lot, so I cannot claim anymore that we're covering everything, but this is the top news that we're going to cover.
- As we already mentioned, Mayo Clinic Redmont.
- This is an AI that detects pancreatic cancer on routine CT scans up to three years before radiologist.
π Open Source: SenseNova U1
Alex Volkov: All right, let's start with open source. All right, let's talk about open source.
- Listen, I think you started with Sense Nova.
- This one is open source as well, right?
- So they had a paper at ICLR last year, but, they didn't show a whole lot at, at that paper.
π Open Source: DeepSeek V4
Let's talk about DeepSeek, the Chinese whale company that we've known and loved and told you about that broke the internet and the stock market, a year and something ago, came back with Dipe for, let me go find the Dipe announcement. this is the Dipsy paper itself, Yam Peleg: and it's free.
- So everyone, should act, absolutely use these tokens that they give you with 80% discount all over being already like 30 times cheaper.
- So yeah, everyone go, go burn some Chinese GPUs, please.
- while you still can, because it's not gonna last for long because that thing is expensive to run.
π Open Source: IBM Granite 4.1
Alex Volkov: let's talk about the next piece of news, quickly here, IBM releases granite 4.1. It's a dense non-thinking model with best in class tool calling,is what they call this.
- we have this, up on Weights, & Biases, because of partnership with IBM, I'm assuming, at 5 cents per input token to 10 cents per million output tokens.
- granted, I, it is very interesting to release a non-thinking model.
- but they're bidding their own previous M Moes with just 8 billion parameters, and getting 73 on BFCL tool.
π’ OpenAI Goblin Mode
Alex Volkov: let's talk about gobbling mode. I know you wanted to talk about gobbling mode, but especially 'cause they released the blog post.
- I dunno if you guys seen the blog post.
- but how about you present this piece of news and then we'll talk about Golin mode?
- 'cause this is still in big companies and APIs.
β‘ Guest: Max Spero (Pangram) - AI Detection
Alex Volkov: I'm g doing good, man. I've been wanting to talk to you for a while, but I think before I introduce you, I'll do this thing.
- I'll click right and I'll do check for AI content.
- Oh, you guys don't see my menu, but hopefully you'll see this thing that shows up.
- and then we're gonna run this whole block from OpenAI through Pangram Labs to see that, according to Pangram, this is a hundred percent human written and this is great.
π€ Stripe Link: Wallets for AI Agents
Alex Volkov: I wanna talk about Stripe. I think this is the most important thing.
- This was the number one highlight for me.
- Stripe had their, annual Stripe sessions, conference where they released a bunch of stuff.
- It is very obvious how fast Stripe is moving in Agent engineering as well.
π€ This Week's Buzz: Wolf Bench & Cursor Agent
Alex Volkov: this week's vow super quick. we have a few things to announce Wolf and I really would love to talk about the virality of Wolf Bench and the findings as well.
- And then I think we're good to conclude the show.
- Like we've been at this for two hours already from CoreWeave.
- I have to rerecord this because Visor is from CoreWeave.
π 11 Labs Music & Outro
Alex Volkov: this 11 labs slow, dreamy indie rock with reverse vocals. it's very, it's worth saying that 11 labs, oh wait, it responded.
- I created a link spend request for $10, to cover the six monthly.
- If I press this, and I can press this quicker than you guys can see, so nobody can snipe this link.
- I can approve, I can show you how this looks.
Hosts and Guests
Alex Volkov - AI Evangelist & Weights & Biases (@altryne)
Co-Hosts: @WolframRvnwlf, @yampeleg, @nisten, @ldjconfirmed
Guest: Max Spero (@max_spero_) - Co-founder, Pangram Labs
Healthcare AI
Mayo Clinicβs REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists (Announcement)
Open Source LLMs
DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datacenters (Arxiv)
SenseTime open-sources SenseNova U1 - unified multimodal 8B/3B-active MoE with no encoder/VAE (HF, GitHub)
IBM releases Granite 4.1 family (3B/8B/30B) - non-thinking dense models with 20x token efficiency over Qwen3.5 9B, Apache 2.0 (Blog, HF)
Mistral launches Medium 3.5 - 128B dense flagship with 256K context, configurable reasoning, plus Vibe coding agent (HF, Blog)
Baidu ERNIE 5.1 Preview hits #13 on Arena (#1 Chinese lab) using just 6% of comparable pretraining compute (ernie.baidu.com)
Big CO LLMs + APIs
OpenAI publishes blog explaining GPT-5.5βs βgoblin modeβ - reward amplification during RL training created an obsession with creature metaphors, leading to duplicated suppression instructions in the Codex system prompt
OpenAI ends Microsoft Azure exclusivity, AWS announces GPT-5.5 and Codex on Bedrock; AGI clause removed from contract (Sam tweet)
Gemini can now generate and export Docs, Sheets, Slides, PDFs, .docx, .xlsx, LaTeX directly from chat - free for all users globally (Blog)
NVIDIA releases Nemotron 3 Nano Omni - 30B/3B-active hybrid Transformer-Mamba MoE with 256K context, 9x throughput on consumer hardware (Blog)
Agentic Commerce & Tools
Stripe launches Link wallet for agents at Sessions 2026 - AI agents get scoped payment credentials with mandatory human approval, real card never exposed (Blog)
Stripe removes waitlist on Projects.dev - 32 infrastructure providers (Cloudflare, WorkOS, ElevenLabs, Twilio, Daytona, Browserbase, AgentMail, etc.) provisionable via CLI for AI agents
Cursor launches SDK exposing the same runtime, harness, and models that power Cursor IDE - now embeddable in any product (Docs)
Cognition launches Devin for Terminal - local CLI coding agent with
/handoffcommand for seamless cloud transfer (cli.devin.ai)
Evals & Benchmarks
WolfBench tests 23 models across 300+ runs on Terminal-Bench 2.0 - Cursor Agent + GPT-5.5 is the #1 combination (wolfbench.ai)
Microsoftβs DELEGATE-52 benchmark shows GPT-5.4 loses 28% of document content after 20 iterative edits, frontier models corrupt stealthily while preserving structure
This Weekβs Buzz - Weights & Biases
IBM Granite 4.1 live on W&B Inference at $0.05/$0.10 per million input/output tokens with 128K context
WolfBench results going viral with Cursor + GPT-5.5 dominance, Codex and Devin testing in the pipeline
AI Detection & Cognitive Security
Pangram Labs launches Chrome extension auto-flagging AI content in real time on X, LinkedIn, Reddit, Substack, Medium with 99.98% accuracy and 1-in-10,000 false positive rate (pangramlabs.com)
Taylor Lorenz uses Pangram API to analyze top 25 Substack bestsellers, finding many popular newsletters are near-fully AI-generated
AI Art, Video & Audio
ElevenLabs launches ElevenMusic - full music platform with discovery, remixing, royalties; 4,000+ indie artists at launch (elevenmusic.io)
HeyGen HyperFrames integrates natively with Claude Design - HTML-to-MP4 motion graphics via single CLI command (hyperframes.dev)
xAI drops Grok Imagine update with dramatically improved lip sync, sound, and 30-second video extensions
OpenAI engineer confirms team is actively fixing GPT-Image-2βs noise artifact issue
Other
Talkie - 13B open-weight LLM trained exclusively on pre-1930 text, by Alec Radford and David Duvenaud (talkie-lm.com)
GPT-5.5 Codex full system prompt leaked from OpenAIβs open-source repo, revealing 272K context window, four reasoning levels, three personality modes, and the duplicated anti-goblin instruction