Episode Summary

TL;DR Hosts and Guests Alex Volkov - AI Evangelist & Weights & Biases ( @altryne ) Co-Hosts: @WolframRvnwlf , @yampeleg , @nisten , @ldjconfirmed Guest: Max Spero ( @max_spero_ ) - Co-founder, Pangram Labs Healthcare AI Mayo Clinic’s REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists ( Announcement ) Open Source LLMs DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datacenters ( Arxiv ) SenseTime open-sources SenseNova U1 - unified multimodal 8B/3B-active MoE with no encoder/VAE ( HF , GitHub ) IBM releases Granite 4.1 family (3B/8B/30B) - non-thinking dense models with 20x token efficiency over Qwen3.5 9B, Apache 2.0 ( Blog , HF ) Mistral launches Medium 3.5 - 128B dense flagship with 256K context, configurable reasoning, plus Vibe coding

Hosts & Guests

Alex Volkov
Alex Volkov
Host - W&B / CoreWeave
@altryne
Max Spero
Max Spero
Co-founder - Pangram Labs
@max_spero_
Wolfram Ravenwolf
Wolfram Ravenwolf
Weekly co-host, AI model evaluator - Independent AI evaluator (r/LocalLLaMA)
@WolframRvnwlf
Yam Peleg
Yam Peleg
Weekly co-host of ThursdAI - AI builder & founder
@Yampeleg
Nisten Tahiraj
Nisten Tahiraj
Weekly co-host of ThursdAI - AI operator & builder
@nisten
LDJ
LDJ
Weekly co-host of ThursdAI - Nous Research
@ldjconfirmed

By The Numbers

before clinical diagnosis with 73
3 years
founder, Pangram Labs Healthcare AI Mayo Clinic’s REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists ( Announceme
vs 39
73%
yo Clinic’s REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists ( Announcement ) Open Source LLMs DeepSeek V4 pape
radiologists
39%
detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists ( Announcement ) Open Source LLMs DeepSeek V4 paper drops with CSA+HC
at 5
1M
logists ( Announcement ) Open Source LLMs DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datace
KV cache
5.7GB
uncement ) Open Source LLMs DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datacenters ( Arxiv
Key metric
8B
s multiple datacenters ( Arxiv ) SenseTime open-sources SenseNova U1 - unified multimodal 8B/3B-active MoE with no encoder/VAE ( HF , GitHub ) IBM releases Granite 4.1 family (3B/8

⚑ Cold Open & Welcome

Welcome to ThursdAI, your weekly live AI news show. This has been a long week, but this is why we're here.

  • As a reminder, this is why we're here.
  • You remember when we used to cover everything that happens in the ai, including some time for papers?
  • This, Alex Volkov: no, it's impossible.
Alex Volkov
Alex Volkov
"Good morning. Good morning everyone. This is Alex ov. Welcome to ThursdAI, April 30th. tomorrow is already may. Can you believe it?"

🧬 Mayo Clinic: AI Detects Pancreatic Cancer

Alex Volkov: There's announcement from Mayo Clinic that AI with Mayo Clinic. this is published in an actual medical journal.

  • Mayo Clinic AI detects pancreatic cancer up to three years before diagnosis.
  • Red mod achieves 73% sensitivity versus 39 for radiologists on the same scans, 73 versus 39 for human radiologist.
  • And specifically this is, as far as I'm concerned, this is the one use of AI that we need to accelerate even more.
Alex Volkov
Alex Volkov
"There's announcement from Mayo Clinic that AI with Mayo Clinic."

⚑ Top News Round Table

Alex Volkov: very quick round of, what's the top news in AI from, since last we chatted. And then we're gonna dive into the TLDR.

  • We also have a guest on the show today.
  • If you ever looked at a piece of text like wondered, I wonder if this is AI written or not.
  • sometimes you can know if people use like the lower models.
Alex Volkov
Alex Volkov
"very quick round of, what's the top news in AI from, since last we chatted. And then we're gonna dive into the TLDR. There's a lot to talk about. We also have a guest on the show today. If you ever looked at a piece of t"

πŸ“° TLDR: This Week in AI

This is the TLDR of Thursday Eye for, April 30th. This is everything that we've gathered from this week's news.

  • there's a lot, so I cannot claim anymore that we're covering everything, but this is the top news that we're going to cover.
  • As we already mentioned, Mayo Clinic Redmont.
  • This is an AI that detects pancreatic cancer on routine CT scans up to three years before radiologist.

πŸ”“ Open Source: SenseNova U1

Alex Volkov: All right, let's start with open source. All right, let's talk about open source.

  • Listen, I think you started with Sense Nova.
  • This one is open source as well, right?
  • So they had a paper at ICLR last year, but, they didn't show a whole lot at, at that paper.

πŸ”“ Open Source: DeepSeek V4

Let's talk about DeepSeek, the Chinese whale company that we've known and loved and told you about that broke the internet and the stock market, a year and something ago, came back with Dipe for, let me go find the Dipe announcement. this is the Dipsy paper itself, Yam Peleg: and it's free.

  • So everyone, should act, absolutely use these tokens that they give you with 80% discount all over being already like 30 times cheaper.
  • So yeah, everyone go, go burn some Chinese GPUs, please.
  • while you still can, because it's not gonna last for long because that thing is expensive to run.

πŸ”“ Open Source: IBM Granite 4.1

Alex Volkov: let's talk about the next piece of news, quickly here, IBM releases granite 4.1. It's a dense non-thinking model with best in class tool calling,is what they call this.

  • we have this, up on Weights, & Biases, because of partnership with IBM, I'm assuming, at 5 cents per input token to 10 cents per million output tokens.
  • granted, I, it is very interesting to release a non-thinking model.
  • but they're bidding their own previous M Moes with just 8 billion parameters, and getting 73 on BFCL tool.
Alex Volkov
Alex Volkov
"let's talk about the next piece of news, quickly here, IBM releases granite 4.1. It's a dense non-thinking model with best in class tool calling,is what they call this. we have this, up on Weights, & Biases, because of p"

🏒 OpenAI Goblin Mode

Alex Volkov: let's talk about gobbling mode. I know you wanted to talk about gobbling mode, but especially 'cause they released the blog post.

  • I dunno if you guys seen the blog post.
  • but how about you present this piece of news and then we'll talk about Golin mode?
  • 'cause this is still in big companies and APIs.
Alex Volkov
Alex Volkov
"let's talk about gobbling mode. Yeah. I know you wanted to talk about gobbling mode, but especially 'cause they released the blog post. I dunno if you guys seen the blog post. but how about you present this piece of news"

⚑ Guest: Max Spero (Pangram) - AI Detection

Alex Volkov: I'm g doing good, man. I've been wanting to talk to you for a while, but I think before I introduce you, I'll do this thing.

  • I'll click right and I'll do check for AI content.
  • Oh, you guys don't see my menu, but hopefully you'll see this thing that shows up.
  • and then we're gonna run this whole block from OpenAI through Pangram Labs to see that, according to Pangram, this is a hundred percent human written and this is great.
Alex Volkov
Alex Volkov
"Max Spiro. Welcome. Max is, from Pengu Labs."

πŸ€– This Week's Buzz: Wolf Bench & Cursor Agent

Alex Volkov: this week's vow super quick. we have a few things to announce Wolf and I really would love to talk about the virality of Wolf Bench and the findings as well.

  • And then I think we're good to conclude the show.
  • Like we've been at this for two hours already from CoreWeave.
  • I have to rerecord this because Visor is from CoreWeave.
Alex Volkov
Alex Volkov
"this week's vow super quick. we have a few things to announce Wolf and I really would love to talk about the virality of Wolf Bench and the findings as well."

πŸ”Š 11 Labs Music & Outro

Alex Volkov: this 11 labs slow, dreamy indie rock with reverse vocals. it's very, it's worth saying that 11 labs, oh wait, it responded.

  • I created a link spend request for $10, to cover the six monthly.
  • If I press this, and I can press this quicker than you guys can see, so nobody can snipe this link.
  • I can approve, I can show you how this looks.
Alex Volkov
Alex Volkov
"this 11 labs slow, dreamy indie rock with reverse vocals."
TL;DR

Hosts and Guests

Healthcare AI

  • Mayo Clinic’s REDMOD detects pancreatic cancer up to 3 years before clinical diagnosis with 73% sensitivity vs 39% for radiologists (Announcement)

Open Source LLMs

  • DeepSeek V4 paper drops with CSA+HCA attention, 1M context at 5.7GB KV cache, possibly first frontier model trained across multiple datacenters (Arxiv)

  • SenseTime open-sources SenseNova U1 - unified multimodal 8B/3B-active MoE with no encoder/VAE (HF, GitHub)

  • IBM releases Granite 4.1 family (3B/8B/30B) - non-thinking dense models with 20x token efficiency over Qwen3.5 9B, Apache 2.0 (Blog, HF)

  • Mistral launches Medium 3.5 - 128B dense flagship with 256K context, configurable reasoning, plus Vibe coding agent (HF, Blog)

  • Baidu ERNIE 5.1 Preview hits #13 on Arena (#1 Chinese lab) using just 6% of comparable pretraining compute (ernie.baidu.com)

Big CO LLMs + APIs

  • OpenAI publishes blog explaining GPT-5.5’s β€œgoblin mode” - reward amplification during RL training created an obsession with creature metaphors, leading to duplicated suppression instructions in the Codex system prompt

  • OpenAI ends Microsoft Azure exclusivity, AWS announces GPT-5.5 and Codex on Bedrock; AGI clause removed from contract (Sam tweet)

  • Gemini can now generate and export Docs, Sheets, Slides, PDFs, .docx, .xlsx, LaTeX directly from chat - free for all users globally (Blog)

  • NVIDIA releases Nemotron 3 Nano Omni - 30B/3B-active hybrid Transformer-Mamba MoE with 256K context, 9x throughput on consumer hardware (Blog)

Agentic Commerce & Tools

  • Stripe launches Link wallet for agents at Sessions 2026 - AI agents get scoped payment credentials with mandatory human approval, real card never exposed (Blog)

  • Stripe removes waitlist on Projects.dev - 32 infrastructure providers (Cloudflare, WorkOS, ElevenLabs, Twilio, Daytona, Browserbase, AgentMail, etc.) provisionable via CLI for AI agents

  • Cursor launches SDK exposing the same runtime, harness, and models that power Cursor IDE - now embeddable in any product (Docs)

  • Cognition launches Devin for Terminal - local CLI coding agent with /handoff command for seamless cloud transfer (cli.devin.ai)

Evals & Benchmarks

  • WolfBench tests 23 models across 300+ runs on Terminal-Bench 2.0 - Cursor Agent + GPT-5.5 is the #1 combination (wolfbench.ai)

  • Microsoft’s DELEGATE-52 benchmark shows GPT-5.4 loses 28% of document content after 20 iterative edits, frontier models corrupt stealthily while preserving structure

This Week’s Buzz - Weights & Biases

  • IBM Granite 4.1 live on W&B Inference at $0.05/$0.10 per million input/output tokens with 128K context

  • WolfBench results going viral with Cursor + GPT-5.5 dominance, Codex and Devin testing in the pipeline

AI Detection & Cognitive Security

  • Pangram Labs launches Chrome extension auto-flagging AI content in real time on X, LinkedIn, Reddit, Substack, Medium with 99.98% accuracy and 1-in-10,000 false positive rate (pangramlabs.com)

  • Taylor Lorenz uses Pangram API to analyze top 25 Substack bestsellers, finding many popular newsletters are near-fully AI-generated

AI Art, Video & Audio

  • ElevenLabs launches ElevenMusic - full music platform with discovery, remixing, royalties; 4,000+ indie artists at launch (elevenmusic.io)

  • HeyGen HyperFrames integrates natively with Claude Design - HTML-to-MP4 motion graphics via single CLI command (hyperframes.dev)

  • xAI drops Grok Imagine update with dramatically improved lip sync, sound, and 30-second video extensions

  • OpenAI engineer confirms team is actively fixing GPT-Image-2’s noise artifact issue

Other

  • Talkie - 13B open-weight LLM trained exclusively on pre-1930 text, by Alec Radford and David Duvenaud (talkie-lm.com)

  • GPT-5.5 Codex full system prompt leaked from OpenAI’s open-source repo, revealing 272K context window, four reasoning levels, three personality modes, and the duplicated anti-goblin instruction