What happened in AI the week of September 18, 2025?

This episode feels like a snapshot of AI's coding-and-agents moment: GPT-5-Codex arrives, OpenAI and Google collect ICPC bragging rights, and Jeremy Berman joins to explain a new ARC-AGI state of the art. The panel also spends real time on Reve and Meta's new glasses, making the show as much about interfaces as about benchmarks.

Open Models and the Week's First Wave: what should I know?

The front of the episode moves quickly through the open-model board before settling into the bigger themes that matter later. Even in the recap mode, the panel keeps emphasizing usability, workflow fit, and how quickly the competitive baseline is moving.

GPT-5-Codex and the Agent Platform Story: what should I know?

GPT-5-Codex gives the show its clearest coding headline. The conversation is not just about raw capability, but about what happens when coding models become easier to trust for longer, more productized workflows.

Meta Glasses and the Multimodal Interface Shift: what should I know?

Meta's new glasses push the episode toward hardware and interface design. The segment works because it links display-equipped wearables back to the same broader question the show keeps asking: where do AI systems actually live in the user's daily workflow?

Jeremy Berman on Beating ARC-AGI: what should I know?

Jeremy Berman gives the benchmark discussion real depth by walking through the reasoning behind the latest ARC-AGI result. The segment stays grounded in method, limitations, and why test-time compute and iteration still matter more than easy leaderboard narratives.

ICPC Bragging Rights and the Next Video Wave: what should I know?

The final stretch connects competition wins, usage stats, and video releases into one broader story about momentum. By the end, the episode feels like a clear snapshot of a field where coding, multimodal interfaces, and research prestige are all colliding at once.

Gpt-5-Codex, OAI wins ICPC, Reve, ARC-AGI SOTA Interview

Alex Volkov 0:32

All righty, let's go.

0:35

Welcome everyone to Thursday I for September 18th. My name's Alex. Ow. I'm in Ai Avengers with Weights, & Biases from CoreWeave with me, my trusted cohost, Wolfram. What's up Wolfram?

Wolfram Ravenwlf 0:48

Hi man.

0:49

You remind me that I need to get some glasses.

Alex Volkov 0:51

Oh, yeah, yeah.

0:52

We're definitely gonna chat about this. I'm sitting here wearing my, meta Houston and, yeah, I just wanna, say to everyone, welcome to the stream today. such a busy week, man. Such a busy week. it feels like the slump of the summer is over and we're well into the fall craziness where people supposed to lock in and bring a lot of ai, updates. But, definitely a lot on my plate. we'll wait for some other hosts, but I will say that very interestingly, we have a great interview today. You guys should stick around for, we're chatting with Jeremy Berman at the end of the second hour. Jeremy has just broke the top score on R-K-G-I-R-K-G-I is this Kind of, multi model test for ai. he broke the top score on first one and he used GR for it. And then, space Uncle Elon reposted everything that he did. so that's, very interesting and we will chat with him about, why grog specifically and how he did it, et cetera. So we're definitely gonna have a very interesting conversation there. Besides this, it's been a whole week full of very interesting things. not withstanding the Meta Connect presentation from yesterday with the AI glasses, which we've talked about before, which I'm donning right now. but I don't have the newest generation of obviously, so we're definitely gonna chat about that as well. and also always a great week when there's a new model from OpenAI. So we got GBT five Codex, which we're gonna also mention, probably more and more stuff.

Wolfram Ravenwlf 2:22

open source stuff as well.

Alex Volkov 2:23

And then they didn't talk about it for a day and then it was