Everything AI Released in July 2025

14 releases covered live on the show — every model, product, paper and tool that mattered, with links and our analysis.

🧠 New Models 8

Agentica
New ModelsOpen weights

DeepSWE-Preview

DeepSWE-Preview hits 59% SWE-Bench Verified with pure RL on Qwen3-32B

Agentica and collaborators (with guest Michael Luo of UC Berkeley) released DeepSWE-Preview, a fully open-sourced RL-trained coding agent built on Qwen3-32B that reached 59% on SWE-Bench Verified, a top open result in a benchmark dominated by closed systems. The team published training methodology and weights, emphasizing reproducible reward design and verification over sealed benchmark numbers.

59% SWE-Bench Verified
Baidu
New ModelsOpen weights

ERNIE 4.5

Baidu open-sources ERNIE 4.5, a 10-model multimodal family

Baidu open-sourced the ERNIE 4.5 series, a family of 10 models ranging from 424B down to 0.3B parameters with multimodal capabilities, reportedly beating o1 on DocVQA. The release marks a sharp reversal from Baidu's previous anti-open-source posture and another sign that Chinese labs are setting the pace in open source.

10 ERNIE 4.5 models
Chai Discovery
New Models

Chai-2

Chai Discovery's Chai-2 enables zero-shot antibody design

Chai Discovery introduced Chai-2, a model for zero-shot antibody design that generates candidate antibodies without iterative lab screening. Mentioned in the show notes tools section as one of the week's notable science releases.

Huawei
New ModelsOpen weights

Pangu Pro MoE

Huawei's Pangu Pro MoE: 72B model trained entirely on Ascend NPUs

Huawei released Pangu Pro, a 72B-parameter MoE trained on its own Ascend NPUs rather than Nvidia or AMD hardware, hitting 1,528 tokens/sec and pretrained on 13T tokens. The panel framed it as the geopolitical open-model story of the week, showing how far Chinese compute stacks have advanced under sanctions.

OpenRouter
New Models

Cypher Alpha

Mystery 1M-context model 'Cypher Alpha' appears free on OpenRouter

A stealth model called Cypher Alpha showed up on OpenRouter with a free 1M-token context window, with the panel speculating it could be Amazon Titan. Alex used it as an example of how model releases increasingly arrive as anonymous market probes rather than tidy launches.

Tencent
New ModelsOpen weights

Hunyuan-A13B-Instruct

Tencent ships Hunyuan-A13B: 80B MoE with only 13B active params

Tencent released Hunyuan-A13B-Instruct, an 80B-parameter MoE that activates only 13B parameters at inference while keeping a 256K context window. Built by the team with WizardLM lineage, it posts strong reasoning benchmarks and feels unusually practical for its class, though the panel flagged its license limits.

13B Hunyuan active params

🚀 Products & Apps 1

Dynamics Lab
Products & Apps

Mirage

Mirage debuts as the first AI-native UGC game engine

Dynamics Lab unveiled Mirage, billed as the world's first AI-native user-generated-content game engine, with real-time photorealistic playable demos powered by world-model-style generation. Alex reacted to it live as the most visibly fun demo of the week and a preview of where interactive media is headed.

✨ Major Features & Updates 3

Cloudflare
Major Features & Updates

One-Click AI Bot Blocking

Cloudflare launches one-click AI bot blocking for the web

Cloudflare announced a one-click feature letting site owners block AI scraping bots, a direct response to the economics of perpetual web scraping by AI labs. The move puts a default-off switch in front of a large share of the internet and highlights the tension between open research norms and commercial scraping.

Cursor (Anysphere)
Major Features & Updates

Cursor Agents on Web, Mobile & Slack

Cursor rolls out coding agents on web, mobile, and Slack

Cursor launched its AI coding agents on web and mobile with Slack integration, extending code agents beyond the editor window into ambient, always-on workflow software. The launch landed the same week Cursor poached key creators of Claude Code, making it product-strategy news as much as HR news.

Google DeepMind
Major Features & Updates

Gemini 2.5 Pro (free tier)

Gemini 2.5 Pro returns to Google's free tier

Google brought Gemini 2.5 Pro back to its free tier, making its flagship reasoning model available again to consumer users at no cost. A quick-hit item in the big-company segment of the show.

📄 Papers & Research 1

Microsoft
Papers & Research

MAI-DxO

Microsoft's MAI-DxO hits 85.5% on NEJM diagnostic cases vs 20% for doctors

Microsoft AI published MAI-DxO, a medical diagnostic orchestration system that reached 85.5% accuracy on challenging NEJM-style cases compared to roughly 20% for practicing physicians. The result is framed as a systems win rather than a single-model win, suggesting orchestration may outperform individual models in high-stakes expert workflows.

85.5% MAI-DxO accuracy

🌀 Also Released 1

Meta
Also Released

Meta Superintelligence Labs (MSL)

Meta launches Superintelligence Labs with up to $300M comp packages

Zuckerberg formally assembled Meta Superintelligence Labs, recruiting a dream team of researchers from OpenAI and other labs with rumored compensation packages of up to $300M. The panel treated the spree as proof that the AI talent war has entered full wartime economics, debating whether money alone can buy research momentum.

$300M Rumored Meta packages