New ModelsOpen weights
Qwen2.5-Omni-7B
Qwen launches Omni 7B: sees, hears, reads, and talks back
Qwen released Qwen2.5-Omni-7B, an open-weights omni-modal model that perceives text, images, audio, and video, and generates both text and speech. It packs end-to-end multimodal perception and spoken output into a 7B parameter model available on Hugging Face.
7B parameters
New ModelsOpen weights
DeepSeek-V3-0324
DeepSeek silently drops V3-0324, 685B params under MIT license
DeepSeek silently updated their V3 base model with DeepSeek-V3-0324, a 685B parameter MoE released on Hugging Face under the MIT license. This is not R1 (their reasoning model) but the powerful base model R1 was built on, and supposedly the base for a future R2.
685B parameters
New Models
Gemini 2.5 Pro
Google reclaims #1 with Gemini 2.5 Pro thinking model
Google dropped Gemini 2.5 Pro, a thinking model that took the #1 spot as the best all-around LLM available, with massive jumps on benchmarks like AIME (up nearly 20 points) and GPQA. It inherits native multimodality and a 1M token context window, maintaining high accuracy even at 120k+ tokens on needle-in-a-haystack tests, with surprisingly low latency (~13 seconds on hard reasoning questions vs 45+ for others). Tulsee Doshi, head of product for Gemini models, joined the show to give the inside scoop.
20 point jump on AIME benchmark1M token context window13 seconds latency on hard reasoning questions (vs 45+ for others)
New Models
Ideogram 3.0
Ideogram 3.0 launches with strong text, logos, and style references
Ideogram launched version 3.0 of its image generation model with another SOTA claim. It is particularly strong on text and logo rendering, photorealism, and style references, continuing Ideogram's edge in typography-heavy image generation.
New Models
GPT-4o (2025-03-26)
GPT-4o gets an update, ties for #1 on LMArena beating GPT-4.5
OpenAI shipped a new GPT-4o checkpoint (2025-03-26) that jumped over GPT-4.5 to tie for #1 on LMArena. The update landed as the show was being written, read as a direct response to Gemini 2.5's launch in the escalating frontier-model race.
New Models
Reve Image
Reve emerges with SOTA diffusion image generation claims
Reve launched a new diffusion image generation model claiming state-of-the-art quality, reportedly beating heavyweights like Midjourney and Flux at roughly a penny per image. The previously low-profile lab made a splash with strong prompt adherence and image quality.
New ModelsOpen weights
Orpheus 3B
Canopy Labs drops Orpheus 3B natural-sounding speech model
Canopy Labs released Orpheus, an open speech language model that produces natural, human-sounding speech, headlined by a 3B model with smaller variants (1B, 500M, 150M) in the family. Weights are on Hugging Face with a Colab for trying it out, discussed on the show with Daily.co CEO Kwindla Kramer in the voice AI segment.
New ModelsOpen weights
EXAONE Deep 32B
LG open sources EXAONE and EXAONE Deep 32B reasoning model
LG AI Research open sourced its EXAONE family, headlined by EXAONE Deep 32B, a thinking/reasoning model. The release puts a large Korean lab's reasoning model in open weights on Hugging Face, and Alex published a live reaction video to the launch.
New ModelsOpen weights
Mistral Small 3.1
Mistral Small 3.1 24B: open-weights multimodal model
Mistral released Mistral Small 3.1, a 24B-parameter open-weights model that adds multimodal (vision) capabilities to the Small line. Both instruct and base checkpoints were published on Hugging Face, making it a strong local multimodal option at the 24B size class.
New ModelsOpen weights
Canary 1B/180M Flash
NVIDIA Canary Flash: Apache 2 speech recognition and translation
NVIDIA released Canary 1B Flash and 180M Flash, Apache 2.0 licensed speech recognition and translation models built as Llama finetunes. The permissive license makes them freely usable for commercial ASR and translation workloads.
New ModelsOpen weights
Llama-Nemotron (Super 49B, Nano 8B)
NVIDIA drops Llama-Nemotron reasoning models plus training dataset
NVIDIA released the Llama-Nemotron family, including Super 49B and Nano 8B reasoning models, announced around GTC. Alongside the open weights, NVIDIA published the Llama-Nemotron post-training dataset, giving the community both the models and the data recipe behind them.
New Models
Next-gen audio models (gpt-4o-mini-tts & transcription)
OpenAI launches steerable voice model and two new transcription models
OpenAI launched a new emotionally steerable text-to-speech voice model plus two new transcription models, watched live on the show as a watch party. The TTS model can be instructed how to speak (tone, emotion, character), demoed at openai.fm, and the models are available through the API for voice agents.
New ModelsOpen weights
RF-DETR
Roboflow drops RF-DETR, a SOTA open-source object detection model
Roboflow released RF-DETR, a state-of-the-art real-time object detection model, announced as breaking news on the show by CEO Joseph Nelson. The model is fully open source on GitHub and targets practical, deployable computer vision workloads.
New ModelsOpen weights
Step-Video-TI2V
StepFun releases Step-Video-TI2V image-to-video model
Chinese lab StepFun dropped Step-Video-TI2V, an open text/image-to-video generation model. Weights are on Hugging Face with code on GitHub, adding another open-weights option to the fast-moving video generation space.
New ModelsOpen weights
Hunyuan3D 2.0 MV & Turbo
Tencent updates Hunyuan3D 2.0 with MultiView and Turbo variants
Tencent updated its Hunyuan3D 2.0 image-to-3D model with an MV (MultiView) version that conditions on multiple input views, plus a faster Turbo variant. The show highlighted it as new SOTA for 3D generation, available to try in a Hugging Face space.
New ModelsOpen weights
OLMo 2 32B
AllenAI ships OLMo 2 32B, a fully open GPT-4-class model
The Allen Institute for AI released OLMo 2 32B, its biggest fully open model yet, with weights, code, and dataset all published under Apache 2.0. Announced by Nathan Lambert as a last-second addition, it reportedly beats GPT-3.5 and GPT-4o mini as well as leading open-weight models like Qwen and Mistral at its size.
New Models
Seedream 2.0
ByteDance unveils Seedream 2.0 bilingual image generation foundation model
ByteDance released Seedream 2.0, a native Chinese-English bilingual image generation foundation model, alongside a technical paper. It emphasizes excellent text rendering (especially Chinese), cultural nuance, and human preference alignment, generating high-quality, culturally relevant images from prompts in either language.
New ModelsOpen weights
Command A
Cohere Command A: 111B enterprise model with 256K context on just 2 GPUs
Cohere announced Command A, a 111B parameter open-weights model with a 256K context window, presented on the show by Cohere's Sandra Kublik. It runs on only two GPUs where models of this size typically require around 32, and is built for enterprise use: agentic tasks, tool use, multilingual performance, and secure private deployments.
New ModelsOpen weights
EuroBERT
EuroBERT: multilingual encoder models from 210M to 2.1B parameters
EuroBERT is a new family of multilingual encoder models ranging from 210M to 2.1B parameters, trained on a 5 trillion-token dataset across 15 languages with 8K context support. It targets European and global language NLP tasks like retrieval and RAG, where properly encoding non-English character sets matters.
New ModelsOpen weights
Gemma 3
Google open sources Gemma 3, 1B-27B multimodal family with 128K context
Google released Gemma 3, an open-weights model family spanning 1B to 27B parameters with multimodal (text, image, video) capabilities, support for over 140 languages, and a 128K context window. The 27B model runs on a single GPU, with Sundar Pichai claiming competitors need roughly 10x the compute for similar performance. It shipped with day-one open source ecosystem support (Hugging Face, Ollama, Kaggle) plus ShieldGemma 2 for content moderation.
New ModelsOpen weights
Open-Sora 2.0
OpenSora 2.0: 11B open-source video model trained for $200K
OpenSora 2.0 is an 11B parameter open-source video generation model that claims state-of-the-art results while costing only about $200,000 to train. The team claims performance approaching OpenAI's Sora on some benchmarks, underscoring how fast open-source video generation is improving.
New ModelsOpen weights
DeepHermes 3 (24B / 3B)
Nous Research releases DeepHermes 24B and 3B hybrid reasoning models
Nous Research released DeepHermes hybrid reasoners at 24B (Mistral-based) and 3B sizes, models that can toggle between standard chat responses and long chain-of-thought reasoning. The 24B preview is available on Hugging Face as part of the week's wave of open-source reasoning model releases.
New ModelsOpen weights
Reka Flash 3
Reka Flash 3: 21B open-source reasoning model under Apache 2.0
Reka AI open sourced Reka Flash 3, a 21B parameter reasoning model released under an Apache 2.0 license and trained with the REINFORCE Leave One-Out (RLOO) reinforcement learning technique. It excels at chat, coding, instruction following, and function calling, with Nisten calling it possibly one of the best ~20B models available.
New ModelsOpen weights
Wan 2.1 14B I2V LoRA video effects
Remade AI releases 8 open LoRA video effects for Wan 2.1
Remade AI published eight LoRA video effects for Alibaba's Wan 2.1 14B image-to-video model, including effects like squish, inflate, deflate, and cakeify. The open release shows video effects becoming trainable and customizable via LoRAs on top of open video models.
New ModelsOpen weights
Jamba 1.6 Large & Mini
AI21 releases Jamba 1.6 Large and Jamba 1.6 Mini open-weights models
AI21 Labs released Jamba 1.6 in Large and Mini sizes, updating its hybrid SSM-Transformer (Mamba-based) model family with open weights on Hugging Face. The Jamba architecture targets long-context efficiency compared to pure transformer models.
New ModelsOpen weights
QwQ-32B
Qwen releases QwQ-32B reasoning model that matches R1 on some evals
Alibaba's Qwen team released QwQ-32B, an open-weights reasoning model that matches DeepSeek R1 on several evals despite being roughly 20x smaller at 32B parameters. Qwen tech lead Junyang Lin joined the show to announce it, and the episode dubbed it Alibaba's 'R1 killer' for bringing strong reasoning to a size that runs on consumer hardware.
New ModelsOpen weights
Aya Vision
Cohere For AI releases Aya Vision 8B and 32B open multilingual vision models
Cohere For AI released Aya Vision in 8B and 32B sizes, extending the multilingual Aya family with open-weights vision-language capabilities. The models target multilingual multimodal understanding across many languages.
New ModelsOpen weights
NotaGen
NotaGen open symbolic music model generates classical sheet music
NotaGen is an open symbolic music generation model that produces high-quality classical sheet music rather than raw audio. The release includes code on GitHub, weights on Hugging Face, and a browser demo.
New Models
Image-01
MiniMax launches Image-01 text-to-image model at 1/10 the cost
MiniMax released Image-01, a versatile text-to-image model the company positions at roughly one tenth the cost of competing image generation offerings. It is available through MiniMax's hosted platform.
New ModelsOpen weights
HunyuanVideo-I2V
Tencent releases HunyuanVideo-I2V open image-to-video model
Tencent finally shipped the long-awaited image-to-video version of HunyuanVideo, with open weights on Hugging Face and a hosted try-it experience. It lets users animate still images using one of the strongest open video generation models.
New ModelsOpen weights
CogView 4 (6B)
Zhipu AI open-sources CogView 4, a 6B text-to-image model
Zhipu AI released CogView 4, a 6B-parameter open text-to-image model in the CogView family, with code available on GitHub. It is notable as an open-weights image generation option with strong Chinese and English prompt support.