New ModelsOpen weights
Qwen 3
Alibaba open-weights the full Qwen 3 family under Apache 2.0
Alibaba released the entire Qwen 3 stack: two MoE models (235B total/22B active and 30B/3B active) plus six dense siblings from 32B down to 0.6B, all Apache 2.0 with day-one support in LM Studio, Ollama, vLLM, MLX and llama.cpp. The headline feature is a runtime hybrid 'thinking' toggle (/think and /no_think) that trades latency for reasoning depth. Trained on ~36T tokens with 128K context and 119-language coverage, the 235B MoE rivals DeepSeek-R1, o1, o3-mini and Gemini 2.5 Pro on coding and math.
235 B Flagship MoE total parameters (22B active)30 B Qwen3-30B-A3B hit 57 tok/s on a Mac with speculative decoding36 Trillions of pre-training tokens (2x Qwen 2.5)
Major Features & Updates
NotebookLM Audio Overviews
NotebookLM AI Audio Overviews go multilingual with 50+ languages
Google expanded NotebookLM's AI audio overviews (the podcast-style summaries) to support more than 50 languages, taking the feature global beyond its English-only debut.
New ModelsOpen weights
Helium-1
Kyutai releases Helium-1, a 2B European-language model plus dactory pipeline
Kyutai released Helium-1, a 2B-parameter model distilled from Gemma-2-9B and purpose-built for Europe's 24 official languages, under CC-BY 4.0. It sets a new state of the art for its size class on MMLU-EU, ARC-EU and FLORES translation while fitting in under 2GB VRAM for edge and phone deployment. They also open-sourced 'dactory' (MIT), their full Common Crawl data-processing pipeline that scores, dedups and tags webpages.