New ModelsOpen weights
PersonaPlex-7B
NVIDIA releases PersonaPlex-7B voice model
NVIDIA released PersonaPlex-7B, an open voice/audio model published on Hugging Face with code on GitHub. Listed in the week's Voice & Audio releases.
New ModelsOpen weights
Alpha Mayo
NVIDIA Alpha Mayo: open source reasoning self-driving models
NVIDIA announced Alpha Mayo at CES, a family of open source reasoning-based self-driving AI models. The models perform end-to-end autonomous driving with explicit reasoning steps, like identifying jaywalkers and stopping accordingly, demoed in a Mercedes-Benz.
Acquisitions
Groq acquisition
NVIDIA acquires Groq team and licenses its tech for ~$20B
NVIDIA entered an exclusive licensing deal with Groq and acquired most of its team for approximately $20B. Groq's inference-optimized chips, created by former Google TPU lead Jonathan Ross, complement NVIDIA's training dominance as inference demand grows exponentially across AI use cases.
New ModelsOpen weights
Nemotron Speech ASR
Nemotron Speech ASR: 600M streaming model with 24ms latency
NVIDIA released Nemotron Speech ASR, a 600M parameter open source streaming speech recognition model with 24ms median latency and support for 900 concurrent streams on a single H100. Kwindla Hultman Kramer of Daily.co demoed sub-500ms voice-to-voice latency using a three-model pipeline of Nemotron ASR, Nemotron Nano LLM, and Magpie TTS.
24ms Nemotron Speech latency
Products & Apps
Vera Rubin
NVIDIA Vera Rubin platform: 5x Blackwell inference at CES 2026
Jensen Huang unveiled the Vera Rubin platform at CES 2026, NVIDIA's next-gen AI computer delivering 50 PFLOPS and 5x inference performance over Blackwell while adding only ~200W of power draw. It needs 75% fewer GPUs for 10 trillion parameter MoE training, packs 72 GPUs per rack with 20.7TB memory and 13 TB/s bandwidth, is 100% liquid cooled, and entered full production just four months after the B300.
5x Vera Rubin vs Blackwell75% Fewer GPUs needed