IBM

3 releases covered on ThursdAI · ibm.com ↗

April 2026

IBM Apr 30, 2026

New ModelsOpen weights

Granite 4.1

IBM Granite 4.1: dense non-thinking models with top tool calling

IBM released the Granite 4.1 family (3B/8B/30B), dense non-thinking models under Apache 2.0 with best-in-class tool calling, scoring 73 on BFCL with just 8B parameters. IBM claims 20x token efficiency over Qwen3.5 9B, and the models are live on W&B Inference at $0.05/$0.10 per million input/output tokens with 128K context.

IBM Granite blog ↗Hugging Face ↗W&B Inference ↗

🎙️ Hear our coverage →

#open-source #agents #industry

October 2025

IBM Oct 30, 2025

New ModelsOpen weights

Granite 4.0 Nano

IBM Granite 4.0 Nano: ultra-efficient tiny models for edge deployment

IBM released Granite 4.0 Nano, a set of ultra-efficient tiny open models aimed at edge deployment. The release continues the trend of capable sub-billion-to-few-billion parameter models that can run locally on constrained hardware.

Artificial Analysis on X ↗Artificial Analysis: Granite ↗

🎙️ Hear our coverage →

#open-source #on-device

September 2025

IBM Sep 25, 2025

New ModelsOpen weights

Granite Docling 258M

IBM releases Granite Docling 258M compact document-parsing VLM

IBM published Granite Docling 258M, an ultra-compact open-source vision-language model for document understanding that converts documents into structured output. At just 258M parameters it reinforced the show's point that tiny specialized models are becoming genuinely useful workflow tools.

HF ↗

🎙️ Hear our coverage →

#vision #on-device #open-source

IBM

April 2026

Granite 4.1

October 2025

Granite 4.0 Nano

September 2025

Granite Docling 258M

Get this every week