ThursdAI · January 15, 2026

📆 ThursdAI - Jan 15 - Agent Skills Deep Dive, GPT 5.2 Codex Builds a Browser, Claude Cowork for the Masses, and the Era of Personalized AI!

From Weights & Biases - come learn what agent skills are all about, Claude Cowork opens the door for non coders to do agentic stuff, GPT 5.2 Codex in API and Gemini get personalized! Big week!

By Alex Volkov

101 min

YouTube Spotify Apple Podcasts Substack

Episode Summary

This week's ThursdAI went deep on agent skills — the open standard that's turning general-purpose AI agents into domain experts with nothing more than markdown files and a directory structure. Eleanor Berger from Agentic Ventures joined for a masterclass on skills, while Alex demoed adding skill support to the Chorus app in just 3.5 hours using a Ralph loop. The show also covered Claude Cowork (a week-and-a-half sprint, 100% written by Claude Code), GPT 5.2 Codex hitting the API where Cursor used it to build a full browser from scratch with 330,000 commits, and Google rolling out Gemini personalized intelligence across Gmail, YouTube, and Search.

In This Episode

📰 TL;DR
🔓 Open Source AI Models
🔓 MedGemma
💰 Drama Corner & Partnerships
🛠️ Claude Cowork
🏢 GPT 5.2 Codex
🏢 Gemini Personal Intelligence
🤖 Agent Skills Deep Dive
🤖 Skills Adoption & Platform Support
🤖 What is a Skill? Structure Explained
🤖 Scripts, References & Assets
🤖 Creating Skills with AI
🤖 Practical Examples & Use Cases
🛠️ Demo: Adding Skills to Chorus
🤖 Future of Skills: Marketplaces & Sharing

Hosts & Guests

Alex Volkov

Host · W&B / CoreWeave

@altryne

Eleanor Berger

Agentic Ventures — Founder

@intellectronica

Ryan Carson

AI educator & founder

@ryancarson

Wolfram Ravenwolf

Weekly co-host, AI model evaluator

@WolframRvnwlf

Nisten Tahiraj

AI operator & builder

Nous Research

AI builder & founder

By The Numbers

Claude-coded Cowork

100%

Claude Cowork was 100% written by Claude Code in a week-and-a-half sprint at Anthropic

Commits

330K

Cursor built a full browser from scratch using GPT 5.2 Codex with ~330,000 commits

Lines of code

3M+

Cursor’s browser experiment: millions of lines of Rust, built by hundreds of concurrent agents

M3 Medical LLM

235B

Byte’s M3 open-source medical model fine-tuned from Qwen3, claims to beat GPT 5.2 on HealthBench

LongCat Flash

560B/27B

Meituan’s LongCat Flash Thinking — 560B total params, 27B active, MIT licensed

OpenAI × Cerebras

$10B

OpenAI partnership with Cerebras for 750 megawatts of high-speed compute, starting 2028

🔥 Breaking During The Show

Flux Two Klein — Black Forest Labs' Fast Image Model

Wolfram broke the news mid-show: Black Forest Labs dropped Flux Two Klein, a fast 4B/9B image generation model under Apache 2.0 / open weights, designed for near-real-time editing and style iteration.

📰 TL;DR

Alex opens the show with a call to non-developers to dive into AI agents, introduces the panel, and runs through a packed week: open-source medical LLMs, Claude Cowork launch, GPT 5.2 Codex in the API, Gemini personal intelligence, drama between Anthropic and Open Code, and a deep dive into agent skills with guest Eleanor Berger.

Agent skills deep dive announced as the main topic
Claude Cowork launched for non-technical users
GPT 5.2 Codex finally released via API
Gemini personal intelligence across Google services

Alex Volkov

"Coding agents or coding harnesses for AI that we talk about GPT, Claude et cetera, they are generalized agents. If an agent can write code for you to do tasks, it's generally a generalized agent that can do everything else."

🔓 Open Source AI Models

The panel covers open-source releases: Byte’s M3, a 235B parameter medical LLM fine-tuned from Qwen3 that claims to beat GPT 5.2 on HealthBench, plus Anthropic and OpenAI both pushing into healthcare with HIPAA-ready products. Nisten highlights M3 can run on an M3 Ultra at usable speeds.

M3: 235B medical LLM, Apache 2.0, beats GPT 5.2 on HealthBench
22B active parameters — runnable on M3 Ultra
Anthropic launches Claude for Healthcare with HIPAA compliance

Nisten Tahiraj

"It's 22 B active parameters and 235 B. So you can actually run this on, like if a doc wants to run it, they can actually run it at a usable speed."

🔓 MedGemma

Google releases MedGemma 1.5 for medical use cases while Nisten and Wolfram clarify it’s a completely different model class (4B for imaging) that pairs well with the much larger M3. Also covered: OpenAI acquiring Torch Health and Anthropic’s Claude achieving 92% on Med Agent Bench with Opus 4.5.

MedGemma 1.5: small enough for offline medical imaging
Opus 4.5 hits 92% on Med Agent Bench
OpenAI acquires Torch Health for GPT Health

Nisten Tahiraj

"These do not replace each other. You should use these together. This is a very good pair."

💰 Drama Corner & Partnerships

Spicy industry news: Thinking Machines co-founders return to OpenAI, Soumith Chintala becomes their CTO. Anthropic blocks Open Code from using Max subscription as a wrapper and blocks xAI from using Claude Code. Apple announces Gemini will power Siri. OpenAI inks a $10B deal with Cerebras for 2028.

Anthropic blocks Open Code and xAI from Claude services
Apple partners with Google — Gemini to power Siri
OpenAI × Cerebras: $10B for 750MW compute (2028)
Thinking Machines co-founders return to OpenAI

Ryan Carson

"I think it's crazy that Apple doesn't have a large language model. It's just unbelievable. They aren't one of the model labs. Like I think all of us are just kind of throwing our hands up and saying, how did this happen?"

Nisten Tahiraj

"There's going to be a lot of spouses and family members of xAI engineers now with the Claude Max subscriptions."

🛠️ Claude Cowork

Anthropic launches Claude Cowork — Claude Code for non-developers, built in a week-and-a-half sprint with 100% of the code written by Claude Code itself. Alex demos it live, adding Flux Klein support to an image extension project without seeing a single line of code. The panel discusses the security implications and the dangerously-skip-permissions debate.

100% coded by Claude Code in a 1.5-week sprint
Research preview, Mac-only, requires Max subscription
Chrome connector enables browser automation
Live demo: added Flux model support without viewing code

Ryan Carson

"People are realizing that the model is good enough to complete specified tasks well without micromanaging, like very well. So this is just an extension, a UI on top of that."

Nisten Tahiraj

"Can you code? No. Can you use Vim? No. I know what I like and I don't like. I am decisive in what I prompt."

🏢 GPT 5.2 Codex

OpenAI finally releases GPT 5.2 Codex via API after months of exclusivity in the Codex app. Cursor used it to build a complete browser from scratch in Rust with 330,000 commits and hundreds of concurrent agents. LDJ and Ryan debate context compaction — Ryan drops the hot take that compaction doesn’t work and atomic Ralph-style tasks are the real solution.

GPT 5.2 Codex now in Cursor, GitHub Copilot, and VS Code
Cursor built a browser from scratch: ~3M lines of Rust
Native context compaction support for long sessions
Ryan's hot take: auto compaction doesn't work

Ryan Carson

"I have a hot take on this. I do not think auto compaction works. I think if you use these tools right, you'll find that you can't compact out what you actually need out of a thread."

LDJ

"For a lot of medium to somewhat hard tasks that Opus can do, I would say 5.2 Codex can often do them as well, but it tends to take longer, and that seems to be a big downside of it right now."

🏢 Gemini Personal Intelligence

Google ships personalized AI in Gemini, reasoning across Gmail, YouTube, Photos, and Search with explicit opt-in. Alex tests it — it figured out he drives a Tesla Model Y from emails and noticed his recent Honda Odyssey search. The panel discusses Google’s massive data moat and LDJ predicts MCPs for everything.

Gemini reasons across Gmail, YouTube, Photos, Search
Explicit opt-in for US Pro and Ultra users
Google’s data moat vs OpenAI and Anthropic
LDJ: MCPs for everything, cross-platform personal AI

Wolfram Ravenwolf

"That's also the reach Google has. At Google, they just make it available. You probably get a popup if you want to enable it and immediately you have millions of users."

LDJ

"On the question of where it's all going, I think you'll have essentially MCPs for everything."

🤖 Agent Skills Deep Dive

Eleanor Berger from Agentic Ventures joins to kick off the skills deep dive. She explains that skills are an admission that we now have general-purpose agents — they do everything except know what you want. Skills are the missing piece: simple markdown files in a directory that give agents domain expertise via progressive disclosure.

Skills = admission we have general-purpose agents
Simple markdown + directory structure, universally adopted
Progressive disclosure: agents load skills on demand
Every major coding agent now supports the standard

Eleanor Berger

"Skills are an admission that we now have general purpose agents. They do everything you need except that they don't know what you want to do. And that's what you have skills for."

🤖 Skills Adoption & Platform Support

Alex walks through the current adoption landscape: Claude is the only chat interface supporting skills, but virtually every coding IDE (Cursor, Windsurf, Anti-Gravity) and CLI (Claude Code, AMP, Open Code, Codex) now supports the standard. Eleanor gives a shout-out to AMP as one of the first adopters.

Cursor, Anti-Gravity, and Gemini CLI added support this week
AMP was one of the first adopters
Skills work cross-platform: same skills, any agent

Alex Volkov

"The reason why it's useful is that these LLMs are really, really good generally. At some point though, you need to steer them and give them domain expertise. Domain expertise is where it's at."

🤖 What is a Skill? Structure Explained

Eleanor walks through the anatomy of a skill: a directory with a skill.md file containing YAML front matter (name + description of when to use it). The magic is that each skill takes only 50–100 tokens of metadata, so you can have hundreds without polluting context. Alex compares it to Neo in The Matrix: the model decides when to load domain knowledge.

A skill is a directory with skill.md + optional scripts/references
50–100 tokens per skill metadata — hundreds fit in context
Progressive disclosure: agent loads full skill only when needed
Skill creator skill: self-reflecting AI that builds skills

Eleanor Berger

"You could have hundreds of them because they take very little, maybe like 50 to hundred tokens per skill, just the metadata. And the agent will figure it out. They'll know when the time comes to grab that skill."

Alex Volkov

"This is like Neo in The Matrix when they plug him in and he's like, I know kung fu. This is skills in a nutshell. The model decides to load what information when."

🤖 Scripts, References & Assets

Eleanor explains the three optional directories in a skill: scripts (Python/TypeScript code for API calls or computations), references (additional markdown for progressive loading), and assets (templates, images, static files). Ryan highlights that experts like Vercel are now releasing skill packs for frameworks like Next.js and React.

Scripts: runnable code for APIs, calculations, tools
References: additional markdown loaded on demand
Vercel releasing official Next.js/React skill packs

Ryan Carson

"Experts in the field like Vercel, who obviously know probably the most in the world about React and Next.js, they're starting to release sets of skills now. You point your agent at this and it will install the skills for you."

🤖 Creating Skills with AI

Eleanor reveals the key insight: you don’t have to manually create skills — agents are really good at building them. She argues this solves continual learning: teach by doing, then tell the agent to package what you just did as a reusable skill. Alex explains that Claude’s chat interface supports skills directly for Max subscribers.

Agents can create skills from your workflows
"Continual learning? It's solved. The problem is solved."
Teach by doing: work with the agent, then package as skill
Claude web/Mac chat supports skills for Max subscribers

Eleanor Berger

"Agents are really good at creating agent skills. You bring the knowledge, you just say, here's what I know about this workflow, or this library. And the agent will very handily create the skill for you."

Eleanor Berger

"How will we solve continual learning? It's solved. The problem is solved."

🤖 Practical Examples & Use Cases

Eleanor shares her skills portfolio: flashcard apps turned into skills, image generation via Nano Banana, MCP replacements, and driving multiple models from Claude. Wolfram describes his to-do list manager skill and screenshot-based workflows. Eleanor drops the key insight: skills are the joker card of customization — they replace commands, hooks, MCPs, and even small apps.

Eleanor replaced a full app with a 10-minute skill
Skills can replace MCP servers, hooks, and commands
Wolfram: to-do list manager built entirely as a skill
Skills are portable between different agents and models

Eleanor Berger

"They're like the joker card of customization because they replace everything. You don't need commands anymore, you don't necessarily need hooks, you don't need MCP servers necessarily. Skills are all you need."

Eleanor Berger

"Skills are portable between different agents and different models. And are forwards compatible. No model in the future will be worse than the models we have now at interpreting the instructions."

🛠️ Demo: Adding Skills to Chorus

Alex reveals his big project: he used a Ralph loop with Claude Code to add full skill support to Chorus, an open-source app that compares answers across multiple LLMs. In 3.5 hours, Claude built a settings panel, skill discovery from the filesystem, front-matter extraction, and cross-model skill injection — making skills work with GPT 5.2 Codex, Gemini, and every Open Router model.

3.5 hours via Ralph loop to add full skill support
Skills now work across any LLM via Chorus + Open Router
Settings UI, filesystem discovery, and front-matter parsing
GPT 5.2 Codex using Claude-style skills for the first time

Alex Volkov

"I've added skill support to Chorus, and now you can use skills, the same skills that you have already installed, with Chorus on every LLM out there. GPT 5.2 Codex, the one that was released yesterday, you can now use it with your own skills in a chat interface."

🤖 Future of Skills: Marketplaces & Sharing

Ryan asks if we’re heading toward a skill marketplace — he already spent $200 on skills from The Boring Marketer. Alex predicts a mix: companies turning docs into skills, free community-shared skill packs via Git, and paid specialist collections. Ryan closes by telling Alex to sell his podcast production skills.

Ryan spent $200 on marketing skills pack — worth it
Skills shareable via Git, local per project or global per user
Skill marketplaces coming alongside free community sharing
WeaveHacks 3 announced: Jan 31–Feb 1, Self-Improving Agents

Ryan Carson

"Alex, I'm gonna give you an idea that I think will make you a lot of money. I think you're very good at preparing for podcasts. I think you should make some skills and we will all buy them."

Alex Volkov

"I was able to use the skills that I learned on the show last week to develop this thing that I think if I were at a startup doing traditional software development, this would've taken a week and a half. This just happened in like three hours."

TL;DR

Hosts and Guests
- Alex Volkov - AI Evangelist & Weights & Biases (@altryne)
- Co Hosts - @WolframRvnwlf @yampeleg @nisten @ldjconfirmed @ryancarson
- Vaibhav Srivastav (VB) - DX at OpenAI ( @reach_vb )
Open Source LLMs
- Z.ai GLM-OCR: 0.9B parameter model achieves #1 ranking on OmniDocBench V1.5 for document understanding (X, HF, Announcement)
- Alibaba Qwen3-Coder-Next, an 80B MoE coding agent model with just 3B active params that scores 70%+ on SWE-Bench Verified (X, Blog, HF)
- Intern-S1-Pro: a 1 trillion parameter open-source MoE SOTA scientific reasoning across chemistry, biology, materials, and earth sciences (X, HF, Arxiv, Announcement)
- StepFun Step 3.5 Flash: 196B sparse MoE model with only 11B active parameters, achieving frontier reasoning at 100-350 tok/s (X, HF)
Agentic AI segment
- Moltbook a redddit for agents as well as a youtube, a twitter, a church, a 4chan, an instagram, a dark web (do not let your agents go in any of these)
Big CO LLMs + APIs
- OpenAI launches Codex App: A dedicated command center for managing multiple AI coding agents in parallel (X, Announcement)
- OpenAI launches Frontier, an enterprise platform to build, deploy, and manage AI agents as ‘AI coworkers’ (X, Blog)
- Anthropic launches Claude Opus 4.6 with state-of-the-art agentic coding, 1M token context, and agent teams for parallel autonomous work (X, Blog)
- OpenAI releases GPT-5.3-Codex with record-breaking coding benchmarks and mid-task steerability (X)
This weeks Buzz - Weights & Biases update
- Links to the gallery of our hackathon winners (Gallery)
Vision & Video
- xAI launches Grok Imagine 1.0 with 10-second 720p video generation, native audio, and API that tops Artificial Analysis benchmarks (X, Announcement, Benchmark)
- Kling 3.0 launches as all-in-one AI video creation engine with native multimodal generation, multi-shot sequences, and built-in audio (X, Announcement)
Voice & Audio
- Mistral AI launches Voxtral Transcribe 2 with state-of-the-art speech-to-text, sub-200ms latency, and open weights under Apache 2.0 (X, Blog, Announcement, Demo)
- ACE-Step 1.5: Open-source AI music generator runs full songs in under 10 seconds on consumer GPUs with MIT license (X, GitHub, HF, Blog, GitHub)
- OpenBMB releases MiniCPM-o 4.5 - the first open-source full-duplex omni-modal LLM that can see, listen, and speak simultaneously (X, HF, Blog)
AI Art & Diffusion & 3D
- LingBot-World: Open-source world model from Ant Group generates 10-minute playable environments at 16fps, challenging Google Genie 3 (X, HF)

Alex Volkov 0:30

All right, welcome everyone to ThursdAI for January 15th.

0:34

My name is Alex Volkov. I'm in the AI evangelists with Weights, & Biases from CoreWeave, and I am super excited for Todays' show. If it's not clear yet, I'm super excited for the day show because we have, not only do we have tons to cover today, we're going to do a deep dive into agent skills. We reported back on agent skills. Since Anthropic released them. Since then, they've been adopted as a open standard across multiple tools and multiple ideas. and I think that kind of like MCP though, they're completely different. agent skills is something that the world is still snipping on. Many don't even want to try them to try them without coding tools used to require a subscription to Claude code or Claude And, today, we will not only dive deep into them, we're having a guest. Berger is gonna join us to talk about them. I'll also share how you can use agent skills without, paying Claude. But all of that later, I'm very excited about the show if you're not clear. I'm also very excited to add my co-host for today. I have Nisten and Wolfram. I will just dive right in and ask you guys what is the biggest highlight of the news for you this week? We'll start with maybe Wolfram and then Nisten.

Wolfram Ravenwolf 1:49

Yeah, those skills in anti-gravity that they build it

1:52

in, that, that, yeah, it's great. That's the best thing about the open standards when they proliferate, when they are used by others and, it really is a standard and can be used everywhere. So I'm using my Claude skills now in anti-gravity and it works, so I'm super happy about that. That's dope.

Alex Volkov 2:08

Nisten, how are you?

Nisten Tahiraj 2:10

I'm on data janitor duty, so, what I liked the most was

2:15

seeing both Crush and Open Code, just end up with an OpenAI deal to use, GPT 5.2 Codesx.

Alex Volkov 2:22

Yeah.

Nisten Tahiraj 2:23

Because 5.2 just released and they had day one support for both

2:28

the open source, coding harnesses. So that's a very interesting, development. But unfortunately, I tried it in open code, but I didn't do any work with 'em. So I want to give it a much more thorough test, for both.

Alex Volkov 2:41

I wanna say, for, a bunch of folks on the podcast listening, tuning in.

2:45

Not everybody who listened to us is an AI developer, and I think that is very important for us to know as well. A lot of the early adoption of the AI insanity wave has been for developers. A lot of the early strong benefits to the economy came through developers adopting ai. And so we often talk about things like cursor and Claude code and crush, and open code, et cetera. We often talk about them as though everybody who listens to the show knows exactly what we need. However, some folks are not developers and for them Anthropic released a tool this week that is specifically for them to, experience the same kind of excitement. Everybody who's not a developer, they can still use a lot of these tools just by literally walking through, your favorite AI agent, have them walk through how to get it started. the reason to use these tools is because I think it's clear this week more than any other week that I've seen that coding agents or coding harnesses for AI that we talk about GPT, Claude et cetera, they are generalized agents. if an agent can write code for you to do tasks. It's generally a generalized agent that can do everything else. It can do taxes, it can do cleaning of your desktop, it can do a bunch of stuff. And I think it's never been, more clear than this week. And so I would encourage everybody who listens to the show, who does not consider themselves a developer for whom a command line is this, nah, I'm not gonna open this type of thing. To open their minds a bit to two things. One, we now in this beautiful, beautiful world where you can learn anything and you can do anything just by having an agent facing screenshots into it. And having, walkthrough two, all of the developers. Now, most of us, what we do is we don't write code anymore. we use natural language to achieve the task that we want. this shift, makes non-developers, more easily accessible to this world. And so most of what we do right now, or at least a lot of us, is babysitting agents. So I would encourage non developers to not be shy and just dive in because the benefits on the other side are wonderful. so this is my little spiel for this week, and I think, with this we'll dive into the TLDR because there's a lot to talk about. There's a big show ahead of us. And as a reminder, Eleanor Berger will join us, later to, talk about AI agent skills,

5:28

All righty. this is the TLDR of everything that happened this week, and this week has been a very busy one. So we'll start with, ThursdAI January 15, to today with you, Alex Volkov, ai evangels with Weights, & Biases. We have Nisten Hir, and we have Yam Peleg, and Wolfram. Raven was joining us and we'll see about other ones. And our guest today was Eleanor Berger She's gonna join us a little bit later to talk about agent skill. About agent skills. we're gonna start with open source, open source LMS by one M three. It's a medical LM by one released M 3 230 5 billion medical fine tuned from Qwen three. And they claim it beats GBD 5.2 on healthcare benchmarks. This one is Apache two, and the tops OpenAIs health bench with 65.1 score. We also have a similar named company that's a little bit different. it's called Meituan, I believe, from they Maan released long cat Flash thinking. agent MOEA, they have a fully MIT licensed model, for gentech tasks scoring 88.2 average on, tau bench, T two bench, and 73 on browser comp. So also super, super cool. in open source. not a huge week for open source, but definitely something deeps seek released a paper that we're not gonna go into because it's, super technical and I'm not sure how useful this is. but deeps seek often releases like super, super cool technical things, that we will cover, but they're not super useful. So I decided, this week has been a big week, so we're gonna skip this in the big companies lms. we will celebrate the release of Claude codes for the masses, Claude Cowork, Anthropic Launch Claude Cowork, which puts Claude codes agenda capabilities into the desktop app for non codes. Also on the web, you give Claude access to a folder and it can read, write, and create files and write code for you. And you don't even have to know what code is, which is great. We're gonna do a deep dive on the show for this. there's a Chrome plugin to, to use the browser. We're gonna mention this at length. So if you are interested and if you never try to Claude code because it's too complex differently, listen to that segment of the show. Also, we have GPT 5.2 Codex, which is the coding version of GPT 5.2 from OpenAI finally released via open API. before this, it was only available in the Codex app itself. Codex is complex. It's an app, it's a model. Nevermind. So GPT 5.2. Codex, was finally available in the API. the main users are. Agent harnesses like Cursor and, cursor specifically used it to do some incredible stuff. We're gonna talk about it's priced the same as GPD 5.2 1.75, per million tokens. we're gonna get to talk about GPD 5.2 because it is great for long running context. All right, what else do we have? we're gonna mention that Cursor used GBT 5.2 Codex to create a whole browser from scratch with over 3 million lines of code. so this is an example of GPT 5.2 Google has shipped personal intelligence in Gemini, which means it reasons across your Gmail, YouTube photos, and search with explicit opt-in controls. in the US and for pro and ultra users. And I've used it, it is really, really funny when it says, Hey, based on your recent Google searches, it looks like you drive this model of a car. But hey, you look for this one. Should I look for some other stuff for you? It's kind of like, you know, I don't use Google a lot for searches lately. having it know what I searched, it's kind of like it gave me pause, but it is kind of cool. it's dope. we're gonna mention this. A personal AI system is very important as well. So we're gonna run through the drama corner super quick. We don't usually do this, this week. I just had to make sure that you know, so you're not left behind on the drama that's happening. So there's a whole thing with Open Code, which is an agent harness in the open source. Anthropic kicked off their API, not really their API, but their login system, blah, blah, blah. apple announced that Gemini will power Siri and not Chat. GPT, with Apple Intelligence started running on device in private cloud. OpenAI announced a $10 billion cereus partnerships for 750 megawatts of compute, but that's only starting 2028 and thinking machines, multiple co-founders and the CTO, returned to OpenAI after being fired for onward conduct or something, And Soumith Chinthala is now the CTO of thinking machines, and I'm pretty sure the elfs have left for valor. is is a reference to that. If anybody knows what I'm talking about. You too, online. Get off, go touch some grass. I think that this is all of the gossipy and kind of like, news for this week. Oh, there's also gossip about, Grok 4.2 coming out very soon and powering the US government, All right, moving on. This week's buzz, a corner where I cover everything happens in the world of. Weights & Biases / Coreweave, Weavehacks 3 three our main hackathon that we organize ourselves, and we have a bunch of sponsors to help us. It is happening. This is the first announcement. You guys are the first to hear this. We only put up the page yesterday. I'm inviting judges. It's happening in San Francisco, January 31st and February 1st. You can sign up at luma.com/weavehacks3 Sign up right now because, places are running out. The prizes are insane. I'm gonna be there to MCP. All right, moving on. Vision and video VO 3.1. Updated, with, vertical videos 9 2 16 vertical videos and 4K upscaling output for better consistency and improved background details. you can use, reference images. Pretty cool. We also have a super viral video moment. Somebody posted a CL motion transfer clip. We mentioned cling before. It allows you to take your motion and put another face on top of it. somebody posted a clip with all the characters from Stranger Things and it kind of blocked the internet. So we're gonna show you this video and talk to you about how to make this also in voice and audio because you kind have to marry those two together. qt I, the folks who did mochi before released pocket TTS, it's a very tiny a hundred million parameter open source TTS that runs in CPU can run in your browser. It runs six x real time on the MacBook Air. Six x realtime on the MacBook cap. It was super dope. one last thing I think in the AI art in diffusion, Z AI released GLM image, a hybrid, auto aggressive diffusion model. It's not that great, but it's great that it's open sourced. And last but not least, a deep dive into agent skills. Agent skills are ways to give detailed custom knowledge to your LMS via composable, repeatable, snippets of text, maybe some code as well. We're gonna deep dive with Eleanor Berger, into skill packs, how to use them, why you use them. And I built something super cool using the Ralph Method from last week and skills from this week. I can't wait to share with you because it's like super, super awesome. So this is our show for today, super quick, DLDR. I will bring back my co-hosts here and see if I missed anything big, And that's it. Let's go into Open source.