New Models
Seaweed-7B
ByteDance publishes Seaweed-7B video generation foundation model
ByteDance publicly presented Seaweed-7B, a 7B parameter video generation foundation model, showing competitive video quality from a comparatively small model. Details and demos were published at seaweed.video.
New Models
Seedream 3.0
ByteDance Seedream 3.0: bilingual 2K text-to-image model
ByteDance's Seed team announced Seedream 3.0, a powerful bilingual (Chinese/English) text-to-image model that generates native 2048x2048 images with fast inference of around 3 seconds for a 1K image on an A100. It challenges the top closed image generation models.
Papers & Research
Seed-Thinking-v1.5
ByteDance publishes Seed-Thinking-v1.5 reasoning model tech report
ByteDance's Seed team published Seed-Thinking-v1.5, a new reasoning model announced via a technical report on GitHub. It was mentioned among the week's open-source LLM news, though weights were not released at the time.
Products & Apps
OmniHuman (via Dreamina)
ByteDance's OmniHuman image-to-avatar model goes public via Dreamina
ByteDance's impressive OmniHuman model, which turns a single image plus audio into a realistic talking avatar video, became publicly usable through the Dreamina (CapCut) website. The results land squarely in uncanny-valley territory, as Alex demonstrated with his own avatar thread.