Bernini-R

Name: Bernini-R
Author: ByteDance

ByteDance🇨🇳 China

active

Compare with other models →

Version History

1.0majorJune 3, 2026

Initial release of Bernini-R inference code and model weights with MLLM-based semantic planner and DiT-based renderer architecture. Supports seven task types including text-to-video, video editing, and reference-guided generation.

Coverage

model releaseByteDance

ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture

ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.

June 3, 2026 · 8:51 AM2 min read

ByteDance video-generation diffusion-models