Bernini-R

ByteDance🇨🇳 China
active

Version History

1.0major

Initial release of Bernini-R inference code and model weights with MLLM-based semantic planner and DiT-based renderer architecture. Supports seven task types including text-to-video, video editing, and reference-guided generation.

Coverage

model releaseByteDance

ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture

ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.

2 min read