Lance 3B

ByteDance🇨🇳 China
active

Version History

1.0major

Initial release of Lance, a 3B-parameter unified multimodal model supporting image and video generation, editing, and understanding tasks. Trained from scratch using 128 A100 GPUs with staged multi-task training recipe.

Coverage

model releaseByteDance

ByteDance releases Lance, 3B-parameter unified multimodal model handling image and video generation, editing, and unders

ByteDance has released Lance, a 3-billion parameter multimodal model that performs image and video generation, editing, and understanding within a single framework. The model was trained entirely from scratch using 128 A100 GPUs and achieves 84.67% on DPG-Bench and 74% on GenEval, competing with larger models despite its compact size.

2 min read