Cosmos 3 Super Image2Video

NVIDIA🇺🇸 United States
active
Context window262K tokens

Version History

3.0major

Initial release of Cosmos 3, NVIDIA's omnimodal world foundation model platform for Physical AI, featuring 64B-parameter variants with Mixture-of-Transformers architecture supporting video, image, audio, and robot action generation.

Coverage

model releaseNVIDIA

NVIDIA Releases Cosmos 3: 64B-Parameter Omnimodal World Model for Physical AI

NVIDIA released Cosmos 3, an omnimodal world foundation model platform for Physical AI spanning robotics, autonomous driving, and industrial environments. The flagship Cosmos3-Super variant contains 64 billion parameters and generates video, images, audio, and action commands from text, image, video, and action trajectory inputs using a Mixture-of-Transformers architecture.

2 min read