Cosmos3-Nano

NVIDIA🇺🇸 United States
active
Context window256K tokens

Version History

3-nanomajor

Initial release of Cosmos3-Nano, a 16B-parameter omnimodal world model for Physical AI applications with 256K context window and support for generating video, audio, images, and robot actions from multimodal inputs.

Coverage

model releaseNVIDIA

NVIDIA Releases Cosmos3-Nano: 16B-Parameter Omnimodal World Model for Physical AI with 256K Token Context

NVIDIA has released Cosmos3-Nano, a 16-billion parameter omnimodal world model capable of generating video, audio, images, and robot action commands from combinations of text, image, video, and action trajectory inputs. The model supports a 256K token context window and is designed for Physical AI applications including robotics, autonomous vehicles, and smart manufacturing environments.

2 min read