NVIDIA Nemotron 3 Ultra

NVIDIA🇺🇸 United States
active
Context window1000K tokens

Version History

3-ultramajor

Initial release of Nemotron 3 Ultra featuring hybrid Transformer-Mamba MoE architecture with 550B total parameters, 55B active parameters, and 1M token context window optimized for agentic workloads.

Coverage