NVIDIA Nemotron-3-Super-120B-A12B

Name: NVIDIA Nemotron-3-Super-120B-A12B
Price: 0.2 USD
Author: NVIDIA

NVIDIA🇺🇸 United States

active

Compare with other models →

Context window1000K tokens

Input / 1M tokens$0.2

Output / 1M tokens$0.2

Version History

A12B-BF16majorMarch 10, 2026

NVIDIA releases Nemotron-3-Super-120B-A12B-BF16, a 120 billion parameter model with latent MoE architecture for efficient text generation across 8 languages.

Benchmark Scores

Full leaderboard →

90.0%

AIME 2025

1362.0 elo

Arena Elo

79.2%

GPQA

83.7%

MMLU-Pro

152.0 tokens_per_sec

Speed (tok/s)

60.5%

SWE-bench Verified

Coverage

model releaseNVIDIA

NVIDIA releases Nemotron-3-Super-120B, a 120B parameter model with latent MoE architecture

NVIDIA has released Nemotron-3-Super-120B-A12B-BF16, a 120 billion parameter model designed for text generation and conversational tasks. The model employs a latent mixture-of-experts (MoE) architecture and supports multiple languages including English, French, Spanish, Italian, German, Japanese, and Chinese.

March 11, 2026 · 11:50 PM1 min read

nvidia model-release text-generation