Nvidia to spend $26B on open-weight AI models, filing reveals

TL;DR

Nvidia will invest $26 billion over the next five years to build open-weight AI models, according to a 2025 financial filing confirmed by executives. The move signals a strategic shift from chipmaker to AI frontier lab, with the company releasing Nemotron 3 Super (128B parameters) and claiming it outperforms GPT-OSS on multiple benchmarks.

March 14, 2026 · 7:08 PM3 min read

NVIDIA Nemotron 3 Super — Quick Specs

Context window256K tokens

Compare NVIDIA Nemotron 3 Super with other models →

Nvidia to Spend $26 Billion on Open-Weight AI Models

Nvidia will invest $26 billion over the next five years to develop open-source artificial intelligence models, according to a 2025 financial filing. The company's executives confirmed the investment in interviews with WIRED, marking a significant strategic pivot toward competing directly with frontier labs like OpenAI and DeepSeek.

What This Means for Nvidia's Strategy

The investment signals Nvidia's evolution from a chipmaker into a full AI research organization. By releasing open-weight models—where parameters and architectural details are publicly available—Nvidia gains several advantages: the models are optimized for Nvidia hardware, creating a built-in market advantage, while the company simultaneously addresses competition from Chinese open models gaining adoption among startups and researchers globally.

This approach differs sharply from OpenAI and Anthropic, which restrict their best models to cloud access. Meta, which pioneered open models with Llama in 2023, has signaled it may not continue releasing future models openly. Meanwhile, Chinese companies including DeepSeek, Alibaba, Moonshot AI, and MiniMax have released powerful open models freely, attracting Western developers and researchers.

Nemotron 3 Super Details

Nvidia released Nemotron 3 Super on Wednesday, its most capable open-weight model to date. The model features 128 billion parameters, comparable in scale to OpenAI's GPT-OSS but claims superior performance across benchmarks.

Benchmark performance:

AI Index score: 37 (GPT-OSS scored 33)
PinchBench rank: #1 (a new benchmark measuring model control of OpenClaw)
Nvidia has also completed pretraining of a 550-billion-parameter model

The company implemented several technical innovations in Nemotron 3 to improve reasoning abilities, long-context handling, and responsiveness to reinforcement learning.

Strategic Context

Nvidia's hardware remains the gold standard for training large language models, with customers spending billions on the company's chips. However, the rise of efficient open models from China—particularly DeepSeek's January 2025 release demonstrating cheaper training methods—threatens Nvidia's dominance. DeepSeek's rumored next model may have been trained exclusively on sanctioned Huawei chips, potentially shifting developer preference.

By providing a robust, well-maintained open alternative tuned to Nvidia hardware, the company aims to retain ecosystem loyalty while positioning itself as supporting "openness" in AI development.

Nvidia VP Bryan Catanzaro stated: "It's in our interest to help the ecosystem develop." Kari Briski, VP of generative AI software, added that open models help Nvidia test and refine its datacenter infrastructure and hardware roadmaps.

Industry Response

AI researchers including Nathan Lambert (Allen Institute for AI) and Andy Konwinski (Laude Institute) view Nvidia's investment as strategically significant. Konwinski noted Nvidia's unique position "at the nexus of AI research" makes this commitment to openness "an unprecedented signal."

Industry observers warn that if open innovation continues shifting to China, it could disadvantage the US long-term. Nvidia frames its investment as providing an American alternative to Chinese open models while maintaining global ecosystem diversity.

What This Means

Nvidia is making a calculated bet: by dominating open-weight model development, it locks in hardware dependence while defending against Chinese model adoption. This converts Nvidia's chip advantage into a research advantage. Success requires Nemotron models matching or exceeding Chinese alternatives in capability and usability—a challenge given Alibaba's Qwen and DeepSeek's momentum with Western developers. The $26B commitment signals Nvidia sees this as essential infrastructure defense.

Source: wired.com ↗

nvidia open-weight-models funding ai-research nemotron model-release frontier-models open-source-ai

model releaseMarch 23, 2026

NVIDIA releases Nemotron-3-Nano-4B, a 4B parameter model for edge AI with 262K context window

NVIDIA released Nemotron-3-Nano-4B-GGUF on March 16, 2026, a 4-billion parameter small language model (SLM) designed for edge deployment on devices like Jetson Thor and GeForce RTX. The model features a hybrid Mamba-2 and Transformer architecture with a 262K token context window and supports both reasoning and non-reasoning modes via system prompts.

product updateMarch 12, 2026

Nvidia to spend $26B on open-weight AI models, targeting Chinese competition and developer lock-in

An SEC filing reveals Nvidia plans to spend $26 billion on open-weight AI models over the next five years. The investment targets the open-source gap left by OpenAI, Meta, and Anthropic while countering the rise of Chinese open-source models and deepening developer dependence on Nvidia hardware.

model releaseApril 29, 2026

NVIDIA Releases Nemotron 3 Nano Omni: 31B Multimodal Model With 256K Context and Reasoning Mode

NVIDIA released Nemotron 3 Nano Omni, a 31B parameter (30B active, 3B per token) multimodal model supporting video, audio, image, and text inputs. The model features a 256K token context window, reasoning mode with chain-of-thought, and tool calling capabilities.