model releaseNVIDIA

NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua

TL;DR

NVIDIA has released Nemotron 3.5 Content Safety, a 4B-parameter model built on Google Gemma 3 4B IT that provides multimodal safety classification across approximately 140 languages. The model includes a 128K context window, custom enterprise policy enforcement, auditable reasoning traces, and is releasing its training dataset.

June 4, 2026 · 7:06 PM3 min read

Nemotron 3.5 Content Safety — Quick Specs

Context window128K tokens

Compare Nemotron 3.5 Content Safety with other models →

NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Language Coverage

Core Specifications

Nemotron 3.5 Content Safety uses a LoRA adapter fine-tuned on Google's Gemma 3 4B IT base model. The model processes text, images, and combined inputs in a single inference call and runs on GPUs with 8GB+ VRAM. NVIDIA has not disclosed pricing for API access.

The model provides explicit training coverage for 12 languages—English, French, Spanish, German, Chinese, Japanese, Korean, Arabic, Hindi, Russian, Portuguese, and Italian—while inheriting zero-shot generalization across approximately 140 languages from the Gemma 3 base model.

Custom Policy Enforcement

The primary architectural addition in version 3.5 is custom policy enforcement. According to NVIDIA, the model accepts custom policy specifications alongside input and reasons over those policies when producing verdicts, rather than relying solely on its built-in taxonomy.

This addresses enterprise deployments where different applications require different risk profiles. For example, a healthcare platform's safety requirements differ from those of a developer tools IDE or children's education app. The model can suppress irrelevant categories or inject proprietary risk categories specific to organizational policies.

Three Output Modes

Nemotron 3.5 operates in three modes:

Low-latency binary verdict: Returns safe/unsafe labels for user input and assistant responses
Binary verdict with categories: Adds violated category labels from the 13-category Aegis 2.0 taxonomy
THINK mode: Provides step-by-step reasoning traces before final verdicts

The THINK mode outputs auditable reasoning traces that document why specific verdicts were reached, which NVIDIA states is necessary for compliance logging and human review in regulated industries. When latency is critical, THINK mode can be disabled for faster binary verdicts.

Multimodal Integration

The model evaluates user prompts, images, and assistant responses as a single unified context rather than scoring each independently. According to NVIDIA, this approach catches policy violations that only emerge from interactions between text and images or between requests and responses.

The safety taxonomy follows the Aegis 2.0 framework with 13 core categories aligned with the MLCommons safety taxonomy, plus 10 fine-grained subcategories.

Dataset Release

NVIDIA is releasing the Nemotron 3.5 Content Safety Dataset, which includes multimodal and multilingual training data with safety reasoning traces. The reasoning traces were generated using a two-step process: first using larger models like Qwen 397B to generate chain-of-thought reasoning with ground-truth labels, then condensing those traces using Qwen 80B to fit within three sentences for efficiency.

NVIDIA states that most open-source safety models do not provide training or evaluation sets, and the problem is more severe for multimodal datasets where licensing restrictions often apply to image and video artifacts.

What This Means

Nemotron 3.5 represents NVIDIA's attempt to consolidate multiple safety capabilities—multimodal input, multilingual coverage, custom policies, and explainable reasoning—into a single 4B-parameter model that can run on modest GPU hardware. The custom policy enforcement feature addresses a genuine enterprise need: most production AI systems cannot operate under universal safety rules and require domain-specific risk profiles.

The dataset release is significant for reproducibility in AI safety research, though NVIDIA has not yet disclosed benchmark scores comparing Nemotron 3.5 to competing safety classifiers like OpenAI's Moderation API or Anthropic's content filtering. The model's practical utility will depend on how its accuracy compares to existing solutions and whether the reasoning traces genuinely improve human review efficiency in production deployments.

Source: huggingface.co ↗

NVIDIA content moderation safety multimodal multilingual Gemma open source dataset release

model releaseJuly 14, 2026

Google releases Gemma 4 E2B, optimized to run natively on Pixel 10's Tensor G5 TPU

Google has released Gemma 4 E2B for TPU, a variant of its open-source Gemma 4 model optimized to run natively on the Tensor G5 chip in Pixel 10 devices. The multimodal model enables completely offline AI chat, image recognition, and audio transcription on Pixel 10, 10 Pro, 10 Pro XL, and 10 Pro Fold.

model releaseJuly 18, 2026

Moonshot AI releases Kimi K3 open source model, claims frontier-level performance

Chinese company Moonshot AI released Kimi K3, an open source model that the company claims demonstrates frontier-level performance while trailing only Claude Fable 5 and GPT 5.6 Sol. Independent analyses from Arena.ai and Vals AI suggest the model is competitive with flagship frontier models, reigniting debate about Chinese AI capabilities and open source model development.

product updateJuly 17, 2026

NVIDIA NeMo Automodel integrates with Hugging Face Diffusers for distributed video and image model fine-tuning

NVIDIA and Hugging Face have integrated NeMo Automodel with the Diffusers library, enabling distributed fine-tuning of video and image diffusion models without checkpoint conversion. The integration supports models including FLUX.1-dev (12B), Wan 2.1 (1.3B/14B), and HunyuanVideo (13B) with full fine-tuning and LoRA options.

benchmarkJuly 16, 2026

NVIDIA Nemotron 3 Embed 8B Tops RTEB Leaderboard with 78.5% Score, 1B Variant Cuts Error Rate 27%

NVIDIA's Nemotron-3-Embed-8B-BF16 ranks #1 on the RTEB leaderboard with a 78.5% score, while the 1B variant reduces error rate by 27% over its predecessor. The open-weight models feature 32k context windows and production-ready deployment options including a Blackwell-optimized NVFP4 variant.