content-moderation

5 articles tagged with content-moderation

March 26, 2026
product updateAmazon Web Services

Amazon Bedrock Guardrails now supports age-responsive, context-aware safety policies

Amazon has released a serverless architecture solution using Bedrock Guardrails that dynamically selects safety policies based on user age, role, and industry. The solution enforces five specialized guardrails—including COPPA-compliant child protection and healthcare-specific policies—at inference time to prevent prompt injection attacks and ensure context-appropriate responses.

March 23, 2026
model releaseNVIDIA

NVIDIA releases Nemotron 3 Content Safety 4B for multimodal, multilingual moderation

NVIDIA released Nemotron 3 Content Safety 4B, an open-source multimodal safety model designed to moderate content across text, images, and multiple languages. Built on Gemma-3 4B-IT with a 128K context window, the model achieved 84% average accuracy on multimodal safety benchmarks and supports over 140 languages through culturally-aware training data.

March 17, 2026
product updateOpenAI

OpenAI's adult mode will allow erotic text but blocks explicit image, audio, and video generation

OpenAI confirmed its forthcoming "adult mode" will permit text-based erotic conversations in ChatGPT but explicitly block generation of pornographic images, audio, and video. The feature, first announced by CEO Sam Altman in October 2024, has been delayed multiple times—most recently in March 2025—as the company grapples with safety concerns including a 12% error rate in age verification systems.

March 16, 2026
product updateOpenAI

OpenAI delays adult mode launch, will limit to text-based erotica only

OpenAI has delayed its planned "adult mode" for ChatGPT, originally announced for this quarter. The feature will support text-based adult conversations only—not images, voice, or video—due to internal concerns about child safety and technical challenges with age verification systems that misclassify minors as adults about 12% of the time.

March 14, 2026
product updateAmazon Web Services

Amazon's Alexa+ adds 'Sassy' personality for adults with explicit language but content guardrails

Amazon announced a new "Sassy" personality for Alexa+ on Thursday, marketed toward adult users and protected by additional security checks including Face ID on iOS. The personality uses explicit language and wit but explicitly excludes sexual content, hate speech, and harmful material.