AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

TL;DR

Amazon Web Services published a practical guide for fine-tuning Amazon Nova models using the Nova Forge SDK's data mixing capabilities. According to AWS, blending customer data with Amazon-curated datasets preserved near-baseline MMLU scores while delivering a 12-point F1 improvement on a Voice of Customer classification task spanning 1,420 leaf categories.

April 17, 2026 · 5:35 PM2 min read

AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

Amazon Web Services published a hands-on guide for fine-tuning Amazon Nova models using the Nova Forge SDK's data mixing capabilities, which allows developers to fine-tune on domain-specific data without losing general model capabilities.

Performance claims

According to AWS, blending customer data with Amazon-curated datasets preserved near-baseline MMLU scores while delivering a 12-point F1 improvement on a Voice of Customer classification task spanning 1,420 leaf categories. By contrast, AWS claims fine-tuning an open-source model on customer data alone caused a near-total loss of general capabilities.

Technical implementation

The guide covers a five-stage workflow: environment setup with Nova Forge SDK installation, data preparation with sanitization and validation, training configuration including SageMaker HyperPod runtime setup, model training using supervised fine-tuning with Low-Rank Adaptation (LoRA), and model evaluation against public benchmarks.

The SDK enforces token-level validation on training data to prevent conflicts with Nova's internal chat template. Special delimiters like System:, User:, and Assistant: must be sanitized before training to avoid corrupting the training signal.

Infrastructure requirements

The walkthrough uses 4 ml.p5.48xlarge GPU instances for both training and evaluation. AWS recommends starting with a short test run (max_steps=5) to validate configuration before committing to full training runs. Prerequisites include an AWS account with Amazon Nova Forge access, a provisioned SageMaker HyperPod cluster with GPU instances, an Amazon SageMaker MLflow application for experiment tracking, and appropriate IAM permissions.

Dataset example

The guide demonstrates the workflow using the MedReason dataset from Hugging Face, which contains approximately 32,700 medical question-answer pairs. The Nova Forge SDK supports JSONL, JSON, and CSV input formats and provides a JSONLDatasetLoader that converts raw data into the structured turn-based format Nova models expect during training.

What this means

Data mixing addresses a critical challenge in model fine-tuning: maintaining general capabilities while adapting to specific domains. AWS's 12-point F1 improvement claim suggests meaningful performance gains are possible without catastrophic forgetting. However, the requirement for expensive GPU infrastructure (ml.p5.48xlarge instances) and the proprietary nature of Amazon's curated datasets may limit adoption to larger organizations already invested in AWS infrastructure. The detailed sanitization requirements highlight the fragility of chat template-based training approaches.

Source: aws.amazon.com ↗

Amazon Nova AWS Fine-tuning Nova Forge SDK SageMaker LoRA Data Mixing HyperPod

product updateJuly 16, 2026

AWS launches Managed Knowledge Base for Bedrock with 6 enterprise connectors and automatic ACL enforcement

Amazon Web Services launched Managed Knowledge Base for Bedrock in general availability, offering a fully managed retrieval solution with six native enterprise connectors including SharePoint, Confluence, and Google Drive. The service handles document parsing up to 500 MB for PDFs, 2 GB for audio, and 10 GB for video, with real-time access control list verification at query time.

product updateJuly 16, 2026

xAI's Grok 4.3 now available on AWS Bedrock with 1M token context and configurable reasoning

xAI has made Grok 4.3 generally available on Amazon Bedrock, marking xAI's debut as a Bedrock model provider. The multimodal model offers a 1 million token context window, configurable reasoning effort (none/low/medium/high), and runs on Bedrock's Mantle inference engine using OpenAI-compatible APIs.

product updateJuly 16, 2026

AWS launches AgentCore platform for building voice AI agents with Amazon Nova 2 Sonic

AWS has released AgentCore, a new platform for hosting and running voice-based AI agents, integrated with Amazon Nova 2 Sonic for real-time speech capabilities. The platform uses the open Model Context Protocol (MCP) to connect agents to backend systems and deploys each conversation in isolated microVMs.

product updateJuly 14, 2026

AWS Extends QA Studio with Test Suites and CI/CD CLI for Automated Regression Testing

AWS has extended its QA Studio reference solution with test suite functionality and a command-line interface for CI/CD integration. The updates enable parallel execution of regression tests on Amazon ECS Fargate and bring Amazon Nova Act-powered visual testing into automated deployment pipelines.

AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

Performance claims

Technical implementation

Infrastructure requirements

Dataset example

What this means

Related Articles

AWS launches Managed Knowledge Base for Bedrock with 6 enterprise connectors and automatic ACL enforcement

xAI's Grok 4.3 now available on AWS Bedrock with 1M token context and configurable reasoning

AWS launches AgentCore platform for building voice AI agents with Amazon Nova 2 Sonic

AWS Extends QA Studio with Test Suites and CI/CD CLI for Automated Regression Testing

Comments