AWS enables fine-tuning of Amazon Nova models for email extraction, achieving 94.77% accuracy with 50% cost reduction
AWS released guidance on fine-tuning Amazon Nova Micro and Nova Lite models for automated email data extraction using SageMaker AI. In collaboration with Parcel Perform, the fine-tuned Nova Micro achieved 94.77% extraction accuracy—a 16.6 percentage point improvement—while reducing inference costs by 50% and latency by 30% compared to previous models.
AWS enables fine-tuning of Amazon Nova models for email extraction, achieving 94.77% accuracy with 50% cost reduction
AWS published detailed guidance on fine-tuning Amazon Nova models for automated email data extraction, demonstrating significant improvements in accuracy and cost efficiency. The approach uses Amazon SageMaker AI with Parameter-Efficient Fine-Tuning (PEFT) through Low-Rank Adaptation (LoRA).
Production results from Parcel Perform
Parcel Perform, an AI delivery experience platform processing millions of emails daily, collaborated with AWS Generative AI Innovation Center to optimize Nova models for extracting structured data from diverse email formats. According to AWS, the fine-tuned Nova Micro model achieved 94.77% extraction accuracy on the testing dataset—an improvement of 16.6 percentage points over baseline performance.
The fine-tuned Nova Micro reduced inference latency by more than 30% and cut costs by 50% compared to Parcel Perform's previous model. The company has moved the solution into production for e-commerce logistics operations.
Technical implementation
The solution uses supervised fine-tuning with LoRA on Amazon SageMaker AI Training, then deploys the customized model to Amazon Bedrock with on-demand inference priced per token. AWS tested two training datasets: 1,300 samples and 4,900 samples to evaluate how data volume impacts performance.
Key training parameters include:
- Base models: amazon.nova-lite-v1:0:300k and amazon.nova-micro-v1:0:128k
- Context length: 8,192 tokens on g5/g6 instances, 32,768 tokens on p5 instances
- Global batch size: 64
- Training epochs: 2
- LoRA alpha: 32
- LoRAPlus learning rate ratio: 8.0
Training data must follow Amazon Bedrock's conversation schema format with email content as user input and extracted entities as assistant responses in JSON format.
Deployment options
With PEFT, models deploy to Amazon Bedrock using on-demand inference. Full-rank supervised fine-tuning supports deployment through either Provisioned Throughput on Amazon Bedrock or a SageMaker AI endpoint. The approach addresses common challenges including model hallucinations, confusion between similar data types (order numbers versus tracking numbers), and high token costs when processing HTML-formatted emails.
Requirements
Developers need an AWS account with permissions for Amazon Bedrock and SageMaker AI, an IAM service role for model customization, an S3 bucket for training data storage, training data in JSONL format, and sufficient Service Quotas for chosen instance types. AWS provides Amazon Nova recipes—YAML configuration files specifying base model names, training hyperparameters, and optimization settings.
What this means
This marks AWS's first detailed public guidance on fine-tuning its Nova model family for production use cases. The 94.77% accuracy with simultaneous 50% cost reduction and 30% latency improvement demonstrates that smaller, fine-tuned models can outperform larger base models on domain-specific tasks. The LoRA approach requires minimal training data (1,300-4,900 samples) compared to full fine-tuning, making specialized model development more accessible. For organizations processing high volumes of structured data extraction, this approach provides a clear path to improving accuracy while controlling inference costs.
Related Articles
AWS launches managed entitlements for Bedrock to distribute third-party model access across multi-account organizations
AWS has introduced managed entitlements for Amazon Bedrock, allowing organizations to subscribe to third-party models like Anthropic Claude and Cohere from a central account and distribute access across member accounts without requiring AWS Marketplace permissions. The feature uses AWS License Manager to create grants that share model entitlements with specific accounts or entire organizational units.
AWS demonstrates two-model pipeline using Nova 2 Lite and Claude Sonnet 4.6 that cuts document processing costs by 67%
AWS published a technical demonstration showing that pairing Amazon Nova 2 Lite with Anthropic's Claude Sonnet 4.6 reduces document processing costs by approximately two-thirds compared to single-model approaches. The two-stage pipeline processed 336 scanned yearbook pages at $0.0027 per page, producing 3,122 name-to-face associations with 93% scoring at or above 0.95 confidence.
AWS releases automated healthcare claims pipeline using Amazon Bedrock Data Automation and AgentCore
AWS has published a technical implementation guide for automating healthcare claims processing using Amazon Bedrock Data Automation and Amazon Bedrock AgentCore. The pipeline extracts data from CMS-1500 claim forms, validates against AWS HealthLake records, and generates FHIR-compliant claim resources with automated notifications.
Anthropic launches Claude Science desktop app with native access to 60+ scientific databases
Anthropic released Claude Science, a specialized desktop application for macOS and Linux that connects Claude models to scientific databases and compute infrastructure. The public beta app includes analysis specialists for genomics, single-cell biology, proteomics, and structural biology, with native connections to over 60 scientific databases.
Comments
Loading...