Amazon Nova Micro Fine-Tuned Text-to-SQL Models Now Available on Bedrock On-Demand Inference at $0.80/Month for 22,000 Q

TL;DR

AWS has enabled fine-tuned Amazon Nova Micro models to run on Bedrock's on-demand inference for text-to-SQL generation. According to AWS testing, a sample workload of 22,000 queries per month costs $0.80 monthly using the serverless approach, compared to higher costs with persistent model hosting. The solution uses LoRA fine-tuning on the sql-create-context dataset containing over 78,000 SQL examples.

April 16, 2026 · 5:51 PM2 min read

Amazon Nova Micro Fine-Tuned Models Available on Bedrock On-Demand Inference for Text-to-SQL

Amazon Web Services has announced that fine-tuned Amazon Nova Micro models can now be deployed on Bedrock's on-demand inference infrastructure for custom text-to-SQL generation. According to AWS testing, a sample workload of 22,000 queries per month incurred costs of $0.80 monthly, compared to higher costs with persistent model hosting infrastructure.

The solution applies LoRA (Low-Rank Adaptation) fine-tuning to Nova Micro, enabling organizations to customize the model for proprietary SQL dialects and domain-specific database schemas while maintaining serverless, pay-per-token pricing.

Technical Implementation

AWS provides two implementation paths for fine-tuning Nova Micro:

Bedrock Model Customization: Fully managed fine-tuning through the AWS console or API, with training data uploaded to S3. AWS handles underlying infrastructure and the resulting custom model deploys with the same token-based pricing as base Nova Micro with no additional markup.

SageMaker AI Training Jobs: Provides granular control over hyperparameters and training infrastructure for organizations requiring customization beyond managed options.

Both approaches use the same data preparation pipeline and deploy to Bedrock for on-demand inference.

Training Configuration

The demonstration uses the sql-create-context dataset, combining WikiSQL and Spider datasets with over 78,000 examples of natural language questions paired with SQL queries. Training data is formatted as JSONL files with system prompts, user queries, and SQL responses.

Configurable hyperparameters for Nova Micro fine-tuning:

Epochs: 1-5 (AWS used 5 in testing)
Batch Size: Fixed at 1 for Nova Micro
Learning Rate: 0.000001-0.0001 (AWS used 0.00001)
Learning Rate Warmup Steps: 0-100 (AWS used 10)

Training completion time: approximately 2-3 hours according to AWS.

Infrastructure Requirements

Deployment requires:

AWS account with billing enabled
IAM permissions for Bedrock Nova Micro, SageMaker AI, and Bedrock Model Customization
Quota for ml.g5.48xl instance for SageMaker AI training

Amazon Bedrock automatically generates training and validation loss metrics, stored in S3. AWS reports that successful training shows both losses decreasing consistently and converging to comparable final values.

What This Means

The on-demand inference option removes the primary cost barrier to deploying fine-tuned models for specialized use cases. Organizations with variable text-to-SQL workloads can now customize models for proprietary SQL dialects without maintaining persistent infrastructure. The $0.80/month cost figure for 22,000 queries demonstrates viability for production workloads with intermittent usage patterns, though AWS does not disclose baseline costs for comparison or specify whether this includes only inference costs or total end-to-end expenses. The LoRA approach trades higher per-query latency for zero idle costs, making it suitable for applications where sub-second response times are acceptable.

Source: aws.amazon.com ↗

Amazon Nova Micro Amazon Bedrock Text-to-SQL LoRA Fine-tuning On-demand Inference AWS SageMaker

product updateJuly 10, 2026

AWS Adds NVIDIA Nemotron 3 Nano (30B) and Super (120B) to SageMaker Serverless Fine-Tuning

Amazon SageMaker AI now supports serverless fine-tuning for NVIDIA Nemotron 3 Nano (30B parameters, 3B active) and Nemotron 3 Super (120B parameters, 12B active). The integration includes supervised fine-tuning, reinforcement learning with verifiable rewards (RLVR), and reinforcement learning from AI feedback (RLAIF).

product updateJuly 14, 2026

Amazon Nova Act Brings Vision-Based Web Navigation to UX Testing, No Hard-Coded Scripts Required

AWS has released a cloud-deployed UX testing platform built on Amazon Nova Act, a multimodal foundation model that navigates web interfaces through visual understanding rather than hard-coded selectors. The solution processes documentation with Claude 4.5 Sonnet to generate test scenarios, executes parallel testing via ECS, and analyzes results automatically, addressing the scalability limitations of manual testing and maintenance overhead of traditional automation tools.

product updateJuly 14, 2026

AWS Extends QA Studio with Test Suites and CI/CD CLI for Automated Regression Testing

AWS has extended its QA Studio reference solution with test suite functionality and a command-line interface for CI/CD integration. The updates enable parallel execution of regression tests on Amazon ECS Fargate and bring Amazon Nova Act-powered visual testing into automated deployment pipelines.

product updateJuly 15, 2026

Google Gemini Spark adds Workspace editing, gets 50% speed boost, expands to AI Ultra subscribers

Google has upgraded its Gemini Spark personal agent with the ability to edit shared Google Workspace documents, a 50% speed improvement, and smarter parallel source processing. The service is now available to Google AI Ultra subscribers in most regions, with AI Pro access planned for the near future.