fine-tuning
9 articles tagged with fine-tuning
Mistral AI fine-tunes Pixtral-12B on satellite imagery, boosting classification accuracy from 56% to 91%
Mistral AI has published research showing that fine-tuning its Pixtral-12B vision language model on satellite imagery increases classification accuracy from 56% to 91% on the Aerial Image Dataset. Using Low-Rank Adaptation (LoRA) with 8,000 training samples across 30 scene categories, the company reduced hallucinations from 5% to 0.1% for under $10 in compute costs.
AWS launches hyperparameter optimization guide for Amazon Nova Forge custom model training
AWS has published a technical guide on hyperparameter optimization for Amazon Nova Forge, its platform for building custom frontier models from Amazon Nova checkpoints. The guide addresses three core challenges: catastrophic forgetting during domain specialization, learning rate calibration when mixing proprietary and curated training data, and baseline performance constraints for reinforcement fine-tuning.
Mistral AI fine-tunes Pixtral-12B on satellite imagery, boosting classification accuracy from 56% to 91%
Mistral AI reports that fine-tuning its Pixtral-12B vision model on satellite imagery increased classification accuracy from 56% to 91% on the Aerial Image Dataset. The company used LoRA (Low-Rank Adaptation) to train on 8,000 samples for under $10, reducing hallucinations from 5% to 0.1%.
NVIDIA releases LoRA/DoRA fine-tuning guide for Cosmos Predict 2.5 to generate synthetic robot training data
NVIDIA published a technical guide for parameter-efficient fine-tuning of its Cosmos Predict 2.5 world model using LoRA and DoRA adapters. The method allows teams to adapt the 2B-parameter model to robot manipulation tasks on a single 80GB GPU, generating synthetic training trajectories from just 92 demonstration videos.
AWS launches agent-guided workflows in SageMaker AI to automate model fine-tuning
Amazon Web Services has released agent-guided workflows in SageMaker AI that use AI coding agents to automate model customization. The feature includes nine pre-built skills covering use case definition, data preparation, fine-tuning technique selection (SFT, DPO, RLVR), evaluation, and deployment to Amazon Bedrock or SageMaker endpoints.
Amazon Bedrock now supports fine-tuning for Nova models with three customization approaches
Amazon Bedrock now enables fine-tuning of Amazon Nova models using supervised fine-tuning (SFT), reinforcement fine-tuning (RFT), and model distillation. The service automates infrastructure provisioning and training orchestration, requiring only data upload to S3 and a single API call. Fine-tuned models run on-demand at standard inference pricing without provisioned capacity requirements.
Amazon Bedrock adds reinforcement fine-tuning best practices for Nova and open source models
Amazon Bedrock now supports Reinforcement Fine-Tuning (RFT) for customizing Amazon Nova and open source models using reward signals instead of labeled datasets. AWS reports up to 66% accuracy improvements over base models with reduced customization complexity. The approach works best for tasks with verifiable correctness (code, math) or subjective evaluation (moderation, summarization).
Gemma 4 success hinges on tooling and fine-tuning ease, not benchmark scores
Google's Gemma 4 release marks a shift in open model strategy with Apache 2.0 licensing and competitive benchmarks, but real success depends on factors rarely measured: tooling stability, fine-tuning ease, and ecosystem adoption. The open model landscape is now crowded with alternatives like Qwen 3.5, Nemotron 3, and others—a maturation that changes what separates winners from the field.
Amazon Bedrock adds reinforcement fine-tuning with OpenAI-compatible APIs
Amazon Bedrock now enables reinforcement fine-tuning (RFT) across multiple model families including Amazon Nova, open-weight models like OpenAI's GPT-OSS 20B, and Qwen 3 32B. The service automates the end-to-end customization workflow using GRPO optimization, allowing models to learn from feedback on multiple responses rather than static training datasets, with support for OpenAI-compatible APIs.