RLVR

2 articles tagged with RLVR

May 4, 2026

AWS launches agent-guided workflows in SageMaker AI to automate model fine-tuning

Amazon Web Services has released agent-guided workflows in SageMaker AI that use AI coding agents to automate model customization. The feature includes nine pre-built skills covering use case definition, data preparation, fine-tuning technique selection (SFT, DPO, RLVR), evaluation, and deployment to Amazon Bedrock or SageMaker endpoints.

May 4, 2026 · 5:20 PM

April 13, 2026

product update

AWS Lambda enables serverless reward functions for Amazon Nova model customization

AWS has introduced Lambda-based reward functions for Amazon Nova model customization through reinforcement fine-tuning (RFT). The serverless architecture automatically scales from 10 concurrent evaluations per second during experimentation to 400+ during production training, supporting both objective RLVR and subjective RLAIF approaches.

April 13, 2026 · 4:20 PM

← Back to all news