aws
17 articles tagged with aws
Amazon QuickSight launches autonomous AI agents that work continuously in background
Amazon has launched autonomous agents in QuickSight (branded as Quick) that execute tasks continuously in the background while users attend meetings or focus on other work. The update includes 16 new data source integrations, an activity feed that consolidates communications across tools, and cross-system query capabilities that join data from multiple sources in real time.
AWS releases open-source test harness for evaluating Amazon Nova Sonic voice agents at scale
Amazon has released an open-source testing framework for Nova Sonic voice agents that automates multi-turn conversation evaluation without requiring human testers. The harness uses LLM-as-judge techniques to assess voice agents across six metrics including goal achievement, response accuracy, and tool usage, addressing a critical QA bottleneck in voice AI development.
AWS launches hyperparameter optimization guide for Amazon Nova Forge custom model training
AWS has published a technical guide on hyperparameter optimization for Amazon Nova Forge, its platform for building custom frontier models from Amazon Nova checkpoints. The guide addresses three core challenges: catastrophic forgetting during domain specialization, learning rate calibration when mixing proprietary and curated training data, and baseline performance constraints for reinforcement fine-tuning.
AWS adds Policy Engine and Lambda interceptors to Bedrock AgentCore gateway for agent security controls
Amazon Web Services launched Policy Engine and Lambda interceptors for Bedrock AgentCore gateway, enabling enterprises to control which tools AI agents can access and validate requests dynamically. The Policy Engine uses Cedar declarative policy language for deterministic access decisions, while Lambda interceptors run custom code before or after each tool call for validation, token exchange, and response filtering.
AWS launches dataset management in Bedrock AgentCore for versioned agent test suites
Amazon Web Services introduced dataset management in Bedrock AgentCore, enabling developers to build versioned test suites with immutable baselines for agent evaluation. The feature supports predefined scenarios with ground truth assertions and user simulation scenarios where LLM-backed actors conduct multi-turn conversations.
Anthropic's Claude Opus 4.8 launches on AWS Bedrock in four regions
Anthropic's Claude Opus 4.8 is now available on Amazon Bedrock and Claude Platform on AWS. The model is designed for autonomous multi-stage tasks, agentic coding, and long-running workflows with reduced supervision.
AWS deploys AgentCore orchestration layer across 20+ sales agents, cutting latency 41% and saving reps 2 hours weekly
AWS has deployed Amazon Bedrock AgentCore to orchestrate more than 20 specialized AI agents across its global sales organization through a unified interface called Field Advisor. The system has processed over 120,000 prompts since launch, delivering a 41% latency reduction compared to previous infrastructure and saving large-scale sales reps up to 2 hours per week on CRM tasks.
AWS launches real-time voice agent framework combining Stream Vision Agents with Nova 2 Sonic
Amazon has released Stream's Vision Agents, an open-source Python framework for building real-time voice AI agents that integrates with Amazon Nova 2 Sonic through Bedrock. The system delivers end-to-end latency under 500 milliseconds using Stream's global edge network with sub-30ms audio latency and typically sub-500ms join times.
Anthropic's Mythos bug-hunting model accessed by unauthorized users, early tests show performance on par with human rese
Anthropic confirmed unauthorized users accessed its Mythos vulnerability detection model through a third-party vendor environment by guessing URL patterns. Early analysis from Mozilla and AWS indicates Mythos performs on par with elite human security researchers rather than surpassing them, despite Anthropic's claims of identifying thousands of critical vulnerabilities.
Amazon Bedrock AgentCore now supports stateful MCP with user input, LLM sampling, and progress streaming
Amazon has introduced stateful MCP client capabilities on Bedrock AgentCore Runtime, enabling agents to pause mid-execution for user input, request LLM-generated content, and stream real-time progress updates. The update transforms one-way tool execution into bidirectional conversations between MCP servers and clients, supporting interactive workflows previously impossible with stateless implementations.
Amazon Bedrock adds reinforcement fine-tuning best practices for Nova and open source models
Amazon Bedrock now supports Reinforcement Fine-Tuning (RFT) for customizing Amazon Nova and open source models using reward signals instead of labeled datasets. AWS reports up to 66% accuracy improvements over base models with reduced customization complexity. The approach works best for tasks with verifiable correctness (code, math) or subjective evaluation (moderation, summarization).
Amazon Bedrock Guardrails now supports age-responsive, context-aware safety policies
Amazon has released a serverless architecture solution using Bedrock Guardrails that dynamically selects safety policies based on user age, role, and industry. The solution enforces five specialized guardrails—including COPPA-compliant child protection and healthcare-specific policies—at inference time to prevent prompt injection attacks and ensure context-appropriate responses.
Amazon Polly adds bidirectional streaming API for real-time speech synthesis in conversational AI
Amazon has released a new Bidirectional Streaming API for Amazon Polly that enables simultaneous text input and audio output over a single HTTP/2 connection. The API reduces end-to-end latency by 39% compared to traditional request-response TTS by allowing text to be sent word-by-word as LLMs generate tokens, rather than waiting for complete sentences. The feature is available in Java, JavaScript, .NET, C++, Go, Kotlin, PHP, Ruby, Rust, and Swift SDKs.
Amazon Bedrock adds three video analysis workflows for multimodal understanding at scale
Amazon Bedrock has introduced three distinct video analysis workflows that leverage multimodal foundation models to extract insights from video content at scale. The approaches—frame-based, shot-based, and multimodal embedding—are designed for different use cases and cost-performance trade-offs, with open-source reference implementations available on GitHub.
AWS adds Claude tool use to Bedrock for custom entity extraction from documents
Amazon Web Services has integrated Claude's tool use (function calling) capability into Bedrock, enabling serverless document processing for custom entity recognition. The solution uses Claude 3.5 Sonnet to extract structured data like names, dates, and addresses from driver's licenses and other documents without traditional model training.
NVIDIA Nemotron 3 Nano now available on Amazon Bedrock as serverless model
Amazon Bedrock now offers NVIDIA's Nemotron 3 Nano as a fully managed serverless model, expanding its Nemotron portfolio alongside previously available Nemotron 2 Nano 9B and Nemotron 2 Nano VL 12B variants. The addition enables developers to deploy NVIDIA's smallest inference-optimized model without managing infrastructure.
Anthropic Claude models now available in India via Amazon Bedrock with cross-region inference
Amazon Bedrock now enables access to Anthropic Claude models in India with global cross-region inference support. The service allows developers to build generative AI applications with Claude variants across AWS regions.