AWS

41 articles tagged with AWS

July 1, 2026

product updateNVIDIA

AWS brings NVIDIA Nemotron and OpenAI GPT OSS models to GovCloud for secure government AI workloads

Amazon Bedrock now supports NVIDIA Nemotron and OpenAI GPT OSS models in AWS GovCloud (US) Regions. The launch includes OpenAI's GPT OSS models (120B and 20B parameters, 128K context) and NVIDIA Nemotron 3 family (9B to 120B parameters, 1M context), providing government agencies FedRAMP High and DoD SRG Level 5-compliant AI inference on U.S. soil.

July 1, 2026 · 6:21 PM

product updateAmazon Web Services

AWS adds metadata filtering to AgentCore Memory, improving agent retrieval accuracy from 40% to 64%

Amazon has added metadata filtering to its AgentCore Memory service for AI agents. In AWS evaluations across 151 questions, the feature improved overall question-answering accuracy from 40% to 64%, with context-dependent questions jumping from 16% to 69% accuracy. The update allows agents to filter memory retrieval by attributes like priority, department, or time range before semantic search runs.

July 1, 2026 · 6:20 PM

June 29, 2026

product updateAmazon Web Services

AWS releases automated healthcare claims pipeline using Amazon Bedrock Data Automation and AgentCore

AWS has published a technical implementation guide for automating healthcare claims processing using Amazon Bedrock Data Automation and Amazon Bedrock AgentCore. The pipeline extracts data from CMS-1500 claim forms, validates against AWS HealthLake records, and generates FHIR-compliant claim resources with automated notifications.

June 29, 2026 · 5:50 PM

June 19, 2026

product updateAmazon Web Services

AWS launches Web Search on Amazon Bedrock AgentCore with tens of billions of documents, no external API required

Amazon Web Services launched Web Search on Amazon Bedrock AgentCore, a fully managed web search capability that gives AI agents access to tens of billions of documents without requiring external search APIs. The service, now generally available, runs entirely within AWS infrastructure and refreshes its index within minutes of new content appearing online.

June 19, 2026 · 2:35 PM

June 18, 2026

product updateAmazon Web Services

AWS Releases AgentCore Harness for Production AI Agents with Two-API Setup

Amazon Web Services made its AgentCore harness generally available, reducing production AI agent deployment to two API calls: CreateHarness and InvokeHarness. The managed service handles sandboxed execution, memory, tool integration, and observability, eliminating infrastructure setup for teams building LLM agents.

June 18, 2026 · 5:50 PM

June 17, 2026

product updateAmazon Web Services

AWS launches Bedrock AgentCore with managed knowledge base, web search, and payment infrastructure for AI agents

Amazon Web Services has released new capabilities for Bedrock AgentCore, its platform for building AI agents. The update includes a managed knowledge base that handles vector storage and retrieval across enterprise data sources, native web search using Amazon's Alexa infrastructure, and a payment system enabling agents to access paid content and APIs.

June 17, 2026 · 3:35 PM

June 16, 2026

product updateAmazon Web Services

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

Amazon Web Services launched the InvokeGuardrailChecks API for Amazon Bedrock Guardrails, enabling developers to apply individual safety checks at any point in agentic AI applications without provisioning guardrail resources. The API operates in detect-only mode and returns numeric scores (0-1 scale) for each safeguard, allowing developers to define custom thresholds and actions in their application logic.

June 16, 2026 · 11:05 PM

June 15, 2026

model releaseGoogle DeepMind

Amazon Bedrock adds Gemma 4 models with 256K context and built-in reasoning mode

Amazon Web Services today announced availability of Google DeepMind's Gemma 4 family on Amazon Bedrock. The open-weight models include three instruction-tuned variants spanning 2.3B to 30.7B parameters, with 256K context windows, multimodal input support, and built-in reasoning mode.

June 15, 2026 · 8:35 PM

product update

AWS Launches Deep Agents Framework with Bedrock AgentCore for Isolated Multi-Agent Workflows

AWS has launched LangChain Deep Agents, an orchestration framework that spawns isolated subagents using Amazon Bedrock AgentCore infrastructure. AgentCore provides dedicated browser MicroVMs running Chromium via Playwright and full Python environments with pre-installed data science libraries, enabling parallel agent execution without context window competition.

June 15, 2026 · 2:05 PM

June 10, 2026

product update

AWS launches Neuron Agentic Development for automated Trainium kernel optimization

AWS announced Neuron Agentic Development, a collection of AI agents that automate kernel optimization for Trainium and Inferentia chips. The toolkit includes five specialized skills that handle kernel writing, debugging, profiling, and analysis, accessible through coding agents in Kiro and Claude.

June 10, 2026 · 3:35 PM

June 8, 2026

product updateAmazon Web Services

AWS Launches AgentCore Runtime for Persistent Coding Agent Sessions That Don't Die When Laptops Close

Amazon Web Services has launched AgentCore Runtime on Bedrock, providing dedicated Linux microVMs with persistent 14-day storage for coding agents. The service eliminates the need to keep laptops open during agent sessions and supports parallel execution of Claude Code, Codex, Kiro, OpenCode, and other coding agents with isolated environments.

June 8, 2026 · 4:50 PM

June 4, 2026

model releaseNVIDIA

NVIDIA Nemotron 3 Ultra launches on AWS SageMaker with 550B parameters, 1M token context window

NVIDIA Nemotron 3 Ultra is now available on Amazon SageMaker JumpStart with 550 billion total parameters and 55 billion active parameters. The model features a hybrid Transformer-Mamba Mixture-of-Experts architecture and supports context windows up to 1 million tokens, targeting agentic AI workloads.

June 4, 2026 · 5:06 PM

June 1, 2026

product updateOpenAI

OpenAI GPT-5.5 and GPT-5.4 Launch on Amazon Bedrock at Parity Pricing

OpenAI's GPT-5.5 and GPT-5.4 models are now generally available on Amazon Bedrock, with pricing matching OpenAI's first-party rates. Codex, OpenAI's coding agent used by 5 million developers weekly, is also available with pay-per-token pricing and no seat licenses.

June 1, 2026 · 9:50 PM

May 27, 2026

product updateAmazon Web Services

AWS launches Amazon Bedrock Data Automation for financial document processing with custom blueprint system

Amazon Web Services released Amazon Bedrock Data Automation (BDA), a foundation model-powered service designed to extract and validate structured data from financial documents. The service uses custom blueprints to process bank statements, W-2 tax forms, 1099-B forms, and vendor contracts, offering what AWS claims is industry-leading accuracy at lower cost than using foundation models directly.

May 27, 2026 · 9:35 PM

May 21, 2026

product updateAmazon Web Services

Amazon Nova Act Becomes HIPAA Eligible for Healthcare Workflows

Amazon Nova Act, AWS's browser-based AI agent service, now qualifies as HIPAA eligible, allowing healthcare organizations to deploy autonomous agents for workflows involving electronically protected health information. The service automates repetitive browser tasks including claims processing, referral coordination, and prior authorization.

May 21, 2026 · 10:35 PM

product updateAmazon Web Services

AWS Launches Amazon Bedrock AgentCore for Deploying Production AI Agents

AWS has launched Amazon Bedrock AgentCore, a serverless runtime environment for deploying production AI agents. Turkish fulfillment company OPLOG demonstrated the platform's capabilities by building three business intelligence agents using Anthropic's Claude Sonnet, achieving a 35% reduction in sales cycles and 98% reduction in manual research time.

May 21, 2026 · 4:20 PM

May 20, 2026

product update

AWS releases four multimodal evaluators for image-to-text AI tasks in Strands Evals SDK

AWS has added four multimodal evaluators to its Strands Evals SDK that judge image-to-text AI outputs by directly analyzing source images. The evaluators—Overall Quality, Correctness, Faithfulness, and Instruction Following—use multimodal large language models to detect visual hallucinations, factual errors, and instruction violations that text-only judges miss.

May 20, 2026 · 6:20 PM

product updateAmazon Web Services

AWS SageMaker AI adds bidirectional streaming for real-time speech transcription with vLLM

Amazon SageMaker AI has launched bidirectional streaming support for real-time inference, enabling WebSocket-based voice applications through vLLM integration. The feature uses HTTP/2 on port 8443 to bridge client connections with vLLM's Realtime API, allowing audio to stream in while transcription streams back simultaneously over a single persistent connection.

May 20, 2026 · 5:20 PM

May 19, 2026

product updateAmazon Web Services

AWS launches Nova Sonic voice agent framework with AgentCore Runtime and three integration patterns

AWS released Amazon Nova Sonic, a speech-to-speech foundation model for voice agents, alongside AgentCore Runtime, a serverless hosting environment with WebSocket streaming and microVM isolation. The framework supports three integration patterns: direct tool calls via AgentCore Gateway using Model Context Protocol (MCP), sub-agent delegation with Agent-to-Agent (A2A) protocol, and session segmentation for multi-step workflows.

May 19, 2026 · 3:36 PM

May 14, 2026

product updateAmazon Web Services

AWS Bedrock AgentCore adds Chrome enterprise policy support with 450+ browser settings

Amazon Bedrock AgentCore Browser now supports Chrome enterprise policies and custom root CA certificates, giving organizations control over 450+ browser settings for AI agents. The feature enables URL filtering, download restrictions, password manager controls, and connectivity to internal services through custom certificate authorities.

May 14, 2026 · 5:35 PM

May 13, 2026

product updateAmazon Web Services

AWS Launches WebRTC Integration for Amazon Nova Sonic Real-Time Voice Streaming

AWS has integrated WebRTC protocol support with Amazon Nova Sonic, its speech-to-speech model, through Amazon Kinesis Video Streams. The integration delivers real-time voice streaming with sub-second latency and includes adaptive bitrate control, forward error correction, and Voice Activity Detection for mobile and IoT applications.

May 13, 2026 · 6:05 PM

May 11, 2026

product updateAnthropic

AWS launches Claude Platform on AWS, bringing Anthropic's native APIs and features directly to AWS accounts

AWS announced general availability of Claude Platform on AWS, enabling direct access to Anthropic's native APIs, tools, and console through existing AWS accounts. The service includes the Messages API, Claude Managed Agents, web search, MCP connector, and code execution, authenticated via AWS IAM and billed through AWS Marketplace.

May 11, 2026 · 6:50 PM

May 7, 2026

product updateAmazon Web Services

AWS launches Amazon Bedrock AgentCore Payments with Coinbase and Stripe for autonomous agent transactions

AWS announced Amazon Bedrock AgentCore Payments (preview), enabling AI agents to autonomously discover and pay for APIs, web content, MCP servers, and other agents. Built with Coinbase and Stripe, the service supports micropayments through the x402 protocol with per-session spending limits and full transaction observability.

May 7, 2026 · 1:05 PM

May 5, 2026

product updateAmazon Web Services

AWS Adds OS-Level Control to Bedrock AgentCore Browser for Native UI Automation

AWS announced OS Level Actions for Amazon Bedrock AgentCore Browser, extending agent automation beyond the browser's Document Object Model to interact with native operating system UI. The feature enables agents to control system dialogs, security prompts, and context menus through direct mouse and keyboard commands at the OS level.

May 5, 2026 · 5:05 PM

May 4, 2026

product updateAmazon AWS

AWS Launches AgentCore Optimization: Automated Performance Loop for Production AI Agents

Amazon Web Services released AgentCore Optimization in preview, introducing an automated performance loop that generates configuration recommendations from production traces, validates them through batch evaluation and A/B testing, and enables continuous agent optimization. The system targets the quality drift problem where AI agents degrade as models evolve and user behavior shifts.

May 4, 2026 · 5:21 PM

product updateAmazon Web Services

AWS launches agent-guided workflows in SageMaker AI to automate model fine-tuning

Amazon Web Services has released agent-guided workflows in SageMaker AI that use AI coding agents to automate model customization. The feature includes nine pre-built skills covering use case definition, data preparation, fine-tuning technique selection (SFT, DPO, RLVR), evaluation, and deployment to Amazon Bedrock or SageMaker endpoints.

May 4, 2026 · 5:20 PM

product update

AWS SageMaker adds automatic instance fallback to prevent GPU capacity failures

Amazon SageMaker AI now supports capacity-aware instance pools that automatically try alternative GPU instance types when primary choices lack capacity. The feature works across endpoint creation, autoscaling, and scale-in operations, eliminating the manual retry cycles that previously left endpoints stuck in failed states.

May 4, 2026 · 4:20 PM

April 30, 2026

product updateAmazon Web Services

Amazon Q Developer IDE plugins to be discontinued April 30, 2027 as AWS shifts to Kiro

AWS announced that Amazon Q Developer IDE plugins and paid subscriptions will reach end of support on April 30, 2027, with new account creation blocked starting May 15, 2026. The company is transitioning users to Kiro, a new agentic development environment built for spec-driven development.

April 30, 2026 · 8:35 PM

April 29, 2026

product updateAmazon Web Services

AWS Launches Serverless MCP Proxy on Bedrock AgentCore Runtime for Custom Agent Controls

AWS has released support for custom Model Context Protocol (MCP) proxies on Amazon Bedrock AgentCore Runtime, allowing organizations to implement custom governance and security controls on AI agent tool interactions without modifying upstream MCP servers. The serverless proxy runs on AgentCore Runtime with automatic scaling and built-in observability through CloudWatch and OpenTelemetry.

April 29, 2026 · 12:05 PM

April 28, 2026

product updateAmazon Web Services

AWS Bedrock adds OpenAI models, Codex, and managed agents service following revised Microsoft agreement

AWS has added OpenAI's latest models, Codex, and a new managed agents service to its Bedrock platform, one day after OpenAI revised its agreement with Microsoft. The integration follows OpenAI's up-to-$50 billion deal with Amazon.

April 28, 2026 · 8:05 PM

product updateAmazon Web Services

Amazon Nova 2 Sonic Unifies Speech Recognition, Reasoning, and TTS in Single Streaming Model

Amazon Web Services released technical guidance for migrating text agents to voice assistants using Amazon Nova 2 Sonic, a native speech-to-speech model that combines automatic speech recognition, reasoning, tool calling, and text-to-speech in a single bidirectional streaming interface. The model supports asynchronous tool calling and built-in voice activity detection for handling interruptions.

April 28, 2026 · 6:06 PM

model releaseNVIDIA

NVIDIA Nemotron 3 Nano Omni: 30B-parameter multimodal model launches on AWS SageMaker with 131K token context

NVIDIA has launched Nemotron 3 Nano Omni on Amazon SageMaker JumpStart, a multimodal model with 30 billion total parameters (3 billion active) that processes video, audio, images, and text in a single inference pass. The model features a 131K token context window and uses a Mamba2 Transformer Hybrid MoE architecture combining three specialized encoders.

April 28, 2026 · 4:51 PM

April 21, 2026

product updateAnthropic

Anthropic's Claude Cowork now runs on Amazon Bedrock with consumption-based pricing

Anthropic announced Claude Cowork is now available on Amazon Bedrock, allowing organizations to deploy the desktop AI assistant through their AWS infrastructure with consumption-based pricing. Unlike Claude Enterprise, pricing flows through existing AWS agreements with no per-seat licensing from Anthropic.

April 21, 2026 · 7:21 PM

April 17, 2026

product updateAmazon Web Services

AWS Reduces Video Search Routing Cost 95% Using Nova Premier-to-Micro Model Distillation

Amazon Web Services released a model distillation pipeline on Amazon Bedrock that transfers video search routing intelligence from Nova Premier to Nova Micro. According to AWS, the approach reduces inference cost by over 95% and latency by 50% compared to using Claude Haiku for intent routing.

April 17, 2026 · 7:51 PM

product updateAmazon Web Services

Amazon Launches Nova Multimodal Embeddings for Video Semantic Search Across Visual, Audio, and Text Signals

Amazon released Nova Multimodal Embeddings on Amazon Bedrock, a unified embedding model that processes text, documents, images, video, and audio into a shared 1024-dimensional semantic vector space. The model supports up to 30 seconds of video per embedding and enables semantic search across all modalities simultaneously without converting video to text first.

April 17, 2026 · 7:50 PM

product updateAmazon Web Services

AWS releases Nova Forge SDK data mixing guide to preserve general capabilities during fine-tuning

Amazon Web Services published a practical guide for fine-tuning Amazon Nova models using the Nova Forge SDK's data mixing capabilities. According to AWS, blending customer data with Amazon-curated datasets preserved near-baseline MMLU scores while delivering a 12-point F1 improvement on a Voice of Customer classification task spanning 1,420 leaf categories.

April 17, 2026 · 5:35 PM

April 16, 2026

product updateAmazon Web Services

Amazon Nova Micro Fine-Tuned Text-to-SQL Models Now Available on Bedrock On-Demand Inference at $0.80/Month for 22,000 Q

AWS has enabled fine-tuned Amazon Nova Micro models to run on Bedrock's on-demand inference for text-to-SQL generation. According to AWS testing, a sample workload of 22,000 queries per month costs $0.80 monthly using the serverless approach, compared to higher costs with persistent model hosting. The solution uses LoRA fine-tuning on the sql-create-context dataset containing over 78,000 SQL examples.

April 16, 2026 · 5:51 PM

product updateAmazon Web Services

AWS launches Automated Reasoning checks in Amazon Bedrock for mathematically verified AI compliance

AWS has released Automated Reasoning checks in Amazon Bedrock Guardrails, a feature that uses formal mathematical verification to validate AI outputs against defined rules. Unlike LLM-as-a-judge approaches that use one probabilistic model to validate another, Automated Reasoning provides mathematically proven, auditable compliance evidence for regulated industries.

April 16, 2026 · 5:50 PM

April 13, 2026

product update

AWS Lambda enables serverless reward functions for Amazon Nova model customization

AWS has introduced Lambda-based reward functions for Amazon Nova model customization through reinforcement fine-tuning (RFT). The serverless architecture automatically scales from 10 concurrent evaluations per second during experimentation to 400+ during production training, supporting both objective RLVR and subjective RLAIF approaches.

April 13, 2026 · 4:20 PM

April 8, 2026

product updateAmazon Web Services

Amazon Bedrock now supports fine-tuning for Nova models with three customization approaches

Amazon Bedrock now enables fine-tuning of Amazon Nova models using supervised fine-tuning (SFT), reinforcement fine-tuning (RFT), and model distillation. The service automates infrastructure provisioning and training orchestration, requiring only data upload to S3 and a single API call. Fine-tuned models run on-demand at standard inference pricing without provisioned capacity requirements.

April 8, 2026 · 8:05 PM

April 2, 2026

product updateAmazon Web Services

AWS Bedrock AgentCore adds persistent filesystem storage and shell command execution

Amazon Bedrock AgentCore Runtime now offers managed session storage to persist agent filesystem state across stop/resume cycles and InvokeAgentRuntimeCommand for executing shell commands directly within agent microVMs. The features address two core challenges in production agent workflows: ephemeral filesystems that reset between sessions and the inability to execute deterministic operations without routing them through LLMs.

April 2, 2026 · 1:05 PM

← Back to all news