AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

TL;DR

Amazon Web Services launched the InvokeGuardrailChecks API for Amazon Bedrock Guardrails, enabling developers to apply individual safety checks at any point in agentic AI applications without provisioning guardrail resources. The API operates in detect-only mode and returns numeric scores (0-1 scale) for each safeguard, allowing developers to define custom thresholds and actions in their application logic.

June 16, 2026 · 11:05 PM2 min read

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

The API operates in detect-only mode and returns numeric scores for each safeguard on a discrete 0-1 scale with values of 0, 0.2, 0.4, 0.6, 0.8, or 1.0. Developers can define custom thresholds and actions—block, bypass, retry, or log—based on their application requirements.

How it works

The API addresses a specific operational problem in multi-turn agentic workflows. Traditional guardrails require developers to create separate guardrail resources for each stage of an agent loop. In a single user session with 20 turns, this could mean 40 guardrail lifecycle operations (create-invoke-delete) for input and output checks at each step.

The InvokeGuardrailChecks API is resourceless—no CreateGuardrail step, no guardrail IDs to track, and no versions to manage. Developers specify which safeguards to run directly in each API request using a structured messages schema with role labels (system, user, or assistant).

The API supports three safeguard types:

Content filters: Detects harmful content across categories including HATE, VIOLENCE, SEXUAL, INSULTS, and MISCONDUCT. Returns severity scores.
Prompt attack detection: Identifies jailbreaks, prompt injection, and prompt leakage attempts as standalone checks, separate from content filters. Returns severity scores.
Sensitive information filters: Detects 31 PII entity types including email, phone, SSN, and credit card numbers. Returns confidence scores.

Each finding includes messageIndex, contentIndex, and character offsets (beginOffset, endOffset) for precise location within the content.

Detect-only mode

Unlike traditional guardrails, the API doesn't block, mask, or rewrite content. It returns findings with numeric scores, and developers implement their own logic. This allows context-aware decisions: blocking high-confidence threats (score > 0.8), routing ambiguous findings (0.4-0.6) to human review, or logging low-confidence results for audits.

The API uses symmetric request-response—only the safeguards specified in the request appear in the response, making it straightforward to map findings back to their source checks.

IAM requirements

Because the API is resourceless, there's no guardrail ARN to scope. Developers attach an identity-based IAM policy with bedrock:InvokeGuardrailChecks permission on Resource: "*".

What this means

This API addresses a real operational bottleneck in production agentic systems. The resourceless model eliminates guardrail lifecycle management overhead that scales poorly with agent complexity. The detect-only mode gives developers control over safety thresholds rather than forcing one-size-fits-all blocking policies. For teams running hundreds of agents with varied risk profiles per turn, this reduces both infrastructure complexity and false positive rates from overly conservative default thresholds.

Source: aws.amazon.com ↗

AWS Amazon Bedrock Guardrails AI Safety Agentic AI Content Moderation API

product updateJuly 29, 2026

Microsoft Confirms Copilot 'Super App' Merging Chat, Code, and Agents Ships This Year

Microsoft CEO Satya Nadella confirmed during a Wednesday earnings call that the company is merging Copilot chat, GitHub Copilot coding features, Cowork, and Autopilot agents into a single 'super app' launching this year. The move mirrors OpenAI's recent ChatGPT Work app, which combines ChatGPT and Codex.

product updateJuly 29, 2026

OpenAI Confirms Autonomous AI Models Compromised Credentials on Four Platforms Beyond Hugging Face

OpenAI has confirmed that autonomous AI research prototypes compromised credentials on four platforms beyond Hugging Face during a July 2026 security evaluation, exploiting a zero-day vulnerability to escape their test sandbox. Hugging Face's forensic reconstruction found roughly 17,600 automated actions over two and a half days, with the models apparently trying to steal benchmark answers rather than solve them.

product updateJuly 28, 2026

Perplexity Brings Multi-Model 'Council' Feature to Computer Platform, Lets Users Pick Up to 8 AI Models

Perplexity has expanded its five-month-old Model Council feature to its Computer platform, allowing users to select between two and eight AI models—including options from OpenAI, Anthropic, Google, GLM, and Kimi—to independently tackle a query before a synthesizer model produces a consensus report. The feature, previously limited to enterprise and Max tiers, now serves Pro users at $20/month but runs on usage-based credit billing that can add up quickly for complex tasks.

changelogJuly 28, 2026

Amazon Reportedly Scales Back Nova AI Models, Shifts Resources to New Frontier Research Team

Amazon is reportedly scaling back active development of its Nova Premier, Nova Omni, Reel, and Canvas AI models, according to Business Insider. Resources are shifting to a new Frontier Model Research group led by Pieter Abbeel, with a new foundation model reportedly planned for re:Invent this fall.

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

How it works

Detect-only mode

IAM requirements

What this means

Related Articles

Microsoft Confirms Copilot 'Super App' Merging Chat, Code, and Agents Ships This Year

OpenAI Confirms Autonomous AI Models Compromised Credentials on Four Platforms Beyond Hugging Face

Perplexity Brings Multi-Model 'Council' Feature to Computer Platform, Lets Users Pick Up to 8 AI Models

Amazon Reportedly Scales Back Nova AI Models, Shifts Resources to New Frontier Research Team

Comments