product updateAmazon Web Services

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

TL;DR

Amazon Web Services launched the InvokeGuardrailChecks API for Amazon Bedrock Guardrails, enabling developers to apply individual safety checks at any point in agentic AI applications without provisioning guardrail resources. The API operates in detect-only mode and returns numeric scores (0-1 scale) for each safeguard, allowing developers to define custom thresholds and actions in their application logic.

2 min read
0

AWS launches InvokeGuardrailChecks API for per-request safety controls in AI agents

Amazon Web Services launched the InvokeGuardrailChecks API for Amazon Bedrock Guardrails, enabling developers to apply individual safety checks at any point in agentic AI applications without provisioning guardrail resources.

The API operates in detect-only mode and returns numeric scores for each safeguard on a discrete 0-1 scale with values of 0, 0.2, 0.4, 0.6, 0.8, or 1.0. Developers can define custom thresholds and actions—block, bypass, retry, or log—based on their application requirements.

How it works

The API addresses a specific operational problem in multi-turn agentic workflows. Traditional guardrails require developers to create separate guardrail resources for each stage of an agent loop. In a single user session with 20 turns, this could mean 40 guardrail lifecycle operations (create-invoke-delete) for input and output checks at each step.

The InvokeGuardrailChecks API is resourceless—no CreateGuardrail step, no guardrail IDs to track, and no versions to manage. Developers specify which safeguards to run directly in each API request using a structured messages schema with role labels (system, user, or assistant).

The API supports three safeguard types:

  • Content filters: Detects harmful content across categories including HATE, VIOLENCE, SEXUAL, INSULTS, and MISCONDUCT. Returns severity scores.
  • Prompt attack detection: Identifies jailbreaks, prompt injection, and prompt leakage attempts as standalone checks, separate from content filters. Returns severity scores.
  • Sensitive information filters: Detects 31 PII entity types including email, phone, SSN, and credit card numbers. Returns confidence scores.

Each finding includes messageIndex, contentIndex, and character offsets (beginOffset, endOffset) for precise location within the content.

Detect-only mode

Unlike traditional guardrails, the API doesn't block, mask, or rewrite content. It returns findings with numeric scores, and developers implement their own logic. This allows context-aware decisions: blocking high-confidence threats (score > 0.8), routing ambiguous findings (0.4-0.6) to human review, or logging low-confidence results for audits.

The API uses symmetric request-response—only the safeguards specified in the request appear in the response, making it straightforward to map findings back to their source checks.

IAM requirements

Because the API is resourceless, there's no guardrail ARN to scope. Developers attach an identity-based IAM policy with bedrock:InvokeGuardrailChecks permission on Resource: "*".

What this means

This API addresses a real operational bottleneck in production agentic systems. The resourceless model eliminates guardrail lifecycle management overhead that scales poorly with agent complexity. The detect-only mode gives developers control over safety thresholds rather than forcing one-size-fits-all blocking policies. For teams running hundreds of agents with varied risk profiles per turn, this reduces both infrastructure complexity and false positive rates from overly conservative default thresholds.

Related Articles

product update

AWS Launches Deep Agents Framework with Bedrock AgentCore for Isolated Multi-Agent Workflows

AWS has launched LangChain Deep Agents, an orchestration framework that spawns isolated subagents using Amazon Bedrock AgentCore infrastructure. AgentCore provides dedicated browser MicroVMs running Chromium via Playwright and full Python environments with pre-installed data science libraries, enabling parallel agent execution without context window competition.

product update

Meta launches 'AI Mode' search on Facebook to surface answers from public posts

Meta launched AI Mode on Facebook, a search feature that uses Meta AI to generate answers from public posts, Groups, and Reels instead of displaying traditional search results. The feature allows users to ask questions in plain language and receive synthesized responses based on platform discussions.

product update

AWS launches Neuron Agentic Development for automated Trainium kernel optimization

AWS announced Neuron Agentic Development, a collection of AI agents that automate kernel optimization for Trainium and Inferentia chips. The toolkit includes five specialized skills that handle kernel writing, debugging, profiling, and analysis, accessible through coding agents in Kiro and Claude.

product update

Meta launches AI Mode search on Facebook, sourcing answers from public posts across platforms

Meta is launching AI Mode as a new search option on Facebook, powered by its Muse Spark model. The feature generates answers by pulling from publicly-posted content across Meta's platforms including Facebook, Instagram, and Threads, similar to how Google has integrated Reddit content into its search results.

Comments

Loading...