product updateAnthropic

AWS adds Claude tool use to Bedrock for custom entity extraction from documents

TL;DR

Amazon Web Services has integrated Claude's tool use (function calling) capability into Bedrock, enabling serverless document processing for custom entity recognition. The solution uses Claude 3.5 Sonnet to extract structured data like names, dates, and addresses from driver's licenses and other documents without traditional model training.

2 min read
0

AWS Adds Claude Tool Use to Bedrock for Custom Entity Extraction

Amazon Web Services has published a production-ready implementation guide for using Anthropic's Claude model with tool use (function calling) capabilities in Amazon Bedrock to extract structured data from unstructured documents.

What's New

The solution demonstrates how to build a serverless document processing pipeline that extracts custom fields from documents like driver's licenses without requiring model training or fine-tuning. Claude evaluates prompts and automatically determines whether to invoke predefined tools to complete extraction tasks.

Technical Implementation

The architecture combines AWS services into an automated workflow:

  1. Document Upload: Users upload documents to Amazon S3
  2. Event Triggering: S3 PUT event automatically triggers an AWS Lambda function
  3. Model Invocation: Lambda sends the document to Claude via Amazon Bedrock's invoke-model API
  4. Entity Extraction: Claude processes the image and extracts fields using defined tool schemas
  5. Monitoring: Results are logged in Amazon CloudWatch

The implementation uses Claude 3.5 Sonnet (global.anthropic.claude-sonnet-4-5-20250929-v1:0) with the tool_choice: auto parameter, allowing Claude to decide whether tool invocation is necessary. For deterministic behavior in production, developers can hardcode specific tool selection.

Key Technical Details

Tool Definition: Developers define tools with JSON schemas specifying field names, types, and requirements. The example extracts driver's license fields including first name, last name, issue date, license number, and address components.

Lambda Configuration: The guide recommends setting Lambda timeout to 30 seconds minimum (1-2 minutes for high-resolution images) to accommodate Claude's processing latency. Default 3-second timeouts will cause failures.

Image Processing: Documents are base64-encoded and sent with text prompts to Claude. The model returns structured JSON matching the defined tool schema.

IAM Permissions: Required permissions include bedrock:InvokeModel for Claude access and s3:GetObject for document retrieval.

Capabilities

The tool use approach supports:

  • Dynamic field extraction without retraining
  • Multiple document types (driver's licenses, insurance cards, ID badges, business forms)
  • Scalable, serverless processing
  • Real-time extraction with no batch processing required
  • Flexible schema definition based on use case

What This Means

This integration reduces barrier to entry for document automation. Previously, extracting data from images required either rule-based OCR systems, fine-tuned models, or manual setup. Claude's tool use with vision capabilities removes the training requirement—developers can define extraction schemas in minutes and deploy production pipelines immediately. The serverless architecture eliminates infrastructure management while AWS Lambda's automatic scaling handles variable document volumes. Organizations processing hundreds or thousands of documents can now automate extraction without building custom ML pipelines.

Related Articles

product update

AWS Releases AgentCore Harness for Production AI Agents with Two-API Setup

Amazon Web Services made its AgentCore harness generally available, reducing production AI agent deployment to two API calls: CreateHarness and InvokeHarness. The managed service handles sandboxed execution, memory, tool integration, and observability, eliminating infrastructure setup for teams building LLM agents.

product update

AWS launches Web Search on Amazon Bedrock AgentCore with tens of billions of documents, no external API required

Amazon Web Services launched Web Search on Amazon Bedrock AgentCore, a fully managed web search capability that gives AI agents access to tens of billions of documents without requiring external search APIs. The service, now generally available, runs entirely within AWS infrastructure and refreshes its index within minutes of new content appearing online.

analysis

US export controls force Anthropic to take Claude Fable 5 offline indefinitely

The US government imposed export controls on Anthropic's newly released Claude Fable 5 and underlying Mythos models on Friday, restricting access even for foreign nationals working at Anthropic in the United States. Anthropic took both models completely offline rather than risk non-compliance, leaving Fable unavailable to all users as of this writing.

product update

Google expands Gemini Android overlay menu with six new tools accessible without opening app

Google has expanded the Gemini overlay plus menu on Android to include six tools: Videos, Music, Canvas, and Guided Learning join the existing Images and Personal Intelligence options. The update, rolling out in Google app version 17.32, allows users to access most Gemini features from anywhere on Android without opening the full app.

Comments

Loading...