OpenAI Agents SDK adds native sandbox execution and governance controls for enterprise deployment
OpenAI has added native sandbox execution and governance controls to its Agents SDK, allowing enterprises to deploy AI agents with isolated compute environments and credential separation. The SDK now supports major cloud storage providers including AWS S3, Azure Blob Storage, Google Cloud Storage, and Cloudflare R2, with built-in integrations for sandbox providers like E2B, Modal, Blaxel, and Vercel.
OpenAI Agents SDK adds native sandbox execution and governance controls for enterprise deployment
OpenAI has released new capabilities for its Agents SDK that separate credential management from code execution environments and add native sandbox support through partnerships with providers including E2B, Modal, Blaxel, Cloudflare, Daytona, Runloop, and Vercel.
The SDK now includes a model-native harness with configurable memory, sandbox-aware orchestration, and filesystem tools. Developers can integrate tool use via Model Context Protocol (MCP), custom instructions via AGENTS.md files, and file edits using an apply patch tool.
Architecture changes
The updated SDK introduces a Manifest abstraction that standardizes workspace configuration, allowing teams to mount local files and define output directories. Direct connections to AWS S3, Azure Blob Storage, Google Cloud Storage, and Cloudflare R2 are supported.
The separation of the control harness from the compute layer means credentials remain isolated from environments where model-generated code executes. According to OpenAI, this architecture prevents injected malicious commands from accessing the central control plane or stealing API keys.
State persistence and cost reduction
The SDK implements snapshotting and rehydration capabilities. If a container crashes during a long-running task, the system can restore state in a fresh container and resume from the last checkpoint without restarting the entire process. OpenAI claims this reduces cloud compute spending by eliminating the need to re-run failed multi-step operations.
Production deployment example
Oscar Health tested the infrastructure to automate clinical records workflows. Rachael Burns, Staff Engineer and AI Tech Lead at Oscar Health, said the system now extracts metadata and identifies patient encounter boundaries in complex medical files that previous approaches could not handle reliably.
"The difference was not just extracting the right metadata, but correctly understanding the boundaries of each encounter in long, complex records," Burns said.
Availability and pricing
The new capabilities are generally available through OpenAI's API with standard token-based pricing and no custom procurement contracts required. The harness and sandbox features launch first for Python developers, with TypeScript support planned for a future release.
OpenAI states it will add code mode and subagents to both Python and TypeScript libraries, expand sandbox provider support, and offer additional integration methods for existing internal systems.
What this means
The credential isolation architecture addresses a genuine security concern for enterprises running AI agents in production: the risk of prompt injection attacks accessing sensitive credentials. The state persistence feature solves a real cost problem with long-running agent tasks. However, the actual reliability improvements and cost savings compared to existing solutions remain unverified beyond Oscar Health's reported experience. The SDK's success will depend on whether the standardized approach proves more maintainable than custom-built solutions across diverse enterprise environments.
Related Articles
OpenAI's Codex for Windows gains Computer Use and remote control from ChatGPT mobile apps
OpenAI has expanded its Codex desktop app to Windows with Computer Use capabilities and remote control from ChatGPT mobile apps. The features, previously Mac-only, allow Codex to operate Windows desktop applications autonomously and enable iPhone, iPad, and Android users to initiate and monitor Codex tasks on Windows devices.
Mistral AI Releases MCP Connectors in Studio with Direct Tool Calling and Human-in-the-Loop Workflows
Mistral AI has released Connectors in Studio, allowing developers to integrate custom MCP (Model Context Protocol) servers alongside built-in connectors for enterprise AI applications. The release includes direct tool calling, human-in-the-loop approval flows, and programmatic connector management via API and SDK.
Mistral AI launches Forge, enterprise platform for training custom models on proprietary data
Mistral AI has launched Forge, a platform for enterprises to train custom AI models on proprietary data including codebases, compliance policies, and operational records. Early partners include ASML, DSO National Laboratories Singapore, Ericsson, European Space Agency, and HTX Singapore.
AWS launches Amazon Bedrock Data Automation for financial document processing with custom blueprint system
Amazon Web Services released Amazon Bedrock Data Automation (BDA), a foundation model-powered service designed to extract and validate structured data from financial documents. The service uses custom blueprints to process bank statements, W-2 tax forms, 1099-B forms, and vendor contracts, offering what AWS claims is industry-leading accuracy at lower cost than using foundation models directly.
Comments
Loading...