api

25 articles tagged with api

June 18, 2026
product updateMistral AI

Mistral AI launches Connectors in Studio with MCP protocol integration and direct tool calling

Mistral AI has released Connectors in Studio, allowing developers to integrate custom MCP (Model Context Protocol) servers and built-in connectors via API/SDK. The release includes direct tool calling for deterministic workflows and human-in-the-loop approval flows for sensitive operations.

model releaseMistral AI

Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version

Mistral AI released Mistral OCR 3, a document extraction model priced at $2 per 1,000 pages ($1 with Batch API discount). The model achieves a 74% overall win rate over its predecessor on forms, scanned documents, complex tables, and handwriting according to internal benchmarks.

product updateMistral AI

Mistral Launches Agents API with Code Execution, Web Search, and MCP Tool Integration

Mistral AI has released its Agents API, a framework for building AI agents with built-in connectors for code execution, web search, image generation, and Model Context Protocol tools. The API includes persistent conversation memory and multi-agent orchestration capabilities, enabling agents to maintain context across interactions and coordinate complex workflows.

product updateMistral AI

Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

Mistral AI has released Mistral OCR, an API for extracting text and images from documents at $1 per 1,000 pages (approximately $0.50 with batch inference). The company claims 94.89% overall accuracy on its internal test set, comparing favorably to GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%).

June 9, 2026
product update

Google launches Gemini 3.5 Live Translate with continuous speech-to-speech in 70+ languages

Google announced Gemini 3.5 Live Translate, a speech-to-speech translation model supporting over 70 languages with continuous audio generation. The model rolls out today to Google Translate on Android and iOS, with Google Meet integration coming in private preview this month for select Workspace customers.

May 28, 2026
changelogAnthropic

Anthropic Launches Claude Opus 4.8 Fast Mode at 2x Price for Higher Output Speed

Anthropic has released a fast-mode variant of Claude Opus 4.8 that delivers higher output speed at double the pricing of the standard version. The model offers identical capabilities to regular Opus 4.8 with input pricing at $10 per million tokens and output at $50 per million tokens.

model releaseMistral AI

Mistral AI Releases Voxtral: Apache 2.0 Speech Models with 32K Token Context at $0.001/Minute

Mistral AI released Voxtral, a family of open-source speech understanding models available in 24B and 3B parameter variants under Apache 2.0 license. The models support up to 32K token context (30 minutes of audio for transcription, 40 minutes for understanding) and are priced at $0.001 per minute via API—less than half the cost of comparable proprietary systems according to Mistral.

product updateMistral AI

Mistral Releases OCR API at $1 per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

Mistral AI has released an OCR API priced at $1 per 1,000 pages with batch inference costs approximately half that rate. The company claims 94.89% overall accuracy on internal benchmarks, ahead of GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%). The model processes up to 2,000 pages per minute on a single node.

May 27, 2026
changelogOpenAI

OpenAI investigating elevated latency across ChatGPT and API

OpenAI confirmed it is investigating elevated latency issues affecting both ChatGPT and its API as of May 27, 2026. The company is monitoring a separate issue for FEDRamp users, though a fix has been applied to that specific problem.

May 21, 2026
changelogAnthropic

Anthropic Python SDK v0.104.0 adds thinking token count estimates for streaming responses

Anthropic released version 0.104.0 of its Python SDK on May 21, 2026. The update adds support for a thinking-token-count beta feature that provides estimated token counts in thinking block deltas when streaming responses from reasoning models.

May 19, 2026
model release

Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June

Google released Gemini 3.5 Flash today with 4x faster output token generation than competing frontier models while surpassing Gemini 3.1 Pro on coding, agentic, and multimodal benchmarks. The company announced Gemini 3.5 Pro will launch next month and introduced Gemini Omni, a new multimodal series that outputs video.

May 6, 2026
changelogAnthropic

Anthropic doubles Claude Code usage limits for paid users, increases API capacity by up to 1500%

Anthropic has doubled Claude Code's five-hour usage limits for Pro, Max, Team, and Enterprise users while removing peak hour restrictions for Pro and Max plans. The company also increased API limits by up to 1500% for input tokens per minute through a compute capacity deal with SpaceX's Colossus 1 data center.

changelogAnthropic

Anthropic doubles Claude Code rate limits, secures 220,000 Nvidia GPUs via SpaceX Colossus 1 deal

Anthropic doubled Claude Code's five-hour rate limits across Pro, Max, Team, and Enterprise plans effective Tuesday, removing peak-hours throttling for Pro and Max users. The capacity expansion comes from an exclusive agreement securing all compute at SpaceX's Colossus 1 data center, which provides over 300 megawatts and more than 220,000 Nvidia GPUs.

April 27, 2026
model releaseOpenAI

OpenAI Launches GPT Mini Latest with 400,000 Token Context Window

OpenAI released GPT Mini Latest on April 27, 2025, featuring a 400,000 token context window. The model automatically redirects to the latest version in the OpenAI GPT Mini family, allowing developers to stay current without manual updates.

April 21, 2026
product updateOpenrouter

OpenRouter Launches Pareto Code Router with Dynamic Model Selection Based on Quality Threshold

OpenRouter has released Pareto Code Router, a dynamic routing system that automatically selects from a curated list of coding models based on a user-defined quality threshold. Users set a min_coding_score between 0 and 1, and the router selects an appropriate model from its shortlist without requiring commitment to a specific model.

April 16, 2026
changelogAnthropic

Anthropic removes bundled tokens from enterprise seats, shifts to metered billing

Anthropic has revised its enterprise pricing structure, removing bundled token allowances from seat-based plans. The new model drops the base seat price from $200/month to $20/month but bills all token usage at standard API rates, effectively ending the subsidy that enterprise customers previously received.

April 15, 2026
model release

Google releases Gemini 3.1 Flash TTS with prompt-directed voice control

Google released Gemini 3.1 Flash TTS, a text-to-speech model that accepts detailed prompts to control voice characteristics, speaking style, accent, and delivery. The model is available through the standard Gemini API using the model ID 'gemini-3.1-flash-tts-preview'.

April 9, 2026
product updateAnthropic

Anthropic launches Claude Managed Agents for autonomous AI agents in public beta

Anthropic has launched Claude Managed Agents as a public beta, providing developers with a hosted platform to build and run autonomous AI agents without managing their own infrastructure. The service costs $0.08 per session hour on top of standard token pricing and is currently available exclusively on Anthropic's infrastructure.

April 8, 2026
changelogAnthropic

Anthropic Python SDK v0.92.0 adds Claude Managed Agents support

Anthropic released version 0.92.0 of its Python SDK on April 8, 2026, introducing support for Claude Managed Agents. The update enables developers to integrate agent capabilities directly through the SDK.

April 7, 2026
changelogAnthropic

Anthropic Python SDK v0.90.0 adds Claude Mythos preview support

Anthropic released version 0.90.0 of its Python SDK on April 7, 2026, adding support for the claude-mythos-preview model. The update also includes a bug fix for query parameter merging in the client.

April 6, 2026
product updateAnthropic

Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints

Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.

April 4, 2026
changelogAnthropic

Anthropic ends free Claude access through third-party tools like OpenClaw

Anthropic has ended free access to Claude through third-party applications like OpenClaw, effective April 4, 2025 at 3PM ET. Users of third-party tools integrating Claude must now purchase usage bundles or use an API key. The change stems from capacity constraints and usage patterns incompatible with subscription tiers.

April 1, 2026
model release

Arcee AI releases Trinity Large Thinking, open-source reasoning model with 262K context window

Arcee AI has released Trinity Large Thinking, an open-source reasoning model featuring a 262,144 token context window. The model is priced at $0.25 per million input tokens and $0.90 per million output tokens, with free access available through OpenRouter for the first five days.

March 28, 2026
product update

Google's Gemini API Agent Skill boosts coding task success from 28% to 97%

Google has released an Agent Skill for the Gemini API that provides language models with current information about their own APIs, SDKs, and best practices. Testing across 117 coding tasks showed Gemini 3.1 Pro's success rate jumped from 28.2% to 96.6%, though older 2.5-series models showed minimal improvement.

March 14, 2026
product updateOpenAI

OpenAI Python SDK v2.27.0 adds Sora video API improvements and character support

OpenAI released version 2.27.0 of its Python SDK on March 13, 2026, adding significant improvements to the Sora video generation API. The update introduces character API support, video extension and editing capabilities, and higher resolution export options.