Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

TL;DR

Mistral AI has released Mistral OCR, an API for extracting text and images from documents at $1 per 1,000 pages (approximately $0.50 with batch inference). The company claims 94.89% overall accuracy on its internal test set, comparing favorably to GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%).

June 18, 2026 · 8:38 AM2 min read

Mistral OCR — Quick Specs

Compare Mistral OCR with other models →

Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

The API accepts images and PDFs as input and outputs interleaved text and images in markdown format. Mistral has deployed the model as the default document understanding system on Le Chat, its chatbot platform.

Performance Claims

According to Mistral, the model achieved the following scores on its internal "text-only" test set:

Overall accuracy: 94.89% (vs GPT-4o's 89.77%)
Math extraction: 94.29% (vs GPT-4o's 87.55%)
Scanned documents: 98.96% (vs GPT-4o's 94.58%)
Tables: 96.12% (vs GPT-4o's 91.70%)
Multilingual: 89.55% (vs GPT-4o's 86.00%)

The company claims processing speeds of up to 2,000 pages per minute on a single node. Mistral states it extracts embedded images from documents alongside text, a capability not present in the compared models.

Multilingual Support

Mistral claims 99.02% fuzzy match accuracy across multiple languages on its benchmarks, compared to 96.53% for Gemini 2.0 Flash and 97.31% for Azure OCR. The company reports accuracy above 97% for 11 tested languages, including Russian (99.09%), German (99.51%), Spanish (99.54%), Chinese (97.11%), and Hindi (97.55%).

Technical Capabilities

The model handles:

Mathematical expressions and LaTeX formatting
Complex tables and interleaved imagery
Documents as prompts with structured JSON output
Multiple scripts and fonts across languages

Mistral positions the API for use in RAG (Retrieval-Augmented Generation) systems processing multimodal documents like slides and complex PDFs. Users can chain extracted outputs into downstream function calls for agent-based workflows.

Availability and Deployment

The API is available today on la Plateforme, Mistral's developer platform. The company plans to extend availability to cloud and inference partners, plus on-premises deployment on a selective basis for organizations handling classified information.

Mistral has not disclosed the model's parameter count, architecture details, or training data composition.

What This Means

Mistral OCR enters a competitive market dominated by Google Document AI, Azure OCR, and general-purpose multimodal models like GPT-4o and Gemini. The pricing of $1 per 1,000 pages undercuts typical enterprise OCR pricing, though direct cost comparisons depend on specific use cases and batch processing capabilities. The claimed accuracy advantages—particularly on mathematical content (94.29% vs GPT-4o's 87.55%)—could make it viable for scientific and technical document processing if the benchmarks prove reproducible on external test sets. The key differentiation appears to be simultaneous text and image extraction in a single pass, which existing general-purpose LLMs don't natively support.

Source: mistral.ai ↗

mistral-ai ocr document-understanding api multimodal benchmarks pricing

product updateJuly 28, 2026

Cursor Launches ₹649/Month 'Start' Plan for Indian Developers

Cursor has launched Cursor Start, a new ₹649 per month subscription plan tailored for developers in India, complete with local UPI billing. The plan includes access to Grok 4.5, Cursor's Composer model, always-on cloud agents, and Cursor for iOS.

product updateJuly 31, 2026

Google Cancels Standalone AI Studio Mobile App, Shifts App-Building Into Gemini App Instead

Google has canceled the standalone AI Studio app for Android and iOS that it teased at I/O 2026, despite 800,000 pre-orders. Instead, app-building capabilities will be integrated directly into the Gemini app for mobile and desktop.

product updateJuly 31, 2026

Oracle Adds Google's Gemini to Fusion Apps and NetSuite; Shares Jump 8.4%

Oracle is embedding Google's Gemini 3.1 Flash-Lite and Gemini 3.5 Flash models into its Fusion Applications and NetSuite software, expanding a partnership with its cloud rival. Oracle shares rose as much as 8.4% to $127.64 on the news.

product updateJuly 30, 2026

Google's Gemini Spark Gains Chrome Auto-Browse Control, Expands to 160+ Countries

Google's Gemini Spark personal agent can now control desktop Chrome directly, using logged-in accounts and saved passwords to complete web tasks. The feature launches in the US first, alongside a Google AI Pro expansion bringing Spark to more than 160 additional countries.

Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

Mistral OCR — Quick Specs

Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

Performance Claims

Multilingual Support

Technical Capabilities

Availability and Deployment

What This Means

Related Articles

Cursor Launches ₹649/Month 'Start' Plan for Indian Developers

Google Cancels Standalone AI Studio Mobile App, Shifts App-Building Into Gemini App Instead

Oracle Adds Google's Gemini to Fusion Apps and NetSuite; Shares Jump 8.4%

Google's Gemini Spark Gains Chrome Auto-Browse Control, Expands to 160+ Countries

Comments