Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version
Mistral AI released Mistral OCR 3, a document extraction model priced at $2 per 1,000 pages ($1 with Batch API discount). The model achieves a 74% overall win rate over its predecessor on forms, scanned documents, complex tables, and handwriting according to internal benchmarks.
Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version
Mistral AI released Mistral OCR 3 on December 17, 2024, a document extraction model priced at $2 per 1,000 pages, dropping to $1 per 1,000 pages with Batch API discount. The model is now available through API (identifier: mistral-ocr-2512) and via Document AI Playground in Mistral AI Studio.
Performance claims
According to Mistral AI, the model achieves a 74% overall win rate compared to Mistral OCR 2 across forms, scanned documents, complex tables, and handwriting. The company claims state-of-the-art accuracy compared to both enterprise document processing solutions and AI-native OCR solutions, though specific benchmark scores against named competitors were not disclosed.
Mistral evaluated the model using internal benchmarks based on customer use cases, comparing outputs to ground truth using fuzzy-match metrics for accuracy.
Technical capabilities
Mistral OCR 3 extracts text and embedded images from documents, outputting markdown enriched with HTML-based table reconstruction. The model handles:
- Handwriting: Cursive, mixed-content annotations, and handwritten text over printed forms
- Forms: Box detection, labels, handwritten entries, invoices, receipts, compliance forms
- Scanned documents: Compression artifacts, skew, distortion, low DPI, background noise
- Complex tables: Reconstructs structures with headers, merged cells, multi-row blocks, and column hierarchies using HTML table tags with colspan/rowspan
The model supports all languages and document form factors, representing what Mistral describes as a significant upgrade over OCR 2.
Pricing structure
- Standard API: $2 per 1,000 pages
- Batch API: $1 per 1,000 pages (50% discount)
- Annotations: $3 per 1,000 pages
- Self-hosting option available for organizations with data privacy requirements
The model is fully backward compatible with Mistral OCR 2.
What this means
At $2 per 1,000 pages standard pricing, Mistral OCR 3 undercuts typical enterprise document processing solutions that often charge per-page rates in the cents range. The 50% Batch API discount makes it particularly competitive for high-volume workflows. However, without public benchmark comparisons to models from Google, Amazon Textract, or other established OCR providers, customers will need to validate Mistral's performance claims in their own testing. The model's ability to handle multiple document types in a single solution could simplify document processing pipelines that currently require specialized tools for different formats.
Related Articles
Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks
Mistral AI has released Mistral OCR, an API for extracting text and images from documents at $1 per 1,000 pages (approximately $0.50 with batch inference). The company claims 94.89% overall accuracy on its internal test set, comparing favorably to GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%).
Mistral AI launches Connectors in Studio with MCP protocol integration and direct tool calling
Mistral AI has released Connectors in Studio, allowing developers to integrate custom MCP (Model Context Protocol) servers and built-in connectors via API/SDK. The release includes direct tool calling for deterministic workflows and human-in-the-loop approval flows for sensitive operations.
Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0
Mistral has released Mistral 3, including Mistral Large 3—a sparse mixture-of-experts model with 41B active and 675B total parameters—and three Ministral 3 edge models (3B, 8B, 14B). All models are released under Apache 2.0 license with multimodal capabilities and are available today on multiple platforms.
Mistral AI adds Deep Research agent, voice mode with Voxtral model to Le Chat
Mistral AI has released a major update to Le Chat, adding a Deep Research agent that generates structured research reports, a new voice input model called Voxtral, and Projects for organizing conversations. The update also includes multilingual reasoning powered by Mistral's Magistral model.
Comments
Loading...