model releaseMistral AI

Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version

TL;DR

Mistral AI released OCR 3, a document parsing model priced at $2 per 1,000 pages with a 50% batch API discount. The company claims a 74% overall win rate compared to Mistral OCR 2 on forms, scanned documents, complex tables, and handwriting.

2 min read
0

Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version

Mistral AI released OCR 3 (model ID: mistral-ocr-2512), a document parsing model priced at $2 per 1,000 pages, or $1 per 1,000 pages using the batch API. The company claims a 74% overall win rate compared to its previous OCR 2 model across forms, scanned documents, complex tables, and handwriting.

Pricing and availability

  • Standard API: $2 per 1,000 pages
  • Batch API: $1 per 1,000 pages (50% discount)
  • Available now via API and Document AI Playground in Mistral AI Studio
  • Self-hosting option available for organizations with data privacy requirements
  • Fully backward compatible with Mistral OCR 2

Technical capabilities

Mistral OCR 3 extracts text and embedded images from documents, outputting markdown with HTML-based table reconstruction. The model handles:

  • Handwriting: Cursive text, mixed-content annotations, and handwritten entries on printed forms
  • Forms: Invoice processing, receipts, compliance forms, and government documents with improved box and label detection
  • Low-quality scans: Handles compression artifacts, skew, distortion, low DPI, and background noise
  • Complex tables: Reconstructs structures with headers, merged cells, multi-row blocks, and column hierarchies using HTML tags with colspan/rowspan attributes

Benchmarks

Mistral AI evaluated OCR 3 on internal benchmarks based on customer use cases, comparing outputs to ground truth using fuzzy-match metrics. The company claims the model outperforms "enterprise document processing solutions as well as AI-native OCR solutions," though specific competitor comparisons and benchmark scores were not disclosed.

According to Mistral AI, OCR 3 represents "a significant upgrade across all languages and document form factors" compared to OCR 2. The company describes it as "a much smaller model than most competitive solutions."

Customer applications

Early customers are using Mistral OCR 3 for:

  • Invoice processing into structured fields
  • Company archive digitization
  • Clean text extraction from technical and scientific reports
  • Enterprise search enhancement
  • Document-to-knowledge transformation pipelines

What this means

Mistral's aggressive pricing at $1-2 per 1,000 pages positions OCR 3 as a cost-competitive alternative to existing document processing services. The 74% win rate claim suggests substantial improvements over the previous generation, though the lack of third-party benchmarks or specific competitor comparisons makes independent verification difficult. The model's smaller size, if accurate, could enable faster processing and lower compute costs for high-volume document workflows. Self-hosting availability addresses enterprise compliance requirements that often block cloud-based document processing adoption.

Related Articles

model release

Mistral Medium 3 launches at $0.4/$2 per million tokens, matching 90% of Claude 3.7 Sonnet performance

Mistral AI launched Mistral Medium 3 on May 7, 2025, priced at $0.4 per million input tokens and $2 per million output tokens. The company claims the model performs at or above 90% of Claude Sonnet 3.7 on benchmarks while being significantly less expensive, and surpasses Llama 4 Maverick and Cohere Command A.

product update

Mistral Releases OCR API at $1 per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks

Mistral AI has released an OCR API priced at $1 per 1,000 pages with batch inference costs approximately half that rate. The company claims 94.89% overall accuracy on internal benchmarks, ahead of GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%). The model processes up to 2,000 pages per minute on a single node.

model release

Mistral Launches Saba: 24B-Parameter Regional Model for Arabic and South Asian Languages

Mistral AI has released Saba, a 24B-parameter model trained specifically for Arabic and South Asian languages including Tamil. The model runs on single-GPU systems at over 150 tokens per second and is available via API or for on-premises deployment.

product update

Mistral rebrands Le Chat to Vibe, launches autonomous coding agent and work automation platform

Mistral AI has rebranded Le Chat as Vibe, introducing two new agent modes: Work Mode for multi-step business tasks across connected apps, and Code Mode for autonomous coding from pull request to merge. The service includes a new VS Code extension and starts at $14.99/month for Pro tier.

Comments

Loading...