model release

Baidu Releases Free Qianfan-OCR-Fast Model with 65K Context Window

TL;DR

Baidu has released Qianfan-OCR-Fast, a specialized OCR model with a 65,536 token context window, available at zero cost through OpenRouter. The model launched on April 20, 2026, and is positioned as a performance upgrade over the original Qianfan-OCR.

April 23, 2026 · 2:20 AM2 min read

Qianfan-OCR-Fast — Quick Specs

Context window66K tokens

Input$0.68/1M tokens

Output$2.81/1M tokens

Compare Qianfan-OCR-Fast with other models →

Baidu Releases Free Qianfan-OCR-Fast Model with 65K Context Window

Baidu has released Qianfan-OCR-Fast, a domain-specific multimodal model designed exclusively for optical character recognition tasks. The model launched on April 20, 2026, with a 65,536 token context window and zero-cost pricing through OpenRouter.

Technical Specifications

Context window: 65,536 tokens
Pricing: $0 per million input tokens, $0 per million output tokens
Model type: Multimodal (OCR-focused)
Availability: Via OpenRouter API

Model Architecture and Purpose

According to Baidu, Qianfan-OCR-Fast is purpose-built for OCR applications using specialized OCR training data. The company claims the model provides "a powerful performance upgrade" over its predecessor, Qianfan-OCR, while maintaining multimodal intelligence capabilities.

The model is positioned as a domain-specific solution rather than a general-purpose multimodal model, indicating focused optimization for text extraction and document understanding tasks.

Distribution and Access

The model is available exclusively through OpenRouter, which provides an OpenAI-compatible API interface. OpenRouter routes requests across multiple providers with automatic fallbacks to maximize uptime. The platform normalizes requests and responses, allowing developers to access the model using OpenAI SDK, Anthropic SDK, or direct API calls.

The "free" designation in the model name (baidu/qianfan-ocr-fast:free) suggests this may be a tier within Baidu's model lineup, though no paid alternative has been announced.

What This Means

Baidu's release of a zero-cost OCR model with a substantial 65K context window addresses a specific enterprise need: document processing at scale without API costs. The free pricing makes it viable for high-volume OCR applications like document digitization pipelines and automated data extraction systems. However, without published benchmark scores comparing it to competitors like GPT-4 Vision or Claude 3's OCR capabilities, developers will need to conduct their own performance evaluations. The OpenRouter-only distribution is notable, suggesting Baidu may be testing international market reception before broader deployment.

Source: openrouter.ai ↗

Baidu OCR multimodal free model OpenRouter Qianfan document AI

model releaseJuly 20, 2026

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba has released Qwen 3.8, a 2.4 trillion parameter open-weight model that the company claims trails only Fable 5. The multimodal model processes images, videos, and documents, with a preview available through Alibaba's platforms at 10 percent of standard pricing.

model releaseJuly 20, 2026

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Thinking Machines Lab released Inkling, a mixture-of-experts model with 975B total parameters and 41B active parameters, trained on 45 trillion tokens across text, images, audio, and video. The Apache 2.0-licensed model supports up to 1M context and debuts alongside Inkling-Small (276B-A12B), marking what observers call the strongest US-based open-weight release to date.

model releaseJuly 20, 2026

Meituan launches LongCat 2.0: 1.6T parameter MoE model with 1M+ context window at $0.30 per 1M input tokens

Meituan has released LongCat 2.0, a sparse mixture-of-experts language model with 48 billion active parameters out of 1.6 trillion total. The model features a 1,049,000 token context window and costs $0.30 per 1M input tokens and $1.20 per 1M output tokens.

model releaseJuly 20, 2026

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba unveiled Qwen3.8 at the World Artificial Intelligence Conference in Shanghai, claiming the 2.4 trillion parameter model ranks second only to Anthropic's Fable 5. The company provided no benchmark scores, model card, or independent verification to support the claim.

Baidu Releases Free Qianfan-OCR-Fast Model with 65K Context Window

Qianfan-OCR-Fast — Quick Specs

Baidu Releases Free Qianfan-OCR-Fast Model with 65K Context Window

Technical Specifications

Model Architecture and Purpose

Distribution and Access

What This Means

Related Articles

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Meituan launches LongCat 2.0: 1.6T parameter MoE model with 1M+ context window at $0.30 per 1M input tokens

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Comments