OpenRouter

25 articles tagged with OpenRouter

July 9, 2026

analysis

OpenAI Launches GPT-5.6 Series with Five Model Variants

OpenAI has released five variants of GPT-5.6, according to listings on OpenRouter. The new series includes Pro and standard versions named Sol, Terra, and Luna, though official specifications and pricing remain undisclosed.

July 9, 2026 · 5:51 PM

July 7, 2026

model releaseAionlabs

Aion Labs Releases Aion-3.0-Mini: Multi-Model Storytelling System Built on DeepSeek

Aion Labs has released Aion-3.0-Mini, a multi-model system designed for roleplaying and storytelling applications. The system uses multiple specialized models working collaboratively on the DeepSeek architecture, with a 131K context window and pricing at $0.70 per 1M input tokens and $1.40 per 1M output tokens.

July 7, 2026 · 7:36 PM

analysisOpenAI

Chinese AI Models Capture 30%+ of U.S. Developer Token Usage as OpenAI, Anthropic Costs Rise

Chinese AI models including DeepSeek and Z.ai have captured over 30% of weekly token usage by U.S. companies on OpenRouter since February 2025, up from 4.5% in the first half of the year. The shift comes as companies seek alternatives 60-90% cheaper than leading models from OpenAI and Anthropic, while Chinese models close the performance gap to within 6-9 months of U.S. frontier systems.

July 7, 2026 · 9:50 AM

June 30, 2026

model release

Google launches Gemini 3.1 Flash Lite Image with 4-second generation time, $0.25 per 1M input tokens

Google has released Gemini 3.1 Flash Lite Image, a text-to-image model that generates 1K resolution images in approximately 4 seconds — 2.7× faster than Gemini 3.1 Flash Image. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with a 66K context window and knowledge cutoff of January 2025.

June 30, 2026 · 7:35 PM

June 24, 2026

model release

Sakana AI Releases Fugu Ultra: Multi-Agent Orchestration System with 1M Context Window at $5/$30 per Million Tokens

Sakana AI has released Fugu Ultra, a multi-agent orchestration system that routes tasks across pools of underlying models rather than operating as a single monolithic model. The system supports a 1M token context window and is priced at $5 per million input tokens and $30 per million output tokens.

June 24, 2026 · 5:05 AM

June 3, 2026

model release

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

June 3, 2026 · 1:20 PM

June 2, 2026

product updateOpenrouter

OpenRouter Launches Fusion: Multi-Model Consensus System That Runs Expert Panels in Parallel

OpenRouter has released Fusion, a multi-model routing system that processes prompts through parallel expert model panels with web search enabled, then uses a judge model to synthesize consensus, contradictions, and unique insights. Users pay the sum of all underlying model completions rather than a single model price.

June 2, 2026 · 7:50 PM

June 1, 2026

model release+1

MiniMax Launches M3 Model With 1M Context Window at $0.30 Per Million Input Tokens

MiniMax has released M3, a multimodal foundation model supporting text, image, and video inputs with a 1-million-token context window. The model costs $0.30 per million input tokens and $1.20 per million output tokens, available through OpenRouter.

June 1, 2026 · 1:05 AM

May 20, 2026

model releasexAI

xAI Launches Grok Build 0.1: Coding Model with 256K Context for Agentic Workflows

xAI has released Grok Build 0.1, a coding-specialized model with a 256K context window and unlimited text output. The model is designed for agentic software engineering workflows and powers xAI's Grok Build CLI tool.

May 20, 2026 · 5:50 PM

May 14, 2026

model release

Baidu Releases Qianfan-OCR-Fast Model with 66K Context at $0.68 Per 1M Input Tokens

Baidu has released Qianfan-OCR-Fast, a multimodal model specialized for optical character recognition tasks. The model offers a 66,000 token context window and is priced at $0.68 per 1M input tokens and $2.81 per 1M output tokens.

May 14, 2026 · 2:20 PM

May 13, 2026

model releaseDeepSeek

DeepSeek Releases V4 Flash: 284B-Parameter MoE Model with 1M Context Window, Free via OpenRouter

DeepSeek has released V4 Flash, a Mixture-of-Experts model with 284B total parameters and 13B activated parameters per forward pass. The model supports a 1M-token context window and is available free through OpenRouter, targeting high-throughput coding and chat applications.

May 13, 2026 · 11:50 PM

May 12, 2026

changelogAnthropic

Anthropic releases Claude Opus 4.7 Fast with 6x pricing for higher output speed

Anthropic has released Claude Opus 4.7 Fast, a speed-optimized variant of its Opus 4.7 model. The fast-mode version delivers identical capabilities with higher output speed at premium pricing: $30 per 1M input tokens and $150 per 1M output tokens, representing a 6x increase over standard pricing.

May 12, 2026 · 7:20 PM

May 11, 2026

model releaseArcee Ai

Arcee AI Releases Trinity Large Thinking: Free 262K Context Reasoning Model

Arcee AI has released Trinity Large Thinking, an open source reasoning model with a 262,144-token context window. The model is available free via OpenRouter and claims strong performance in PinchBench, agentic workloads, and reasoning tasks.

May 11, 2026 · 3:50 PM

May 6, 2026

model release

Baidu Launches CoBuddy Code Generation Model with 131K Context Window, Free on OpenRouter

Baidu has released CoBuddy, a code generation model optimized for coding tasks and AI agent workflows. The model features a 131K token context window, up to 65K output tokens, and runs on fp8 quantization with native support for tool calling and reasoning.

May 6, 2026 · 3:05 AM

April 30, 2026

model releaseOpenrouter

OpenRouter Launches Owl Alpha: Free Foundation Model for Agentic Workflows with 1M Context

OpenRouter has released Owl Alpha, a foundation model specifically designed for agentic workloads with native tool use support and a 1,048,756 token context window. The model is currently free for both input and output tokens and is compatible with Claude Code, OpenClaw, and other productivity tools.

April 30, 2026 · 2:50 PM

April 28, 2026

model release

Poolside releases Laguna XS.2, free fp8-quantized coding agent with 128K context

Poolside has released Laguna XS.2, the second-generation model in its XS size class for agentic coding workflows. The model offers 128K context window, up to 8K output tokens, and is quantized to fp8 for efficiency, available free via OpenRouter.

April 28, 2026 · 3:20 PM

changelogOpenAI

OpenAI Makes Whisper Speech Recognition Available on OpenRouter at $0.006 per Minute

OpenAI's Whisper 1 automatic speech recognition model is now accessible through OpenRouter's API routing service. The model supports transcription and translation across 50+ languages from audio files up to 25 MB, priced at $0.006 per minute of audio.

April 28, 2026 · 12:35 AM

April 27, 2026

changelog

Google Releases Gemini Flash Latest Router with 1M+ Token Context Window

Google released Gemini Flash Latest on April 27, 2026, a dynamic router that automatically redirects to the newest model in the Gemini Flash family. The model supports 1,048,576 token context window and includes reasoning capabilities.

April 27, 2026 · 7:51 PM

analysis

Qwen releases three new Qwen3.6 models ranging from 27B to flagship Max Preview

Qwen has released three models in its Qwen3.6 series: a flagship Max Preview model, a 35B parameter A3B variant, and a 27B parameter base model. All three models are now accessible through OpenRouter's API platform.

April 27, 2026 · 3:51 AM

April 23, 2026

model releaseTencent

Tencent Releases Hy3 Preview MoE Model with 262K Context and Three Reasoning Modes

Tencent has released Hy3 Preview, a Mixture-of-Experts model offering 262,144 token context window and three configurable reasoning modes (disabled, low, high) for production agentic workflows. The model is available for free through OpenRouter.

April 23, 2026 · 5:20 AM

model release

Baidu Releases Free Qianfan-OCR-Fast Model with 65K Context Window

Baidu has released Qianfan-OCR-Fast, a specialized OCR model with a 65,536 token context window, available at zero cost through OpenRouter. The model launched on April 20, 2026, and is positioned as a performance upgrade over the original Qianfan-OCR.

April 23, 2026 · 2:20 AM

April 21, 2026

model releaseOpenAI

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.

April 21, 2026 · 9:35 PM

model release

InclusionAI releases Ling-2.6-flash: 104B parameter model with 7.4B active parameters, free on OpenRouter

InclusionAI has released Ling-2.6-flash, an instruction-tuned model with 104 billion total parameters and 7.4 billion active parameters, available free through OpenRouter. The model features a 262,144-token context window and is designed for agent workflows requiring fast responses and high token efficiency.

April 21, 2026 · 6:50 PM

April 15, 2026

model release

Meta releases Llama Guard 4, a 12B parameter multimodal safety classifier with 164K context window

Meta has released Llama Guard 4, a 12-billion parameter content safety classifier derived from Llama 4 Scout. The model features a 163,840 token context window and can classify both text and image content, available free through OpenRouter with an August 31, 2024 knowledge cutoff.

April 15, 2026 · 5:51 PM

April 13, 2026

model release+1

OpenRouter Releases Elephant Alpha: 100B-Parameter Model with 256K Context Window and Free Pricing

OpenRouter has released Elephant Alpha, a 100B-parameter text model with a 256K context window and 32K output token limit. The model is available at no cost through OpenRouter's platform, supporting function calling, structured output, and prompt caching.

April 13, 2026 · 3:36 PM

← Back to all news