model release

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

TL;DR

Moonshot AI released Kimi Latest, a router endpoint that automatically redirects to the most recent model in the Kimi family. The model features a 262,144 token context window, though specific pricing and performance benchmarks have not been disclosed.

April 27, 2026 · 7:51 PM2 min read

Kimi Latest — Quick Specs

Context window262K tokens

Input$3/1M tokens

Output$15/1M tokens

Compare Kimi Latest with other models →

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

Moonshot AI released Kimi Latest on April 27, 2026, a router endpoint that automatically redirects requests to the most recent model in the company's Kimi family. The model offers a 262,144 token context window.

Technical Specifications

The router model is designed to always point to the latest version of Moonshot AI's Kimi models, eliminating the need for developers to manually update model endpoints when new versions are released. This approach mirrors practices used by other AI providers for maintaining current model access.

The 262,144 token context window (approximately 262K tokens) positions the model among large context offerings in the market, though Moonshot AI has not disclosed whether this represents an increase from previous Kimi models.

Availability and Pricing

The model is available through OpenRouter, which normalizes API requests across providers. Pricing per million tokens has not been disclosed. OpenRouter indicates the model supports reasoning-enabled features, allowing access to step-by-step thinking processes through the reasoning_details array in API responses.

No usage activity data is currently available on OpenRouter's platform, as indicated by the provider's dashboard.

Integration Details

Developers can access Kimi Latest through OpenRouter's API using the model identifier ~moonshotai/kimi-latest. The platform supports both OpenRouter SDK and OpenAI SDK integration methods, along with various third-party frameworks.

What This Means

Router endpoints that auto-update to the latest model version reduce integration maintenance for developers but sacrifice version pinning control. The 262K context window suggests Moonshot AI is competing in the long-context segment, though without disclosed pricing or benchmark scores, it's difficult to assess competitiveness against established models from Anthropic (Claude 3.5 with 200K context) or Google (Gemini 1.5 Pro with 2M context). The lack of usage data and performance metrics on OpenRouter may indicate a very recent release or limited early adoption.

Source: openrouter.ai ↗

moonshot-ai kimi router-model long-context openrouter model-release

model releaseJuly 24, 2026

Anthropic Launches Claude Opus 5 (Fast) at $10/$50 per Million Tokens, 1M Context Window

Anthropic has released Claude Opus 5 (Fast), a higher-throughput variant of Opus 5 that carries identical capabilities but runs at roughly 2x the price of the standard model. The model ships with a 1 million token context window and is available now through OpenRouter.

model releaseJuly 26, 2026

Microsoft Releases Mage-Flow: Compact 4B Image Generation and Editing Models Matching Systems 5-8x Larger

Microsoft has released Mage-Flow, a family of 4B-parameter image generation and editing models built on a shared tokenizer-transformer stack. According to Microsoft, the Turbo variants match or beat open-source systems with 5-8x more parameters while running in 4 diffusion steps.

model releaseJuly 25, 2026

Microsoft Releases Fara1.5-27B, a 27B Vision-Only Web Browsing Agent with 262K Context

Microsoft Research AI Frontiers has released Fara1.5-27B, a 27-billion-parameter multimodal agent that completes web tasks by reading screenshots and emitting click/type/scroll commands. The model, fine-tuned from Qwen3.5-27B, ships under MIT license with a 262K-token context window and is designed to run alongside Microsoft's MagenticLite sandbox.

model releaseJuly 25, 2026

Anthropic's Claude Opus 5 Hits 0% Prompt Injection Success Rate in Browser Agent Tests, With Defenses Enabled

Anthropic's system card for Claude Opus 5 reports a 0% prompt injection success rate across 129 browser agent test scenarios when Auto Mode is enabled. On Gray Swan's broader indirect prompt injection benchmark, Opus 5 posted a 2.0% attacker success rate after 15 attempts, the lowest among tested frontier models.

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

Kimi Latest — Quick Specs

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

Technical Specifications

Availability and Pricing

Integration Details

What This Means

Related Articles

Anthropic Launches Claude Opus 5 (Fast) at $10/$50 per Million Tokens, 1M Context Window

Microsoft Releases Mage-Flow: Compact 4B Image Generation and Editing Models Matching Systems 5-8x Larger

Microsoft Releases Fara1.5-27B, a 27B Vision-Only Web Browsing Agent with 262K Context

Anthropic's Claude Opus 5 Hits 0% Prompt Injection Success Rate in Browser Agent Tests, With Defenses Enabled

Comments