model release

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

TL;DR

Moonshot AI released Kimi Latest, a router endpoint that automatically redirects to the most recent model in the Kimi family. The model features a 262,144 token context window, though specific pricing and performance benchmarks have not been disclosed.

2 min read
0

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

Moonshot AI released Kimi Latest on April 27, 2026, a router endpoint that automatically redirects requests to the most recent model in the company's Kimi family. The model offers a 262,144 token context window.

Technical Specifications

The router model is designed to always point to the latest version of Moonshot AI's Kimi models, eliminating the need for developers to manually update model endpoints when new versions are released. This approach mirrors practices used by other AI providers for maintaining current model access.

The 262,144 token context window (approximately 262K tokens) positions the model among large context offerings in the market, though Moonshot AI has not disclosed whether this represents an increase from previous Kimi models.

Availability and Pricing

The model is available through OpenRouter, which normalizes API requests across providers. Pricing per million tokens has not been disclosed. OpenRouter indicates the model supports reasoning-enabled features, allowing access to step-by-step thinking processes through the reasoning_details array in API responses.

No usage activity data is currently available on OpenRouter's platform, as indicated by the provider's dashboard.

Integration Details

Developers can access Kimi Latest through OpenRouter's API using the model identifier ~moonshotai/kimi-latest. The platform supports both OpenRouter SDK and OpenAI SDK integration methods, along with various third-party frameworks.

What This Means

Router endpoints that auto-update to the latest model version reduce integration maintenance for developers but sacrifice version pinning control. The 262K context window suggests Moonshot AI is competing in the long-context segment, though without disclosed pricing or benchmark scores, it's difficult to assess competitiveness against established models from Anthropic (Claude 3.5 with 200K context) or Google (Gemini 1.5 Pro with 2M context). The lack of usage data and performance metrics on OpenRouter may indicate a very recent release or limited early adoption.

Related Articles

model release

Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing

Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.

model release

Anthropic releases Claude Fable 5, first public Mythos-class model at $10/$50 per million tokens

Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, at $10 per million input tokens and $50 per million output tokens—less than half the price of Claude Mythos Preview. The model includes safeguards that redirect sensitive queries to Claude Opus 4.8 in less than 5% of sessions.

model release

Anthropic releases Claude Fable 5 with Mythos-class capabilities at $10/$50 per million tokens

Anthropic released Claude Fable 5, a Mythos-class model, to enterprise customers and paid subscribers two months after limiting its advanced Mythos model to select users. The new model costs $10 per million input tokens and $50 per million output tokens—twice the price of Claude Opus 4.8—and includes safeguards that block responses in high-risk areas like cybersecurity and biology.

model release

Anthropic releases Claude Fable 5, a safety-limited version of Mythos, at $10/$50 per million tokens

Anthropic released Claude Fable 5, the first publicly available version of its Mythos model, with built-in safety restrictions that automatically block high-risk queries in cybersecurity, biology, chemistry, and related fields. The model costs $10 per million input tokens and $50 per million output tokens, double the price of Claude Opus 4.8.

Comments

Loading...