Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window
Moonshot AI released Kimi Latest, a router endpoint that automatically redirects to the most recent model in the Kimi family. The model features a 262,144 token context window, though specific pricing and performance benchmarks have not been disclosed.
Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window
Moonshot AI released Kimi Latest on April 27, 2026, a router endpoint that automatically redirects requests to the most recent model in the company's Kimi family. The model offers a 262,144 token context window.
Technical Specifications
The router model is designed to always point to the latest version of Moonshot AI's Kimi models, eliminating the need for developers to manually update model endpoints when new versions are released. This approach mirrors practices used by other AI providers for maintaining current model access.
The 262,144 token context window (approximately 262K tokens) positions the model among large context offerings in the market, though Moonshot AI has not disclosed whether this represents an increase from previous Kimi models.
Availability and Pricing
The model is available through OpenRouter, which normalizes API requests across providers. Pricing per million tokens has not been disclosed. OpenRouter indicates the model supports reasoning-enabled features, allowing access to step-by-step thinking processes through the reasoning_details array in API responses.
No usage activity data is currently available on OpenRouter's platform, as indicated by the provider's dashboard.
Integration Details
Developers can access Kimi Latest through OpenRouter's API using the model identifier ~moonshotai/kimi-latest. The platform supports both OpenRouter SDK and OpenAI SDK integration methods, along with various third-party frameworks.
What This Means
Router endpoints that auto-update to the latest model version reduce integration maintenance for developers but sacrifice version pinning control. The 262K context window suggests Moonshot AI is competing in the long-context segment, though without disclosed pricing or benchmark scores, it's difficult to assess competitiveness against established models from Anthropic (Claude 3.5 with 200K context) or Google (Gemini 1.5 Pro with 2M context). The lack of usage data and performance metrics on OpenRouter may indicate a very recent release or limited early adoption.
Related Articles
Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing
Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.
Anthropic releases Claude Fable 5, first public Mythos-class model at $10/$50 per million tokens
Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, at $10 per million input tokens and $50 per million output tokens—less than half the price of Claude Mythos Preview. The model includes safeguards that redirect sensitive queries to Claude Opus 4.8 in less than 5% of sessions.
Anthropic releases Claude Fable 5 with Mythos-class capabilities at $10/$50 per million tokens
Anthropic released Claude Fable 5, a Mythos-class model, to enterprise customers and paid subscribers two months after limiting its advanced Mythos model to select users. The new model costs $10 per million input tokens and $50 per million output tokens—twice the price of Claude Opus 4.8—and includes safeguards that block responses in high-risk areas like cybersecurity and biology.
Anthropic releases Claude Fable 5, a safety-limited version of Mythos, at $10/$50 per million tokens
Anthropic released Claude Fable 5, the first publicly available version of its Mythos model, with built-in safety restrictions that automatically block high-risk queries in cybersecurity, biology, chemistry, and related fields. The model costs $10 per million input tokens and $50 per million output tokens, double the price of Claude Opus 4.8.
Comments
Loading...