model release

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

TL;DR

Moonshot AI released Kimi Latest, a router endpoint that automatically redirects to the most recent model in the Kimi family. The model features a 262,144 token context window, though specific pricing and performance benchmarks have not been disclosed.

2 min read
0

Moonshot AI Launches 'Kimi Latest' Router Model with 262K Context Window

Moonshot AI released Kimi Latest on April 27, 2026, a router endpoint that automatically redirects requests to the most recent model in the company's Kimi family. The model offers a 262,144 token context window.

Technical Specifications

The router model is designed to always point to the latest version of Moonshot AI's Kimi models, eliminating the need for developers to manually update model endpoints when new versions are released. This approach mirrors practices used by other AI providers for maintaining current model access.

The 262,144 token context window (approximately 262K tokens) positions the model among large context offerings in the market, though Moonshot AI has not disclosed whether this represents an increase from previous Kimi models.

Availability and Pricing

The model is available through OpenRouter, which normalizes API requests across providers. Pricing per million tokens has not been disclosed. OpenRouter indicates the model supports reasoning-enabled features, allowing access to step-by-step thinking processes through the reasoning_details array in API responses.

No usage activity data is currently available on OpenRouter's platform, as indicated by the provider's dashboard.

Integration Details

Developers can access Kimi Latest through OpenRouter's API using the model identifier ~moonshotai/kimi-latest. The platform supports both OpenRouter SDK and OpenAI SDK integration methods, along with various third-party frameworks.

What This Means

Router endpoints that auto-update to the latest model version reduce integration maintenance for developers but sacrifice version pinning control. The 262K context window suggests Moonshot AI is competing in the long-context segment, though without disclosed pricing or benchmark scores, it's difficult to assess competitiveness against established models from Anthropic (Claude 3.5 with 200K context) or Google (Gemini 1.5 Pro with 2M context). The lack of usage data and performance metrics on OpenRouter may indicate a very recent release or limited early adoption.

Related Articles

model release

DeepSeek V4 Pro launches with 1.6T parameters at $1.74/M tokens, undercutting Claude Sonnet 4.6 by 42%

DeepSeek released two preview models: V4 Pro (1.6T total parameters, 49B active) and V4 Flash (284B total, 13B active), both with 1 million token context windows. V4 Pro is priced at $1.74/M input tokens and $3.48/M output—42% cheaper than Claude Sonnet 4.6—while V4 Flash at $0.14/$0.28 per million tokens undercuts all small frontier models.

model release

Xiaomi Releases MiMo-V2.5-Pro: 1.02T Parameter MoE Model with 1M Context Window

Xiaomi has released MiMo-V2.5-Pro, an open-source Mixture-of-Experts model with 1.02 trillion total parameters and 42 billion active parameters. The model supports up to 1 million tokens context length and claims 99.6% on GSM8K and 86.2% on MATH benchmarks.

model release

OpenAI Launches GPT Mini Latest with 400,000 Token Context Window

OpenAI released GPT Mini Latest on April 27, 2025, featuring a 400,000 token context window. The model automatically redirects to the latest version in the OpenAI GPT Mini family, allowing developers to stay current without manual updates.

model release

Alibaba Qwen Releases Qwen3.6 Flash with 1M Context Window at $0.25 per 1M Input Tokens

Alibaba's Qwen team has released Qwen3.6 Flash, a multimodal language model supporting text, image, and video input with a 1 million token context window. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with tiered pricing above 256K tokens.

Comments

Loading...