model releaseTencent

Tencent Releases Hy3 Preview: Mixture-of-Experts Model with 262K Context and Configurable Reasoning

TL;DR

Tencent has released Hy3 preview, a Mixture-of-Experts model with a 262,144 token context window priced at $0.066 per million input tokens and $0.26 per million output tokens. The model features three configurable reasoning modes—disabled, low, and high—designed for agentic workflows and production environments.

2 min read
0

Hy3 Preview — Quick Specs

Context window262K tokens
Input$0.066/1M tokens
Output$0.26/1M tokens

Tencent Releases Hy3 Preview: Mixture-of-Experts Model with 262K Context and Configurable Reasoning

Tencent has released Hy3 preview, a Mixture-of-Experts (MoE) model with a 262,144 token context window, priced at $0.066 per million input tokens and $0.26 per million output tokens.

Key Specifications

The model supports three configurable reasoning levels: disabled, low, and high modes. According to Tencent, this allows users to balance computational speed against reasoning depth depending on task requirements.

Hy3 preview is designed specifically for agentic workflows and production environments. Tencent claims the model delivers strong code generation capabilities and reliable performance across multi-step, real-world workflows.

Reasoning Architecture

The model exposes its reasoning process through a reasoning_details array in API responses. When enabled, the model shows step-by-step thinking before producing final answers. To maintain reasoning continuity across conversation turns, developers must preserve the complete reasoning_details when passing messages back to the model.

The reasoning feature is controlled via a reasoning parameter in API requests, allowing developers to toggle between the three modes based on task complexity.

Availability

Hy3 preview is available through OpenRouter's API, which normalizes requests and responses across multiple model providers. The model was released on April 22, 2026, according to the OpenRouter model registry.

Model weights are available, though distribution details were not specified in the release information.

Pricing Context

At $0.066 per million input tokens, Hy3 preview positions itself in the lower cost tier for frontier models. The 3.9x multiplier between input and output pricing ($0.26 per million output tokens) is standard for models with generation-focused workloads.

What This Means

Tencent's entry with a configurable reasoning model signals continued competition in the agent-focused AI space. The three-tier reasoning system is a practical approach to the speed-versus-accuracy tradeoff that developers face when building production systems. The 262K context window places it in the extended-context category, though still below the 1M+ context leaders. The combination of MoE architecture, configurable reasoning, and competitive pricing makes this a relevant option for developers building multi-step agentic applications who need cost-effective inference with reasoning capabilities.

Related Articles

model release

Poolside releases Laguna M.1: 225B parameter MoE model scores 74.6% on SWE-bench Verified

Poolside has released Laguna M.1, a 225B total parameter Mixture-of-Experts model with 23B activated parameters per token, designed for agentic coding tasks. The model scores 74.6% on SWE-bench Verified and 63.1% on SWE-bench Multilingual, released under Apache 2.0 license.

model release

Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0

Mistral has released Mistral 3, including Mistral Large 3—a sparse mixture-of-experts model with 41B active and 675B total parameters—and three Ministral 3 edge models (3B, 8B, 14B). All models are released under Apache 2.0 license with multimodal capabilities and are available today on multiple platforms.

model release

Cohere releases North Mini Code, a 30B-parameter sparse MoE coding model with 256K context window, free on OpenRouter

Cohere has released North Mini Code, the first model in its North family and its first agentic coding model. The sparse mixture-of-experts architecture features 30B total parameters with 3B active, a 256K-token context window, and up to 64K tokens of output, available free via OpenRouter under Apache 2.0 license.

product update

Tencent tests AI assistant Xiaowei in WeChat's 1.4 billion user base

Tencent is testing an AI assistant called Xiaowei in Weixin, the Chinese version of WeChat, which has over 1.4 billion monthly active users combined with WeChat. Users can interact with Xiaowei through text or voice, communicate with friends, and launch mini-programs within the app.

Comments

Loading...