model release

Alibaba's Qwen3.6 Plus reaches 78.8 on SWE-bench with 1M context window

TL;DR

Alibaba released Qwen3.6 Plus on April 2, 2026, featuring a 1 million token context window at $0.50 per million input tokens and $3 per million output tokens. The model combines linear attention with sparse mixture-of-experts routing to achieve a 78.8 score on SWE-bench Verified, with significant improvements in agentic coding, front-end development, and reasoning tasks.

April 8, 2026 · 1:05 AM2 min read

Qwen 3.6 Plus — Quick Specs

Context window1000K tokens

Input$0.5/1M tokens

Output$3/1M tokens

Compare Qwen 3.6 Plus with other models →

Alibaba's Qwen3.6 Plus Reaches 78.8 on SWE-bench with Million-Token Context

Alibaba released Qwen3.6 Plus on April 2, 2026, introducing a model that combines efficient linear attention with sparse mixture-of-experts routing to handle complex reasoning and coding tasks at scale.

Key Specifications

Context and Pricing:

Context window: 1,000,000 tokens
Input pricing: $0.50 per million tokens
Output pricing: $3.00 per million tokens

Performance: The model achieves a 78.8 score on SWE-bench Verified, positioning it alongside leading state-of-the-art models. Alibaba claims major improvements over the 3.5 series in agentic coding, front-end development, and overall reasoning capabilities.

Architecture and Capabilities

Qwen3.6 Plus uses a hybrid architecture combining:

Efficient linear attention mechanisms for scalability
Sparse mixture-of-experts routing for high-performance inference

According to Alibaba, the model excels at complex tasks including 3D scene generation, game development, and repository-level problem solving. The company describes particular improvements in "vibe coding experience," though this term lacks precise technical definition.

The model claims substantial performance gains in both pure-text and multimodal tasks, though specific benchmark comparisons to previous versions remain undisclosed.

Data Collection Notice

Alibaba explicitly states that Qwen3.6 Plus collects prompt and completion data for model improvement purposes. Users should review privacy implications before deploying in sensitive applications.

Deployment Availability

Qwen3.6 Plus is available through OpenRouter, which routes requests across multiple providers to optimize for context window support and uptime. The platform provides normalized API access across providers and supports reasoning-enabled inference with step-by-step thinking visibility.

What This Means

Qwen3.6 Plus positions Alibaba's Qwen line as a competitive option in the 1M-context segment, matching context window sizes offered by Claude 3.5 and other leaders. The 78.8 SWE-bench score places it in the tier of capable coding models, though detailed comparisons to other 1M-context models remain unavailable. Pricing at $0.50/$3.00 per million tokens is competitive with other high-context models. The explicit data collection policy requires careful consideration for enterprises handling proprietary code or sensitive information.

Source: openrouter.ai ↗

qwen alibaba model-release 1m-context coding swe-bench moe april-2026

model releaseMay 21, 2026

Alibaba Releases Qwen3.7 Max with 1M Token Context Window for Agent and Coding Tasks

Alibaba has released Qwen3.7 Max, the flagship model in its Qwen3.7 series, featuring a 1 million token context window. The text-only model is designed for agent-centric workloads with strengths in coding, office productivity, and long-horizon autonomous execution, and includes explicit prompt caching support.

model releaseMay 19, 2026

Google releases Gemini 3.5 Flash with autonomous coding and agent capabilities, claims 4x speed boost

Google released Gemini 3.5 Flash, positioning it as an agent-first model designed for autonomous coding and multi-hour workflows. The company claims the model outperforms its 3.1 Pro predecessor on coding and agentic benchmarks while running 4x faster than competing frontier models, with an optimized version achieving 12x speed gains.

model releaseMay 22, 2026

Tencent Releases Hy-MT2: 1.8B Translation Model Compressed to 440MB With 1.25-Bit Quantization

Tencent has open-sourced Hy-MT2, a family of multilingual translation models available in 1.8B, 7B, and 30B-A3B parameter sizes. The models support translation across 33 languages and include extreme quantization down to 1.25-bit, reducing the 1.8B model to 440MB storage while increasing inference speed by 1.5x.

model releaseMay 20, 2026

NemoStation releases Marlin-2B: 2-billion parameter video VLM achieves dense captioning performance between Tarsier-34B

NemoStation has released Marlin-2B, a 2-billion parameter video vision-language model that produces structured scene and event captions with second-precise timestamps. The model tops the CaReBench dense captioning leaderboard and sits between Tarsier-34B and Gemini-1.5-Pro on DREAM-1K, while matching Gemini-2.0-Flash on temporal grounding benchmarks.