Alibaba's Qwen3.6 Plus reaches 78.8 on SWE-bench with 1M context window
Alibaba released Qwen3.6 Plus on April 2, 2026, featuring a 1 million token context window at $0.50 per million input tokens and $3 per million output tokens. The model combines linear attention with sparse mixture-of-experts routing to achieve a 78.8 score on SWE-bench Verified, with significant improvements in agentic coding, front-end development, and reasoning tasks.
Qwen 3.6 Plus — Quick Specs
Alibaba's Qwen3.6 Plus Reaches 78.8 on SWE-bench with Million-Token Context
Alibaba released Qwen3.6 Plus on April 2, 2026, introducing a model that combines efficient linear attention with sparse mixture-of-experts routing to handle complex reasoning and coding tasks at scale.
Key Specifications
Context and Pricing:
- Context window: 1,000,000 tokens
- Input pricing: $0.50 per million tokens
- Output pricing: $3.00 per million tokens
Performance: The model achieves a 78.8 score on SWE-bench Verified, positioning it alongside leading state-of-the-art models. Alibaba claims major improvements over the 3.5 series in agentic coding, front-end development, and overall reasoning capabilities.
Architecture and Capabilities
Qwen3.6 Plus uses a hybrid architecture combining:
- Efficient linear attention mechanisms for scalability
- Sparse mixture-of-experts routing for high-performance inference
According to Alibaba, the model excels at complex tasks including 3D scene generation, game development, and repository-level problem solving. The company describes particular improvements in "vibe coding experience," though this term lacks precise technical definition.
The model claims substantial performance gains in both pure-text and multimodal tasks, though specific benchmark comparisons to previous versions remain undisclosed.
Data Collection Notice
Alibaba explicitly states that Qwen3.6 Plus collects prompt and completion data for model improvement purposes. Users should review privacy implications before deploying in sensitive applications.
Deployment Availability
Qwen3.6 Plus is available through OpenRouter, which routes requests across multiple providers to optimize for context window support and uptime. The platform provides normalized API access across providers and supports reasoning-enabled inference with step-by-step thinking visibility.
What This Means
Qwen3.6 Plus positions Alibaba's Qwen line as a competitive option in the 1M-context segment, matching context window sizes offered by Claude 3.5 and other leaders. The 78.8 SWE-bench score places it in the tier of capable coding models, though detailed comparisons to other 1M-context models remain unavailable. Pricing at $0.50/$3.00 per million tokens is competitive with other high-context models. The explicit data collection policy requires careful consideration for enterprises handling proprietary code or sensitive information.
Related Articles
Alibaba Releases Qwen3.7 Max with 1M Token Context Window for Agent and Coding Tasks
Alibaba has released Qwen3.7 Max, the flagship model in its Qwen3.7 series, featuring a 1 million token context window. The text-only model is designed for agent-centric workloads with strengths in coding, office productivity, and long-horizon autonomous execution, and includes explicit prompt caching support.
Google releases Gemini 3.5 Flash with autonomous coding and agent capabilities, claims 4x speed boost
Google released Gemini 3.5 Flash, positioning it as an agent-first model designed for autonomous coding and multi-hour workflows. The company claims the model outperforms its 3.1 Pro predecessor on coding and agentic benchmarks while running 4x faster than competing frontier models, with an optimized version achieving 12x speed gains.
Tencent Releases Hy-MT2: 1.8B Translation Model Compressed to 440MB With 1.25-Bit Quantization
Tencent has open-sourced Hy-MT2, a family of multilingual translation models available in 1.8B, 7B, and 30B-A3B parameter sizes. The models support translation across 33 languages and include extreme quantization down to 1.25-bit, reducing the 1.8B model to 440MB storage while increasing inference speed by 1.5x.
NemoStation releases Marlin-2B: 2-billion parameter video VLM achieves dense captioning performance between Tarsier-34B
NemoStation has released Marlin-2B, a 2-billion parameter video vision-language model that produces structured scene and event captions with second-precise timestamps. The model tops the CaReBench dense captioning leaderboard and sits between Tarsier-34B and Gemini-1.5-Pro on DREAM-1K, while matching Gemini-2.0-Flash on temporal grounding benchmarks.
Comments
Loading...