model release

Alibaba releases Qwen3.6-Plus with 1M token context, claims performance near Claude 4.5 Opus

TL;DR

Alibaba has released Qwen3.6-Plus, its third proprietary AI model in days, featuring a 1 million token context window available via Alibaba Cloud Model Studio API. The model claims improved agentic coding capabilities and partially outperforms Anthropic's Claude 4.5 Opus in Alibaba-conducted benchmarks, though trails Claude 4.6 Opus released in December 2025.

April 2, 2026 · 3:05 PM1 min read

Qwen 3.6 Plus — Quick Specs

Context window1000K tokens

Input$0.5/1M tokens

Output$3/1M tokens

Compare Qwen 3.6 Plus with other models →

Alibaba Launches Qwen3.6-Plus with 1M Token Context Window

Alibaba has released Qwen3.6-Plus, its third proprietary AI model announcement within days, marking an acceleration in the company's shift toward monetized, closed-source models.

Specifications and Capabilities

Qwen3.6-Plus offers a context window of 1 million tokens and is available through Alibaba Cloud Model Studio API. According to the Qwen team, the model prioritizes improved capabilities for agentic coding tasks, including frontend development and complex code generation.

The model will be integrated into Alibaba's Qwen chatbot application and its enterprise AI service Wukong.

Benchmark Claims

In benchmarks published by Alibaba, Qwen3.6-Plus partially outperforms Anthropic's Claude 4.5 Opus. However, Alibaba notes that these measurements were conducted internally. Claude 4.6 Opus, released in December 2025, scores 65.4 percent on Terminal-Bench 2.0, placing it ahead of Qwen3.6-Plus on that metric.

The model outperforms Alibaba's own Qwen3.5 model in tested benchmarks.

Strategic Shift to Proprietary Models

Alibaba has fundamentally changed its Qwen model strategy. Historically, the company released Qwen models as open-source. The latest Qwen3.5-Omni is also unavailable as open-source, reflecting the company's decision to monetize AI capabilities for enterprise customers.

This shift occurs as Alibaba Cloud faces intense competition from ByteDance. According to Bloomberg, Alibaba is targeting $100 billion in AI revenue over the next five years, requiring proprietary products with clear monetization paths.

What This Means

Alibaba is aggressively consolidating AI product offerings while competing directly against well-funded ByteDance operations. The three-model-in-days release cadence signals accelerated iteration but raises questions about differentiation. Qwen3.6-Plus's claimed parity with Claude 4.5 Opus—while behind the December 2025 Claude 4.6—positions it as a viable enterprise alternative, particularly in Asian markets where Alibaba maintains infrastructure advantages. The shift from open-source to proprietary models prioritizes near-term revenue over ecosystem building, a reversal of past strategy.

Source: the-decoder.com ↗

alibaba qwen proprietary-models 1m-context agentic-ai coding benchmarks

model releaseMay 19, 2026

Google releases Gemini 3.5 Flash with autonomous coding and agent capabilities, claims 4x speed boost

Google released Gemini 3.5 Flash, positioning it as an agent-first model designed for autonomous coding and multi-hour workflows. The company claims the model outperforms its 3.1 Pro predecessor on coding and agentic benchmarks while running 4x faster than competing frontier models, with an optimized version achieving 12x speed gains.

model releaseMay 20, 2026

NemoStation releases Marlin-2B: 2-billion parameter video VLM achieves dense captioning performance between Tarsier-34B

NemoStation has released Marlin-2B, a 2-billion parameter video vision-language model that produces structured scene and event captions with second-precise timestamps. The model tops the CaReBench dense captioning leaderboard and sits between Tarsier-34B and Gemini-1.5-Pro on DREAM-1K, while matching Gemini-2.0-Flash on temporal grounding benchmarks.

model releaseMay 19, 2026

Google Releases Gemini 3.5 Flash with 1M Token Context and Configurable Thinking Modes at $1.50/$9 Per Million Tokens

Google has released Gemini 3.5 Flash, a multimodal model with a 1 million token context window priced at $1.50 per million input tokens and $9 per million output tokens. The model supports text, image, video, audio, and PDF inputs with configurable thinking effort levels from minimal to high.

model releaseMay 15, 2026

Microsoft Releases Fara-7B: 7B Parameter Computer Use Agent Trained in 2.5 Days on 64 H100s

Microsoft Research has released Fara-7B, a 7-billion parameter small language model designed for computer automation tasks. The model, which took 2.5 days to train on 64 H100 GPUs, can navigate websites to complete tasks like booking restaurants and shopping, using screenshots as input with a 128K token context window.