model release

Alibaba Qwen Releases 27B Parameter Model with 262K Context Window, Claims 77.2% on SWE-bench Verified

TL;DR

Alibaba Qwen released Qwen3.6-27B, a 27-billion parameter model with a 262,144 token context window extensible to 1,010,000 tokens. The model claims 77.2% on SWE-bench Verified and 53.5% on SWE-bench Pro, with open weights available on Hugging Face.

April 22, 2026 · 2:06 PM2 min read

Qwen3.6 27B — Quick Specs

Context window262K tokens

Input$0.195/1M tokens

Output$1.56/1M tokens

Compare Qwen3.6 27B with other models →

Qwen3.6-27B Released with 262K Context Window

Alibaba Qwen released Qwen3.6-27B, a 27-billion parameter open-weight language model with a 262,144 token context window natively, extensible to 1,010,000 tokens. The model represents the first release in the Qwen3.6 series following the February 2025 Qwen3.5 launch.

Architecture Details

Qwen3.6-27B uses a non-standard transformer architecture with 64 layers and a 5,120 hidden dimension. The model employs a hybrid attention mechanism: 16 blocks of 3 "Gated DeltaNet" layers followed by 1 "Gated Attention" layer per block.

The Gated DeltaNet uses 48 linear attention heads for V and 16 for QK with 128 head dimension. The Gated Attention uses 24 attention heads for Q and 4 for KV with 256 head dimension. The model has 248,320 tokens in its vocabulary (padded) and uses rotary position embeddings with 64 dimensions.

Benchmark Performance

According to Alibaba, Qwen3.6-27B achieves:

SWE-bench Verified: 77.2% (vs. 80.9% for Claude 4.5 Opus)
SWE-bench Pro: 53.5% (vs. 57.1% for Claude 4.5 Opus)
SWE-bench Multilingual: 71.3%
Terminal-Bench 2.0: 59.3% (tied with Claude 4.5 Opus)
SkillsBench Avg5: 48.2%
MMLU-Pro: 86.2%
GPQA Diamond: 87.8%
AIME 2026: 94.1%

The company evaluated models using internal agent scaffolds with temperature 1.0, top_p 0.95, and 200K context windows for SWE-bench series tests. All benchmarks used 256K context windows unless specified otherwise.

Vision Capabilities

The model includes a vision encoder for multimodal tasks. According to Alibaba's benchmarks:

MMMU: 82.9%
MathVista mini: 87.4%
VideoMME (with subtitles): 87.7%
AndroidWorld: 70.3%
Visual Agent V*: 94.7%

Key Features

Qwen3.6-27B introduces "thinking preservation" to retain reasoning context from historical messages during iterative development. The model supports multi-token prediction (MTP) during training and can be deployed with speculative decoding for faster inference.

The model is compatible with Hugging Face Transformers, vLLM (version 0.19.0+), SGLang (version 0.5.10+), and KTransformers. Alibaba recommends maintaining at least 128K token context length to preserve reasoning capabilities, though the default is 262K tokens.

Availability

Pricing has not been disclosed. Open weights are available on Hugging Face under the repository Qwen/Qwen3.6-27B. The model can be served via OpenAI-compatible APIs using standard inference frameworks.

What This Means

Qwen3.6-27B enters the competitive 20-30B parameter space with strong coding performance claims, particularly on repository-level tasks. The 262K native context window and 1M token extensibility position it for long-context applications, though real-world performance at extended lengths requires independent verification. The hybrid Gated DeltaNet/Attention architecture is unconventional and may offer efficiency advantages, but deployment complexity compared to standard transformers remains to be seen in production environments.

Source: huggingface.co ↗

qwen alibaba open-weights coding multimodal long-context model-release

model releaseJuly 20, 2026

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba has released Qwen 3.8, a 2.4 trillion parameter open-weight model that the company claims trails only Fable 5. The multimodal model processes images, videos, and documents, with a preview available through Alibaba's platforms at 10 percent of standard pricing.

model releaseJuly 20, 2026

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba unveiled Qwen3.8 at the World Artificial Intelligence Conference in Shanghai, claiming the 2.4 trillion parameter model ranks second only to Anthropic's Fable 5. The company provided no benchmark scores, model card, or independent verification to support the claim.

model releaseJuly 21, 2026

Alibaba Releases Qwen-Image-3.0, an Image Generator That Renders 10-Pixel Text and 3x3 Infographic Grids in One Pass

Alibaba's Qwen team has released Qwen-Image-3.0, an image generator that accepts prompts up to 4,500 tokens and can render legible text as small as ten pixels, complex LaTeX formulas, and twelve languages in a single pass. The model is currently invite-only via API, and unlike its predecessor, it likely won't ship with open weights.

model releaseJuly 20, 2026

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Thinking Machines Lab released Inkling, a mixture-of-experts model with 975B total parameters and 41B active parameters, trained on 45 trillion tokens across text, images, audio, and video. The Apache 2.0-licensed model supports up to 1M context and debuts alongside Inkling-Small (276B-A12B), marking what observers call the strongest US-based open-weight release to date.

Alibaba Qwen Releases 27B Parameter Model with 262K Context Window, Claims 77.2% on SWE-bench Verified

Qwen3.6 27B — Quick Specs

Qwen3.6-27B Released with 262K Context Window

Architecture Details

Benchmark Performance

Vision Capabilities

Key Features

Availability

What This Means

Related Articles

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba Releases Qwen-Image-3.0, an Image Generator That Renders 10-Pixel Text and 3x3 Infographic Grids in One Pass

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Comments