model release

Alibaba releases Qwen 3.6 Plus Preview with 1M token context, free via OpenRouter

TL;DR

Alibaba's Qwen division has released Qwen 3.6 Plus Preview, a free multimodal model available via OpenRouter with a 1,000,000 token context window. The model claims stronger reasoning and more reliable agentic behavior compared to the 3.5 series, with particular strength in coding and complex problem-solving tasks.

March 30, 2026 · 6:50 PM2 min read

Qwen 3.6 Plus Preview — Quick Specs

Context window1000K tokens

Compare Qwen 3.6 Plus Preview with other models →

Alibaba Releases Qwen 3.6 Plus Preview at No Cost

Alibaba's Qwen team has made Qwen 3.6 Plus Preview available as a free model through OpenRouter's API. The model is accessible as qwen/qwen3.6-plus-preview:free.

Technical Specifications

Qwen 3.6 Plus Preview features a 1,000,000 token context window, enabling processing of substantially longer documents and code repositories in single requests. The model operates as a text-to-text system with a hybrid architecture designed to improve both efficiency and scalability.

According to Alibaba, the model demonstrates performance at or above state-of-the-art benchmarks, though specific benchmark scores have not been disclosed in available materials.

Key Capabilities

The company emphasizes three primary use cases:

Agentic coding: The model handles complex code generation, debugging, and autonomous code tasks
Front-end development: Specialized performance for web interface development workflows
Complex problem-solving: Enhanced reasoning capabilities for multi-step logical tasks

Alibaba claims the model delivers "stronger reasoning" and "more reliable agentic behavior" than its Qwen 3.5 predecessor, though quantitative comparisons have not been provided.

Data Collection Notice

Users should note that Qwen 3.6 Plus Preview collects prompt and completion data that Alibaba states will be used to improve the model. This is a standard practice among providers offering free model access but represents a consideration for users handling sensitive information.

Pricing and Availability

The model is free to use via OpenRouter's API with no disclosed usage limits or rate restrictions. This positions it competitively against other free-tier model offerings from major providers.

Access requires an OpenRouter API key. The preview designation suggests this may represent an early-stage release with potential for changes or discontinuation, typical for model preview programs.

What This Means

Alibaba continues expanding Qwen's presence in the open-access model ecosystem. A 1M context window at no cost represents genuine capacity for long-document processing tasks—useful for summarizing codebases, processing full books, or analyzing extended conversations. The focus on agentic behavior and coding aligns with current market demand for models that can function as autonomous agents rather than simple text processors.

The data collection caveat means organizations must evaluate whether their use cases align with Alibaba's model improvement protocols. For non-sensitive applications, the combination of scale (1M tokens), claimed performance parity with SOTA models, and zero cost makes this worth testing against current paid alternatives.

Source: openrouter.ai ↗

qwen alibaba free-model context-window openrouter multimodal agentic-ai coding

model releaseMay 14, 2026

Baidu Releases Qianfan-OCR-Fast Model with 66K Context at $0.68 Per 1M Input Tokens

Baidu has released Qianfan-OCR-Fast, a multimodal model specialized for optical character recognition tasks. The model offers a 66,000 token context window and is priced at $0.68 per 1M input tokens and $2.81 per 1M output tokens.

model releaseMay 7, 2026

Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens

Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.

model releaseMay 13, 2026

DeepSeek Releases V4 Flash: 284B-Parameter MoE Model with 1M Context Window, Free via OpenRouter

DeepSeek has released V4 Flash, a Mixture-of-Experts model with 284B total parameters and 13B activated parameters per forward pass. The model supports a 1M-token context window and is available free through OpenRouter, targeting high-throughput coding and chat applications.

model releaseMay 12, 2026

Perceptron Launches Mk1 Vision-Language Model with Video Reasoning at $0.15/$1.50 per 1M Tokens

Perceptron has released Perceptron Mk1, a vision-language model designed for video understanding and embodied reasoning tasks. The model accepts image and video inputs with 33K context window, priced at $0.15 per 1M input tokens and $1.50 per 1M output tokens, and supports structured spatial annotations on demand.