Alibaba Qwen Releases Qwen3.6 Flash with 1M Context Window at $0.25 per 1M Input Tokens
Alibaba's Qwen team has released Qwen3.6 Flash, a multimodal language model supporting text, image, and video input with a 1 million token context window. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with tiered pricing above 256K tokens.
Qwen3.6 Flash — Quick Specs
Alibaba Qwen Releases Qwen3.6 Flash with 1M Context Window
Alibaba's Qwen team has released Qwen3.6 Flash, a multimodal language model that processes text, image, and video inputs with a 1 million token context window. Released on April 27, 2026, the model is positioned as a fast, efficient option in the Qwen 3.6 series.
Pricing and Technical Specifications
The model is priced at $0.25 per 1 million input tokens and $1.50 per 1 million output tokens for prompts up to 256K tokens. According to the release information, tiered pricing applies for requests exceeding 256K tokens, though specific rates for higher tiers were not disclosed.
The 1M token context window places Qwen3.6 Flash among models with extended context capabilities, though it falls short of some competitors offering 2M+ token windows. The model supports prompt caching with separate pricing for cache read and cache creation operations.
Multimodal Capabilities
Qwen3.6 Flash handles three input modalities: text, images, and video. This positions it as a general-purpose multimodal model, though specific benchmark scores and performance metrics were not provided in the release announcement.
The model is available through OpenRouter, which routes requests across multiple providers with automatic fallback for uptime optimization. OpenRouter's implementation supports reasoning-enabled features, allowing the model to display step-by-step thinking processes through a reasoning_details array in API responses.
API Integration
Developers can access Qwen3.6 Flash through OpenRouter's normalized API, which maintains compatibility with OpenAI SDK conventions. The platform provides request routing to optimize for prompt size and parameters, with provider fallbacks to maintain service availability.
What This Means
Qwen3.6 Flash represents Alibaba's continued push into competitive AI model pricing while expanding multimodal capabilities. The $0.25 per 1M input tokens rate undercuts several major competitors, though direct performance comparisons remain unclear without published benchmark scores. The tiered pricing structure for larger contexts suggests the model is optimized for shorter interactions, with the 256K threshold marking a significant cost increase point. Video input support is notable, as this capability remains relatively uncommon among broadly available language models.
Related Articles
Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing
Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.
Anthropic releases Claude Fable 5, first public Mythos-class model at $10/$50 per million tokens
Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, at $10 per million input tokens and $50 per million output tokens—less than half the price of Claude Mythos Preview. The model includes safeguards that redirect sensitive queries to Claude Opus 4.8 in less than 5% of sessions.
Anthropic releases Claude Fable 5 with Mythos-class capabilities at $10/$50 per million tokens
Anthropic released Claude Fable 5, a Mythos-class model, to enterprise customers and paid subscribers two months after limiting its advanced Mythos model to select users. The new model costs $10 per million input tokens and $50 per million output tokens—twice the price of Claude Opus 4.8—and includes safeguards that block responses in high-risk areas like cybersecurity and biology.
Anthropic releases Claude Fable 5, a safety-limited version of Mythos, at $10/$50 per million tokens
Anthropic released Claude Fable 5, the first publicly available version of its Mythos model, with built-in safety restrictions that automatically block high-risk queries in cybersecurity, biology, chemistry, and related fields. The model costs $10 per million input tokens and $50 per million output tokens, double the price of Claude Opus 4.8.
Comments
Loading...