Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens
Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.
Qwen3.7 Plus — Quick Specs
Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens
Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.
Specifications
Qwen3.7 Plus operates with a 1,000K (1 million) token context window, positioning it among models capable of processing lengthy documents and extended conversations. The model processes both text and image inputs while generating text outputs.
According to Alibaba, Qwen3.7 Plus represents a "cost-effective model" in the Qwen3.7 series, building on the series' text capabilities with what the company describes as a "comprehensive upgrade" to its multimodal processing.
Pricing and Availability
The model is currently available through OpenRouter's API under the identifier qwen/qwen3.7-plus. Pricing is set at:
- Input: $0.40 per million tokens
- Output: $1.60 per million tokens
This pricing positions Qwen3.7 Plus below many frontier models while maintaining a large context window. For comparison, Claude 3.5 Sonnet costs $3.00 per million input tokens with a 200K context window, while GPT-4o costs $2.50 per million input tokens with a 128K context window.
Technical Details
The model's parameter count, training data cutoff date, and benchmark scores have not been disclosed. Information about the model's performance on standard benchmarks such as MMLU, HumanEval, or vision benchmarks remains unavailable at this time.
The designation "Qwen3.7" suggests this is part of Alibaba's third major model generation, though the company has not confirmed whether this represents a 3.7 billion parameter model or a different versioning scheme.
What This Means
Qwen3.7 Plus expands the availability of million-token context models at competitive pricing points, particularly for developers working through OpenRouter's unified API. The combination of multimodal capabilities and extended context at under $0.50 per million input tokens could make it viable for applications requiring large document analysis or extended multi-turn conversations with vision capabilities. However, without published benchmark results, developers will need to conduct their own evaluations to assess the model's performance against alternatives in their specific use cases.
Related Articles
Microsoft releases MAI-Thinking-1, its first reasoning AI model trained without third-party distillation
Microsoft announced MAI-Thinking-1, its first advanced reasoning AI model, at Build 2026. The company claims it's a medium-sized model matching leading models on key software engineering benchmarks, trained from scratch without distillation from third-party models.
MiniMax Launches M3 Model With 1M Context Window at $0.30 Per Million Input Tokens
MiniMax has released M3, a multimodal foundation model supporting text, image, and video inputs with a 1-million-token context window. The model costs $0.30 per million input tokens and $1.20 per million output tokens, available through OpenRouter.
ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture
ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.
Microsoft launches MAI-Code-1 and MAI-Thinking-1 models to reduce OpenAI dependence
Microsoft announced two proprietary AI models at its Build developer conference: MAI-Code-1 for code generation and MAI-Thinking-1 for reasoning tasks. The models are designed to run on Azure infrastructure, allowing Microsoft to reduce costs from its $13 billion OpenAI investment while competing directly with Anthropic and Google.
Comments
Loading...