model release

Alibaba releases Qwen3.5-27B, a 27B multimodal model with Apache 2.0 license

TL;DR

Alibaba Qwen has released Qwen3.5-27B, a 27-billion parameter model capable of processing both images and text. The model is available under an Apache 2.0 open license and is compatible with standard transformer endpoints.

February 24, 2026 · 7:20 PM2 min read

Qwen3.5-27B — Quick Specs

Context window262K tokens

Compare Qwen3.5-27B with other models →

Alibaba Qwen Releases Qwen3.5-27B Multimodal Model

Alibaba's Qwen team has published Qwen3.5-27B, a 27-billion parameter model designed to handle both image and text inputs. The release marks the latest iteration in Alibaba's open-source model lineup.

Model Specifications

Qwen3.5-27B is a multimodal model with an architecture supporting image-text-to-text tasks. The model carries an Apache 2.0 license, making it freely available for both research and commercial use. It is compatible with standard transformer endpoints and follows the safetensors format for model weights.

The model's parameter count of 27 billion positions it in the mid-range segment—larger than models like Mistral 7B but smaller than many instruction-tuned variants in the 70B range. This size targets deployment scenarios where computational resources are constrained but model capability remains a priority.

Capability Profile

Qwen3.5-27B is tagged for conversational tasks and multimodal understanding, suggesting it can engage in dialogue while processing images alongside text prompts. The image-text-to-text classification indicates the model accepts images and text as combined inputs and generates text responses.

Specific benchmark scores, training data composition, knowledge cutoff date, and maximum context window length have not been disclosed in the initial release metadata.

Availability and Licensing

The model is hosted on Hugging Face and is immediately available for download. The Apache 2.0 license removes legal barriers to commercial deployment, distinguishing this release from many restricted-license models. Support for standard transformer inference frameworks means existing tooling can run the model without custom implementations.

No pricing information or commercial hosting details have been announced.

What This Means

Qwen3.5-27B expands Alibaba's open competition with other mid-range multimodal models like Qwen's own larger variants and offerings from Mistral, Meta, and others. The 27B parameter count targets developers who need multimodal capability without the compute overhead of 70B+ models. The Apache 2.0 license removes deployment friction compared to restricted models. However, without disclosed benchmarks or performance data, comparative positioning against competing 27-30B multimodal models remains unclear. Organizations evaluating this model should establish baselines on their specific use cases before production deployment.

Source: huggingface.co ↗

qwen alibaba-qwen multimodal 27b open-source apache-2-0 image-text

model releaseJune 3, 2026

Google DeepMind releases Gemma 4 12B Unified: encoder-free multimodal model with 256K context window

Google DeepMind has released Gemma 4 12B Unified, an encoder-free multimodal model that processes text, images, and audio through a single decoder-only transformer. The model features 11.95 billion parameters, a 256K token context window, and achieves 77.2% on MMLU Pro and 72.0% on LiveCodeBench v6.

model releaseJune 3, 2026

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

model releaseJune 3, 2026

ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture

ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.

model releaseJune 4, 2026

NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua

NVIDIA has released Nemotron 3.5 Content Safety, a 4B-parameter model built on Google Gemma 3 4B IT that provides multimodal safety classification across approximately 140 languages. The model includes a 128K context window, custom enterprise policy enforcement, auditable reasoning traces, and is releasing its training dataset.

Alibaba releases Qwen3.5-27B, a 27B multimodal model with Apache 2.0 license

Qwen3.5-27B — Quick Specs

Alibaba Qwen Releases Qwen3.5-27B Multimodal Model

Model Specifications

Capability Profile

Availability and Licensing

What This Means

Related Articles

Google DeepMind releases Gemma 4 12B Unified: encoder-free multimodal model with 256K context window

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture

NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua

Comments