Alibaba releases Qwen3.5-27B, a 27B multimodal model with Apache 2.0 license
Alibaba Qwen has released Qwen3.5-27B, a 27-billion parameter model capable of processing both images and text. The model is available under an Apache 2.0 open license and is compatible with standard transformer endpoints.
Alibaba Qwen Releases Qwen3.5-27B Multimodal Model
Alibaba's Qwen team has published Qwen3.5-27B, a 27-billion parameter model designed to handle both image and text inputs. The release marks the latest iteration in Alibaba's open-source model lineup.
Model Specifications
Qwen3.5-27B is a multimodal model with an architecture supporting image-text-to-text tasks. The model carries an Apache 2.0 license, making it freely available for both research and commercial use. It is compatible with standard transformer endpoints and follows the safetensors format for model weights.
The model's parameter count of 27 billion positions it in the mid-range segment—larger than models like Mistral 7B but smaller than many instruction-tuned variants in the 70B range. This size targets deployment scenarios where computational resources are constrained but model capability remains a priority.
Capability Profile
Qwen3.5-27B is tagged for conversational tasks and multimodal understanding, suggesting it can engage in dialogue while processing images alongside text prompts. The image-text-to-text classification indicates the model accepts images and text as combined inputs and generates text responses.
Specific benchmark scores, training data composition, knowledge cutoff date, and maximum context window length have not been disclosed in the initial release metadata.
Availability and Licensing
The model is hosted on Hugging Face and is immediately available for download. The Apache 2.0 license removes legal barriers to commercial deployment, distinguishing this release from many restricted-license models. Support for standard transformer inference frameworks means existing tooling can run the model without custom implementations.
No pricing information or commercial hosting details have been announced.
What This Means
Qwen3.5-27B expands Alibaba's open competition with other mid-range multimodal models like Qwen's own larger variants and offerings from Mistral, Meta, and others. The 27B parameter count targets developers who need multimodal capability without the compute overhead of 70B+ models. The Apache 2.0 license removes deployment friction compared to restricted models. However, without disclosed benchmarks or performance data, comparative positioning against competing 27-30B multimodal models remains unclear. Organizations evaluating this model should establish baselines on their specific use cases before production deployment.
Related Articles
Google DeepMind releases Gemma 4 12B Unified: encoder-free multimodal model with 256K context window
Google DeepMind has released Gemma 4 12B Unified, an encoder-free multimodal model that processes text, images, and audio through a single decoder-only transformer. The model features 11.95 billion parameters, a 256K token context window, and achieves 77.2% on MMLU Pro and 72.0% on LiveCodeBench v6.
Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens
Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.
ByteDance Open-Sources Bernini-R Video Diffusion Model With Semantic Planning Architecture
ByteDance released Bernini-R, an open-source video generation and editing model that combines an MLLM-based semantic planner with a DiT-based renderer. The model requires Hopper-class GPUs (H100/H800/H200) for optimal performance and supports multiple tasks including text-to-video, video editing, and reference-guided generation.
NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua
NVIDIA has released Nemotron 3.5 Content Safety, a 4B-parameter model built on Google Gemma 3 4B IT that provides multimodal safety classification across approximately 140 languages. The model includes a 128K context window, custom enterprise policy enforcement, auditable reasoning traces, and is releasing its training dataset.
Comments
Loading...