model release

Alibaba releases Qwen3.5-4B, a 4B multimodal model for vision and text tasks

TL;DR

Alibaba's Qwen team has released Qwen3.5-4B, a 4 billion parameter multimodal model capable of processing both images and text. The model is available on Hugging Face under an Apache 2.0 license, making it freely available for commercial and research use.

March 2, 2026 · 3:20 PM2 min read

Qwen3.5-4B — Quick Specs

Context window262K tokens

Compare Qwen3.5-4B with other models →

Alibaba Releases Qwen3.5-4B Multimodal Model

Alibaba's Qwen team has released Qwen3.5-4B, a 4 billion parameter multimodal model designed to handle both image and text inputs. The model was published on Hugging Face on February 27, 2026.

Model Specifications

Qwen3.5-4B is positioned as a lightweight multimodal model with 4 billion parameters. It supports image-text-to-text tasks, enabling users to provide images and text prompts and receive text responses. The model is available in base form (Qwen3.5-4B-Base) with instruction-tuned variants also released.

The model uses the safetensors format for model weights and is compatible with standard transformers pipelines and Hugging Face Endpoints.

Licensing and Availability

Qwen3.5-4B is released under the Apache 2.0 license, permitting free use for both commercial and non-commercial applications. This represents a fully open release with no usage restrictions. The model is available directly from Hugging Face's model hub.

Architecture and Capabilities

The model is tagged for conversational use cases and image-text-to-text applications. At 4 billion parameters, it targets the efficiency segment of the market—suitable for deployment on resource-constrained hardware while maintaining multimodal capabilities.

As of publication, the model has received 60 likes and 41 downloads on Hugging Face, indicating early interest from the open-source community.

Community Reception

The release includes evaluation results published alongside the model weights, following Alibaba's standard practice of providing benchmark data for model transparency. The model is marked as compatible with Hugging Face Endpoints for easy deployment.

What This Means

Qwen3.5-4B extends Alibaba's Qwen family into the efficient multimodal space at a smaller scale than previous releases. The 4B parameter count makes it suitable for edge deployment and fine-tuning on limited hardware, while Apache 2.0 licensing removes legal barriers to adoption. This positions the model as a competitive option for developers needing lightweight vision-language capabilities without commercial restrictions. The release reflects continued competition in the open-source multimodal space, where parameter efficiency and licensing terms are becoming primary differentiators.

Source: huggingface.co ↗

qwen alibaba multimodal vision-language 4b-parameters open-source apache-2.0

model releaseJuly 20, 2026

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba has released Qwen 3.8, a 2.4 trillion parameter open-weight model that the company claims trails only Fable 5. The multimodal model processes images, videos, and documents, with a preview available through Alibaba's platforms at 10 percent of standard pricing.

model releaseJuly 20, 2026

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba unveiled Qwen3.8 at the World Artificial Intelligence Conference in Shanghai, claiming the 2.4 trillion parameter model ranks second only to Anthropic's Fable 5. The company provided no benchmark scores, model card, or independent verification to support the claim.

model releaseJuly 20, 2026

Moonshot AI and Alibaba release 2.8T and 2.4T parameter models, claim performance near GPT-5.6 and Claude Fable 5

Within days, Moonshot AI and Alibaba unveiled what they claim are frontier-class models. Moonshot's Kimi K3, at 2.8 trillion parameters, and Alibaba's Qwen3.8, at 2.4 trillion parameters, will both be released as open-weight models with full weights available for download.

model releaseJuly 20, 2026

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Thinking Machines Lab released Inkling, a mixture-of-experts model with 975B total parameters and 41B active parameters, trained on 45 trillion tokens across text, images, audio, and video. The Apache 2.0-licensed model supports up to 1M context and debuts alongside Inkling-Small (276B-A12B), marking what observers call the strongest US-based open-weight release to date.

Alibaba releases Qwen3.5-4B, a 4B multimodal model for vision and text tasks

Qwen3.5-4B — Quick Specs

Alibaba Releases Qwen3.5-4B Multimodal Model

Model Specifications

Licensing and Availability

Architecture and Capabilities

Community Reception

What This Means

Related Articles

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Moonshot AI and Alibaba release 2.8T and 2.4T parameter models, claim performance near GPT-5.6 and Claude Fable 5

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Comments