model release

Alibaba releases Qwen3.5-9B, a multimodal 9B parameter model

TL;DR

Alibaba has released Qwen3.5-9B, a 9-billion parameter multimodal language model capable of processing both images and text. The model is available under Apache 2.0 license on Hugging Face with transformer-compatible architecture.

March 2, 2026 · 1:50 PM2 min read

Qwen3.5-9B — Quick Specs

Context window262K tokens

Compare Qwen3.5-9B with other models →

Alibaba's Qwen team has released Qwen3.5-9B, a 9-billion parameter multimodal model designed for image-text-to-text tasks. The model arrived on Hugging Face on February 27, 2026.

Model Specifications

Qwen3.5-9B is a multimodal language model that accepts both image and text inputs, classifying it as an image-text-to-text model. The base model is available as Qwen/Qwen3.5-9B-Base, with the released version appearing to be fine-tuned for improved performance on downstream tasks.

The model uses standard transformer architecture and is distributed in SafeTensors format, compatible with the Hugging Face transformers library. It supports inference endpoints according to the model card.

Licensing and Availability

Qwen3.5-9B is released under the Apache 2.0 license, permitting commercial use, modification, and redistribution with attribution. The model is hosted on Hugging Face, where it has received 65 likes and 10 downloads as of early release.

Technical Details

The model carries the qwen3_5 tag and is labeled conversational, indicating design for dialogue and interaction tasks. Its 9-billion parameter scale positions it in the lightweight-to-mid-range segment, suitable for deployment on consumer hardware and edge devices with moderate GPU resources.

The pipeline tag specifies image-text-to-text functionality, meaning the model accepts images and text as input and generates text output. This multimodal capability differentiates it from text-only models of similar size.

Context and Positioning

Qwen3.5-9B represents Alibaba's continued expansion in the open-source language model space. The Qwen series has established itself as a competitive alternative to models from OpenAI, Google, and other providers, particularly for developers requiring open-source options with permissive licensing.

The model size and architecture suggest design for scenarios where compute efficiency matters: fine-tuning, deployment in resource-constrained environments, and use cases where model interpretability and size are advantages. Multimodal capability at the 9B scale addresses demand for models that can handle both vision and language tasks without the resource overhead of larger multimodal systems.

What this means

Qwen3.5-9B extends Alibaba's open-source model lineup with a multimodal option in an accessible size range. For practitioners, this creates another option for deploying image-text understanding locally or with minimal compute costs. The Apache 2.0 license removes licensing barriers for commercial applications, competing directly with similarly-sized models from other providers. The specific performance characteristics—benchmark scores, token limits, and inference speed—remain unreleased, making empirical comparison necessary before adoption decisions.

Source: huggingface.co ↗

qwen alibaba multimodal 9b-parameters image-text apache-2.0 open-source language-model

model releaseJuly 20, 2026

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba has released Qwen 3.8, a 2.4 trillion parameter open-weight model that the company claims trails only Fable 5. The multimodal model processes images, videos, and documents, with a preview available through Alibaba's platforms at 10 percent of standard pricing.

model releaseJuly 20, 2026

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba unveiled Qwen3.8 at the World Artificial Intelligence Conference in Shanghai, claiming the 2.4 trillion parameter model ranks second only to Anthropic's Fable 5. The company provided no benchmark scores, model card, or independent verification to support the claim.

model releaseJuly 20, 2026

Moonshot AI and Alibaba release 2.8T and 2.4T parameter models, claim performance near GPT-5.6 and Claude Fable 5

Within days, Moonshot AI and Alibaba unveiled what they claim are frontier-class models. Moonshot's Kimi K3, at 2.8 trillion parameters, and Alibaba's Qwen3.8, at 2.4 trillion parameters, will both be released as open-weight models with full weights available for download.

model releaseJuly 20, 2026

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Thinking Machines Lab released Inkling, a mixture-of-experts model with 975B total parameters and 41B active parameters, trained on 45 trillion tokens across text, images, audio, and video. The Apache 2.0-licensed model supports up to 1M context and debuts alongside Inkling-Small (276B-A12B), marking what observers call the strongest US-based open-weight release to date.

Alibaba releases Qwen3.5-9B, a multimodal 9B parameter model

Qwen3.5-9B — Quick Specs

Model Specifications

Licensing and Availability

Technical Details

Context and Positioning

What this means

Related Articles

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Moonshot AI and Alibaba release 2.8T and 2.4T parameter models, claim performance near GPT-5.6 and Claude Fable 5

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Comments