model release

Alibaba releases Qwen3.5-35B-A3B, a 35B multimodal model with Apache 2.0 license

TL;DR

Alibaba has released Qwen3.5-35B-A3B, a 35-billion parameter multimodal model capable of processing images and text. The model is published under an Apache 2.0 license and available on Hugging Face with Transformers and SafeTensors format support.

February 24, 2026 · 6:05 PM2 min read

Qwen3.5-35B-A3B — Quick Specs

Context window262K tokens

Compare Qwen3.5-35B-A3B with other models →

Alibaba Releases Qwen3.5-35B-A3B Multimodal Model

Alibaba's Qwen team has released Qwen3.5-35B-A3B, a 35-billion parameter multimodal model designed to process both images and text inputs. The model was published on February 24, 2026, and is available via Hugging Face.

Model Specifications

Qwen3.5-35B-A3B operates as an image-text-to-text model, meaning it accepts images and text as input and generates text responses. The model uses a mixture-of-experts (MoE) architecture, as indicated by the qwen3_5_moe tag in its Hugging Face metadata.

The 35-billion parameter count positions this model in the mid-to-large range for open-weight deployments, offering a balance between computational requirements and capability for enterprises and researchers with moderate infrastructure.

Licensing and Access

The model is released under the permissive Apache 2.0 license, allowing commercial and research use with minimal restrictions. This licensing choice contrasts with some recent model releases that employ more restrictive agreements.

The model is published in SafeTensors format and fully compatible with the Hugging Face Transformers library, enabling straightforward integration into existing ML pipelines. Hugging Face Inference Endpoints compatibility is confirmed, making deployment accessible for users without dedicated infrastructure.

Technical Details

The model card indicates support for conversational use cases, suggesting fine-tuning or training optimizations for dialogue applications. Specific details on context window length, training data cutoff date, and benchmark performance scores have not been disclosed by Alibaba as of release.

The "A3B" suffix in the model identifier likely denotes a specific variant or training configuration within the Qwen3.5 family, though Alibaba has not publicly clarified this designation.

What This Means

Qwen3.5-35B-A3B expands the open-weight multimodal model landscape with an Apache 2.0-licensed option suitable for commercial applications. The mid-size 35B parameter count fills a practical deployment niche for organizations seeking multimodal capabilities without enterprise-grade infrastructure. However, without published benchmarks or detailed capability comparisons, the model's competitive positioning relative to other open multimodal models remains unclear. The lack of disclosed context window size and training specifics limits technical evaluation at launch.

Source: huggingface.co ↗

qwen alibaba-qwen multimodal image-to-text open-weight 35b-parameters apache-2-0 mixture-of-experts

model releaseJune 4, 2026

Ideogram 4: 9.3B parameter open-weight text-to-image model with native 2K resolution and structured JSON prompting

Ideogram has released Ideogram 4, its first open-weight text-to-image model with 9.3 billion parameters. The model supports native 2K resolution, structured JSON prompting with bounding-box layout controls, and is available in nf4 and fp8 quantizations under a non-commercial license.

model releaseJune 3, 2026

Ideogram Releases First Open-Weight Image Model With 9.3B Parameters and 2K Native Resolution

Ideogram has released Ideogram 4, a 9.3B parameter open-weight text-to-image model trained from scratch. The model features structured JSON prompting, native 2K resolution output, and ranks as the top open-weight model on Design Arena. Available in fp8 and nf4 quantizations under a non-commercial license.

model releaseJune 3, 2026

Google DeepMind Releases Gemma 4: Encoder-Free Multimodal Models from 2.3B to 30.7B Parameters

Google DeepMind released Gemma 4, a family of open-weight multimodal models ranging from 2.3B to 30.7B parameters. The flagship 12B Unified model eliminates separate encoders, processing text, images, audio, and video directly through a single decoder-only transformer with up to 256K token context window.

model releaseJune 3, 2026

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

Alibaba releases Qwen3.5-35B-A3B, a 35B multimodal model with Apache 2.0 license

Qwen3.5-35B-A3B — Quick Specs

Alibaba Releases Qwen3.5-35B-A3B Multimodal Model

Model Specifications

Licensing and Access

Technical Details

What This Means

Related Articles

Ideogram 4: 9.3B parameter open-weight text-to-image model with native 2K resolution and structured JSON prompting

Ideogram Releases First Open-Weight Image Model With 9.3B Parameters and 2K Native Resolution

Google DeepMind Releases Gemma 4: Encoder-Free Multimodal Models from 2.3B to 30.7B Parameters

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Comments