Qwen3.5-35B-A3B-FP8

Name: Qwen3.5-35B-A3B-FP8
Author: Alibaba / Qwen

Alibaba / Qwen🇨🇳 China

active

Compare with other models →

Context window262K tokens

Version History

FP8 QuantizedpatchMarch 1, 2026

Released FP8-quantized version of Qwen3.5-35B-A3B, reducing memory requirements while maintaining multimodal capabilities. Compatible with Transformers endpoints and Azure deployment.

Coverage

model release

Alibaba releases Qwen3.5-35B-A3B-FP8, a quantized multimodal model for efficient deployment

Alibaba's Qwen team released Qwen3.5-35B-A3B-FP8 on Hugging Face, a quantized version of their 35-billion parameter multimodal model. The FP8 quantization reduces model size and memory requirements while maintaining the base model's image-text-to-text capabilities. The model is compatible with standard Transformers endpoints and Azure deployment.

March 1, 2026 · 11:20 AM1 min read

qwen alibaba-qwen model-release