Qwen2.5-VL 72B
Alibaba / Qwen🇨🇳 China
Alibaba 72B vision-language model. Understands images, hour-long video, documents, and structured data.
Context window131K tokens
Input / 1M tokens$0.4
Output / 1M tokens$1.2
Alibaba 72B vision-language model. Understands images, hour-long video, documents, and structured data.