Qwen3 72B
Alibaba / Qwen🇨🇳 China
Alibaba's top open-weights model. Hybrid thinking/non-thinking mode. Multilingual.
Context window128K tokens
Input / 1M tokens$0.4
Output / 1M tokens$1.2
Version History
Qwen3-72B-0806patch
Qwen3 72B patch release with bug fixes and improved instruction-following stability across multilingual prompts.
Qwen3-72Bmajor
Qwen3 72B ships with hybrid thinking/non-thinking modes. Claims top open-weights position on coding, math, and multilingual benchmarks.
Benchmark Scores
Full leaderboard →82.5%
AIME 2025
72.3%
GPQA
91.2%
HumanEval
87.5%
MATH
87.1%
MMLU
79.5%
MMLU-Pro
65.0 tokens_per_sec
Speed (tok/s)