DeepSeek R1 Distill Qwen 32B
DeepSeek🇨🇳 China
Context window33K tokens
Input / 1M tokens$0.29
Output / 1M tokens$0.29
Version History
1.0major
DeepSeek releases distilled 32B model using R1 reasoning outputs applied to Qwen 2.5 32B. Claims state-of-the-art performance for dense models at $0.29/M tokens.