DeepSeek R1 Distill Qwen 32B

DeepSeek🇨🇳 China
active
Context window33K tokens
Input / 1M tokens$0.29
Output / 1M tokens$0.29

Version History

1.0major

DeepSeek releases distilled 32B model using R1 reasoning outputs applied to Qwen 2.5 32B. Claims state-of-the-art performance for dense models at $0.29/M tokens.