DeepSeek

Chinese AI lab, maker of DeepSeek-V3 and DeepSeek-R1

News

model releaseDeepSeek

DeepSeek Releases V4-Flash: 284B Parameter MoE Model with 1M Context Window at Q8 162GB

Unsloth has released optimized GGUF quantizations of DeepSeek-V4-Flash, a 284B parameter Mixture-of-Experts model that activates 13B parameters and supports 1 million token context windows. The Q8 quantization (UD-Q8_K_XL) runs at 162GB with claimed lossless precision, only 7GB larger than the Q4 variant.

July 8, 2026 · 2:51 PM2 min read

DeepSeek MoE GGUF

product updateDeepSeek

Cline v4.0.2 Adds DeepSeek Reasoning Effort Controls, Including 'xhigh' Setting

Cline, the autonomous AI coding assistant, released v4.0.2 with support for reasoning effort controls on DeepSeek thinking models, including the new 'xhigh' setting. The update also improves the ClinePass provider experience with clearer reasoning controls and model selection.

June 29, 2026 · 4:20 AM1 min read

cline deepseek coding-assistant

model releaseDeepSeek

DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3

DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.

June 29, 2026 · 12:36 AM2 min read

deepseek model-release moe

model releaseDeepSeek

DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3

DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.

June 27, 2026 · 3:51 PM2 min read

DeepSeek mixture-of-experts long-context

model releaseDeepSeek

DeepSeek-V4-Fable: Offensive Security Model Trained on 80,000 CTF Trajectories Achieves 58.7% Solve Rate

Chunjiang Intelligence has released DeepSeek-V4-Fable, an autonomous agent model designed for offensive security research and CTF challenges. The model, distilled from Claude-5-Fable and built on DeepSeek-V4-Flash, was trained on 80,000 verified CTF trajectories and achieves a 58.7% solve rate across held-out security challenges.

June 25, 2026 · 3:06 PM2 min read

deepseek security model-release

changelogDeepSeek

DeepSeek cuts V4 Pro pricing by 75% to $0.003625 per million input tokens

DeepSeek has permanently reduced pricing for its V4 Pro model by 75%, bringing input token costs down to $0.003625 per million tokens from $0.0145. The move makes permanent a promotional discount that was set to expire May 31, 2026.

May 23, 2026 · 3:50 PM2 min read

deepseek pricing v4-pro

model releaseDeepSeek

DeepSeek Releases V4 Flash: 284B-Parameter MoE Model with 1M Context Window, Free via OpenRouter

DeepSeek has released V4 Flash, a Mixture-of-Experts model with 284B total parameters and 13B activated parameters per forward pass. The model supports a 1M-token context window and is available free through OpenRouter, targeting high-throughput coding and chat applications.

May 13, 2026 · 11:50 PM2 min read

DeepSeek V4 Flash MoE

model releaseDeepSeek

DeepSeek V4 cuts inference costs with 1.6T parameter model using 13.7x less memory than V3

DeepSeek released V4 in two versions: a 284 billion parameter Flash model and a 1.6 trillion parameter Pro model with 49 billion active parameters. According to DeepSeek, the models use 9.5x-13.7x less memory than V3 through compressed attention mechanisms and FP4/FP8 mixed precision, while supporting a 1 million token context window.

April 24, 2026 · 9:36 PM2 min read

deepseek llm mixture-of-experts

model releaseDeepSeek

DeepSeek V4 Pro launches with 1.6 trillion parameters, 1M token context at $0.145 per million input tokens

Chinese AI lab DeepSeek has released preview versions of DeepSeek V4 Flash and V4 Pro, mixture-of-experts models with 1 million token context windows. The V4 Pro has 1.6 trillion total parameters (49 billion active), making it the largest open-weight model available, while both models significantly undercut frontier model pricing.

April 24, 2026 · 1:50 PM2 min read

DeepSeek V4 mixture-of-experts

model releaseDeepSeek

DeepSeek releases V4 preview, claims parity with GPT-4o and Claude 3.5 Sonnet

DeepSeek released a preview of its V4 model on April 24, 2026, claiming the open-source system matches leading closed-source models from Anthropic, Google, and OpenAI. The company emphasized improved coding capabilities and compatibility with domestic Huawei chips, but did not disclose training costs or hardware specifications.

April 24, 2026 · 10:05 AM2 min read

DeepSeek V4 open-source

model releaseDeepSeek

DeepSeek Releases V4-Flash-Base: 292B Parameter Base Model

DeepSeek has released V4-Flash-Base, a 292 billion parameter base model now available on Hugging Face. The model uses BF16, I64, F32, and F8_E4M3 tensor types and is distributed in Safetensors format.

April 24, 2026 · 6:50 AM1 min read

DeepSeek V4-Flash base-model

model releaseDeepSeek

DeepSeek V4 Pro launches with 1.6T parameters at $1.74/M tokens, undercutting Claude Sonnet 4.6 by 42%

DeepSeek released two preview models: V4 Pro (1.6T total parameters, 49B active) and V4 Flash (284B total, 13B active), both with 1 million token context windows. V4 Pro is priced at $1.74/M input tokens and $3.48/M output—42% cheaper than Claude Sonnet 4.6—while V4 Flash at $0.14/$0.28 per million tokens undercuts all small frontier models.

April 24, 2026 · 6:21 AM2 min read

deepseek model-release pricing

model releaseDeepSeek

DeepSeek releases V4 model preview with agent optimization, pricing undisclosed

DeepSeek released a preview of its V4 large language model on April 24, 2026, available in 'pro' and 'flash' versions. The Hangzhou-based company claims the open-source model achieves strong performance on agent-based tasks and has been optimized for tools like Anthropic's Claude Code and OpenClaw.

April 24, 2026 · 5:05 AM2 min read

DeepSeek V4 open-source

model releaseDeepSeek

DeepSeek Releases V4-Pro-Base with 1.6 Trillion Parameters

DeepSeek has released DeepSeek-V4-Pro-Base, a 1.6 trillion parameter foundation model now available on Hugging Face. The base model uses BF16 precision for weights and includes support for F8_E4M3, I64, and F32 tensor types.

April 24, 2026 · 4:50 AM1 min read

DeepSeek base-model open-source

model releaseDeepSeek

DeepSeek Releases V4 Pro: 1.6T Parameter MoE Model with 1M Token Context at $1.74/M Input Tokens

DeepSeek has released V4 Pro, a Mixture-of-Experts model with 1.6 trillion total parameters and 49 billion activated parameters. The model supports a 1-million-token context window and costs $1.74 per million input tokens and $3.48 per million output tokens.

April 24, 2026 · 4:21 AM2 min read

DeepSeek V4 Pro MoE

model releaseDeepSeek

DeepSeek V4 Flash Released: 284B Parameter MoE Model with 1M Context Window at $0.14 per Million Tokens

DeepSeek has released V4 Flash, a Mixture-of-Experts model with 284B total parameters and 13B activated parameters per request. The model supports a 1,048,576-token context window and is priced at $0.14 per million input tokens and $0.28 per million output tokens.

April 24, 2026 · 4:21 AM2 min read

DeepSeek V4 Flash Mixture-of-Experts

model releaseDeepSeek

DeepSeek Releases V4-Flash: 284B-Parameter MoE Model With 1M Token Context at 27% Inference Cost

DeepSeek released two Mixture-of-Experts models: V4-Flash with 284B total parameters (13B activated) and V4-Pro with 1.6T parameters (49B activated). Both models support one million token context windows and use a hybrid attention architecture that requires only 27% of the inference FLOPs compared to DeepSeek-V3.2 at 1M token context.

April 24, 2026 · 3:36 AM2 min read

deepseek moe long-context

model releaseDeepSeek

DeepSeek Releases V4-Pro: 1.6T Parameter MoE Model with 1M Token Context

DeepSeek released two new Mixture-of-Experts models: DeepSeek-V4-Pro with 1.6 trillion parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated), both supporting one million token context length. The models achieve 27% of inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2 at 1M context through a hybrid attention architecture combining Compressed Sparse Attention and Heavily Compressed Attention.

April 24, 2026 · 3:21 AM2 min read

deepseek moe long-context

model releaseDeepSeek

Deepseek v4 launching on Huawei chips exclusively, signaling China's AI independence progress

Deepseek v4 is launching in the coming weeks running exclusively on Huawei chips, marking a major milestone in China's effort to reduce dependency on foreign semiconductors. Chinese tech giants including Alibaba, Bytedance, and Tencent have ordered hundreds of thousands of Huawei Ascend 950PR units to deploy the model through their cloud services.

April 3, 2026 · 7:20 PM2 min read

deepseek huawei chips

model releaseDeepSeek

DeepSeek releases R1 reasoning model with chain-of-thought capabilities

DeepSeek has released DeepSeek-R1, a text generation model featuring reasoning capabilities through chain-of-thought processing. The model was published January 20, 2025 and has accumulated over 830,000 downloads on Hugging Face.

February 27, 2026 · 11:05 AM2 min read

deepseek model-release reasoning