model releaseDeepSeek

DeepSeek releases V4 model preview with agent optimization, pricing undisclosed

TL;DR

DeepSeek released a preview of its V4 large language model on April 24, 2026, available in 'pro' and 'flash' versions. The Hangzhou-based company claims the open-source model achieves strong performance on agent-based tasks and has been optimized for tools like Anthropic's Claude Code and OpenClaw.

2 min read
0

DeepSeek released a preview version of its V4 large language model on April 24, 2026, marking the company's first major model update since its R1 reasoning model in January 2025. The model is available in two variants: "pro" and "flash," though the company has not disclosed technical specifications, pricing, or context window sizes.

Model Capabilities

According to DeepSeek, V4 delivers improved performance against domestic Chinese competitors, particularly in agent-based tasks, knowledge processing, and inference. The company specifically optimized the model for compatibility with popular agent tools including Anthropic's Claude Code and OpenClaw.

Like its predecessor V3, DeepSeek-V4 is open source, allowing developers to download the code, run it locally, and modify it. The company has not released benchmark scores or comparative performance data.

Market Context

The release comes 15 months after DeepSeek's R1 reasoning model disrupted global tech markets in January 2025. R1 matched or outperformed leading models from OpenAI and Google on several benchmarks, despite DeepSeek's claims of development costs far below U.S. competitors.

DeepSeek's V3 model, released in late 2024, gained attention for reportedly being trained with less powerful chips and at a fraction of the cost of comparable models. However, the company's subsequent model releases have not replicated R1's market impact.

Competitive Landscape

DeepSeek now faces intensifying competition in China's AI sector. Alibaba and ByteDance have both released new models in 2026, competing for market share in the rapidly growing domestic AI market.

Founded in 2023 and based in Hangzhou, DeepSeek continues its strategy of open-source releases, contrasting with the closed-source approaches of many Western AI labs.

What This Means

The V4 preview extends DeepSeek's open-source model lineup but lacks the specificity needed to assess its competitive position. Without disclosed benchmarks, pricing, or technical specifications, it's unclear whether V4 represents a meaningful advance over V3 or how it compares to recent releases from competitors like Alibaba's Qwen and international models. The focus on agent optimization suggests DeepSeek is targeting enterprise and developer use cases, though the absence of pricing information makes cost comparisons to Western alternatives impossible.

Source: cnbc.com

Related Articles

model release

NVIDIA Releases Nemotron-3-Ultra: 550B Parameter Model with 1M Token Context and Configurable Reasoning

NVIDIA released Nemotron-3-Ultra-550B-A55B-NVFP4, a 550B parameter model with 55B active parameters, featuring a 1M token context window and configurable reasoning mode. The model uses a hybrid LatentMoE architecture combining Mamba-2, Mixture-of-Experts, and Attention layers with Multi-Token Prediction, trained with NVIDIA's NVFP4 quantization-aware approach.

model release

Ideogram 4: 9.3B parameter open-weight text-to-image model with native 2K resolution and structured JSON prompting

Ideogram has released Ideogram 4, its first open-weight text-to-image model with 9.3 billion parameters. The model supports native 2K resolution, structured JSON prompting with bounding-box layout controls, and is available in nf4 and fp8 quantizations under a non-commercial license.

model release

Google DeepMind releases Gemma 4 12B Unified: encoder-free multimodal model with 256K context window

Google DeepMind has released Gemma 4 12B Unified, an encoder-free multimodal model that processes text, images, and audio through a single decoder-only transformer. The model features 11.95 billion parameters, a 256K token context window, and achieves 77.2% on MMLU Pro and 72.0% on LiveCodeBench v6.

model release

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

Comments

Loading...