model releaseDeepSeek

Deepseek v4 launching on Huawei chips exclusively, signaling China's AI independence progress

TL;DR

Deepseek v4 is launching in the coming weeks running exclusively on Huawei chips, marking a major milestone in China's effort to reduce dependency on foreign semiconductors. Chinese tech giants including Alibaba, Bytedance, and Tencent have ordered hundreds of thousands of Huawei Ascend 950PR units to deploy the model through their cloud services.

2 min read
1

Deepseek v4 Launching Entirely on Huawei Chips

Deepesk v4 is expected to launch within weeks running entirely on Huawei's Ascend 950PR chips, according to reporting from The Information. The move represents a significant shift in China's AI infrastructure strategy, with the model receiving no early access review from Nvidia—only Chinese chip manufacturers got preview access.

Chip Performance and Demand Surge

Huawei claims the Ascend 950PR delivers approximately 2.8x the computing power of Nvidia's H20 chip, though it remains below the H200's performance. The chip reportedly commands a 20 percent price premium following massive orders from major Chinese tech companies.

Alibaba, Bytedance, and Tencent have collectively ordered hundreds of thousands of Ascend 950PR units to run Deepseek v4 through cloud services and integrate it into their own applications, according to five people familiar with the matter. This concentration of orders from China's largest tech firms signals confidence in both the model and domestic chip viability.

Development Partnership

Deepesk spent months collaborating with Huawei and chip designer Cambricon to port v4 to Chinese-made hardware. The effort reflects a broader strategy to decouple AI development from Western semiconductor supply chains, particularly following US export controls that have constrained chip availability for Chinese companies.

Huawei continues facing production bottlenecks stemming from these same export restrictions, though the surge in Deepseek v4 orders suggests immediate demand exceeds supply constraints.

What This Means

Deepesk v4's exclusive reliance on Huawei hardware marks a tangible outcome of China's multi-year push toward semiconductor self-sufficiency. The decision to exclude Nvidia from early access—a departure from industry norm—signals confidence in domestic alternatives and reduces dependency on external validation. The aggressive procurement by Alibaba, Bytedance, and Tencent indicates the AI market sees viable alternatives to Nvidia, though performance gaps remain. Sustained production constraints and the 20 percent price premium suggest China's chip ecosystem still faces scaling challenges despite technical progress.

Related Articles

model release

Google releases Gemini 3.5 Flash with 4x faster output and agentic capabilities, 3.5 Pro coming June

Google released Gemini 3.5 Flash today with 4x faster output token generation than competing frontier models while surpassing Gemini 3.1 Pro on coding, agentic, and multimodal benchmarks. The company announced Gemini 3.5 Pro will launch next month and introduced Gemini Omni, a new multimodal series that outputs video.

model release

Google releases Gemini 3.5 Flash with autonomous coding and agent capabilities, claims 4x speed boost

Google released Gemini 3.5 Flash, positioning it as an agent-first model designed for autonomous coding and multi-hour workflows. The company claims the model outperforms its 3.1 Pro predecessor on coding and agentic benchmarks while running 4x faster than competing frontier models, with an optimized version achieving 12x speed gains.

model release

Google releases Gemini 3.5 Flash at half the price of frontier models, announces Omni world model

Google released Gemini 3.5 Flash, priced at half to one-third the cost of comparable frontier models, and announced it will become the default model in the Gemini app globally. The company also unveiled Omni, a world model for simulating physical environments, and Gemini Spark, an AI agent in beta testing.

model release

ByteDance releases Lance, 3B-parameter unified multimodal model handling image and video generation, editing, and unders

ByteDance has released Lance, a 3-billion parameter multimodal model that performs image and video generation, editing, and understanding within a single framework. The model was trained entirely from scratch using 128 A100 GPUs and achieves 84.67% on DPG-Bench and 74% on GenEval, competing with larger models despite its compact size.

Comments

Loading...