Deepseek v4 launching on Huawei chips exclusively, signaling China's AI independence progress
Deepseek v4 is launching in the coming weeks running exclusively on Huawei chips, marking a major milestone in China's effort to reduce dependency on foreign semiconductors. Chinese tech giants including Alibaba, Bytedance, and Tencent have ordered hundreds of thousands of Huawei Ascend 950PR units to deploy the model through their cloud services.
Deepseek v4 Launching Entirely on Huawei Chips
Deepesk v4 is expected to launch within weeks running entirely on Huawei's Ascend 950PR chips, according to reporting from The Information. The move represents a significant shift in China's AI infrastructure strategy, with the model receiving no early access review from Nvidia—only Chinese chip manufacturers got preview access.
Chip Performance and Demand Surge
Huawei claims the Ascend 950PR delivers approximately 2.8x the computing power of Nvidia's H20 chip, though it remains below the H200's performance. The chip reportedly commands a 20 percent price premium following massive orders from major Chinese tech companies.
Alibaba, Bytedance, and Tencent have collectively ordered hundreds of thousands of Ascend 950PR units to run Deepseek v4 through cloud services and integrate it into their own applications, according to five people familiar with the matter. This concentration of orders from China's largest tech firms signals confidence in both the model and domestic chip viability.
Development Partnership
Deepesk spent months collaborating with Huawei and chip designer Cambricon to port v4 to Chinese-made hardware. The effort reflects a broader strategy to decouple AI development from Western semiconductor supply chains, particularly following US export controls that have constrained chip availability for Chinese companies.
Huawei continues facing production bottlenecks stemming from these same export restrictions, though the surge in Deepseek v4 orders suggests immediate demand exceeds supply constraints.
What This Means
Deepesk v4's exclusive reliance on Huawei hardware marks a tangible outcome of China's multi-year push toward semiconductor self-sufficiency. The decision to exclude Nvidia from early access—a departure from industry norm—signals confidence in domestic alternatives and reduces dependency on external validation. The aggressive procurement by Alibaba, Bytedance, and Tencent indicates the AI market sees viable alternatives to Nvidia, though performance gaps remain. Sustained production constraints and the 20 percent price premium suggest China's chip ecosystem still faces scaling challenges despite technical progress.
Related Articles
DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3
DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.
Google releases Gemini 3.1 Flash Lite Image, its fastest and cheapest image generation model
Google has released Gemini 3.1 Flash Lite Image, also called Nano Banana 2 Lite, which the company describes as its fastest and cheapest image generation model. The model is available through Google's AI Studio and Gemini API with the identifier gemini-3.1-flash-lite-image.
Claude Sonnet 5 ships with 1M token context and new tokenizer that increases costs 30-40% for English text
Anthropic released Claude Sonnet 5 with a 1 million token context window and 128,000 token maximum output. The model removes traditional sampling parameters and introduces a new tokenizer that generates approximately 30% more tokens than Sonnet 4.6 for the same English text—effectively a significant price increase despite unchanged nominal rates of $3/million input and $15/million output tokens.
Claude Sonnet 5 launches on AWS Bedrock with Opus-level intelligence at Sonnet pricing
Anthropic has released Claude Sonnet 5 on Amazon Bedrock and Claude Platform on AWS. The model delivers what Anthropic describes as near-Opus intelligence while maintaining Sonnet-tier pricing, with promotional rates available through August 31, 2026.
Comments
Loading...