model releaseDeepSeek

DeepSeek Releases V4-Flash-Base: 292B Parameter Base Model

TL;DR

DeepSeek has released V4-Flash-Base, a 292 billion parameter base model now available on Hugging Face. The model uses BF16, I64, F32, and F8_E4M3 tensor types and is distributed in Safetensors format.

1 min read
0

DeepSeek V4-Flash-Base: 292B Parameter Base Model Released

DeepSeek has released V4-Flash-Base, a 292 billion parameter base model now available on Hugging Face. The model represents the base version of DeepSeek's V4-Flash series.

Technical Specifications

The model contains 292 billion parameters and supports multiple tensor types: BF16 (bfloat16), I64 (64-bit integer), F32 (32-bit float), and F8_E4M3 (8-bit float in E4M3 format). Files are distributed in the Safetensors format, which provides safer serialization than traditional pickle-based formats.

Availability and Deployment

The model weights are available for download on Hugging Face as part of a collection containing 4 items. According to the Hugging Face listing, no inference providers currently support deployment of this model. The collection was last updated approximately 4 hours ago and has 307 downloads.

Missing Information

DeepSeek has not yet published a model card with detailed information about training data, benchmark performance, capabilities, or pricing. Context window size, training cutoff date, and specific use cases remain undisclosed. As a base model, V4-Flash-Base typically requires fine-tuning for specific tasks, unlike instruction-tuned variants.

What This Means

The release of a 292B parameter base model signals DeepSeek's continued development of large-scale models, though the lack of documentation makes technical evaluation impossible at this stage. The "Flash" designation suggests optimization for speed, consistent with other models in the industry using similar naming conventions. The use of F8_E4M3 tensor types indicates potential support for efficient inference through quantization. Without benchmark scores or a detailed model card, organizations should wait for complete documentation before considering deployment.

Related Articles

model release

NVIDIA Releases Nemotron-3-Ultra: 550B Parameter Model with 1M Token Context and Configurable Reasoning

NVIDIA released Nemotron-3-Ultra-550B-A55B-NVFP4, a 550B parameter model with 55B active parameters, featuring a 1M token context window and configurable reasoning mode. The model uses a hybrid LatentMoE architecture combining Mamba-2, Mixture-of-Experts, and Attention layers with Multi-Token Prediction, trained with NVIDIA's NVFP4 quantization-aware approach.

model release

Ideogram 4: 9.3B parameter open-weight text-to-image model with native 2K resolution and structured JSON prompting

Ideogram has released Ideogram 4, its first open-weight text-to-image model with 9.3 billion parameters. The model supports native 2K resolution, structured JSON prompting with bounding-box layout controls, and is available in nf4 and fp8 quantizations under a non-commercial license.

model release

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

model release

Microsoft launches MAI-Code-1 and MAI-Thinking-1 models to reduce OpenAI dependence

Microsoft announced two proprietary AI models at its Build developer conference: MAI-Code-1 for code generation and MAI-Thinking-1 for reasoning tasks. The models are designed to run on Azure infrastructure, allowing Microsoft to reduce costs from its $13 billion OpenAI investment while competing directly with Anthropic and Google.

Comments

Loading...