Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0
Mistral has released Mistral 3, including Mistral Large 3—a sparse mixture-of-experts model with 41B active and 675B total parameters—and three Ministral 3 edge models (3B, 8B, 14B). All models are released under Apache 2.0 license with multimodal capabilities and are available today on multiple platforms.
Mistral Large 3 — Quick Specs
Mistral Releases Mistral 3 Family: 675B-Parameter Large 3 MoE and Three Edge Models Under Apache 2.0
Mistral has released Mistral 3, a model family spanning from 3B to 675B parameters, all under the Apache 2.0 license. The release includes Mistral Large 3, a sparse mixture-of-experts architecture with 41B active parameters and 675B total parameters, alongside three Ministral 3 edge models at 3B, 8B, and 14B sizes.
Mistral Large 3 Specifications
Mistral Large 3 was trained from scratch on 3,000 NVIDIA H200 GPUs. According to Mistral, the model ranks #2 among open-source non-reasoning models on LMArena and #6 among all open-source models. The company claims the model achieves "parity with the best instruction-tuned open-weight models" on general prompts.
Pricing for Mistral Large 3:
- Input: $0.50 per 1M tokens
- Output: $1.50 per 1M tokens
Both base and instruction-tuned versions are available. A reasoning version is planned for future release.
Ministral 3 Edge Models
The Ministral 3 series includes three parameter sizes: 3B, 8B, and 14B. Each size offers base, instruct, and reasoning variants with image understanding capabilities. Mistral claims the 14B reasoning variant achieves 85% on AIME 2025.
Pricing for Ministral 3 8B (pricing for other sizes not disclosed):
- Input: $0.15 per 1M tokens
- Output: $0.15 per 1M tokens
According to Mistral, the instruct models generate "an order of magnitude fewer tokens" than comparable models while matching or exceeding performance.
Technical Implementation
Mistral partnered with NVIDIA, vLLM, and Red Hat for deployment optimization. The company released a checkpoint in NVFP4 format using llm-compressor, enabling Mistral Large 3 to run on a single 8×A100 or 8×H100 node via vLLM. NVIDIA integrated Blackwell attention and MoE kernels for efficient inference on GB200 NVL72 systems.
For edge deployment, NVIDIA delivers optimized deployments on DGX Spark, RTX PCs, and Jetson devices.
Availability
All Mistral 3 models are available today on Mistral AI Studio, Amazon Bedrock, Azure Foundry, Hugging Face, Modal, IBM WatsonX, OpenRouter, Fireworks, Unsloth AI, and Together AI. NVIDIA NIM and AWS SageMaker availability is coming soon.
What This Means
Mistral's Apache 2.0 licensing decision for a 675B-parameter model represents the largest permissively-licensed model release to date, potentially accelerating enterprise adoption of open-weight alternatives to proprietary models. The sparse MoE architecture with 41B active parameters positions Large 3 as computationally efficient compared to dense models of similar capability, though real-world cost-effectiveness will depend on actual serving infrastructure requirements and the efficiency gains from the optimized NVFP4 format.
Related Articles
Mistral Releases Voxtral TTS: 4B Parameter Text-to-Speech Model at $0.016 per 1k Characters
Mistral AI has released Voxtral TTS, a 4B parameter text-to-speech model supporting 9 languages including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The model achieves 70ms latency for typical inputs and can clone voices from as little as 3 seconds of audio, priced at $0.016 per 1,000 characters.
Mistral AI Launches Forge for Enterprise Model Training on Proprietary Data
Mistral AI has launched Forge, a platform that allows enterprises to train custom AI models on their proprietary data including codebases, compliance policies, and operational documentation. The system supports both dense and mixture-of-experts architectures with pre-training, post-training, and reinforcement learning capabilities.
Mistral releases Leanstral, open-source 6B-parameter proof assistant for Lean 4 under Apache 2.0
Mistral AI has released Leanstral, a sparse 120B model with 6B active parameters designed specifically for the Lean 4 proof assistant. The model is available under Apache 2.0 license with free API access and achieves a 26.3 FLTEval score at pass@2, outperforming Claude Sonnet 4.6 while costing $36 versus $549.
Mistral OCR 3 launches at $2 per 1,000 pages with 74% win rate over previous version
Mistral AI released Mistral OCR 3, a document extraction model priced at $2 per 1,000 pages ($1 with Batch API discount). The model achieves a 74% overall win rate over its predecessor on forms, scanned documents, complex tables, and handwriting according to internal benchmarks.
Comments
Loading...