Mistral Releases Codestral Embed, Code-Specialized Embedding Model at $0.15 Per Million Tokens
Mistral AI has released Codestral Embed, its first code-specialized embedding model, priced at $0.15 per million tokens. The model features an 8192-token context window and claims to outperform Voyage Code 3, Cohere Embed v4.0, and OpenAI's large embedding model on code retrieval benchmarks.
Codestral Embed — Quick Specs
Mistral Releases Codestral Embed, Code-Specialized Embedding Model at $0.15 Per Million Tokens
Mistral AI has released Codestral Embed, its first embedding model specialized for code retrieval and semantic understanding. The model is available via API under the identifier codestral-embed-2505 at $0.15 per million tokens, with a 50% discount available through Mistral's batch API.
Technical Specifications
Codestral Embed supports an 8192-token context window and offers flexible output configurations. The model can generate embeddings at different dimensions and precisions, with dimensions ordered by relevance. According to Mistral, even at reduced dimension 256 with int8 precision, the model outperforms competing code embedding models.
For retrieval use cases, Mistral recommends chunking datasets into 3000-character segments with 1000-character overlap rather than using the full context window, as larger chunks reportedly degrade retrieval performance.
Benchmark Performance
Mistral claims Codestral Embed outperforms Voyage Code 3, Cohere Embed v4.0, and OpenAI's large embedding model across multiple code retrieval benchmarks. The company highlights particularly strong performance on:
- SWE-Bench Lite: Real-world GitHub issue resolution, identifying files requiring modification
- CodeSearchNet: Code-to-code and documentation-to-code retrieval from GitHub repositories
- Text2SQL tasks: Spider, WikiSQL, and synthetic SQL generation benchmarks
- Algorithm challenges: APPS, CodeChef, MBPP+, and competitive programming problems
- Data science: DS-1000 matching questions to implementations
Mistral evaluated the model across categories including SWE-Bench, code-to-code retrieval, Text2Code from GitHub data, Text2SQL, algorithmic problems, and data science tasks. Specific numerical scores were not disclosed in the announcement.
Use Cases
The company positions Codestral Embed for four primary applications:
- Retrieval-augmented generation: Context retrieval for code completion, editing, and explanation in coding assistants and agent frameworks
- Semantic code search: Natural language or code-based queries across documentation and development tools
- Duplicate detection: Identifying functionally similar code segments for reuse optimization and license enforcement
- Code analytics: Unsupervised clustering for repository analysis and architecture pattern identification
Availability
Codestral Embed is accessible through Mistral's API and batch API. On-premises deployments require direct contact with Mistral's applied AI team. The company has published documentation and a cookbook with examples for code agent retrieval implementations.
What This Means
Codestral Embed represents Mistral's first specialized product for developer tooling, competing directly with established code embedding models from Voyage AI, Cohere, and OpenAI. The $0.15 per million token pricing positions it competitively in the embedding market, though direct cost comparisons depend on competitors' pricing for comparable dimensions and precision levels. The model's flexible dimension reduction feature could provide cost savings for applications where storage efficiency matters more than maximum retrieval accuracy. Its strong claimed performance on SWE-Bench Lite, a benchmark using real GitHub issues, suggests potential utility for coding agents and automated software engineering tools.
Related Articles
Mistral releases Vibe 2.0 terminal coding agent with custom subagents and Devstral 2 API pricing
Mistral AI released Vibe 2.0, a terminal-native coding agent powered by Devstral 2, adding custom subagents, multi-choice clarifications, and slash-command skills. Devstral 2 API pricing is now $0.40/M input tokens and $2.00/M output tokens, with a smaller variant at $0.10/$0.30 per million tokens.
Mistral Launches OCR API at $1 Per 1,000 Pages, Claims 94.89% Accuracy on Document Benchmarks
Mistral AI has released Mistral OCR, an API for extracting text and images from documents at $1 per 1,000 pages (approximately $0.50 with batch inference). The company claims 94.89% overall accuracy on its internal test set, comparing favorably to GPT-4o (89.77%), Gemini 2.0 Flash (88.69%), and Azure OCR (89.52%).
Mistral AI launches Connectors in Studio with MCP protocol integration and direct tool calling
Mistral AI has released Connectors in Studio, allowing developers to integrate custom MCP (Model Context Protocol) servers and built-in connectors via API/SDK. The release includes direct tool calling for deterministic workflows and human-in-the-loop approval flows for sensitive operations.
Mistral Releases Voxtral TTS: 4B Parameter Text-to-Speech Model at $0.016 per 1k Characters
Mistral AI has released Voxtral TTS, a 4B parameter text-to-speech model supporting 9 languages including English, French, German, Spanish, Dutch, Portuguese, Italian, Hindi, and Arabic. The model achieves 70ms latency for typical inputs and can clone voices from as little as 3 seconds of audio, priced at $0.016 per 1,000 characters.
Comments
Loading...