model releaseMistral AI

Mistral Releases Medium 3.5: 128B Model with Cloud Coding Agents and 77.6% SWE-Bench Verified

TL;DR

Mistral AI released Medium 3.5, a 128B dense model with a 256k context window that scores 77.6% on SWE-Bench Verified. The model powers new remote coding agents in Mistral Vibe that run asynchronously in the cloud, plus a new Work mode in Le Chat for multi-step agentic tasks.

3 min read
0

Mistral Medium 3.5 — Quick Specs

Context window256K tokens
Input$1.5/1M tokens
Output$7.5/1M tokens

Mistral Releases Medium 3.5: 128B Model with Cloud Coding Agents and 77.6% SWE-Bench Verified

Mistral AI released Mistral Medium 3.5, a 128B dense model with a 256k context window, built for long-running coding and productivity tasks. The model scores 77.6% on SWE-Bench Verified and 91.4 on τ³-Telecom, according to Mistral.

Mistral Medium 3.5 is now available in public preview under a modified MIT open-weights license. The company claims the model can run self-hosted on as few as four GPUs and combines instruction-following, reasoning, and coding in a single set of weights.

Performance and Technical Details

According to Mistral, the model outperforms Devstral 2 and Qwen 3.5 397B A17B on SWE-Bench Verified with its 77.6% score. The model includes a vision encoder trained from scratch to handle variable image sizes and aspect ratios.

Reasoning effort is configurable per request, allowing the same model to handle quick chat replies or complex agentic workflows. Mistral built the model specifically for long-horizon tasks with reliable tool calling and structured output.

Pricing and Availability

Mistral Medium 3.5 is priced at $1.50 per million input tokens and $7.50 per million output tokens via API. Open weights are available on Hugging Face. The model is also available through NVIDIA's build.nvidia.com platform and as NVIDIA NIM containerized inference microservices.

Remote Coding Agents in Vibe

Mistral launched remote coding agents in Vibe CLI that run asynchronously in the cloud. Users can start coding sessions from either the CLI or Le Chat web interface, with sessions running in isolated sandboxes while users step away.

Local CLI sessions can be "teleported" to the cloud, preserving session history, task state, and approvals. The agents integrate with GitHub for pull requests, Linear and Jira for issues, Sentry for incidents, and Slack or Teams for notifications.

According to Mistral, the system is designed for high-volume coding work like module refactors, test generation, dependency upgrades, and bug fixes. Multiple coding sessions can run in parallel.

Work Mode in Le Chat

Mistral introduced Work mode in Le Chat (Preview), powered by Medium 3.5, for multi-step agentic tasks beyond coding. The mode enables cross-tool workflows, research and synthesis, inbox triage, and issue creation in project management tools.

In Work mode, connectors are enabled by default, allowing the agent to access documents, mailboxes, calendars, and other systems. Every tool call and reasoning step is visible to users, with explicit approval required for sensitive actions like sending messages or modifying data.

What This Means

Mistral's combination of a strong open-weights coding model with cloud-based agent infrastructure directly addresses the friction in current AI coding workflows, where developers must babysit agent runs. The 77.6% SWE-Bench Verified score positions Medium 3.5 competitively against larger models, while the claimed four-GPU deployment requirement could enable wider self-hosting.

The $1.50/$7.50 pricing undercuts similar-capability models from competitors, though real-world performance on complex codebases will determine adoption. The integration of coding agents into Le Chat and the new Work mode signals Mistral's push beyond chat interfaces into persistent, multi-step autonomous workflows.

Related Articles

product update

Mistral AI Expands Into Industrial Engineering With Airbus, BMW Partnerships and Acquires Physics AI Firm Emmi

Mistral AI announced a new industrial engineering AI stack combining physics models with partnerships across aerospace, automotive, and semiconductor sectors. The company acquired scientific AI firm Emmi on May 22, 2026, and is opening a 10 MW inference data center in Les Ulis, France in Q3 2026.

product update

Mistral AI Releases MCP Connectors in Studio with Direct Tool Calling and Human-in-the-Loop Workflows

Mistral AI has released Connectors in Studio, allowing developers to integrate custom MCP (Model Context Protocol) servers alongside built-in connectors for enterprise AI applications. The release includes direct tool calling, human-in-the-loop approval flows, and programmatic connector management via API and SDK.

product update

Mistral AI launches Forge, enterprise platform for training custom models on proprietary data

Mistral AI has launched Forge, a platform for enterprises to train custom AI models on proprietary data including codebases, compliance policies, and operational records. Early partners include ASML, DSO National Laboratories Singapore, Ericsson, European Space Agency, and HTX Singapore.

product update

Mistral launches 20+ MCP connectors and memory system for Le Chat, all features free

Mistral has released 20+ Model Context Protocol (MCP) connectors for Le Chat, enabling integration with enterprise tools including Databricks, Snowflake, GitHub, Atlassian, Asana, Stripe, and Zapier. The company also introduced Memories, a cross-conversation context system that retains user preferences and facts. Both features are available on Le Chat's free plan.

Comments

Loading...