model releaseCohere

Cohere Releases North Mini Code 1.0: 30B-Parameter MoE Model With 256K Context for Agentic Coding

TL;DR

Cohere Labs has released North Mini Code 1.0, a 30B-parameter sparse Mixture-of-Experts model with 3B active parameters and a 256K context window. The Apache 2.0-licensed model is optimized for agentic software engineering, featuring 128 experts with 8 activated per token, and trained specifically for tool use in coding tasks.

June 9, 2026 · 5:21 PM2 min read

North Mini Code 1.0 — Quick Specs

Context window256K tokens

Compare North Mini Code 1.0 with other models →

Cohere Releases North Mini Code 1.0: 30B-Parameter MoE Model With 256K Context for Agentic Coding

Cohere Labs has released North Mini Code 1.0, a 30B-parameter sparse Mixture-of-Experts (MoE) model with 3B active parameters designed for code generation and agentic software engineering tasks.

Model Architecture and Specifications

North Mini Code 1.0 uses a decoder-only Transformer architecture with 128 experts, activating 8 per token. The model features:

Total parameters: 30B (3B active)
Context window: 256K tokens with 64K max output
License: Apache 2.0
Architecture: Sparse MoE with interleaved attention (3:1 ratio of sliding-window with RoPE to global attention without positional embeddings)
Training: Two-stage post-training with supervised fine-tuning (SFT) followed by reinforcement learning with verifiable rewards (RLVR)

Performance on Agentic Coding Benchmarks

Cohere evaluated the model on SWE-Bench Verified, SWE-Bench Pro, Terminal-Bench v2, and Terminal-Bench Hard using the Swe-Agent harness v1.1.0. The company also tested on SciCode and LiveCodeBench v6 for complex code generation. All benchmarks used temperature=1.0 and top_p=0.95 across 3 seeds. Specific benchmark scores were not disclosed in the model card.

Tool Use and Integration

The model supports native tool-use capabilities through chat templates in Transformers. According to Cohere, North Mini Code 1.0 features "interleaved thinking" where the model generates reasoning content alongside tool calls. The company recommends passing all model-generated thinking content to future agentic steps for optimal performance.

Integration requires installing Transformers from source and, for vLLM deployment, using the main branch with Cohere's melody library (version 0.9.0+). The model uses a bash tool for terminal command execution.

Availability

North Mini Code 1.0 is available on Hugging Face and can be tested in OpenCode and Cohere's hosted Hugging Face Space. The model requires tensor parallelism (recommended -tp 2) for vLLM serving with a max model length of 320,000 tokens.

What This Means

This release targets the growing market for AI-powered software engineering tools, competing with models like GitHub Copilot and Amazon CodeWhisperer. The 30B-parameter count with 3B active parameters via sparse MoE suggests Cohere is prioritizing inference efficiency over raw model size—a practical choice for deployment in development environments. The Apache 2.0 license and open weights make this accessible for commercial use, though the lack of disclosed pricing and specific benchmark comparisons to competing models leaves performance questions open. The emphasis on agentic capabilities and tool use reflects the industry shift from simple code completion to multi-step reasoning and execution workflows.

Source: huggingface.co ↗

Cohere North Mini Code code generation MoE open source agentic AI tool use SWE-Bench

model releaseJuly 31, 2026

DeepSeek Releases V4-Flash-0731, a 304B-Parameter Model Claiming to Beat Its Own Pro Preview on Agentic Benchmarks

DeepSeek has released DeepSeek-V4-Flash-0731, a 304-billion-parameter model that supersedes its earlier preview version with what the company describes as substantially enhanced agentic capabilities. According to DeepSeek's technical report, the model outperforms the larger DeepSeek-V4-Pro (Preview) on several coding and agent benchmarks despite a far smaller activated parameter count.

model releaseAugust 1, 2026

OpenAI Reportedly Developing 'Astra' Model Family for Multi-Day Autonomous Problem-Solving

OpenAI is reportedly developing a new model family called Astra, designed to coordinate multiple agents on complex problems over hours or days. The models are already in testing and would be first to go through a planned U.S. government pre-release review, according to The Information.

model releaseJuly 31, 2026

DeepSeek Releases V4-Flash-0731, a 284B-Parameter Model That Beats Its Own Larger Pro Variant on Agentic Benchmarks

DeepSeek has shipped the full release of DeepSeek-V4-Flash-0731, a 284B-parameter model that according to DeepSeek outperforms its own larger V4-Pro (Preview) on agentic and coding benchmarks. Unsloth has published quantized GGUF versions, with lossless 8-bit weights requiring 162GB of storage.

model releaseJuly 31, 2026

Thinking Machines Lab Releases Inkling Small: 276B MoE Model with 524K Context Window

Thinking Machines Lab has released Inkling Small, an open-weight multimodal mixture-of-experts model with 12B active parameters out of 276B total and a 524K token context window. The model targets reasoning, coding, agentic workflows, and multilingual use cases at $0.58 per 1M input tokens and $1.44 per 1M output tokens.

Cohere Releases North Mini Code 1.0: 30B-Parameter MoE Model With 256K Context for Agentic Coding

North Mini Code 1.0 — Quick Specs

Cohere Releases North Mini Code 1.0: 30B-Parameter MoE Model With 256K Context for Agentic Coding

Model Architecture and Specifications

Performance on Agentic Coding Benchmarks

Tool Use and Integration

Availability

What This Means

Related Articles

DeepSeek Releases V4-Flash-0731, a 304B-Parameter Model Claiming to Beat Its Own Pro Preview on Agentic Benchmarks

OpenAI Reportedly Developing 'Astra' Model Family for Multi-Day Autonomous Problem-Solving

DeepSeek Releases V4-Flash-0731, a 284B-Parameter Model That Beats Its Own Larger Pro Variant on Agentic Benchmarks

Thinking Machines Lab Releases Inkling Small: 276B MoE Model with 524K Context Window

Comments