model release

Sakana AI Releases Fugu Ultra: Multi-Agent Orchestration System with 1M Context Window at $5/$30 per Million Tokens

TL;DR

Sakana AI has released Fugu Ultra, a multi-agent orchestration system that routes tasks across pools of underlying models rather than operating as a single monolithic model. The system supports a 1M token context window and is priced at $5 per million input tokens and $30 per million output tokens.

2 min read
0

Fugu Ultra — Quick Specs

Context window1000K tokens
Input$5/1M tokens
Output$30/1M tokens

Sakana AI Releases Fugu Ultra: Multi-Agent Orchestration System with 1M Context Window

Sakana AI has released Fugu Ultra, the higher-performance model in its Fugu family, now available through OpenRouter. Unlike traditional language models, Fugu Ultra is a learned multi-agent orchestration system trained to route tasks across a swappable pool of underlying models and recursively call instances of itself.

Architecture and Capabilities

According to Sakana AI, Fugu Ultra prioritizes answer quality on complex, multi-step reasoning, coding, and agentic workflows. The system supports:

  • 1 million token context window
  • Configurable reasoning effort
  • Tool calling
  • Built-in web search capabilities

Orchestration tokens consumed by the system are billed as standard input/output tokens, with no separate pricing tier for the routing logic.

Pricing

Fugu Ultra is priced at:

  • Input: $5 per million tokens
  • Output: $30 per million tokens

The effective price can be 60-80% lower when prompt caching is applied for repeated context, according to OpenRouter's monitoring data.

Technical Approach

Rather than training a single large model, Sakana AI's approach involves training a language model to act as an orchestrator, deciding which underlying models to route tasks to and when to recursively invoke additional instances of itself. This architecture represents a departure from the monolithic model paradigm adopted by most major AI labs.

The model was released June 24, 2026, according to OpenRouter's listing. Sakana AI has not disclosed benchmark scores, parameter count, or details about the underlying model pool at this time.

Availability

Fugu Ultra is currently available exclusively through OpenRouter, which forwards requests directly to Sakana AI's infrastructure with no intermediate routing layer.

What This Means

Sakana AI's orchestration approach represents a significant architectural departure from the single-model paradigm. By routing tasks to specialized models rather than attempting to embed all capabilities in one system, Fugu Ultra could potentially offer better cost-performance ratios for complex workflows. However, without published benchmarks, it's unclear how the system compares to frontier models like Claude 3.5 Sonnet or GPT-4 on standardized reasoning and coding tasks. The success of this approach will depend on whether the orchestration overhead is offset by more efficient task routing.

Related Articles

model release

Sakana AI releases Fugu orchestration model to route tasks across multiple AI vendors

Sakana AI released Fugu, an orchestration language model that routes tasks across multiple AI providers to reduce vendor lock-in risks. The Japanese AI firm positions Fugu as a solution to enterprise dependency on single monolithic AI APIs.

model release

Cohere releases North Mini Code, a 30B-parameter sparse MoE coding model with 256K context window, free on OpenRouter

Cohere has released North Mini Code, the first model in its North family and its first agentic coding model. The sparse mixture-of-experts architecture features 30B total parameters with 3B active, a 256K-token context window, and up to 64K tokens of output, available free via OpenRouter under Apache 2.0 license.

model release

Krea Releases 12-Billion Parameter Text-to-Image Model with 8-Step Generation

Krea.ai released Krea 2 Turbo, a 12-billion parameter diffusion transformer model for text-to-image generation. The open-weight model generates images in 8 inference steps and supports resolutions up to 2048x2048 pixels.

model release

Mistral OCR 4 Launches With Bounding Boxes, 170 Language Support at $2-4 Per 1,000 Pages

Mistral AI released OCR 4, a compact document extraction model that returns bounding boxes, block classification, and inline confidence scores alongside text. The model supports 170 languages, scores 85.20 on OlmOCRBench, and is priced at $4 per 1,000 pages via API ($2 with batch discount) or $5 per 1,000 pages through Document AI.

Comments

Loading...