model release

Kwaipilot releases KAT-Coder-Pro V2 with 256K context for enterprise coding

TL;DR

Kwaipilot released KAT-Coder-Pro V2, the latest model in its KAT-Coder series, on March 27, 2026. The model features a 256,000-token context window and is priced at $0.30 per million input tokens and $1.20 per million output tokens. It targets enterprise-grade software engineering with focus on multi-system coordination and web aesthetics generation.

March 28, 2026 · 12:50 AM2 min read

KAT-Coder-Pro V2 — Quick Specs

Context window256K tokens

Input$0.3/1M tokens

Output$1.2/1M tokens

Compare KAT-Coder-Pro V2 with other models →

Kwaipilot Releases KAT-Coder-Pro V2 for Enterprise Software Engineering

Kwaipilot has released KAT-Coder-Pro V2, positioning it as the latest iteration in its KAT-Coder series designed for complex enterprise software engineering and SaaS integration.

Specifications

KAT-Coder-Pro V2 features a 256,000-token context window—sufficient for handling large codebases and extended development sessions. Pricing is set at $0.30 per million input tokens and $1.20 per million output tokens, making it competitively positioned for production use cases.

The model was released on March 27, 2026, and is available through OpenRouter alongside other providers.

Key Capabilities

According to Kwaipilot, the model builds on "agentic coding strengths of earlier versions" with emphasis on:

Large-scale production environments
Multi-system coordination
Integration across modern software stacks
Web aesthetics generation for production-grade landing pages and presentation decks

The inclusion of web design capabilities distinguishes it from traditional code-focused models, suggesting broader developer experience applications beyond backend systems.

Market Context

Kwaipilot is not currently listed in major AI model directories, making KAT-Coder-Pro V2 a less widely recognized entrant compared to established coding models from Anthropic, OpenAI, and Meta. The model's positioning around enterprise multi-system coordination and visual component generation indicates targeting of full-stack development teams rather than specialized coding roles.

What This Means

KAT-Coder-Pro V2 enters a competitive space occupied by Claude 3.5 Sonnet (200K context), GPT-4o (128K context), and Llama 3.1-405B. The 256K context window is substantial but matches or slightly exceeds existing alternatives. Pricing at $0.30/$1.20 per 1M tokens positions it as mid-range—not the cheapest option but less expensive than flagship models from tier-one providers. The emphasis on multi-system coordination and web design suggests Kwaipilot is targeting teams building full-stack applications, though the model's actual performance benchmarks remain undisclosed. Builders should verify performance against existing models on their specific use cases before migration.

Source: openrouter.ai ↗

kwaipilot coding-models enterprise-software kat-coder 256k-context

model releaseMay 12, 2026

Mira Murati's Thinking Machines announces full-duplex AI model with 0.40-second response time

Thinking Machines Lab, founded by former OpenAI CTO Mira Murati, announced TML-Interaction-Small, a full-duplex AI model that processes input while generating responses simultaneously. The company claims 0.40-second response time, matching natural human conversation speed.

model releaseMay 11, 2026

Arcee AI Releases Trinity Large Thinking: Free 262K Context Reasoning Model

Arcee AI has released Trinity Large Thinking, an open source reasoning model with a 262,144-token context window. The model is available free via OpenRouter and claims strong performance in PinchBench, agentic workloads, and reasoning tasks.

model releaseMay 11, 2026

OpenAI offers EU preview access to GPT-5.5-Cyber model while Anthropic withholds Mythos

OpenAI announced GPT-5.5-Cyber is rolling out in limited preview to vetted cybersecurity teams and is in discussions with the European Commission about preview access. Anthropic released its Mythos model a month ago but has yet to grant EU access for security review.

model releaseMay 10, 2026

Supertone releases Supertonic 3: 99M-parameter on-device TTS model supporting 31 languages

Supertone has released Supertonic 3, a 99M-parameter text-to-speech model that runs entirely on-device using ONNX Runtime. The model expands language support from 5 to 31 languages compared to Supertonic 2, requires no GPU, and claims competitive accuracy against models 7-20x larger.