model releaseOpenAI

OpenAI's GPT-5.4 mini now available in GitHub Copilot

TL;DR

OpenAI has released GPT-5.4 mini, the lightweight variant of its agentic coding model GPT-5.4, in GitHub Copilot. The model represents OpenAI's highest-performing mini offering to date for code generation and completion tasks.

1 min read
0

GPT-5.4 mini — Quick Specs

Context window400K tokens
Input$0.75/1M tokens
Output$4.5/1M tokens

OpenAI's GPT-5.4 mini Now Generally Available in GitHub Copilot

OpenAI's GPT-5.4 mini has begun rolling out to GitHub Copilot users as a generally available option. According to GitHub, the model is the latest fast-optimized version of GPT-5.4, OpenAI's agentic coding model designed for code generation and completion.

Performance Claims

GitHub states that GPT-5.4 mini represents OpenAI's highest-performing mini model to date in early internal testing. The company has not yet disclosed specific benchmark scores, latency metrics, or token pricing for the model variant.

What's Included

GPT-5.4 mini joins GitHub Copilot's existing model roster, which already includes access to various OpenAI models. The release appears focused on providing developers with a faster, more cost-efficient option for real-time code completion while maintaining competitive performance compared to previous mini variants.

No specific context window size, input/output pricing per million tokens, or parameter count has been disclosed in the announcement.

What This Means

GPT-5.4 mini positions GitHub Copilot users to access a presumably faster inference speed relative to full GPT-5.4, which could reduce latency in IDE integration. However, without published benchmarks or pricing, the concrete advantages over existing models remain unclear. The release follows the industry pattern of offering tiered model variants—full-scale and mini editions—to balance performance with speed and cost across different use cases. Developers using GitHub Copilot should expect this as an additional option within their existing subscription tier.

Related Articles

model release

MiniMax Releases M3: 428B-Parameter Multimodal Model with 1M Context Window and 15× Decode Speedup

MiniMax has released M3, a multimodal model with approximately 428 billion parameters and 23 billion activated parameters. The model supports a 1 million token context window and uses MiniMax Sparse Attention to achieve 9× prefill and 15× decode speedups compared to its predecessor M2.

model release

Anthropic releases Claude Fable 5, first public version of Mythos model for code generation

Anthropic has released Claude Fable 5, the first publicly available version of its Mythos model line. University of Pennsylvania AI researcher Ethan Mollick reports the model can execute multi-page specifications for up to 12 hours and generate complete video games from single prompts in Claude Code.

model release

Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing

Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.

model release

Anthropic releases Claude Fable 5, first public Mythos-class model at $10/$50 per million tokens

Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, at $10 per million input tokens and $50 per million output tokens—less than half the price of Claude Mythos Preview. The model includes safeguards that redirect sensitive queries to Claude Opus 4.8 in less than 5% of sessions.

Comments

Loading...