model releaseOpenAI

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

TL;DR

OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.

April 21, 2026 · 9:35 PM1 min read

GPT-5.4 Image 2 — Quick Specs

Context window272K tokens

Input$8/1M tokens

Output$15/1M tokens

Compare GPT-5.4 Image 2 with other models →

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

OpenAI has released GPT-5.4 Image 2, a multimodal model that combines the company's GPT-5.4 reasoning capabilities with image generation. The model is available via OpenRouter's API under the identifier openai/gpt-5.4-image-2.

Technical Specifications

GPT-5.4 Image 2 features a 272,000 token context window and supports text, image, and file inputs with text and image outputs. OpenRouter lists pricing at $8.00 per million input tokens and $15.00 per million output tokens.

According to OpenRouter, the model "combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2." The system is designed for what OpenRouter describes as "rich multimodal workflows," enabling users to move between reasoning, coding, and image generation tasks within the same context.

Availability and Access

The model is currently available exclusively through OpenRouter's API. OpenAI has not announced direct API access through its own platform, and no benchmark scores or parameter counts have been disclosed.

OpenRouter positions the model as suitable for workflows that require both analytical reasoning and visual content generation in a single session, leveraging the extended context window to maintain coherence across complex multimodal tasks.

What This Means

GPT-5.4 Image 2 represents OpenAI's continued expansion into multimodal AI, though the exclusive availability through OpenRouter raises questions about the model's official status and whether this is a full public release or a limited partnership rollout. The 272K context window is competitive with other frontier models, but without benchmark data or direct comparison to GPT-4 Vision or other multimodal systems, it's difficult to assess the model's capabilities independently. The pricing sits in the mid-range for multimodal models, making it accessible for production use cases that require both language understanding and image generation.

Source: openrouter.ai ↗

OpenAI GPT-5.4 multimodal image generation OpenRouter model release

model releaseJuly 20, 2026

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba unveiled Qwen3.8 at the World Artificial Intelligence Conference in Shanghai, claiming the 2.4 trillion parameter model ranks second only to Anthropic's Fable 5. The company provided no benchmark scores, model card, or independent verification to support the claim.

model releaseJuly 20, 2026

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Alibaba has released Qwen 3.8, a 2.4 trillion parameter open-weight model that the company claims trails only Fable 5. The multimodal model processes images, videos, and documents, with a preview available through Alibaba's platforms at 10 percent of standard pricing.

model releaseJuly 20, 2026

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Thinking Machines Lab released Inkling, a mixture-of-experts model with 975B total parameters and 41B active parameters, trained on 45 trillion tokens across text, images, audio, and video. The Apache 2.0-licensed model supports up to 1M context and debuts alongside Inkling-Small (276B-A12B), marking what observers call the strongest US-based open-weight release to date.

model releaseJuly 20, 2026

Meituan launches LongCat 2.0: 1.6T parameter MoE model with 1M+ context window at $0.30 per 1M input tokens

Meituan has released LongCat 2.0, a sparse mixture-of-experts language model with 48 billion active parameters out of 1.6 trillion total. The model features a 1,049,000 token context window and costs $0.30 per 1M input tokens and $1.20 per 1M output tokens.

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

GPT-5.4 Image 2 — Quick Specs

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

Technical Specifications

Availability and Access

What This Means

Related Articles

Alibaba previews Qwen3.8 with 2.4 trillion parameters, claims second place without benchmark data

Alibaba releases Qwen 3.8, a 2.4 trillion parameter open-weight model claiming second place behind Fable 5

Thinking Machines releases Inkling: 975B-parameter MoE model with Apache 2.0 license, first major US open-weight multimo

Meituan launches LongCat 2.0: 1.6T parameter MoE model with 1M+ context window at $0.30 per 1M input tokens

Comments