model releaseOpenAI

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

TL;DR

OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.

1 min read
0

GPT-5.4 Image 2 — Quick Specs

Context window272K tokens
Input$8/1M tokens
Output$15/1M tokens

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

OpenAI has released GPT-5.4 Image 2, a multimodal model that combines the company's GPT-5.4 reasoning capabilities with image generation. The model is available via OpenRouter's API under the identifier openai/gpt-5.4-image-2.

Technical Specifications

GPT-5.4 Image 2 features a 272,000 token context window and supports text, image, and file inputs with text and image outputs. OpenRouter lists pricing at $8.00 per million input tokens and $15.00 per million output tokens.

According to OpenRouter, the model "combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2." The system is designed for what OpenRouter describes as "rich multimodal workflows," enabling users to move between reasoning, coding, and image generation tasks within the same context.

Availability and Access

The model is currently available exclusively through OpenRouter's API. OpenAI has not announced direct API access through its own platform, and no benchmark scores or parameter counts have been disclosed.

OpenRouter positions the model as suitable for workflows that require both analytical reasoning and visual content generation in a single session, leveraging the extended context window to maintain coherence across complex multimodal tasks.

What This Means

GPT-5.4 Image 2 represents OpenAI's continued expansion into multimodal AI, though the exclusive availability through OpenRouter raises questions about the model's official status and whether this is a full public release or a limited partnership rollout. The 272K context window is competitive with other frontier models, but without benchmark data or direct comparison to GPT-4 Vision or other multimodal systems, it's difficult to assess the model's capabilities independently. The pricing sits in the mid-range for multimodal models, making it accessible for production use cases that require both language understanding and image generation.

Related Articles

model release

OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens

OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.

model release

OpenAI releases ChatGPT Images 2.0 with integrated reasoning and text-image composition

OpenAI has released ChatGPT Images 2.0, which integrates reasoning capabilities to generate complex visual compositions combining text and images. The model supports aspect ratios from 3:1 to 1:3 and outputs up to 2K resolution, with advanced features available to Plus, Pro, Business, and Enterprise users.

product update

OpenAI's ChatGPT Images 2.0 adds web search and multi-image generation with reasoning mode

OpenAI released ChatGPT Images 2.0, powered by the new GPT Image 2 model. The update enables web search integration for paid subscribers in thinking mode, generates up to eight images from a single prompt while maintaining visual consistency, and supports 2K resolution output.

model release

InclusionAI releases Ling-2.6-flash: 104B parameter model with 7.4B active parameters, free on OpenRouter

InclusionAI has released Ling-2.6-flash, an instruction-tuned model with 104 billion total parameters and 7.4 billion active parameters, available free through OpenRouter. The model features a 262,144-token context window and is designed for agent workflows requiring fast responses and high token efficiency.

Comments

Loading...