model releaseOpenAI

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

TL;DR

OpenAI has released GPT-5.4 Image 2, combining the GPT-5.4 reasoning model with image generation capabilities. The multimodal model features a 272K token context window and is priced at $8 per million input tokens and $15 per million output tokens.

1 min read
0

GPT-5.4 Image 2 — Quick Specs

Context window272K tokens
Input$8/1M tokens
Output$15/1M tokens

OpenAI Releases GPT-5.4 Image 2 with 272K Context Window and Image Generation

OpenAI has released GPT-5.4 Image 2, a multimodal model that combines the company's GPT-5.4 reasoning capabilities with image generation. The model is available via OpenRouter's API under the identifier openai/gpt-5.4-image-2.

Technical Specifications

GPT-5.4 Image 2 features a 272,000 token context window and supports text, image, and file inputs with text and image outputs. OpenRouter lists pricing at $8.00 per million input tokens and $15.00 per million output tokens.

According to OpenRouter, the model "combines OpenAI's GPT-5.4 model with state-of-the-art image generation capabilities from GPT Image 2." The system is designed for what OpenRouter describes as "rich multimodal workflows," enabling users to move between reasoning, coding, and image generation tasks within the same context.

Availability and Access

The model is currently available exclusively through OpenRouter's API. OpenAI has not announced direct API access through its own platform, and no benchmark scores or parameter counts have been disclosed.

OpenRouter positions the model as suitable for workflows that require both analytical reasoning and visual content generation in a single session, leveraging the extended context window to maintain coherence across complex multimodal tasks.

What This Means

GPT-5.4 Image 2 represents OpenAI's continued expansion into multimodal AI, though the exclusive availability through OpenRouter raises questions about the model's official status and whether this is a full public release or a limited partnership rollout. The 272K context window is competitive with other frontier models, but without benchmark data or direct comparison to GPT-4 Vision or other multimodal systems, it's difficult to assess the model's capabilities independently. The pricing sits in the mid-range for multimodal models, making it accessible for production use cases that require both language understanding and image generation.

Related Articles

model release

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

product update

OpenAI launches Lockdown Mode to block prompt injection data exfiltration attacks

OpenAI has released Lockdown Mode, an optional security setting that protects against prompt injection attacks by limiting network requests and image fetching in ChatGPT. The feature is designed for users handling sensitive data and disables some ChatGPT capabilities including Deep Research and Agent Mode.

product update

OpenAI upgrades ChatGPT memory architecture with automatic 'dreaming' synthesis, now available to free users

OpenAI is rolling out a new memory architecture for ChatGPT that automatically synthesizes information across conversations without explicit user prompts. The company announced free tier users will access memory features for the first time, while Plus and Pro users receive expanded memory capacity.

model release

NVIDIA Releases Nemotron 3.5 Content Safety: 4B-Parameter Multimodal Model with Custom Policy Enforcement and 140-Langua

NVIDIA has released Nemotron 3.5 Content Safety, a 4B-parameter model built on Google Gemma 3 4B IT that provides multimodal safety classification across approximately 140 languages. The model includes a 128K context window, custom enterprise policy enforcement, auditable reasoning traces, and is releasing its training dataset.

Comments

Loading...