model releaseOpenAI

OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens

TL;DR

OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.

2 min read
0

OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens

OpenAI released ChatGPT Images 2.0, its latest image generation model. CEO Sam Altman claims the leap from gpt-image-1 to gpt-image-2 is equivalent to jumping from GPT-3 to GPT-5 in text models.

Pricing and specifications

The model charges $30 per million output tokens. A maximum resolution 3840x2160 image generated 13,342 output tokens, costing approximately $0.40. The model includes an outputQuality parameter that can be set to "high" for improved results.

The OpenAI Python client library has not yet been updated to include gpt-image-2 in its validated model list, but the model ID can be used directly with the API.

Comparative testing results

Developer Simon Willison tested the model using a "Where's Waldo" style prompt: creating a busy scene where viewers must find a raccoon holding a ham radio.

gpt-image-1 (baseline): Failed to include an identifiable raccoon in the generated scene. Claude Opus 4.7 could not locate a raccoon when analyzing the output.

Google Nano Banana 2: Successfully generated a clearly visible raccoon at an "Amateur Radio Club" booth with callsign W6HAM. Claude described this as the easiest result, with the raccoon "not really hiding."

Google Nano Banana Pro: Produced the worst result across all models tested, with an oversized raccoon in the center surrounded by an "ugly white border."

gpt-image-2 (standard quality): Generated a complex scene but did not appear to include the requested raccoon.

gpt-image-2 (high quality, 3840x2160): Successfully generated a detailed Where's Waldo-style scene with the raccoon holding a ham radio visible in the bottom left corner. The output was a 17MB PNG file.

Technical implementation

Willison used a command-line wrapper around the OpenAI Python client:

OPENAI_API_KEY="$(llm keys get openai)" \
  uv run https://tools.simonwillison.net/python/openai_image.py \
  -m gpt-image-2 \
  --quality high --size 3840x2160 \
  "[prompt text]"

According to the updated OpenAI image generation cookbook, the model supports multiple output qualities and sizes, with 3840x2160 appearing to be the maximum resolution.

What this means

ChatGPT Images 2.0 demonstrates OpenAI's continued competition with Google's image generation models, particularly Nano Banana 2. The $30 per million token pricing makes high-resolution generation accessible for production use cases. The model's ability to handle complex compositions with both detailed illustrations and readable text represents measurable progress in image generation capability, though Altman's GPT-3 to GPT-5 comparison remains unverified by independent benchmarks.

Related Articles

product update

OpenAI launches Lockdown Mode to block prompt injection data exfiltration attacks

OpenAI has released Lockdown Mode, an optional security setting that protects against prompt injection attacks by limiting network requests and image fetching in ChatGPT. The feature is designed for users handling sensitive data and disables some ChatGPT capabilities including Deep Research and Agent Mode.

product update

OpenAI upgrades ChatGPT memory architecture with automatic 'dreaming' synthesis, now available to free users

OpenAI is rolling out a new memory architecture for ChatGPT that automatically synthesizes information across conversations without explicit user prompts. The company announced free tier users will access memory features for the first time, while Plus and Pro users receive expanded memory capacity.

product update

OpenAI expands ChatGPT memory to free users, doubles storage capacity for paid tiers

OpenAI is rolling out an upgraded memory system for ChatGPT that synthesizes context more efficiently across conversations. The company reduced compute requirements by approximately 5x, enabling it to offer the memory feature to free users for the first time while doubling storage capacity for Plus and Pro subscribers.

model release

Alibaba's Qwen Releases Qwen3.7 Plus: 1M Context Window at $0.40 Per Million Input Tokens

Alibaba's Qwen has released Qwen3.7 Plus, a multimodal model with a 1 million token context window. The model accepts text and image input with text output, priced at $0.40 per million input tokens and $1.60 per million output tokens through OpenRouter's API.

Comments

Loading...