model release

Z.ai Releases GLM-5.2 with 1M Token Context Window at $1.40/$4.40 per Million

TL;DR

Z.ai has released GLM-5.2, a model designed for long-horizon engineering tasks with a 1 million token context window. The model is priced at $1.40 per million input tokens and $4.40 per million output tokens, and was released on June 16, 2025.

2 min read
0

Z.ai Releases GLM-5.2 with 1M Token Context Window

Z.ai has released GLM-5.2, a model with a 1 million token context window designed for project-level engineering tasks. The model is priced at $1.40 per million input tokens and $4.40 per million output tokens.

Technical Specifications

  • Context window: 1 million tokens
  • Input pricing: $1.40 per 1M tokens
  • Output pricing: $4.40 per 1M tokens
  • Release date: June 16, 2025
  • Modalities: Text in/out

Claimed Capabilities

According to Z.ai, GLM-5.2 is positioned as their "flagship model for the era of long-horizon tasks." The company claims the model can:

  • Handle project-level engineering context
  • Execute long-running tasks with improved reliability
  • Follow engineering standards consistently
  • Complete full development workflows from requirements to multi-platform deployment

The model is currently hosted exclusively through OpenRouter, with all requests forwarded directly to Z.ai's infrastructure.

Pricing Context

At $1.40/$4.40 per million tokens, GLM-5.2's pricing positions it in the mid-range for large context window models. For comparison:

  • Anthropic's Claude 3.5 Sonnet (200K context): $3/$15 per 1M tokens
  • OpenAI's GPT-4o (128K context): $2.50/$10 per 1M tokens
  • Google's Gemini 1.5 Pro (2M context): $1.25/$5 per 1M tokens

OpenRouter notes that effective pricing can be 60-80% lower than list prices when prompt caching is applied for repeated context.

What This Means

GLM-5.2 enters a competitive market for long-context models, where context window size alone no longer differentiates offerings. The 1M token window matches several existing models, while models like Gemini 1.5 Pro already offer 2M tokens at comparable pricing. The real test will be whether Z.ai's claimed advantages in long-horizon task execution and engineering workflow completion translate to measurable performance improvements in production use cases. Without published benchmark scores or independent verification of the model's capabilities, its market position remains uncertain.

Related Articles

model release

GLM-5.2 Released with 1M Token Context and 753B Parameters Under MIT License

Zhipu AI has released GLM-5.2, a 753 billion parameter model featuring a 1 million token context window and MIT open-source license. The model scores 62.1% on SWE-bench Pro and 91.2% on GPQA-Diamond, with flexible reasoning effort levels for coding tasks.

model release

MiniMax Releases M3: 428B-Parameter Multimodal Model with 1M Context Window and 15× Decode Speedup

MiniMax has released M3, a multimodal model with approximately 428 billion parameters and 23 billion activated parameters. The model supports a 1 million token context window and uses MiniMax Sparse Attention to achieve 9× prefill and 15× decode speedups compared to its predecessor M2.

model release

Anthropic releases Fable 5, bringing capabilities of restricted Mythos model to public with $10/$50 per 1M token pricing

Anthropic has released Fable 5, making capabilities from its previously restricted Mythos model available to the public. The company claims Fable 5 beats GPT-5.5, Gemini 3.1 Pro, and its own Opus 4.8 in internal testing, with pricing set at $10 per million input tokens and $50 per million output tokens after a free trial period ending June 22.

model release

Anthropic releases Claude Fable 5, first public Mythos-class model at $10/$50 per million tokens

Anthropic has released Claude Fable 5, its first publicly available Mythos-class model, at $10 per million input tokens and $50 per million output tokens—less than half the price of Claude Mythos Preview. The model includes safeguards that redirect sensitive queries to Claude Opus 4.8 in less than 5% of sessions.

Comments

Loading...

Z.ai GLM-5.2: 1M Context Window Model Released | TPS