pricing

47 articles tagged with pricing

May 7, 2026
model release

Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens

Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.

May 5, 2026
changelog+1

Google preparing 'AI Ultra Lite' tier between $20 Pro and $250 Ultra plans, adding usage dashboard

Google is developing an intermediate subscription tier called 'AI Ultra Lite' to slot between its $20 Pro and $250 Ultra plans, according to code discovered in the Gemini macOS app. The company is also preparing a usage dashboard showing token budgets across five-hour and weekly limits.

April 28, 2026
product updateMicrosoft

GitHub Copilot switches to metered token billing June 1 as flat-rate model proves unsustainable

Microsoft's GitHub is ending flat-rate billing for Copilot on June 1, 2026, switching to usage-based metered tokens after acknowledging the request-based model is no longer sustainable. Copilot Pro subscribers ($10/month) will receive 1,000 GitHub AI Credits monthly, with each credit worth $0.01.

April 27, 2026
product updateGitHub

GitHub Copilot switches to token-based pricing June 1, ending unlimited usage model

GitHub Copilot transitions to token-based pricing effective June 1, 2026, replacing its premium request unit system. Base subscription prices remain unchanged at $10/month for Pro and $39/month for Pro+, but users now receive equivalent monthly AI Credits that deplete with usage—and service stops when credits run out.

product updateGitHub

GitHub Copilot switches to usage-based billing with AI Credits starting June 1, 2025

GitHub will replace Copilot's flat subscription model with usage-based billing starting June 1, 2025. Users will consume GitHub AI Credits based on their actual Copilot usage, marking a significant shift in the company's pricing strategy.

changelog

Alibaba releases Qwen3.5 Plus with 1M token context window at $0.40 per million input tokens

Alibaba released an updated version of Qwen3.5 Plus on April 27, 2026, with a 1 million token context window. The multimodal model accepts text, image, and video input and is priced at $0.40 per million input tokens and $2.40 per million output tokens, with tiered pricing above 256K tokens.

model release

Alibaba Qwen Releases Qwen3.6 Flash with 1M Context Window at $0.25 per 1M Input Tokens

Alibaba's Qwen team has released Qwen3.6 Flash, a multimodal language model supporting text, image, and video input with a 1 million token context window. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with tiered pricing above 256K tokens.

April 24, 2026
model releaseOpenAI

OpenAI Releases GPT-5.5 Pro with 1M+ Token Context Window, $30 Per Million Input Tokens

OpenAI has released GPT-5.5 Pro, a high-capability model with a 1,050,000 token context window (922K input, 128K output) priced at $30 per million input tokens and $180 per million output tokens. The model supports text and image inputs and is optimized for deep reasoning, agentic coding, and multi-step workflows.

model releaseDeepSeek

DeepSeek V4 Pro launches with 1.6 trillion parameters, 1M token context at $0.145 per million input tokens

Chinese AI lab DeepSeek has released preview versions of DeepSeek V4 Flash and V4 Pro, mixture-of-experts models with 1 million token context windows. The V4 Pro has 1.6 trillion total parameters (49 billion active), making it the largest open-weight model available, while both models significantly undercut frontier model pricing.

model releaseDeepSeek

DeepSeek V4 Pro launches with 1.6T parameters at $1.74/M tokens, undercutting Claude Sonnet 4.6 by 42%

DeepSeek released two preview models: V4 Pro (1.6T total parameters, 49B active) and V4 Flash (284B total, 13B active), both with 1 million token context windows. V4 Pro is priced at $1.74/M input tokens and $3.48/M output—42% cheaper than Claude Sonnet 4.6—while V4 Flash at $0.14/$0.28 per million tokens undercuts all small frontier models.

April 22, 2026
model releaseXiaomi

Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks

Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.

product updateAnthropic+1

Anthropic silently tests 5x price increase for Claude Code, reverses within hours after backlash

Anthropic updated its pricing page on April 22, 2026, removing Claude Code from the $20/month Pro plan and restricting it to $100-200/month Max plans. The company reversed the change within hours after significant backlash across Reddit, Hacker News, and Twitter.

changelogAnthropic

Anthropic removes Claude Code from Pro plan pricing page, says only 2% of new signups affected by test

Anthropic removed Claude Code from its Pro subscription plan pricing page on Tuesday, though the company claims the change is only a test affecting approximately 2% of new prosumer signups. Existing Pro and Max subscribers remain unaffected, according to head of growth Amol Avasare.

April 21, 2026
model releaseOpenAI

OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens

OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.

April 20, 2026
product updateMicrosoft

GitHub halts Copilot Pro signups as agentic AI workloads overwhelm infrastructure

GitHub has paused new subscriptions for Copilot Pro, Pro+, and Student plans due to compute capacity constraints. The company cites agentic workflows as consuming significantly more resources than its original pricing structure anticipated, forcing tighter usage limits and a shift away from flat-rate billing.

product updateGitHub

GitHub Copilot Individual Plans Change Structure, Details Not Yet Disclosed

GitHub has announced changes to its Copilot Individual subscription plans, citing the need for reliability and predictability for existing customers. The company has not yet disclosed specific details about pricing adjustments, feature modifications, or implementation timelines.

April 16, 2026
changelogAnthropic

Anthropic removes bundled tokens from enterprise seats, shifts to metered billing

Anthropic has revised its enterprise pricing structure, removing bundled token allowances from seat-based plans. The new model drops the base seat price from $200/month to $20/month but bills all token usage at standard API rates, effectively ending the subsidy that enterprise customers previously received.

model releaseAnthropic

Anthropic releases Claude Opus 4.7 with 1M context window for long-running agent tasks

Anthropic has released Claude Opus 4.7, the latest version of its flagship Opus family designed for long-running, asynchronous agent tasks. The model features a 1 million token context window and costs $5 per million input tokens and $25 per million output tokens.

April 15, 2026
product updateMicrosoft

GitHub Copilot imposes multi-day rate limits after fixing token counting bug that undercharged customers

GitHub Copilot customers are experiencing rate limits lasting up to 181 hours after the company fixed a token counting bug that had been undercharging for usage of newer models including Claude Opus 4.6 and GPT-5.4. GitHub has suspended Pro free trials and removed Anthropic's Opus 4.6 Fast model from Pro+ tiers.

model releaseAnthropic

Anthropic to launch Claude Opus 4.7 and AI design tool this week, valued at up to $800B

Anthropic is preparing to release Claude Opus 4.7 and an AI-powered design tool for websites and presentations as early as this week, according to The Information. The announcement comes as venture capitalists offer valuations up to $800 billion, more than double the $380 billion valuation from February 2026.

April 13, 2026
product update+1

Google adds Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10

Google is adding Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10, 2026. The model costs less than half of Veo 3.1 Fast but generates videos at the same speed according to Google, though quality tradeoffs remain unclear.

April 12, 2026
changelogOpenAI

OpenAI clarifies confusing ChatGPT Pro usage limits across $100 and $200 tiers

OpenAI's pricing page created confusion about usage limits across its new ChatGPT Pro tiers. An OpenAI employee clarified that the $100 plan provides at least 10x Plus usage while the $200 plan offers at least 20x, though both currently benefit from a temporary 2x boost expiring May 31.

April 10, 2026
product updateOpenAI+1

OpenAI launches $100/month ChatGPT Pro plan to compete with Claude's $100 tier

OpenAI has introduced a $100 per month ChatGPT Pro plan positioned between its $20 Plus and $200 Pro tiers. The new tier offers 5x more Codex capacity than Plus and access to the same advanced tools as the $200 plan, with a limited-time promotional doubling of Codex allocation.

April 9, 2026
changelogOpenAI

OpenAI launches $100/month ChatGPT Pro tier with 5x more Codex usage

OpenAI has introduced a new $100 per month ChatGPT Pro subscription tier offering 5x more usage of its Codex coding tool than the $20 Plus plan. The tier directly competes with Anthropic's Claude Max subscription at the same price point and sits between the $20 Plus and $200 Pro tiers.

changelogOpenAI

OpenAI launches $100/month ChatGPT Pro plan to compete with Anthropic's Claude pricing

OpenAI announced a new $100/month Pro plan for ChatGPT, filling the pricing gap between its $20/month Plus tier and $200/month option. The plan offers 5x more Codex capacity than Plus and directly targets developers, positioning it as a competitive alternative to Anthropic's $100/month Claude offering.

product updateOpenAI

OpenAI launches $100/month ChatGPT Pro tier to compete with Anthropic's Claude Code

OpenAI announced a $100 per month ChatGPT Pro subscription tier offering five times more Codex usage than its existing $20 Plus plan. The move directly challenges Anthropic's Claude Code, which generated over $2.5 billion in run-rate revenue in February with 100%+ growth since the start of 2026. Codex now has three million weekly active users.

product updateOpenAI

OpenAI cuts ChatGPT Pro price to $100/month, targets Codex developers with 5x usage limits

OpenAI has introduced a new $100/month ChatGPT Pro tier positioned for Codex users, cutting the previous $200/month Pro price in half. The new plan offers 5x higher usage limits than ChatGPT Plus (10x Codex-specific usage through May 31) while maintaining the same core capabilities as the $200 tier, which remains available for power users.

product updateOpenAI

OpenAI cuts Pro subscription to $100/month, halving price for heavy Codex users

OpenAI is restructuring its subscription tiers, introducing a new $100-per-month Pro plan for heavy Codex users—half the price of its previous $200 Pro tier. The new plan includes 5x more Codex usage than the $20 Plus plan and targets users requiring extended programming sessions. This pricing move undercuts comparable offerings from Anthropic and Google.

April 8, 2026
product update

Miro adds AI agents directly to collaborative whiteboards with context awareness

Miro has launched AI Workflows, a system of AI agents that operate directly on collaborative canvases using full visual and spatial context. The feature includes Sidekicks (conversational agents) and Flows (multi-step automated workflows), accessible through the Business + AI Workflows tier at $20 per member per month with 50 AI credits included.

April 6, 2026
product updateAnthropic

Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints

Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.

April 4, 2026
changelogAnthropic

Anthropic charges Claude Code subscribers extra for OpenClaw usage starting today

Anthropic is enforcing separate billing for Claude Code subscribers using third-party tools like OpenClaw, starting April 4, 2026. Subscribers can no longer use their subscription limits for these integrations and must pay through a new pay-as-you-go model. The decision follows OpenClaw creator Peter Steinberger's move to OpenAI.

changelogAnthropic

Anthropic ends free Claude access through third-party tools like OpenClaw

Anthropic has ended free access to Claude through third-party applications like OpenClaw, effective April 4, 2025 at 3PM ET. Users of third-party tools integrating Claude must now purchase usage bundles or use an API key. The change stems from capacity constraints and usage patterns incompatible with subscription tiers.

April 3, 2026
product updateOpenAI

OpenAI shifts Codex to usage-based pricing, offers $500 credits to enterprise customers

OpenAI is replacing per-seat licensing with usage-based pricing for Codex in ChatGPT Business and Enterprise plans, eliminating upfront license costs. Eligible Business customers can claim up to $500 in promotional credit per workspace. The shift targets enterprises where coding tools typically expand from individual developers to full teams, positioning OpenAI against GitHub Copilot and Cursor.

April 2, 2026
product update

Gemini 3.1 Pro launches in Augment Code at 2.6x cheaper than Claude Opus 4.6

Augment Code now offers Gemini 3.1 Pro alongside Claude Opus 4.6 and GPT-5.4. In head-to-head testing on structural refactoring tasks, Gemini matched or outperformed Opus while consuming 268 credits per task—46% cheaper than Opus's 488 credits—making it 2.6x more cost-effective per message in real-world usage.

March 31, 2026
model release

Google launches Veo 3.1 Lite, cutting video generation costs by half

Google announced Veo 3.1 Lite, a cost-reduced video generation model priced at less than 50% of Veo 3.1 Fast's cost. The model supports text-to-video and image-to-video generation at 720p or 1080p resolution with customizable durations of 4s, 6s, or 8s, rolling out today on the Gemini API and Google AI Studio.

March 24, 2026

Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor

Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.

March 23, 2026
model releaseXiaomi

Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters

Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.

model releaseXiaomi

Xiaomi launches MiMo-V2-Pro with 1T parameters, matches Claude Opus on coding at 80% lower cost

Xiaomi shipped three AI models simultaneously designed to form a complete agent platform. MiMo-V2-Pro, a 1-trillion-parameter Mixture-of-Experts model with 42 billion active parameters per request, scores 78% on SWE-bench Verified and 81 points on ClawEval—nearly matching Claude Opus 4.6 while costing $1 per million input tokens versus $5 for Opus.

March 19, 2026
model release

Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing

Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).

product updateAmazon Web Services

Amazon launches Alexa+ in UK with £19.99/month pricing for non-Prime users

Amazon is rolling out Alexa+, its AI-powered conversational assistant, to the UK as the first market outside North America. The company is offering free early access to new Echo buyers and plans to reach hundreds of thousands of customers in the coming weeks. After the beta period ends, Prime subscribers will get free access while non-Prime users will pay £19.99 per month.

March 18, 2026
model releaseOpenAI

OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens

OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.

March 17, 2026
model releaseOpenAI

OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains

OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.

analysis

Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost

Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.

product updateAnthropic

Anthropic launches Code Review service at $15-25 per PR, takes 20 minutes per review

Anthropic has launched Code Review, a new service that deploys multiple specialized agents to scan pull requests for bugs, security vulnerabilities, and regressions. According to Anthropic, reviews average $15-25 per PR and take approximately 20 minutes to complete, scaling with code complexity.

March 15, 2026
product updateMicrosoft

GitHub removes premium AI models from free Copilot Student plan

GitHub has removed premium AI models including GPT-5.4, Claude Opus, and Claude Sonnet from its free Copilot Student plan effective March 12, 2026. The change leaves students with access only to lower-cost models: Claude 4.5 Haiku, Gemini 3.1 Pro, and GPT-5.3 Codex. GitHub's decision triggered 2,874 downvotes versus 21 upvotes on the announcement, with students arguing they need premium models to learn industry-standard tools.

product update

Google Antigravity quotas drop dramatically as pricing shifts to paid credits model

Google has restructured pricing for its Antigravity agentic AI coding tool, shifting from quota-based access to a hybrid credits model. Developers on the $20/month AI Pro plan report weekly rate limits have tightened dramatically—one user tracked a drop from 300M input tokens weekly to under 9M. The company now directs heavy users toward the $249.99/month Ultra plan.

February 21, 2026
product update

Google AI Pro and Ultra tiers bring tiered Gemini access; Ultra tops out at advanced reasoning

Google has restructured its AI subscription offerings into two main tiers: Google AI Pro (formerly Gemini Advanced) and the higher-tier Google AI Ultra. The tiers provide different access levels to Gemini models and capabilities, with Ultra positioned as the premium option for advanced reasoning tasks.