pricing
47 articles tagged with pricing
Google releases Gemini 3.1 Flash Lite with 1M context at $0.25 per million input tokens
Google has released Gemini 3.1 Flash Lite, a high-efficiency multimodal model with a 1,048,576 token context window priced at $0.25 per million input tokens and $1.50 per million output tokens. The model supports text, image, video, audio, and PDF inputs with four thinking levels for cost-performance optimization.
Google preparing 'AI Ultra Lite' tier between $20 Pro and $250 Ultra plans, adding usage dashboard
Google is developing an intermediate subscription tier called 'AI Ultra Lite' to slot between its $20 Pro and $250 Ultra plans, according to code discovered in the Gemini macOS app. The company is also preparing a usage dashboard showing token budgets across five-hour and weekly limits.
GitHub Copilot switches to metered token billing June 1 as flat-rate model proves unsustainable
Microsoft's GitHub is ending flat-rate billing for Copilot on June 1, 2026, switching to usage-based metered tokens after acknowledging the request-based model is no longer sustainable. Copilot Pro subscribers ($10/month) will receive 1,000 GitHub AI Credits monthly, with each credit worth $0.01.
GitHub Copilot switches to token-based pricing June 1, ending unlimited usage model
GitHub Copilot transitions to token-based pricing effective June 1, 2026, replacing its premium request unit system. Base subscription prices remain unchanged at $10/month for Pro and $39/month for Pro+, but users now receive equivalent monthly AI Credits that deplete with usage—and service stops when credits run out.
GitHub Copilot switches to usage-based billing with AI Credits starting June 1, 2025
GitHub will replace Copilot's flat subscription model with usage-based billing starting June 1, 2025. Users will consume GitHub AI Credits based on their actual Copilot usage, marking a significant shift in the company's pricing strategy.
Alibaba releases Qwen3.5 Plus with 1M token context window at $0.40 per million input tokens
Alibaba released an updated version of Qwen3.5 Plus on April 27, 2026, with a 1 million token context window. The multimodal model accepts text, image, and video input and is priced at $0.40 per million input tokens and $2.40 per million output tokens, with tiered pricing above 256K tokens.
Alibaba Qwen Releases Qwen3.6 Flash with 1M Context Window at $0.25 per 1M Input Tokens
Alibaba's Qwen team has released Qwen3.6 Flash, a multimodal language model supporting text, image, and video input with a 1 million token context window. The model is priced at $0.25 per 1M input tokens and $1.50 per 1M output tokens, with tiered pricing above 256K tokens.
OpenAI Releases GPT-5.5 Pro with 1M+ Token Context Window, $30 Per Million Input Tokens
OpenAI has released GPT-5.5 Pro, a high-capability model with a 1,050,000 token context window (922K input, 128K output) priced at $30 per million input tokens and $180 per million output tokens. The model supports text and image inputs and is optimized for deep reasoning, agentic coding, and multi-step workflows.
DeepSeek V4 Pro launches with 1.6 trillion parameters, 1M token context at $0.145 per million input tokens
Chinese AI lab DeepSeek has released preview versions of DeepSeek V4 Flash and V4 Pro, mixture-of-experts models with 1 million token context windows. The V4 Pro has 1.6 trillion total parameters (49 billion active), making it the largest open-weight model available, while both models significantly undercut frontier model pricing.
DeepSeek V4 Pro launches with 1.6T parameters at $1.74/M tokens, undercutting Claude Sonnet 4.6 by 42%
DeepSeek released two preview models: V4 Pro (1.6T total parameters, 49B active) and V4 Flash (284B total, 13B active), both with 1 million token context windows. V4 Pro is priced at $1.74/M input tokens and $3.48/M output—42% cheaper than Claude Sonnet 4.6—while V4 Flash at $0.14/$0.28 per million tokens undercuts all small frontier models.
Xiaomi Launches MiMo-V2.5-Pro with 1M Context Window for Complex Agentic Tasks
Xiaomi released MiMo-V2.5-Pro on April 22, 2026, its flagship model featuring a 1,048,576 token context window and pricing at $1 per million input tokens and $3 per million output tokens. According to Xiaomi, the model ranks highly on ClawEval, GDPVal, and SWE-bench Pro benchmarks, designed for autonomous completion of professional tasks requiring thousands of tool calls.
Anthropic silently tests 5x price increase for Claude Code, reverses within hours after backlash
Anthropic updated its pricing page on April 22, 2026, removing Claude Code from the $20/month Pro plan and restricting it to $100-200/month Max plans. The company reversed the change within hours after significant backlash across Reddit, Hacker News, and Twitter.
Anthropic removes Claude Code from Pro plan pricing page, says only 2% of new signups affected by test
Anthropic removed Claude Code from its Pro subscription plan pricing page on Tuesday, though the company claims the change is only a test affecting approximately 2% of new prosumer signups. Existing Pro and Max subscribers remain unaffected, according to head of growth Amol Avasare.
OpenAI releases ChatGPT Images 2.0 with 3840x2160 resolution at $30 per 1M output tokens
OpenAI released ChatGPT Images 2.0, pricing output tokens at $30 per million with maximum resolution of 3840x2160 pixels. CEO Sam Altman claims the improvement from gpt-image-1 to gpt-image-2 equals the jump from GPT-3 to GPT-5.
GitHub halts Copilot Pro signups as agentic AI workloads overwhelm infrastructure
GitHub has paused new subscriptions for Copilot Pro, Pro+, and Student plans due to compute capacity constraints. The company cites agentic workflows as consuming significantly more resources than its original pricing structure anticipated, forcing tighter usage limits and a shift away from flat-rate billing.
GitHub Copilot Individual Plans Change Structure, Details Not Yet Disclosed
GitHub has announced changes to its Copilot Individual subscription plans, citing the need for reliability and predictability for existing customers. The company has not yet disclosed specific details about pricing adjustments, feature modifications, or implementation timelines.
Anthropic removes bundled tokens from enterprise seats, shifts to metered billing
Anthropic has revised its enterprise pricing structure, removing bundled token allowances from seat-based plans. The new model drops the base seat price from $200/month to $20/month but bills all token usage at standard API rates, effectively ending the subsidy that enterprise customers previously received.
Anthropic releases Claude Opus 4.7 with 1M context window for long-running agent tasks
Anthropic has released Claude Opus 4.7, the latest version of its flagship Opus family designed for long-running, asynchronous agent tasks. The model features a 1 million token context window and costs $5 per million input tokens and $25 per million output tokens.
GitHub Copilot imposes multi-day rate limits after fixing token counting bug that undercharged customers
GitHub Copilot customers are experiencing rate limits lasting up to 181 hours after the company fixed a token counting bug that had been undercharging for usage of newer models including Claude Opus 4.6 and GPT-5.4. GitHub has suspended Pro free trials and removed Anthropic's Opus 4.6 Fast model from Pro+ tiers.
Anthropic to launch Claude Opus 4.7 and AI design tool this week, valued at up to $800B
Anthropic is preparing to release Claude Opus 4.7 and an AI-powered design tool for websites and presentations as early as this week, according to The Information. The announcement comes as venture capitalists offer valuations up to $800 billion, more than double the $380 billion valuation from February 2026.
Google adds Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10
Google is adding Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10, 2026. The model costs less than half of Veo 3.1 Fast but generates videos at the same speed according to Google, though quality tradeoffs remain unclear.
OpenAI clarifies confusing ChatGPT Pro usage limits across $100 and $200 tiers
OpenAI's pricing page created confusion about usage limits across its new ChatGPT Pro tiers. An OpenAI employee clarified that the $100 plan provides at least 10x Plus usage while the $200 plan offers at least 20x, though both currently benefit from a temporary 2x boost expiring May 31.
OpenAI launches $100/month ChatGPT Pro plan to compete with Claude's $100 tier
OpenAI has introduced a $100 per month ChatGPT Pro plan positioned between its $20 Plus and $200 Pro tiers. The new tier offers 5x more Codex capacity than Plus and access to the same advanced tools as the $200 plan, with a limited-time promotional doubling of Codex allocation.
OpenAI launches $100/month ChatGPT Pro tier with 5x more Codex usage
OpenAI has introduced a new $100 per month ChatGPT Pro subscription tier offering 5x more usage of its Codex coding tool than the $20 Plus plan. The tier directly competes with Anthropic's Claude Max subscription at the same price point and sits between the $20 Plus and $200 Pro tiers.
OpenAI launches $100/month ChatGPT Pro plan to compete with Anthropic's Claude pricing
OpenAI announced a new $100/month Pro plan for ChatGPT, filling the pricing gap between its $20/month Plus tier and $200/month option. The plan offers 5x more Codex capacity than Plus and directly targets developers, positioning it as a competitive alternative to Anthropic's $100/month Claude offering.
OpenAI launches $100/month ChatGPT Pro tier to compete with Anthropic's Claude Code
OpenAI announced a $100 per month ChatGPT Pro subscription tier offering five times more Codex usage than its existing $20 Plus plan. The move directly challenges Anthropic's Claude Code, which generated over $2.5 billion in run-rate revenue in February with 100%+ growth since the start of 2026. Codex now has three million weekly active users.
OpenAI cuts ChatGPT Pro price to $100/month, targets Codex developers with 5x usage limits
OpenAI has introduced a new $100/month ChatGPT Pro tier positioned for Codex users, cutting the previous $200/month Pro price in half. The new plan offers 5x higher usage limits than ChatGPT Plus (10x Codex-specific usage through May 31) while maintaining the same core capabilities as the $200 tier, which remains available for power users.
OpenAI cuts Pro subscription to $100/month, halving price for heavy Codex users
OpenAI is restructuring its subscription tiers, introducing a new $100-per-month Pro plan for heavy Codex users—half the price of its previous $200 Pro tier. The new plan includes 5x more Codex usage than the $20 Plus plan and targets users requiring extended programming sessions. This pricing move undercuts comparable offerings from Anthropic and Google.
Miro adds AI agents directly to collaborative whiteboards with context awareness
Miro has launched AI Workflows, a system of AI agents that operate directly on collaborative canvases using full visual and spatial context. The feature includes Sidekicks (conversational agents) and Flows (multi-step automated workflows), accessible through the Business + AI Workflows tier at $20 per member per month with 50 AI credits included.
Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints
Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.
Anthropic charges Claude Code subscribers extra for OpenClaw usage starting today
Anthropic is enforcing separate billing for Claude Code subscribers using third-party tools like OpenClaw, starting April 4, 2026. Subscribers can no longer use their subscription limits for these integrations and must pay through a new pay-as-you-go model. The decision follows OpenClaw creator Peter Steinberger's move to OpenAI.
Anthropic ends free Claude access through third-party tools like OpenClaw
Anthropic has ended free access to Claude through third-party applications like OpenClaw, effective April 4, 2025 at 3PM ET. Users of third-party tools integrating Claude must now purchase usage bundles or use an API key. The change stems from capacity constraints and usage patterns incompatible with subscription tiers.
OpenAI shifts Codex to usage-based pricing, offers $500 credits to enterprise customers
OpenAI is replacing per-seat licensing with usage-based pricing for Codex in ChatGPT Business and Enterprise plans, eliminating upfront license costs. Eligible Business customers can claim up to $500 in promotional credit per workspace. The shift targets enterprises where coding tools typically expand from individual developers to full teams, positioning OpenAI against GitHub Copilot and Cursor.
Gemini 3.1 Pro launches in Augment Code at 2.6x cheaper than Claude Opus 4.6
Augment Code now offers Gemini 3.1 Pro alongside Claude Opus 4.6 and GPT-5.4. In head-to-head testing on structural refactoring tasks, Gemini matched or outperformed Opus while consuming 268 credits per task—46% cheaper than Opus's 488 credits—making it 2.6x more cost-effective per message in real-world usage.
Google launches Veo 3.1 Lite, cutting video generation costs by half
Google announced Veo 3.1 Lite, a cost-reduced video generation model priced at less than 50% of Veo 3.1 Fast's cost. The model supports text-to-video and image-to-video generation at 720p or 1080p resolution with customizable durations of 4s, 6s, or 8s, rolling out today on the Gemini API and Google AI Studio.
Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor
Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.
Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters
Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.
Xiaomi launches MiMo-V2-Pro with 1T parameters, matches Claude Opus on coding at 80% lower cost
Xiaomi shipped three AI models simultaneously designed to form a complete agent platform. MiMo-V2-Pro, a 1-trillion-parameter Mixture-of-Experts model with 42 billion active parameters per request, scores 78% on SWE-bench Verified and 81 points on ClawEval—nearly matching Claude Opus 4.6 while costing $1 per million input tokens versus $5 for Opus.
Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing
Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).
Amazon launches Alexa+ in UK with £19.99/month pricing for non-Prime users
Amazon is rolling out Alexa+, its AI-powered conversational assistant, to the UK as the first market outside North America. The company is offering free early access to new Echo buyers and plans to reach hundreds of thousands of customers in the coming weeks. After the beta period ends, Prime subscribers will get free access while non-Prime users will pay £19.99 per month.
OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens
OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.
OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains
OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.
Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost
Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.
Anthropic launches Code Review service at $15-25 per PR, takes 20 minutes per review
Anthropic has launched Code Review, a new service that deploys multiple specialized agents to scan pull requests for bugs, security vulnerabilities, and regressions. According to Anthropic, reviews average $15-25 per PR and take approximately 20 minutes to complete, scaling with code complexity.
GitHub removes premium AI models from free Copilot Student plan
GitHub has removed premium AI models including GPT-5.4, Claude Opus, and Claude Sonnet from its free Copilot Student plan effective March 12, 2026. The change leaves students with access only to lower-cost models: Claude 4.5 Haiku, Gemini 3.1 Pro, and GPT-5.3 Codex. GitHub's decision triggered 2,874 downvotes versus 21 upvotes on the announcement, with students arguing they need premium models to learn industry-standard tools.
Google Antigravity quotas drop dramatically as pricing shifts to paid credits model
Google has restructured pricing for its Antigravity agentic AI coding tool, shifting from quota-based access to a hybrid credits model. Developers on the $20/month AI Pro plan report weekly rate limits have tightened dramatically—one user tracked a drop from 300M input tokens weekly to under 9M. The company now directs heavy users toward the $249.99/month Ultra plan.
Google AI Pro and Ultra tiers bring tiered Gemini access; Ultra tops out at advanced reasoning
Google has restructured its AI subscription offerings into two main tiers: Google AI Pro (formerly Gemini Advanced) and the higher-tier Google AI Ultra. The tiers provide different access levels to Gemini models and capabilities, with Ultra positioned as the premium option for advanced reasoning tasks.