pricing

12 articles tagged with pricing

March 24, 2026

Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor

Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.

March 23, 2026
model releaseXiaomi

Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters

Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.

model releaseXiaomi

Xiaomi launches MiMo-V2-Pro with 1T parameters, matches Claude Opus on coding at 80% lower cost

Xiaomi shipped three AI models simultaneously designed to form a complete agent platform. MiMo-V2-Pro, a 1-trillion-parameter Mixture-of-Experts model with 42 billion active parameters per request, scores 78% on SWE-bench Verified and 81 points on ClawEval—nearly matching Claude Opus 4.6 while costing $1 per million input tokens versus $5 for Opus.

March 19, 2026
model release

Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing

Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).

product updateAmazon Web Services

Amazon launches Alexa+ in UK with £19.99/month pricing for non-Prime users

Amazon is rolling out Alexa+, its AI-powered conversational assistant, to the UK as the first market outside North America. The company is offering free early access to new Echo buyers and plans to reach hundreds of thousands of customers in the coming weeks. After the beta period ends, Prime subscribers will get free access while non-Prime users will pay £19.99 per month.

March 18, 2026
model releaseOpenAI

OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens

OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.

March 17, 2026
model releaseOpenAI

OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains

OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.

analysis

Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost

Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.

product updateAnthropic

Anthropic launches Code Review service at $15-25 per PR, takes 20 minutes per review

Anthropic has launched Code Review, a new service that deploys multiple specialized agents to scan pull requests for bugs, security vulnerabilities, and regressions. According to Anthropic, reviews average $15-25 per PR and take approximately 20 minutes to complete, scaling with code complexity.

March 15, 2026
product updateMicrosoft

GitHub removes premium AI models from free Copilot Student plan

GitHub has removed premium AI models including GPT-5.4, Claude Opus, and Claude Sonnet from its free Copilot Student plan effective March 12, 2026. The change leaves students with access only to lower-cost models: Claude 4.5 Haiku, Gemini 3.1 Pro, and GPT-5.3 Codex. GitHub's decision triggered 2,874 downvotes versus 21 upvotes on the announcement, with students arguing they need premium models to learn industry-standard tools.

product update

Google Antigravity quotas drop dramatically as pricing shifts to paid credits model

Google has restructured pricing for its Antigravity agentic AI coding tool, shifting from quota-based access to a hybrid credits model. Developers on the $20/month AI Pro plan report weekly rate limits have tightened dramatically—one user tracked a drop from 300M input tokens weekly to under 9M. The company now directs heavy users toward the $249.99/month Ultra plan.

February 21, 2026
product update

Google AI Pro and Ultra tiers bring tiered Gemini access; Ultra tops out at advanced reasoning

Google has restructured its AI subscription offerings into two main tiers: Google AI Pro (formerly Gemini Advanced) and the higher-tier Google AI Ultra. The tiers provide different access levels to Gemini models and capabilities, with Ultra positioned as the premium option for advanced reasoning tasks.