pricing
12 articles tagged with pricing
Google DeepMind's Gemini 3.1 Flash-Lite generates websites in real time, 2.5x faster than predecessor
Google DeepMind released Gemini 3.1 Flash-Lite, a model that generates functional websites in real time through a new pseudo-browser demo. The model achieves first response token 2.5 times faster than Gemini 2.5 Flash and outputs over 360 tokens per second, though output pricing has tripled from $0.40 to $1.50 per million tokens.
Xiaomi releases MiMo-V2-Pro with 1M context window and 1T+ parameters
Xiaomi released MiMo-V2-Pro on March 18, 2026, a flagship foundation model with over 1 trillion total parameters and a 1,048,576 token context window. The model is priced at $1 per million input tokens and $3 per million output tokens, positioning it as an agent-focused system comparable to top-tier models.
Xiaomi launches MiMo-V2-Pro with 1T parameters, matches Claude Opus on coding at 80% lower cost
Xiaomi shipped three AI models simultaneously designed to form a complete agent platform. MiMo-V2-Pro, a 1-trillion-parameter Mixture-of-Experts model with 42 billion active parameters per request, scores 78% on SWE-bench Verified and 81 points on ClawEval—nearly matching Claude Opus 4.6 while costing $1 per million input tokens versus $5 for Opus.
Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing
Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).
Amazon launches Alexa+ in UK with £19.99/month pricing for non-Prime users
Amazon is rolling out Alexa+, its AI-powered conversational assistant, to the UK as the first market outside North America. The company is offering free early access to new Echo buyers and plans to reach hundreds of thousands of customers in the coming weeks. After the beta period ends, Prime subscribers will get free access while non-Prime users will pay £19.99 per month.
OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens
OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.
OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains
OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.
Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost
Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.
Anthropic launches Code Review service at $15-25 per PR, takes 20 minutes per review
Anthropic has launched Code Review, a new service that deploys multiple specialized agents to scan pull requests for bugs, security vulnerabilities, and regressions. According to Anthropic, reviews average $15-25 per PR and take approximately 20 minutes to complete, scaling with code complexity.
GitHub removes premium AI models from free Copilot Student plan
GitHub has removed premium AI models including GPT-5.4, Claude Opus, and Claude Sonnet from its free Copilot Student plan effective March 12, 2026. The change leaves students with access only to lower-cost models: Claude 4.5 Haiku, Gemini 3.1 Pro, and GPT-5.3 Codex. GitHub's decision triggered 2,874 downvotes versus 21 upvotes on the announcement, with students arguing they need premium models to learn industry-standard tools.
Google Antigravity quotas drop dramatically as pricing shifts to paid credits model
Google has restructured pricing for its Antigravity agentic AI coding tool, shifting from quota-based access to a hybrid credits model. Developers on the $20/month AI Pro plan report weekly rate limits have tightened dramatically—one user tracked a drop from 300M input tokens weekly to under 9M. The company now directs heavy users toward the $249.99/month Ultra plan.
Google AI Pro and Ultra tiers bring tiered Gemini access; Ultra tops out at advanced reasoning
Google has restructured its AI subscription offerings into two main tiers: Google AI Pro (formerly Gemini Advanced) and the higher-tier Google AI Ultra. The tiers provide different access levels to Gemini models and capabilities, with Ultra positioned as the premium option for advanced reasoning tasks.