LLM News

Every LLM release, update, and milestone.

0
product update

Google testing Gemini app for macOS with Desktop Intelligence feature

Google is testing a native Gemini app for macOS, according to Bloomberg. The app would compete directly with OpenAI's ChatGPT and Anthropic's Claude, both of which offer standalone Mac applications. A key differentiator is 'Desktop Intelligence,' which allows Gemini to view screen context and pull content from apps to personalize responses.

1 min readvia engadget.com
0
product update

ElevenLabs launches music marketplace for AI-generated tracks with no copyright protection

ElevenLabs has launched a music marketplace where users can publish and sell tracks created with its ElevenCreative AI music model, with the company claiming to have already generated nearly 14 million songs on the platform. The company has paid out over $11 million through its Voice Marketplace using the same model. However, AI-generated music lacks legal copyright protection, leaving all legal risk on users.

2 min readvia the-decoder.com
0
product update

Google begins beta testing dedicated Gemini app for macOS

Google has begun beta testing a dedicated Gemini app for macOS with select users, according to Bloomberg. The early version includes only critical features and hints at a "Desktop Intelligence" capability that lets Gemini see screen context. The move addresses a competitive gap, as Anthropic and OpenAI already offer native Mac apps for Claude and ChatGPT.

2 min readvia 9to5mac.com
0
model release

Cursor releases Composer 2 at $0.50/$2.50 per 1M tokens, undercutting Claude and GPT-4 on pricing

Cursor released Composer 2, a code-specialized model priced at $0.50 per million input tokens and $2.50 per million output tokens—roughly 90% cheaper than Claude Opus 4.6 ($5.00/$25.00) and 60% cheaper than GPT-5.4 ($2.50/$15.00). The model scores 61.3 on Cursor's internal CursorBench, competitive with Claude Opus 4.6 (58.2) but below GPT-5.4 Thinking (63.9).

2 min readvia the-decoder.com
0
product updateAmazon Web Services

Amazon's Alexa+ launches Early Access in UK with British localization

Amazon's Alexa+, its conversational AI assistant, begins Early Access in the UK on March 19, rolling out to hundreds of thousands of users. The updated assistant includes British English localization with regional speech patterns, slang understanding, and agentic capabilities. Free for Prime members; £20/month for non-members.

0
product updateAmazon Web Services

Amazon launches Alexa+ in UK with £19.99/month pricing for non-Prime users

Amazon is rolling out Alexa+, its AI-powered conversational assistant, to the UK as the first market outside North America. The company is offering free early access to new Echo buyers and plans to reach hundreds of thousands of customers in the coming weeks. After the beta period ends, Prime subscribers will get free access while non-Prime users will pay £19.99 per month.

2 min readvia techcrunch.com
0
product update

Okta launches agent management platform with discovery, governance, and kill-switch controls

Okta has released Okta for AI Agents, a management platform that addresses three core requirements: discovering deployed agents, monitoring their activities and access permissions, and terminating agent access when needed. The platform integrates with Salesforce, ServiceNow, Google, and AWS to import agents and their metadata, while providing continuous background scanning for unmanaged agents.

0
product update

Google tests prominent Temporary Chat button in Gemini app homepage

Google is testing a UI redesign for its Gemini app that moves the Temporary chat feature from a navigation drawer icon to a prominent button on the homepage. The change aims to make privacy-focused conversations more discoverable while maintaining Temporary chats' core privacy guarantee: conversations aren't stored in history, used for model training, or personalization.

2 min readvia 9to5google.com
0
product updateGoogle DeepMind

Google Deepmind adds multi-tool chaining and context circulation to Gemini API

Google Deepmind has expanded the Gemini API to enable multi-tool chaining, allowing developers to combine built-in tools like Google Search and Google Maps with custom functions in a single request. Results from one tool now automatically pass to the next through context circulation, eliminating the need for separate sequential handling.

0
product update

Perplexity's Comet AI browser launches free iOS app after $200/month PC debut

Perplexity has released Comet, its AI-powered browser, as a free standalone app for iPhone users. Originally launched on PC at $200 per month, the iOS version joins recently-released Android and existing Windows and Mac versions. The browser combines web browsing with AI assistance for summarization, research, and task automation.

0
model releaseOpenAI

OpenAI releases GPT-4o mini with 128K context at $0.15/$0.60 per 1M tokens

OpenAI released GPT-4o mini on July 18, 2024, a compact multimodal model with 128,000 token context window priced at $0.15 per million input tokens and $0.60 per million output tokens. The model achieves 82% on MMLU and claims to rank higher than GPT-4 on chat preference leaderboards while costing 60% less than GPT-3.5 Turbo.

2 min readvia openrouter.ai
0
product updateAmazon Web Services

Amazon Nova 2 Lite surpasses Nova 1 Pro with 1M token context and extended thinking at 7x lower cost

Amazon Nova 2 Lite expands context window to 1 million tokens, introduces extended thinking with developer controls, and adds native tool use and web grounding. AWS claims Nova 2 Lite surpasses Nova 1 Pro on multi-step reasoning while costing 7x less and running up to 5x faster.

0
product update

Midjourney V8 achieves 5x faster generation but premium features cost 4x more

Midjourney has released an early version of V8 for community testing, achieving roughly 5x faster image generation and introducing native 2K resolution via --hd mode. However, premium features including --hd, --q 4, style references, and mood boards cost four times as much as standard generation, with Relax mode unavailable at launch.

0
product update

Meta's Manus launches desktop app enabling AI agents to access local files and applications

Meta's recently acquired AI startup Manus launched a desktop application enabling its AI agent to directly access local files, tools, and applications on personal computers through a 'My Computer' feature. Previously cloud-only, the move positions Manus to compete with OpenClaw, the open-source AI agent that sparked recent industry momentum. Unlike OpenClaw's free, MIT-licensed offering, Manus operates as a paid subscription service.

2 min readvia cnbc.com
0
model releaseOpenAI

OpenAI releases GPT-5.4 mini and nano with 3-4x price increases but major performance gains

OpenAI has released GPT-5.4 mini and GPT-5.4 nano, compact models optimized for coding and subagent tasks. The new models deliver significant performance improvements—GPT-5.4 mini reaches 54.4% on SWE-Bench Pro versus 45.7% for GPT-5 mini—but cost 3-4x more per input token than their predecessors.

2 min readvia the-decoder.com
0
analysis

Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost

Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.