Breaking

Mistral's Leanstral code verification agent outperforms Claude Sonnet at 15% of the cost

Mistral has released Leanstral, a 120B-parameter code verification agent built with the Lean programming language, claiming it outperforms larger open-source models and offers significant cost advantages over Anthropic's Claude suite. The model achieves a pass@2 score of 26.3—beating Claude Sonnet by 2.6 points—while costing $36 to run compared to Sonnet's $549.

March 17, 2026

Latest News

All news →
product update

Google cuts Gemini for Home latency 40%, adds smarter alarms and calendar features

Google has deployed its second major Gemini for Home update this month, reducing response latency by 40% for common commands and substantially shortening voice responses for alarms, timers, and calendar events. The update adds five new alarm/timer capabilities including world event-based triggers and recurring alarms, while expanding Gemini Live translation to 30 languages and rolling out Home features to 19 additional countries.

3 min readvia 9to5google.com
product update

Google expands Personal Intelligence to all US free-tier users via Gemini app and Chrome

Google announced Tuesday that all US users, including free-tier subscribers, now have access to Personal Intelligence in the Gemini app, Chrome, and AI Mode in Search. The feature, previously limited to paid AI Pro and AI Ultra subscribers, connects Gmail, Google Photos, YouTube, and other Google services to automatically personalize Gemini's responses without manual prompt engineering.

2 min readvia theverge.com
product updateMicrosoft

Microsoft launches Copilot Health to synthesize medical records and fitness data with AI

Microsoft has launched Copilot Health, an AI-powered tool designed to integrate fitness data from 50+ devices, medical records from 50,000+ US healthcare providers, and health history into unified summaries. The company plans to charge for access via subscription after a free trial period, though pricing remains undisclosed.

product update

Perplexity launches Computer for Enterprise, claims $1.6M labor savings in internal test

Perplexity made Computer for Enterprise generally available to enterprise customers on March 12, claiming an internal study of 16,000+ queries showed $1.6 million in labor cost savings and 3.2 years of equivalent work completed in four weeks. The service integrates with Gmail, Outlook, GitHub, Linear, Slack, Notion, Snowflake, Databricks, and Salesforce, orchestrating tasks across 20 frontier models with agentic internet access.

product updateAnthropic

Anthropic launches Code Review service at $15-25 per PR, takes 20 minutes per review

Anthropic has launched Code Review, a new service that deploys multiple specialized agents to scan pull requests for bugs, security vulnerabilities, and regressions. According to Anthropic, reviews average $15-25 per PR and take approximately 20 minutes to complete, scaling with code complexity.

product updateOpenAI

OpenAI's adult mode will allow erotic text but blocks explicit image, audio, and video generation

OpenAI confirmed its forthcoming "adult mode" will permit text-based erotic conversations in ChatGPT but explicitly block generation of pornographic images, audio, and video. The feature, first announced by CEO Sam Altman in October 2024, has been delayed multiple times—most recently in March 2025—as the company grapples with safety concerns including a 12% error rate in age verification systems.

2 min readvia engadget.com

Latest Models

All →