LLM News

Every LLM release, update, and milestone.

product update

Google announces Spark AI agent, Information agents, and Android Halo at I/O 2026—all paywalled behind $100/month Ultra

Google announced multiple AI agent products at I/O 2026, including Spark for managing digital tasks, Information agents for 24/7 topic monitoring, and Android Halo for notifications. All features remain paywalled behind the $100/month Gemini Ultra plan, with free access timeline unspecified.

May 21, 2026 · 2:05 PM3 min read

google google-deepmind ai-agents

via techcrunch.com ↗

product update

Google cuts AI Ultra plan to $200/month, launches new $100 developer tier

Google announced pricing changes to its Gemini AI subscription tiers at I/O 2026, cutting its top AI Ultra plan from $250 to $200 per month while introducing a new $100/month developer-focused tier. All plans now get access to Gemini 3.5 Flash and the new Gemini Omni video generation model.

May 21, 2026 · 1:36 PM3 min read

google-deepmind gemini pricing

via zdnet.com ↗

analysisOpenAI

OpenAI reasoning model solves 80-year math problem as Anthropic hits $10.9B quarterly revenue

In a two-hour span Wednesday, OpenAI announced its reasoning model autonomously solved an 80-year-old geometry problem while Anthropic reported it's on track for $10.9 billion in Q2 revenue with $559 million in operating profit—two years ahead of internal projections. The developments came alongside Nvidia's $81.6 billion quarter, Anthropic's $1.25 billion monthly SpaceX compute deal, and a White House AI executive order signing.

May 21, 2026 · 9:35 AM2 min read

OpenAI Anthropic SpaceX

via axios.com ↗

model releaseCohere

Cohere Releases Command A+ Open Source Model with 25B Active Parameters, 128K Context

Cohere has released Command A+ as an open source model under Apache 2.0 license. The sparse mixture-of-experts architecture features 25 billion active parameters out of 218B total parameters, supports 128K input context length, and includes vision capabilities alongside tool use and reasoning features.

May 21, 2026 · 1:20 AM2 min read

cohere command-a-plus open-source

via huggingface.co ↗

model releaseCohere

Cohere Releases Command A+: 218B-Parameter MoE Model With 4-Bit Quantization Runs on Single B200 GPU

Cohere has released Command A+, an open-source sparse mixture-of-experts model with 218 billion total parameters and 25 billion active parameters. The model features W4A4 quantization allowing deployment on a single Nvidia B200 GPU, supports 128K input context, and includes built-in chain-of-thought reasoning with vision capabilities.

May 20, 2026 · 10:51 PM2 min read

cohere command-a-plus mixture-of-experts

via huggingface.co ↗

researchOpenAI

OpenAI claims reasoning model disproved 80-year-old Erdős conjecture in geometry

OpenAI claims its new reasoning model has produced an original mathematical proof disproving a geometry conjecture first posed by Paul Erdős in 1946. The company says this is the first time AI has autonomously solved a prominent open problem central to a field of mathematics, with verification from mathematicians including Thomas Bloom and Noga Alon.

May 20, 2026 · 8:35 PM2 min read

OpenAI reasoning models mathematics

via techcrunch.com ↗

product update

AWS releases four multimodal evaluators for image-to-text AI tasks in Strands Evals SDK

AWS has added four multimodal evaluators to its Strands Evals SDK that judge image-to-text AI outputs by directly analyzing source images. The evaluators—Overall Quality, Correctness, Faithfulness, and Instruction Following—use multimodal large language models to detect visual hallucinations, factual errors, and instruction violations that text-only judges miss.

May 20, 2026 · 6:20 PM2 min read

AWS Amazon Bedrock Strands Evals

via aws.amazon.com ↗

model releasexAI

xAI Launches Grok Build 0.1: Coding Model with 256K Context for Agentic Workflows

xAI has released Grok Build 0.1, a coding-specialized model with a 256K context window and unlimited text output. The model is designed for agentic software engineering workflows and powers xAI's Grok Build CLI tool.

May 20, 2026 · 5:50 PM2 min read

xAI Grok coding models

via openrouter.ai ↗

product updateAmazon Web Services

AWS SageMaker AI adds bidirectional streaming for real-time speech transcription with vLLM

Amazon SageMaker AI has launched bidirectional streaming support for real-time inference, enabling WebSocket-based voice applications through vLLM integration. The feature uses HTTP/2 on port 8443 to bridge client connections with vLLM's Realtime API, allowing audio to stream in while transcription streams back simultaneously over a single persistent connection.

May 20, 2026 · 5:20 PM2 min read

AWS SageMaker vLLM

via aws.amazon.com ↗

product update

Google launches Universal Cart, an AI agent that shops across multiple retailers in one checkout

Google announced Universal Cart at its I/O developer conference, an AI-powered shopping system that consolidates purchases from multiple retailers including Target, Shopify, Wayfair, and Etsy into a single checkout. The feature uses Gemini's agentic AI to verify product compatibility, suggest better deals, and automate routine purchases.

May 20, 2026 · 4:05 PM2 min read

google gemini ai-agents

via zdnet.com ↗

product update

Google Announces Gemini Spark Agent and Antigravity Platform at I/O, Launch Date Not Disclosed

Google announced Gemini Spark at I/O 2026, positioning it as a competitor to OpenAI's Claude-based agents. The service will integrate with Gmail, Calendar, Drive, and other Google apps, running on Gemini 3.5 Flash and a new platform called Antigravity. No general availability date has been disclosed.

May 20, 2026 · 3:51 PM2 min read

google gemini agent

via simonwillison.net ↗

model releaseStability AI

Stability AI Releases Stable Audio 3.0 Model Family Trained on Licensed Data

Stability AI has released Stable Audio 3.0, a model family for audio generation trained on fully licensed data. The company positions the release as a foundation for commercial audio applications, though specific technical specifications have not yet been disclosed.

May 20, 2026 · 3:05 PM1 min read

stability-ai stable-audio audio-generation

via stability.ai ↗

analysis

Google bets Gemini Spark and 3.5 Flash can catch OpenClaw's agentic AI success

Google announced Gemini Spark, a cloud-based AI agent that runs 24/7 across Gmail, Drive, and 30+ external partners, powered by the upcoming Gemini 3.5 Flash model. The company claims the new model is four times faster and costs less than half of competing frontier models, directly responding to OpenClaw's viral success since November 2025.

May 20, 2026 · 1:35 PM2 min read

Google Gemini AI Agents

via theverge.com ↗

analysis

Google I/O 2026 announces Gemini Omni model and AI-powered search integration

Google's I/O 2026 developer conference centered entirely on AI announcements, including a new Gemini Omni model, expanded AI capabilities in Google Search, an agentic personal assistant called Spark, and the first Android XR glasses.

May 20, 2026 · 12:35 PM2 min read

google gemini io-2026

via engadget.com ↗

model release

Google releases Gemini Omni Flash video generation model with conversational editing, withholds speech synthesis

Google DeepMind released Gemini Omni Flash, the first model in its new Omni family that generates and edits video from image, audio, video, and text inputs. The model is rolling out to Gemini app subscribers and YouTube Shorts with a 10-second clip limit, while speech-editing capabilities remain withheld pending safety testing.

May 20, 2026 · 9:20 AM2 min read

google-deepmind video-generation gemini

via thenextweb.com ↗

model release

NemoStation releases Marlin-2B: 2-billion parameter video VLM achieves dense captioning performance between Tarsier-34B

NemoStation has released Marlin-2B, a 2-billion parameter video vision-language model that produces structured scene and event captions with second-precise timestamps. The model tops the CaReBench dense captioning leaderboard and sits between Tarsier-34B and Gemini-1.5-Pro on DREAM-1K, while matching Gemini-2.0-Flash on temporal grounding benchmarks.

May 20, 2026 · 7:51 AM2 min read

marlin nemostation video-vlm

via huggingface.co ↗

product update

llm-gemini Plugin Adds Support for Google's Gemini 3.5 Flash Model

Developer Simon Willison released version 0.32 of the llm-gemini plugin, which adds support for Google's Gemini 3.5 Flash model. The plugin enables command-line access to Google's Gemini model family through the LLM tool.

May 20, 2026 · 12:05 AM1 min read

gemini google-deepmind plugin

via simonwillison.net ↗

product updateOpenAI

OpenAI adopts C2PA metadata standard and Google's SynthID watermarking for AI image detection

OpenAI is joining the C2PA open standard and embedding Google DeepMind's invisible SynthID watermark in all AI-generated images from its models. The company is launching a public verification tool that checks for both C2PA metadata and SynthID watermarks, though detection only works for images created by OpenAI's own products.

May 19, 2026 · 11:05 PM2 min read

openai google c2pa

via thenextweb.com ↗

product update

Google launches Pics, AI design app for Workspace powered by Nano Banana 2

Google announced Pics at I/O 2026, an AI design app for Google Workspace that generates social media graphics, invitations, and marketing materials from text prompts. Powered by Nano Banana 2 for generation and Gemini for editing, the app launches to testers now and rolls out to Google AI Ultra subscribers this summer.

May 19, 2026 · 9:50 PM2 min read

google generative-ai design-tools

via techcrunch.com ↗

product update

Google launches Gmail Live, voice-powered AI inbox assistant for Ultra subscribers this summer

Google announced Gmail Live at IO 2026, a Gemini-powered conversational AI feature that allows users to ask natural language questions about their inbox instead of typing search terms. The voice-powered tool will roll out this summer exclusively to Google AI Ultra subscribers.

May 19, 2026 · 9:35 PM2 min read

Gmail Google Gemini

via techcrunch.com ↗

← PreviousPage 23 of 46Next →