LLM News

Every LLM release, update, and milestone.

research

Major AI models mention religion 5-16% of the time when humans expect it 45-59%, multi-university study finds

Large language models systematically exclude religious perspectives when answering questions about grief, ethics, and family, according to new research from a multi-university consortium. Americans expected religion in AI responses 45-59% of the time depending on topic, but models mentioned it only 5-16% of the time.

June 1, 2026 · 9:35 AM3 min read

AI bias religious AI LLM research

via axios.com ↗

model releaseStepFun

StepFun Releases Step-3.7-Flash: 198B-Parameter Sparse MoE Model With 256K Context in GGUF Format

StepFun has released Step-3.7-Flash, a 198B-parameter sparse Mixture-of-Experts vision-language model that activates approximately 11B parameters per token. The model supports a 256K context window, native image understanding via a 1.8B-parameter vision encoder, and offers three selectable reasoning levels.

June 1, 2026 · 8:06 AM2 min read

StepFun Step-3.7-Flash MoE

via huggingface.co ↗

model releaseNVIDIA

NVIDIA Releases Cosmos 3: 8B and 32B Omni-Models Combining Video Generation, Reasoning, and Action in Single Architectur

NVIDIA has released Cosmos 3, a unified omni-model that combines world generation, physical reasoning, and action generation in a single architecture. Available in 8B (Nano) and 32B (Super) parameter versions on Hugging Face, Cosmos 3 uses a Mixture-of-Transformers architecture to process text, image, video, audio, and action modalities without switching between separate models.

June 1, 2026 · 4:51 AM2 min read

nvidia multimodal video-generation

via huggingface.co ↗

model release

MiniMax Launches M3 Model With 1M Context Window at $0.30 Per Million Input Tokens

MiniMax has released M3, a multimodal foundation model supporting text, image, and video inputs with a 1-million-token context window. The model costs $0.30 per million input tokens and $1.20 per million output tokens, available through OpenRouter.

June 1, 2026 · 1:05 AM2 min read

MiniMax M3 multimodal

via openrouter.ai ↗

changelogAnthropic

OpenCode v1.15.13 Adds Session Metadata API, Fixes Anthropic Opus 4.7 Adaptive Reasoning Bug

OpenCode v1.15.13 introduces custom session metadata storage through its API and SDK. The release fixes a bug where Anthropic's Opus 4.7+ adaptive reasoning returned empty thinking blocks instead of summarized thinking.

May 30, 2026 · 11:50 PM1 min read

opencode changelog anthropic

via github.com ↗

product updateMicrosoft

GitHub Copilot switches to token-based billing June 1, some users report costs jumping from $50 to $3,000

Microsoft is ending GitHub Copilot's flat-rate subscription model in favor of token-based billing starting June 1. Some developers report monthly costs rising from approximately $29-50 to $750-3,000, while others claim the increases only affect inefficient "vibe-coders" who iterate excessively without clear direction.

May 30, 2026 · 4:35 PM2 min read

GitHub Copilot Microsoft pricing

via techcrunch.com ↗

changelog

Vercel AI SDK Deprecates searchParameters for xAI, Adds Image Search Support

Vercel released AI SDK version 4.0.0-canary.69 with breaking changes to xAI integration. The update deprecates the searchParameters option for xAI live search, replacing it with dedicated web_search and x_search agent tools, and adds image search capability through a new enableImageSearch parameter.

May 30, 2026 · 1:21 AM1 min read

vercel xai ai-sdk

via github.com ↗

changelog

Vercel AI SDK Adds Support for Google's Deep Research Models and Gemini Embedding-2

Vercel released version 5.0.0-canary.98 of its Google Vertex AI SDK, adding support for three new Google models: deep-research-max-preview-04-2026, deep-research-preview-04-2026, and gemini-embedding-2. The update enables developers to integrate Google's research-focused models and latest embedding model into applications using Vercel's AI SDK.

May 30, 2026 · 1:21 AM1 min read

vercel google ai-sdk

via github.com ↗

changelog

Vercel AI SDK Adds Support for Gemini Embedding 2 and Deep Research Models

Vercel released version 4.0.0-canary.75 of its AI SDK Google package on May 30, adding support for three new Google models: gemini-embedding-2, deep-research-max-preview-04-2026, and deep-research-preview-04-2026. The update enables developers to integrate Google's latest embedding and deep research capabilities into applications built with the Vercel AI SDK.

May 30, 2026 · 1:20 AM2 min read

vercel google gemini

via github.com ↗

product updateOpenAI

OpenAI's Codex for Windows gains Computer Use and remote control from ChatGPT mobile apps

OpenAI has expanded its Codex desktop app to Windows with Computer Use capabilities and remote control from ChatGPT mobile apps. The features, previously Mac-only, allow Codex to operate Windows desktop applications autonomously and enable iPhone, iPad, and Android users to initiate and monitor Codex tasks on Windows devices.

May 29, 2026 · 6:35 PM2 min read

openai codex chatgpt

via 9to5mac.com ↗

product update

Google launches Gemini Spark AI agent for Ultra subscribers in US with automated task execution

Google has launched Gemini Spark, a 24/7 AI agent for Google AI Ultra subscribers in the US. The service automates tasks across Google Workspace apps with remote browser control, supporting up to 15 concurrent tasks with compute-based usage limits.

May 29, 2026 · 5:50 PM2 min read

gemini google ai-agent

via 9to5google.com ↗

fundingAnthropic

Anthropic raises $65B at $965B valuation, releases Claude Opus 4.8, plans wider Mythos rollout

Anthropic closed a $65 billion Series H at a $965 billion valuation, making it the most valuable AI startup globally and surpassing OpenAI's $852 billion March valuation. The company simultaneously released Claude Opus 4.8 and announced plans to bring its Mythos cyber-focused model to all customers within weeks.

May 29, 2026 · 4:51 PM3 min read

anthropic funding claude-opus

via fortune.com ↗

model release

StepFun releases Step-3.7-Flash: 198B-parameter MoE model with 256K context at $0.20/M input tokens

StepFun has released Step-3.7-Flash, a 198B-parameter sparse Mixture-of-Experts vision-language model that activates 11B parameters per token and delivers up to 400 tokens per second. The model supports a 256K context window, three selectable reasoning levels, and is priced at $0.20 per million input tokens (cache miss) and $1.15 per million output tokens.

May 29, 2026 · 12:51 PM2 min read

stepfun mixture-of-experts vision-language-model

via huggingface.co ↗

model releaseLiquid Ai

Liquid AI Releases LFM2.5-8B: 8-Billion Parameter Hybrid Model Optimized for Edge Deployment

Liquid AI has released LFM2.5-8B-A1B, an 8-billion parameter hybrid model designed specifically for edge AI and on-device deployment. The model is available in multiple GGUF quantized formats ranging from 4-bit (4.84 GB) to 16-bit (16.9 GB), optimized for memory efficiency.

May 29, 2026 · 4:21 AM2 min read

Liquid AI LFM2.5 edge AI

via huggingface.co ↗

changelog

Google caps single-prompt quota for Gemini 3.1 Pro, makes Flash-Lite free after usage limit complaints

Google has modified Gemini's compute-based usage limits introduced at I/O 2026 after users reported depleting quotas too quickly. The company is now capping how much quota a single Gemini 3.1 Pro prompt can consume and making all 3.1 Flash-Lite prompts free.

May 29, 2026 · 2:20 AM2 min read

google-deepmind gemini usage-limits

via 9to5google.com ↗

model releaseStepFun

StepFun launches Step 3.7 Flash: 196B MoE model with 256K context and adjustable reasoning levels at $0.20/$1.15 per 1M

StepFun has released Step 3.7 Flash, a 196B-parameter Mixture-of-Experts model that activates approximately 11B parameters per token. The multimodal model supports a 256K context window and introduces selectable reasoning levels (high/medium/low), priced at $0.20 per 1M input tokens and $1.15 per 1M output tokens.

May 29, 2026 · 12:20 AM2 min read

StepFun Step 3.7 Flash Mixture-of-Experts

via openrouter.ai ↗

model releaseAnthropic

Anthropic's Opus 4.8 matches Claude Mythos Preview in alignment, cuts thinking mode costs by 67%

Anthropic released Claude Opus 4.8 on May 28, 2026, replacing Opus 4.7 at unchanged pricing. The company claims the model's misalignment rates match those of Claude Mythos Preview, the experimental model deemed too dangerous for public release in April 2026. Opus 4.8 delivers faster thinking modes at one-third the cost of version 4.7.

May 28, 2026 · 9:21 PM2 min read

anthropic claude opus-4-8

via zdnet.com ↗

product updateMicrosoft

Microsoft strips color from Copilot interface in pursuit of 'intelligence that feels present but not imposing'

Microsoft has rolled out a visual overhaul for Copilot in Microsoft 365, replacing the colorful interface with a predominantly black-and-white, text-forward design. The redesign, aimed at making the AI assistant feel "present but not imposing," includes a new adaptive prompt surface and consistent side panel placement across Word, PowerPoint, and Excel.

May 28, 2026 · 8:21 PM2 min read

microsoft copilot ui-redesign

via engadget.com ↗

product updateMicrosoft

Microsoft 365 Copilot gains 2x faster load times and progressive disclosure interface

Microsoft is rolling out a redesigned Microsoft 365 Copilot that loads twice as fast, according to the company. The update introduces "progressive disclosure" — showing tools and controls contextually based on prompts rather than displaying all options at once.

May 28, 2026 · 8:20 PM1 min read

Microsoft Copilot Microsoft 365

via theverge.com ↗

model releaseAnthropic

Anthropic releases Claude Opus 4.8 with improved agentic coding and reasoning benchmarks

Anthropic released Claude Opus 4.8 on May 28, 2026, with improved performance in agentic coding, computer use, and reasoning benchmarks. Pricing remains at $5 per million input tokens and $25 per million output tokens, while the model's fast mode is now three times cheaper than previous versions.

May 28, 2026 · 7:05 PM2 min read

Claude Anthropic model release

via 9to5google.com ↗

← PreviousPage 18 of 47Next →