LLM News

Every LLM release, update, and milestone.

researchAi2

AI2 Releases DiScoFormer: Single Transformer Estimates Density and Score Across Distributions Without Retraining

Allen Institute for AI (AI2) has released DiScoFormer, a transformer model that estimates both the density and score of any distribution from a sample in a single forward pass without retraining. In 100 dimensions, the model reduces score estimation error by 6.5x and density error by 37x compared to classical kernel density estimation.

June 29, 2026 · 6:20 PM2 min read

research transformers density-estimation

via huggingface.co ↗

product updateAmazon Web Services

AWS demonstrates two-model pipeline using Nova 2 Lite and Claude Sonnet 4.6 that cuts document processing costs by 67%

AWS published a technical demonstration showing that pairing Amazon Nova 2 Lite with Anthropic's Claude Sonnet 4.6 reduces document processing costs by approximately two-thirds compared to single-model approaches. The two-stage pipeline processed 336 scanned yearbook pages at $0.0027 per page, producing 3,122 name-to-face associations with 93% scoring at or above 0.95 confidence.

June 29, 2026 · 6:05 PM3 min read

amazon-nova claude-sonnet document-processing

via aws.amazon.com ↗

product update

Cursor launches iOS app for mobile code review and agent management after SpaceX acquisition

Cursor has released its first iOS app for iPhone and iPad, allowing developers to launch coding agents, review pull requests, and manage engineering work from mobile devices. The app arrives weeks after SpaceX acquired the AI coding company in June 2026.

June 29, 2026 · 5:51 PM2 min read

cursor ios spacex

via 9to5mac.com ↗

product updateAmazon Web Services

AWS releases automated healthcare claims pipeline using Amazon Bedrock Data Automation and AgentCore

AWS has published a technical implementation guide for automating healthcare claims processing using Amazon Bedrock Data Automation and Amazon Bedrock AgentCore. The pipeline extracts data from CMS-1500 claim forms, validates against AWS HealthLake records, and generates FHIR-compliant claim resources with automated notifications.

June 29, 2026 · 5:50 PM2 min read

AWS Amazon Bedrock AgentCore

via aws.amazon.com ↗

product updateAmazon Web Services

AWS launches AgentCore Observability for Amazon Bedrock to debug production AI agents

Amazon Web Services launched AgentCore Observability for Amazon Bedrock, a debugging tool that provides visibility into AI agent execution through OpenTelemetry traces, CloudWatch metrics, and structured logs. The tool addresses silent failures in production agents including infinite reasoning loops, incorrect tool selection, and plausible but incorrect answers.

June 29, 2026 · 5:35 PM2 min read

amazon-aws observability debugging

via aws.amazon.com ↗

product update

Cursor launches iOS app for managing AI coding agents from mobile devices

Cursor released an iOS app Monday that allows developers to prompt and manage coding agents from their phones. The app integrates with Cursor 2.0's agent-based workflow, enabling users to initiate new agents or interact with existing desktop-initiated agents remotely.

June 29, 2026 · 5:20 PM2 min read

cursor mobile-app ios

via techcrunch.com ↗

model release

DeepReinforce Releases Ornith-1.0, Open-Source Agentic Coding Model in 9B to 397B Sizes

DeepReinforce has released Ornith-1.0, an MIT-licensed model designed for agentic coding tasks with variants ranging from 9B to 397B parameters. Built on top of Apache 2.0-licensed Gemma 4 and Qwen 3.5 base models, the company claims it achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks.

June 29, 2026 · 4:36 PM2 min read

ornith deepreinforce agentic-coding

via simonwillison.net ↗

product update

Google expands Gemini-powered meeting transcription to AI Pro ($19.99/month) and Ultra subscribers

Google has expanded access to its Gemini-powered 'Take notes for me' feature in Google Meet to AI Pro and Ultra subscribers. The tool, previously limited to Workspace customers since 2024, automatically transcribes calls and generates summaries with action items for $19.99 per month.

June 29, 2026 · 4:35 PM1 min read

google gemini google-meet

via 9to5google.com ↗

product update

Scam.ai launches Halo on-device deepfake detection model for live video calls with Qualcomm partnership

Scam.ai announced a partnership with Qualcomm and launched Halo, an on-device deepfake detection model for live video calls. The announcements were made at Computex 2026 in Taipei, where Scam.ai was featured at Qualcomm's booth.

June 29, 2026 · 8:05 AM1 min read

deepfake detection on-device AI video security

via artificialintelligence-news.com ↗

product updateDeepSeek

Cline v4.0.2 Adds DeepSeek Reasoning Effort Controls, Including 'xhigh' Setting

Cline, the autonomous AI coding assistant, released v4.0.2 with support for reasoning effort controls on DeepSeek thinking models, including the new 'xhigh' setting. The update also improves the ClinePass provider experience with clearer reasoning controls and model selection.

June 29, 2026 · 4:20 AM1 min read

cline deepseek coding-assistant

via github.com ↗

model releaseDeepSeek

DeepSeek Releases V4 Models: 1M Context Window, 90% Less KV Cache Than V3

DeepSeek has released two new MoE models: DeepSeek-V4-Pro with 1.6T parameters (49B activated) and DeepSeek-V4-Flash with 284B parameters (13B activated). Both models support a one million token context window and use a hybrid attention architecture that requires only 27% of single-token inference FLOPs and 10% of KV cache compared to DeepSeek-V3.2.

June 29, 2026 · 12:36 AM2 min read

deepseek model-release moe

via huggingface.co ↗

benchmarkZhipu AI

China's Zhipu AI releases GLM-5.2, claims parity with Mythos on cybersecurity benchmarks

Zhipu AI released its open-weight GLM-5.2 model, with researchers claiming it matches Anthropic's Mythos on certain bug-finding and cybersecurity tasks. The model lags behind Anthropic and OpenAI models on general benchmarks but represents a significant narrowing of capabilities between Chinese and US AI systems.

June 28, 2026 · 9:50 PM2 min read

zhipu-ai glm-5.2 mythos

via theverge.com ↗

analysisZyphra

Open Model Ecosystem Shifts from Chinese Dominance to Global Diversity with 550B NVIDIA, 218B Cohere Releases

The open model ecosystem is diversifying beyond its previous Chinese dominance, with companies like NVIDIA, Cohere, Poolside, and Zyphra releasing models under permissive licenses. NVIDIA's 550B parameter Nemotron-3-Ultra uses LatentMoE architecture and switches to the OpenMDW license, while Cohere released Command A+ as a 218B-A25B MoE under Apache 2.0.

June 28, 2026 · 5:20 PM2 min read

open-source model-releases nvidia

via interconnects.ai ↗

model releaseDeepSeek

DeepSeek Releases V4-Pro with 1.6T Parameters, 1M Token Context at 27% Inference Cost of V3

DeepSeek has released two Mixture-of-Experts models: V4-Pro with 1.6 trillion parameters (49B activated) and V4-Flash with 284B parameters (13B activated), both supporting 1 million token context windows. V4-Pro requires only 27% of inference FLOPs and 10% of KV cache compared to V3.2 at 1M token context, trained on over 32 trillion tokens.

June 27, 2026 · 3:51 PM2 min read

DeepSeek mixture-of-experts long-context

via huggingface.co ↗

analysisAnthropic

U.S. clears Anthropic's Mythos 5 cybersecurity model for limited deployment after two-week ban

The U.S. Commerce Department has cleared Anthropic to restore access to its Mythos 5 AI model for select cybersecurity partners, two weeks after imposing export controls over jailbreak concerns. The related Fable 5 model remains under government restrictions.

June 27, 2026 · 2:20 PM3 min read

Anthropic Mythos 5 Fable 5

via fortune.com ↗

model releaseAnthropic

Anthropic's Fable 5 model expected to return next week after 15-day government shutdown

The Trump administration is close to allowing Anthropic to restore access to its Fable 5 model, which has been offline for 15 days due to national security concerns. Insiders expect restrictions could be lifted as soon as next week, though Pentagon and NSA approval is still required.

June 27, 2026 · 1:06 PM3 min read

Anthropic Fable 5 Mythos 5

via axios.com ↗

model releaseOpenAI

OpenAI previews GPT-5.6 to select partners with three variants priced from $1 to $30 per million tokens

OpenAI has begun previewing its GPT-5.6 series to a limited group of trusted partners after government review. The release includes three variants: Sol at $5 input/$30 output per million tokens, Terra at $2.50/$15, and Luna at $1/$6.

June 27, 2026 · 4:35 AM2 min read

OpenAI GPT-5.6 pricing

via engadget.com ↗

product updateAnthropic

US government authorizes Anthropic to restore Mythos 5 cybersecurity model to 100+ institutions

The US government has authorized Anthropic to redeploy its Mythos 5 cybersecurity AI model to more than 100 US institutions, including major corporations and government agencies, following a two-week suspension. Commerce Secretary Howard Lutnick approved the redeployment after Anthropic implemented safeguards and committed to work with the government on release protocols.

June 27, 2026 · 2:05 AM2 min read

anthropic mythos-5 cybersecurity

via engadget.com ↗

product updateAnthropic

Trump Administration Permits Anthropic's Claude Mythos 5 for 100+ US Organizations After Two-Week Ban

The Trump administration is allowing Anthropic to deploy Claude Mythos 5 to over 100 specific US government agencies and companies, two weeks after banning the cybersecurity model. Commerce Secretary Howard Lutnick approved access for organizations operating critical infrastructure, including non-American employees, though Fable 5 remains unavailable.

June 27, 2026 · 1:20 AM2 min read

anthropic claude mythos-5

via techcrunch.com ↗

model releaseAnthropic

US government allows Anthropic to release Claude Mythos 5 to 100+ institutions after two-week export control block

The US Commerce Department has partially lifted export controls on Anthropic's Claude Mythos 5 model, permitting its release to over 100 US institutions including major companies and government agencies. The restrictions, imposed two weeks ago alongside a block on Claude Fable 5, reportedly stemmed from concerns about potential jailbreaks and Chinese access.

June 26, 2026 · 11:20 PM2 min read

anthropic claude export-controls

via 9to5mac.com ↗

← PreviousPage 4 of 47Next →