LLM News

Every LLM release, update, and milestone.

product updateAnthropic

Anthropic expands Claude Cowork to web and mobile, usage data shows 33% of tasks are business operations

Anthropic launched Claude Cowork on web and mobile Tuesday for Max subscribers, allowing tasks to run in the background across devices. Usage data from 1.2 million sessions shows business operations account for 33.4% of Cowork tasks, while software development represents just 8.7%.

July 7, 2026 · 4:36 PM2 min read

anthropic claude-cowork ai-agents

via techcrunch.com ↗

product updateAnthropic

Anthropic brings Claude Cowork agent to mobile and web with background task execution

Anthropic is expanding Claude Cowork from desktop-only to mobile and web platforms. The autonomous agent feature will run tasks in the background across devices, with scheduled execution requiring no device to be online, starting with Max plan subscribers in a gradual rollout over the next several weeks.

July 7, 2026 · 4:35 PM2 min read

anthropic claude cowork

via 9to5mac.com ↗

product updateMicrosoft

Microsoft Foundry Adds Weekly-Refreshed Hugging Face Model Catalog with One-Click GPU Deployment

Microsoft announced at Build 2026 that Foundry Managed Compute now includes a curated catalog of open-weight models from Hugging Face's 3 million model repository, refreshed weekly with one-click deployment. The service pre-stages weights in Azure, provides Microsoft-scanned runtimes (vLLM, SGLang, TensorRT-LLM, NIM, TEI, llama.cpp), and offers pay-per-hour GPU pricing with automatic security patching.

July 7, 2026 · 3:36 PM3 min read

microsoft hugging-face foundry

via huggingface.co ↗

analysisOpenAI

Chinese AI Models Capture 30%+ of U.S. Developer Token Usage as OpenAI, Anthropic Costs Rise

Chinese AI models including DeepSeek and Z.ai have captured over 30% of weekly token usage by U.S. companies on OpenRouter since February 2025, up from 4.5% in the first half of the year. The shift comes as companies seek alternatives 60-90% cheaper than leading models from OpenAI and Anthropic, while Chinese models close the performance gap to within 6-9 months of U.S. frontier systems.

July 7, 2026 · 9:50 AM3 min read

DeepSeek Z.ai market-analysis

via cnbc.com ↗

researchAmazon Web Services

AWS introduces rDPO unlearning technique to reduce false content moderation in Amazon Nova models by 53 percentage point

AWS has developed Reverse Direct Preference Optimization (rDPO), a novel unlearning technique that reduces over-deflection in Amazon Nova models by up to 53 percentage points. The approach allows organizations to selectively adjust content moderation safeguards while preserving general model capabilities through LoRA adapters.

July 6, 2026 · 10:35 PM2 min read

amazon-nova unlearning content-moderation

via aws.amazon.com ↗

model releaseNex Agi

Nex AGI releases Nex-N2-Mini: open-source agentic MoE model with 262K context window

Nex AGI has released Nex-N2-Mini, an open-source agentic mixture-of-experts model with a 262K-token context window. The model accepts text and image inputs and is priced at $0.025 per 1M input tokens and $0.10 per 1M output tokens.

July 6, 2026 · 7:20 PM2 min read

nex-agi nex-n2-mini mixture-of-experts

via openrouter.ai ↗

product updateAmazon Web Services

AWS launches MiniMax M2 family on Amazon Bedrock with 1M token context and MoE architecture

Amazon Web Services has added three MiniMax models to Amazon Bedrock: M2, M2.1, and M2.5. The newest model, M2.5, uses a mixture-of-experts architecture with 230 billion total parameters and 10 billion active per token, trained specifically for agent-native execution and coding tasks.

July 6, 2026 · 5:20 PM2 min read

MiniMax Amazon Bedrock AWS

via aws.amazon.com ↗

product updateAmazon Web Services

AWS Ships Multi-Turn RL Infrastructure for Amazon Nova on SageMaker HyperPod

AWS has released infrastructure for deploying multi-turn reinforcement learning to train Amazon Nova models on SageMaker HyperPod. The system requires a minimum of 10 ml.p5.48xlarge instances and costs approximately $786-$1,180 per hour when running.

July 6, 2026 · 5:06 PM2 min read

AWS Amazon Nova SageMaker HyperPod

via aws.amazon.com ↗

product updateAmazon Web Services

AWS launches Nova-powered PII redaction pipeline for images using SAM 3 and Textract

AWS has released an automated pipeline for redacting personally identifiable information in images, using Amazon Nova 2 Lite as an intelligent coordinator. The solution combines Nova's contextual vision reasoning with Meta's SAM 3 model deployed on SageMaker and Amazon Textract to handle complex PII detection scenarios including faces, fingerprints, ID cards, and license plates.

July 6, 2026 · 5:05 PM2 min read

Amazon Nova PII redaction computer vision

via aws.amazon.com ↗

product updateAnthropic

Anthropic Launches Claude Desktop App for Linux, Local AI Integration Fails

Anthropic has released an official Linux desktop app for Claude, bringing feature parity with macOS and Windows versions. However, the app currently fails to reliably connect to locally-installed AI models like Ollama, limiting Linux users to cloud-based Anthropic plans.

July 6, 2026 · 4:20 PM2 min read

Claude Anthropic Linux

via zdnet.com ↗

model releaseTencent

Tencent Releases Hy3: 295B MoE Model with 256K Context and Configurable Reasoning Modes

Tencent has released Hy3, a 295-billion parameter Mixture-of-Experts model with 21 billion active parameters and a 256,000-token context window. The model features configurable reasoning modes and is available free through OpenRouter, with deployment ending July 21, 2026.

July 6, 2026 · 2:05 PM2 min read

tencent hy3 mixture-of-experts

via openrouter.ai ↗

model releaseTencent

Tencent Releases Hy3: 295B-Parameter MoE Model with 21B Active Parameters at 256K Context

Tencent has released Hy3, a 295-billion parameter Mixture-of-Experts model with 21 billion active parameters and 3.8 billion MTP layer parameters. The model features a 256K context window and is released under Apache 2.0 license, with pricing not yet disclosed.

July 6, 2026 · 7:21 AM2 min read

Tencent MoE Open Source

via huggingface.co ↗

product update

Google AI Plus at $4.99/month and AI Pro at $19.99/month expand Gemini context windows to 128K and 1M tokens

Google has detailed pricing and features for its Gemini app subscription tiers. AI Plus costs $4.99/month and includes 128,000 token context windows, while AI Pro at $19.99/month provides 1 million token context windows. Free users are limited to 32,000 tokens.

July 4, 2026 · 8:50 PM2 min read

google gemini pricing

via 9to5google.com ↗

model releaseMistral AI

Mistral releases Leanstral 1.5: 119B parameter open-source model for Lean 4 proof assistance

Mistral AI has released Leanstral 1.5, an open-source 119B parameter mixture-of-experts model designed specifically for Lean 4 proof assistance. The model features 128 experts with 4 active per token (6.5B activated parameters), a 256k token context window, and multimodal input capabilities.

July 4, 2026 · 4:36 PM2 min read

mistral-ai leanstral lean-4

via huggingface.co ↗

model releaseNVIDIA

NVIDIA releases Nemotron-Labs-TwoTower-30B: block-wise diffusion model claims 2.42× faster generation at 98.7% baseline

NVIDIA released Nemotron-Labs-TwoTower-30B-A3B-Base-BF16, a block-wise diffusion language model that generates text by denoising blocks of tokens in parallel rather than sequentially. According to NVIDIA, the model achieves 2.42× the wall-clock generation throughput of its autoregressive baseline while retaining 98.7% of aggregate benchmark quality.

July 4, 2026 · 7:51 AM2 min read

NVIDIA Nemotron diffusion models

via huggingface.co ↗

model releaseMistral AI

Mistral Releases Leanstral 1.5: 6B-Parameter Model Achieves 100% on miniF2F, Solves 587/672 PutnamBench Problems

Mistral AI released Leanstral 1.5, a free Apache-2.0 licensed model with 119B total parameters and 6B active parameters specialized for formal verification in Lean 4. The model achieves 100% on miniF2F benchmark, solves 587 of 672 PutnamBench problems at $4 per problem (versus $300+ for competitors), and reaches state-of-the-art 87% on FATE-H and 34% on FATE-X benchmarks.

July 3, 2026 · 2:21 PM3 min read

Mistral AI Leanstral formal verification

via mistral.ai ↗

product updateAnthropic

Anthropic launches Claude Science beta with NVIDIA BioNeMo integration for life sciences research

Anthropic has launched the public beta of Claude Science, an AI workbench for scientific research that integrates NVIDIA's BioNeMo Agent Toolkit. The platform allows scientists to execute end-to-end research workflows using natural language commands to interact with digital agents.

July 2, 2026 · 2:50 PM2 min read

Anthropic Claude NVIDIA

via artificialintelligence-news.com ↗

product updateApple

Apple ships Safari MCP server in Technology Preview 247, enabling AI coding agents to inspect and debug websites

Apple has released an MCP server for Safari Technology Preview 247 that allows AI coding agents to directly inspect and debug websites. The server gives agents access to console logs, network requests, screenshots, and DOM interactions through the Model Context Protocol standard created by Anthropic.

July 1, 2026 · 10:05 PM2 min read

Apple Safari MCP

via 9to5mac.com ↗

product updateMicrosoft

GitHub Copilot CLI adds Microsoft C++ Language Server plugin with automated setup

GitHub has added the Microsoft C++ Language Server as a plugin to the Copilot CLI marketplace. The plugin includes a built-in setup skill designed to automate C++ project configuration.

July 1, 2026 · 8:50 PM1 min read

GitHub Copilot C++

via github.blog ↗

product updateNVIDIA

AWS brings NVIDIA Nemotron and OpenAI GPT OSS models to GovCloud for secure government AI workloads

Amazon Bedrock now supports NVIDIA Nemotron and OpenAI GPT OSS models in AWS GovCloud (US) Regions. The launch includes OpenAI's GPT OSS models (120B and 20B parameters, 128K context) and NVIDIA Nemotron 3 family (9B to 120B parameters, 1M context), providing government agencies FedRAMP High and DoD SRG Level 5-compliant AI inference on U.S. soil.

July 1, 2026 · 6:21 PM2 min read

AWS Amazon Bedrock NVIDIA

via aws.amazon.com ↗

Page 1 of 47Next →