LLM News

Every LLM release, update, and milestone.

0
model releaseAnthropic

Anthropic's Unreleased Claude Mythos Preview Finds 10,000+ Vulnerabilities in One Month

Anthropic's unreleased Claude Mythos Preview model has discovered more than 10,000 vulnerabilities across partner organizations in its first month of deployment through Project Glasswing. The company reports partners are finding bugs at 10x their previous rate, with Cloudflare discovering 2,000 bugs and Mozilla finding 271 Firefox vulnerabilities — 10x more than with previous Claude models.

2 min readvia engadget.com
0
model releaseTencent

Tencent Releases Hy-MT2 Translation Models: 1.8B, 7B, and 30B-A3B Support 33 Languages

Tencent released Hy-MT2, a family of multilingual translation models available in 1.8B, 7B, and 30B-A3B (MoE) sizes. All models support translation among 33 languages and follow translation instructions in multiple languages. The 1.8B model can be compressed to 440MB using 1.25-bit AngelSlim quantization.

0
researchNVIDIA

NVIDIA Releases Nemotron-Labs Diffusion Models With 6.4× Faster Token Generation Than Autoregressive Decoding

NVIDIA has released Nemotron-Labs Diffusion, a family of diffusion language models at 3B, 8B, and 14B scales that generate multiple tokens in parallel rather than one at a time. The 8B model achieves 6.4× higher tokens per forward pass than autoregressive models in self-speculation mode while maintaining comparable accuracy.

0
product update

Google Gemini Mac app adding 'Spark' AI agent and voice control features in summer 2026

Google announced two major features coming to its Gemini Mac app this summer: the Spark AI agent that can automate desktop workflows and access local files, and an enhanced voice control system. Spark will be available to Google AI Ultra subscribers ($100/month) and can integrate with Workspace apps and third-party services.

2 min readvia 9to5google.com
0
model releaseNVIDIA

NVIDIA releases Nemotron-Labs-Diffusion-14B with tri-mode decoding achieving 3.3x speed-up on GB200

NVIDIA released Nemotron-Labs-Diffusion-14B, a 14-billion parameter language model that supports three decoding modes by switching attention patterns during inference. The model achieves 850 tokens per second on GB200 hardware at concurrency 1, representing a 3.3x speed-up over standard autoregressive decoding and outperforming Qwen3-8B-Eagle3 by 2.2x in self-speculation mode.

0
model releaseTencent

Tencent Releases Hy-MT2: 1.8B Translation Model Compressed to 440MB With 1.25-Bit Quantization

Tencent has open-sourced Hy-MT2, a family of multilingual translation models available in 1.8B, 7B, and 30B-A3B parameter sizes. The models support translation across 33 languages and include extreme quantization down to 1.25-bit, reducing the 1.8B model to 440MB storage while increasing inference speed by 1.5x.

0
product updateAmazon Web Services

Amazon Nova Act Becomes HIPAA Eligible for Healthcare Workflows

Amazon Nova Act, AWS's browser-based AI agent service, now qualifies as HIPAA eligible, allowing healthcare organizations to deploy autonomous agents for workflows involving electronically protected health information. The service automates repetitive browser tasks including claims processing, referral coordination, and prior authorization.

2 min readvia aws.amazon.com
0
product update

Google opens 'Gemini built in' program to third-party speaker manufacturers with turnkey reference designs

Google is expanding its 'Gemini built in' program to include speaker reference designs, allowing third-party manufacturers to build Gemini-powered smart speakers without lengthy development cycles. The program, which previously launched cameras through Walmart's Onn brand, now provides turnkey hardware solutions for both speakers and cameras.

2 min readvia 9to5google.com
0
product updateAmazon Web Services

AWS launches AgentCore Code Interpreter to process documents beyond context window limits using recursive LLM architectu

Amazon Web Services released AgentCore Code Interpreter, a sandboxed Python environment that enables recursive language models to process documents of unlimited length by treating context as an external environment rather than loading it into the model's context window. The system orchestrates sub-LLM calls from within the sandbox, maintaining intermediate results as Python variables across a persistent session.

0
product updateAmazon Web Services

AWS Launches Amazon Bedrock AgentCore for Deploying Production AI Agents

AWS has launched Amazon Bedrock AgentCore, a serverless runtime environment for deploying production AI agents. Turkish fulfillment company OPLOG demonstrated the platform's capabilities by building three business intelligence agents using Anthropic's Claude Sonnet, achieving a 35% reduction in sales cycles and 98% reduction in manual research time.

2 min readvia aws.amazon.com
0
product update

Perplexity upgrades Comet iOS browser with phone number actions, iPad sidebar polish, Finance Deep Dive tabs

Perplexity has released a major update to its Comet AI browser for iOS, adding eight new features including one-tap phone number actions, a redesigned iPad sidebar, and Finance Deep Dive analysis that opens as browser tabs. The update also fixes persistent bugs with recently closed tabs and deleted conversation threads.

2 min readvia 9to5mac.com
Page 1 of 47Next →