LLM News

Every LLM release, update, and milestone.

product update

Google launches Gemini Enterprise mobile app, integrates @Google Chat for Workspace

Google has released mobile apps for Gemini Enterprise, its AI service for business users announced last October. The update also brings @Google Chat integration directly into the Gemini app for Workspace subscribers.

February 25, 2026 · 10:20 PM1 min read

google gemini enterprise

via 9to5google.com ↗

product update

Google relaunches Flow AI studio with free image generation and video editing

Google has relaunched its Flow AI creative studio as a unified platform for image and video creation. The updated tool includes free image generation capabilities and new editing features designed to streamline creative workflows.

February 25, 2026 · 7:05 PM1 min read

google flow creative-tools

via the-decoder.com ↗

researchByteDance

Bytedance study: reasoning models know when to stop, but sampling methods force continued thinking

A new Bytedance study reveals that large reasoning models actually know when they've reached the correct answer, but common sampling methods prevent them from stopping. The models engage in unnecessary cross-checking and reformulation despite already solving problems correctly.

February 25, 2026 · 6:20 PM2 min read

reasoning-models inference-efficiency sampling-methods

via the-decoder.com ↗

product update

Adobe Firefly Quick Cut automates video rough cuts from text prompts

Adobe has integrated a new "Quick Cut" feature into Firefly that automatically generates rough video edits from raw footage based on text prompts. The tool handles the initial editing phase, traditionally one of the most time-consuming parts of video production.

February 25, 2026 · 5:51 PM2 min read

adobe firefly video-editing

via the-decoder.com ↗

product updateAnthropic

Anthropic launches Claude Code Remote Control for device automation

Anthropic has released Claude Code Remote Control, a new feature allowing users to initiate remote control sessions on their computers and manage them via Claude Code on web, iOS, and native apps. The feature is in early stages with reported stability issues including API 500 errors and permission approval requirements.

February 25, 2026 · 5:50 PM2 min read

anthropic claude-code ai-agents

via simonwillison.net ↗

product update

Adobe Firefly adds Quick Cut feature to auto-generate video drafts from raw footage

Adobe has added Quick Cut to Firefly, an AI-powered feature that automatically generates first-draft videos from raw footage based on user instructions. The tool is designed to reduce manual editing time by processing footage and applying cuts, transitions, and basic structure without requiring frame-by-frame manual work.

February 25, 2026 · 2:05 PM2 min read

adobe firefly video-editing

via techcrunch.com ↗

product updateAnthropic

Anthropic enables Claude Code sessions across devices via cloud sync

Anthropic has expanded Claude Code accessibility to smartphones, tablets, and web browsers, allowing users to resume locally-running programming sessions across devices. The research preview feature enables seamless session continuity without requiring users to restart their development environment.

February 25, 2026 · 10:35 AM2 min read

claude-code anthropic developer-tools

via the-decoder.com ↗

researchApple

Apple Research Identifies 'Text-Speech Understanding Gap' Limiting LLM Speech Performance

Apple researchers have identified a fundamental limitation in speech-adapted large language models: they consistently underperform their text-based counterparts on language understanding tasks. The team terms this the 'text-speech understanding gap' and documents that speech-adapted LLMs lag behind both their original text versions and cascaded speech-to-text pipelines.

February 24, 2026 · 11:35 PM2 min read

apple-research multimodal-ai speech-recognition

via machinelearning.apple.com ↗

model release

Inception's Mercury 2 uses diffusion for language reasoning, claims 5x speed over autoregressive models

Inception has released Mercury 2, positioning it as the first diffusion-based language reasoning model. Rather than generating text sequentially word-by-word like standard language models, Mercury 2 refines entire passages in parallel, according to the company.

February 24, 2026 · 7:50 PM2 min read

inception diffusion-model language-model

via the-decoder.com ↗

model release

Alibaba releases Qwen3.5-27B, a 27B multimodal model with Apache 2.0 license

Alibaba Qwen has released Qwen3.5-27B, a 27-billion parameter model capable of processing both images and text. The model is available under an Apache 2.0 open license and is compatible with standard transformer endpoints.

February 24, 2026 · 7:20 PM2 min read

qwen alibaba-qwen multimodal

via huggingface.co ↗

researchGoogle DeepMind

DeepMind proposes AI should assign humans busywork to maintain job skills

A new Google DeepMind research paper on AI agent delegation recommends that AI systems occasionally assign humans tasks they could easily complete themselves. The goal: preventing workforce skill atrophy as automation increases.

February 24, 2026 · 7:05 PM2 min read

deepmind ai-delegation human-ai-collaboration

via the-decoder.com ↗

model release

Alibaba releases Qwen3.5-35B-A3B, a 35B multimodal model with Apache 2.0 license

Alibaba has released Qwen3.5-35B-A3B, a 35-billion parameter multimodal model capable of processing images and text. The model is published under an Apache 2.0 license and available on Hugging Face with Transformers and SafeTensors format support.

February 24, 2026 · 6:05 PM2 min read

qwen alibaba-qwen multimodal

via huggingface.co ↗

model release

Liquid AI releases LFM2-24B-A2B, a 24B parameter mixture-of-experts model

Liquid AI has released LFM2-24B-A2B, a 24-billion parameter mixture-of-experts model designed for text generation and conversational tasks. The model supports nine languages including English, Arabic, Chinese, French, German, Japanese, Korean, Spanish, and Portuguese.

February 24, 2026 · 5:35 PM1 min read

liquid-ai moe mixture-of-experts

via huggingface.co ↗

product update

ProducerAI music generator joins Google Labs, used by Wyclef Jean

ProducerAI, a music generation tool, has been integrated into Google Labs. The platform was used by artist Wyclef Jean in the creation of his new song 'Back in Abu Dhabi,' marking an early real-world deployment of Google's AI music capabilities.

February 24, 2026 · 5:05 PM1 min read

google music-generation ai-labs

via techcrunch.com ↗

product updateAnthropic

Anthropic expands Claude Cowork with enterprise app integrations and multi-step automation

Anthropic expanded Claude Cowork with integrations to Google Workspace, DocuSign, and WordPress, plus pre-built plugins for HR, design, engineering, and finance tasks. The platform can now execute multi-step workflows across Excel and PowerPoint automatically.

February 24, 2026 · 4:50 PM1 min read

anthropic claude-cowork ai-agents

via theverge.com ↗

product update

Google acquires ProducerAI music platform, integrates with Lyria 3 model

ProducerAI, an AI music-making platform that allows users to generate sounds, remix songs, and create instruments via AI agents, is joining Google. The platform will operate under Google Labs and be powered by a preview version of Google's Lyria 3 music generation model.

February 24, 2026 · 2:05 PM1 min read

google acquisition music-generation

via theverge.com ↗

model release

LocoreMind releases LocoOperator-4B, a 4B parameter agent model based on Qwen3

LocoreMind has released LocoOperator-4B, a 4 billion parameter text generation model fine-tuned from Qwen/Qwen3-4B-Instruct-2507. The model is optimized for agent workflows and tool-calling capabilities and is available under an MIT license.

February 24, 2026 · 9:05 AM1 min read

model-release qwen agent-models

via huggingface.co ↗

changelogOpenAI

OpenAI Python SDK v2.23.0 adds GPT-4 Realtime 1.5 and Audio 1.5 model support

OpenAI released Python SDK version 2.23.0 on February 24, 2026, adding support for two new model options in realtime API calls: gpt-realtime-1.5 and gpt-audio-1.5. The minor update primarily focuses on expanding model availability for real-time applications.

February 24, 2026 · 3:35 AM1 min read

openai python-sdk api-update

via github.com ↗

product update

Gemini app adds video templates for quicker content generation

Google has rolled out video templates in its Gemini app, enabling users to generate video content more quickly through pre-built starting points. This update follows the introduction of music generation capabilities the previous week.

February 23, 2026 · 7:35 PM2 min read

google gemini video-generation

via 9to5google.com ↗

benchmarkOpenAI

OpenAI says SWE-bench Verified is broken—most tasks reject correct solutions

OpenAI is calling for the retirement of SWE-bench Verified, the widely-used AI coding benchmark, claiming most tasks are flawed enough to reject correct solutions. The company argues that leading AI models have likely seen the answers during training, meaning benchmark scores measure memorization rather than genuine coding ability.

February 23, 2026 · 7:20 PM2 min read

benchmarks SWE-bench code-generation

via the-decoder.com ↗

← PreviousPage 22 of 24Next →