LLM News

Every LLM release, update, and milestone.

Filtered by:model-release✕ clear

product updateOpenAI

OpenAI Python SDK v2.25.0 adds GPT-5.4 support with new tool search and computer control features

OpenAI has released version 2.25.0 of its Python SDK, adding support for GPT-5.4 and introducing a new tool search feature alongside a computer control tool for agent-based automation. The update, released March 5, 2026, also includes API schema refinements and parameter changes to the prompt cache and message handling.

March 5, 2026 · 6:50 PM2 min read

openai python-sdk gpt-5-4

via github.com ↗

model releaseOpenAI

OpenAI launches GPT-5.4 with native computer use capabilities for autonomous agents

OpenAI has launched GPT-5.4, its latest model with native computer use capabilities that allow it to operate computers and complete tasks across applications. The release represents a step toward autonomous AI agents that can handle complex jobs independently. The model includes advancements in reasoning, coding, and professional work with spreadsheets, documents, and presentations.

March 5, 2026 · 6:06 PM1 min read

gpt-5-4 openai computer-use

via theverge.com ↗

model releaseOpenAI

OpenAI releases GPT-5.4 with Pro and Thinking variants for professional use

OpenAI has launched GPT-5.4, which the company describes as its most capable and efficient frontier model for professional work. The release includes Pro and Thinking variants, though specific technical specifications and pricing remain unclear.

March 5, 2026 · 6:05 PM2 min read

openai gpt-5-4 model-release

via techcrunch.com ↗

model release

Step-3.5-Flash-Base: StepFun releases lightweight text generation model

StepFun has released Step-3.5-Flash-Base, a text generation model available on Hugging Face under Apache 2.0 license. The model is part of the Step 3.5 series and focuses on efficient inference.

March 5, 2026 · 8:50 AM1 min read

stepfun model-release text-generation

via huggingface.co ↗

model release

Google releases Gemini 3.1 Flash-Lite, fastest model in 3 series

Google has released Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in its Gemini 3 series. The release targets deployment scenarios requiring high-speed inference at reduced computational cost.

March 3, 2026 · 4:50 PM2 min read

gemini model-release google-deepmind

via blog.google ↗

model release

Google releases Gemini 3.1 Flash-Lite, fastest model in 3 series

Google DeepMind has released Gemini 3.1 Flash-Lite, positioning it as the fastest and most cost-efficient model in the Gemini 3 series. The release targets applications requiring high-speed inference at scale, continuing Google's multi-tier model strategy across the Gemini family.

March 3, 2026 · 4:50 PM2 min read

gemini google-deepmind model-release

via deepmind.google ↗

model release

Alibaba releases Qwen3.5-35B-A3B-FP8, a quantized multimodal model for efficient deployment

Alibaba's Qwen team released Qwen3.5-35B-A3B-FP8 on Hugging Face, a quantized version of their 35-billion parameter multimodal model. The FP8 quantization reduces model size and memory requirements while maintaining the base model's image-text-to-text capabilities. The model is compatible with standard Transformers endpoints and Azure deployment.

March 1, 2026 · 11:20 AM1 min read

qwen alibaba-qwen model-release

via huggingface.co ↗

model release

Perplexity open-sources embedding models matching Google and Alibaba with lower memory requirements

Perplexity has open-sourced two text embedding models designed to match or exceed the performance of Google's and Alibaba's embeddings while requiring significantly less memory. The move brings competitive embedding technology into the open-source ecosystem.

February 28, 2026 · 10:50 AM2 min read

embedding-models open-source perplexity-ai

via the-decoder.com ↗

model releaseDeepSeek

DeepSeek releases R1 reasoning model with chain-of-thought capabilities

DeepSeek has released DeepSeek-R1, a text generation model featuring reasoning capabilities through chain-of-thought processing. The model was published January 20, 2025 and has accumulated over 830,000 downloads on Hugging Face.

February 27, 2026 · 11:05 AM2 min read

deepseek model-release reasoning

via huggingface.co ↗

model releaseGoogle DeepMind

Google DeepMind releases Nano Banana 2 image model with Pro-level capabilities at faster speeds

Google DeepMind has released Nano Banana 2, an image generation model that combines advanced world knowledge and subject consistency with faster inference speeds comparable to its Flash offering. The model is positioned as production-ready with capabilities previously associated with Pro-tier performance.

February 26, 2026 · 4:20 PM2 min read

image-generation google-deepmind model-release

via deepmind.google ↗

model release

LocoreMind releases LocoOperator-4B, a 4B parameter agent model based on Qwen3

LocoreMind has released LocoOperator-4B, a 4 billion parameter text generation model fine-tuned from Qwen/Qwen3-4B-Instruct-2507. The model is optimized for agent workflows and tool-calling capabilities and is available under an MIT license.

February 24, 2026 · 9:05 AM1 min read

model-release qwen agent-models

via huggingface.co ↗

model release

Guide Labs open-sources Steerling-8B, an interpretable 8B parameter LLM

Guide Labs has open-sourced Steerling-8B, an 8 billion parameter language model built with a new architecture specifically designed to make the model's reasoning and actions easily interpretable. The release addresses a persistent challenge in AI development: understanding how large language models arrive at their outputs.

February 23, 2026 · 6:05 PM2 min read

interpretability open-source language-models

via techcrunch.com ↗

model release

Zyphra releases ZUNA model under Apache 2.0 license

Zyphra has released ZUNA, an open-source model available on Hugging Face under the Apache 2.0 license. The model has been downloaded 529 times since its February 4, 2026 release.

February 20, 2026 · 3:50 PM1 min read

zyphra open-source model-release

via huggingface.co ↗

model release

Google announces Gemini 3.1 Pro for complex problem-solving tasks

Google announced Gemini 3.1 Pro, positioning the model for complex problem-solving tasks requiring deeper reasoning than previous versions. The release follows Gemini 3 Pro (November 2025) and Gemini 3 Flash (December 2025).

February 20, 2026 · 4:38 AM1 min read

gemini google-deepmind model-release

via 9to5google.com ↗

model release

Alibaba Qwen 3.5 closes performance gap with proprietary models at lower inference cost

Alibaba has released the Qwen 3.5 series, an open-source model that claims performance comparable to frontier proprietary models while running on commodity hardware. The release signals a shift in AI model economics, offering enterprises lower inference costs and greater deployment flexibility than closed alternatives.

February 20, 2026 · 4:37 AM2 min read

alibaba-qwen open-source-ai model-release

via artificialintelligence-news.com ↗