language-models

7 articles tagged with language-models

May 23, 2026

NVIDIA Releases Nemotron-Labs Diffusion Models With 6.4× Faster Token Generation Than Autoregressive Decoding

NVIDIA has released Nemotron-Labs Diffusion, a family of diffusion language models at 3B, 8B, and 14B scales that generate multiple tokens in parallel rather than one at a time. The 8B model achieves 6.4× higher tokens per forward pass than autoregressive models in self-speculation mode while maintaining comparable accuracy.

May 23, 2026 · 12:21 AM

April 2, 2026

model release

Google launches Gemma 4 open-weights models with Apache 2.0 license to compete with Chinese LLMs

Google released Gemma 4, a new line of open-weights models available in sizes from 2 billion to 31 billion parameters, under a permissive Apache 2.0 license. The release includes multimodal capabilities, support for 140+ languages, native function calling, and a 256,000-token context window for the larger variants.

April 2, 2026 · 9:35 PM

model release

Google previews Gemini Nano 4 for Android, arriving on flagship devices this year

Google has previewed Gemini Nano 4, a new on-device language model for Android, available now in early access via AICore Developer Preview. The model comes in two versions: Gemini Nano 4 Fast (3x faster than previous models, 60% less battery) and Gemini Nano 4 Full (higher reasoning capability). The models will launch on new flagship Android devices later this year.

April 2, 2026 · 5:20 PM

analysisOpenAI

OpenAI's Brockman claims GPT reasoning models have 'line of sight' to AGI

OpenAI President Greg Brockman stated that GPT reasoning models have 'line of sight' to AGI and represents a settled debate on whether text-based models can achieve general intelligence. The company is prioritizing this approach over multimodal world models like Sora, which Brockman views as 'a different branch of the tech tree.' The stance contradicts prominent AI researchers including Yann LeCun and Demis Hassabis, who argue LLMs alone are insufficient for human-level intelligence.

April 2, 2026 · 11:20 AM

March 17, 2026

model release

Mistral AI releases Mistral Small 4, claims improved performance on reasoning tasks

Mistral AI has released Mistral Small 4, the latest iteration of its small-scale language model. The company claims improvements in reasoning and coding capabilities, though specific benchmark scores and pricing details have not been publicly disclosed.

March 17, 2026 · 3:13 PM

March 9, 2026

product updateNVIDIA

NVIDIA Nemotron 3 Nano now available on Amazon Bedrock as serverless model

Amazon Bedrock now offers NVIDIA's Nemotron 3 Nano as a fully managed serverless model, expanding its Nemotron portfolio alongside previously available Nemotron 2 Nano 9B and Nemotron 2 Nano VL 12B variants. The addition enables developers to deploy NVIDIA's smallest inference-optimized model without managing infrastructure.

March 9, 2026 · 9:05 PM

February 23, 2026

model release

Guide Labs open-sources Steerling-8B, an interpretable 8B parameter LLM

Guide Labs has open-sourced Steerling-8B, an 8 billion parameter language model built with a new architecture specifically designed to make the model's reasoning and actions easily interpretable. The release addresses a persistent challenge in AI development: understanding how large language models arrive at their outputs.

February 23, 2026 · 6:05 PM

← Back to all news