language models

3 articles tagged with language models

May 22, 2026
model releaseNVIDIA

NVIDIA releases Nemotron-Labs-Diffusion-14B with tri-mode decoding achieving 3.3x speed-up on GB200

NVIDIA released Nemotron-Labs-Diffusion-14B, a 14-billion parameter language model that supports three decoding modes by switching attention patterns during inference. The model achieves 850 tokens per second on GB200 hardware at concurrency 1, representing a 3.3x speed-up over standard autoregressive decoding and outperforming Qwen3-8B-Eagle3 by 2.2x in self-speculation mode.

April 27, 2026
analysis

Qwen releases three new Qwen3.6 models ranging from 27B to flagship Max Preview

Qwen has released three models in its Qwen3.6 series: a flagship Max Preview model, a 35B parameter A3B variant, and a 27B parameter base model. All three models are now accessible through OpenRouter's API platform.

April 23, 2026
model releaseOpenAI

OpenAI releases GPT-5.5 with faster token efficiency and improved context understanding

OpenAI released GPT-5.5, which completes tasks at the same difficulty level faster than GPT-5.4 while using significantly fewer tokens. The model is available to ChatGPT Plus, Pro, Business, and Enterprise users, with API access delayed due to required safeguards.