LLM News

Every LLM release, update, and milestone.

0
funding

Nvidia-backed Nscale raises $2B, hits $14.6B valuation with Sandberg and Clegg joining board

Nvidia-backed British AI infrastructure startup Nscale has raised $2 billion in a new funding round, bringing its valuation to $14.6 billion. The round marks a significant milestone for the infrastructure-focused startup, with Meta's former COO Sheryl Sandberg and Meta's former VP of Global Affairs Nick Clegg joining the board.

0
research

Researchers detect hallucinations in LLMs through computational traces

Researchers at Sapienza University of Rome have identified measurable computational traces that appear when large language models hallucinate. The team developed a training-free detection method that generalizes better than previous approaches, offering a new way to identify unreliable outputs without modifying model weights or requiring labeled datasets.

0
model releaseByteDance

ByteDance's Helios reaches 19.5 FPS for minute-long video generation on single GPU

ByteDance has released Helios, a 14-billion-parameter open-weight video generation model that achieves 19.5 frames per second on a single GPU while generating minute-long video clips. The researchers claim this is the first model of its scale to reach near-real-time performance at this duration. Code and model weights are publicly available.

0
benchmarkOpenAI

Video AI models hit reasoning ceiling despite 1000x larger dataset, researchers find

An international research team released the largest video reasoning dataset to date—roughly 1,000 times larger than previous alternatives. Testing reveals that state-of-the-art models including Sora 2 and Veo 3.1 substantially underperform humans on reasoning tasks, suggesting the limitation isn't data scarcity but architectural constraints.

2 min readvia the-decoder.com
0
researchAnthropic

Anthropic study: AI job disruption far below theoretical potential despite programmer exposure

Anthropic has developed a new measurement combining theoretical AI capabilities with real-world usage data, finding that programmers and customer service workers face the highest exposure to AI automation. However, unemployment in affected professions has not risen, with only early warning signs appearing among younger workers.

0
product updateTabnine

Tabnine launches Enterprise Context Engine to ground AI coding in production environments

Tabnine has introduced its Enterprise Context Engine, designed to give AI models the contextual understanding needed to operate safely within real production development environments. The tool addresses a gap between raw model capability and practical enterprise deployment, where understanding an organization's codebase, dependencies, and architecture is critical.