LLM News

Every LLM release, update, and milestone.

Filtered by:research✕ clear

Anthropic study: AI job disruption far below theoretical potential despite programmer exposure

Anthropic has developed a new measurement combining theoretical AI capabilities with real-world usage data, finding that programmers and customer service workers face the highest exposure to AI automation. However, unemployment in affected professions has not risen, with only early warning signs appearing among younger workers.

March 6, 2026 · 11:50 AM2 min read

anthropic ai-labor job-displacement

via the-decoder.com ↗

research

Timer-S1: 8.3B time series foundation model achieves state-of-the-art forecasting on GIFT-Eval

Researchers have introduced Timer-S1, a Mixture-of-Experts time series foundation model with 8.3 billion total parameters and 750 million activated parameters per token. The model achieves state-of-the-art forecasting performance on the GIFT-Eval leaderboard, with the best MASE and CRPS scores among pre-trained models.

March 6, 2026 · 6:09 AM2 min read

LLM News

Anthropic study: AI job disruption far below theoretical potential despite programmer exposure

Timer-S1: 8.3B time series foundation model achieves state-of-the-art forecasting on GIFT-Eval

Research: Token-wise KV cache compression cuts memory to 6% while retaining 94% performance

New method uses structural graphs to fix LLM reasoning collapse in multi-step theorem prediction

EvoTool optimizes LLM agent tool-use policies via evolutionary algorithms without gradients

New technique extends LLM context windows to 128K tokens without expensive retraining

1.58-bit BitNet models naturally support structured sparsity with minimal accuracy loss

Researchers propose WIM rating system to replace subjective numerical scores in LLM training

Progressive Residual Warmup improves LLM pretraining stability and convergence speed

Researchers Identify 'Contextual Inertia' Bug in LLMs During Multi-Turn Conversations

BandPO improves LLM reinforcement learning by replacing fixed clipping with probability-aware bounds

Researchers develop controllable full-duplex speech model trainable on 2,000 hours of data

Stable-LoRA addresses feature learning instability in low-rank adaptation fine-tuning

Vevo2 unifies speech and singing voice generation with prosody and style control

RealWonder generates physics-accurate videos in real-time from single images

POET-X reduces LLM training memory by 40%, enables billion-parameter models on single H100

ms-Mamba outperforms Transformer models on time-series forecasting with fewer parameters

Vevo2 unifies speech and singing voice generation with controllable prosody and style

New framework improves VLM spatial reasoning through minimal information selection

FLoC reduces video AI token load by 50%+ without retraining using facility location algorithm

RePo: Research Shows Dynamic Positional Encoding Improves LLM Context Understanding

ButterflyMoE achieves 150× memory reduction for mixture-of-experts models via geometric rotations

FreeAct framework relaxes quantization constraints for multimodal and diffusion LLMs

Meta-Reinforcement Learning Framework MAGE Enables LLM Agents to Adapt and Strategize

New benchmark reveals LLMs struggle with genuine knowledge discovery in biology

Research: Contrastive refinement reduces AI model over-refusal without sacrificing safety

New Method Reduces AI Over-Refusal Without Sacrificing Safety Alignment

Researchers develop inference-time personality sliders for LLMs without retraining

StructLens reveals hidden structural patterns across language model layers

ByteFlow Net removes tokenizers, learns adaptive byte compression for language models

New world model architecture maintains 3D consistency across extended video generation

Pointer-CAD unifies B-Rep and command sequences for LLM-based CAD generation

WebDS benchmark reveals 80% performance gap between AI agents and humans on real-world data science tasks

Researchers introduce RDB-PFN, first relational database foundation model trained entirely on synthetic data

Researchers identify 'Lazy Attention' problem in multimodal AI training, boost reasoning by 7%

Study shows RL training enables LLMs to abstain on unanswerable temporal questions, outperforming GPT-4o

SureLock cuts masked diffusion language model decoding compute by 30-50%

Spectral Surgery: Training-Free Method Improves LoRA Adapters Without Retraining

Study reveals preference leakage bias when LLMs judge synthetically-trained models

OSCAR: New RAG compression method achieves 2-5x speedup with minimal accuracy loss

Researchers identify and fix critical toggle control failure in multimodal GUI agents

Researchers develop data synthesis method to improve multimodal AI reasoning on charts and documents

Knowledge graphs enable smaller models to outperform GPT-5.2 on complex reasoning

Research reveals LLMs internalize logic as geometric flows in representation space

NeuroProlog framework combines neural networks with symbolic reasoning to fix LLM math errors

Researchers expose 'preference leakage' bias in LLM judging systems

Meta's NLLB-200 learns universal language structure, study finds

Diffusion language models memorize less training data than autoregressive models, study finds

CoDAR framework shows continuous diffusion language models can match discrete approaches

New benchmark reveals LLMs lose controllability at finer behavioral levels