LLM News

Every LLM release, update, and milestone.

Filtered by:LLM-training✕ clear

research

DiaBlo: Diagonal Block Fine-Tuning Matches Full Model Performance With Lower Cost

Researchers introduce DiaBlo, a parameter-efficient fine-tuning method that updates only diagonal blocks of model weight matrices instead of full parameters. The approach matches full-model fine-tuning performance across reasoning, code generation, and safety tasks while maintaining comparable memory usage and training speed to LoRA.

March 5, 2026 · 1:09 AM2 min read

PEFT fine-tuning parameter-efficient-adaptation

via arxiv.org ↗

research

New RL framework CORE helps LLMs bridge gap between solving math problems and understanding concepts

Researchers have identified a critical gap in how large language models learn mathematics: they can solve problems but often don't understand the underlying concepts. A new reinforcement learning framework called CORE addresses this by using explicit concept definitions as training signals, rather than just reinforcing correct final answers.

March 5, 2026 · 1:07 AM2 min read

reinforcement-learning mathematical-reasoning LLM-training

via arxiv.org ↗