LLM News

Every LLM release, update, and milestone.

Filtered by:lora✕ clear

research

Stable-LoRA addresses feature learning instability in low-rank adaptation fine-tuning

Researchers have identified a fundamental instability in Low-Rank Adaptation (LoRA), the widely-used parameter-efficient fine-tuning method, and proposed Stable-LoRA as a solution. The new approach uses dynamic weight shrinkage to maintain stable feature learning during training while preserving LoRA's efficiency benefits.

March 6, 2026 · 5:24 AM2 min read

low-rank-adaptation parameter-efficient-fine-tuning lora

via arxiv.org ↗

research

Spectral Surgery: Training-Free Method Improves LoRA Adapters Without Retraining

Researchers propose Spectral Surgery, a training-free refinement method that improves Low-Rank Adaptation (LoRA) adapters by decomposing trained weights via SVD and selectively reweighting singular values based on gradient-estimated component sensitivity. The approach achieves consistent gains across Llama-3.1-8B and Qwen3-8B—up to +4.4 points on CommonsenseQA and +2.4 pass@1 on HumanEval—by adjusting only ~1,000 scalar coefficients.

March 5, 2026 · 5:36 AM2 min read

lora fine-tuning parameter-efficient-tuning

via arxiv.org ↗

research

Crab+: New audio-visual model solves negative transfer problem in multimodal learning

A new audio-visual large language model called Crab+ addresses a critical problem in multimodal learning: negative transfer, where training on multiple tasks simultaneously causes performance degradation on nearly 55% of tasks. The model uses a new dataset of 222K samples and a technique called Interaction-aware LoRA to coordinate different audio-visual tasks, reversing the degradation trend to achieve positive transfer on 88% of tasks.

March 5, 2026 · 5:24 AM2 min read

audio-visual-learning multimodal-ai negative-transfer

via arxiv.org ↗

research

DiaBlo: Diagonal Block Finetuning Matches Full Model Performance With Lower Cost

Researchers propose DiaBlo, a parameter-efficient finetuning (PEFT) method that updates only diagonal blocks of model weight matrices, achieving comparable performance to full-model finetuning while maintaining LoRA-level efficiency. The approach eliminates low-rank matrix dependencies and provides theoretical guarantees of convergence.

March 5, 2026 · 12:54 AM2 min read

peft parameter-efficient-finetuning lora

via arxiv.org ↗