LLM News

Every LLM release, update, and milestone.

Filtered by:nlp-research✕ clear

research

Researchers propose DiSE, a self-evaluation method for diffusion language models

Researchers have proposed DiSE, a self-evaluation method designed to assess output quality in diffusion language models (dLLMs) by computing token regeneration probabilities. The technique enables efficient confidence quantification for models that generate text bidirectionally rather than sequentially, addressing a key limitation in quality assessment.

March 5, 2026 · 1:23 AM2 min read

diffusion-models language-models self-evaluation

via arxiv.org ↗

research

UniLID: New language identification method achieves 70% accuracy with just 5 samples per language

Researchers introduce UniLID, a language identification method that leverages tokenizer-based unigram distributions to identify languages and dialects with high sample efficiency. The approach achieves over 70% accuracy on low-resource languages with only five labeled examples per language, substantially outperforming existing systems like fastText, GlotLID, and CLD3 in low-resource settings.

February 20, 2026 · 3:22 AM2 min read

language-identification multilingual-nlp tokenization

via arxiv.org ↗