LLM News

Every LLM release, update, and milestone.

Filtered by:agent-training✕ clear

Go-Browse trains 7B model to beat GPT-4o mini on web navigation tasks

Researchers propose Go-Browse, a method for training web agents through structured exploration that frames data collection as graph search. A 7B parameter language model fine-tuned on 10K trajectories achieves 21.7% success on the WebArena benchmark, outperforming GPT-4o mini by 2.4 percentage points.

March 5, 2026 · 1:25 AM2 min read

web-agents language-models training-data

via arxiv.org ↗

researchAnthropic

Researchers achieve 141% improvement in agent training with just 312 human demonstrations

Researchers at GAIR-NLP have published PC Agent-E, an agent training framework that achieves a 141% relative improvement in computer use tasks starting from only 312 human-annotated trajectories. The method uses Claude 3.7 Sonnet to synthesize alternative action decisions, and the resulting model outperforms Claude 3.7 Sonnet by 10% on WindowsAgentArena-V2.

March 5, 2026 · 1:07 AM2 min read

agent-training computer-use data-synthesis

via arxiv.org ↗