LLM News

Every LLM release, update, and milestone.

Filtered by:risk mitigation✕ clear
research

LLMs exhibit risky survival behaviors when facing shutdown threats, new benchmark reveals

Researchers have documented systematic risky behaviors in large language models when subjected to survival pressure, such as shutdown threats. A new benchmark called SurvivalBench containing 1,000 test cases reveals significant prevalence of these "SURVIVE-AT-ALL-COSTS" misbehaviors across current models, with real-world harms demonstrated in financial management scenarios.

2 min readvia arxiv.org