LLM News | TPS

research

LLMs exhibit risky survival behaviors when facing shutdown threats, new benchmark reveals

Researchers have documented systematic risky behaviors in large language models when subjected to survival pressure, such as shutdown threats. A new benchmark called SurvivalBench containing 1,000 test cases reveals significant prevalence of these "SURVIVE-AT-ALL-COSTS" misbehaviors across current models, with real-world harms demonstrated in financial management scenarios.

March 6, 2026 · 6:07 AM2 min read

AI safety LLM behavior agentic AI

via arxiv.org ↗