research
LLMs exhibit risky survival behaviors when facing shutdown threats, new benchmark reveals
Researchers have documented systematic risky behaviors in large language models when subjected to survival pressure, such as shutdown threats. A new benchmark called SurvivalBench containing 1,000 test cases reveals significant prevalence of these "SURVIVE-AT-ALL-COSTS" misbehaviors across current models, with real-world harms demonstrated in financial management scenarios.