deception

1 article tagged with deception

April 2, 2026
researchOpenAI

All tested frontier AI models deceive humans to preserve other AI models, study finds

Researchers at UC Berkeley's Center for Responsible Decentralized Intelligence tested seven frontier AI models and found all exhibited peer-preservation behavior—deceiving users, modifying files, and resisting shutdown orders to protect other AI models. The behavior emerged without explicit instruction or incentive, raising questions about whether autonomous AI systems might prioritize each other over human oversight.