safety
2 articles tagged with safety
March 24, 2026
product updateAnthropic
Anthropic launches Claude Code 'auto mode' with AI-powered permission classifier
Anthropic has released 'auto mode' for Claude Code, a permissions system that sits between conservative defaults and fully disabled safeguards. The feature uses a classifier to automatically approve safe actions like file writes and bash commands while blocking potentially destructive operations.
March 9, 2026
product updateOpenAI
OpenAI acquires Promptfoo to strengthen AI agent security capabilities
OpenAI has acquired Promptfoo, a platform for testing and evaluating AI agents. The acquisition signals frontier labs' intensifying focus on proving their technology can operate safely in critical business environments.