ai-security

6 articles tagged with ai-security

April 1, 2026

Google Deepmind identifies six attack categories that can hijack autonomous AI agents

A Google Deepmind paper introduces the first systematic framework for 'AI agent traps'—attacks that exploit autonomous agents' vulnerabilities to external tools and internet access. The researchers identify six attack categories targeting perception, reasoning, memory, actions, multi-agent networks, and human supervisors, with proof-of-concept demonstrations for each.

March 14, 2026
product updateOpenAI

OpenAI launches Codex Security research preview for AI-powered vulnerability detection

OpenAI has released Codex Security as a research preview, an AI application security agent designed to detect and patch complex code vulnerabilities. The tool analyzes project context to reduce noise and increase confidence in vulnerability detection.

March 12, 2026
product updateOpenAI

OpenAI acquires Promptfoo, an AI security and testing platform

OpenAI is acquiring Promptfoo, an AI security platform that helps enterprises identify and remediate vulnerabilities in AI systems during development. Terms of the acquisition were not disclosed.

March 9, 2026
product updateOpenAI

OpenAI acquires Promptfoo, integrates security testing into Frontier platform

OpenAI is acquiring Promptfoo, an AI security platform, to integrate automated vulnerability testing directly into its Frontier enterprise offering. The acquisition adds jailbreak detection, prompt injection testing, and data leak identification capabilities to OpenAI's enterprise product.

March 7, 2026
researchAnthropic

Claude discovers 100+ Firefox vulnerabilities in security audit

Anthropic's Claude AI has identified over 100 security vulnerabilities in Firefox, including previously undetected bugs that traditional testing methods missed over decades. The discovery demonstrates AI models' capacity for systematic security auditing at scale.

February 21, 2026
product updateAnthropic

Anthropic launches Claude Code Security tool; cybersecurity stocks fall

Anthropic has released Claude Code Security, an AI tool designed to identify code vulnerabilities that traditional security scanners overlook. The announcement prompted an immediate decline in cybersecurity stock valuations.