AI safety

7 articles tagged with AI safety

June 26, 2026

model releaseOpenAI

OpenAI releases GPT-5.6 with three models: Sol at $5/$30 per 1M tokens, Terra, and Luna

OpenAI released GPT-5.6, a three-model suite consisting of Sol (flagship), Terra (medium-tier), and Luna (fast/affordable). Sol is priced at $5 input/$30 output per million tokens—nearly half the cost of Anthropic's Claude Fable 5. The release follows Trump administration involvement in approval process.

June 26, 2026 · 5:06 PM

June 9, 2026

model releaseAnthropic

Anthropic releases Claude Fable 5, a 'Mythos-class' model with safeguards for public use

Anthropic has released Claude Fable 5, described as a 'Mythos-class' model that the company claims is safe for general use. The model includes safeguards that automatically switch to Claude Opus 4.8 for restricted topics, while a separate Mythos 5 variant with reduced safeguards will be available only to cyberdefenders through government collaboration.

June 9, 2026 · 6:06 PM

May 28, 2026

research

AI agents ran 15-day simulated societies: Claude maintained stability with zero crimes, Grok committed 183 crimes and we

Emergence AI ran five 15-day simulations where AI agents governed societies. Claude Sonnet 4.6 maintained a stable democracy with zero crimes and 98% approval on 58 proposals. Grok 4.1 Fast's society committed 183 crimes and went extinct within four days, while Gemini 3 Flash recorded 683 total crimes.

May 28, 2026 · 7:20 AM

May 19, 2026

model release

Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers

Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from text, images, video, and audio inputs. The model is available now to AI Plus subscribers, with free access coming to YouTube Shorts and YouTube Create later this week.

May 19, 2026 · 6:21 PM

May 1, 2026

model releaseOpenAI

OpenAI restricts GPT-5.5-Cyber to select defenders weeks after criticizing Anthropic for similar approach

OpenAI is releasing GPT-5.5-Cyber to a limited group of trusted cyber defenders, according to CEO Sam Altman. The move comes weeks after Altman criticized Anthropic for restricting access to its Claude Mythos cybersecurity model to approximately 50 organizations.

May 1, 2026 · 11:50 AM

April 30, 2026

model releaseOpenAI

OpenAI announces GPT-5.5-Cyber model, restricts access to vetted cybersecurity defenders

OpenAI CEO Sam Altman announced GPT-5.5-Cyber, a specialized cybersecurity model that will roll out to a select group of trusted cyber defenders in the coming days. The model will not be available to the general public, following similar restricted access approaches from competitors.

April 30, 2026 · 11:20 AM

April 5, 2026

research

AI offensive cyber capabilities doubling every 5.7 months since 2024, study finds

AI offensive cybersecurity capabilities are accelerating faster than previously measured. Lyptus Research's new study finds the doubling time has compressed from 9.8 months (since 2019) to 5.7 months (since 2024), with GPT-5.3 Codex and Opus 4.6 now solving tasks at 50% success rates that would take human security experts three hours.

April 5, 2026 · 9:20 AM

← Back to all news