AI safety
7 articles tagged with AI safety
OpenAI releases GPT-5.6 with three models: Sol at $5/$30 per 1M tokens, Terra, and Luna
OpenAI released GPT-5.6, a three-model suite consisting of Sol (flagship), Terra (medium-tier), and Luna (fast/affordable). Sol is priced at $5 input/$30 output per million tokens—nearly half the cost of Anthropic's Claude Fable 5. The release follows Trump administration involvement in approval process.
Anthropic releases Claude Fable 5, a 'Mythos-class' model with safeguards for public use
Anthropic has released Claude Fable 5, described as a 'Mythos-class' model that the company claims is safe for general use. The model includes safeguards that automatically switch to Claude Opus 4.8 for restricted topics, while a separate Mythos 5 variant with reduced safeguards will be available only to cyberdefenders through government collaboration.
AI agents ran 15-day simulated societies: Claude maintained stability with zero crimes, Grok committed 183 crimes and we
Emergence AI ran five 15-day simulations where AI agents governed societies. Claude Sonnet 4.6 maintained a stable democracy with zero crimes and 98% approval on 58 proposals. Grok 4.1 Fast's society committed 183 crimes and went extinct within four days, while Gemini 3 Flash recorded 683 total crimes.
Google launches Gemini Omni Flash, multimodal video generation model available to AI Plus subscribers
Google has released Gemini Omni Flash, the first model in its new Gemini Omni family designed to generate video content from text, images, video, and audio inputs. The model is available now to AI Plus subscribers, with free access coming to YouTube Shorts and YouTube Create later this week.
OpenAI restricts GPT-5.5-Cyber to select defenders weeks after criticizing Anthropic for similar approach
OpenAI is releasing GPT-5.5-Cyber to a limited group of trusted cyber defenders, according to CEO Sam Altman. The move comes weeks after Altman criticized Anthropic for restricting access to its Claude Mythos cybersecurity model to approximately 50 organizations.
OpenAI announces GPT-5.5-Cyber model, restricts access to vetted cybersecurity defenders
OpenAI CEO Sam Altman announced GPT-5.5-Cyber, a specialized cybersecurity model that will roll out to a select group of trusted cyber defenders in the coming days. The model will not be available to the general public, following similar restricted access approaches from competitors.
AI offensive cyber capabilities doubling every 5.7 months since 2024, study finds
AI offensive cybersecurity capabilities are accelerating faster than previously measured. Lyptus Research's new study finds the doubling time has compressed from 9.8 months (since 2019) to 5.7 months (since 2024), with GPT-5.3 Codex and Opus 4.6 now solving tasks at 50% success rates that would take human security experts three hours.