Breaking

Google adds Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10

Google is adding Veo 3.1 Lite to Ultra subscriptions at zero credit cost starting May 10, 2026. The model costs less than half of Veo 3.1 Fast but generates videos at the same speed according to Google, though quality tradeoffs remain unclear.

April 13, 2026

Latest News

All news →
0
product update

AWS Lambda enables serverless reward functions for Amazon Nova model customization

AWS has introduced Lambda-based reward functions for Amazon Nova model customization through reinforcement fine-tuning (RFT). The serverless architecture automatically scales from 10 concurrent evaluations per second during experimentation to 400+ during production training, supporting both objective RLVR and subjective RLAIF approaches.

2 min readvia aws.amazon.com
0
analysis

Google Gemma 4 Runs Locally on Edge Devices, Creating Enterprise Security Blind Spot

Google released Gemma 4, an open-weights model family that runs directly on edge devices with multi-step planning and autonomous workflow capabilities. The Apache 2.0 licensed model bypasses traditional cloud security controls by executing entirely on local hardware, creating a governance blind spot for enterprise security teams.

0
model releaseAnthropic

Anthropic launches Mythos AI model claiming zero-day vulnerability discovery capabilities

Anthropic has launched Mythos, an AI model the company claims can identify and exploit zero-day vulnerabilities with significant capability. The model has not been released publicly, with Anthropic citing security concerns. The announcement raises questions about the model's actual capabilities versus pre-IPO positioning.

2 min readvia go.theregister.com
0
model releaseAnthropic

Trump officials encourage banks to test Anthropic's Mythos model for security vulnerabilities

U.S. Treasury Secretary Scott Bessent and Federal Reserve Chair Jerome Powell summoned bank executives this week and encouraged them to test Anthropic's newly announced Mythos model for detecting security vulnerabilities. According to Bloomberg, major banks including Goldman Sachs, Citigroup, Bank of America, and Morgan Stanley are already testing the model alongside JPMorgan Chase, despite Anthropic's stated plan to limit initial access.

2 min readvia techcrunch.com
0
analysis

Enterprise AI gap widens as open-weight models mature into production-ready alternatives

Open-weight models from Google, Alibaba, Microsoft, and Nvidia have crossed a threshold from research projects to enterprise-grade systems. The shift reflects a growing divide: frontier models from OpenAI and Anthropic are too expensive and pose data security risks for most enterprises, while open alternatives now deliver sufficient capability at a fraction of the cost.

0
researchAnthropic

AI agent skills fail in real-world conditions, researchers find testing 34,000 skills

A large-scale study testing 34,198 real-world skills reveals that AI agent performance drops drastically when moving from curated benchmarks to realistic conditions. Claude Opus 4.6 saw pass rates fall from 55.4% with hand-selected skills to 38.4% in truly realistic scenarios, while weaker models like Kimi K2.5 actually perform below their no-skill baseline.

3 min readvia the-decoder.com
0
model releaseArcee Ai

Arcee AI releases Trinity-Large-Thinking, open reasoning model matching Claude Opus on agent tasks

Arcee AI has released Trinity-Large-Thinking, a 400-billion-parameter open-weight reasoning model with a mixture-of-experts architecture that activates only 13 billion parameters per token. The model matches Claude Opus 4.6 on agent benchmarks like Tau2 and PinchBench but lags on general reasoning tasks. The company spent approximately $20 million—roughly half its total venture capital—to train the model on 2,048 Nvidia B300 GPUs over 33 days.

3 min readvia the-decoder.com
0
model release

MiniMax releases M2.7, a 229B parameter model with self-evolving capabilities and agent teams

MiniMax has released MiniMax-M2.7, a 229-billion parameter model that uniquely participates in its own evolution during development. The model achieves 66.6% medal rate on MLE Bench Lite and 56.22% on SWE-Pro benchmarks, with native support for multi-agent collaboration and complex tool orchestration.

Latest Models

All →