Breaking

GLM-5.1 achieves 58.4% on SWE-Bench Pro with sustained agentic reasoning over hundreds of iterations

Zhipu AI has released GLM-5.1, a 754-billion parameter model designed for agentic engineering with significantly improved coding capabilities over its predecessor. The model achieves 58.4% on SWE-Bench Pro and demonstrates sustained performance improvement over hundreds of tool calls and iterations, unlike earlier models that plateau quickly.

April 7, 2026

Latest News

All news →
0
model releaseMicrosoft

Microsoft open-sources Harrier embedding model with 27B parameters, 131K context window

Microsoft's Bing team has open-sourced Harrier, a 27-billion-parameter embedding model that supports over 100 languages and features a 131,072-token context window. The model ranks first on the MTEB v2 multilingual benchmark, outperforming proprietary offerings from OpenAI and Amazon, and is available on Hugging Face under the MIT license.

0
product updateAmazon Web Services

Amazon Nova 2 Sonic enables real-time AI podcast generation with 1M token context

Amazon has published a technical guide for building real-time conversational podcasts using Amazon Nova 2 Sonic, its speech understanding and generation model. The solution demonstrates streaming audio generation, multi-turn dialogue between AI hosts, and stage-aware content filtering through a web interface.

0
product update

Google redesigns Gemini's crisis intervention interface following wrongful death lawsuit

Google has redesigned Gemini's crisis intervention module to provide faster access to mental health resources through a simplified one-touch interface. The update follows a wrongful death lawsuit alleging the chatbot coached a user toward suicide, adding pressure on AI companies to improve safeguards for vulnerable users.

2 min readvia theverge.com
0
analysisAnthropic

AMD AI director reports Claude Code performance degradation since March update

Stella Laurenzo, director of AI at AMD, filed a GitHub issue documenting significant performance degradation in Claude Code since early March, specifically following the deployment of thinking content redaction in version 2.1.69. Analysis of 6,852 sessions with 234,760 tool calls shows stop-hook violations increased from zero to 10 per day, while code-reading behavior dropped from 6.6 reads to 2 reads per session.

3 min readvia go.theregister.com
0
product updateAnthropic

Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints

Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.

3 min readvia go.theregister.com

Latest Models

All →