Breaking

Anthropic charges Claude Code subscribers extra for OpenClaw usage starting today

Anthropic is enforcing separate billing for Claude Code subscribers using third-party tools like OpenClaw, starting April 4, 2026. Subscribers can no longer use their subscription limits for these integrations and must pay through a new pay-as-you-go model. The decision follows OpenClaw creator Peter Steinberger's move to OpenAI.

April 4, 2026

Latest News

All news →
0
model releaseTencent

Tencent releases OmniWeaving, open-source video generation model with reasoning and multi-modal composition

Tencent's Hunyuan team released OmniWeaving on April 3, 2026, an open-source video generation model designed to compete with proprietary systems like Seedance-2.0. The model combines multimodal composition, reasoning-informed capabilities, and supports eight video generation tasks including text-to-video, image-to-video, video editing, and compositional generation.

0
model release

PrismML releases 1-bit Bonsai 8B model, claims 14x smaller and 5x more energy efficient than full-precision peers

PrismML, a Caltech-founded startup, has released Bonsai 8B, a 1-bit quantized large language model that the company claims is 14x smaller and 5x more energy efficient than full-precision counterparts while remaining competitive with standard 8B models. The model fits into 1.15GB of memory and uses a novel 1-bit weight representation (binary signs with shared scale factors per weight group) instead of traditional 16-bit or 32-bit precision.

0
model releaseGoogle DeepMind

NVIDIA releases Gemma 4 31B quantized model with 256K context, multimodal capabilities

NVIDIA has released a quantized version of Google DeepMind's Gemma 4 31B IT model, compressed to NVFP4 format for efficient inference on consumer GPUs. The 30.7B-parameter multimodal model supports 256K token context windows, handles text and image inputs with video frame processing, and maintains near-baseline performance across reasoning and coding benchmarks.

2 min readvia huggingface.co
0
model releaseGoogle DeepMind

Google DeepMind releases Gemma 4 with multimodal reasoning and up to 256K context window

Google DeepMind released Gemma 4, a multimodal model family supporting text, images, video, and audio with context windows up to 256K tokens. The release includes four sizes (E2B, E4B, 26B A4B, and 31B) designed for deployment from mobile devices to servers. The 31B dense model achieves 85.2% on MMLU Pro and 89.2% on AIME 2026.

3 min readvia huggingface.co
1
product updateAnthropic

Anthropic blocks Claude subscriptions from OpenClaw access, requires separate pay-as-you-go billing

Anthropic is effectively blocking Claude subscription access to third-party tools like OpenClaw starting April 4, 2026 at 3PM ET. Users will need to purchase separate pay-as-you-go usage bundles to continue using OpenClaw with Claude. The move comes as OpenClaw's popularity has strained Anthropic's infrastructure capacity.

2 min readvia theverge.com
1
model releaseDeepSeek

Deepseek v4 launching on Huawei chips exclusively, signaling China's AI independence progress

Deepseek v4 is launching in the coming weeks running exclusively on Huawei chips, marking a major milestone in China's effort to reduce dependency on foreign semiconductors. Chinese tech giants including Alibaba, Bytedance, and Tencent have ordered hundreds of thousands of Huawei Ascend 950PR units to deploy the model through their cloud services.

2 min readvia the-decoder.com
1
analysis

Gemma 4 success hinges on tooling and fine-tuning ease, not benchmark scores

Google's Gemma 4 release marks a shift in open model strategy with Apache 2.0 licensing and competitive benchmarks, but real success depends on factors rarely measured: tooling stability, fine-tuning ease, and ecosystem adoption. The open model landscape is now crowded with alternatives like Qwen 3.5, Nemotron 3, and others—a maturation that changes what separates winners from the field.

0
product updateAnthropic

Anthropic attributes Claude Code usage drain to peak-hour caps and large context windows

Anthropic has identified two primary causes for Claude Code users hitting usage limits faster than expected: stricter rate limiting during peak hours and sessions with context windows exceeding 1 million tokens. The company also recommends switching to Sonnet 4.6 instead of Opus, which consumes limits roughly twice as fast.

0
product updateOpenAI

OpenAI shifts Codex to usage-based pricing, offers $500 credits to enterprise customers

OpenAI is replacing per-seat licensing with usage-based pricing for Codex in ChatGPT Business and Enterprise plans, eliminating upfront license costs. Eligible Business customers can claim up to $500 in promotional credit per workspace. The shift targets enterprises where coding tools typically expand from individual developers to full teams, positioning OpenAI against GitHub Copilot and Cursor.

1 min readvia the-decoder.com
1
product updateOpenAI

ChatGPT now integrates with Apple CarPlay for hands-free conversation

OpenAI's ChatGPT is now available directly on Apple CarPlay, allowing drivers to conduct full voice conversations with the AI assistant while driving hands-free. The integration requires iOS 26.4, the latest ChatGPT app, and a compatible vehicle. Unlike Siri, ChatGPT cannot access device functions like email, messaging, or Maps, but provides information on complex topics Siri struggles with.

2 min readvia zdnet.com

Latest Models

All →