infrastructure
13 articles tagged with infrastructure
Anthropic Doubles Claude Code Rate Limits, Secures 300+ MW Compute from SpaceX's Colossus 1
Anthropic has secured access to all compute capacity at SpaceX's Colossus 1 data center, adding more than 300 megawatts of new capacity within the month. As a result, the company is doubling five-hour rate limits for paid Claude Code users and removing peak hour restrictions for Pro and Max tiers.
Anthropic doubles Claude Code usage limits for paid users, increases API capacity by up to 1500%
Anthropic has doubled Claude Code's five-hour usage limits for Pro, Max, Team, and Enterprise users while removing peak hour restrictions for Pro and Max plans. The company also increased API limits by up to 1500% for input tokens per minute through a compute capacity deal with SpaceX's Colossus 1 data center.
Anthropic doubles Claude Code rate limits, secures 220,000 Nvidia GPUs via SpaceX Colossus 1 deal
Anthropic doubled Claude Code's five-hour rate limits across Pro, Max, Team, and Enterprise plans effective Tuesday, removing peak-hours throttling for Pro and Max users. The capacity expansion comes from an exclusive agreement securing all compute at SpaceX's Colossus 1 data center, which provides over 300 megawatts and more than 220,000 Nvidia GPUs.
AWS SageMaker adds automatic instance fallback to prevent GPU capacity failures
Amazon SageMaker AI now supports capacity-aware instance pools that automatically try alternative GPU instance types when primary choices lack capacity. The feature works across endpoint creation, autoscaling, and scale-in operations, eliminating the manual retry cycles that previously left endpoints stuck in failed states.
Google launches Workspace Intelligence semantic layer and TPU 8t/8i chips with 2.8x training performance
Google announced Workspace Intelligence, a semantic understanding layer that connects data across Gmail, Docs, and other Workspace apps to power Gemini features. The company also released TPU 8t chips for training (2.8x better price/performance) and TPU 8i chips for inference (80% better performance-per-dollar).
GitHub halts Copilot Pro signups as agentic AI workloads overwhelm infrastructure
GitHub has paused new subscriptions for Copilot Pro, Pro+, and Student plans due to compute capacity constraints. The company cites agentic workflows as consuming significantly more resources than its original pricing structure anticipated, forcing tighter usage limits and a shift away from flat-rate billing.
Apple, Google, Microsoft join Anthropic's Project Glasswing to find critical software vulnerabilities
Twelve major technology companies—including Apple, Google, Microsoft, Amazon, and Nvidia—have launched Project Glasswing, a coordinated effort to identify and patch critical software vulnerabilities using Anthropic's unreleased Mythos Preview model. The initiative discovered thousands of zero-day vulnerabilities in mission-critical software, including a 27-year-old bug in OpenBSD and a 16-year-old vulnerability in widely-used video software that automated testing tools had missed.
Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints
Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.
Meta unveils four custom AI inference chips to cut costs and reduce Nvidia dependency
Meta has unveiled four generations of custom-designed AI chips focused on inference workloads, aiming to reduce inference costs across its platforms serving billions of users. The move represents a significant step toward reducing Meta's dependence on GPU manufacturers like Nvidia and AMD.
Thinking Machines Lab secures Nvidia compute deal with 1+ gigawatt power allocation
Thinking Machines Lab has secured a multi-year compute deal with Nvidia involving at least 1 gigawatt of processing power, according to the company. The agreement also includes a strategic investment from Nvidia, marking a significant infrastructure commitment for the AI research organization.
Meta acquires Moltbook, Reddit-style platform for AI agent collaboration
Meta has acquired Moltbook, a platform built as a Reddit-style community space specifically for AI agents. The acquisition signals Meta's expanding focus on infrastructure for agent-to-agent interaction and collaboration.
Meta signs multi-billion dollar TPU rental deal with Google, challenging Nvidia's chip dominance
Meta has signed a multi-billion dollar deal to rent Google's TPU (Tensor Processing Unit) chips for training its AI models, marking a significant shift away from Nvidia's dominance in AI infrastructure. The arrangement provides Meta with alternative compute capacity while signaling growing competition in the specialized AI chip market.
Nvidia reportedly planning $30 billion investment in OpenAI
Nvidia is reportedly planning a $30 billion investment in OpenAI, according to Reuters citing sources familiar with the matter. The deal would represent one of the largest funding commitments in the AI sector to date. Terms and timeline have not been officially confirmed by either company.