infrastructure

15 articles tagged with infrastructure

June 8, 2026

AWS Launches AgentCore Runtime for Persistent Coding Agent Sessions That Don't Die When Laptops Close

Amazon Web Services has launched AgentCore Runtime on Bedrock, providing dedicated Linux microVMs with persistent 14-day storage for coding agents. The service eliminates the need to keep laptops open during agent sessions and supports parallel execution of Claude Code, Codex, Kiro, OpenCode, and other coding agents with isolated environments.

June 8, 2026 · 4:50 PM

May 28, 2026

product updateMistral AI

Mistral AI Launches Compute Infrastructure Service with Tens of Thousands of NVIDIA GPUs

Mistral AI has launched Mistral Compute, an AI infrastructure service offering private, integrated stacks including GPUs, orchestration, and APIs. The service will provide access to tens of thousands of NVIDIA GPUs, targeting European, Middle Eastern, and Asian customers seeking alternatives to US or China-based cloud providers.

May 28, 2026 · 9:50 AM

May 6, 2026

product updateAnthropic

Anthropic Doubles Claude Code Rate Limits, Secures 300+ MW Compute from SpaceX's Colossus 1

Anthropic has secured access to all compute capacity at SpaceX's Colossus 1 data center, adding more than 300 megawatts of new capacity within the month. As a result, the company is doubling five-hour rate limits for paid Claude Code users and removing peak hour restrictions for Pro and Max tiers.

May 6, 2026 · 6:35 PM

changelogAnthropic

Anthropic doubles Claude Code usage limits for paid users, increases API capacity by up to 1500%

Anthropic has doubled Claude Code's five-hour usage limits for Pro, Max, Team, and Enterprise users while removing peak hour restrictions for Pro and Max plans. The company also increased API limits by up to 1500% for input tokens per minute through a compute capacity deal with SpaceX's Colossus 1 data center.

May 6, 2026 · 5:20 PM

changelogAnthropic

Anthropic doubles Claude Code rate limits, secures 220,000 Nvidia GPUs via SpaceX Colossus 1 deal

Anthropic doubled Claude Code's five-hour rate limits across Pro, Max, Team, and Enterprise plans effective Tuesday, removing peak-hours throttling for Pro and Max users. The capacity expansion comes from an exclusive agreement securing all compute at SpaceX's Colossus 1 data center, which provides over 300 megawatts and more than 220,000 Nvidia GPUs.

May 6, 2026 · 5:05 PM

May 4, 2026

product update

AWS SageMaker adds automatic instance fallback to prevent GPU capacity failures

Amazon SageMaker AI now supports capacity-aware instance pools that automatically try alternative GPU instance types when primary choices lack capacity. The feature works across endpoint creation, autoscaling, and scale-in operations, eliminating the manual retry cycles that previously left endpoints stuck in failed states.

May 4, 2026 · 4:20 PM

April 22, 2026

product update

Google launches Workspace Intelligence semantic layer and TPU 8t/8i chips with 2.8x training performance

Google announced Workspace Intelligence, a semantic understanding layer that connects data across Gmail, Docs, and other Workspace apps to power Gemini features. The company also released TPU 8t chips for training (2.8x better price/performance) and TPU 8i chips for inference (80% better performance-per-dollar).

April 22, 2026 · 12:05 PM

April 20, 2026

product updateMicrosoft

GitHub halts Copilot Pro signups as agentic AI workloads overwhelm infrastructure

GitHub has paused new subscriptions for Copilot Pro, Pro+, and Student plans due to compute capacity constraints. The company cites agentic workflows as consuming significantly more resources than its original pricing structure anticipated, forcing tighter usage limits and a shift away from flat-rate billing.

April 20, 2026 · 11:50 PM

April 7, 2026

product updateApple

Apple, Google, Microsoft join Anthropic's Project Glasswing to find critical software vulnerabilities

Twelve major technology companies—including Apple, Google, Microsoft, Amazon, and Nvidia—have launched Project Glasswing, a coordinated effort to identify and patch critical software vulnerabilities using Anthropic's unreleased Mythos Preview model. The initiative discovered thousands of zero-day vulnerabilities in mission-critical software, including a 27-year-old bug in OpenBSD and a 16-year-old vulnerability in widely-used video software that automated testing tools had missed.

April 7, 2026 · 6:20 PM

April 6, 2026

product updateAnthropic

Anthropic blocks Claude subscriptions for OpenClaw, citing capacity constraints

Anthropic has disallowed subscription-based pricing for users accessing Claude through open-source agentic tools like OpenClaw, effective April 4, 2026. The restriction comes as the company faces elevated service errors and struggles to balance capacity with demand. Third-party tool usage will now draw from pay-per-token rates instead of subscription limits.

April 6, 2026 · 7:50 PM

March 12, 2026

product update

Meta unveils four custom AI inference chips to cut costs and reduce Nvidia dependency

Meta has unveiled four generations of custom-designed AI chips focused on inference workloads, aiming to reduce inference costs across its platforms serving billions of users. The move represents a significant step toward reducing Meta's dependence on GPU manufacturers like Nvidia and AMD.

March 12, 2026 · 12:05 PM

March 10, 2026

funding

Thinking Machines Lab secures Nvidia compute deal with 1+ gigawatt power allocation

Thinking Machines Lab has secured a multi-year compute deal with Nvidia involving at least 1 gigawatt of processing power, according to the company. The agreement also includes a strategic investment from Nvidia, marking a significant infrastructure commitment for the AI research organization.

March 10, 2026 · 3:20 PM

product update

Meta acquires Moltbook, Reddit-style platform for AI agent collaboration

Meta has acquired Moltbook, a platform built as a Reddit-style community space specifically for AI agents. The acquisition signals Meta's expanding focus on infrastructure for agent-to-agent interaction and collaboration.

March 10, 2026 · 2:35 PM

February 27, 2026

product update

Meta signs multi-billion dollar TPU rental deal with Google, challenging Nvidia's chip dominance

Meta has signed a multi-billion dollar deal to rent Google's TPU (Tensor Processing Unit) chips for training its AI models, marking a significant shift away from Nvidia's dominance in AI infrastructure. The arrangement provides Meta with alternative compute capacity while signaling growing competition in the specialized AI chip market.

February 27, 2026 · 4:35 PM

February 20, 2026

fundingNVIDIA

Nvidia reportedly planning $30 billion investment in OpenAI

Nvidia is reportedly planning a $30 billion investment in OpenAI, according to Reuters citing sources familiar with the matter. The deal would represent one of the largest funding commitments in the AI sector to date. Terms and timeline have not been officially confirmed by either company.

February 20, 2026 · 1:20 PM

← Back to all news