product updateApple

Apple to Use Nvidia Blackwell B200 GPUs in Google Cloud for Gemini-Powered Siri

TL;DR

Apple will process some Siri queries using Nvidia's Blackwell B200 data center GPUs deployed in Google Cloud, according to The Information. The company plans to use Nvidia's confidential compute feature to encrypt data during processing on the chips.

2 min read
0

Apple to Use Nvidia Blackwell B200 GPUs in Google Cloud for Gemini-Powered Siri

Apple will process some Siri queries using Nvidia's Blackwell B200 data center GPUs deployed in Google Cloud, according to a report from The Information. The company plans to use Nvidia's confidential compute feature to encrypt data during processing on the chips.

Technical Infrastructure

According to sources familiar with the matter, Apple will tap into Google's fleet of Nvidia Blackwell B200 chips specifically. The Nvidia B200 is one of Nvidia's flagship data center GPUs for large-scale AI training and inference, positioned for running trillion-parameter models with improvements in inference speed, memory bandwidth, and multi-GPU scaling compared to the previous Hopper architecture.

Apple has approved the use of Nvidia's confidential compute technology in this setup. This hardware-based security system encrypts data while it is actively being processed by the GPUs. According to Nvidia, the feature "preserves the confidentiality and integrity of AI models deployed on Rubin, Blackwell, and Hopper GPUs" and allows "sensitive AI workloads to run securely at scale with near-native performance, even in shared or cloud environments."

Partnership Structure

Under the agreement with Google, certain user queries to a new version of Siri will run in Google Cloud on a licensed version of Google's Gemini model. This arrangement represents a departure from Apple's traditional strategy of controlling all critical components of its products.

The report notes it remains unclear how Apple's previously launched Private Cloud Compute server system will integrate with the upcoming Siri product launch. Apple announced Private Cloud Compute as a privacy-focused server infrastructure for processing sensitive AI workloads.

Deployment Timeline

Apple is expected to detail its AI plans at WWDC, though specific dates for the Gemini-powered Siri rollout have not been disclosed.

What This Means

Apple's reliance on Google Cloud and Nvidia hardware marks a significant shift from its vertical integration strategy. The company is betting that Nvidia's confidential compute can provide sufficient privacy guarantees for processing user data on third-party infrastructure—a claim that will face scrutiny given Apple's privacy positioning. The architecture suggests Apple either cannot or chooses not to run these workloads on its own silicon, raising questions about the performance requirements of Gemini integration versus the capabilities of Apple's server chips.

Related Articles

product update

Google launches Gemini Go for Android Go devices with 2GB+ RAM, replaces Assistant Go

Google is rolling out Gemini Go, a streamlined version of its Gemini AI assistant designed for Android Go devices with 2GB of RAM or more. The update replaces Assistant Go and integrates into the Google Search app on entry-level Android phones.

product update

Google's Gemini Android overlay adds Dynamic Color theming and relocates Screen content capture

Google is rolling out interface updates to the Gemini overlay on Android. The overlay now supports Dynamic Color theming in version 17.28 of the Google app, with the Screen content capture feature relocated from the tools menu to the main carousel alongside Photos, Camera, Files, Drive, and Notebooks.

product update

Google's Gemini Spark AI agent uses personal data to plan trips, raising privacy concerns

Google's Gemini Spark, an AI agent rolling out to the company's $99/month AI Ultra plan, demonstrates advanced capabilities by mining users' Gmail, calendar, photos, and location data to create detailed trip itineraries. The agent can perform actions across apps and operate computers, though third-party services like Airbnb currently block its booking attempts.

product update

Cline v3.87.0 Adds MiniMax M3 Support, Patches 5 Security Vulnerabilities

Cline, the open-source AI coding assistant, released v3.87.0 on June 3, 2025, adding support for MiniMax's M3 model. The update patches security vulnerabilities in five packages including @xmldom/xmldom, basic-ftp, axios, and undici.

Comments

Loading...