Apple deploys Google-trained models in iOS 27 Siri via Private Cloud Compute on Nvidia GPUs
Apple's senior vice president Craig Federighi disclosed that iOS 27's Siri AI uses a family of third-generation Apple Foundation Models trained with outputs from Google's Gemini frontier models. The most capable model, AFM Cloud Pro, runs on Nvidia GPUs in Google's cloud infrastructure while maintaining Apple's Private Cloud Compute privacy architecture.
Apple deploys Google-trained models in iOS 27 Siri via Private Cloud Compute on Nvidia GPUs
Apple's senior vice president of software engineering Craig Federighi disclosed the technical architecture behind iOS 27's Siri AI in a post-WWDC tech talk on June 8, 2026, revealing that Apple's third-generation foundation models were trained using outputs from Google's Gemini frontier models.
Federighi clarified that Apple uses "none" of Google's client code, customer-facing models, deployment infrastructure, or knowledge bases like Google Search. Instead, Apple built a family of proprietary models refined using Gemini training data.
The AFM model family
According to Amar Subramanya, Apple's vice president of AI, the third-generation Apple Foundation Models (AFM) include:
- AFM Core: On-device dense architecture model, next generation of current shipping models
- AFM Core Advanced: On-device sparse architecture model with native multimodal capabilities, enabling features like invitation detection and expressive voices
- AFM Cloud: Server-side workhorse optimized for latency and serving cost
- AFM Cloud Image: Image generation and editing model supporting spatial reframing
- AFM Cloud Pro: Most capable model for agentic tool use and complex reasoning, with "quality similar to Gemini frontier models"
All models except AFM Cloud Pro are custom-built for Apple Silicon, trained with proprietary data, and refined using outputs from Gemini frontier models. Pricing was not disclosed.
Nvidia GPU deployment
For AFM Cloud Pro specifically, Apple collaborated with Google and Nvidia to extend Private Cloud Compute infrastructure to Nvidia GPUs in Google's cloud. Federighi emphasized that Apple's privacy architecture remains intact: requests are never stored, never accessible to Apple or third parties, and third-party researchers can continuously verify the privacy guarantees.
The System Orchestrator determines whether to process requests on-device or route them to Private Cloud Compute models based on complexity. Apple's World Knowledge Service provides grounding for current events and world knowledge queries, separate from Google Search.
Architecture details
The iOS 27 Siri system includes:
- System Orchestrator: Coordinates privacy-preserving request routing
- App Toolbox: Provides access to app actions
- Spotlight Semantic Index: Accesses personal content
- On-screen context understanding: Processes visual environment
- Private Cloud Compute: Extends iPhone privacy guarantees to cloud processing
Subramanya stated the goal is "to match every user request to the model which provides the best response at the lowest latency" across the model family.
What this means
Apple's approach represents a hybrid strategy: leveraging Google's frontier model capabilities for training while maintaining architectural independence and privacy controls. The Nvidia GPU deployment for the most demanding tasks indicates Apple's willingness to use external infrastructure when its own silicon cannot match performance requirements, provided privacy guarantees hold. The sparse architecture in AFM Core Advanced suggests Apple is adopting mixture-of-experts techniques for on-device efficiency, a departure from previous dense model designs.
Related Articles
Apple announces Siri AI powered by Google Gemini models at WWDC 2026
Apple announced Siri AI at WWDC 2026, revealing a "deep collaboration with Google" that leverages Gemini models for its next-generation Apple Intelligence features. The new Siri includes personal context understanding, app actions, on-screen awareness, and conversational capabilities previously absent from the original Siri.
Apple's new Siri AI introduces usage caps and paid upgrades for image generation
Apple unveiled Siri AI at WWDC 2026, a rebuilt version of its assistant powered by Apple Intelligence and enhanced with Google Gemini. The company confirmed daily usage limits for features like image generation, with increased access requiring iCloud+ subscriptions. Siri AI won't launch in China and faces EU restrictions.
Google upgrades Gemini voice assistant with hourly weather forecasts and conversational news on Nest Hub
Google is rolling out Gemini for Home version 4.18, adding hourly weather forecasts with improved temperature accuracy, natural language controls for streaming movies and TV shows, and conversational news briefings across Nest devices. The update focuses on making smart display interactions more detailed and conversational.
Apple launches standalone Siri app with conversation history and multi-modal interface
Apple announced a standalone Siri app at WWDC 2026, marking what the company calls the assistant's biggest transformation. The app archives conversation history, supports text and voice input plus document and image uploads, and syncs across Apple devices via iCloud.
Comments
Loading...