product updateApple

Apple gains full Gemini access, uses distillation to build lightweight on-device models

TL;DR

Apple has secured full access to Google's Gemini models within its data centers and is using knowledge distillation to generate training data for smaller, on-device AI models. The approach allows Apple to create lightweight versions that replicate Gemini's reasoning patterns while running directly on Apple devices, requiring significantly less processing power.

March 26, 2026 · 8:05 PM2 min read

Apple gains full Gemini access, uses distillation to build lightweight on-device models

Apple has secured broad access rights to Google's Gemini models, according to reporting from The Information. The company now has full access to Gemini within its own data centers and, critically, permission to use knowledge distillation—a technique for extracting capabilities from larger models into smaller ones.

How the distillation approach works

Apple is leveraging Gemini to generate high-quality training data by extracting both answers and reasoning chains from the larger model. This output serves as training data for smaller models that Apple builds internally. The result: lightweight models that deliver identical answers and reasoning paths as Gemini while consuming far less computational resources.

These distilled versions can run directly on Apple devices without requiring cloud connectivity, a key advantage for privacy and latency-sensitive applications.

The strategic play

This approach mirrors tactics allegedly used by Chinese AI companies, but with a critical difference—Apple has paid for legitimate access rights to Gemini's outputs. The arrangement reflects a pragmatic strategy: rather than building reasoning capabilities from scratch, Apple taps Google's foundation models to train its own smaller variants optimized for device-side execution.

According to The Information, Gemini's design around chatbot and enterprise use cases doesn't perfectly align with Apple's Siri integration goals. This mismatch has motivated Apple to continue building its own models in parallel through its Apple Foundation Models team.

Timeline and expectations

Apple is expected to announce new AI features during its Worldwide Developers Conference in June 2026. The distillation work appears designed to power these announcements with practical, on-device capabilities.

The full scope of Apple's Gemini licensing agreement—including pricing, usage restrictions, and exclusivity terms—remains undisclosed.

What this means

Apple is adopting a hybrid approach: leveraging frontier models from established leaders (Google) while investing in proprietary on-device optimization. This reduces Apple's need to develop world-class reasoning capabilities independently while maintaining control over the user-facing models deployed on its devices. The strategy positions Apple to ship differentiated AI features by WWDC without bearing the full R&D cost of training large foundation models from scratch. For Google, the deal provides both revenue and a partnership with a major AI integration partner.

Source: the-decoder.com ↗

apple google gemini distillation on-device-ai foundation-models siri product-update

product updateMay 7, 2026

Google testing 'Gemini Agent' upgrade that takes actions across apps, makes purchases autonomously

Google is testing a major upgrade to Gemini Agent, internally called "Remy," that can autonomously take actions on users' behalf including making purchases, sharing documents, and communicating with others. The experimental feature, available to Google AI Ultra subscribers, will monitor user preferences and handle complex tasks proactively across connected apps.

product updateMay 6, 2026

Chrome installs 4GB Gemini Nano model file for on-device AI features without clear user notice

Google Chrome is automatically downloading a 4GB model file for its Gemini Nano-powered AI features, causing unexpected storage usage on user devices. The weights.bin file enables on-device AI capabilities like scam detection and writing assistance, but users report receiving no clear notification about the storage requirements.

product updateMay 6, 2026

Google tests Remy AI agent internally, designed to act autonomously across Gemini services

Google is testing Remy, an AI personal agent for Gemini that can take actions on users' behalf across Google services, according to Business Insider. The tool is currently in employee-only testing with no confirmed public release date.

product updateMay 5, 2026

Google preps Gemini agent for macOS to control computers and organize files, challenging Claude Cowork

Google is developing a Gemini agent for macOS that will control computers, organize files, and integrate with Google Workspace apps. Code analysis reveals features including file conversion to Google Sheets, folder organization, batch file renaming, and meeting follow-up automation.

Apple gains full Gemini access, uses distillation to build lightweight on-device models

Apple gains full Gemini access, uses distillation to build lightweight on-device models

How the distillation approach works

The strategic play

Timeline and expectations

What this means

Related Articles

Google testing 'Gemini Agent' upgrade that takes actions across apps, makes purchases autonomously

Chrome installs 4GB Gemini Nano model file for on-device AI features without clear user notice

Google tests Remy AI agent internally, designed to act autonomously across Gemini services

Google preps Gemini agent for macOS to control computers and organize files, challenging Claude Cowork

Comments