Apple Uses Google Gemini to Distill On-Device AI for iOS 27

Apple is leveraging Google's Gemini AI models to create smaller, on-device versions through distillation. According to The Information, Google gave Apple "complete access" to Gemini in Google's own data centers, allowing Apple to customize the model for Siri and other AI features.

How the Distillation Process Works

Apple can ask the main Gemini model to perform tasks that provide high-quality results, including a rundown of the reasoning process. Apple then feeds the answers and reasoning information from Gemini to train smaller, cheaper models. This enables the smaller models to learn the internal computations used by Gemini, producing efficient models with Gemini-like performance but requiring less computing power.

Technical Details and Challenges

Apple can design models built to run on Apple devices without internet connectivity
Apple can edit Gemini as needed to ensure responses align with Apple's requirements
Apple has encountered issues because Gemini was tuned for chatbot and coding applications, which doesn't always meet Apple's needs
The smarter, chatbot version of Siri planned for iOS 27 will rely on Google's Gemini models

Capabilities and Development

Siri will be able to perform many of the same functions as Gemini and other chatbots, including:

Answering questions
Summarizing information
Scanning and understanding uploaded documents
Telling stories
Providing emotional support
Completing tasks like booking travel

The Apple Foundation Models team continues to work on Apple AI models distinct from Gemini models, indicating this is a transitional approach while Apple develops its own AI capabilities.

📖 Read the full source: HN AI Agents

Apple Using Google Gemini Access for On-Device AI Model Distillation

How the Distillation Process Works

Technical Details and Challenges

Capabilities and Development

👀 See Also

OpenClaw Codex OAuth returning billing errors despite valid account

OpenClaw: Dive Into the First AMA on r/clawdbot

Longitudinal study finds AI productivity gains at 10%, not 10x

Real-World Hourly Costs for Long-Running AI Agent Teams