Apple Using Google Gemini Access for On-Device AI Model Distillation

Apple is leveraging Google's Gemini AI models to create smaller, on-device versions through distillation. According to The Information, Google gave Apple "complete access" to Gemini in Google's own data centers, allowing Apple to customize the model for Siri and other AI features.
How the Distillation Process Works
Apple can ask the main Gemini model to perform tasks that provide high-quality results, including a rundown of the reasoning process. Apple then feeds the answers and reasoning information from Gemini to train smaller, cheaper models. This enables the smaller models to learn the internal computations used by Gemini, producing efficient models with Gemini-like performance but requiring less computing power.
Technical Details and Challenges
- Apple can design models built to run on Apple devices without internet connectivity
- Apple can edit Gemini as needed to ensure responses align with Apple's requirements
- Apple has encountered issues because Gemini was tuned for chatbot and coding applications, which doesn't always meet Apple's needs
- The smarter, chatbot version of Siri planned for iOS 27 will rely on Google's Gemini models
Capabilities and Development
Siri will be able to perform many of the same functions as Gemini and other chatbots, including:
- Answering questions
- Summarizing information
- Scanning and understanding uploaded documents
- Telling stories
- Providing emotional support
- Completing tasks like booking travel
The Apple Foundation Models team continues to work on Apple AI models distinct from Gemini models, indicating this is a transitional approach while Apple develops its own AI capabilities.
📖 Read the full source: HN AI Agents
👀 See Also

OpenClaw Codex OAuth returning billing errors despite valid account
OpenClaw Codex OAuth is returning a 429 error stating 'Your account is not active, please check your billing details' even though billing is confirmed valid and the exec command works. The issue persists across multiple OpenClaw versions.

OpenClaw: Dive Into the First AMA on r/clawdbot
In an exciting AMA session, the OpenClaw team discussed the future of AI coding agents on Reddit's r/clawdbot. Discover key insights and takeaways from this interactive event.

Longitudinal study finds AI productivity gains at 10%, not 10x
A longitudinal study tracking 40 companies from November 2024 through February 2026 found AI usage increased by 65% on average, but pull request throughput only increased by 9.97%. The data suggests coding was never the primary bottleneck in software development.

Real-World Hourly Costs for Long-Running AI Agent Teams
A developer shares actual hourly costs for AI agent teams running 5+ hour sessions with full Linux, browser, and tool access. Coding agents cost $10-$60/hr, marketing agents $10-$30/hr, and back-office agents $5-$15/hr.