Apple Secretly Runs Google's Gemini Inside Its Own Data Centers

Apple isn't renting Gemini. It's learning from it.

The Summary

Apple's Google deal includes full access to Gemini inside Apple's own data centers, allowing them to distill smaller, specialized models from the larger teacher model.
This isn't white-labeling. Apple can extract knowledge from Gemini to build proprietary models that run locally on devices or in Private Cloud Compute.
The architecture reveals Apple's actual AI strategy: learn from the giants, then own the execution layer where privacy and performance matter.

The Signal

Most observers saw Apple's Gemini partnership as admission of weakness. Turns out it was a training program. The Information reports Apple has complete access to Gemini inside its own infrastructure, not just API calls to Google's servers. That access enables distillation, the process where a large model teaches a smaller one its core capabilities without the compute overhead.

This is the smart play for anyone building at scale. Google spent billions training Gemini. Apple gets to compress that knowledge into task-specific models that run on A17 chips or in Private Cloud Compute without sending user data to Mountain View. The economics are brutal for Google, elegant for Apple. Train once on Google's dime, deploy everywhere on Apple's terms.

The Private Cloud Compute detail matters more than it seems. Apple built PCC specifically to run AI workloads with verifiable privacy guarantees. Running distilled Gemini models there means Apple can offer cloud-powered intelligence without the privacy nightmare of third-party AI services. Users get capable AI, Apple keeps the trust moat intact, Google gets... a check and the knowledge that Apple is systematically learning to not need them.

Distillation is how agent infrastructure will scale. The big foundation models are too expensive, too slow, and too general for most real work. The winning pattern is clear: capture intelligence from foundation models, compress it into specialized agents that do one thing well, run them where the data lives. Apple just proved the playbook works at consumer scale.

The Implication

Watch who else negotiates distillation rights into their AI partnerships. The foundation model race was act one. Act two is who can compress that intelligence into agents that run locally, cheaply, and privately. If you're building agent infrastructure, the question isn't which foundation model to use. It's how fast you can learn from them and stop needing them.

Source: Daring Fireball