Apple distills Google Gemini model for on-iPhone processing

Apple is distilling Google Gemini into smaller chunks for on-iPhone AI processing

Apple's pulling out chunks of Google Gemini for on-device processing

Apple is cutting down Google Gemini's massive models into smaller and more secure parts through distillation, to create elements that are more suited to on-device Apple Intelligence processing.

The deal Apple made with Google allows the iPhone maker to use Google's Gemini AI models as a basis for its own updated AI. It now appears that the deal lets Apple use some techniques to make models that will work better for on-device processing than the mainly server-based Gemini itself.

It was previously known that Apple could adjust the Google Gemini model to respond to queries in specific ways. However, according to The Information on Wednesday, Apple has a lot more leeway in what it can do and access within the model.

This includes complete access to the model within its own data centers, which lets its engineers have the ability to closely examine Gemini and how it functions.

A key element to the report is that Apple can perform distillation, a technique that can transfer knowledge from a larger model into a smaller one. The idea is for some of the smaller models within Gemini to teach a smaller Apple model, gradually segmenting and separating out the model's knowledge base.

To do this, Apple's external model acts as a student, learning how Gemini's internal computations work for a function and mimicking the processing. Apple can ask for high-quality results and the Gemini model's "chain of thought," which can be used to train its own small model.

The end result is the creation of a smaller model that performs a specific function just like the larger model. That smaller model would run at about the same speed and accuracy as the original Gemini model for that function.

The benefit is that the smaller model is cheaper to run, and could also use less powerful hardware too. That means models that could be created to work on devices like an iPhone, instead of requiring a server.

Small models, but big are questionable

The distillation effort is one of the tasks being carried out by Apple Foundational Models, the team working on Apple's AI projects. The ultimate scope of the AFM's remit, however, is still unknown.

To report sources, the creation of mini models splintered off from Gemini is clearly for on-device processing purposes. But questions remain about making a bigger version.

One unnamed source doubted that the distilled models would be used as a base for creating new larger models. They don't believe the team is actually trying to make a direct competitor to Gemini at all at this time.

For the moment, Gemini will continue to provide responses for Siri answers. With expectations of Apple finally pulling off its big Siri refresh in June, Google's model may have more work to do in its undistilled form.

It is evident that Apple clearly believes in on-device processing being the path ahead for its AI success. It's in the process of acquiring model pieces to continue down that road, but a bigger comprehensive Siri model is off the table for the moment at least.

Comments

iwatchamacallit · 12 comments · 12 Years About 1 month ago

Is this how TurboQuant came into being? 🤔

blastdoor · 4192 comments · 17 Years About 1 month ago

Great, this way it can do narrowly targeted but useful things like "start 10 minute timer" or "begin outdoors walk" all on my device ;-)

StrangeDays · 13216 comments · 10 Years About 1 month ago

one day I’d just like Siri to stop muting background audio playback for command processing after I ask it to do things like…set a lighting scene, or even, adjust the volume

tshapi · 397 comments · 15 Years About 1 month ago

Apple has billions of devices. Most of these devices have NFC and other technology. I Wonder… if Apple is planning or considering crating better AI by doing a sort of peer to peer networking, maybe even using local mini servers. the AI can live on your device for. Privacy and so on, but draw power for more complex computation from local nearby places. Not sure if this is something that is plausible. With Apple at a 2nm chip, it is plausible Apple can provide its AI with a dedicated chip just tor that in more premium models.

gatorguy · 24933 comments · 15 Years About 1 month ago

I believe this is what Google has planned for Android 17. Several features have already been moved to on-device processing in previous years, and more pieces and expansion will be found there over the next few months. Perhaps Google is assisting Apple with their on-device integration too, a high degree of cooperation between the two companies? Sounds like it.

Share Your Thoughts on our Forums ->

News

Apple is distilling Google Gemini into smaller chunks for on-iPhone AI processing

Small models, but big are questionable

AirPods Max 2 one-month review: Spot the difference?

Mac mini pricing shifts, $599 config disappears from Apple store

Here's what using a touchscreen Mac may be like

Price war: Apple's 1TB M5 MacBook Pro with 24GB RAM plunges to $1,699

What the analysts said about Apple's record-breaking second quarter

Tim Cook's remarks strongly suggest that there are no new Macs or iPads before September

Indian antitrust body draws Apple's ire as $38 billion fine looms

Apple Vision Pro isn't dead, Ternus talk, & AI rumors on the AppleInsider Podcast

Follow us on Social Media

Small models, but big are questionable