Apple's first AI research paper focuses on computer vision

Apple last week published its first scholarly research paper, an article covering methods of improving recognition in computer vision systems, marking a new direction for the traditionally secretive company.

The paper, titled "Learning from Simulated and Unsupervised Images through Adversarial Training," was submitted for review in mid-November before seeing publication through the Cornell University Library on Dec. 22.

Apple's article arrives less than a month after the company said it would no longer bar employees from publishing research relating to artificial intelligence.

Spotted by Forbes on Monday, the first of Apple's public research papers describes techniques of training computer vision algorithms to recognize objects using synthetic, or computer generated, images.

Compared to training models based solely on real-world images, those leveraging synthetic data are often more efficient because computer generated images are usually labelled. For example, a synthetic image of an eye or hand is annotated as such, while real-world images depicting similar material are unknown to the algorithm and thus need to be described by a human operator.

As noted by Apple, however, relying completely on simulated images might yield unsatisfactory results, as computer generated content is sometimes not realistic enough to provide an accurate learning set. To help bridge the gap, Apple proposes a system of refining a simulator's output through "Simulated+Unsupervised learning."

In practice, this particular flavor of S+U learning combines unlabeled real image data with annotated synthetic images. The technique is based in large part on Generative Adversarial Networks (GANs), which applies two competing neural networks — generator and discriminator — against each other to better discern generated data from real data. A fairly recent development, the process has seen success in the generation of photorealistic "super-resolution" images.

While not necessarily presaging upcoming consumer technology, it is interesting to note Apple elected to apply its modified GAN to the evaluation of gaze and hand pose estimation. In addition, the company says it hopes to one day move S+U learning beyond static images to video input.

Apple's first public research paper was penned by vision expert Ashish Shrivastava and a team of engineers including Tomas Pfister, Oncel Tuzel, Wenda Wang, Russ Webb and Apple Director of Artificial Intelligence Research Josh Susskind. Of note, Susskind made the announcement of Apple's newfound interest in scholarly pursuits earlier this month, a move some believe will help future recruitment efforts.

Comments

cali · 3494 comments · 12 Years About 9 years ago

I don't get what the advantage of spilling your work to the enemy has.

imagine if the U.S. military published papers on what we were working on?

yojimbo007 · 1165 comments · 14 Years About 9 years ago

While we are at it Apple.. Can we get the most basic of AI: Apple Spell Check and contextual word recognition working...beyond the Jurassic state it is in now. Many Thanks , in advance .

ericthehalfbee · 4521 comments · 15 Years About 9 years ago

cali said:

I don't get what the advantage of spilling your work to the enemy has.

imagine if the U.S. military published papers on what we were working on?

A lot of people working on cutting edge stuff want to publish and get recognition for their work. And people in specific fields might want to work for someone (like Apple) if they know more about things they're working on. Especially if it aligns with their own ideas.

But that doesn't mean they're giving away ALL their secrets when they publish. This report is only a 15 page summary. I would think the data Apple has accumulated along with the source code for the software they're using would span 10's of thousands of pages. It's sort of like Apple explaining how their A Series processors work without giving away the actual blueprints to make one.

vvswarup · 338 comments · 16 Years About 9 years ago

Apple's AI ambitions require the expertise of some talented PhDs on computer science. Good luck getting those kinds of people if they can't talk about any of their work, especially when there are so many jobs in computer science.

cashxx · 115 comments · 14 Years About 9 years ago

Hurts them if anything releasing the data I think. Look at the rumors on hardware, etc ahead of time. People think Apple is doing badly and behind times....yea because we all know what is coming ahead of time! We don't get the surprises like we used too.

Share Your Thoughts on our Forums ->

News

Apple's first AI research paper focuses on computer vision

App Store ecosystem surges to $1.4 trillion globally in 2025, from a certain point of view

AirPods Max 2 plunge to $499 at Amazon, the lowest price ever

Apple Card Savings returns keep shrinking as APY falls to 3.4%

These photos are the best leak yet of iPhone 18 Pro colors

Run Google's Gemma LLMs right on your Mac with the new AI Edge Gallery

iPhone 18 Pro Max isn't getting any thinner as Apple focuses elsewhere

Revamped Siri will tap Nvidia chips for fast, private cloud computing

How to manage your privacy on iPhone and iPad

Follow us on Social Media