• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Facebook says its new Instagram-trained A.I. represents a big leap forward for computer vision

Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
Jeremy Kahn
By
Jeremy Kahn
Jeremy Kahn
Editor, AI
Down Arrow Button Icon
March 4, 2021, 10:22 AM ET

Our mission to make business better is fueled by readers like you. To enjoy unlimited access to our journalism, subscribe today.

Facebook has created an artificial intelligence system that may make it much more efficient for companies to train such software for a range of computer vision tasks, from facial recognition to functions needed for self-driving cars.

The company unveiled the new system in a series of blog posts Thursday.

Today, training machine-learning systems for such tasks often requires hundreds of thousands or even millions of labeled data sets. Creating an accurately labeled data set for this training can be both expensive and time-consuming.

Learning in baby steps

Facebook’s breakthrough allows an A.I. model to be trained from a very large set of unlabeled image data and then fine-tuned for a wide range of specific vision-related tasks using just a tiny fraction of the amount of labeled data that such software typically requires.

Yann LeCun, Facebook’s chief A.I. scientist, said that the idea is to create artificial intelligence that can learn the way a human infant does. That is, through observation and by building a mental model of the relationships between objects.

“Babies learn how the world works by watching the spectacle of the world,” LeCun told Fortune. “Once you have a good understanding and representation of the world, you can learn any task relatively quickly.”

This is why most teenagers can learn to drive with only a few hours of lessons, LeCun said. Today’s software for self-driving cars, on the other hand, requires millions of simulated hours to equal the same level of performance.

The ability to learn from far fewer labeled examples is critical for a wide range of commercial A.I. applications, LeCun said. In medical imaging diagnostics, for example, much of today’s computer vision software requires tens of thousands of annotated examples to reach the same accuracy as a human radiologist. But for a rare lung condition, there might not be tens of thousands of examples available to train such a system.

What you SEER is what you get

In recent years, the use of similar techniques in natural language processing has resulted in giant leaps forward in the capabilities of A.I. software. The latest technology can perform tasks such as language translation, document summarization, answering questions about a text, and writing long passages of coherent text from a simple human-written prompt. The same techniques have also allowed big performance improvements in speech recognition for digital assistants such as Amazon’s Alexa and Google Assistant.

Now Facebook hopes that its new A.I. will result in a similar leap forward in the capabilities of computer vision systems, and possibly also systems that can learn the relationship between images and the words that describe those images.

The new A.I., which Facebook calls SEER, is a breakthrough in a type of machine learning called self-supervision. This type of A.I. model uncovers the relationships in data on its own, using statistical methods, without the need for labeled data to act as a kind of instructor that tells the system how to link a given input to a given output. (SEER is a shortening of the phrase “self-supervised,” according to a Facebook blog post announcing the A.I. system.)

In this case, SEER is an ultra-large vision model, taking in more than 1 billion variables and having been trained on more than 1 billion images from publicly available Instagram accounts. This has also been the trend with self-supervision in natural language processing. Some of the best systems take in hundreds of billions of variables and are trained from data sets that include almost everything publicly available on the Internet.

On ImageNet, the field’s signature image identification benchmark test, SEER achieved an 84.2% accuracy, even though it had not been trained on that data. The results outperformed the best previous self-supervised systems that had been trained for that task.

SEER also outperformed the best systems that have been trained from labeled data on tasks such as object detection, segmenting an image into component parts, and image classification. When given just 10% of the labeled ImageNet examples to train on, SEER still achieved 77.9% accuracy on the full ImageNet data set. With only 1% of the annotated ImageNet examples, the A.I. achieved a 60.5% accuracy.

Doing it for the ’Gram

Although Facebook is not yet using SEER or any other fully self-supervised computer vision A.I. on its own social networks, LeCun says that the company does use a system that is weakly supervised, having been trained on images paired with Instagram hashtags. It is this A.I. that allows Facebook to group users’ photos for them thematically and also allows the company to automatically detect hateful images or terrorist propaganda. LeCun said he thought SEER, or software based on the same underlying algorithms, would likely become the company’s base computer vision system, which will then be fine-tuned for specific use cases in the near future.

LeCun acknowledges that the size of these very large, self-supervised A.I. systems, and the expense of the computer hardware needed to train and run them, can intimidate business executives and academic researchers alike. But he pointed out that new advances in computer chips designed specifically to run large neural networks, a kind of machine-learning software design loosely based on the human brain that underpins most recent advances in A.I., including SEER, were significantly outpacing the growth of these large machine-learning systems. In other words, the cost of training should decline in the future.

LeCun also noted that even the most massive artificial neural networks in use today have about as many connections as a mouse brain. Creating machines that could equal human intelligence would almost certainly require much larger software systems.

LeCun, who is a past recipient of the Turing Award, computer science’s highest accolade, was dismissive of concerns about the carbon footprint of these large, self-supervised A.I. models. He said all the data centers in the world consumed only about 1% to 2% of the planet’s electricity, with the training and use of A.I. algorithms making up an even smaller fraction of that amount. He also said the new computer chips designed for A.I. were more energy efficient than older hardware at running large A.I. systems. So that even if this software kept growing in size, its energy footprint per decision should decline over time.

Ethical dilemmas

LeCun said he took more seriously another ethical concern raised about these ultra-large, self-supervised systems: that because they are trained from massive amounts of Internet data, they can pick up the biases, including racial and gender stereotypes, inherent in that data. Often these biases are not obvious until the system is deployed. Because the training data sets are so large, auditing them for bias can be difficult.

LeCun said that removing such biases from self-supervised systems might require a specialized training of the A.I. with an additional, smaller data set curated essentially to unteach the system a specific bias. More research would need to be done to figure out the best way to do this.

“It’s a complicated issue,” LeCun said. But self-supervised systems were, in his view, less likely to be biased than some A.I. software that learns from labeled examples, he said, because the labels are often applied by biased humans and the data sets were smaller, so each biased example would have a bigger impact.

The Facebook scientist’s views on the potential bias of A.I. systems have gotten him into trouble in the past. Last year, LeCun temporarily abandoned Twitter after a spat over the social media platform with Timnit Gebru—an A.I. ethics researcher who was recently, and controversially, pushed out from Google after raising concerns about large, self-supervised language models. Some other computer scientists accused LeCun of being tone-deaf and unfairly high-handed in his back-and-forth with Gebru, who is one of the few prominent Black women in A.I. research. The dispute centered on the nature of potential harms A.I. systems could cause and how much responsibility machine-learning researchers needed to take for addressing them.

LeCun says that the next steps for the self-supervised techniques behind SEER is to extend them from still images to video. That’s no easy feat, he said, acknowledging that trying to develop A.I. systems that have enough understanding of the world that they can accurately predict what will happen next in a video was a problem that had stymied computer scientists, himself included, for years. Another ripe research area is “multimodal learning,” in which an A.I. system is trained on both images and text simultaneously.

At the heart of the SEER system is an algorithm that Facebook calls SwAV, short for “swapping assignments between multiple views,” that involves clustering images. First the algorithm applies some distortions to an image; in this case, a series of crops that are used to create multiple “views” of the same picture. Then the algorithm tries to accurately determine which cluster it should assign the original image to based on these alternative views.

This new method allows the system to be trained much more efficiently. Training of this sort required a sixth of the data needed in previous methods that were based on comparing just two image views at a time, according to Facebook’s blog post.

Facebook is making the SwAV algorithm open source and free for anyone to use. The company is also making available a set of components for creating self-supervised computer vision systems and benchmarking tools for those systems, called VISSL, to the public.

This story has been updated to make clear that Facebook is not making its trained SEER A.I. model or the Instagram data set used to train it available to the public.

More must-read tech coverage from Fortune:

  • You aren’t alone: Why Americans will receive 48 billion robocalls this year
  • Instagram’s new streaming feature looks a lot like Zoom
  • Can we really banish email from the workplace? Author Cal Newport says yes
  • How Microsoft plans to give virtual reality apps a big boost
  • PlayStation is killing its on-demand video service


About the Author
Jeremy Kahn
By Jeremy KahnEditor, AI
LinkedIn iconTwitter icon

Jeremy Kahn is the AI editor at Fortune, spearheading the publication's coverage of artificial intelligence. He also co-authors Eye on AI, Fortune’s flagship AI newsletter.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Fortune Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map

Latest in Tech

Bill Gates speaks onstage at the Bloomberg Philanthropies Global Forum 2025 at The Plaza Hotel on September 24, 2025 in New York City.
AIBill Gates
Bill Gates says AI could be used as a bioterrorism weapon akin to the COVID pandemic if it falls into the wrong hands
By Eleanor PringleJanuary 9, 2026
13 hours ago
shapiro
Big TechMedia
Netflix’s competition isn’t sleep anymore. Its battle against YouTube is like fighting an ‘infinite number of monkeys,’ top strategist says
By Nick LichtenbergJanuary 9, 2026
13 hours ago
Bill Gates pictured in New York, May 2025.
SuccessBill Gates
Read Bill Gates’s 2026 annual letter in full
By Eleanor PringleJanuary 9, 2026
13 hours ago
A screen displays the Grok logo while a person holds another phone in front.
AIX
Lawmakers and victims criticize the choice to limit Grok’s AI image generation to paid users as ‘insulting’ and ‘not effective’
By Beatrice NolanJanuary 9, 2026
15 hours ago
NewslettersTerm Sheet
Andreessen Horowitz’s shiny, new $15 billion reveals where the firm sees the biggest opportunities
By Allie GarfinkleJanuary 9, 2026
16 hours ago
Chinese and U.S. flags wave outside a technology company in Beijing, on April 17, 2025. (Photo: Pedro Pardo/AFP/Getty Images)
NewslettersFortune Tech
‘Salt Typhoon’ hackers accessed email of U.S. congressional committee staff
By Andrew NuscaJanuary 9, 2026
16 hours ago

Most Popular

placeholder alt text
North America
Bill Gates warns the world is going 'backwards' and gives 5-year deadline before we enter a new Dark Age
By Eleanor PringleJanuary 9, 2026
13 hours ago
placeholder alt text
Success
Diary of a CEO founder says he hired someone with 'zero' work experience because she 'thanked the security guard by name' before the interview
By Emma BurleighJanuary 8, 2026
1 day ago
placeholder alt text
Workplace Culture
Amazon demands proof of productivity from employees, asking for list of accomplishments
By Jake AngeloJanuary 8, 2026
1 day ago
placeholder alt text
Crypto
Russia and Iran are increasingly turning to crypto—especially stablecoins—to avoid sanctions, report finds
By Carlos GarciaJanuary 8, 2026
2 days ago
placeholder alt text
Politics
White House says it's 'reviewing protocols' after Trump seemingly violated federal policy by disclosing jobs data early
By Eva RoytburgJanuary 9, 2026
8 hours ago
placeholder alt text
Real Estate
Google billionaire Larry Page copies the Jeff Bezos playbook, buying a $173 million Miami compound that will save him millions in taxes
By Nick LichtenbergJanuary 8, 2026
1 day ago

© 2025 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.