• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
TechAI

Why synthetic data is such a hot topic in the artificial intelligence world

By
Jonathan Vanian
Jonathan Vanian
Down Arrow Button Icon
By
Jonathan Vanian
Jonathan Vanian
Down Arrow Button Icon
December 21, 2021, 12:01 PM ET

Farm machinery giant John Deere needed a huge amount of data to train its tractors to “see” like farmers. But lacking enough images, the company’s artificial intelligence sometimes failed to distinguish corn from invasive plants, like when spraying weed killer.

While A.I. can identify a weed called itchgrass in sunshine, it sometimes gets confused when it’s cloudy. To overcome that hurdle, John Deere trains its A.I. on synthetic photos—mathematical representations that only computers can understand—that show the weed under noonday sun, gray skies, and damp with rain.

Using images created by computers helps eliminate the “need to take a picture of every single possible weed” under every condition, says Julian Sanchez, John Deere’s director of emerging technology.

Synthetic data, a relatively new development, may be key to A.I.’s ultimate success, some A.I. researchers say. The excitement is so great that a cottage industry has emerged to sell synthetic data services to business customers. In addition to imagery, they offer synthetic versions of data typically found in corporate spreadsheets.

But it’s still too early to say whether those synthetic data vendors will succeed, several industry experts tell Fortune. It’s also unclear whether the technology itself is good enough to improve machine-learning models.

This data looks real, but it’s fake

Unity Technologies is best known for its software, which developers use to create video games. But recently, the company, which went public in 2020, opened a new business unit for creating synthetic visuals for businesses to feed into computer-vision systems.

Manufacturers, for instance, could create synthetic visuals to help train technology that automatically scans merchandise on the factory floor for flaws, says Danny Lange, Unity’s senior vice president of artificial intelligence. A Unity spokesperson says that Boeing has used Unity’s technology to create synthetic images for training software used by Boeing mechanics to spot production flaws in airplanes. 

Lange says that companies hate having to pay third parties $50,000 to add labels to training data so that machine-learning models can understand it when, in many cases, the models end up not working as well as hoped. This failure ends up forcing companies to spend even more on labeling.

Rendered.ai, a startup that just raised $6 million to help it grow its synthetic data business, makes tools for developers to create their own artificial imagery more easily, explains its CEO, Nathan Kundtz. X-ray technology company Quadridox used Rendered.ai’s software to generate fake airport X-ray scans of passenger bags that included images of banned weapons like explosives, guns, and knives.

This set of synthetic X-ray airport scans, which were spawned from an original data set of X-ray baggage scans, was used to improve Quadridox’s security screening technology, Kundtz says. In a real-world data set, he says, companies like Quadridox may only have access to real imagery of five different knives, but that “in the synthetic world, I can create an infinite number of variations,” he explains.

One synthetic photo resembles an X-ray scan of luggage, in which a handgun is located in the center of the image as if it were placed by a careless criminal. Thousands more could be created showing the weapon and others like it hidden in different parts of the luggage.

Another startup, Mostly AI, specializes in technology to help companies generate synthetic financial data sets, such as customer sales records. The idea behind the Austria-based company is that clients can create fake versions of their existing customer data to avoid violating Europe’s tough GDPR privacy laws.

Over the years, one unnamed European telecommunication client amassed an enormous amount of customer data, says Alexandra Ebert, chief trust officer at Mostly AI. But because many of the people on the list never consented to have their data collected, the client couldn’t use much of the information.

By creating synthetic data derived from the original data set for training its A.I., the company was able to do tasks like predict customer churn—business jargon for the number of people who would cancel their service in the future—without breaking GDPR rules, Ebert says.

“They could better cater to the needs of clients without risking privacy or being in conflict with privacy legislation,” she says.

Is it still too early for synthetic data?

Despite the enthusiasm over synthetic data, some experts are concerned about how useful the technology currently is. Sumit Agarwal, a senior analyst at Gartner who specializes in A.I., says that “the idea of synthetic data is very, very promising,” and that banks or health care companies could use artificial data to protect customer and patient privacy when they require data to develop machine-learning algorithms. But he says that startups offering synthetic data generating services need several more years to improve their technology and show that it works outside a few test cases.

“I think there’s a lot of work that has to happen for them to be in the state where companies can use them without any support,” Agarwal said of heavily regulated businesses using synthetic data services.

Duke Energy chief information officer Bonnie Titone said during a recent Fortune Brainstorm A.I. online event that its experiments using synthetic data resulted in A.I. systems that weren’t very good.

“We find it better to take our own data and kind of obfuscate it or use it in a non-identifiable manner, because there’s so much power in the information that was real life, whether it’s your customers or internal operations,” Titone said.

Some venture capitalists are hesitant to invest in synthetic data startups, particularly those selling services to create artificial financial data sets. Ariel Tseitlin, an investor with Scale Venture Partners, says he hasn’t yet found a compelling startup that creates synthetic data, and that there are other data anonymizing techniques that companies can already use to ensure that their real data is compliant with privacy laws.

“I don’t think that’s a great use case for synthetic data generation,” Tseitlin says.

Additionally, businesses with a bit of technical acumen can create their own fake data instead of using third parties. John Deere’s tech team, for instance, created its own synthetic images of crops and weeds to use.

Despite all these caveats, many businesses are optimistic about synthetic data’s potential.

Self-driving car companies like Waymo continue to use synthetic data that is gathered by “driving” their autonomous vehicles through virtual cities and roads, as a way to improve their A.I. systems. And American Express is creating synthetic financial data as part of its cybersecurity research. These companies are all hoping that the projects will eventually pay off.

As John Deere’s Sanchez says, his company will eventually test how well A.I. systems trained on synthetic data work compared with those using real-life data. Ultimately, he says, it will show “whether this form of sausage making works out and gives us some efficiency.”

Subscribe to Fortune Daily to get essential business stories delivered straight to your inbox each morning.

About the Author
By Jonathan Vanian
LinkedIn iconTwitter icon

Jonathan Vanian is a former Fortune reporter. He covered business technology, cybersecurity, artificial intelligence, data privacy, and other topics.

See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • Future 50
  • World’s Most Admired Companies
  • See All Rankings
Sections
  • Finance
  • Leadership
  • Success
  • Tech
  • Asia
  • Europe
  • Environment
  • Fortune Crypto
  • Health
  • Retail
  • Lifestyle
  • Politics
  • Newsletters
  • Magazine
  • Features
  • Commentary
  • Mpw
  • CEO Initiative
  • Conferences
  • Personal Finance
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
About Us
  • About Us
  • Editorial Calendar
  • Press Center
  • Work At Fortune
  • Diversity And Inclusion
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

U.S. President Donald Trump speaks to the press, saying he's talking to NATO about Greenland, before he departs the White House en route Palm Beach, Florida on January 16, 2026, in Washington DC, United States.
PoliticsGreenland
The weak business case for Trump acquiring Greenland: a $1 trillion price tag and few returns for two decades
By Jordan BlumJanuary 17, 2026
18 hours ago
boardroom
CommentaryCorporate Governance
When AI decides how shareholders vote, boards need to rethink governance
By Jane SadowskyJanuary 17, 2026
19 hours ago
The CEO of Informatica, Amit Walia
SuccessCareers
Like DoorDash and Google’s CEOs, $7.6 billion Informatica boss is a McKinsey alum—he says being ‘pushed around’ by smart consultants helped him grow
By Emma BurleighJanuary 17, 2026
20 hours ago
photo of western union store
CryptoCryptocurrency
Stablecoins will shake up the $900 billion remittance market—setting up a fight between crypto firms and legacy brands like Western Union
By Carlos GarciaJanuary 17, 2026
21 hours ago
InnovationThe Boring Company
Exclusive: Elon Musk’s Boring Co. is studying a tunnel project to Tesla Gigafactory near Reno
By Jessica MathewsJanuary 16, 2026
1 day ago
AIOpenAI
ChatGPT tests ads as a new era of AI begins
By Sharon GoldmanJanuary 16, 2026
1 day ago

Most Popular

placeholder alt text
Newsletters
The oil CEO who stood up to Trump is a follower of the disciplined 'Exxon way' and has a history of blunt statements
By Jordan BlumJanuary 13, 2026
5 days ago
placeholder alt text
Politics
The Nobel Prize committee doesn't want Trump getting one, even as a gift—but they treated Obama very differently
By Nick LichtenbergJanuary 16, 2026
1 day ago
placeholder alt text
Banking
'Absolutely, positively no chance, no way, no how, for any reason': Dimon says he'd never run the Fed but 'would take the call' to lead Treasury
By Jacqueline MunisJanuary 16, 2026
2 days ago
placeholder alt text
Economy
America’s $38 trillion national debt is so big the nearly $1 trillion interest payment will be larger than Medicare soon
By Shawn TullyJanuary 15, 2026
3 days ago
placeholder alt text
Success
Jensen Huang tells Stanford students their high expectations may make it hard for them to succeed: 'I wish upon you ample doses of pain and suffering'
By Orianna Rosa RoyleJanuary 16, 2026
2 days ago
placeholder alt text
Innovation
Exclusive: Elon Musk’s Boring Co. is studying a tunnel project to Tesla Gigafactory near Reno
By Jessica MathewsJanuary 16, 2026
1 day ago

© 2025 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.