• Home
  • Latest
  • Fortune 500
  • Finance
  • Tech
  • Leadership
  • Lifestyle
  • Rankings
  • Multimedia
Tech

Google’s DeepMind Claims Massive Progress in Synthesized Speech

By
David Meyer
David Meyer
Down Arrow Button Icon
By
David Meyer
David Meyer
Down Arrow Button Icon
September 9, 2016, 4:26 AM ET
Photograph by Getty Images

Researchers at Google’s DeepMind artificial intelligence division claim to have come up with a way of producing much more natural-sounding synthesized speech, compared with the techniques that are currently in use.

Existing text-to-speech (TTS) systems tend to use a system called concatenative TTS, where the audio is generated by recombining fragments of recorded speech. There’s also a technique called parametric TTS that generates speech by passing information through a vocoder, but that sounds even less natural.

So DeepMind has come up with a new technique called WaveNet that learns from the audio it’s fed, and produces raw audio sample-by-sample. To give an idea of how detailed that is, we’re talking at least 16,000 samples per second.

Get Data Sheet, Fortune’s technology newsletter.

A WaveNet is a “neural network”—essentially an artificial brain—that is trained on real waveforms and then uses statistics to choose which samples of that audio to use when “speaking,” piece by piece.

“Building up samples one step at a time like this is computationally expensive, but we have found it essential for generating complex, realistic-sounding audio,” DeepMind’s researchers said in a post about their findings.

That post is well worth checking out, as it includes several clips of the same pieces of text, read out by different speech synthesis techniques. For both U.S. English and Mandarin Chinese, the WaveNet-generated audio is noticeably more realistic than that produced by concatenative TTS.

DeepMind claimed that blind tests with human subjects showed the WaveNet audio to be at least 50% closer to real human speech—though of course such tests are subjective.

For more on DeepMind, watch our video.

DeepMind’s researchers said they would be able to add emotions and accents as inputs, to make the speech sound even more realistic.

Fascinatingly, WaveNets can generate speech without text—or at least, what the neural networks think speech should sound like. As the clips show, these are word-like sounds that mean nothing, and they’re rather creepy.

The same techniques can also be used to create non-speech audio. The post includes clips of the “music” generated by WaveNets that were trained on classical music—again, a good approximation of actual music that might get away with it if you’re not listening too closely.

Again, all this requires an awful lot of computational power and isn’t being used in any real-world applications just yet. But as is always the case with such things, it’s really just a matter of time before computers will be able to sound extremely human.

About the Author
By David Meyer
LinkedIn icon
See full bioRight Arrow Button Icon

Latest in Tech

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025

Most Popular

Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Finance
Lorem ipsum dolor sit amet, consectetur adipiscing elit, sed do eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam
By Fortune Editors
October 20, 2025
Fortune Secondary Logo
Rankings
  • 100 Best Companies
  • Fortune 500
  • Global 500
  • Fortune 500 Europe
  • Most Powerful Women
  • World's Most Admired Companies
  • See All Rankings
  • Lists Calendar
Sections
  • Finance
  • Fortune Crypto
  • Features
  • Leadership
  • Health
  • Commentary
  • Success
  • Retail
  • Mpw
  • Tech
  • Lifestyle
  • CEO Initiative
  • Asia
  • Politics
  • Conferences
  • Europe
  • Newsletters
  • Personal Finance
  • Environment
  • Magazine
  • Education
Customer Support
  • Frequently Asked Questions
  • Customer Service Portal
  • Privacy Policy
  • Terms Of Use
  • Single Issues For Purchase
  • International Print
Commercial Services
  • Advertising
  • Fortune Brand Studio
  • Fortune Analytics
  • Fortune Conferences
  • Business Development
  • Group Subscriptions
About Us
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • About Us
  • Press Center
  • Work At Fortune
  • Terms And Conditions
  • Site Map
  • Facebook icon
  • Twitter icon
  • LinkedIn icon
  • Instagram icon
  • Pinterest icon

Latest in Tech

Man in suit coat with hands gesturing
Investingtech stocks
Supermicro CEO insists ‘no one’ beyond indicted employees were involved in alleged $2.5 billion smuggling scheme
By Amanda GerutMay 5, 2026
3 hours ago
Gen Alpha is using makeup to pass age verification tech online. One mom caught her son using an eyebrow pencil
CybersecuritySocial Media
Gen Alpha is using makeup to pass age verification tech online. One mom caught her son using an eyebrow pencil
By Catherina GioinoMay 5, 2026
9 hours ago
OpenAI cofounder and president Greg Brockman (left) and cofounder and CEO Sam Altman (right) dressed in suits and walking through the lobby of a court house.
NewslettersEye on AI
Musk’s court fight against OpenAI produces more heat than light on the control of advanced AI
By Jeremy KahnMay 5, 2026
9 hours ago
dimon, amodei
Cybersecuritycyber
Jamie Dimon and Dario Amodei sidestep question about whether the AI cyber ‘freakout’ is warranted
By Nick LichtenbergMay 5, 2026
10 hours ago
dario
Economydisruption
Dario Amodei spent last year warning of an AI white-collar bloodbath. Now he’s changing the narrative
By Nick LichtenbergMay 5, 2026
10 hours ago
Mark Zuckerberg
LawMeta
James Patterson, Biden publishers say Mark Zuckerberg ‘personally authorized’ copyright infringement in new lawsuit against Meta
By Hillel Italie and The Associated PressMay 5, 2026
10 hours ago

Most Popular

Clean energy's winning argument is the one it refuses to make
Commentary
Clean energy's winning argument is the one it refuses to make
By David CraneMay 5, 2026
18 hours ago
Current price of oil as of May 5, 2026
Personal Finance
Current price of oil as of May 5, 2026
By Joseph HostetlerMay 5, 2026
16 hours ago
Diary of a CEO founder says he hired someone with 'zero' work experience because she 'thanked the security guard by name' before the interview
Success
Diary of a CEO founder says he hired someone with 'zero' work experience because she 'thanked the security guard by name' before the interview
By Emma BurleighMay 3, 2026
3 days ago
Gen Z workers say showing up 10 minutes late to work is as good as on time—but baby boomer bosses have zero tolerance for tardiness, research reveals
Success
Gen Z workers say showing up 10 minutes late to work is as good as on time—but baby boomer bosses have zero tolerance for tardiness, research reveals
By Orianna Rosa RoyleMay 5, 2026
15 hours ago
China stopped issuing new robotaxi licenses over a glitch. America can't stop them from rolling into active shooter situations
Law
China stopped issuing new robotaxi licenses over a glitch. America can't stop them from rolling into active shooter situations
By Catherina GioinoMay 4, 2026
1 day ago
Current price of silver as of Monday, May 4, 2026
Personal Finance
Current price of silver as of Monday, May 4, 2026
By Joseph HostetlerMay 4, 2026
2 days ago

© 2026 Fortune Media IP Limited. All Rights Reserved. Use of this site constitutes acceptance of our Terms of Use and Privacy Policy | CA Notice at Collection and Privacy Notice | Do Not Sell/Share My Personal Information
FORTUNE is a trademark of Fortune Media IP Limited, registered in the U.S. and other countries. FORTUNE may receive compensation for some links to products and services on this website. Offers may be subject to change without notice.