Nonhuman Vocal Algorithms

The DeepMind AI Can Say Sentences and Play Piano Without a Sound Library

Joey Haar — Sep 12, 2016 — Tech

As algorithmic voices become a more common part of daily life, with Siri, Alexa and Cortana feeling as familiar as friends, the DeepMind AI's WaveNet program is focusing on creating an authentic sounding voice that is completely original. In other words, while Siri, Alexa, Cortana and others rely on processing recordings of a real human's voice, DeepMind AI's WaveNet creates an original voice by learning from a host of recordings.

The difference between WaveNet and other common systems is that WaveNet doesn't just put together sounds from a database. Rather, it learns speech sounds and puts those sounds together on its own. Because of this learning process, the DeepMind AI program can be used for anything that involves sound, including piano music (a sampling of which Google included on its blog.)

Trend Themes

Algorithmic Voices — Opportunity for businesses to develop original and authentic algorithmic voices for virtual assistants and other applications.
Voice Learning — Disruptive innovation opportunity to create AI programs that can learn speech sounds and generate original voices.
Sound Generation AI — Emerging trend of AI programs capable of creating sounds, including piano music, without relying on pre-existing sound libraries.

Industry Implications

Virtual Assistant Technology — Potential for disruption in the virtual assistant industry with the development of algorithmic voices that are more realistic and personalized.
Speech Recognition and Synthesis — Opportunity for advancements in speech recognition and synthesis technologies as AI programs learn to generate original voices.
Music Composition and Production — Disruption in the music industry with the emergence of AI programs capable of generating original music compositions without the need for pre-existing sound libraries.

GET A CUSTOM REPORT SUBSCRIBE TO ADVISORY

Related Ideas

Similar Ideas