Google’s DeepMind develops creepy, ultra-realistic human speech synthesis

We all become accustomed to the tone and pattern of human speech at an early age, and any deviations from what we have come to accept as “normal” are immediately recognizable. That’s why it has been so difficult to develop text-to-speech (TTS) that sounds authentically human. Google’s DeepMind AI research arm has turned its machine learning model on the problem, and the resulting “WaveNet” platform has produced some amazing (and slightly creepy) results. Google and other companies have made huge advances in making human speech understandable by machines, but making the reply sound realistic has proven more challenging. Most TTS systems are based on so-called concatenative technologies. This relies upon a database of speech fragments that are combined to form words. This tends to sound rather uneven and has odd…


Link to Full Article: Google’s DeepMind develops creepy, ultra-realistic human speech synthesis

Pin It on Pinterest

Share This

Join Our Newsletter

Sign up to our mailing list to receive the latest news and updates about homeAI.info and the Informed.AI Network of AI related websites which includes Events.AI, Neurons.AI, Awards.AI, and Vocation.AI

You have Successfully Subscribed!