3min chapter

Google DeepMind: The Podcast cover image

Me, myself and AI

Google DeepMind: The Podcast

CHAPTER

How to Co-Train Voices Together

When deep mind launched wavenet in two thousand and 16 you needed about four hours worth of audio samples from a person to model how their voice sounds. But now you can do it with just a few minutes worth of audio. Google has built an enormous data set with professional voice actors reading out the same text. The model learns from all o these samples how particular words are pronounced. Now the third and final part is the acoustic modelling. Acoustic modelling focuses on who it sounds like. If i pretend to sound like my brother on the phone, it still sounds like me. My friend will be able to tell it. Mif i say the sentence with a different tone of voice, you

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode