Changelog Master Feed cover image

Democratizing ML for speech (Practical AI #164)

Changelog Master Feed

00:00

The Multilingual Simplified Words Corpus

The multi lingual spoken words corpus is a data set generator, so you can sort of dial your own key words. Because of that capability, it's useful to know what parts of speech are there, and what are sort of the content types in the topics and domains. One of the things we did was randomly sampled about five thousand hours to find out what kind of background noise we had. And ultimately, is a dat a scientist just getting this is good.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app