648: VALL-E: Uncannily Realistic Voice Imitation from a 3-Second Clip
Jan 27, 2023
09:51
forum Ask episode
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
Text-to-speech gets a groundbreaking update with Microsoft’s VALL-E. On this Five-Minute Friday, Jon Krohn investigates how the Microsoft team modeled their tool to replicate natural human speech using just three seconds of a person’s voice.