Training AI to read your lips — in multiple languages
Nov 30, 2022
04:09
forum Ask episode
view_agenda Chapters
auto_awesome Transcript
info_circle Episode notes
While widely used speech recognition tools like Siri or Otter generally analyze audio alone, researchers have also made progress in developing visual speech recognition (VSR) models, which rely on visual input to identify what a speaker is saying.