The Cloudcast

Enabling Voice AI in Applications

35 snips
Jul 24, 2024
Scott Stephenson, CEO of Deepgram, discusses the evolution and business applications of Voice AI, exploring its value in applications and common use cases. He also delves into the process of adding Voice AI to existing applications and collecting user feedback for voice-centric interactions.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

From Dark Matter To Audio AI

  • Scott Stephenson moved from building deep underground dark matter detectors to founding DeepGram after realizing audio signal processing skills transferred to speech AI.
  • He and his team recorded and analyzed waveforms continuously, which sparked the idea to index and search audio at scale.
INSIGHT

Voice AI Has Turned On

  • Voice AI has reached a turning point where quality, latency, and cognitive ability combine to make interactions feel real.
  • Perception (speech-to-text) is mature, and rapid improvements in TTS and LLMs now enable human-like, low-latency voice experiences.
INSIGHT

Cost Determines Real-World Viability

  • Cost parity with human labor is essential: voice AI must stay below approximate human-hour costs to be competitive.
  • Achieving low enough inference and orchestration cost is as important as technical quality for adoption.
Get the Snipd Podcast app to discover more snips from this episode
Get the app