

The Challenge with Voice Agents
33 snips Feb 22, 2025
Paul van der Boor, VP of AI at Prosus Group, shares his extensive knowledge of voice AI applications, while Floris Fok, an AI Engineer at the same company, delves into the technical hurdles faced with OpenAI’s real-time voice API. They discuss real-time voice interactions in environments like Brazil's food delivery system and explore innovative applications in healthcare, enhancing triaging accuracy. The duo highlights challenges such as language processing and turn detection, emphasizing ongoing improvements and the future of voice technology.
AI Snips
Chapters
Transcript
Episode notes
Real-Time Voice Interaction
- Synthetic voice generation allows new interface possibilities with AI.
- Real-time voice interaction is an important step, especially for B2C agents.
iFood's Real-World Voice AI Test
- iFood tested voice AI with delivery drivers in Sao Paulo, Brazil, using real-world conditions.
- This tested the model's ability to handle accents, noise, and stress, pushing technology limits.
Memory in Voice AI
- Voice AI models face memory challenges similar to text-based models.
- Context window limitations and prompt engineering are key considerations for maintaining conversation flow.