

Cloning voices with Coqui
Jul 12, 2022
Josh Meyer, co-founder of Coqui, discusses the exciting world of voice cloning technology, highlighting how it can infuse emotional expression into AI-generated voices. He emphasizes the importance of community collaboration in speech technology, especially for marginalized languages and ethical considerations. The conversation touches on the transformative influence of speech tech in everyday applications and reassures that machine learning is here to enhance human roles rather than replace them. Meyer also champions the idea that anyone can contribute to open-source projects, regardless of technical skills.
AI Snips
Chapters
Transcript
Episode notes
Shifting Interfaces
- Voice interfaces are becoming the primary computing interface for many worldwide.
- This shift changes how we interact with technology, moving from traditional keyboard/mouse to voice.
Augmenting Humans
- Machine learning and voice tech should augment human tasks, not replace humans entirely.
- Focus on using technology to improve tedious or difficult aspects of human work, like parallel parking.
Voice Assistant Adoption
- Josh Meyer recently started using an Apple Watch with voice control for its convenience while walking his dog.
- He finds the immediacy of voice assistants crucial but abandons them if the response is slow.