

EP 435: How 50X cheaper & faster AI transcription is changing enterprise work
Jan 8, 2025
Philip Kiely, Head of Developer Relations at Baseten, dives into the world of AI transcription and its game-changing potential for enterprises. He highlights the cost reductions and efficiency gains with AI tools like the Whisper model. Kiely discusses how AI can transform audio into actionable insights while also stressing the importance of human verification for accuracy. The conversation also touches on the future of voice technology and invites creative thinking about innovative applications of transcription in business settings.
AI Snips
Chapters
Transcript
Episode notes
Transcription Benefits
- Transcribing audio data unlocks valuable insights and searchability.
- It converts low-bandwidth audio into easily processed text data for both humans and machines.
Manual Transcription Woes
- Philip Kiely had to manually transcribe interviews for a book due to the poor quality of existing transcription technology.
- Whisper's 2022 release was a game-changer with its high accuracy and multilingual capabilities.
Whisper's Speed Advancements
- Whisper's speed improvements enable near-instant transcription, achieving real-time factors of up to 1000x.
- This allows for processing an hour of audio in mere seconds.