

EP97: Moore’s Law for AI agents, OpenAI's new audio models, o1-pro API & When Will AI Replace Us?
140 snips Mar 21, 2025
OpenAI's latest audio models are putting their pronunciation skills to the test, leading to some hilarious reactions. The podcast explores the balance of realism and accuracy in AI voice synthesis, while also diving into the financial implications of using these advanced models. There's a chaotic but amusing take on ambitious publicity stunts and the looming impact of AI on job security. Amid the serious topics, light-hearted merchandise discussions add a whimsical touch, revealing the quirky side of AI advancements.
AI Snips
Chapters
Transcript
Episode notes
Australian Accent Test
- OpenAI's new audio model was tested with a complex Australian script.
- It performed well, pronouncing difficult Aboriginal names like Kosciuszko.
Voice Inconsistency
- OpenAI's text-to-speech model's voice lacks consistency.
- Each run produces a completely different voice, impacting applications needing consistent pronunciation.
Text-to-Speech for Non-Real-Time
- Prioritize OpenAI's text-to-speech models for non-real-time audio applications.
- These models offer affordability and reliable pronunciation, ideal for longer audio clips and narrations.