Smart Speakers Podcast cover image

Smart Speakers Podcast

Latest episodes

undefined
Apr 7, 2025 • 43min

The 500ms Dash—Nikhil Gupta, VAPI

Nikhil Gupta is the cofounder and CTO of VAPI. He’s been in the trenches building and scaling one of the biggest voice platforms in the world.On this episode, he explains how VAPI aims to create a voice-default future where we talk to all our computers—and goes deep into every step of VAPI’s voice pipelines and the technical challenges along the way.Some highlights:☎️ Massive scale: VAPI has processed 44 million voice calls on their platform, growing from a COVID-era one-click Zoom meeting button to a full voice infrastructure company used by thousands of developers.⚡️ Latency matters: Voice AI needs to respond within 500 milliseconds to feel natural to humans. That means cleaning audio, detecting when users are done speaking, transcribing text, generating responses, converting to speech, and handling interruptions—all within a fraction of a second.🗣️Voice-first future: Nikhil is betting a future where voice becomes our default interface with all computing systems.If you’ve ever wondered how voice API actually works—this is the episode for you. Chapters00:00 - Introducing VAPI04:42 - Pivoting through COVID05:42 - ChatGPT existential crisis08:33 - Technical challenges of voice12:42 - Anatomy of a voice call14:46 - Knowing when someone is done speaking18:37 - Routing to the fastest model22:07 - Knowledge and context injection26:47 - The text-to-speech bottleneck31:14 - Handling interruptions gracefully33:43 - The 500-millisecond barrier36:56 - The DNS latency discovery39:25 - Scaling the team and what's nextLinks* VAPI * Nikhil Gupta on LinkedIn* VAPI on Product Hunt* Stratechery by Ben Thompson - Recommended reading from Nikhil This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
undefined
Mar 31, 2025 • 40min

Your AI interviewer will see you now—Varun Khurana, Wayfaster

Varun Khurana wants every job candidate to have the best possible chance to prove themselves for every job they apply to. He’s been in the trenches building with voice AI for 2+ years, and had some great nuggets to share:* 💬 How Wayfaster started: “Wayfaster started out as a way to interview your entire pipeline in a couple seconds... the only way you can really do that is using voice AI."* ⛔️Why AI video avatars are a no-go: "We tried avatars and every candidate hated it... I think it just makes candidates feel dumb. It's just like, why am I talking to an avatar? Like how stupid do you think I am?"* 📑How AI interviews give candidates a chance: "Candidates have just gotten really accustomed to getting this default auto-reject email... At least here, I know that if I do a good job on the interview, I have a shot at this opportunity."* 🐻Why he’s bearish consumer voice AI: "I'm actually a little more bearish on the consumer voice AI use cases. I like voice AI in constrained environments, like B2B, there's an amount of intent... I don't think people are going to want to be talking to their phones all the time."Hope you enjoy the conversation! As always you can subscribe at https://smartspeakers.fm.Chapters0:00 Welcome and intro1:32 First big AI aha moment5:08 Career background7:40 Exploring startup ideas10:34 How WayFaster was born14:21 Initial target markets16:45 Candidate response to AI interviews19:07 Benefits for recruiters and candidates23:30 How AI is transforming recruiting26:51 The two-sided recruiting game31:05 Why Varun is bearish on consumer voice AI35:09 Future generations and AI adoption38:09 Content recommendations and closingLinks* WayFaster Website: wayfaster.com* Varun Khurana on LinkedIn: https://www.linkedin.com/in/vkhurana2/* South Park Commons* Deepgram* Charles Rubenfeld's Newsletter This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
undefined
Mar 24, 2025 • 57min

The pipeline for voice—Kwindla Hultman Kramer, CEO of Daily

Much of my childhood was spent in my basement, dialing up other people’s computers to trade messages and play games.Kwin Kramer from Daily remembers that time, too—and says today's voice AI moment feels just like those early internet days. That sense of endless possibilities is back.His open-source project PipeCat has become the standard toolkit for voice agents. What began as an experiment now powers voice AI for OpenAI, Google DeepMind, and countless startups, making conversations feel natural and responsive.Some highlights:* That early internet feeling is back: "1995 to 1999 felt a certain way. It never felt that way again until 2023 to 2025."* GPT-4 transformed Daily's business by removing a key bottleneck: "Previously you needed two humans for a conversation. Now you only need one, maybe not even that."* Voice AI's killer feature? Latency matters: "If response times are long, you're in that uncanny valley where people get uncomfortable."* Kwin's bold prediction: "We're all going to have friends in our group chats that aren't human because LLMs are actually really entertaining."Hope you enjoy it as much as we did.LinksDaily: https://daily.co/PipeCat: https://pipecat.ai/Kwin on Twitter: https://twitter.com/kwindlaChapters0:00 Intro2:02 First AI aha moment5:43 MIT Media Lab beginnings9:05 BBS and door games15:26 The AllAfrica journey18:54 Starting Daily21:13 COVID's impact on WebRTC22:36 GPT-4 transformation31:26 Building voice for LLMs35:17 PipeCat's key challenges44:10 The future of speech-to-speech47:10 Voice AI adoption trends52:34 Vibe coding revolution56:11 What's next for PipeCat This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
undefined
Mar 17, 2025 • 46min

The craft behind voice AI magic with Tom Shapland

This week, Tom Shapland shares his journey from agriculture tech to founding Canonical, a tool that helps voice AI developers understand and improve conversations by mapping call stages.Tom has deep experience in the voice world—we loved this conversation and we’re sure you will too!Linkshttps://canonical.chathttps://x.com/Tom_Shaplandhttps://www.linkedin.com/in/tom-shapland-b4494212/Chapters0:00 - Intro1:05 - Welcome and first AI moment3:03 - Computer vision for thirsty plants5:50 - Explaining AI to farmers7:52 - Hardware is hard8:31 - Non-VC shaped business struggles16:51 - Starting a new company19:41 - From metrics to conversation stages22:42 - Voice AI evolution26:40 - Balancing determinism and freedom29:53 - Who uses Canonical and when32:39 - Don't make your agent spell anything35:52 - Zero to 80% is easy, production is hard39:28 - Future of voice AI adoption42:21 - Book recommendations and closing This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
undefined
Mar 10, 2025 • 55min

Olivia Moore, a16z: Voice as a wedge

Olivia Moore, a Partner at Andreessen Horowitz and an expert on AI voice agents, discusses the transformative power of voice AI as it parallels the internet's evolution in the late '90s. She introduces the 'wedge' strategy for businesses, noting how voice AI is reshaping recruitment by enhancing candidate experiences. Olivia explores its compliance capabilities in regulated industries and critiques the dominance of big tech in specialized voice applications. She emphasizes a shift in pricing strategies as voice technology continues to redefine enterprise operations.
undefined
Mar 3, 2025 • 43min

Voice AI for Healthcare: Phil Markunas, Standard Practice

We chat with Phil Markunas about his wild journey from Army staff sergeant to voice AI founder.Phil shares how Standard Practice evolved from a healthcare payment startup to building an AI assistant that handles complex insurance calls. Along the way, we learn about Phil's life in Japan, his Oreo obsession, and why he believes natural conversation is as hard as self-driving cars. I loved Phil’s concept of “success, but doof” — it neatly encapsulates just how hard it is to measure the success of AI voice calls.Stay tuned next week for our chat with Olivia Moore from a16z. See you soon—dave & harishFollow Phil:https://www.linkedin.com/in/phil-markunashttps://philm.io/https://x.com/philmarkunasStandard Practice:https://standardpractice.aiChapters0:00 - Introduction to Phil Markunas, CTO of Standard Practice1:34 - AI icebreaker and Phil's emotional response to voice AI & Oreo obsession5:17 - Military background and how Army service shaped Phil's people-first leadership8:07 - Global citizen with 60+ moves and life in Japan13:04 - The Nibble Health origin story and creating a medical bill payment card17:59 - Facing challenges and developing the SimpleBuill medical analysis tool22:07 - Pivoting to voice AI to solve the healthcare insurance call problem26:25 - Why voice conversation is an AGI-level problem and deceptively complex33:21 - Beyond "just prompt it up" to building sophisticated voice AI architecture40:45 - "Success, but doof" moments and the future of Standard PracticeSubscribe — and let us know what you think!https://smartspeakers.fm This is a free preview of a paid episode. To hear more, visit www.smartspeakers.fm
undefined
Feb 24, 2025 • 48min

Episode 1: ChatGPT lies to us

Smart Speakers is a podcast about voice AI. Our first guest is, of course, an AI: ChatGPT in advanced voice mode.We talk about:* Our journey since selling Chartable to Spotify in 2022: life inside Spotify, and Harish’s explorations outside corporate life in construction and factory tech* Starting to work together again last July* Exploring projects: a Chartable-like analytics platform for audiobooks, a text-to-speech bookmarking tool called Earmark, and finally experiments with voice AI* We interview ChatGPT, which is surprisingly good at podcast ad reads!* Harish thinks ChatGPT lied to us at least twice. What do you think?Stay tuned next week: Phil Markunas from Standard Practice discusses voice AI in healthcare billing. Thanks for listening, reading, and watching. We’d love to hear your feedback! Subscribe and comment at https://smartspeakers.fm This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm
undefined
Feb 5, 2025 • 43sec

Smart Speakers coming Monday Feb 24th!

Smart Speakers—the podcast about voice AI—is launching on Monday, February 24th! Subscribe now at https://smartspeakers.fm This is a public episode. If you would like to discuss this with other subscribers or get access to bonus episodes, visit www.smartspeakers.fm

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app