4min chapter

Lex Fridman Podcast cover image

#151 – Dan Kokotov: Speech Recognition with AI and Humans

Lex Fridman Podcast

CHAPTER

ASR, Automatic Speech Recognition?

Real time is pretty difficult. It's actually a pretty, it's not an easy job. Right now, I think it's maybe 14% word error rate on our test test suite that we generally use to measure accuracy for ASR. Most people think realistically it's like 3%, 2%, word error rate would be like the max achievable. So there's still quite a gap, right? Would you say that, so YouTube when I upload videos, often generates automatic captions, are you trying to beat YouTube? Google, it's a hell of a, so Google, I don't know how seriously they take this task, but I imagine it's quite serious.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode