5min chapter

Lex Fridman Podcast cover image

#110 – Jitendra Malik: Computer Vision

Lex Fridman Podcast

CHAPTER

Six Lessons That We Can Learn From Children of Being Multimodal

So, speaking of neural networks, how much of this problem of computer vision can be, have a reconstruction? How much of it can be learned end to end, do you think, sort of set it and forget it. Have a giant data set, multiple, perhaps multimodal, and then just learn the entirety of it. So, I think that currently what that end to end learning means nowadays is end to end supervised learning. And that I would argue is a two narrow view of the problem. It's one where there are certain capabilities that are built up and then there arecertain capabilities which are built up on top of that. That's, that's what I believe in.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode