14min chapter

AI + a16z cover image

Beyond Language: Inside a Hundred-Trillion-Token Video Model

AI + a16z

CHAPTER

Advancements in Fine-tuning 2D Models for 3D Representations

The chapter explores the innovative approach of fine-tuning 2D models on multi-view images to enhance knowledge about object appearances in different sizes, leading to significant advancements in various domains. It discusses transitioning from reasoning about 2D images to exploring 3D knowledge through video learning and how large-scale computation can capture complex effects in 3D scenes. The Dream Machine video model is highlighted for its enhanced 3D reasoning capabilities and simplicity in overcoming challenges associated with traditional 3D capturing methods.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode