3min chapter

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

No Priors: Artificial Intelligence | Technology | Startups

CHAPTER

The Future of Video Generating

I think control sort of tends to lag behind the core capability. Even with images, I feel like we first had to get to a point where these models can actually generate nice looking images before we start worrying about whether it's really doing what I wanted it to do. The other potential part of output for videos, obviously, is text to speech or some sort of voice or other ways to sort of accompany the video or animate it. What is your view in terms of the state of the art of tech to speech systems and how those are evolving?

00:00

Transcript

Episode notes

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

3min chapter

The Timeline for Realistic 4-D: Devi Parikh from Meta on Research Hurdles for Generative AI in Video and Multimodality

No Priors: Artificial Intelligence | Technology | Startups

Get the Snipdpodcast app

AI-poweredpodcast player

Discoverhighlights

Save anymoment

Share& Export

AI-poweredpodcast player

Discoverhighlights

Get the Snipd
podcast app

AI-powered
podcast player

Discover
highlights

Save any
moment

Share
& Export

AI-powered
podcast player

Discover
highlights