Latent Space: The AI Engineer Podcast cover image

2024 in Vision [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

00:00

Exploring Transformative Trends in Computer Vision for 2024

This chapter explores the shift from image-based models to video-based frameworks in computer vision, highlighting key papers like 'Sora' and 'debtors'. It also discusses the innovations brought by MAGVIT and its impact on video generation and real-time object detection.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app