Latent Space: The AI Engineer Podcast cover image

2024 in Vision [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

00:00

Innovations in Video Generation: Techniques and Model Architectures

This chapter explores the complex methods involved in generating videos, focusing on LLM captioning and diffusion models. It examines video quality optimization techniques and the evolution of model architectures, emphasizing the shift from autoregressive models to diffusion transformers.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app