Latent Space: The AI Engineer Podcast cover image

2024 in Vision [LS Live @ NeurIPS]

Latent Space: The AI Engineer Podcast

CHAPTER

Innovations in Video Generation: Techniques and Model Architectures

This chapter explores the complex methods involved in generating videos, focusing on LLM captioning and diffusion models. It examines video quality optimization techniques and the evolution of model architectures, emphasizing the shift from autoregressive models to diffusion transformers.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner