Training Data cover image

OpenAI Sora 2 Team: How Generative Video Will Unlock Creativity and World Models

Training Data

00:00

Data Choices and Pretraining at Scale

Bill discusses data mix decisions, video token density, and the abundance of video for pretraining.

Play episode from 10:44
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app