No Priors: Artificial Intelligence | Technology | Startups cover image

O3 and the Next Leap in Reasoning with OpenAI’s Eric Mitchell and Brandon McKinzie

No Priors: Artificial Intelligence | Technology | Startups

00:00

Navigating Challenges in Reinforcement Learning Infrastructure

This chapter explores the complexities of training reinforcement learning models at OpenAI, focusing on the challenges of asynchronous learning in a large-scale environment. It highlights the difficulties of managing infrastructure failures and offers insights on maintaining model integrity amid unexpected breakdowns.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app