Latent Space: The AI Engineer Podcast cover image

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

Latent Space: The AI Engineer Podcast

00:00

GRPO: Pros, Cons, and Practical Limitations

Kyle explains GRPO's relative scoring benefits and why its need for reproducible parallel rollouts makes it impractical long-term.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app