Kyle Corbitt

Co-founder and CEO of OpenPipe (recently acquired by CoreWeave), focused on reinforcement learning for agent reliability and continual learning for production agents.

Best podcasts with Kyle Corbitt

Ranked by the Snipd community

1,794 snips

Oct 16, 2025 • 0sec

Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)

Kyle Corbitt, co-founder and CEO of OpenPipe, discusses the shift from fine-tuning to reinforcement learning in AI. He highlights that many AI projects fail not due to capability, but reliability issues that can be resolved through continuous learning. Kyle introduces RULER, a system that simplifies reward assignment using LLMs to judge agent behaviors. He also critiques the impracticalities of GRPO and emphasizes the importance of realistic training environments for AI agents. Finally, he shares insights on the future of continual learning and serverless RL.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app