Get the app
Kyle Corbitt
Co-founder and CEO of OpenPipe (recently acquired by CoreWeave), focused on reinforcement learning for agent reliability and continual learning for production agents.
Best podcasts with Kyle Corbitt
Ranked by the Snipd community
1,498 snips
Oct 16, 2025
• 0sec
Why RL Won — Kyle Corbitt, OpenPipe (acq. CoreWeave)
chevron_right
Kyle Corbitt, co-founder and CEO of OpenPipe, discusses the shift from fine-tuning to reinforcement learning in AI. He highlights that many AI projects fail not due to capability, but reliability issues that can be resolved through continuous learning. Kyle introduces RULER, a system that simplifies reward assignment using LLMs to judge agent behaviors. He also critiques the impracticalities of GRPO and emphasizes the importance of realistic training environments for AI agents. Finally, he shares insights on the future of continual learning and serverless RL.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app