Get the app
Mahesh Sathiamoorthy
Co-founder and CEO of Bespoke Labs, working on using reinforcement learning (RL) to reshape how custom agents are built on top of foundation models.
Best podcasts with Mahesh Sathiamoorthy
Ranked by the Snipd community
281 snips
May 13, 2025
• 1h 1min
From Prompts to Policies: How RL Builds Better AI Agents with Mahesh Sathiamoorthy - #731
chevron_right
Mahesh Sathiamoorthy, co-founder and CEO of Bespoke Labs, dives into the innovative world of reinforcement learning (RL) and its impact on AI agents. He highlights the importance of data curation and evaluation, asserting that RL outperforms traditional prompting methods. The conversation touches on limitations of supervised fine-tuning, reward-shaping strategies, and specialized models like MiniCheck for hallucination detection. Mahesh also discusses tools like Curator and the exciting future of automated AI engineering, promising to make powerful solutions accessible to all.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app