Lenny's Podcast: Product | Career | Growth cover image

The 100-person AI lab that became Anthropic and Google's secret weapon | Edwin Chen (Surge AI)

Lenny's Podcast: Product | Career | Growth

00:00

The evolution of post-training techniques

Edwin maps SFT to RLHF to rubrics/verifiers and now RL environments as complementary learning stages.

Play episode from 41:34
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app