TBPN cover image

David Sacked by NYT, Sir Dylan Patel Joins, Kushner & Sama are Thriving | Ro Khanna, Jonathan Swerdlin, Cristóbal Valenzuela, Vincent Weisser, Ben Hylak, Alby Churven

TBPN

00:00

Post-training RL for Application-Specific Models

Vincent explains how RL environments let companies fine-tune models cheaply for app-specific capabilities.

Play episode from 02:52:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app