The Swyx Mixtape

[AI] Behind ChatGPT: RLHF and the Proximal Policy Optimization - Practical AI

Jan 24, 2023
Ask episode
Chapters
Transcript
Episode notes