Interconnects

Interviewing Louis Castricato of Synth Labs and Eleuther AI on RLHF, Gemini Drama, DPO, founding Carper AI, preference data, reward models, and everything in between

Mar 4, 2024
Louis Castricato, a researcher at EleutherAI and founder of Synth Labs, dives deep into the fascinating world of RLHF. He explores the complexities of preference learning and the shift from PPO to DPO in reinforcement learning. The conversation highlights the challenges of biases in AI, especially regarding representation in training data. Castricato also shares insights on Gemini's impact on data safety, the evolution of model evaluation techniques, and the importance of collaborative efforts in advancing AI research.
Ask episode
Chapters
Transcript
Episode notes