80,000 Hours Podcast cover image

#159 – Jan Leike on OpenAI's massive push to make superintelligence safe in 4 years or less

80,000 Hours Podcast

00:00

OpenAI's Contribution to Reinforcement Learning from Human Feedback

OpenAI was involved in coming up with the method of reinforcement learning from human feedback using deep learning systems, which has been successful in making chat GBT work effectively. The collaboration between Paul Cristiano, Darrio Amade, and the speaker led to this breakthrough.

Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner
Get the app