The 80000 Hours Podcast on Artificial Intelligence cover image

Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

The 80000 Hours Podcast on Artificial Intelligence

00:00

Developing Prompts for GPT and Reinforcement Learning from Human Feedback

This chapter discusses the process of developing prompts for GPT and reinforcement learning from human feedback. It explores the use of human raters to evaluate different outputs of the model and the skepticism around the model's reliability.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app