The 80000 Hours Podcast on Artificial Intelligence cover image

Four: Rohin Shah on DeepMind and trying to fairly hear out both AI doomers and doubters

The 80000 Hours Podcast on Artificial Intelligence

CHAPTER

Developing Prompts for GPT and Reinforcement Learning from Human Feedback

This chapter discusses the process of developing prompts for GPT and reinforcement learning from human feedback. It explores the use of human raters to evaluate different outputs of the model and the skepticism around the model's reliability.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner