Eye On A.I. cover image

#151 Asa Cooper: How Will We Know If AI Is Fooling Us?

Eye On A.I.

00:00

Automating Reinforcement Learning from Human Feedback

This chapter explores the concept of automating RLHF (Reinforcement Learning from Human Feedback) with AI, discussing the limitations of relying solely on human feedback and the lab's work on AI safety through debates. The speaker also discusses alternative architectures and avenues to achieve higher intelligence in machines.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app