Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 10 - Accelerated training to become an AI safety researcher w/ Ryan Kidd (Co-Director, MATS)

Artificial General Intelligence (AGI) Show with Soroush Pour

CHAPTER

Enhancing AI Training Signals and Aligning Models

The chapter discusses the challenges of aligning superhuman AI models and the limitations of reinforcement learning from human feedback. It emphasizes the need for scalable methods to align models and highlights the risks of undetected misbehavior in AI due to inadequate training.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner