Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 10 - Accelerated training to become an AI safety researcher w/ Ryan Kidd (Co-Director, MATS)

Artificial General Intelligence (AGI) Show with Soroush Pour

00:00

Enhancing AI Training Signals and Aligning Models

The chapter discusses the challenges of aligning superhuman AI models and the limitations of reinforcement learning from human feedback. It emphasizes the need for scalable methods to align models and highlights the risks of undetected misbehavior in AI due to inadequate training.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app