Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 10 - Accelerated training to become an AI safety researcher w/ Ryan Kidd (Co-Director, MATS)

Artificial General Intelligence (AGI) Show with Soroush Pour

CHAPTER

Exploring Human Feedback Mechanisms in AI Safety Research

Delving into the limitations and potential of using human feedback to guide AI systems towards safer behaviors. Highlighting the gaps in the alignment ecosystem and the necessity for research on interpretability and safety guarantees. Discussing advanced AI safety techniques and the importance of collaboration with key researchers to address gaps in the field.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner