Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 14 - Interp, latent robustness, RLHF limitations w/ Stephen Casper (PhD AI researcher, MIT)

Artificial General Intelligence (AGI) Show with Soroush Pour

00:00

Exploring AI Safety and Risks in the Field

The chapter delves into the technical aspects of AI safety, discussing tools for evaluations, robustness issues in AI systems, and methods to enhance their safety. The speakers reflect on their journey into the AI field, motivations for working on AI safety, and evolving perceptions of AI risks over time. They explore the importance of avoiding catastrophic events caused by AI technology and concerns regarding autonomous agents potentially being used maliciously.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app