Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 14 - Interp, latent robustness, RLHF limitations w/ Stephen Casper (PhD AI researcher, MIT)

Artificial General Intelligence (AGI) Show with Soroush Pour

CHAPTER

Exploring AI Safety and Risks in the Field

The chapter delves into the technical aspects of AI safety, discussing tools for evaluations, robustness issues in AI systems, and methods to enhance their safety. The speakers reflect on their journey into the AI field, motivations for working on AI safety, and evolving perceptions of AI risks over time. They explore the importance of avoiding catastrophic events caused by AI technology and concerns regarding autonomous agents potentially being used maliciously.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner