17min chapter

Artificial General Intelligence (AGI) Show with Soroush Pour cover image

Ep 14 - Interp, latent robustness, RLHF limitations w/ Stephen Casper (PhD AI researcher, MIT)

Artificial General Intelligence (AGI) Show with Soroush Pour

CHAPTER

Exploring AI Safety and Risks in the Field

The chapter delves into the technical aspects of AI safety, discussing tools for evaluations, robustness issues in AI systems, and methods to enhance their safety. The speakers reflect on their journey into the AI field, motivations for working on AI safety, and evolving perceptions of AI risks over time. They explore the importance of avoiding catastrophic events caused by AI technology and concerns regarding autonomous agents potentially being used maliciously.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode