80,000 Hours Podcast cover image

Can we tell if an AI is loyal by reading its mind? DeepMind's Neel Nanda (part 1)

80,000 Hours Podcast

00:00

Navigating a Career in Mechanistic Interpretability

This chapter offers career guidance for those interested in Mechanistic Interpretability, highlighting a shift towards caution and strategic decision-making in this evolving field. The discussion emphasizes personal fit, motivation, and the importance of exploring diverse opportunities to enhance individual development in AI.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app