Charbel-Raphaël

Podcast author and narrator who produced this link post for LessWrong summarizing the Global Call for AI Red Lines presented at the UN General Assembly. Provides concise narration and context for the episode's sourced article.

Best podcasts with Charbel-Raphaël

Ranked by the Snipd community

Aug 21, 2023 • 1h 19min

"Against Almost Every Theory of Impact of Interpretability" by Charbel-Raphaël

Charbel-Raphaël critiques theories of interpretability, questioning their practicality in industry. Discusses limitations of pixel attribution techniques and the need for accuracy. Explores the challenges of interpreting AI models for deception detection. Advocates for cognitive emulation over traditional visualization methods for transparency in AI models. Emphasizes the importance of balancing safety and capabilities in AI alignment research.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app