Evaluating Misalignment in AI Systems

This chapter discusses the need to evaluate different potential forms of misalignment in AI systems separately and proposes a roadmap for developing models that demonstrate various subcomponents of AI takeover.

Transcript

Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app