AI Safety Fundamentals: Alignment cover image

Introduction to Logical Decision Theory for Computer Scientists

AI Safety Fundamentals: Alignment

00:00

Fixing the Infinite Loop Problem and Evaluating Modal Agents

This chapter discusses an algorithm for fixing the infinite loop problem in logical decision theory by using proof based decision theory and lobes theorem to ensure cooperation.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app