AI Safety Fundamentals: Alignment cover image

Introduction to Logical Decision Theory for Computer Scientists

AI Safety Fundamentals: Alignment

00:00

Fixing the Infinite Loop Problem and Evaluating Modal Agents

This chapter discusses an algorithm for fixing the infinite loop problem in logical decision theory by using proof based decision theory and lobes theorem to ensure cooperation.

Play episode from 09:40
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app