
38.2 - Jesse Hoogland on Singular Learning Theory
AXRP - the AI X-risk Research Podcast
00:00
Intro
This chapter delves into Singular Learning Theory (SLT) and its role in AI alignment, highlighting its Bayesian statistical foundations. The discussion encompasses the evaluation of AI behavior, interpretability, and the challenges of predicting downstream behaviors and detecting issues in model performance.
Transcript
Play full episode