
38.2 - Jesse Hoogland on Singular Learning Theory
AXRP - the AI X-risk Research Podcast
Intro
This chapter delves into Singular Learning Theory (SLT) and its role in AI alignment, highlighting its Bayesian statistical foundations. The discussion encompasses the evaluation of AI behavior, interpretability, and the challenges of predicting downstream behaviors and detecting issues in model performance.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.