
AI testing, benchmarks and evals
Thoughtworks Technology Podcast
Exploring Mechanistic Interpretability in AI
This chapter discusses the emerging field of mechanistic interpretability in artificial intelligence, highlighting ongoing research efforts to align AI models with desired outcomes. It encourages listeners to explore current studies while acknowledging that the field is still developing.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.