Exploring Correlations and Misalignments in Large Language Models

This chapter explores the intricacies of low decoupling in machine learning, particularly within large language models. It highlights how contextual cues influence model responses, potentially leading to superficial and profound misalignments based on training data vulnerabilities.

Play episode from 24:39

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app