Exploring Hypotheses on LLMs' Behavior and Feature Directions

Exploring a hypothesis about the behavior of LLMs and their ability to distinguish basic feature directions, with a hypothetical example of a transformer model.

Play episode from 27:18

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app