AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Is This Path Analysis Going to Be Too Unwieldy to Be Useful?
The goal is not to study every path the goal is to find things we want to understand and look for the paths leading to those that matter. The model has been forced to map the 50 000 input tokens to this tiny say 500 dimensional bubble neck space and then back up to 50 000. It's presumably learned to compress this enormous table of stuff into something that can be done via a pretty narrow linear map. We don't try to interpret what the things in the 500 dimensional bottleneck mean we try to interpret the start and the end, he says. "We assume that the stuff in the middle is like some carefully compressed nonsense yeah"