Introduction

This is Lost in the Middle, how language models use context. There's really not that much known about how well these models actually use this context or even how they use it in general. And so what does the paper do to figure out how well the models are using their context and where it sometimes fails? Well, they're testing four models to open source. So that's going to be MBT 30B instruct and then long chat 13B and then two closed source models chat GPT three or not. Sorry, not chat GPT 3.5 turbo and Claude.

Play episode from 00:00

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app