The Jim Rutt Show cover image

Currents 088: Melanie Mitchell on AI Measurement and Understanding

The Jim Rutt Show

00:00

The Importance of Memorization in Standardized Exams

Chat TPT 3.5 did great on all of the standardized exams, except AP English which it didn't score well on at all. So you know, there's all kinds of issues with giving these language models these tests. And we know that these systems are very sensitive to the prompts that they're given. But none of us can really test that because number one, we don't have access to GPT 4 and we also don't know what exactly the material is that they tested on.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app