Orchestrate all the Things cover image

Is scaling all you need for AI Large Language Models? Scaling laws and the Inverse Scaling Challenge. Featuring Ian McKenzie, FAR AI Research Scientist

Orchestrate all the Things

00:00

Inverse Scaling Tasks - What You've Learned So Far

In inverse scaling tasks we find that the performance on the smaller models starts out about random. And then the inverse scaling starts when the model kind of quote unquote thinks that it understands how to do the task but really is getting it wrong from our perspective, and so becomes more confident on the wrong answer. One way that this can happen is that you have what's quite quite a hard task and within that is an easier task. They may get big enough to be able to solve the harder task as well and then their performance will start to improve again. But I think there's some hints that as models get bigger they'll start to,. for example, in my example with the harder task and

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app