The Inside View cover image

Ethan Perez–Inverse Scaling, Language Feedback, Red Teaming

The Inside View

00:00

Using a Language Model for Evaluation?

The idea te the language model that you're to make up with as the same architecture as the language model will produce the code, and so it will be able to understand why the thing generated e code s. Ah, i think it can get, it can get tricky when you do this,. A may also be able to infer that you are using it to help in the evaluation process, or maybe youike specifically say that, and then then it can avoid pointing out certain important weaknesses. And so i think there you need to, for example, use many different kinds of models. Use also some smaller models that are maybe less likely to have these kinds of failures.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app