The Inside View cover image

Collin Burns On Discovering Latent Knowledge In Language Models Without Supervision

The Inside View

00:00

Is AI Feedback a Better Way to Evaluate Models?

Human evaluators just won't be able to evaluate many statements and so the model will just end up generating a bunch of incorrect things as well in cases where they can't tell. So ideally instead of having like humans giving feedback I heard a lot of people saying that we could maybe get AI is giving feedback in the future. do more complicated things otherwise by do or meta I might just come up with like a better but and eat you yeahYeah it's not clear what the limits are for an AI system, he says.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app