The Inside View cover image

Ethan Perez–Inverse Scaling, Language Feedback, Red Teaming

The Inside View

00:00

What Do You Mean by Offensive?

i think am even in the pre training setting, there are other ways that we can learn from pre training texts. We could do things that maximu is, likeli of the good sequences in the data and minimas likelio of the bad sequences. But right now we do something that's much more naive. And it doesn't have to be that way. i think by showing, like really highlighting what goes wrong with the language modelling objective, we can then provide some tasks that motivate people to develop other kinds of pre training objectivesthat don't have those limitations.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app