Data Skeptic cover image

The NLP Community Metasurvey

Data Skeptic

CHAPTER

The Implications of AI Alignment

The field is sometimes called AI alignment because to the extent that an AI system is going to take care to optimize an objective effectively, you need that objective to be aligned with what you actually want. Examining the objective function isn't necessarily enough for a lot of reasons and it's quite contested. We can look inside of the systems and see what features they're picking up on,. If there are internal representations of objectives, we can think about ways of supervising and structuring systems which give them more transparent internal structure than black box optimizer.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner