2min chapter

The Inside View cover image

Curtis Huebner on Doom, AI Timelines and Alignment at EleutherAI

The Inside View

CHAPTER

How to Solve Alignment Problems With a Language Model

The focus has actually shifted from that and kind of asking on the reverse side now. Can you sort of like work backwards to say like what was the chain of events that led you to that bad state according to the probability distribution specified by the language model? So that is sort of like, you know, it's a very sort of exploratory kind of direction. But I do definitely think that having more lenses and kind of perspectives to be able to understand these systems is going to be useful in general.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode