Astral Codex Ten Podcast cover image

ELK And The Problem Of Truthful AI

Astral Codex Ten Podcast

00:00

The E L K Contest - What Would a Human Expect to Hear?

A r c team discussed two classes of translation system. The direct translator, good, looks at what the security a i is thinking and faithfully translates it to its programmes. The human simulator, bad, focuses on what it would expect a human to think in that situation and tells its programme as that. A l k contest was come up with a strategy that insures your reporter a i ends up the direct translator and not as the human simulator.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app