LessWrong (Curated & Popular) cover image

"Cyborgism" by Nicholas Kees & Janus

LessWrong (Curated & Popular)

00:00

Creating Cyborgs for Alignment

The object level plan of creating cyborgs for alignment boils down to two main directions. One, design more tools or methods like LUM which provide high bandwidth, human-in-the-loop ways for humans to interact with GPT as a simulator. Two, train alignment researchers to use these tools and leverage that understanding to exert fine-grained control over the model. Heading cyborg cognition is intended to help clarify what is meant by the term cyborg.

Play episode from 27:05
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app