LessWrong (Curated & Popular) cover image

"(My understanding of) What Everyone in Technical Alignment is Doing and Why" by Thomas Larsen & Eli Lifland

LessWrong (Curated & Popular)

00:00

Scalable Alignment Research

Fars curren aims to incubate new, scaleable alignment research. They are working on adversarial attacks against narrowly superhuman systems like alpago language model and bench marks for value learning. Mery thinks technical alignment is really hard and that we are very far from a solution. However, they think that policy solutions have even less hope.

Play episode from 52:00
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app