
17 - Training for Very High Reliability with Daniel Ziegler
AXRP - the AI X-risk Research Podcast
Is There Any Research on Scalable Oversight?
The team at redwood is working on a set of tasks that can be defined just by simple agrithmic predicates. They hope to figure out which kinds of adversary attacks and training techniques work really well in that setting. And then they'll hope to scale back up to more sophisticated tasks.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.