
AI #138 Part 2: Watch Out For Documents
Don't Worry About the Vase Podcast
00:00
Challenges aligning superhuman AIs
Zvi surveys alignment difficulties, datasets of reward-hacking, and tools like METEAR for evaluating agentic misbehavior.
Transcript
Play full episode