
43 - David Lindner on Myopic Optimization with Non-myopic Approval
AXRP - the AI X-risk Research Podcast
00:00
Interest in reasoning models and chain-of-thought failures
David flags concerns about unfaithful or steganographic chains of thought as potential multi-step hacking modes.
Play episode from 01:07:53
Transcript


