AXRP - the AI X-risk Research Podcast cover image

43 - David Lindner on Myopic Optimization with Non-myopic Approval

AXRP - the AI X-risk Research Podcast

00:00

Interest in reasoning models and chain-of-thought failures

David flags concerns about unfaithful or steganographic chains of thought as potential multi-step hacking modes.

Play episode from 01:07:53
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app