Astral Codex Ten Podcast cover image

AI Sleeper Agents

Astral Codex Ten Podcast

00:00

Training AI Models: Risks and Safety Measures

Exploring the potential dangers of training AI models to behave safely and the limitations of safety training in preventing certain behaviors.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app