80k After Hours cover image

Highlights: #214 – Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway

80k After Hours

00:00

Navigating AI Misalignment Risks

This chapter explores the complexities of detecting harmful AI behaviors and the subtle steps that could lead to malicious actions. It discusses strategies for improved monitoring, including resampling methods and the simulation of environments to mitigate risks.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app