80k After Hours cover image

Highlights: #214 – Buck Shlegeris on controlling AI that wants to take over – so we can use it anyway

80k After Hours

CHAPTER

Navigating AI Misalignment Risks

This chapter explores the complexities of detecting harmful AI behaviors and the subtle steps that could lead to malicious actions. It discusses strategies for improved monitoring, including resampling methods and the simulation of environments to mitigate risks.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner