Exploiting Vulnerabilities in Superhuman Go Playing AIs

This chapter discusses vulnerabilities in superhuman go playing AIs, highlighting the concept of gray box access and how adversaries can exploit weaknesses to defeat the AI. It explores the challenges of securing AI systems like Go engines and the evolving nature of attacks that target these systems. The conversation also touches on adversarial attacks on open source models, the trade-offs in achieving superhuman capability and robustness, and the importance of addressing safety issues in AI design.

Play episode from 01:02:19

Transcript

Episode notes

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app