The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Adversarial Attacks Against Reinforcement Learning Agents with Ian Goodfellow & Sandy Huang

Mar 15, 2018

Guest

Sandy Huang

Guest

Ian Goodfellow

Ian Goodfellow, a Staff Research Scientist at Google Brain known for his work on adversarial machine learning, joins Sandy Huang, a PhD student at UC Berkeley focusing on adversarial attacks in reinforcement learning. They dive into how a single pixel alteration can drastically reduce the performance of Atari-playing AI. The conversation also touches on the philosophy behind error assessment in AI, reward complexity in reinforcement learning, and the implications of adversarial threats on security in AI systems, highlighting the urgent need for robust defenses.