Thinking Machines: AI & Philosophy cover image

Thinking Machines: AI & Philosophy

On Adversarial Training & Robustness with Bhavna Gopal

May 8, 2024
Bhavna Gopal, a PhD candidate at Duke with research experience at top tech companies, uncovers the world of adversarial training and AI robustness. She explains how adversarial attacks threaten AI model integrity, especially in sensitive fields like healthcare and law. The conversation touches on the challenges of evaluating model performance and the ethical ramifications of AI deployment. Also discussed are the complexities of self-driving cars and the importance of interpretability in ensuring public trust in AI technologies.
44:05

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Adversarial training enhances the robustness of machine learning models by preparing them to withstand intentional input manipulations that could mislead predictions.
  • A deep understanding of model mechanisms is vital for effective fine-tuning, particularly in high-stakes fields like healthcare where misinterpretations can be detrimental.

Deep dives

Understanding Adversarial Training

Adversarial training refers to the process of preparing machine learning models to withstand intentional input perturbations that could mislead their predictions. This involves strategically manipulating inputs to identify scenarios where the model's outputs deviate from expected behavior, which is particularly vital in high-stakes fields like medicine and law. A practical example includes feeding biased or adversarial resumes to an AI screening tool to exploit its algorithms, showcasing the risk of models producing harmful or incorrect outputs. The broader implication is that these attacks can stem from knowledgeable adversaries looking to manipulate results, as well as unaware users whose inputs may inadvertently confuse the model.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode