Building a Supportive Ecosystem for AI Ethics and Safety Research

Exploring the need for interdisciplinary platforms like the ACM Fenness Accountability Transparency Conference and structured training pipelines to attract talent to AI safety and ethics research.

Play episode from 01:37:09

chevron_right

Transcript

chevron_right

Transcript

Episode notes

Episode 124

You may think you’re doing a priori reasoning, but actually you’re just over-generalizing from your current experience of technology.

I spoke with Professor Seth Lazar about:

* Why managing near-term and long-term risks isn’t always zero-sum

* How to think through axioms and systems in political philosphy

* Coordination problems, economic incentives, and other difficulties in developing publicly beneficial AI

Seth is Professor of Philosophy at the Australian National University, an Australian Research Council (ARC) Future Fellow, and a Distinguished Research Fellow of the University of Oxford Institute for Ethics in AI. He has worked on the ethics of war, self-defense, and risk, and now leads the Machine Intelligence and Normative Theory (MINT) Lab, where he directs research projects on the moral and political philosophy of AI.

Reach me at editor@thegradient.pub for feedback, ideas, guest suggestions.

Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter

Outline:

* (00:00) Intro

* (01:32) The allocation of attention — attention, moral skill, and algorithmic recommendation

* (03:53) Attention allocation as an independent good (or bad)

* (08:22) Axioms in political philosophy

* (11:55) Explaining judgments, multiplying entities, parsimony, intuitive disgust

* (15:05) AI safety / catastrophic risk concerns

* (22:10) Superintelligence arguments, reasoning about technology

* (28:42) Attacking current and future harms from AI systems — does one draw resources from the other?

* (35:55) GPT-2, model weights, related debates

* (39:11) Power and economics—coordination problems, company incentives

* (50:42) Morality tales, relationship between safety and capabilities

* (55:44) Feasibility horizons, prediction uncertainty, and doing moral philosophy

* (1:02:28) What is a feasibility horizon?

* (1:08:36) Safety guarantees, speed of improvements, the “Pause AI” letter

* (1:14:25) Sociotechnical lenses, narrowly technical solutions

* (1:19:47) Experiments for responsibly integrating AI systems into society

* (1:26:53) Helpful/honest/harmless and antagonistic AI systems

* (1:33:35) Managing incentives conducive to developing technology in the public interest

* (1:40:27) Interdisciplinary academic work, disciplinary purity, power in academia

* (1:46:54) How we can help legitimize and support interdisciplinary work

* (1:50:07) Outro

Links:

* Seth’s Linktree and Twitter

* Resources

* Attention, moral skill, and algorithmic recommendation

* Catastrophic AI Risk slides

Get full access to The Gradient at thegradientpub.substack.com/subscribe

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Home Top podcasts Popular guests Top books