AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Building a Supportive Ecosystem for AI Ethics and Safety Research
Exploring the need for interdisciplinary platforms like the ACM Fenness Accountability Transparency Conference and structured training pipelines to attract talent to AI safety and ethics research.
Episode 124
You may think you’re doing a priori reasoning, but actually you’re just over-generalizing from your current experience of technology.
I spoke with Professor Seth Lazar about:
* Why managing near-term and long-term risks isn’t always zero-sum
* How to think through axioms and systems in political philosphy
* Coordination problems, economic incentives, and other difficulties in developing publicly beneficial AI
Seth is Professor of Philosophy at the Australian National University, an Australian Research Council (ARC) Future Fellow, and a Distinguished Research Fellow of the University of Oxford Institute for Ethics in AI. He has worked on the ethics of war, self-defense, and risk, and now leads the Machine Intelligence and Normative Theory (MINT) Lab, where he directs research projects on the moral and political philosophy of AI.
Reach me at editor@thegradient.pub for feedback, ideas, guest suggestions.
Subscribe to The Gradient Podcast: Apple Podcasts | Spotify | Pocket Casts | RSSFollow The Gradient on Twitter
Outline:
* (00:00) Intro
* (00:54) Ad read — MLOps conference
* (01:32) The allocation of attention — attention, moral skill, and algorithmic recommendation
* (03:53) Attention allocation as an independent good (or bad)
* (08:22) Axioms in political philosophy
* (11:55) Explaining judgments, multiplying entities, parsimony, intuitive disgust
* (15:05) AI safety / catastrophic risk concerns
* (22:10) Superintelligence arguments, reasoning about technology
* (28:42) Attacking current and future harms from AI systems — does one draw resources from the other?
* (35:55) GPT-2, model weights, related debates
* (39:11) Power and economics—coordination problems, company incentives
* (50:42) Morality tales, relationship between safety and capabilities
* (55:44) Feasibility horizons, prediction uncertainty, and doing moral philosophy
* (1:02:28) What is a feasibility horizon?
* (1:08:36) Safety guarantees, speed of improvements, the “Pause AI” letter
* (1:14:25) Sociotechnical lenses, narrowly technical solutions
* (1:19:47) Experiments for responsibly integrating AI systems into society
* (1:26:53) Helpful/honest/harmless and antagonistic AI systems
* (1:33:35) Managing incentives conducive to developing technology in the public interest
* (1:40:27) Interdisciplinary academic work, disciplinary purity, power in academia
* (1:46:54) How we can help legitimize and support interdisciplinary work
* (1:50:07) Outro
Links:
* Resources
* Attention, moral skill, and algorithmic recommendation
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode