AI Safety Fundamentals cover image

AI Safety Fundamentals

Latest episodes

undefined
Jan 4, 2025 • 12min

Introduction to Mechanistic Interpretability

Our introduction introduces common mech interp concepts, to prepare you for the rest of this session's resources.Original text: https://aisafetyfundamentals.com/blog/introduction-to-mechanistic-interpretability/Author(s): Sarah Hastings-WoodhouseA podcast by BlueDot Impact.Learn more on the AI Safety Fundamentals website.
undefined
Jan 4, 2025 • 20min

We Need a Science of Evals

This lays out a number of open questions, in what the author calls a 'Science of Evals'.Original text: https://www.apolloresearch.ai/blog/we-need-a-science-of-evalsAuthor(s): Apollo Research blogA podcast by BlueDot Impact.Learn more on the AI Safety Fundamentals website.
undefined
Jan 2, 2025 • 40min

If-Then Commitments for AI Risk Reduction

Holden Karnofsky, a visiting scholar at the Carnegie Endowment for International Peace, delves into his innovative 'If-Then' commitments for managing AI risks. He outlines how these structured responses can ensure proactive safety measures without stifling innovation. The discussion highlights the importance of timely interventions as AI technology evolves, ensuring developments stay safe and beneficial. Karnofsky also touches on the challenges of implementing these commitments and the necessity of regulatory compliance across sectors.
undefined
Jan 2, 2025 • 11min

This is How AI Will Transform How Science Gets Done

This article by Eric Schmidt, former CEO of Google, explains existing use cases for AI in the scientific community and outlines ways that sufficiently advanced, narrow AI models might transform scientific discovery in the near future. As you read, pay close attention to the existing case studies he describes.Original text: https://www.technologyreview.com/2023/07/05/1075865/eric-schmidt-ai-will-transform-science/ Author(s): Eric SchmidtA podcast by BlueDot Impact.Learn more on the AI Safety Fundamentals website.
undefined
Dec 30, 2024 • 41min

So You Want to be a Policy Entrepreneur?

Discover the dynamic world of policy entrepreneurs and their role in driving innovation. Uncover strategies like problem framing and coalition building that empower them to tackle global challenges, especially climate change. Learn how collaborative networks help share vital knowledge and foster legislative support. Historical examples illustrate the impact of these leaders in areas like California’s stem cell research. Dive into the ongoing fight against violence in conflict, showcasing the critical need for advocacy and transformative policies.
undefined
Dec 30, 2024 • 56min

Open-Sourcing Highly Capable Foundation Models: An Evaluation of Risks, Benefits, and Alternative Methods for Pursuing Open-Source Objectives

The discussion tackles the double-edged sword of releasing powerful foundation models. While transparency can fuel innovation, the potential for misuse by malicious actors is alarming. Cyberattacks, the development of biological weapons, and disinformation loom large. The conversation also emphasizes the necessity for responsible AI governance, advocating for structured access and staged releases as safer alternatives to total openness. Legal liabilities and the implications of regulation are scrutinized, highlighting the critical need for thorough risk assessments.
undefined
Dec 30, 2024 • 26min

Considerations for Governing Open Foundation Models

Discover the debate surrounding open foundation models and their potential to drive innovation and competition. The discussion highlights the ethical dilemmas of open versus closed models and the need for thoughtful policy design. Examining the risks, such as disinformation, the speakers argue that evidence for these dangers is limited. They advocate for policies that consider the unique characteristics of open models to prevent stifling development while promoting transparency and reducing monopolistic power in AI.
undefined
May 22, 2024 • 36min

Driving U.S. Innovation in Artificial Intelligence: A Roadmap for Artificial Intelligence Policy in the United States Senate

The podcast discusses the U.S. Senate's AI policy roadmap, including AI Ready Data, National AI Research Resource, AI safety, and election security. It explores funding and collaboration in AI applications for defense and national security, as well as addressing legal and regulatory gaps in AI systems. Policy recommendations cover export controls, security mechanisms, and collaboration with international partners.
undefined
May 20, 2024 • 40min

The AI Triad and What It Means for National Security Strategy

Ben Buchanan, author of the AI Triad framework, discusses the inputs powering machine learning: algorithms, data, and compute. The podcast explores the impact of these components on national security strategy, the disparities between machine learning and traditional programming, and the application of machine learning in national security, robotics, and AI advancements.
undefined
May 20, 2024 • 46min

Societal Adaptation to Advanced AI

Authors Jamie Bernardi and Gabriel discuss societal adaptation to advanced AI systems in a paper, emphasizing the need for adaptive strategies and resilience. Topics include managing AI risks, interventions, loss of control to AI decision-makers, adaptive strategies, and responses to AI threat models.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner