Pragmatic AI Safety

A Framework for AI Safety Research

Book • 2023

Author

Oliver Zhang

Author

Mantas Mazeika

Pragmatic AI Safety (PAIS) is a research approach that emphasizes performing impactful AI safety research without simultaneously advancing AI capabilities.

It highlights the importance of sociotechnical factors, such as safety culture, and encourages a systems view of AI safety to address complex interactions within the AI research ecosystem.

Mentioned by

Arthur Conmy

Mentioned in 1 episodes

Mentioned by

Arthur Conmy

in the context of mechanistic interpretability research.

29 snips

E48: Mechanizing Mechanistic Interpretability with Arthur Conmy

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app