80k After Hours cover image

Highlights: #197 – Nick Joseph on whether Anthropic’s AI safety policy is up to the task

80k After Hours

00:00

Enhancing AI Safety through Collaboration and Evaluation

This chapter focuses on the importance of improving AI safety policies through collaborative efforts both within Anthropic and with external organizations. It emphasizes the need for conservative safety measures and discusses how individuals can help develop better threat models and safety evaluations.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app