4min snip

80,000 Hours Podcast cover image

#141 – Richard Ngo on large language models, OpenAI, and striving to make the future go well

80,000 Hours Podcast

ADVICE

Areas for AI Safety Research

Chip-Level Verification: Research chip-level verification mechanisms to prevent risky training. "doing research on chip level verification mechanisms such that we can try and produce chips that can't be used for certain types of particularly risky training." This could involve monitoring activity, verifying inactivity periods, or implementing restrictions like Nvidia's parallel processing limits. "Nvidia has some restrictions on the way some of their GPUs can be used so that you need to pay extra in order to unlock the ability to run many of them in parallel."

Automated Verification of Training: Develop automated verification methods for machine learning training runs. "automated verification of certain properties of machine learning training runs." This would help verify properties like parameters, training duration, and algorithms used, without revealing sensitive source code. "One way could be for me to like show auditors the actual source code that I'm using for my training run but you know that leaks all sorts of information there are privacy concerns [...]."

Bridging Governance and Technical Expertise: Focus on bridging the gap between governance needs and technical implementation. "the thing I'm trying to do is as I said like build these bridges." Translate governance requirements into concrete technical research directions, facilitating collaboration between governance experts and technical researchers. "like really I want to hand them over to the people who have like way more domain expertise and potentially serve as a bridge [...]."

The cause of increasing complexity and potential risks in AI training necessitates the effect of developing stronger verification and governance mechanisms.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode