LessWrong (Curated & Popular) cover image

"More information about the dangerous capability evaluations we did with GPT-4 and Claude." by Beth Barnes

LessWrong (Curated & Popular)

00:00

The Future of AI Is in the Models, Not the Machines

No prone to hallucinations were not fully effective at delegating large tasks between multiple copies. Current language models are also very much capable of convincing humans to do things for them. We think that for systems more capable than Claude and GPT-4, we need to check carefully that new models do not have sufficient capabilities to replicate autonomously or cause catastrophic harm.

Play episode from 11:17
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app