80,000 Hours Podcast cover image

#151 – Ajeya Cotra on accidentally teaching AI models to deceive us

80,000 Hours Podcast

NOTE

The Future of AI

The evolution of AI capabilities hinges on the challenge of increasing difficulty in obtaining further incremental improvements. The quest for achieving higher levels of intelligence may not necessarily get easier with each advancement, leading to a potential leveling off in progress. Despite uncertainties, the anticipation remains high for an explosive takeoff in growth rates leading to super exponential advancements in a relatively short period. Addressing the concern that powerful AI systems may become too goal-directed and agentic, there is a belief that it is possible to develop AI systems proficient in science without being excessively goal-directed or long-sighted, though this may require significant effort and resources.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner