AXRP - the AI X-risk Research Podcast cover image

22 - Shard Theory with Quintin Pope

AXRP - the AI X-risk Research Podcast

CHAPTER

The Convergence of Deep Learning and the Human Brain

The workhorse underlying the human learning process is basically loss minimization slash reward signals. And then there's a little bit of RL on top of that, which is like directing that cognitive VM off in useful directions using a relatively small amount of fine tuning data. There has been an advance out of anthropic in terms of pre-training that further narrows the gap between what the human brain does and how they train their language models.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner