AXRP - the AI X-risk Research Podcast cover image

22 - Shard Theory with Quintin Pope

AXRP - the AI X-risk Research Podcast

00:00

The Convergence of Deep Learning and the Human Brain

The workhorse underlying the human learning process is basically loss minimization slash reward signals. And then there's a little bit of RL on top of that, which is like directing that cognitive VM off in useful directions using a relatively small amount of fine tuning data. There has been an advance out of anthropic in terms of pre-training that further narrows the gap between what the human brain does and how they train their language models.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app