
Ep 58: Sam Bowman on ChatGPT & Controlling AI
Brave New World -- hosted by Vasant Dhar
00:00
What Is a Large Language Model and How Does It Work?
GPT-3 is a large language model in sort of bite sized pieces and where does the magic come from? I mean it's gotten everyone's imagination like everyone out there is playing with GPT. What makes it possible to do such an impressive thing from such simple code is that those linear algebraic operations have all of these sort of free parameters. You're defining this very flexible, under-determined computation that takes place. And then you do this training process where you show the model enormous amounts of data that gradually, incrementally, sets all of those free parameters.
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.