NEJM AI Grand Rounds cover image

Google’s Exploration of Large Language Models in Medicine

NEJM AI Grand Rounds

CHAPTER

The Complexity of the Large Language Model

GPT-3 training procedure is download a bunch of text from the internet and then make a model predict the next word. I think with this very simple next word prediction objective on like internet scale data what we have done is we have actually multitasked a bunch of different objectives. We're seeing all these interesting facets and phenomena emerge as we start to more optimally train these modelsScale them up understand better how to like you know train them.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner