4min chapter

Deep Papers cover image

Toolformer: Training LLMs To Use Tools

Deep Papers

CHAPTER

The Limitations of Self Supervised Learning

So you've generated this massive amount of API calls to annotate your data set. They're not very good. You filter them using this kind of clever mechanism. Now you have a data set that is annotated with these API calls. What do you do after that? Like, how do you actually use that to get the language model to use the tools? So there's two steps here. And then the final step is actually very, very simple. We just train the language model for a bit longer on this new data set with API calls. It won't like unlearn things that it learned during pre-training because it's still the exact same data distribution.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode