Deep Papers cover image

Toolformer: Training LLMs To Use Tools

Deep Papers

00:00

Using Pre-Trained Language Models to Generate API Calls

Teemo: One of the problems with this is getting a huge annotated dataset. And part of the problem is because the scale of these text datasets is so large, like common crawl, you can't just pay human labelers to do it. We wanted to avoid having huge costs from humans having to annotate data. So we use a language model to sample a massive amount of different API calls. Then we give it three examples of an input and the corresponding output would be the same text with API calls inserted. And then we ask the model to do the same for a fourth text.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app