Deep Papers cover image

Toolformer: Training LLMs To Use Tools

Deep Papers

00:00

Galak Tijar

The tool is based on a publicly available language model. It uses BM25 retrieval one time we ever, we have a calculator. We just wanted to see whether the same approach can work for different kinds of tools that are very diverse. Yeah. I think there's a direct connection. Once there's a future work we want, could we like to connect Galak Tijar as a tool to leverage an expert in a domain here, science? And so you could see it as a mixture of experts, the DR API calls.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app