The Bleeding Edge Benchmark

Right now the main user of this benchmark is just the big tech companies that can build these large language models. So using all these data sets can be a nice benchmark. The actual paper just came out two months ago, so it's kind of new to the community. It covers mostly like models from Google and OpenAI,. Like GPT and also the recent models Palm; another one I forgot its name.

Play episode from 33:08

Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app