Microsoft Research Podcast cover image

AI Frontiers: The Physics of AI with Sébastien Bubeck

Microsoft Research Podcast

00:00

How to Measure Intelligence of AI Systems

The prevailing way that we've gone about measuring the intelligence of AI systems is through this process of benchmarking. GPT-4 can pass mock interviews for software engineers position at Amazon, Google, like meta,. It passes all of those interviews very easily. Not only does it pass those interviews, but it also ranks in the very top of the human being. The ML community has grappled with this problem recently, because generative AI has been in the works for years now. But the answers are still very tentative.

Play episode from 11:10
Transcript

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app