The Data Exchange with Ben Lorica cover image

Evaluating Language Models

The Data Exchange with Ben Lorica

00:00

Introduction

Percy Liang, Professor of Computer Science and Statistics at Stanford where he also directs the new Center for Research on Foundation Models. We're going to talk mainly about your new framework for evaluating large language models called Helm,. Holistic Evaluation of Large Language Models. A language model in the HelmSense is any model that takes text and Outputs Text.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app