
Evaluating Language Models
The Data Exchange with Ben Lorica
00:00
Introduction
Percy Liang, Professor of Computer Science and Statistics at Stanford where he also directs the new Center for Research on Foundation Models. We're going to talk mainly about your new framework for evaluating large language models called Helm,. Holistic Evaluation of Large Language Models. A language model in the HelmSense is any model that takes text and Outputs Text.
Transcript
Play full episode