Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20
Introduction
00:00 • 2min
The Holistic Evaluation of Language Models
01:58 • 2min
How to Test a Language Model for General Purpose Tasks
04:23 • 3min
The Seven Metrics of Detoxification Toxicity
07:07 • 2min
How to Score Language Models in a Benchmark
08:54 • 2min
How to Use Helm to Narrow Down Your Use Cases
11:06 • 2min
The Future of Helm: A Community Initiative
13:01 • 2min
The Importance of Scaling Language Models
15:08 • 2min
The Complexity of Copyright
17:18 • 3min
The Challenges of Evaluating Language Models
19:54 • 1min
The Gap Between Open AI and Private Models
21:23 • 2min
How to Measure Efficiency in the API
22:57 • 1min
The Future of Language Models
24:26 • 3min
The Promise of Language Models
27:39 • 2min
Fine Tuning General Purpose Language Models
29:09 • 2min
The Paradigm Shift in Foundation Models
30:52 • 2min
The Future of Foundation Models
32:57 • 2min
The Importance of Data
35:23 • 2min
The Importance of Externalizing Knowledge
37:15 • 3min
Helm: A Center for Foundation Models
39:53 • 6min