The Inside View

[JUNE 2022] Aran Komatsuzaki on Scaling, GPT-J and Alignment

Jul 19, 2023
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
How to Read So Many Abstracts on Twitter
01:39 • 2min
3
The Importance of Scaling
03:39 • 2min
4
The Future of AI
05:11 • 4min
5
The Importance of AI in the Economy
09:21 • 2min
6
The Importance of Scaling in AI
11:32 • 4min
7
How to Scale Your Models to Maximize Performance
15:25 • 3min
8
GPT-2 and GPT-3: A Comparison of Scaling and Scaling in NLP
18:48 • 2min
9
One E-Pop Is All You Need
20:26 • 4min
10
How to Scale GPT and T0
24:54 • 4min
11
How to Scale a Multi Task Training Procedure
28:47 • 3min
12
The Importance of Scaling for GPT-like Models
31:22 • 4min
13
How to Scale a Scaling Law for Humans
35:14 • 2min
14
GPT J for GPT Jax
36:53 • 3min
15
How to Build a Wider Language Model
40:04 • 4min
16
How JPTJ Compares to JPT3 and JPTNEO
43:37 • 4min
17
How to Fine-Tune a TensorFlow Model
47:14 • 3min
18
The Negative Impact of Releasing a Big Language Model With the Current Trend
50:32 • 2min
19
How GPTJ Accelerated Research Timelines
52:12 • 3min
20
The Importance of Accelerating Open Source Timelines
55:00 • 2min
21
The Problem With AI Deception
56:50 • 2min
22
How to Write a Good Benchmark for Machine Learning Models
59:09 • 2min
23
NLP Benchmarks for Language Models
01:00:47 • 3min
24
AI's Alignment
01:03:22 • 3min
25
How to Go From Human Level Language Models to AGI
01:06:21 • 2min
26
How to Convince Humans to Build More GPU Centers
01:08:10 • 2min
27
The Future of Human Intelligence
01:09:43 • 2min
28
How to Regulate the Behavior of AGI
01:11:23 • 2min
29
The Future of AI and Alignment
01:13:06 • 4min