Monday Morning Data Chat

#135 - Dataframe Deep Dive w/ Devin Petersohn

4 snips
Jul 18, 2023
Ask episode
Chapters
Transcript
Episode notes
1
Introduction
00:00 • 2min
2
The Complexity of Data Frames
02:27 • 2min
3
The Complexity of Data Frames
04:29 • 5min
4
The Parallel Universe of Data Frames
09:40 • 3min
5
The Importance of Understanding Data Frame Order
12:17 • 4min
6
The Advantages of Ordering in Data Frames
16:20 • 6min
7
Relational Algebra and Transpose Operators
21:55 • 2min
8
The Controversy of the Transpose and the Same World
23:35 • 2min
9
The Data Frame Algebra: A More Flexible Way of Interacting With Data
25:07 • 3min
10
The Best API for Data
27:38 • 3min
11
How to Make Big Decisions for Your Software Analysis Team
30:43 • 2min
12
The Evolution of Pandas
32:37 • 2min
13
The Evolution of Data Frames
34:58 • 2min
14
Modus In: A Drop in Replacement for Pandas
36:43 • 2min
15
How to Create Order and Tracking in a Non Ordered Cloud Data Warehouse
38:42 • 2min
16
How to Avoid Loops in the Data Warehouse
40:59 • 2min
17
The Importance of Knowing the SDK and API
42:32 • 4min
18
How to Retrain Users to Use a More Scalable Product
46:31 • 3min
19
Who Should Be Using SQL and Interfacing With Data Frames?
49:58 • 2min
20
The Trade-Offs Between Different Approaches
52:14 • 2min
21
The Importance of Mental Models for Learning Pandas
54:04 • 5min
22
How to Transpose Machine Learning Data
58:43 • 2min
23
The Dark Matter of Data
01:00:29 • 2min