

Data Science at Spotify with Boxun Zhang
Dec 11, 2015
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23
Introduction
00:00 • 2min
Is There a Chronological Overlap Between Data Science and Distributed Systems?
02:07 • 2min
The Evolution of Big Data at Spotifi
03:43 • 3min
Do You Need to Understand the Underlying Technology?
07:01 • 3min
How to Reduce the Time Expended Cleaning Data?
10:23 • 3min
Log Archiver - Why Did It Fail?
13:36 • 2min
Koka at Spotifi - What Are the Benefits of Using Koka?
15:15 • 2min
Is There Any Open Source Alternative to Lu?
17:29 • 2min
Hadup a - What Are the Persistent Themes in the Hadup a Problems?
19:33 • 3min
How Do You Work on a Data Science Architecture?
22:04 • 3min
Gradien Boosting Models - A General Purpose Implementation
24:59 • 2min
Random Forest in Machine Learning?
27:15 • 2min
Using Random Forest for Classification and Regression Problems?
28:56 • 2min
The Most Popular Clustering Elgenno Used Out There
30:42 • 2min
Machine Learning and Data Science - Gradient Decadence
32:38 • 2min
The Collaborative Process Between Data Scientists and Engineers at Spotifi
34:36 • 4min
Sitting Together in a Data Science Environment?
38:07 • 3min
The Future of Streaming Systems in Spotify
41:08 • 2min
Tenser Flow
43:37 • 2min
Deploying Machine Learning Models to User's Devices
45:59 • 2min
What Is the Future of Data Science?
48:05 • 3min
Is There a Future for Data Science?
51:10 • 2min
Discover Weekly - Are You Using Machine Learning to Generate Music?
52:45 • 3min