
Datacast Episode 110: Wisdom in Building Data Infrastructure, Lessons From Open-Source Development, The Missing README, and The Future of Data Engineering with Chris Riccomini
Mar 14, 2023
Chris Riccomini, a data engineer with experience at PayPal and LinkedIn, discusses building data infrastructure, lessons from open-source development, and the future of data engineering. Topics include scaling Hadoop clusters, choosing big data solutions, strategies for early-stage startups, and the importance of running models and microservices on the same continuous delivery stack. Chris also talks about creating Apache Samza, evangelizing open-source projects, and the evolution of data infrastructure at WePay. The podcast explores mastering interviewing skills in data engineering hiring, building Apache SAMHSA, the evolution of Apache Airflow, designing principles of Apache Kafka, and the future of data engineering predictions and industry insights.
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7 8 9 10
Intro
00:00 • 2min
Transition to Tech: Education and Specialization in Data
01:48 • 5min
Navigating Career Progression and Technological Evolution
06:22 • 25min
Mastering Interviewing Skills in Data Engineering Hiring
31:00 • 6min
Building Apache SAMHSA: Motivation, Design Philosophy, and Lessons Learned
37:30 • 23min
Evolution of Data Infrastructure at WePay
01:00:43 • 12min
Evolution and Developments in Apache Airflow
01:12:41 • 8min
Design Principles of Apache Kafka and Writing a Technical Book
01:20:14 • 18min
Future of Data Engineering Predictions and Industry Insights
01:38:05 • 26min
Exploring a Data Professional's Journey and Insights
02:04:06 • 2min
