The Data Scientist Show - Daliana Liu cover image

Becoming a deep learning researcher without a PhD, graph neural network(GNN), time series, recommender system with Kyle Kranen - The Data Scientist Show#028

The Data Scientist Show - Daliana Liu

00:00

Optimizing Pre-Processing Efficiency in Data Science

Explore strategies for enhancing pre-processing efficiency through optimized join orders in data science, emphasizing the impact on table size reduction and processing speed. Learn the significance of strategic planning in selecting join sequences for operations in frameworks like PySpark, highlighting the manual optimization compared to automatic processes in SQL engines.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app