AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
Data scientists and analysts are advised to spend time shadowing colleagues who handle data entry or business operations to gain context and understand the data-generating process. This direct involvement in the business side helps professionals grasp the meaning of the data columns they analyze, contributing to more insightful interpretations.
David Asport shares his career journey transitioning from software development to data science, highlighting the importance of understanding the value-generating aspect of business operations for data professionals. Through experiences in software development and data science education, he found a niche in data and education, emphasizing the value of following personal interests in career growth.
David addresses the disparity between data science education and practical industry demands, stressing the need for a shift towards applied and realistic training. He challenges the current focus on technical skills in foundational training, advocating for a more holistic approach that incorporates real-world challenges, business understanding, and value-driven problem-solving in data analytics.
Real-life data analysis often involves encountering data quality issues, such as miscategorized products in an e-commerce dataset. The speaker emphasizes the importance of approaching analysis realistically, acknowledging that data problems will arise throughout the process. They highlight the need to address quality issues actively rather than striving for unattainable perfect accuracy specifically when dealing with large datasets. Tools like statistical distributions aid in understanding data variance, while caution is advised against indiscriminately dumping all data into a data lake without considering the end goals.
The conversation emphasizes the critical role of critical thinking in data analysis, illustrated by the cautionary tale of misinterpreting survey data average response time due to outliers. Furthermore, the discourse delves into the integration of artificial intelligence tools in data analysis, cautioning against blind reliance on AI-generated outputs without proper domain understanding. Amidst discussions on the evolving landscape with AI's potential to aid in specific tasks like data extraction from PDFs, the importance of continual skill practice and having a clear purpose for data work are underscored for successful outcomes.
The conversation explores the evolving role of artificial intelligence in data analysis, particularly in accelerating specific processes like chart creation and data extraction from PDF files. The speakers reflect on the role of AI assistance in expediting tasks and the necessity for caution due to biases and limitations inherent in AI models. The episode also probes into the implications of AI proliferation in data analysis, emphasizing the need for analytical skills and a critical understanding of data outputs to counter potential inaccuracies and biases.
The podcast offers insights on cultivating curiosity, continuous skill practice, and anchoring data work with a clear purpose for data-driven success. The importance of applying knowledge through practice projects, including solving personal data challenges, is highlighted as a key learning strategy. Furthermore, the discussion emphasizes the necessity of defining a purpose for data work, underscoring the significance of aligning data analysis efforts with tangible business goals for meaningful outcomes.
The episode concludes with a focus on the role of AI in enhancing specific tasks within data analysis and the importance of understanding its limitations. The speaker directs interested listeners to engage further through LinkedIn for updates and insights, providing a valuable resource for those seeking ongoing learning opportunities in data analysis and AI integration. Additionally, the speaker shares information about their book and availability online, offering a comprehensive platform for accessing additional resources and insights related to data analysis and AI.
“All data scientists and analysts should spend more time in the business, outside the data sets, just to see how the actual business works. Because then you have the context, and then you understand the columns you’re seeing in the data."
David Asboth, author of “Solve Any Data Analysis Problem” and co-host of the “Half Stack Data Science” podcast, shares practical tips for solving real-world data analysis challenges. He highlights the gap between academic training and industry demands, emphasizing the importance of understanding the business problem and maintaining a results-driven approach.
David offers practical insights on data dictionary, data modeling, data cleaning, data lake, and prediction analysis. We also explore AI’s impact on data analysis and the importance of critical thinking when leveraging AI solutions. Tune in to level up your skills and become an indispensable, results-driven data analyst.
Listen out for:
_____
David Asboth’s Bio
David is a “data generalist”; currently a freelance data consultant and educator with an MSc. in Data Science and a background in software and web development. With over 6 years experience teaching, he has taught everyone from junior analysts up to C-level executives in industries like banking and management consulting about how to successfully apply data science, machine learning, and AI to their day-to-day roles. He co-hosts the Half Stack Data Science podcast about data science in the real world and is the author of Solve Any Data Analysis Problem, a book about the data skills that aspiring analysts actually need in their jobs, which will be published by Manning in 2024.
Follow David:
_____
Our Sponsors
Manning Publications is a premier publisher of technical books on computer and software development topics for both experienced developers and new learners alike. Manning prides itself on being independently owned and operated, and for paving the way for innovative initiatives, such as early access book content and protection-free PDF formats that are now industry standard.
Get a 45% discount for Tech Lead Journal listeners by using the code techlead45 for all products in all formats.
Like this episode?
Show notes & transcript: techleadjournal.dev/episodes/175. Follow @techleadjournal on LinkedIn, Twitter, and Instagram. Buy me a coffee or become a patron.
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode