DataFramed cover image

DataFramed

Latest episodes

undefined
Mar 20, 2023 • 35min

#131 How the Aviation Industry Leverages Data Science

Data leaders play a critical role in driving innovation and growth in various industries, and this is particularly true in highly regulated industries such as aviation. In such industries, data leaders face unique challenges and opportunities, working to balance the need for innovation with strict regulatory requirements. This week’s guest is Derek Cedillo, who has 27 years of experience working in Data and Analytics at GE Aerospace. Derek currently works as a Senior Manager for GE Aerospace’s Remote Monitoring and Diagnostics division, having previously worked as the Senior Director for Data Science and Analytics.In the episode, Derek shares the key components to successfully managing a Data Science program within a large and highly regulated organization. He also shares his insights on how to standardize data science planning across various projects and how to get a Data Scientists to think and work in an agile manner. We hear about ideal data team structures, how to approach hiring, and what skills to look for in new hires. The conversation also touches on what responsibility Data Leaders have within organizations, championing data-driven decisions and strategy, as well as the complexity Data Leaders face in highly regulated industries. When it comes to solving problems that provide value for the business, engagement and transparency are key aspects. Derek shares how to ensure that expectations are met through clear and frank conversations with executives that try to align expectations between management and Data Science teams. Finally, you'll learn about validation frameworks, best practices for teams in less regulated industries, what trends to look out for in 2023 and how ChatGPT is changing how executives define their expectations from Data Science teams. Links to mentioned in the show:The Checklist Manifesto by Atul GawandeTeam of Teams by General Stanley McChrystalThe Harvard Data Science Review PodcastRelevant Links from DataCamp:Article: Storytelling for More Impactful Data ScienceCourse: Data Communication ConceptsCourse: Data-Driven Decision-Making for Business
undefined
Mar 13, 2023 • 50min

#130 The Path to Becoming a Kaggle Grandmaster

Oftentimes, Kaggle competitions are looked at as an excellent way for data scientists to sharpen their machine learning skills and become technically excellent. This begs the question, what are the hallmarks of high-performing Kaggle competitors? What makes a Kaggle Grand Master?Today’s guest, Jean-Francois Puget PhD, distinguished engineer at NVIDIA, has achieved this impressive feat three times. Throughout the episode, Richie and Jean-Francois discuss his background and how he became a Kaggle Grandmaster. He shares his scientific approach to machine learning and how he uses this to consistently achieve high results in Kaggle competitions.Jean-Francois also discusses how NVIDIA employs nine Kaggle Grandmasters and how they use Kaggle experiments to breed innovation in solving their machine learning challenges. He expands on the toolkit he employs in solving Kaggle competitions, and how he has achieved 50X improvements in efficiencies using tools like RAPIDS. Richie and Jean-Francois also delve into the difference between competitive data science on Kaggle and machine learning work in a real-world setting. They deep dive into the challenges of real-world machine learning, and how to resolve the ambiguities of using machine learning in production that data scientists don’t encounter in Kaggle competitions.
undefined
Mar 6, 2023 • 35min

#129 Increasing Diverse Representation in Data Science

Studies have shown that companies lacking in racial diversity also have a corresponding lack in their ability to innovate as a whole, which makes it important for any organization to prioritize an inclusive workplace culture and welcome more women and underrepresented groups in data.This is why Nikiska Alcindor's work is so vital to the future of the data science industry. Nikisha is the President and Founder of the STEM Educational Institute (SEI), a nonprofit corporation that equips underrepresented high school students with the technological skills needed to build generational wealth and be effective in the workforce. Nikisha is a strategic management leader with expertise in organizational change, investing, and fundraising. She is a  recipient of the 2021 Dean Huss Teaching Award, a board member of the Upper Manhatten Empowerment Zone, and has taught a master class at Columbia Business School as well as several guest lectures at Columbia University.Throughout the episode, we discuss SEI’s three-pillar approach to education,  the rising importance of STEM-based careers, why financial literacy is crucial to a student’s success, SEI’s partnership with DataCamp, contextualizing educational and upskilling programs to your organization’s specific population, how data leaders can positively communicate upskilling initiatives, and much more.
undefined
Feb 27, 2023 • 44min

#128 Unlocking Scalable ROI for Data Teams

In order for any data team to move from reactive to proactive and drive revenue for the business, they must make sure the basics are in place and that the team and data culture is mature enough to allow for scalable return on investment. Without these elements, data teams find themselves unable to make meaningful progress because they are stuck reacting to problems and responding to rudimentary questions from stakeholders across the organization. This quickly takes up bandwidth and keeps them from achieving meaningful ROI.In today’s episode, we have invited Shane Murray to break down how to effectively structure a data team, how data leaders can lead efficient decentralization, and how teams can scale their ROI in 2023. Shane is the Field CTO at Monte Carlo, a data reliability company that created the industry's first end-to-end Data Observability platform. Shane’s career has taken him through a successful 9-year tenure at The New York Times, where he grew the data analytics team from 12 to 150 people and managed all core data products. Shane is an expert when it comes to data observability, enabling effective ROI for data initiatives, scaling high-impact data teams, and more.Throughout the episode we discuss how to structure a data team for maximum efficiency, how data leaders can balance long-term and short-term data initiatives, how data maturity correlates to a team’s forward-thinking ability, data democratization with data insights and reporting ROI, best practices for change management, and much more.
undefined
Feb 20, 2023 • 42min

#127 How Data Scientists Can Thrive in Consulting

The most common application for data science is to solve problems within your own organization, and as professionals become more data literate, they rely less and less on others to solve their problems and unlock professional growth and career advancement.But in the world of consulting, data science is used to solve other people’s problems, which adds an additional layer of complexity since consultants aren’t always given all of the tools they need to do the job right.Enter Pratik Agrawal, a Partner at Kearney Analytics leading the automotive and industrial transportation sector. In this episode, we are taking a look at how data science is applied in the consulting industry and what skills are critical to be a successful data science consultant. As a software engineer and data scientist with over a decade of experience in the consulting world at companies like Boston Consulting Group and IRI, Pratik has a deep understanding of how to navigate the industry and how data science can be leveraged in it, as well as expertise in digital transformation projects and strategy.Throughout the episode, we discuss common problems that consultants encounter, the skills needed to be successful as a consultant, the different approaches to analytics in consulting versus in an organization, how to handle context switching when juggling multiple projects, what makes consulting feel exciting and challenging, and much more.
undefined
Feb 13, 2023 • 50min

#126 Make Your A/B Testing More Effective and Efficient

One of the toughest parts of any data project is experimentation, not just because you need to choose the right testing method to confirm the project’s effectiveness, but because you also need to make sure you are testing the right hypothesis and measuring the right KPIs to ensure you receive accurate results.One of the most effective methods for data experimentation is A/B testing, and Anjali Mehra, Senior Director of Product Analytics, Data Science, Experimentation, and Instrumentation at DocuSign, is no stranger to how A/B testing can impact multiple parts of any organization. Throughout her career, she has also worked in marketing analytics and customer analytics at companies like Shutterfly, Wayfair, and Constant Contact.Throughout the episode, we discuss DocuSign’s analytics goals, how A/B testing works, how to gamify data experimentation, how A/B testing helps with new initiative validation, examples of A/B testing with data projects, how organizations can get started with data experimentation, and much more.
undefined
Feb 6, 2023 • 41min

#125 Building Trust in Data with Data Governance

Perhaps the biggest obstacle to establishing a data culture is building trust in the data itself, making it vital for organizations to have a robust approach to data governance to ensure data quality is as high as possible.Enter Laurent Dresse, Data Governance Evangelist and Director of Professional Services at DataGalaxy. Throughout his career, Laurent has served as a bridge between IT and the rest of the business as an expert in data governance, quality, data management, and more. Throughout the episode, we discuss the state of data governance today, how data leaders and organizations can start their data governance journey, how to evangelize for data governance and gain buy-in across your organization, data governance tooling, and much more.
undefined
Feb 3, 2023 • 2min

Special Announcement!

A special announcement from the DataFramed team. Join us for RADAR, a free two-day digital event curated to equip businesses and individuals with the insights to thrive in the era data, coming to you March 22-23, 2023! Register here to secure your spot!
undefined
Jan 30, 2023 • 41min

#124 Using AI to Improve Data Quality in Healthcare

Data quality can make or break any data initiative or product. If you aren’t able to collect data that is accurate, or you have data sets that have varying structures, or are filled with typos and other issues caused by human error, then the chances drop drastically that your data models will be accurate, or even helpful.When it comes to healthcare, data quality can be an absolute nightmare. With so many different data sources, high data churn rates, and a lack of standardization in many different healthcare categories, it can seem impossible to make quality healthcare more easily accessible to people when they need it.Ribbon Health seeks to change that by using AI to improve the quality of healthcare data and create a data platform with actionable provider information including insurance coverage, prices, and performance.Today’s guests are Nate Fox, the CTO, Co-Founder, and President of Ribbon Health, and Sunna Jo, a former pediatrician who is now a data scientist at Ribbon Health.Throughout the episode, we talk about why data quality in healthcare is messy, why having context around data is necessary to interpret and utilize it properly, how healthcare providers are improving their services because of platforms like Ribbon Health, how to tackle common data cleaning problems, and much more
undefined
Jan 23, 2023 • 44min

#123 Why We Need More Data Empathy

When working with data, it’s easy for us to think about it as a mechanistic process, where data comes in and products come out. But as we’ve explored throughout the show, succeeding in data, whether you’re a data leader looking to build a data culture, a data scientist ascending the ranks, or even a policy maker looking to have an impact with data, the human side is crucial.At the heart of the “human side” is empathy— whether it’s for your stakeholders if you’re a data scientist developing a dashboard for them, empathy for your workforce if you’re a data or learning leader, or empathy for the planet and your citizens if you’re a policy maker. So how can we all practice better empathy? Specifically, can we all practice better data empathy? Luckily, empathy is a muscle that can be built. It’s not a “you have it, or you don’t” type of skill. So how can individuals and organizations utilize data empathy to improve how they work with data and the success rate of their projects?  Enter Phil Harvey, an Industrial Metaverse Architect in the Industrial Metaverse Core group at Microsoft. He is an expert in Data & AI Technical and Business Strategy & Philosophy. Harvey is also co-author of the book Data: A Guide to Humans, which explores the concept of Data Empathy, and how it can power better use of data through better communication and understanding of stakeholders in the value chain of data. 

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode