DataFramed cover image

DataFramed

Latest episodes

undefined
Sep 2, 2022 • 2min

Announcing Data Literacy Month

Taking inspiration from International Literacy Day on September 8, DataCamp is dedicating the whole month of September to raising awareness about Data Literacy. Throughout the month, we are featuring thought leaders and subject matter experts in order to get you Data Literacy, and we can’t wait for you to hear the exceptional guests we have lined up for you right here on DataFramed. Check out the full lineup of events.
undefined
Aug 29, 2022 • 41min

#102 How an Always-Learning Culture Drives Innovation at Shopify

Many times, data scientists can fall into the trap of resume-driven development. As in, learning the shiniest, most advanced technique available to them in an attempt to solve a business problem. However, this is not what a learning mindset should look like for data teams. As it turns out, taking a step back and focusing on the fundamentals and step-by-step iteration can be the key to growing as a data scientist, because when data teams develop a strong understanding of the problems and solutions lying underneath the surface, they will be able to wield their tools with complete mastery. Ella Hilal joins the show to share why operating from an always-learning mindset will open up the path to a true mastery and innovation for data teams. Ella is the VP of Data Science and Engineering for Commercial and Service Lines at Shopify, a global commerce leader that helps businesses of all size grow, market, and manage their retail operations. Recognized as a leading woman in Data science, Internet of things and Machine Learning, Ella has over 15 years of experience spanning multiple countries, and is an advocate for responsible innovation, women in tech, and STEM. In this episode, we talk about the biggest mistakes data scientists make when solving business problems, how to create cohesion between data teams and the broader organization, how to be an effective data leader that prioritizes their team’s growth, and how developing an always-learning mindset based on iteration, experimentation, and deep understanding of the problems needing to be solved can accelerate the growth of data teams.
undefined
Aug 22, 2022 • 45min

#101 How Real-Time Data Accelerates Business Outcomes

Most companies experience the same pain point when working with data: it takes too long to get the right data to the right people. This creates a huge opportunity for data scientists to find innovative solutions to accelerate that process.One very effective method is to implement real-time data solutions that can increase business revenue and make it easier for anyone relying on the data to access the data they need, understand it, and make accurate decisions with it.George Trujillo joins the show to share how he believes real-time data has the potential to completely transform the way companies work with data. George is the Principal Data Strategist at DataStax, a tech company that helps businesses scale by mobilizing real-time data on a single, unified stack. With a career spanning 30 years and companies like Charles Schwab, Fidelity Investments, and Overstock.com, George is an expert in data-driven executive decision-making and tying data initiatives to tangible business value outcomes.In this episode, we talk about the real-world use cases of real-time analytics, why reducing data complexity is key to improving the customer experience, the common problems that slow data-driven decision-making, and how data practitioners can start implementing real-time data through small high-value analytical assets.
undefined
Aug 15, 2022 • 52min

#100 Embedded Machine Learning on Edge Devices

Machine learning models are often thought to be mainly utilized by large tech companies that run large and powerful models to accomplish a wide array of tasks. However, machine learning models are finding an increasing presence in edge devices such as smart watches. ML engineers are learning how to compress models and fit them into smaller and smaller devices while retaining accuracy, effectiveness, and efficiency. The goal is to empower domain experts in any industry around the world to effectively use machine learning models without having to become experts in the field themselves. Daniel Situnayake is the Founding TinyML Engineer and Head of Machine Learning at Edge Impulse, a leading development platform for embedded machine learning used by over 3,000 enterprises across more than 85,000 ML projects globally. Dan has over 10 years of experience as a software engineer, which includes companies like Google (where he worked on TensorFlow Lite) and Loopt, and co-founded Tiny Farms America’s first insect farming technology company. He wrote the book, "TinyML," and the forthcoming "AI at the Edge". Daniel joins the show to talk about his work with EdgeML, the biggest challenges facing the field of embedded machine learning, the potential use cases of machine learning models in edge devices, and the best tips for aspiring machine learning engineers and data science practitioners to get started with embedded machine learning.
undefined
Aug 8, 2022 • 34min

#99 Post-Deployment Data Science

Many machine learning practitioners dedicate most of their attention to creating and deploying models that solve business problems. However, what happens post-deployment? And how should data teams go about monitoring models in production? Hakim Elakhrass is the Co-Founder and CEO of NannyML, an open-source python library that allows users to estimate post-deployment model performance, detect data drift, and link data drift alerts back to model performance changes. Originally, Hakim started a machine learning consultancy with his NannyML co-founders, and the need for monitoring quickly arose, leading to the development of NannyML. Hakim joins the show to discuss post-deployment data science, the real-world use cases for tools like NannyML, the potentially catastrophic effects of unmonitored models in production, the most important skills for modern data scientists to cultivate, and more.
undefined
Aug 1, 2022 • 51min

#98 Interpretable Machine Learning

One of the biggest challenges facing the adoption of machine learning and AI in Data Science is understanding, interpreting, and explaining models and their outcomes to produce higher certainty, accountability, and fairness. Serg Masis is a Climate & Agronomic Data Scientist at Syngenta and the author of the book, Interpretable Machine Learning with Python. For the last two decades, Serg has been at the confluence of the internet, application development, and analytics. Serg is a true polymath. Before his current role, he co-founded a search engine startup incubated by Harvard Innovation Labs, was the proud owner of a Bubble Tea shop, and more. Throughout the episode, Serg spoke about the different challenges affecting model interpretability in machine learning, how bias can produce harmful outcomes in machine learning systems, the different types of technical and non-technical solutions to tackling bias, the future of machine learning interpretability, and much more.
undefined
Jul 25, 2022 • 44min

#97 How Salesforce Created a High-Impact Data Science Organization

Anjali Samani, Director of Data Science & Data Intelligence at Salesforce, joins the show to discuss what it takes to become a mature data organization and how to build an impactful, diverse data team. As a data leader with over 15 years of experience, Anjali is an expert at assessing and deriving maximum value out of data, implementing long-term and short-term strategies that directly enable positive business outcomes, and how you can do the same. You will learn the hallmarks of a mature data organization, how to measure ROI on data initiatives, how Salesforce implements its data science function, and how you can utilize strong relationships to develop trust with internal stakeholders and your data team.
undefined
Jul 18, 2022 • 1h 4min

#96 GPT-3 and our AI-Powered Future

In 2020, OpenAI launched GPT-3, a large language AI model that is demonstrating the potential to radically change how we interact with software, and open up a completely new paradigm for cognitive software applications. Today’s episode features Sandra Kublik and Shubham Saboo, authors of GPT-3: Building Innovative NLP Products Using Large Language Models. We discuss what makes GPT-3 unique, transformative use-cases it has ushered in, the technology powering GPT-3, its risks and limitations, whether scaling models is the path to “Artificial General Intelligence”, and more. Announcement For the next seven days, DataCamp Premium and DataCamp for Teams are free. Gain free access by following going here. 
undefined
Jul 11, 2022 • 39min

#95 How to Build a Data Science Team from Scratch

While leading a mature data science function is a challenge in its own right, building one from scratch at an organization can be just as, if not even more, difficult. As a data leader, you need to balance short-term goals with a long-term vision, translate technical expertise into business value, and develop strong communication skills and an internalized understanding of a business's values and goals in order to earn trust with key stakeholders and build the right team. Elettra Damaggio is no stranger to this process. Elettra is the Director for Global Data Science at StoneX, an institutional-grade financial services network that connects clients to the global markets ecosystem. Elettra has over 10 years of experience in machine learning, AI, and various roles within digital transformation and digital business growth. In this episode, she shares how data leaders can balance short-term wins with long-term goals, how to earn trust with stakeholders, major challenges when launching a data science function, and advice she has for new and aspiring data practitioners.
undefined
Jul 4, 2022 • 40min

#94 How Data Science Enables Better Decisions at Merck

In pharmaceuticals, wrong decisions can not only cost a company revenue, but they can also cost people their lives. With stakes so high, it’s vital that pharmaceutical companies have robust systems and processes in place to accurately gather, analyze, and interpret data and turn it into actionable steps to solving health issues. Suman Giri is the Global Head of Data Science of the Human Health Division at Merck, a biopharmaceutical research company that works to develop innovative health solutions for both people and animals. Suman joins the show today to share how Merck is using data to improve organizational decision-making, medical research outcomes, and how data science is transforming the pharmaceutical industry at scale. He also shares some of the biggest challenges facing the industry right now and what new trends are on the horizon.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode