The Role of Infrastructure in ML // Niels Bantilan // #197

Dec 22, 2023

In this podcast, Niels Bantilan, Chief Machine Learning Engineer at Union, discusses the role of infrastructure in ML leveraging open source. Topics covered include data quality, schema definition, integrating tools like Polars, tracking data quality over time, generative DevOps, reproducibility in MLOps, and navigating edge cases.

Ask episode

Chapters

Transcript

Episode notes

Introduction

00:00 • 4min

Exploring Pandera: Simplifying Data Frame Validation and Schema Definition

04:06 • 3min

The Journey of Pandera: From Small Initiative to Expanding Scope

06:59 • 5min

Integration with Polars and the Power of Lazy Evaluation

11:38 • 3min

Tracking Data Quality and Injecting ML into Data Frames

14:15 • 10min

Generative DevOps and the Risks of Complacency

24:26 • 25min

Reproducibility and Challenges in MLOps

49:09 • 13min

Navigating Edge Cases and Trusting the Process

01:02:01 • 3min

MLOps podcast #197 with Niels Bantilan, Chief Machine Learning Engineer at Union, The Role of Infrastructure in ML Leveraging Open Source brought to us by Union.// AbstractWhen we start out building and deploying models in a new organization, life is simple: all I need to do is grab some data, iterate on a model that fits the data well and performs reasonably well on some held-out test set. Then, if you’re fortunate enough to get to the point where you want to deploy it, it’s fairly straightforward to wrap it in an app framework and host it on a cloud server. However, once you get past this stage, you’re likely to find yourself needing:More scalable data processing frameworkExperiment tracking for modelsHeavier duty CPU/GPU hardwareVersioning tools to link models, data, code, and resource requirementsMonitoring tools for tracking data and model qualityThere’s a rich ecosystem of open-source tools that solves each of these problems and more: but how do you unify all of them together into a single view? This is where orchestration tools like Flyte can help. Flyte not only allows you to compose data and ML pipelines, but it also serves as “infrastructure as code” so that you can leverage the open-source ecosystem and unify purpose-built tools for different parts of the ML lifecycle on a single platform. ML systems are not just models: they are the models, data, and infrastructure combined.// BioNiels is the Chief Machine Learning Engineer at Union.ai, and core maintainer of Flyte, an open-source workflow orchestration tool, author of UnionML, an MLOps framework for machine learning microservices, and creator of Pandera, a statistical typing and data testing tool for scientific data containers. His mission is to help data science and machine learning practitioners be more productive.He has a Masters in Public Health with a specialization in sociomedical science and public health informatics, and prior to that a background in developmental biology and immunology. His research interests include reinforcement learning, AutoML, creative machine learning, and fairness, accountability, and transparency in automated systems.// MLOps Jobs board https://mlops.pallet.xyz/jobs// MLOps Swag/Merchhttps://mlops-community.myshopify.com/// Related LinksWebsite: https://github.com/cosmicBboy, https://union.ai/

Flyte: https://flyte.org/ MLOps vs ML Orchestration // Ketan Umare // MLOps Podcast #183 - https://youtu.be/k2QRNJXyzFg⁠--------------- ✌️Connect With Us ✌️ -------------Join our Slack community: https://go.mlops.community/slackFollow us on Twitter: @mlopscommunitySign up for the next meetup: https://go.mlops.community/registerCatch all episodes, blogs, newsletters, and more: https://mlops.community/Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/Connect with Niels on LinkedIn: https://www.linkedin.com/in/nbantilan/Timestamps:[00:00] Niels' preferred coffee[00:17] Takeaways[03:45] Shout out to our Premium Brand Partner, Union![04:30] Pandera[08:12] Creating a company[14:22] Injecting ML for Data[17:30] ML for Infrastructure Optimization[22:17] AI Implementation Challenges[24:25] Generative DevOps movement[28:27] Pushing Limits: Code Responsibility[29:46] Orchestration in OpenAI's Dev Day[34:27] MLOps Stack: Layers & Challenges[42:45] Mature Companies Embrace Kubernetes[45:29] Horizon Challenges[47:24] Flexible Integration for Resources[49:10] MLOps Reproducibility Challenges[53:14] MLOps Maturity Spectrum [57:48] First-Class Citizens in Design[1:00:16] Delegating for Efficient Collaboration[1:04:55] Wrap up