Nathan Ryan Frank, former astrophysicist turned data scientist and machine learning engineer, discusses challenges and solutions when operationalizing machine learning systems. Topics include team dynamics, communication issues, approaching MLOps from a DevOps perspective, and practical guidance for newcomers to MLOps. The podcast also delves into the speaker's experience building telescopes, transitioning to industry and basketball analytics, and the importance of testing in software development. They also share thrilling hiking adventures.
Read more
AI Summary
AI Chapters
Episode notes
auto_awesome
Podcast summary created with Snipd AI
Quick takeaways
Close collaboration between data science and engineering teams, enabled by shared context and communication tools, is crucial for the success of ML Ops projects.
Early testing in the ML development process helps ensure system functionality and accuracy, facilitates collaboration, and identifies and addresses issues.
Building a fully integrated team with diverse skills, including data scientists, ML engineers, software engineers, and product owners, leads to more effective ML projects and bridges the gap between different roles.
Deep dives
Building a ML Ops Team
The development of an ML Ops team at Stats Perform was a success thanks to a fully integrated team that included data scientists, ML engineers, software engineers, a technical lead, an engineering manager, and a product owner. The team worked together closely, sharing context and collaborating on defining stories, prioritizing work, and daily stand-ups. This close collaboration reduced the gap between data science and engineering, fostering a shared language and understanding. The team's success was also attributed to the use of feature stores and other tools that facilitated communication and minimized translation between team members.
Importance of Testing in ML
Testing plays a crucial role in ML systems and can save significant time and effort in the long run. Writing tests early in the development process allows for more time to focus on building the models and algorithms. Tests provide a safety net and help ensure the functionality and accuracy of the system. They also facilitate collaboration between data scientists and engineers, providing a shared understanding of the codebase. Testing provides cognitive feedback, a sense of accomplishment when tests pass, and helps developers identify and address issues before they escalate.
Benefits of Fully Integrated Teams
Building a fully integrated team with data scientists, ML engineers, software engineers, and product owners can lead to successful ML projects. By breaking down silos and fostering collaboration, teams can develop a shared vocabulary and understanding. Having a mix of skills within the team allows for better communication and reduces the need for translation between roles. Integrated teams can also iterate quickly, improve knowledge sharing, and leverage each other's expertise. This approach helps bridge the gap between data science, ML engineering, and software engineering, leading to more effective and efficient ML projects.
Fostering Shared Language and Context
Creating a shared language and context among different teams and stakeholders in ML projects is essential. Fully integrated teams with clear roles and responsibilities help minimize the communication gap. Collaborative processes such as defining stories, prioritizing work, and daily stand-ups ensure alignment and shared understanding. Tools like feature stores can also facilitate communication and understanding between data scientists and engineers. Breaking down barriers and promoting shared language and context result in more effective collaboration, shared ownership, and smoother ML project outcomes.
Developing ML Systems with ML Engineers
ML engineers play a crucial role in developing ML systems by bridging the gap between data science and engineering. ML engineers bring their expertise to areas such as model deployment, model serving, and system architecture. Collaboration between data scientists and ML engineers helps in building robust ML systems that are scalable, maintainable, and reliable. By understanding the requirements, triggers, and implications of ML models, ML engineers can contribute valuable ideas and insights to the overall system design. This collaboration leads to improved communication, efficiency, and success in ML projects.
Nathan Ryan Frank is the Machine Learning Operations and platform Director of Grainger. Former Astrophysicist turned data scientist and machine learning engineer with a proven history of delivering results into production across a wide variety of domains while leading projects with international, cross-functional teams.
MLOps podcast #199 with Nathan Ryan Frank, Director, Machine Learning Platform & Operations at WW Grainger, Challenges Operationalizing Machine Learning (And Some Solutions).
// Abstract
This talk details some common challenges and pitfalls when attempting to operationalize machine learning systems and discusses some simple solutions. We dive into the machine learning development workflow and cover topics such as team dynamics, communication issues between roles that don't share a common language, and approaching MLOps from an SRE/DevOps perspective. Similarly, the talk highlights some challenges unique to operationalizing machine learning, drawing distinctions where necessary to highlight a large amount of similarity. Finally, the talk offers some simple and practical guidance for those new to MLOps who want to understand where to start and how to adopt best practices in an evolving field.
// Bio
Nathan Frank is currently the Director of Machine Learning Platform and Operations at Grainger where he is building a team to support the Technology Group's expanding machine learning efforts. Prior to joining Grainger, Nathan led machine learning engineering efforts at Strong Analytics, a boutique data science and machine learning consulting firm, as well as machine learning platform and development teams at Stats Perform, a leader in sports data and technology. Nathan holds bachelor's and master's degrees in Astrophysics from UC - Santa Cruz and UNC-Chapel Hill, respectively. When not building machine learning systems, Nathan spends as much time as possible with his favorite person in the world, his wife, as well as their four kids and two dogs, and enjoys getting outside to hike or garden and baking bread.
// MLOps Jobs board
https://mlops.pallet.xyz/jobs
// MLOps Swag/Merch
https://mlops-community.myshopify.com/
// Related Links
Website: https://nrfrank.github.io/
Bisi: https://bisi.gitbook.io/bisi/
--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Nathan on LinkedIn: https://www.linkedin.com/in/nrfrank
Timestamps:
[00:00] Nathan's preferred coffee
[00:40] Takeaways
[02:00] Please leave a review in our comment sections! Please like, share, and subscribe to our MLOps channels!
[03:00] Telescope for gamma-ray burst
[07:31] Transition into ML
[11:23] Stats-heavy US sports commentary
[14:25] Building machine learning systems approach
[20:02] ML Workflow Must-Haves
[26:50] Love for tests
[33:10] Test Writing Importance
[34:37] Bridging Stakeholder Language Gap
[43:04] Shared Language, Team Collaboration
[47:28] Rapid fire questions
[51:20] Wrap up
Get the Snipd podcast app
Unlock the knowledge in podcasts with the podcast player of the future.
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode
Save any moment
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Share & Export
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
AI-powered podcast player
Listen to all your favourite podcasts with AI-powered features
Discover highlights
Listen to the best highlights from the podcasts you love and dive into the full episode