MLOps.community

Demetrios

Relaxed Conversations around getting AI into production, whatever shape that may come in (agentic, traditional ML, LLMs, Vibes, etc)

Episodes

Mentioned books

May 21, 2021 • 52min

Scaling AI in production // Srivatsan Srinivasan // MLOps Coffee Sessions #40

Coffee Sessions #40 with Srivatsan Srinivasan of AIEngineering, Scaling AI in Production. //Abstract //Bio 20+ years of intense passion for building data-driven applications and products for top financial customers. Srivatsan has been a trusted advisor to a senior-level executive from business and technology, helping them with complex transformation in the data and analytics space. Srivatsan also run a YouTube Channel (AIEngineering) where he talks about data, AI and MLOps. //Takeaways Understand the role and need of MLOps Prioritize MLOps capability Model deployment Importance of K8s //Other Links AI and MLOps free courses - https://github.com/srivatsan88 Youtube channel: bit.ly/AIEngineering --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/ Connect with Srivatsan on LinkedIn: https://www.linkedin.com/in/srivatsan-srinivasan-b8131b/ Timestamps: [00:00] Introduction to Srivatsan Srinivasan [01:41] Background on Youtube AIEngineering [03:17] Tips on learning MLOps and start with the field [06:00] "Focus on your key challenges and that will drive your capability that you need to implement." [06:50] Tips on starting CI/CD [08:46] "Start with DevOps and see what additional capabilities you will require for the Machine Learning aspect of it." [09:24] Staying general in different environments [10:43] "Focus on the core concepts of it. The concepts are similar." [12:10] Testing systems robustly [20:00] Trends within MLOps space [20:31] "Everybody can fail fast but you need to fail smart because Machine Learning is a huge investment." [23:21] GCP Auto ML [26:54] Deployment [27:06] "It's not only the tools, but it's also the patterns." [29:34] Kubernetes perspective [31:21] Favorite model release strategy [36:22] Annotation, labeling, and concept of ground truth [38:10] Best practices in Architecture and systems design in the context of ML [41:29] "You learn a lot, at the same time the complexity also increases, so work with multiple teams in this process to learn it." [42:35] "Your speed increases based on the way you envision your architecture." [42:55] Software engineering lifecycle vs machine learning development life cycle [44:55] Youtube experience [45:50] "My focus has always been from intermediate to experts." [46:24] Content creation [47:17] "You cannot do everything in MLOps at one stretch. You have to see what is critical for you." [47:23] "For me, continuous training is not that critical because I don't want to take the freedom out of the data scientists." [48:31] New contents planned [48:40] IoT and Edge Analytics - Predictive maintenance [50:21] "It's a two-way process. I learn then I teach."

May 18, 2021 • 54min

MLOps: A leader's perspective // Stephen Galsworthy // MLOps Coffee Sessions #39

Coffee Sessions #39 with Stephen Galsworthy of Quby, MLOps: A leader's perspective. //Abstract //Bio Dr. Stephen Galsworthy is a data leader skilled at building high-performing teams and passionate about developing data-powered products with lasting impact on users, businesses, and society. Most recently he was the Chief Data and Product Officer at Quby, an Amsterdam-based tech company offering data-driven energy services. He oversaw its transformation from a hardware-based business to a digital organization with data and AI at its core. He put in place a central cloud-based data infrastructure and unified analytics platform to collect and take advantage of petabytes of IoT data. His team deployed real-time monitoring and energy insight services for 500k homes across Europe. Stephen has a Master’s degree and Ph.D. in Mathematics from Oxford University and has been leading data science teams since 2011. //Takeaways MLOps as a process, people, and technological problem. Experiences from a team working at the forefront of data and AI. --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/ Connect with Stephen on LinkedIn: https://www.linkedin.com/in/galsworthy/

May 14, 2021 • 50min

Learnings from Live Coding: An MLOps Project on Twitch // Felipe Campos Penha // MLOps Meetup #63

MLOps community meetup #63! Last Wednesday we talked to Felipe Campos Penha, Senior Data Scientist, Cargill. //Abstract Can one learn anything useful by creating content online? The usual answer is a sounding YES. But what about live coding an MLOps project on Twitch? Can anything good come out of it? //Bio Felipe Penha creates content about Data Science regularly on the Data Science Bits channel on YouTube and Twitch. He has 8+ years of experience with hands-on data-related work, starting with his doctorate in Astroparticle Physics. His career in the private sector has been devoted to bringing value to various segments of the Food and Beverages Industry through the use of Analytics and Machine Learning. ----------- Connect With Us ✌️------------- Join our Slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Felipe on LinkedIn: https://www.linkedin.com/in/fcpenha/ Timestamps: [00:00] Introduction to Felipe Campos Penha [01:30] Felipe's background [05:36] Developing models in physics vs developing models for companies [08:07] Felipe's transition from Jupyter Notebook to Operational ML [09:34] "The thought of business basically for customers, they always wanted to see the value and try to roll out more manual work like spreadsheets so they could try out that model on the field." [12:07] Felipe's software engineering development learning [14:10] Catalyst on Youtube and Twitch [18:06] Elements of Twitch [20:02] Non-polished versions of Twitch [21:16] "Twitch was not made for coding, it was for gamers." [26:17] Felipe's audience impact on Twitch [28:02] Logistical pieces [30:43] Words of wisdom on live streaming [30:56] "Don't be afraid to start. There are many streamers that are actually learning from scratch and they are showing the process of learning online. They are learning faster because the help is faster." [33:16] Blog post as other means to Twitch [33:50] "I'm a perfectionist when I'm writing. The shortest it is, the hardest it could get. You want to polish it a lot to make nice figures. I learned a lot but for me, I feel that process is too slow because you're thinking about one subject for a long time trying to polish it while in live streaming, it's very dynamic and fast." [34:25] Twitch affecting Felipe's career [36:36] "Exposing yourself, showing your mistakes, vocalizing your thoughts, I think all of this makes you a better programmer." [37:12] Getting through a problem [39:41] Recommended streamers that caught Felipe's interest [41:00] Community aspect and importance of Twitch [42:42] Role of community on Twitch [45:16] "Twitch is becoming such a trend that even companies are following."

May 10, 2021 • 56min

Law of Diminishing Returns for Running AI Proof-of-Concepts // Oguzhan Gencoglu // MLOps Meetup #62

MLOps community meetup #62! Last Wednesday we talked to Oguzhan Gencoglu, Co-founder & Head of AI, Top Data Science. //Abstract Starting the AI adoption with AI Proof-of-Concepts (PoCs) is the most common choice for most companies. Yet, a significant percentage of AI PoCs do not make it into production whether they were successful or not. Furthermore, running yet another AI PoC follows the law of diminishing returns in various aspects. This talk will revolve around this theme. //Bio Oguzhan "Ouz" Gencoglu is the Co-founder and Head of AI at Top Data Science, a Helsinki-based AI consultancy. With his team, he delivered more than 70 machine learning solutions in numerous industries for the past 5 years. Before that, he used to conduct machine learning research in several countries including the USA, Czech Republic, Turkey, Denmark, and Finland. Oguzhan has given more than 40 talks on machine learning to audiences of various backgrounds. ----------- Connect With Us ✌️------------- Join our Slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Oguzhan on LinkedIn: https://www.linkedin.com/in/ogencoglu/ Timestamps: [00:00] Introduction to Oguzhan Gencoglu [00:47] Ouz's background [01:47] Recurring/repetitive problem patterns [03:16] "When you solve a repetitive task in an automatic way, that's Scalability." [04:32] Evolution expected of Machine Learning [05:10] "People are quite confused about the titles and what's worst, those titles don't have a common definition in different companies. If you feel a little bit overwhelmed, that's normal." [08:04] Proof-of-Concepts [10:35] Successful PoCs but not Productionized [16:03] Productionize as soon as possible [16:47] "In your Proof-of-Concepts, it's not only technical, but it's also a mindset." [20:00] Framework of a successful PoCs [24:28] Taking too much on PoCs [28:05] Proof-of-Concepts after Proof-of-Concepts and Proof-of-Concepts hell [31:30] Wholistic view [34:00] Operationalizing PoCs [37:17] "The teams also need to adjust themselves to these new tools, new paradigms, and the different needs of the whole industry." [37:26] Horror stories [39:54] Open communication tips 43:31] "Open communication should not only be from the technical perspective but also down to the business and strategy perspective." [44:20] Translation tips [44:39] "I believe the most crucial part of today's ML scientists' role is not building a machine learning model but translating a real-life problem into a machine learning problem. It's crucial because it's a scarce talent and skill." [49:30] Realistic budget for small PoCs [50:18] "You need at least 1 month of work of proof of value but that doesn't mean things will go to production." [51:40] Understanding the questions fully [52:55] "That translation skill is the greatest skill to have in this industry because you can't auto ML that or whatever. It stands the test of time because that will be needed all the time."

May 7, 2021 • 55min

Organisational Challenges of MLOps // Adam Sroka // MLOps Coffee Sessions #38

Coffee Sessions #38 with Adam Sroka of Origami Energy, Organisational Challenges of MLOps. //Abstract Deploying data science solutions into production is challenging for both small and large organizations. From platform and tooling wars to architecture and design pattern trade-offs it can get overwhelming for inexperienced teams. Furthermore, many organizations will only go through the painful discovery process once. Adam will share some of his experiences from consulting and leading data teams to successfully deploying machine learning solutions, highlighting some of the more difficult challenges to overcome. You might not be surprised to hear it’s not all down to the tech. //Bio Dr. Adam Sroka, Head of Machine Learning Engineering at Origami Energy, is an experienced data and AI leader helping organizations unlock value from data by delivering enterprise-scale solutions and building high-performing data and analytics teams from the ground up. Adam shares his thoughts and ideas through public speaking, tech community events, on his blog, and in his podcast. --------------- ✌️Connect With Us ✌️ ------------- Join our slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/ Connect with Adam on LinkedIn: https://www.linkedin.com/in/aesroka/ Timestamps: [00:00] Introduction to Adam Sroka [01:53] Adam's background in tech [08:06] 2 blog posts of Adam: Why So Many Data Scientists Quit Good Jobs at Great Companies and Why You Shouldn’t Hire More Data Scientists [08:31] High turn rate Adam has in the data science role [13:50] Avoiding hiring talents with deficits and coaching people [16:05] "I can't teach you to care about the standard of your core of what you're doing. It's quite hard to teach people charisma. Everything else, you pick up." [16:45] Resume-driven development, the idea of not playing the game, and politics in the workplace. [17:57] "You have to realize, other people, don't have the same experience in the context that you do." [19:59] Exit, Voice, Loyalty and Neglect Model [22:35] You probably don't need a data scientist [23:40] "Data scientists can do everything slower and more expensively than everyone else but they can do everything and that's the important bit." [27:54] "My success is just driven by who I am as much as what I can do." Vishnu [28:24] Being Candor [30:37] Disconnect between the senior stakeholders and data scientists [32:30] "Before you come out to bring someone in some expensive talent search, engage with the consultancy. Do a four-week PRC, get them to tell you like." [34:18] Educational experiences as a consultant [37:35] Adam's journey into MLOps, productionize ML models when you are a data scientist and tips [43:16] "Beginners can help beginners. Your perspective is really important. The value is not in the content. The value is in your perspective of the content." [45:21] Educating clients on uncertainty [48:34] Decision making process [52:32] Organizational problems [53:43] "All models are wrong, but some are useful." George Box

May 3, 2021 • 53min

From Idea to Production ML // Lex Beattie - Michael Munn - Mike Moran // MLOps Meetup #61

MLOps community meetup #61! Last Wednesday we talked to Lex Beattie, Michael Munn, and Mike Moran. //Abstract We started out talking about some of the main bottlenecks they have encountered over the years of trying to push data products into production environments. Then things started to heat up as we dove into the topic of monitoring ML and inevitably the word explainability started being thrown around. Turns out Lex is currently doing a Ph.D. on the subject so there was much to talk about. We had to ask if explainability is now table-stakes when it comes to monitoring solutions on the market? The short answer from the team. Yes! Please excuse the bit of sound trouble we had with google mike at the beginning. //Bio Lex Beattie - ML Engagement Lead, Spotify In the last year, Lex has helped over 40 different teams across Spotify understand ML best practices, productionize ML workflows and implement impactful ML in their products. Lex is also a Ph.D. candidate at the University of Oklahoma, focusing on feature importance and interpretability in deep neural networks. Beyond her passion for all things ML, she enjoys exploring the great outdoors in Montana with her German Wirehaired Pointer, Bridger. Michael Munn - ML Solutions Engineer, Google Michael is an ML Solutions Engineer with Google Cloud and Google's Advanced Solutions Lab. In his role, he works with customers to build and deploy end-to-end ML solutions with Google Cloud. Within the Advanced Solutions Lab, he teaches these skills to customers. Mike Moran - Principal Engineer, Skyscanner Mike has worked across many dimensions; in large/tiny companies, back-end/front-end, with many languages, and as a sys-admin /engineer/manager. Mike has a healthy skepticism for most things and likes solving problems through applying System thinking. Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Lex on LinkedIn: https://www.linkedin.com/in/lexbeattie/ Connect with Michael on LinkedIn: https://www.linkedin.com/in/munnm/ Connect with Mike on LinkedIn: https://www.linkedin.com/in/mrmikemoran/ Timestamps: [00:00] Introduction to Lex, Michael, & Mike [02:46] Common roadblocks [05:25] Consolidating knowledge [07:02] Bottlenecks on failures [09:58] Don't go on a detour [12:22] Bringin on complexity signs [19:33] Explainable AI [21:34] "There are different ways to approach Explainable AI. It starts to get more complicated when you start working with more complicated models." Lex [24:43] "If there are a lot of disparate sources out there about Explainability, I'd found myself hunting down various resources to simplify it for customers I'd worked with." Michael [26:46] "Being clear about who you're explaining it to because in our context, sometimes the organization needs to explain it to a regulator." Mike [28:04] Monitoring solution [31:00] ML Canvas [33:24] Explainable AI Resources [34:48] Explainable Predictions by Michael [36:48] Purpose of Explainable Model [39:40] Work in the same language [42:46] Use of War Stories [49:11] Hot seat! [49:15] Mike - Skyscanner pricing [50:30] Lex - Spotify recommendation sudden stop [51:35] Michael - NLP models on emails

Apr 30, 2021 • 1h 1min

Deploying Machine Learning Models at Scale in Cloud // Vishnu Prathish // MLOps Meetup #60

MLOps community meetup #60! Last Wednesday we talked to Vishnu Prathish, Director Of Engineering, AI Products, Innovyze. //Abstract The way Data Science is done is changing. Notebook sharing and collaboration were messy and there was minimal visibility or QA into the model deployment process. Vishnu will talk about building an ops platform that deploys hundreds of models at-scale every month. A platform that supports typical features of MLOps (CI/CD, Separated QA, Dev and PROD environment, experiments tracking, Isolated retraining, model monitoring in real-time, Automatic Retraining with live data) and ensures quality and observability without compromising the collaborative nature of data science. //Bio With 10 years in building production-grade data-first software at BBM & HP Labs, I started building Emagin's AI platform about three years ago with the goal of optimizing operations for the water industry. At Innovyze post-acquisition, we are part of the org building world-leading water infrastructure data analytics product. //Takeaways Why is MLOps necessary for model building at scale? What are various cloud-based models for MLOps? Where can ops help in various points in the ML pipeline Data Prep, Feature Engineering, Model building, Training, Retraining, Evaluation and inference ----------- Connect With Us ✌️------------- Join our Slack community: https://go.mlops.community/slack Follow us on Twitter: @mlopscommunity Sign up for the next meetup: https://go.mlops.community/register Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/ Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vishnuprathish/ Timestamps: [00:00] Introduction to Vishnu Prathish [00:16] Vishnu's background [04:18] Use cases on wooden pipes for freshwater [04:55] Virtual representation of actual, physical, tangible assets [06:56] Platform built by Vishnu [08:30] Build a reliable representation of network [11:52] Pipeline architecture [16:17] "MLOps is still an evolving discipline. You need to try and fail many times before you figure out what's right for you." [17:11] Open-sourcing [18:17] Platform for virtual twin [20:02] Entirely Amazon Stagemaker [20:43] Data quality issues [23:21] Reproducibility [23:40] "Reproducibility is important for everybody. Most of the frameworks do that for you." [25:00] Reproducibility as Innovyze's core business. [26:38] Each model is individual to each customer [27:50] Solving reproducibility problems [28:24] "Reproducibility applies to the process of training pipelines. It starts with collected from historical raw data from customers. In real-time, there's also this data being collected directly from sensors coming from a certain pipeline." [31:55] "Reusable training is step one to attaining automated retraining." [32:17] Collaboration of Vishnu's team [36:23] War stories [41:36] Data prediction [44:24] "A data scientist is the most expensive hire you can make." [47:55] 3 Tiers [48:53] MLOps problems [52:25] Automatically retraining [52:34] "Because of the numbers of models that go through this pipeline, it's impossible for somebody to manually monitor and retrain as necessary. It's not easy, it takes a lot of time." [54:22] Metrics on retraining [56:42] "Retraining is a little less prevalent for our industry compared to a turned prediction model that changes a lot. There are external factors that depend on it but a pump is a pump."

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app