

Data Driven
Data Driven
Data Driven: the podcast where we explore the emerging field of Data Science. We bring the best minds in Data, Software Engineering, Machine Learning, and Artificial Intelligence right to you every Tuesday.
The field of data science mashes up the worlds of statistics, database architecture and software engineering. Data Scientist has been labelled by the Harvard Business Review, as "the sexiest job of the 21st century." A quick search of job search sites reveal that this field is in high demand.
In a world where Data is the new Oil, Data Science the new Refineries, consider this Car Talk for the Data Age. Every week we bring the best minds in this emerging field straight to you. Our goal is to educate and inspire our listeners so that they can be prepared to thrive in a Data Driven world.
The field of data science mashes up the worlds of statistics, database architecture and software engineering. Data Scientist has been labelled by the Harvard Business Review, as "the sexiest job of the 21st century." A quick search of job search sites reveal that this field is in high demand.
In a world where Data is the new Oil, Data Science the new Refineries, consider this Car Talk for the Data Age. Every week we bring the best minds in this emerging field straight to you. Our goal is to educate and inspire our listeners so that they can be prepared to thrive in a Data Driven world.
Episodes
Mentioned books

Feb 1, 2024 • 53min
Devvret Rishi on Powering Real-World AI with Declarative AI and Open Source
In this episode, Frank sits down and talks with Devvret Rishi on powering real-world AI projects with declarative ML and the importance of open source.Andy was not able to attend this recording, but will be back next week!Show Notes04:36 Build, train, serve, deploy; critical data engineering link.07:24 Model configuration for input output prediction summaries.11:05 Saw spike and heavy churn after rollout.16:21 Advancements in AI: use pre-trained deep learning models.19:38 Trends for Gen AI: creative use cases, specialized APIs.21:31 Questioning a sales tactic and legal concerns.25:58 People can introspect, edit, and change models.30:02 Early data science projects led to passion.31:24 Cybersecurity and AI partnership driving industry innovation.33:58 Understanding randomness as a valuable model feature.39:39 Technology provides accessible, shared experiences in AI.41:51 Technology as a companion for psychological support.44:06 Immigration experience from India to Silicon Valley.47:59 Unexpected culture shock from Bay Area to Boston.50:40 Easily learn with hands-on prediabase.com access.Speaker BioDevvret Rishi is a co-founder of Prediabase, a platform that helps engineers and developers productionize open source AI. The idea for Prediabase came from Rishi's co-founder Piero's experience at Uber, where he noticed that he was constantly reinventing the wheel with each new machine learning project. To streamline the process, he created a tool called Ludwig, which eventually became popular at Uber and was open sourced. Rishi's work with Prediabase has revolutionized the way AI is developed and implemented in engineering teams around the world.

Jan 16, 2024 • 36min
Blake Reichenbach on Marketing, Curiosity, and the Love of Books
In this episode, the Frank and Andy are joined by special guest Blake Reichenbach, a product manager at HubSpot and the owner of Howdy Curiosity, an online nonfiction bookstore and learning community. The conversation dives into the intersection of data, AI, and the love of books, as they discuss the next steps in managing and mitigating the hallucination part of AI technology, the importance of human interaction with AI tools, and finding the right balance in user experience. Blake shares his insights on integrating AI into HubSpot's platform, emphasizing the need for a balanced approach, and the pitfalls of solely relying on generative AI tools in marketing. Stay tuned as they also touch on personal matters, career transitions, and the rapid evolution of technology. This episode is packed with valuable insights and engaging conversations - you won't want to miss it!Show Notes00:00 HubSpot is a leading CRM platform.05:44 New AI features for CMS and websites.09:33 Gen AI tools need to prioritize meaningful data.11:34 Summary: Suggesting blending human and AI for success.15:34 ML models need precise training on nuanced datasets.17:13 Content marketing: human connection, AI balance, user experience.21:24 Approach content marketing like a multi-bandit test.26:56 Selling nonfiction books online and sharing recommendations.27:54 Rapid tech evolution creating excitement and challenges.30:56 Balancing work and entrepreneurship for personal growth.35:24 Thanks Frank, Andy, and Blake for amazing show.Speaker BioBlake Reichenbach is a proud employee of HubSpot, a leading customer relationship management platform for scaling companies. With a focus on the CMS aspect of the platform, Blake is passionate about helping businesses with their front office needs, including marketing, sales, service, and data operations. With a bias towards HubSpot, Blake believes in the product and the company, and recommends it highly for businesses looking to streamline their operations.

Jan 3, 2024 • 48min
Max Sklar on Exploring AI, Data Science, and Local Search
In today's episode, the hosts Frank La Vigne and Andy Leonard are joined by the expert in location data and machine learning, Max Sklar. Max shares insights from his decade-long tenure at Foursquare, delving into the company's evolution, gamification features, the challenges faced in the local search space, and his early interest in location data. The conversation explores the enduring relevance of foundational tech concepts, the cyclical nature of technology trends, and Max's personal journey into data and machine learning. Max also discusses his podcast, "The Local Maximum," and his diverse interests, including abstract math papers and a project rewriting the US Constitution. Join us as we dive into a thought-provoking discussion about AI, data science, and the ever-evolving world of technology with Max Sklar.Show Notes00:00 Foursquare split, confused but loved the concept.04:29 Rewards program failed due to lack of scalability.08:44 Early career in New York City's tech boom.13:05 Foursquare uses phone data to track locations.16:25 Models analyzed data to improve sentiment analysis.20:02 Data pipeline technology used for real-time deployment.20:54 Python written code, comparing different languages used.24:17 Navigating reinvention in a changing world.29:38 Joined wireless generation, now known as Amplify, as a software engineer.31:53 Machine learning brings data to life.34:26 Using OpenAI API to create interactive content.40:03 Technology enables limitless creativity and storytelling potential.42:12 Enjoys volunteering in underserved communities around the world.44:36 Extensive library and website featuring various projects.47:48 Please subscribe, rate, and review our podcast.

Dec 4, 2023 • 49min
Navigating the Complexity of Operationalizing ML Models
In this episode of Data Driven, our Andy Leonard and Frank La Vigne are joined by Chris McDermott, VP of Engineering at Wallaroo.AI. Together, they explore the challenges and advancements in the ever-evolving world of machine learning and artificial intelligence.From the importance of ongoing care for machine learning models to the rise of edge computing and decentralized networks, they touch on the critical need for flexibility and data privacy. Chris shares his insights on the technical challenges of AI and ML adoption, as well as his unique career journey. They also discuss the evolution of technology and the potential future impact of these innovations.Join us for a deep dive into the world of AI, technology, and the future of machine learning with Chris McDermott on this episode of Data Driven.Show Notes00:00 Exploring AI, data science, and data engineering.06:20 Training and inferring are different stages.08:12 Legacy AI doesn't require neural networks or GPUs.12:09 Machine learning models require consistent care and monitoring.15:10 MLOps merges skills, breaks down silos, collaborates.16:47 Prefer MLOps to avoid namespace collision. DevOps parallels original Star Wars plot.20:27 Internet-scale operations require automation and resilience.24:13 Challenges of integrating AI into business processes.28:03 New push for edge computing in technology industry.32:05 Edge technology critical, discussed in government tech symposium.34:50 Navigating from SendGrid to Twilio simplified processes.36:15 First foray into data, growing knowledge.39:33 Technology evolves, builds complexity over time.44:41 Book recommendation: "Seeing Like a State" by James C. Scott discusses legibility and centralization of power in society.46:28 Predictable tree farming fails due to ecosystem complexity.Speaker BioChris McDermott is a software engineer and entrepreneur who is passionate about creating products that make machine learning more accessible and manageable for users. His focus is on developing a platform that allows for easy deployment and management of machine learning models using any framework and on any architecture or hardware. He believes that current solutions in the market force users into a specific platform, and he aims to provide a more flexible and efficient alternative. With a strong belief in the potential of his product, Chris is dedicated to making machine learning more accessible and user-friendly for people across various industries.

Nov 29, 2023 • 42min
Advanced Fraud Prevention in the Age of Artificial Intelligence
Learn about the challenges of verifying identities remotely, the rise of deep fakes for fraud, and the use of AI to combat threats. Dive into the impact of technology on security measures and Pavel's journey in the field of AI. Discover insights into fraud detection, technology, and security in this fascinating episode.

Nov 28, 2023 • 2h 4min
Diving into Re:Invent 2023: Open Sourcing Dingo and Being in the Top 2.5 Percent
In this jam-packed episode, hosts Frank and Andy delve into a wide range of topics, from the chaos of podcast scheduling and the allure of Cyber Week deals, to the behind-the-scenes world of data engineering and AI professionals. Join us as we journey through the challenges of podcasting, the important roles of data engineers, and the potential open sourcing of Dingo, an innovative blogging automation tool. Along the way, the hosts share personal anecdotes, discuss legislative impacts, and even touch on cult-followed gas stations. You won't want to miss this delightful, informative, and always data-driven episode!Show Notes00:00 Glamorous world of podcasting and Microsoft Bookings.13:48 Privacy laws are spreading globally, impacting data sovereignty.27:14 Funny moment at Dunkin' Donuts sparks creativity.32:27 Importance of data engineering in AI projects.49:38 Struggling with hearing loss, amplifiers magnify all sounds.01:02:45 Emotions on camera, times sidetrack, sarcastic leadership.01:07:32 Excited to hang out at the mall.01:21:04 Considering discontinuing blog after reaching 100 posts.01:25:18 Wants to shift focus to new projects.01:37:09 Transition from long-form to short-form content.01:49:50 Drove up to Jersey for Christmas, reminisced.01:58:48 Concerns about coastal development and zoning enforcement.Links01:02:45 Here's an example of early FWTV where I am at the mall and not happy about it: https://www.youtube.com/watch?v=f8S7ha9fZWo

Nov 22, 2023 • 1h 5min
OpenAI Drama, Open Source, and Andy's New Venture
In this episode, your hosts Andy Leonard and Frank La Vigne dive headfirst into the world of open source, decision making, and the unfolding drama surrounding OpenAI. From sarcastic responses to holographic displays, we've got it all covered! Join us as we discuss the potential consequences of dependencies, community protests leading to change, and the recent issues with OpenAI. We'll also explore the importance of open source in AI and share some intriguing insights on Sam Altman's return to the company. With a sprinkle of tech industry gossip and even a potential Netflix adaptation, this episode is a must-listen. So sit back, relax, and get ready to be data driven!Show Notes02:42 OpenAI, Thanksgiving break, intense year, household name.10:35 3-day conference with nightly events, pre-conference presentations.14:09 NVIDIA, OpenAI, Elon Musk, open source.21:07 "Doubts arise about OpenAI's dependence and transparency."24:55 Regulations and transparency warranted for research.29:57 OpenAI lacked options to protest, unlike Node.36:52 Teams invite, alternative to costly Calendly.42:04 Product shelved, lack of promotion, open source alternatives.44:06 Insufficient hardware led to new AI venture.48:55 Artists use online art to fight scraping.55:37 Costs exceeded expectations, customers pulling back, database snapshot unavailable.01:03:42 Happy Thanksgiving from the Data Driven Podcast.

Nov 15, 2023 • 6min
Two Hosts on Two Coasts
Andy is speaking at PASS Summit in Seattle and Frank is speaking at the Red Hat Government Symposium in Washtington, DC.Two hosts. Two Coasts. One Podcast!

Nov 6, 2023 • 54min
Brennan Lamey on Entrepreneurship & Data Engineering in the Web 3 Era
Welcome back to another exciting episode of Data Driven! In this show, we delve into the fascinating world of Web 3 and decentralized databases. Join us as we explore the insights and experiences of our guest, Brennan Lamey, the founder of Kwil - a revolutionary company that builds decentralized databases for Web 3 applications.Throughout this episode, Brennan shares his journey and the inspiration behind Kwil, as well as the cutting-edge technology that powers their database solutions. From complex access control rules to collaboration between competitors, we uncover how Kwil is transforming the way companies approach data storage, privacy, and sharing.But it's not just about the technology - we also dive into Brennan's personal story, from their humble beginnings in Idaho to their entrepreneurial success and passion for data engineering. Plus, don't miss their recommendations for AI programming and an intriguing sci-fi audiobook they're currently enthralled by.So, whether you're a tech enthusiast, a data-driven professional, or simply curious about the future of the internet, this episode is a must-listen. Tune in as we unravel the intricacies of Web 3, decentralized databases, and the exciting possibilities they hold for a better, fairer online world. Let's get started on this illuminating journey with Brennan Lamey and Kwil in this data-driven episode of Data Driven!

Nov 5, 2023 • 49sec
BAILeY Celebrates Guy Fawkes Night
BAILeY recites the V laden introductory speed from V for Vendetta.Just for fun.TranscriptVoilà!In view, a humble vaudevillian veteran, cast vicariously as both victim and villain by the vicissitudes of Fate. This visage, no mere veneer of vanity, is a vestige of the vox populi, now vacant, vanished. However, this valorous visitation of a by-gone vexation stands vivified, and has vowed to vanquish these venal and virulent vermin vanguarding vice and vouchsafing the violently vicious and voracious violation of volition.The only verdict is vengeance; a vendetta, held as a votive, not in vain, for the value and veracity of such shall one day vindicate the vigilant and the virtuous.Verily, this vichyssoise of verbiage veers most verbose, so let me simply add that it is my very good honor to meet you and you may call me V.