AI-powered
podcast player
Listen to all your favourite podcasts with AI-powered features
MLOps Coffee Sessions #110 with David Bayliss, Chief Data Scientist of LexisNexis Risk Solutions, Just Fetch the Data and then... co-hosted by Vishnu Rachakonda.
// Abstract
Composing data to extract features can be a significant problem. Key factors are the data size, compliance restrictions, and real-time data. Ethics (and law) can drive extremely complex audit requirements. In the cloud, you can do anything - at a price.
// Bio
One of the creators of the world's first big data platform (HPCC); David has been tackling big data problems for two decades. A mathematician, compiler writer, and data sponge with more than five dozen patents spanning platforms linking, and search.
Most inventors think outside the box; David can't even remember where the box is. He leads the team that creates their core Data Science methods used by hundreds of data scientists.
// MLOps Jobs board
https://mlops.pallet.xyz/jobs
MLOps Swag/Merch
https://mlops-community.myshopify.com/
// Related Links
Interesting insight in this post. Would be cool to learn from David about his view on things
https://www.google.com/url?q=https://www.linkedin.com/posts/david-bayliss-426556a_datascience-platform-portability-activity-6913448643303759872-2dqq?utm_source%3Dlinkedin_share%26utm_medium%3Dmember_desktop_web&sa=D&source=calendar&ust=1649078059106132&usg=AOvVaw26wAevExeEfW_AdZSA8UhF
--------------- ✌️Connect With Us ✌️ -------------
Join our slack community: https://go.mlops.community/slack
Follow us on Twitter: @mlopscommunity
Sign up for the next meetup: https://go.mlops.community/register
Catch all episodes, blogs, newsletters, and more: https://mlops.community/
Connect with Demetrios on LinkedIn: https://www.linkedin.com/in/dpbrinkm/
Connect with Vishnu on LinkedIn: https://www.linkedin.com/in/vrachakonda/
Connect with David on LinkedIn: https://www.linkedin.com/in/david-bayliss-426556a/
Timestamps:
[00:00] Introduction to David Bayliss
[01:03] Takeaways
[04:56] LexisNexis and David's role
[07:15] Evolution of LexisNexis in 20 years with so many use cases
[08:51] Role of David in structuring data for working with data change
[14:32] Data management and data access
[17:45] Unique challenges of scale, use case, and diversity at LexisNexis
[24:47] Tardis Iron Box
[30:05] Iron Box translation
[32:56] JVM for data science
[34:24] Iron Box meaning
[36:52] Metadata with PII
[39:08] Detrimental privacy / Hairy Kneecap Theory
[40:57] Speeding things up and Anonymized linking
[46:47] What kept David working at LexisNexis?
[50:30] Wrap up
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode
Hear something you like? Tap your headphones to save it with AI-generated key takeaways
Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more
Listen to all your favourite podcasts with AI-powered features
Listen to the best highlights from the podcasts you love and dive into the full episode