Open||Source||Data cover image

Open||Source||Data

Latest episodes

undefined
Feb 16, 2022 • 37min

Trust, Automation, and Trade-Offs with Joseph Jacks

This episode features an interview with Joseph Jacks, Founder and General Partner of OSS Capital. OSS Capital is the first and only COSS (Commercial Open Source Software) company investor that focuses on supporting early-stage COSS founders. Joseph, also known as JJ, has worked at Mesosphere, TIBCO Software, and Talend in various sales, engineering, and strategy roles. In this episode, JJ and Sam weigh the trade-offs of open and closed core companies and discuss how each can go public. JJ also dives into the misconception of trust equating privacy within tech. Guest Quote [25:14]: “There’s a societal recognition that if you use technology to automate some part of your life and you use that regularly, you have to be able to trust it. And I think gradually, consumers are becoming more and more aware that one of the most effective ways of checking the trust box is answering the question, ‘Is the technology I'm using open source at the core, yes or no?’ And if the answer is no, I think it's very difficult and a lot harder to achieve the levels of trust that you can if the answer is yes.” – Joseph Jacks Time Stamps [12:59]: The difference between open and closed core companies [17:23]: Understanding the trade-off between open and closed source [18:23]: Trends within open source data companies [20:21]: Is it possible to go public as a closed source database? [22:35]: Leveraging the automation opportunity of open source systems [23:47]: How can consumers trust the technology they’re using? [34:01]: Advice for those starting open source projects Links LinkedIn - Connect with JJ LinkedIn - Connect with OSS Capital Twitter - Follow OSS Capital Visit OSS Capital See omnystudio.com/listener for privacy information.
undefined
Feb 2, 2022 • 47min

Open Source, Adoptability, and Name Changes with Martin Traverso

This episode features an interview with Martin Traverso, CTO at Starburst Data and Co-founder of Trino, a lightning fast distributed SQL query engine. Martin was previously a software engineer at Facebook where he led the Presto (now Trino) development team. Trino has gained worldwide adoption from companies like Netflix, Amazon, and LinkedIn. In this episode, Martin sits down with Sam to discuss the barriers, advantages, and complications of going open-source. Episode Notes -Guest Quote [33:55]: “What makes Trino powerful is the ecosystem around it. You have integrations with all sorts of data sources and that’s part of the power and magic of Trino. You can pull data from all these data sources using a single interface. On the other end is the integrations with all the tools that everyone uses. Once you put all those pieces together, that’s what gives Trino the power.” -Time Stamps [8:38]: How Martin solved Facebook’s analytics problem [13:00]: How the team adapted to customers’ needs [17:07]: What makes Trino stand out among other query engines [19:42]: Going open-source changes the game [30:14]: Presto becomes Trino [33:24]: What gives Trino its magic [35:19]: What Trino’s community looks like today [38:34]: Advice for those starting open-source projects -Links Blog - Intro to Trino for the Trinewbie Trino Community Broadcast - Subscribe GitHub Trino repository - Give Trino a star LinkedIn - Connect with Martin Trino Meetup - Join Play with Trino Rebrand from Presto to Trino - Learn More Slack - Join Trino Trino: The Definitive Guide (Download a free copy) Twitter - Follow Martin Twitter - Follow Trino See omnystudio.com/listener for privacy information.
undefined
Oct 29, 2021 • 24min

Season Two Finale and Recap with Open||Source||Data Producer Audra Montenegro

Join Open||Source||Data producer Audra Montenegro as she and Sam cover highlights and takeaways from the ten episodes of season two. And get a sneak peak of what's in store for season three!See omnystudio.com/listener for privacy information.
undefined
Oct 14, 2021 • 31min

Embeddings, Feature stores, and MLOps with Simba Khadder

Join CEO of Featureform, Simba Khadder as he talks with Sam about how versioning, immutability, and sharing will accelerate ML workflows. Tune-in on state of the art collaboration in data teams, and the power of focusing on your north star.See omnystudio.com/listener for privacy information.
undefined
Sep 30, 2021 • 29min

Abundance, Metadata, and Automation with Mark Grover

How can we make data 10X more accessible for data-driven people within data-driven companies? Tune in to Mark and Sam discussing probabilistic product management, and the emerging metadata ecosystem.See omnystudio.com/listener for privacy information.
undefined
Sep 16, 2021 • 36min

Metadata, Communities, and Architecture with Shirshanka Das

How can we evolve an expanding ecosystem of data technologies while making sense of the whole? Tune in to LinkedIn DataHub, and Acryl Data founder, Shirshanka Das, as he and Sam have a discussion on metadata at the center and specialization at the edge to sustainably scale data governance.See omnystudio.com/listener for privacy information.
undefined
Sep 2, 2021 • 59min

Data Management Pain Points and Future Solutions for Data Discovery

Data discovery is one of the hardest problems to solve in data management in general and comes up as a major pain point in most data mesh discussions. Tune in to this all-star expert panel recorded in collaboration with the Data Mesh community, and hosted by a previous Open||Source||Data podcast guest, Paco Nathan of Derwen.ai. Paco engages panelists, Shinji Kim (Select Star), Sophie Watson (Red Hat), Mark Grover (Stemma), and Shirshanka Das (Acryl Data) in a 60-minute discussion on not only Data Mesh, but other data strategies and process needs for the data discovery future.See omnystudio.com/listener for privacy information.
undefined
Aug 19, 2021 • 27min

ModelOps, ML Monitoring, and Busy Humans with Elena Samuylova

It’s 2 AM - do you know what your models are doing? Listen to Elena Samuylova as she talks to us about how to bridge the critical gaps between data scientists, engineers, and business managers using tooling and empathy.See omnystudio.com/listener for privacy information.
undefined
Aug 5, 2021 • 36min

Cloud-Native, Open-Source, and Collaborative with Eric Brewer and Melody Meckfessel

Google Fellow & VP of Infrastructure Eric Brewer, Observable CEO Melody Meckfessel, and DataStax Chief Strategy Officer Sam Ramji explore the state of the art, the near future, and grand challenges for the next decade in cloud-native data.See omnystudio.com/listener for privacy information.
undefined
Jul 22, 2021 • 32min

MLOps, AIOps, and Data Startups with Jocelyn Goldfein

Dealing with data hyperabundance, solving economic problems for businesses and changing lives for the better. Tune-in to Managing Director at Zetta Venture Partners, Jocelyn Goldfein as she and Sam have a discussion around engineering leadership, organizational graph structures, and productization of AI.See omnystudio.com/listener for privacy information.

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app