
Open||Source||Data
What can we learn from ai-native development through stimulating conversations with developers, regulators, academics and people like you that drive forward development, seek to understand impact, and are working to mitigate risk in this new world?
Join Charna Parkey and the community shaping the future of open source data, open source software, data in AI, and much more.
Latest episodes

Feb 16, 2022 • 37min
Trust, Automation, and Trade-Offs with Joseph Jacks
This episode features an interview with Joseph Jacks, Founder and General Partner of OSS Capital. OSS Capital is the first and only COSS (Commercial Open Source Software) company investor that focuses on supporting early-stage COSS founders. Joseph, also known as JJ, has worked at Mesosphere, TIBCO Software, and Talend in various sales, engineering, and strategy roles.
In this episode, JJ and Sam weigh the trade-offs of open and closed core companies and discuss how each can go public. JJ also dives into the misconception of trust equating privacy within tech.
Guest Quote
[25:14]: “There’s a societal recognition that if you use technology to automate some part of your life and you use that regularly, you have to be able to trust it. And I think gradually, consumers are becoming more and more aware that one of the most effective ways of checking the trust box is answering the question, ‘Is the technology I'm using open source at the core, yes or no?’ And if the answer is no, I think it's very difficult and a lot harder to achieve the levels of trust that you can if the answer is yes.” – Joseph Jacks
Time Stamps
[12:59]: The difference between open and closed core companies
[17:23]: Understanding the trade-off between open and closed source
[18:23]: Trends within open source data companies
[20:21]: Is it possible to go public as a closed source database?
[22:35]: Leveraging the automation opportunity of open source systems
[23:47]: How can consumers trust the technology they’re using?
[34:01]: Advice for those starting open source projects
Links
LinkedIn - Connect with JJ
LinkedIn - Connect with OSS Capital
Twitter - Follow OSS Capital
Visit OSS Capital
See omnystudio.com/listener for privacy information.

Feb 2, 2022 • 47min
Open Source, Adoptability, and Name Changes with Martin Traverso
This episode features an interview with Martin Traverso, CTO at Starburst Data and Co-founder of Trino, a lightning fast distributed SQL query engine. Martin was previously a software engineer at Facebook where he led the Presto (now Trino) development team. Trino has gained worldwide adoption from companies like Netflix, Amazon, and LinkedIn.
In this episode, Martin sits down with Sam to discuss the barriers, advantages, and complications of going open-source.
Episode Notes
-Guest Quote
[33:55]: “What makes Trino powerful is the ecosystem around it. You have integrations with all sorts of data sources and that’s part of the power and magic of Trino. You can pull data from all these data sources using a single interface. On the other end is the integrations with all the tools that everyone uses. Once you put all those pieces together, that’s what gives Trino the power.”
-Time Stamps
[8:38]: How Martin solved Facebook’s analytics problem
[13:00]: How the team adapted to customers’ needs
[17:07]: What makes Trino stand out among other query engines
[19:42]: Going open-source changes the game
[30:14]: Presto becomes Trino
[33:24]: What gives Trino its magic
[35:19]: What Trino’s community looks like today
[38:34]: Advice for those starting open-source projects
-Links
Blog - Intro to Trino for the Trinewbie
Trino Community Broadcast - Subscribe
GitHub Trino repository - Give Trino a star
LinkedIn - Connect with Martin
Trino Meetup - Join
Play with Trino
Rebrand from Presto to Trino - Learn More
Slack - Join Trino
Trino: The Definitive Guide (Download a free copy)
Twitter - Follow Martin
Twitter - Follow Trino
See omnystudio.com/listener for privacy information.

Oct 29, 2021 • 24min
Season Two Finale and Recap with Open||Source||Data Producer Audra Montenegro
Join Open||Source||Data producer Audra Montenegro as she and Sam cover highlights and takeaways from the ten episodes of season two. And get a sneak peak of what's in store for season three!See omnystudio.com/listener for privacy information.

Oct 14, 2021 • 31min
Embeddings, Feature stores, and MLOps with Simba Khadder
Join CEO of Featureform, Simba Khadder as he talks with Sam about how versioning, immutability, and sharing will accelerate ML workflows. Tune-in on state of the art collaboration in data teams, and the power of focusing on your north star.See omnystudio.com/listener for privacy information.

Sep 30, 2021 • 29min
Abundance, Metadata, and Automation with Mark Grover
How can we make data 10X more accessible for data-driven people within data-driven companies? Tune in to Mark and Sam discussing probabilistic product management, and the emerging metadata ecosystem.See omnystudio.com/listener for privacy information.

Sep 16, 2021 • 36min
Metadata, Communities, and Architecture with Shirshanka Das
How can we evolve an expanding ecosystem of data technologies while making sense of the whole? Tune in to LinkedIn DataHub, and Acryl Data founder, Shirshanka Das, as he and Sam have a discussion on metadata at the center and specialization at the edge to sustainably scale data governance.See omnystudio.com/listener for privacy information.

Sep 2, 2021 • 59min
Data Management Pain Points and Future Solutions for Data Discovery
Data discovery is one of the hardest problems to solve in data management in general and comes up as a major pain point in most data mesh discussions. Tune in to this all-star expert panel recorded in collaboration with the Data Mesh community, and hosted by a previous Open||Source||Data podcast guest, Paco Nathan of Derwen.ai. Paco engages panelists, Shinji Kim (Select Star), Sophie Watson (Red Hat), Mark Grover (Stemma), and Shirshanka Das (Acryl Data) in a 60-minute discussion on not only Data Mesh, but other data strategies and process needs for the data discovery future.See omnystudio.com/listener for privacy information.

Aug 19, 2021 • 27min
ModelOps, ML Monitoring, and Busy Humans with Elena Samuylova
It’s 2 AM - do you know what your models are doing? Listen to Elena Samuylova as she talks to us about how to bridge the critical gaps between data scientists, engineers, and business managers using tooling and empathy.See omnystudio.com/listener for privacy information.

Aug 5, 2021 • 36min
Cloud-Native, Open-Source, and Collaborative with Eric Brewer and Melody Meckfessel
Google Fellow & VP of Infrastructure Eric Brewer, Observable CEO Melody Meckfessel, and DataStax Chief Strategy Officer Sam Ramji explore the state of the art, the near future, and grand challenges for the next decade in cloud-native data.See omnystudio.com/listener for privacy information.

Jul 22, 2021 • 32min
MLOps, AIOps, and Data Startups with Jocelyn Goldfein
Dealing with data hyperabundance, solving economic problems for businesses and changing lives for the better. Tune-in to Managing Director at Zetta Venture Partners, Jocelyn Goldfein as she and Sam have a discussion around engineering leadership, organizational graph structures, and productization of AI.See omnystudio.com/listener for privacy information.