

Open Source Startup Podcast
Robby (MTF); Tim (Essence VC)
The leading podcast on how to build a successful open source company.
Learn from the founders of HashiCorp, Chronosphere, Vercel, MongoDB, DBT, mobile.dev and more!
Learn from the founders of HashiCorp, Chronosphere, Vercel, MongoDB, DBT, mobile.dev and more!
Episodes
Mentioned books

4 snips
Apr 17, 2023 • 38min
E82: Creating Apache Iceberg & Headless Data Warehouse Tabular
Ryan Blue is Co-Founder of data automation platform Tabular and Co-Creator of Apache Iceberg, the open source high-performance format for huge analytic tables.
Tabular most recently raised a Series A from a16z.
In this episode, we discuss the concept of a "headless data warehouse", being a problem-centric rather than solution-centric founder & more!

Apr 10, 2023 • 48min
E81: Open Source DataOps with Meltano
Douwe Maan is Founder & CEO of DataOps platform Meltano, the extract and load company behind the open source CLI & version control project meltano.
Meltano has raised $12M from investors including Venrock & Google Ventures.
In this episode, we dig into spinning a company out of GitLab, Meltano's cloud launch, making technical data engineers first-class citizens & more!

Apr 3, 2023 • 38min
E80: Securing Kubernetes With ARMO & Kubescape
Shauli Rozen is Founder & CEO of ARMO, the company behind Kubernetes open source security platform kubescape. The project has over 8K stars on GitHub and includes tools for risk analysis, security, compliance, and misconfiguration scanning.
ARMO has raised $35M from investors including Tiger Global and Pitango VC.
In this episode, we dig into the differences in building product for DevOps vs. security teams, how to use signals from discord / slack channels to drive product roadmap, bringing on a VP of Open Source & more!

Mar 27, 2023 • 40min
E79: Spin Up Production-Like Dev Environments With Okteto
Ramiro Berrelleza is Founder & CEO of Okteto, the Kubernetes development platform that allows developers to spin up production-like dev environments in the cloud. Okteto's open source project, also called Okteto, allows users to spin up a development container, which is configured like the user's production Kubernetes deployment. Today, it has 2.8K start on GitHub.
Okteto has raised $18M from investors including Root VC and Two Sigma.
In this episode, we discuss the challenges of building with kubernetes, figuring out market timing, how to position for your specific users & more!

Mar 20, 2023 • 42min
E78: The Fastest Path From Data To Insight With Starburst
Justin Borgman is CEO of Starburst, the “Analytics Everywhere” company based on the sequel query engine Trino (previously called Presto). Trino is a distributed SQL query engine for big data and is used by companies such as Salesforce, Robinhood, Lyft, LinkedIn, Goldman Sachs, and Netflix. Trino currently has 7.5K GitHub Stars.
Starburst has raised over $400M from investors including Index, Coatue, A16z, and Alkeon.
In this episode, we dig into the Presto to Trino transition, recruiting the Trino founders to Starburst, waiting to raise venture capital until there are strong signs of PMF, what PMF looks like (ie. multiple Fortune 500 users), getting competition to compete on your turf, and more!

Mar 15, 2023 • 41min
E77: Simplify Your ML Infrastructure With Aqueduct
Vikram Sreekanti & Joey Gonzalez are Co-Founders of Aqueduct, the open-source orchestration layer for machine learning infrastructure. Aqueduct's open source project, also called aqueduct, has over 400 stars on GitHub.
In this episode, we discuss what Vikram & Joey learned from interviewing 100s of data teams, building in the competitive MLOps space, how and why they invest in content & much more!

6 snips
Mar 7, 2023 • 38min
E76: How Cleanlab Can Help GPT-3, Bard, and Claude with Data Quality
Curtis Northcutt is Co-Founder & CEO of Cleanlab, the company that helps AI & ML teams automatically find and fix errors in their datasets. They have over 5K stars on GitHub and are already working with companies such as Wells Fargo and Google on ML data quality.
In this episode, we discuss the difference between data noise and model noise, the growing importance of ML data quality with the momentum around generative AI models and applications, how Curtis' focus as CEO has shifted over time & much more!

Feb 23, 2023 • 35min
E75: Payload, the React & TypeScript Headless CMS
James Mikrut is Founder of Payload CMS, the React & TypeScript headless CMS. Their open source project, payload, has over 9K stars on Github and provides a Headless CMS and Application Framework built with TypeScript, Node.js, React, and MongoDB.
Payload has raised over $5M from investors including Gradient Ventures and YC.
In this episode, we discuss Payload's early guerilla marketing tactics, listening to your community to inform your monetization model, what developer-first really means & more!

Feb 21, 2023 • 41min
E74: Dev-First Testing with AtomicJar & Testcontainers
Sergei Egorov is Co-Founder & CEO of AtomicJar, the developer-first testing platform built on top of open source testing framework Testcontainers. AtomicJar provides Testcontainers Cloud which allows users to run tests in the cloud with anything that can be containerized.
AtomicJar has raised almost $30M from investors including Insight Partners and Boldstart.
In this episode, we discuss user demand driving the creation of a company alongside an open source project, using a different name for the company to have the ability to work with other projects, learnings from early scaling & more!

Feb 16, 2023 • 38min
E73: Building Scalable Postgres with Serverless Database Platform Neon
Nikita Shamgunov is Co-Founder & CEO of Neon, the open-source serverless postgres database platform. Neon separates storage and compute to offer autoscaling, branching, and bottomless storage. Their open source project, also called Neon, has 6.5K stars on Github.
Neon has raised $30M from investors including GGV and Khosla.
In this episode, we dig into the Neon founding story of starting a scalable alternative to AWS Aurora, why it's important to separate storage and compute, Neon's partner strategy, Nikita's thoughts on the "DevCloud" movement & much more!