
Contributor
The origin story behind the best open source projects and communities.
Latest episodes

Aug 26, 2020 • 38min
Presto with Martin Traverso, Dain Sundstrom and David Phillips
Eric Anderson (@ericmander) talks to Martin Traverso (@mtraverso), Dain Sundstrom (@daindumb) and David Phillips (@electrum32) about their collaboration on Presto, an open-source distributed SQL query engine for big data. The three engineers worked together at three different companies before deciding to solve an efficiency problem for data analytics at Facebook in 2012. Listen to today’s episode to learn about the careful planning and technical philosophy behind the development and design of Presto.
In this episode we discuss:
Starting an open-source project at Facebook in the early 2010s
The importance of making Presto “dirt simple to install”
What is “documentation driven development”
Bootstrapping the growth of an open-source community
How a single query caused a brownout across Facebook infrastructure
Related Links:
Presto
Starburst
Ning
Netezza
ProofPoint
Hadoop
Postgres
Hive
OpenCompute
@Scale
Arm Treasure Data
Qubole
People mentioned:
Jay Parikh (@jayparikh)

Aug 12, 2020 • 28min
Xanadu with Nathan Killoran
Nathan Killoran (@co9olguy) guides Eric Anderson (@ericmander) through the cutting-edge world of quantum machine learning at Xanadu, a quantum computing company that is innovating with its use of photonics. Nathan is Xanadu’s Head of Software, Algorithms, & Quantum Machine Learning, and has detailed insight on their main open-source software projects, StrawberryFields and PennyLane. On today’s episode, Nathan explains how the barrier to contributing may be lower than you think, even if you don’t have a PhD in quantum physics.
In this episode we discuss:
Designing software for Xanadu’s unique approach to quantum computing
Machine learning, differentiable programming and more in the quantum domain
How even high school students can contribute to an open-source quantum computing project
Is there a road map for quantum machine learning?
Nathan’s “blue sky” interview questions
Links:
Xanadu
StrawberryFields
PennyLane
ProjectQ
TensorFlow Quantum
PyTorch
Qiskit
Pyquil
Cirq
Alpine Quantum Technologies
Quantum Open Source Foundation
Unitary Fund
People mentioned:
Christian Weedbrook, CEO of Xanadu (@_cweedbrook)

Jul 29, 2020 • 42min
Clickhouse with Alexey Milovidov and Ivan Blinkov
Eric Anderson (@ericmander) talks to Alexey Milovidov (@alexey-milovidov) and Ivan Blinkov (@blinkov) about their work on Clickhouse, an open source analytical database from the team at Yandex. Originally designed to support Yandex.Metrica, word of this powerful tool spread rapidly inside the company, and the idea was hatched to make Clickhouse into a truly open source project. Tune in to learn about how Alexey petitioned management to accept what initially seemed like a “crazy” idea - and how the risk paid off.
In this episode we discuss:
Differences between Clickhouse and similar products
Why some open source projects are more successful than others
The history of open source at Yandex
What makes a good open source developer
Building an international community
Links:
Clickhouse
Yandex.Metrica
Altinity
Postgres
Oracle
Infobright
InfinityDB
MongoDB
Vertica
Dremel: Interactive Analysis of Web-Scale Datasets (2010)
CatBoost
BEM
Presto
Druid
Greenplum
Apache Spark

Jul 15, 2020 • 32min
LinkerD with William Morgan
Eric Anderson (@ericmander) chats with William Morgan (@wm), CEO of Buoyant and a creator of the open source service mesh, LinkerD. As a former infrastructure engineer at Twitter, William leveraged his experience there to help develop what would become effectively the first service mesh. Listen to today’s episode to find out how the team at Buoyant originally coined the term, and are continuing to define the concept today.
In this episode we discuss:
Pioneering the very first service mesh
Why Buoyant rejected the open core model
How the industry is shifting away from the “nights and weekends” community
Rewriting LinkerD from scratch
Links:
LinkerD
Buoyant
Dive
Kubernetes
Docker
Finagle
HAProxy
NGINX
CNCF
Prometheus
Cisco Webex
Istio

Jul 1, 2020 • 41min
Chef with Adam Jacob
Full show notes and transcript.
Eric Anderson (@ericmander) welcomes Chef co-founder Adam Jacob (@adamhjk) to talk about the popular open source service. He and co-founder Nathan Haneysmith originally started the company as a way to sell automation services to startups, but wanted to expand their abilities to serve more clients. From naming the company to governance and engaging with contributors, Adam dives into why it was important to him to go the open source route and how the business model works.
In this episode we discuss:
How Chef got started
The decision to be open source
What the business model looks like
Contributors and community members
Where Chef is today and where it’s headed
Links
Chef
Puppet
The Apache Software Foundation
Docker
Perl

Jul 1, 2020 • 31min
Istio with Sven Mawson
Full show notes and transcript.
Eric Anderson (@ericmander) and Sven Mawson (@smawson) dive into the past, present and future of Istio, an open source service mesh born of collaboration between IBM and Google. Sven is a Senior Staff Engineer at Google and co-founder of the Istio project. In today’s episode, he shares the story of how two titans came together for a tool that anyone can use and contribute to.
In this episode we discuss:
How Google asked IBM to drop their Amalgam8 project
The involvement of Lyft, Envoy and Matt Klein (@mattklein123)
Making moves at QCon
A counter-intuitive marketing strategy
What work still needs to be done
Links
Istio
Google Cloud Endpoints
Kubernetes
Envoy
QCon
NGinX

Nov 14, 2019 • 39min
Envoy Proxy with Matt Klein
Full show notes and transcript.
Eric Anderson (@ericmander) and Matt Klein (@mattklein123) discuss the beginnings of Envoy Proxy, an open source proxy now governed by the CNCF. Matt is a software engineer at Lyft and creator of the Envoy. On today’s episode, Matt gives the inside scoop on the benefits and challenges of cultivating a self-sustaining open source community.
In this episode we discuss:
How Matt’s experience at Twitter informed development of Envoy
Working with Google
The role of marketing in Envoy’s success
Why building an open source community is like “total controlled anarchy”
Finding the right contributors and maintainers
Links:
Envoy Proxy
Finagle
Hystrix
NginX
HA Proxy
Istio
CNCF

Nov 1, 2019 • 33min
Alluxio with Haoyuan Li
Full show notes and transcript.
Eric Anderson (@ericmander) hosts Haoyuan Li (@haoyuan), also known as H.Y., creator of Spark Streaming as well as the open source data orchestration system, Alluxio. H.Y. founded Alluxio, Inc. to further develop the research project that he first created as a doctoral student at UC Berkeley’s AMPLab. Listen to today’s episode to learn more about how H.Y. identified an opportunity to disrupt cloud storage with an open source project as his Ph.D. thesis.
In this episode we discuss:
H.Y.’s analysis of the data storage industry’s cyclical history
How H.Y. balanced academics with the Alluxio community
The 3 types of Alluxio contributors
Use cases for Alluxio
Links:
Alluxio
Spark Streaming
Kubernetes
Presto
TensorFlow
AMPLab

Jul 4, 2019 • 45sec
Contributor Trailer
Learn about Contributor, a podcast about the best open source projects and the communities that build them.