The Data Stack Show cover image

The Data Stack Show

41: Doing MLOps on Top of Apache Pulsar and Trino with Joshua Odmark of Pandio

Jun 23, 2021
50:20

Highlights from this week’s episode:

  • Joshua started his first company at age 15 and then sold two more startups after that (2:15)
  • Embracing the open source movement and not reinventing the wheel if you don't have to (12:15)
  • Pulsar seemed built to address Kafka's weaknesses (17:23)
  • Using Redis as a coordinator for federated learning and taking advantage of its portability (23:05)
  • The pillars of Pandio and some practical use cases (31:24)
  • Feature stores and model versioning (38:23)
  • Seeing Pulsar as the future because of the ability to run tens of millions of topics (41:04)

The Data Stack Show is a weekly podcast powered by RudderStack. Each week we’ll talk to data engineers, analysts, and data scientists about their experience around building and maintaining data infrastructure, delivering data and data products, and driving better outcomes across their businesses with data.

RudderStack helps businesses make the most out of their customer data while ensuring data privacy and security. To learn more about RudderStack visit rudderstack.com.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode