Data Engineering Podcast

Tobias Macey

This show goes behind the scenes for the tools, techniques, and difficulties associated with the discipline of data engineering. Databases, workflows, automation, and data manipulation are just some of the topics that you will find here.

Episodes

Mentioned books

Sep 25, 2023 • 59min

Powering Vector Search With Real Time And Incremental Vector Indexes

This podcast discusses the growth of machine learning and the need for vector search capabilities. They explore the challenges of real-time indexes, the benefits of semantic search, and incorporating vector search into data flows. They also cover the considerations and limitations of vector search and share insights on working with vector databases.

103 snips

Sep 17, 2023 • 1h 2min

Building Linked Data Products With JSON-LD

In this podcast, Brian Platz discusses the concept and implications of linked data, the benefits of using JSON-LD for building semantic data products, the challenges faced in building linked data products, and the need for improved data management tools.

25 snips

Sep 10, 2023 • 1h 1min

An Overview Of The State Of Data Orchestration In An Increasingly Complex Data Ecosystem

Nick Schrock, creator of Dagster, discusses the state of data orchestration technology and its application. They explore the challenges and benefits of orchestrators, the balance between information and infrastructure, and the capabilities and challenges of data orchestration. They also discuss low code and no code solutions in data work, their integration into software engineering, and the role of data orchestration in ML workflows.

15 snips

Sep 4, 2023 • 42min

Eliminate The Overhead In Your Data Integration With The Open Source dlt Library

The podcast explores the dlt project, an open source Python library for data loading. It discusses the challenges in data integration, the benefits of dlt over other tools, and how to start building pipelines. Other topics include the journey of becoming a data engineer, performance considerations of using Python, collaboration in data integration, and integration with different runtimes. The hosts emphasize the need for better education in data management and practical solutions.

Aug 28, 2023 • 1h 1min

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!

Get the app

Data Engineering Podcast

Episodes

Mentioned books

Powering Vector Search With Real Time And Incremental Vector Indexes

Building Linked Data Products With JSON-LD

An Overview Of The State Of Data Orchestration In An Increasingly Complex Data Ecosystem

Eliminate The Overhead In Your Data Integration With The Open Source dlt Library

Building An Internal Database As A Service Platform At Cloudflare

Harnessing Generative AI For Creating Educational Content With Illumidesk

Unpacking The Seven Principles Of Modern Data Pipelines

Quantifying The Return On Investment For Your Data Team

Strategies For A Successful Data Platform Migration

Build Real Time Applications With Operational Simplicity Using Dozer

The AI-powered Podcast Player