DataNation - Podcast for Data Engineers, Analysts and Scientists cover image

51 – Open Data Standards (Apache Iceberg, Apache Parquet, Apache Arrow, Apache Ibis, Apach Substrait)

DataNation - Podcast for Data Engineers, Analysts and Scientists

00:00

Introduction

This chapter discusses the benefits of open data standards like Apache Arrow and Apache Iceberg in the data space by streamlining data serialization and deserialization processes, enhancing speed, and lowering costs. It specifically emphasizes the performance advantages of Apache Arrow's optimized columnar in-memory format when loading data from sources such as Parquet files.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app