MLOps.community  cover image

Small Data, Big Impact: The Story Behind DuckDB // Hannes Mühleisen & Jordan Tigani // #202

MLOps.community

NOTE

Rethinking the Notion of Big Data in Data Engineering

The prevailing notion that real data engineers work on huge systems is being challenged. The concept of 'big data is dead' aims to validate the experiences of data engineers who work with small data sets. Working on BigQuery revealed that even big customers and renowned companies mostly work with small, summarized, and cleaned-up data sets. The majority of queries were sub-terabyte, with 90% being sub-hundred megabytes. It underscores the idea that most people in the real world don't deal with big data as traditionally perceived.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner