The Data Stack Show

175: The Parts, Pieces, and Future of Composable Data Systems, Featuring Wes McKinney, Pedro Pedreira, Chris Riccomini, and Ryan Blue

17 snips
Jan 31, 2024
Data systems experts Wes McKinney, Pedro Pedreira, Chris Riccomini, and Ryan Blue discuss the concept of composable data systems, the challenges and incentives for composable components, specialization and modularity in data workloads, and the efficiency and common layers in data management systems. They also explore the evolution of data system composability, exciting new projects in data systems, and the challenges of standardizing APIs.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
INSIGHT

Push Policy Into Storage Layer

  • Storage-layer standards like table formats must carry policy and metadata to make multi-engine use safe.
  • Moving access controls and catalog decisions toward storage helps keep policies coherent across engines.
INSIGHT

Schema Friction Breaks Systems

  • Data model and schema description span from runtime to file formats and cause much coercion.
  • A shared, expressive schema standard would reduce repeated type conversions across systems.
INSIGHT

Type Systems Force Trade-offs

  • Different type-system goals cause fragmentation: expressive in-memory types versus minimal portable storage types.
  • Projects must balance implementability versus expressiveness when standardizing types.
Get the Snipd Podcast app to discover more snips from this episode
Get the app