Database School

The database for all your AI needs

13 snips
Sep 16, 2025
Marcel Kornacker, the mastermind behind Apache Impala and co-creator of Apache Parquet, shares insights into his innovative project, Pixeltable. This multimodal AI database adeptly blends structured and unstructured data with seamless Python workflows. He discusses the evolution of SQL, the challenges in computer vision data management, and how Pixeltable tackles transformation and transactional guarantees. Listen for fascinating examples like processing video and audio, proving how this tool can revolutionize AI data handling.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Impala's Performance-First Origin

  • Marcel describes building Impala in C++ with LLVM codegen because JVM-based stacks were too slow for analytics.
  • That performance-first choice proved crucial as analytics workloads demanded speed.
INSIGHT

Data Generation Is The Core AI Problem

  • Multimodal AI work centers on data generation and pipeline orchestration, not just SQL queries.
  • Marcel Kornacker argues a tabular model with computed columns can capture transformations and asset semantics transactionally.
ADVICE

Use A Database To Eliminate Data Plumbing

  • Treat PixelTable as a transactional, ACID tabular database that natively supports multimodal types and Python workflows.
  • Use it to avoid glue-code by embedding transformations as computed columns and iterators.
Get the Snipd Podcast app to discover more snips from this episode
Get the app