

The database for all your AI needs
13 snips Sep 16, 2025
Marcel Kornacker, the mastermind behind Apache Impala and co-creator of Apache Parquet, shares insights into his innovative project, Pixeltable. This multimodal AI database adeptly blends structured and unstructured data with seamless Python workflows. He discusses the evolution of SQL, the challenges in computer vision data management, and how Pixeltable tackles transformation and transactional guarantees. Listen for fascinating examples like processing video and audio, proving how this tool can revolutionize AI data handling.
AI Snips
Chapters
Transcript
Episode notes
Impala's Performance-First Origin
- Marcel describes building Impala in C++ with LLVM codegen because JVM-based stacks were too slow for analytics.
- That performance-first choice proved crucial as analytics workloads demanded speed.
Data Generation Is The Core AI Problem
- Multimodal AI work centers on data generation and pipeline orchestration, not just SQL queries.
- Marcel Kornacker argues a tabular model with computed columns can capture transformations and asset semantics transactionally.
Use A Database To Eliminate Data Plumbing
- Treat PixelTable as a transactional, ACID tabular database that natively supports multimodal types and Python workflows.
- Use it to avoid glue-code by embedding transformations as computed columns and iterators.