What's New In Data cover image

Scaling Databases in the AI Era: Insights from Andy Pavlo (Carnegie Mellon University)

What's New In Data

00:00

Evolving Data Formats and Their Challenges

This chapter explores the evolution of data formats like Parquet and ORC, addressing the shift from disk speed limitations to modern CPU bottlenecks. It emphasizes the need for extensibility and portability in data file specifications and the significance of clean data in AI applications. Additionally, the discussion highlights the complexities of database management and the integration of natural language processing in analytics, while cautioning against complacency with new technological claims.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app