What's New In Data cover image

Scaling Databases in the AI Era: Insights from Andy Pavlo (Carnegie Mellon University)

What's New In Data

CHAPTER

Evolving Data Formats and Their Challenges

This chapter explores the evolution of data formats like Parquet and ORC, addressing the shift from disk speed limitations to modern CPU bottlenecks. It emphasizes the need for extensibility and portability in data file specifications and the significance of clean data in AI applications. Additionally, the discussion highlights the complexities of database management and the integration of natural language processing in analytics, while cautioning against complacency with new technological claims.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner