The Data Stack Show cover image

The Data Stack Show

179: Time Series Data Management and Data Modeling with Tony Wang of Stanford University

Feb 28, 2024
Stanford University PhD student, Tony Wang, discusses his research focus on time series data management. Topics include challenges in academia and industry, academic lab structure, decision to move from hardware to data research, data modeling in time series, issues and potential solutions for parquet format, and the role of external indices in parquet files.
50:42

Episode guests

Podcast summary created with Snipd AI

Quick takeaways

  • Transition from hardware to data research emphasized practical applications over theoretical research.
  • Innovative approach to time series data management proposed solutions to enhance data indexing and storage efficiency.

Deep dives

Tony Wang's Background and PhD Research Focus

Tony Wang, a PhD student at Stanford University, discusses his background and research focus on data processing systems. He delves into his transition from hardware to data systems research, emphasizing the importance of practical applications over theoretical research. Wang's work centers on optimizing data processing in data lakes like Apache Iceberg and Delta Lake to efficiently analyze large-scale data.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner