
The Data Stack Show
179: Time Series Data Management and Data Modeling with Tony Wang of Stanford University
Feb 28, 2024
Stanford University PhD student, Tony Wang, discusses his research focus on time series data management. Topics include challenges in academia and industry, academic lab structure, decision to move from hardware to data research, data modeling in time series, issues and potential solutions for parquet format, and the role of external indices in parquet files.
50:42
Episode guests
AI Summary
AI Chapters
Episode notes
Podcast summary created with Snipd AI
Quick takeaways
- Transition from hardware to data research emphasized practical applications over theoretical research.
- Innovative approach to time series data management proposed solutions to enhance data indexing and storage efficiency.
Deep dives
Tony Wang's Background and PhD Research Focus
Tony Wang, a PhD student at Stanford University, discusses his background and research focus on data processing systems. He delves into his transition from hardware to data systems research, emphasizing the importance of practical applications over theoretical research. Wang's work centers on optimizing data processing in data lakes like Apache Iceberg and Delta Lake to efficiently analyze large-scale data.
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.