
How Apache Pinot Achieves 200,000 Queries per Second (with Tim Berglund)
Developer Voices
Optimizing Query Processing and Data Movement in a Server Cluster
The chapter explores the intricate process of server communication with controllers and brokers in handling metadata changes, emphasizing Zookeeper's role. It discusses query routing, segment distribution, query scatter, filtering, and reduction operations across servers, drawing parallels to map-reduce concepts. The conversation delves into single-stage vs. multi-stage query engines, data streaming, shuffling for efficiency, and the importance of optimizing data movement for query processing.
00:00
Transcript
Play full episode
Remember Everything You Learn from Podcasts
Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.