
How Apache Pinot Achieves 200,000 Queries per Second (with Tim Berglund)
Developer Voices
00:00
Optimizing Query Processing and Data Movement in a Server Cluster
The chapter explores the intricate process of server communication with controllers and brokers in handling metadata changes, emphasizing Zookeeper's role. It discusses query routing, segment distribution, query scatter, filtering, and reduction operations across servers, drawing parallels to map-reduce concepts. The conversation delves into single-stage vs. multi-stage query engines, data streaming, shuffling for efficiency, and the importance of optimizing data movement for query processing.
Transcript
Play full episode