Data Engineering Podcast cover image

Data Engineering Podcast

Tackling Real Time Streaming Data With SQL Using RisingWave

Feb 4, 2024
The podcast discusses the RisingWave database engine for stream processing on S3, its architecture, challenges faced, and potential integration with a data lakehouse. It explores the use of Kafka for buffering and converting data formats, enhancing Postgres with real-time processing, and the differences in change data capture handling. The episode also covers workflow, onboarding, integration, and unexpected use cases of RisingWave in the manufacturing industry.
56:55

Podcast summary created with Snipd AI

Quick takeaways

  • RisingWave is a distributed SQL stream database built on top of S3, aiming to make stream processing more accessible and cost-efficient.
  • RisingWave focuses on scaling up before scaling out and automating the user experience to make it easy for users to experiment with the database.

Deep dives

The Rising Wave Database: A Novel Cloud-Native Stream Processing Engine

Rising Wave Labs has built a distributed SQL stream database that is built on top of S3. The main goal of Rising Wave is to make stream processing more accessible and cost-efficient. They aim to achieve this by providing a familiar SQL programming model and taking advantage of the cloud-native architecture. The database is designed to handle real-time analytics over Kafka data and can also serve as a booster for existing databases like Postgres. The architecture of Rising Wave involves the decoupling of computing and storage, with S3 serving as the primary storage layer. They are also working on integrating with the iceberg ecosystem to provide efficient query processing.

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner