

The Great Data Debate
Nov 13, 2020
In this insightful discussion, George Fraser, founder of Fivetran, moderates a captivating debate featuring Bob Muglia, former CEO of Snowflake. They dive into the evolving roles of data lakes and warehouses, exploring how their functionalities are converging. The conversation touches on the integration of machine learning with analytics, the importance of low latency, and the future direction of the modern data stack. With perspectives on decentralization and the significance of collaborative tools, they tackle the complexities of technology in enterprise environments.
AI Snips
Chapters
Transcript
Episode notes
Data Lake's Future
- Data warehouses are optimized for analytics, a specific workflow and query pattern.
- Data lakes handle unstructured data, operational AI, and compute-intensive tasks, potentially consuming everything.
SQL Dominance
- SQL-based relational data warehouses will likely replace data lakes for structured and semi-structured data.
- Complex data like images and videos will require additional support in data warehouses.
AI/ML Growth
- Two primary data use cases exist: analytics (queries and dashboards) and operational AI (complex models in production).
- The operational AI use case, served by data lakes, is smaller but growing faster.