
Gnarly Data Waves by Dremio EP12 - How to Modernize Hive to the Data Lakehouse with Dremio and Apache Iceberg
Apr 12, 2023
Learn how to modernize Hive to the Data Lakehouse using Dremio and Apache Iceberg. Explore in-place migration, shadow migration, and moving tables between catalogs. Discover the benefits of Apache Iceberg, different catalog options, and optimistic concurrency. Dive into automating compaction and optimizing tables, pricing and compatibility with Dremio, Kafka, Nessie, and Iceberg. Also, get insights on using AWS Glue catalog with Iceberg and the upcoming Nessie connector for Dreamio Arctic catalog.
Chapters
Transcript
Episode notes
1 2 3 4 5 6 7
Introduction
00:00 • 4min
Modernizing Hive with Apache Iceberg for Data Lakehouse
03:34 • 22min
Different Catalog Options and their Implementations
25:55 • 25min
Understanding Optimistic Concurrency in Data Processing Jobs
50:30 • 2min
Automating compaction and optimizing tables in a data lakehouse
52:30 • 4min
Pricing, Editions, and Compatibility with Dremio, Kafka, Nessie, and Iceberg
56:25 • 2min
AWS Glue catalog and Nessie connector
58:26 • 5min
