

DataNation - Podcast for Data Engineers, Analysts and Scientists
Alex Merced Podcasts
Welcome to "Datanation," the podcast where your host, Alex Merced, takes you on a captivating journey through the fascinating world of data. In each episode, we explore a wide range of data topics, from data engineering and data analytics to the art and science of data-driven decision-making.
In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future.
Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge.
Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/
Follow Alex on Twitter @amdatalakehouse
Find Alex's Blogs and Social Links at AlexMerced.com
In the age of information, data is the currency that drives innovation and progress. "Datanation" is your passport to this ever-evolving landscape, where we unravel the mysteries, dissect the trends, and celebrate the breakthroughs shaping the data-driven future.
Join Alex Merced, a seasoned data enthusiast and educator, as he engages in enlightening discussions, informative interviews, and thought-provoking explorations of data concepts and practices. Whether you're a seasoned data professional, a curious tech enthusiast, or someone simply intrigued by the power of data, this podcast offers valuable insights and knowledge.
Find all episodes at: https://host.alexmercedpodcast.com/series/datanation/
Follow Alex on Twitter @amdatalakehouse
Find Alex's Blogs and Social Links at AlexMerced.com
Episodes
Mentioned books

Mar 28, 2024 • 0sec
52 – Apache Iceberg, Dremio and PuppyGraph
Discussing the benefits of Apache Iceberg's open data ecosystem. Exploring Graph Data Processing with Dremio, Puppy Graph, and Apache Iceberg. Efficiency and Flexibility of Apache Iceberg for data lakes, overcoming data duplication challenges and enabling diverse data modeling possibilities.

Mar 25, 2024 • 0sec
#1 – intro to catalogs, manifests and metadata. Oh my!
Alex Merced introduces his new podcast exploring open-source data projects like Apache Iceberg. The episode discusses the importance of catalogs, manifests, and metadata in developing advanced data systems affordably. Listeners are encouraged to subscribe for future in-depth explorations of open source project architectures.

7 snips
Mar 18, 2024 • 0sec
51 – Open Data Standards (Apache Iceberg, Apache Parquet, Apache Arrow, Apache Ibis, Apach Substrait)
Explore the benefits of open data standards like Apache Arrow and Apache Iceberg in the data space, optimizing data transfer efficiency with Apache Arrow Flight and ADBC, enhancing scan planning in data catalogs with Apache Iceberg spec and Apache Ibis, standardizing data frameworks and SQL query processing with Apache Substrate, and the value of standardized open data formats and systems for innovation and efficiency.

Feb 21, 2024 • 0sec
50 – Thinking about the flow of Streaming/Real-Time Data
Alex thinks on the development of Real-time data pipelines.

Feb 2, 2024 • 0sec
48 – Understanding how Lakehouse Table Formats are Implemented in your Favorite Tools
Alex Merced discusses how Lakehouse Table Formats like Apache Iceberg, Apache Hudi, and Delta Lake are implemented in favorite tools. The podcast explores Java libraries, file structures, metadata tables, and file slices. It also covers implementing formats in different languages, query performance, and the differences between Apache Iceberg, Hoodie, and Delta formats.

5 snips
Jan 21, 2024 • 0sec
47 – Understanding your cloud costs (Storage, Egress, Compute, Serverless, etc.)
Exploring cloud costs, distributed file systems, object storage, and tiered storage models. Understanding cost-effective cloud service models and navigating cloud costs. Emphasizing the importance of optimizing data handling for cost efficiency.

Jan 20, 2024 • 0sec
Bonus: New Youtube Channel, State of the Data Lakehouse
Find all my data resources below:https://bio.alexmerced.com/data Listen to the State of the Data Lakehouse Podcast Here:https://em360tech.com/podcast/dremio-state-data-lakehouse?utm_source=podcasts&utm_medium=podcast&utm_content=content&utm_campaign=alexmercedcontent&utm_term=iceberg+lakehouse+nessie

Jan 9, 2024 • 0sec
2024 Preview – Data/Web Content
youtube.com/@alexmercedcoder youtube.com/@alexmerceddata twitter.com/alexmercedcoder twitter.com/amdatalakehouse

Dec 8, 2023 • 0sec
46 – Apache Iceberg vs Delta Lake: Understanding the Table Format Debate
Exploring the differences between Apache Iceberg and Delta Lake, including their requirements and structure. The podcast also dives into the importance of open source projects with a focus on Apache Iceberg. Additionally, robust tools for AI and ML workflows, such as Drimeo, are discussed.

Nov 1, 2023 • 0sec
45 – BI Dashboard Acceleration (Extracts, Cubes and Reflections)
Alex Merced discusses different techniques to speed up BI Dashboard performance.