DataNation - Podcast for Data Engineers, Analysts and Scientists

Alex Merced Podcasts
undefined
May 8, 2023 • 0sec

35 – Data Lakehouse Statistics (Understanding Parquet and Iceberg)

Alex Merced helps explain how stats are collected and used when working with Parquet files and Apache Iceberg tables. Follow Alex on twitter @amdatalakehouse
undefined
Apr 12, 2023 • 0sec

BONUS: What is Object Storage like AWS S3, Minio and more!

Alex Merced discusses what is Object Storage and the history of file systems. Join the community at datanation.click
undefined
Apr 8, 2023 • 0sec

34 – What is a Vector Database?

Alex Merced explains what is a Vector Database Join the community at DataNation.click
undefined
Mar 23, 2023 • 0sec

BONUS: The Big Picture at a Tech Company (Engineering, Product, Marketing, Sales)

Alex Merced discusses the different departments at a tech company and how they all fit together to create success. follow alex on twitter Web -> @alexmercedcoder Data -> @amdatalakehouse
undefined
Mar 21, 2023 • 0sec

33 – CI/CD on the Data Lakhouse

Alex Merced discusses what is CI/CD and how to achieve CI/CD pipelines on the data lakehouse.
undefined
Mar 10, 2023 • 0sec

32 – Data Versioning Solutions (Apache Iceberg, Project Nessie, LakeFS)

Alex Merced discusses the different Data Versioning Solutions and the approach different solutions have.
undefined
Feb 22, 2023 • 0sec

31 – Optimizing MPP Workloads

Alex Merced discusses how MPP tools plan tasks and how understanding that can help you plan your writes better. dremio.com/subsurface <— Register for Subsurface
undefined
Feb 15, 2023 • 0sec

30 – The Subsurface Live! Data Lakehouse Conference

Register for Subsurface at Dremio.com/subsurface Follow me on twitter @amdatalakehouse Subscribe to this and my other podcasts: Gnarly Data WavesSelect * from Data.Lake;Web Dev 101Web and Data: Interviews by Alex Merced
undefined
Feb 7, 2023 • 0sec

29 – Optimizing Data Performance on Small Data and Big Data

Alex Merced discusses the different considerations with optimizing data and how no one tool can make every use case performant, but understanding which ones will solve which use cases is the key.
undefined
Feb 1, 2023 • 0sec

28 – Reduce Data Warehouse costs with Dremio and DuckDB

Alex Merced discusses how you can really reduce your Data Warehouse costs by using Dremio to unify and organize your data lake and DuckDB for local ad hoc queries on data pulled through Dremio. Join the slack community at DataNation.click

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app