Data Engineering Podcast cover image

Data Engineering Podcast

Evolving Responsibilities in AI Data Management

Feb 16, 2025
Bartosz Mikulski, an MLOps engineer with a rich background in data engineering, dives deep into the realm of AI data management. He highlights the crucial role of data testing in AI applications, especially with the rise of generative AI. Bartosz discusses the need for specialized datasets and the skills required for data engineers to transition into AI. He also addresses challenges like frequent data reprocessing and unstructured data handling, showcasing the evolving responsibilities in this fast-paced field.
38:57

Podcast summary created with Snipd AI

Quick takeaways

  • Data engineers must adapt to evolving AI demands by mastering skills like data testing, reprocessing, and working with unstructured datasets.
  • The necessity for specific test datasets in AI applications emphasizes the importance of effective evaluation strategies over traditional training datasets.

Deep dives

Understanding Data Requirements for AI Applications

AI applications require specific types of data assets, primarily focusing on test and evaluation datasets. Unlike traditional data engineering, where training datasets are paramount, generative AI applications rely more on accurately configured evaluation datasets to assess whether functionalities operate correctly. As applications evolve, developers find that each stage of an AI workflow necessitates distinct test datasets to troubleshoot various individual processes. This multiplicative requirement highlights the importance of comprehensive data gathering and the ability to generate realistic testing scenarios, which can significantly streamline the development and deployment of AI applications.

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode