The Data Exchange with Ben Lorica cover image

ETL for LLMs

The Data Exchange with Ben Lorica

CHAPTER

Evolution and Challenges in Building ETL Pipelines for Large Language Models

The chapter delves into the evolution of ETL pipelines for LLMs, emphasizing the integration of software engineering practices like unit testing and data quality checks. It explores the importance of high-quality inferences at scale and the benefits of using open source projects over custom solutions for ETL and pre-processing data. The speakers provide insights into key considerations for deploying large language models, discussing applications of unstructured data processing and the impact of utilizing structured versus unstructured data.

00:00
Transcript
Play full episode

Remember Everything You Learn from Podcasts

Save insights instantly, chat with episodes, and build lasting knowledge - all powered by AI.
App store bannerPlay store banner