18min chapter

Catalog & Cocktails: The Honest, No-BS Data Podcast cover image

Bridging Innovation and Open Source to the Real World with Paco Nathan

Catalog & Cocktails: The Honest, No-BS Data Podcast

CHAPTER

Open Source, Reproducibility, and the Importance of Code Quality

This chapter explores the role of open source in bringing innovation from research into the real world, highlighting projects like Hadoop, Spark, Project Jupiter, Spacey Pipelines, and Ray. The discussion also covers the growing importance of open source machine learning, the impact of open source models like those on hugging face litter boards, and the challenges in reproducibility in machine learning research. Additionally, the chapter addresses the problem of over-emphasizing benchmarks in computer science research and provides recommendations for improving research in the field of machine learning.

00:00

Get the Snipd
podcast app

Unlock the knowledge in podcasts with the podcast player of the future.
App store bannerPlay store banner

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode

Save any
moment

Hear something you like? Tap your headphones to save it with AI-generated key takeaways

Share
& Export

Send highlights to Twitter, WhatsApp or export them to Notion, Readwise & more

AI-powered
podcast player

Listen to all your favourite podcasts with AI-powered features

Discover
highlights

Listen to the best highlights from the podcasts you love and dive into the full episode