The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence) cover image

OLMo: Everything You Need to Train an Open Source LLM with Akshita Bhagia - #674

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

00:00

Enhancing Data Transparency in Large Datasets

This chapter explores the development of transparency in training datasets, focusing on tools like data sheets, model cards, and the 'What's in My Big Data' toolkit. It highlights the rapid advancements in Natural Language Processing and the importance of understanding large-scale data profiles.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app