Gradient Dissent: Conversations on AI cover image

D. Sculley — Technical Debt, Trade-offs, and Kaggle

Gradient Dissent: Conversations on AI

00:00

Why Do We Use IID Test Trains From the Same Distribution?

The 20 years ago, machine learning was basically like, look, you've got your test set and your training set. And so long as they're from the same distribution, we're just going to assume that your test data has all the behaviors that you're going to need to worry about. Just make sure you've got good accuracy on your heldout test set. But when we go and deploy a model in the real world, it's pretty unlikely that the data that that model encounters is going to be from exactly the same distribution than happened to be in our limited historical snapshot of data previously.

Transcript
Play full episode

The AI-powered Podcast Player

Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
App store bannerPlay store banner
Get the app