Practical AI

Evaluating models without test data

14 snips
Sep 20, 2022
Charles Martin, an AI and data science consultant, created WeightWatcher, a unique tool for analyzing neural networks without needing test data. In this discussion, he dives into the challenges of evaluating machine learning models, especially in fields like retail and finance. He emphasizes the significance of human judgment and innovative methods for model assessment. Martin also highlights the integration of statistical methods from physics in AI, showcasing how they can enhance model training and optimization processes.
Ask episode
AI Snips
Chapters
Transcript
Episode notes
ANECDOTE

Weight Watcher in Practice

  • Daniel Whitenack analyzed his XLM Roberta-based question-answering model with Weight Watcher.
  • The tool revealed 10 under-trained layers, prompting further investigation and adjustments.
INSIGHT

Deep Learning and Multifractal Data

  • Deep learning excels with natural data like text and images due to its multifractal structure.
  • Neural networks effectively learn these multifractal patterns, which explains their success.
ADVICE

Troubleshooting Unconverged Layers

  • Check regularization, dropouts, learning rates, and data sufficiency if layers haven't converged.
  • Consider freezing earlier layers, extending training, or adjusting hyperparameters.
Get the Snipd Podcast app to discover more snips from this episode
Get the app