Get the app
Timothy Nguyen
DeepMind Research Scientist and MIT scholar, known for his research on transformers and n-gram statistics. His work includes a novel method for detecting overfitting in large language models without using holdout sets.
Best podcasts with Timothy Nguyen
Ranked by the Snipd community
12 snips
Aug 15, 2024
• 33min
Is ChatGPT an N-gram model on steroids?
chevron_right
In this discussion, Timothy Nguyen, a DeepMind Research Scientist and MIT scholar, shares insights from his innovative research on transformers and n-gram statistics. He reveals a method to analyze transformer predictions without tapping into internal mechanisms. The conversation covers how transformers evolve during training, particularly in curriculum learning, and how to detect overfitting without traditional holdout methods. Nguyen also dives into philosophical questions about AI understanding, highlighting the complexities of interpreting neural network behavior.
The AI-powered Podcast Player
Save insights by tapping your headphones, chat with episodes, discover the best highlights - and more!
Get the app