Machine Learning Street Talk (MLST)

#039 - Lena Voita - NLP

Jan 23, 2021
Lena Voita, a Ph.D. student and former research scientist at Yandex, shares her insights on NLP and machine translation. She discusses her research on the source and target contributions in neural translation models and explores information-theoretic probing using minimum description length. Lena also delves into the evolution of representations in Transformers and the complexities of language models, including challenges like hallucinations and exposure bias. Additionally, she highlights her comprehensive NLP course designed to foster deeper understanding in the field.
Ask episode
AI Snips
Chapters
Books
Transcript
Episode notes
ANECDOTE

Forgotten Recording

  • The hosts forgot to record the first 15 minutes of the conversation with Lena Voita.
  • They joked about building a taser into Tim's chair to prevent this in the future.
INSIGHT

Hallucination vs. Translation

  • Language models, especially in machine translation, face a dilemma between translating source information and maintaining target language grammar.
  • This can lead to hallucinations, where the model prioritizes grammatical correctness over accurate translation.
INSIGHT

Lena's Blog

  • Lena Voita's blog excels in visualizing complex NLP concepts, making them easier to understand.
  • She uses graphics and animations to explain topics like source-target contributions in machine translation.
Get the Snipd Podcast app to discover more snips from this episode
Get the app