The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Deep Learning for NLP: From the Trenches with Charlene Chambliss - #433

Dec 3, 2020

Charlene Chambliss, a Machine Learning Engineer at Primer AI with expertise in NLP, discusses her unique transition from psychology to data science. She shares insights on working with BERT models, detailing projects like her multilingual BERT initiative and a COVID-19 classifier. The conversation dives into challenges in data labeling, the use of innovative techniques for topic drift, and debugging NLP models. Charlene also offers advice for those looking to shift into tech from non-technical backgrounds, emphasizing the importance of mentorship.

Ask episode

AI Snips

Chapters

Transcript

Episode notes

ANECDOTE

Multilingual NER Project

Charlene Chambliss built multilingual NER models for IQT Labs' machine translation models.
These models highlighted entities in Russian and English translations for quality assessment.

INSIGHT

ML for ML Assessment

The NER models were used to assess the quality of machine translation models.
This involved highlighting entities like names to check for mistranslations.

ADVICE

Fast Tokenizers

Use Hugging Face Transformers' new Fast Tokenizers.
They simplify aligning text spans with tokens, eliminating boilerplate code.

Get the Snipd Podcast app to discover more snips from this episode

Get the app