Neural Search Talks — Zeta Alpha

Generating Training Data with Large Language Models w/ Special Guest Marzieh Fadaee

Dec 13, 2022
Marzieh Fadaee, an NLP Research Lead at Zeta Alpha, discusses her innovative work on using large language models like GPT-3 to generate domain-specific training data. The conversation dives into her papers, 'InPars' and 'Promptagator,' highlighting methods for high-quality data augmentation with minimal human intervention. Fadaee explores the challenges of leveraging LMs in information retrieval, the intricacies of prompt engineering, and the potential pitfalls of synthetic data. Her insights pave the way for future research in optimizing neural retrieval systems.
Ask episode
Chapters
Transcript
Episode notes