

Building domain specific natural language applications
Feb 6, 2020
33:09
In this episode of the Data Exchange I speak with David Talby, co-creator of Spark NLP, an open source, highly scalable, production grade natural language processing (NLP) library. Spark NLP has become one of the more popular NLP libraries and is available on PyPI, Conda, Maven, and Spark Packages. With recent advances in research in large-scale natural language models, there is strong interest in domain specific natural language applications. Besides their work on Spark NLP, David and his collaborators are building natural language models tuned specifically for healthcare applications.
Our conversation spanned many topics, including:
- Spark NLP: its current status and some common and surprising use cases.
- Recent developments in NLP research and their implications for companies.
- Spark NLP for Healthcare
Detailed show notes can be found on The Data Exchange web site.