Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 433 - Statistical Approaches in Text Analysis
Type: Topic-Contributed
Date/Time: Thursday, August 12, 2021 : 4:00 PM to 5:50 PM
Sponsor: Text Analysis Interest Group
Abstract #317445
Title: On the Need for More Statistics in Text Analysis, with Recent Advances
Author(s): Jordan Rodu* and Michael Baiocchi
Companies: University of Virginia and Stanford University
Keywords: text analysis; NLP
Abstract:

We argue for the need for more statistical thinking in text analysis. Recent advances in computer science have dominated the text analysis landscape (often called natural language processing (NLP)). But these exciting developments have left large gaps in their wake, particularly in places where our scientific colleagues most need robust approaches. We provide a theoretical justification for why NLP techniques are often not suitable, and encourage statisticians to work on principled methodologies to provide alternatives. Some current advances are highlighted.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program