Online Program Home
My Program

Abstract Details

Activity Number: 73 - Data Driven Digital and Social Media Marketing
Type: Contributed
Date/Time: Sunday, July 29, 2018 : 4:00 PM to 5:50 PM
Sponsor: Section on Statistics in Marketing
Abstract #330779
Title: Forecasting Accuracy of Topic Modeling Techniques for Online Reviews: a Benchmark Study
Author(s): Yuan Cheng* and Shawn Mankad
Companies: Cornell University and Cornell University
Keywords: topic modeling; online reviews; prediction; benchmark
Abstract:

Reviews from online markets are critical for companies to obtain feedback and develop strategies. One of the most common analysis approaches of the unstructured online reviews is to follow a two-stage procedure, where one first derives text features through topic modeling techniques and subsequently estimates linear models for prediction and inference. Yet, there is lack of a guidance in choosing different topic modeling techniques for forecasting in various text scenarios. In this paper, we perform the first ever benchmark study on both simulated texts and real online reviews to provide guidance for practitioners in selecting topic modeling methods depending on properties of the textual corpus. We argue that when the research design combines estimated topics as independent variables within linear models, as is common in applied economic and decision analyses, the best topic modeling method should balance capturing the underlying textual themes in addition to maximizing statistical and forecasting power of the regression model.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program