JSM 2015 Online Program

Online Program Home
My Program

Abstract Details

Activity Number: 190
Type: Contributed
Date/Time: Monday, August 10, 2015 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #317268
Title: A Coefficient of Determination for Topic Models
Author(s): Thomas Jones*
Companies: 3e Services LLC
Keywords: Topic Modeling ; Text Mining ; Multinomial Distribution ; R-squared ; Coefficient of Determination ; Latent Dirichlet Allocation

This document proposes a new (old) metric for evaluating goodness of fit in topic models, the coefficient of determination, or R-squared. Within the context of topic modeling, R-squared has the same interpretation that it does when used in a broader class of statistical models. The topic model R-squared uses a geometric interpretation of the standard R-squared statistic. Reporting R-squared with topic models addresses two current problems in topic modeling: a lack of standard evaluation metrics and ease of communication with lay audiences.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program

For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home