JSM 2016 Online Program

Activity Number:	554
Type:	Topic Contributed
Date/Time:	Wednesday, August 3, 2016 : 10:30 AM to 12:20 PM
Sponsor:	Scientific and Public Affairs Advisory Committee
Abstract #318913
Title:	Text-Mining Using Discrete Optimization: An Application to Automate Conference Scheduling
Author(s):	Jason Pan and Kelly Zou* and Ching-Ray Yu and Franklin W. Sun and Martin O. Carlsson
Companies:	Pfizer and Pfizer and Pfizer and Pfizer and Pfizer
Keywords:	Text mining ; Structured database ; Document term matrix ; Discrete optimization ; Simulated annealing ; Stemming
Abstract:	Nowadays, unstructured text data are increasingly and readily available. For example, information may arise from conference abstracts, scientific publications, surveys, written notes, e-mails, blogs, and other sources including social media. In this research, we utilize text mining techniques and tools to first transform the unstructured texts into a structured database. Consequently, a document term matrix is extracted from structured data, with descriptive statistics generated for further exploration and analysis. Discrete optimization and simulated annealing will be applied to maximize an objective function based on the overall similarity of abstracts within a session. These methods are illustrated on a recent conference sponsored by the American Statistical Association. Statistical programming is conducted in R.

Authors who are presenting talks have a * after their name.