This is the program for the 2010 Joint Statistical Meetings in Vancouver, British Columbia.

Abstract Details

Activity Number: 532
Type: Contributed
Date/Time: Wednesday, August 4, 2010 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #307490
Title: An Examination of Reliability of Text Mining
Author(s): Chong Ho Yu*+ and Angel Jannasch-Pennell and Samuel DiGangi
Companies: Arizona State University and Arizona State University and Arizona State University
Address: 1475 N Scottsdale Rd, Scottsdale, AZ, 85257,
Keywords: text mining ; reliability ; data mining
Abstract:

Inter-rater reliability of textual analysis conducted with qualitative methods has been a concern to many researchers. While text mining is claimed to be a reliable technique for its sophisticated algorithms, research on comparing the agreement among the results using the same dataset but different text mining applications is virtually absent. To fill this vacuum, this project compares the results of several text mining packages. The same data source, which encompasses student blogs at a university, will be used for extracting common threads. Because there is no single best solution as the benchmark, the objective of this project is not to rank the accuracy of these packages. Rather, strategies for evaluating inter-coder reliability were employed. It was found that there is a high degree of inconsistency among text miners and thus triangulation for text mining is strongly recommended.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2010 program




2010 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.