This is the program for the 2010 Joint Statistical Meetings in Vancouver, British Columbia.
Abstract Details
Activity Number:
|
532
|
Type:
|
Contributed
|
Date/Time:
|
Wednesday, August 4, 2010 : 10:30 AM to 12:20 PM
|
Sponsor:
|
Section on Statistical Learning and Data Mining
|
Abstract - #307490 |
Title:
|
An Examination of Reliability of Text Mining
|
Author(s):
|
Chong Ho Yu*+ and Angel Jannasch-Pennell and Samuel DiGangi
|
Companies:
|
Arizona State University and Arizona State University and Arizona State University
|
Address:
|
1475 N Scottsdale Rd, Scottsdale, AZ, 85257,
|
Keywords:
|
text mining ;
reliability ;
data mining
|
Abstract:
|
Inter-rater reliability of textual analysis conducted with qualitative methods has been a concern to many researchers. While text mining is claimed to be a reliable technique for its sophisticated algorithms, research on comparing the agreement among the results using the same dataset but different text mining applications is virtually absent. To fill this vacuum, this project compares the results of several text mining packages. The same data source, which encompasses student blogs at a university, will be used for extracting common threads. Because there is no single best solution as the benchmark, the objective of this project is not to rank the accuracy of these packages. Rather, strategies for evaluating inter-coder reliability were employed. It was found that there is a high degree of inconsistency among text miners and thus triangulation for text mining is strongly recommended.
|
The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.
Back to the full JSM 2010 program
|
2010 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Continuing Education program, please contact the Education Department.