JSM Preliminary Online Program
This is the preliminary program for the 2006 Joint Statistical Meetings in Seattle, Washington.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2006 Program page




Legend: = Applied Session, = Theme Session, = Presenter, Sheraton Seattle Hotel & Towers = “S”
Washington State Convention & Trade Center = “CC”, Grand Hyatt Seattle = “H”

Add To My Program
CE_07C Sun, 8/6/06, 8:30 AM - 5:00 PM CC-309
Text Mining - Continuing Education - Course
The ASA, Section on Statistical Computing
Instructor(s): David Madigan, Rutgers University, David D. Lewis, David D. Lewis Consulting LLC
This course will provide an overview of the arsenal of statistical and computational advances available, examining both statistical methods that pertain to textual data and language processing techniques that play a vital role in successful text mining. The course will present examples and case studies while attempting to provide an overarching framework that focuses on the connection between textual artifacts and real-world entities. We will describe "mainstream" text mining applications, but will focus on concepts and algorithms, rather than specific commercial software. Prerequisites include knowledge of basic statistical techniques and elementary probability. Familiarity with Introduction to the Practice of Statistics by Moore and McCabe or a similar text would suffice.

Course attendees are expected to be familiar with the topic at the level of:
Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, The MIT Press.

 

JSM 2006 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised April, 2006