JSM Preliminary Online Program
This is the preliminary program for the 2007 Joint Statistical Meetings in Salt Lake City, Utah.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.



Back to main JSM 2007 Program page




Activity Number: 544
Type: Contributed
Date/Time: Thursday, August 2, 2007 : 10:30 AM to 12:20 PM
Sponsor: Section on Physical and Engineering Sciences
Abstract - #310039
Title: Variable Selection for LDA When n < p
Author(s): Matthew Mitchell*+
Companies: Metabolon
Address: PO Box 110407, Research Triangle Park, NC, 27709,
Keywords: LDA ; variable selection
Abstract:

Linear Discriminant Analysis (LDA) is a classification technique used for multivariate data. Each discriminant is a linear combination of all the variables. However, when the number of variables (p) is larger than the sample size (n), all variables cannot be used because the matrices in the discriminant function become singular. There are several methods for overcoming this difficulty: performing a principal component analysis (PCA), and then performing the LDA on the principal components; using univariate significance tests to choose a subset of the variables for the LDA; forward, backward, or stepwise variable selection; or more complex techniques. We compare these techniques in terms of their predictive ability and their ability to select the important features. These comparisons will be performed over several simulated data sets of various complexities and a real dataset.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2007 program

JSM 2007 For information, contact jsm@amstat.org or phone (888) 231-3473. If you have questions about the Continuing Education program, please contact the Education Department.
Revised September, 2007