Abstract #300118


The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2002 Program page



JSM 2002 Abstract #300118
Activity Number: 192
Type: Invited
Date/Time: Tuesday, August 13, 2002 : 10:30 AM to 12:20 PM
Sponsor: ENAR
Abstract - #300118
Title: Subset Selection in Regression Models using Simulated Annealing: Software and Experiences
Author(s): Nitin Patel*+
Affiliation(s): Massachusetts Institute of Technology/Cytel Software Corp.
Address: 675 Massachusetts Avenue, Cambridge, Massachusetts, 02139-3309, USA
Keywords: data mining ; dimension reduction ; generlized linear regression models
Abstract:

An important dimension reduction technique in data mining is that of selecting subsets of covariates for multiple linear and generalized linear regression models. When the number of covariates to choose from is large, as is often the case with genetic and genomic data, computing best subsets is prohibitively slow. The standard methods used in such situations are variants of step-wise regression. We have adapted the simulated annealing heuristic method to the subset selection problem. Applications to a number of biological data sets has been very encouraging. In this talk we discuss this experience and the easy-to-use software, XL Miner, that we have used in our investigations. Joint work with: Pralay Senchaudhuri (Cytel Software) and Marsha Wilcox (Harvard Medical School).


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2002 program

JSM 2002

For information, contact meetings@amstat.org or phone (703) 684-1221.

If you have questions about the Continuing Education program, please contact the Education Department.

Revised March 2002