The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.
Abstract Details
Activity Number:
|
585
|
Type:
|
Contributed
|
Date/Time:
|
Wednesday, August 3, 2011 : 2:00 PM to 3:50 PM
|
Sponsor:
|
Section on Statistical Learning and Data Mining
|
Abstract - #303009 |
Title:
|
Enhancement of Clustering Results with Variable Selection and Resampling Methods
|
Author(s):
|
Wenzhu Bi*+ and George C. Tseng and Julie C. Price and Lisa A. Weissfeld
|
Companies:
|
University of Pittsburgh and University of Pittsburgh and University of Pittsburgh and University of Pittsburgh
|
Address:
|
Department of Biostatistics, Pittsburgh, PA, 15261,
|
Keywords:
|
clustering ;
variable selection ;
resampling ;
imaging ;
feature selection ;
PET
|
Abstract:
|
Clustering can be used to identify biologically distinct subgroups from an n×p dataset without knowledge of the true group membership. Since some variables are irrelevant to clustering and may only introduce noise, variable selection methods have recently been developed to exclude these variables and to yield more reliable and parsimonious clustering results. Recently in 2010, Witten and Tibshirani introduced a general framework for variable selection by applying a Lasso-type penalty and an L2 condition. We propose to combine Witten's method with resampling techniques, such as bootstrapping or leave-one-out resampling. The goal is to alleviate the effects of noise and outliers on the variable selection and clustering results and also to generate confidence intervals for the clustering results. The performance of the proposed method is demonstrated by simulation. We then apply the method to neuroimaging data. The focus is on the analysis of voxel-level data and the identification of a subset of voxels that can be used to classify subjects into groups. We present a PET imaging example using an unspecified radiotracer to identify groups of subjects with varying amounts of tracer.
|
The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.
Back to the full JSM 2011 program
|
2011 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Continuing Education program, please contact the Education Department.