|
Activity Number:
|
354
|
|
Type:
|
Topic Contributed
|
|
Date/Time:
|
Tuesday, August 4, 2009 : 2:00 PM to 3:50 PM
|
|
Sponsor:
|
Section on Statistical Computing
|
| Abstract - #303741 |
|
Title:
|
Pairwise Variable Selection for High-Dimensional Model-Based Clustering
|
|
Author(s):
|
Frank Jian Guo*+ and Elizaveta Levina and George Michailidis and Ji Zhu
|
|
Companies:
|
University of Michigan and University of Michigan and University of Michigan and University of Michigan
|
|
Address:
|
Department of Statistics, Ann Arbor, MI, 48109,
|
|
Keywords:
|
Pairwise fusion ; Gaussian mixture models ; Variable selection ; EM algorithm ; Model-based clustering
|
|
Abstract:
|
Variable selection for clustering is an important and challenging problem in high-dimensional data analysis. Existing variable selection methods for model-based clustering select informative variables in a "one-in-all-out" manner; that is, a variable is selected if at least one pair of clusters is separable by this variable and removed if none of the clusters are separable by this variable. In many applications, however, it is of interest to further establish exactly which clusters are separable by each informative variable. To address this question, we propose a pairwise variable selection method for high-dimensional model-based clustering. The method is based on a new pairwise penalty. Results on simulated and real data show that the new method performs better than the L1-norm approach and offers better interpretation.
|