Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 163 - Biometrics Section Byar Award Student Paper Session II
Type: Topic Contributed
Date/Time: Tuesday, August 4, 2020 : 10:00 AM to 11:50 AM
Sponsor: Biometrics Section
Abstract #312468
Title: Integrative Generalized Convex Clustering Optimization and Feature Selection for Mixed Multi-View Data
Author(s): Minjie Wang* and Genevera Allen
Companies: Rice University and Rice University
Keywords: Integrative clustering; convex clustering; feature selection; convex optimization; sparse clustering; GLM deviance

In mixed multi-view data, multiple sets of diverse features are measured on the same set of samples. By integrating all available data sources, we seek to discover common group structure among samples that may be hidden in individualistic cluster analyses of a single data-view. We develop a convex formalization that inherits the strong statistical, mathematical and empirical properties of increasingly popular convex clustering methods. Specifically, our Integrative Generalized Convex Clustering Optimization (iGecco) method employs different convex losses for each data view with a joint convex fusion penalty that leads to common groups. Additionally, integrating mixed multi-view data is often challenging when each data source is high-dimensional. To perform feature selection, we develop an adaptive shifted group-lasso penalty that selects features by shrinking them towards their loss-specific centers. Our iGecco+ approach selects features from each data-view that are best for determining groups. Through a series of numerical experiments and real data examples on genomics, we show that iGecco+ achieves superior empirical performance for high-dimensional mixed multi-view data.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program