JSM 2017 Online Program

Activity Number:	653 - Optimization Methods with Biological Applications
Type:	Topic Contributed
Date/Time:	Thursday, August 3, 2017 : 10:30 AM to 12:15 PM
Sponsor:	Biometrics Section
Abstract #323422
Title:	Convex Triclustering
Author(s):	Brian Gaines* and Eric Chi and Will Wei Sun and Hua Zhou
Companies:	North Carolina State University and North Carolina State University and University of Miami and University of California, Los Angeles
Keywords:	Clustering ; Fused lasso ; Multiway data analysis ; Regularization path
Abstract:	Clustering is a fundamental unsupervised learning technique that aims to discover groups of objects in a dataset. Biclustering extends clustering to two dimensions where both observations and features are grouped simultaneously. For example, clustering both cancerous tumors and genes or both documents and words. Triclustering is then the natural extension of clustering to three dimensions where the data are organized in a three-dimensional array, or tensor. We develop and study a convex formulation of the triclustering problem, which is guaranteed to obtain a unique global minimum. Convex triclustering generates an entire solution path of possible triclusters governed by one tuning parameter, and thus alleviates the need to specify the number of clusters a priori. We investigate the application of our method to biological datasets.

Authors who are presenting talks have a * after their name.