Online Program Home
My Program

Abstract Details

Activity Number: 304 - Clustering and Regression Analyzes
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 8:30 AM to 10:20 AM
Sponsor: International Statistical Institute
Abstract #328522 Presentation
Title: Assisted Gene Expression-Based Clustering with AWNCut
Author(s): Yang Li* and Ruofan Bie and Sebastian J Teran Hidalgo and Yichen Qin and Mengyunn Wu and Shuangge Ma
Companies: Renmin University of China and Renmin University of China and Yale University and University of Cincinnati and Yale University and Yale University
Keywords: Assisted analysis; Clustering; Gene expression data; NCut
Abstract:

Gene expression data have been extensively used for clustering samples. The clusters so generated can serve as the basis for disease subtype identification and risk stratification. With the small sample sizes of genetic profiling studies and noisy nature of gene expression data, clustering analysis results are often unsatisfactory. In the most recent studies, a prominent trend is to conduct multidimensional profiling, which collects data on gene expressions as well as their regulators on the same subjects. We develop a novel assisted clustering method, which effectively uses regulator information to improve clustering analysis using gene expression data. To account for the fact that not all gene expressions are informative, we propose a weighted strategy, where the weights are determined data-dependently and can discriminate informative gene expressions from noises. The proposed method is built on the NCut technique and effectively realized using a simulated annealing algorithm. Simulations demonstrate that it can well outperform multiple direct competitors. In the analysis of TCGA melanoma data, biologically sensible findings different from the alternatives are made.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program