Online Program Home
My Program

Abstract Details

Activity Number: 126 - SPEED: New Methods in Statistical Genomics and Genetics Part 1
Type: Contributed
Date/Time: Monday, July 29, 2019 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #304698 Presentation
Title: Regularized Regression by Graph Propagation for Genomic Data Analysis
Author(s): Han Yu* and Rachael Hageman Blair
Companies: Roswell Park Comprehensive Cancer Center and the State University of New York at Buffalo
Keywords: linear regression; graph; prediction; variable selection; genomics

Regression analysis is a widely used tool for prediction in genomic studies. In the last decades, a number of databases of biological networks have become available. These networks document relevant functional information and associations for omics data, where the nodes represent the biological entities such as genes and the edges between them represent associations or interactions. Fully harnessing such information for predictive modeling remains a challenge. Most existing methods incorporate network information by extending group lasso penalty to connected nodes, or penalizing differences in their coefficients. However, these methods may fail in biological networks, where regulations can take effect at levels not represented by the data. We propose a Graph Propagation Regularized regression (GPR), which uses a propagation process simulating spread of activation between nodes, thus implicitly encodes the network structure beyond direct connectivity. GPR has shown superior performance in a range of synthetic scenarios where important genes tend to be clustered in modules. We will demonstrate its application to gene expression data from ovarian cancers.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2019 program