Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 209 - Statistical methods for genomic and epigenetic data analysis
Type: Contributed
Date/Time: Tuesday, August 10, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #317940
Title: Integrative Modeling of Massive Multiple-Domain Cancer Genomics Data Sets
Author(s): Dongyan Yan*
Companies: Eli Lilly and Company
Keywords: Integrative clustering; Bayesian nonparametric; Dirichlet process; Unsupervised learning; Bayesian variable selection
Abstract:

Rapid technological advances have allowed for molecular profiling across multiple omics domains from a single sample for clinical decision making in many diseases, especially cancer. As tumor development and progression are dynamic biological processes involving composite genomic aberrations, key challenges are to effectively assimilate information across these domains to identify genomic signatures and biological entities that are druggable, develop accurate risk prediction profiles for future patients, and identify novel patient subgroups for tailored therapy and monitoring. We invent integrative probabilistic frameworks for massive multiple-domain data that coherently incorporate dependence within and between domains to accurately detect tumor subtypes, thus providing a catalogue of genomic aberrations associated with cancer taxonomy. We describe an efficient variable selection procedure to identify relevant genomic aberrations that can potentially reveal underlying drivers of a disease. The proposed methodology is applied to lung cancer genomics samples publicly available from The Cancer Genome Atlas.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program