Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 446 - High-Dimensional Biomarkers in Drug Discovery and Early Drug Development Studies: An Integrated Data Analysis Approach
Type: Topic Contributed
Date/Time: Thursday, August 6, 2020 : 10:00 AM to 11:50 AM
Sponsor: Biopharmaceutical Section
Abstract #312229
Title: High-Dimensional Surrogacy and Biomarkers: Computational Aspects of an Upscaled Integrated Data Analysis
Author(s): Rudradev Sengupta*
Companies: Janssen Pharmaceutical Companies of Johnson and Johnson, Beerse, Belgium
Keywords: High Dimensional Data; Integrated Analysis; Surrogacy and Biomarkers; High Performance Computing; Parallelization with R
Abstract:

Identification of genomic biomarkers is a primary data analysis task in the context of drug discovery experiments. These experiments consist of several high dimensional datasets which contain information about a set of new drugs (compounds) under development. This data structure introduces the challenge of multi-source data integration which is needed to identify the biological pathways related to the new set of drugs under development. In order to process all these information, High performance Computing (HPC) techniques are required. Though R packages for parallel computing are available, they are not optimized for a specific setting or data structure. In the current study, we proposed a new “master-slave” framework, for data analysis using R in parallel, in a computer cluster. The proposed data analysis workflow is applied to a multi-source high dimensional drug discovery dataset and a performance comparison is made between the new framework and existing R packages for parallel computing. Different configuration settings, for parallel programming in R, are presented to show that the computation time, for the specific application under consideration, can be reduced significantly.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program