Online Program Home
My Program

Abstract Details

Activity Number: 288 - Genomical Is the New Astronomical: Big Data Algorithms and Applications in Genomics
Type: Topic Contributed
Date/Time: Tuesday, July 31, 2018 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract #330542
Title: Cloud Computing Approaches to Genomic Data Science
Author(s): Sean Davis*
Companies: National Cancer Institute
Keywords: genomics; cloud computing; big data; reproducible research; data sharing

Biomedical research is increasingly reliant on and driven by technologies that generate massive amounts of data to probe the mechanisms of disease and underpinnings of normal biology. The prime example of such a technology is high throughput sequencing applied to genomics research. In parallel with increased data generation capability, commercial cloud services have become commodity resources for flexible, scalable computing. Marrying cloud services, software tools for reproducible and portable workflows, and genomics data services and resources provides a powerful data ecosystem for large-scale genomic data science. Here, we present some key concepts, technologies, and software that allow for large-scale orchestration of data processing workflows and data science on commercial cloud resources. We also present some applications of these technologies to real-world genomics datasets, including how cloud computing can enhance portability, reproducibility, and data sharing. Finally, we highlight some of the challenges in adopting cloud technologies for biomedical data science.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program