Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 578 - SBSS Student Paper Competition II
Type: Topic Contributed
Date/Time: Thursday, August 6, 2020 : 3:00 PM to 4:50 PM
Sponsor: Section on Bayesian Statistical Science
Abstract #309850
Title: More for Less: Predicting and Maximizing Genetic Variant Discovery via Bayesian Nonparametrics
Author(s): Lorenzo Masoero* and Federico Camerlenghi and Stefano Favaro and Tamara Broderick
Companies: Massachusetts Institute of Technology and Universita di Milano and Universita di Torino and Massachusetts Institute of Technology
Keywords: Bayesian nonparametrics; Genomics; Optimal experimental design
Abstract:

While the cost of sequencing genomes has decreased dramatically in recent years, this expense often remains non-trivial. Under a fixed budget, scientists face a natural trade off between quantity and quality: they can spend resources to sequence either more individuals or more accurately. Optimizing resource allocation promises to reveal as many new variations in the genome as possible, and thus as many new scientific insights as possible. We consider the setting where scientists have conducted a pilot study to reveal genomic variants and are contemplating a follow-up study. We introduce a Bayesian nonparametric methodology to predict the number of new variants in the follow-up study based on the pilot study. When experimental conditions are kept constant between the pilot and follow up, we show on real data from the gnomAD project that our prediction is more accurate than three recent proposals, and competitive with a more classic proposal. Unlike other methods, though, our method allows practitioners to change experimental conditions between the pilot and the follow-up, allowing more realistic predictions and optimal allocation of a fixed budget between quality and quantity.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2020 program