JSM 2015 Preliminary Program

Online Program Home
My Program

Abstract Details

Activity Number: 638
Type: Topic Contributed
Date/Time: Thursday, August 13, 2015 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract #316104
Title: Fast Multinomial Clustering with Applications to Genetic Population Structure
Author(s): Karin Dorman* and Arun Sethuraman and Wei-Chen Chen
Companies: Iowa State University and Temple University and FDA
Keywords: mixture model ; population genetics ; EM algorithm ; EM acceleration
Abstract:

Identifying population structure from multilocus genotype data is key to downstream population genetic analyses in a variety of fields, including conservation, evolutionary genetics, Genome Wide Association Studies (GWAS), and pedigree reconstruction for quantitative genetics. There are both Bayesian and maximum likelihood approaches for inference of this model, but neither has scaled well with large datasets. We extend recent improvements in accelerated optimization routines for independent Binomial models to the Multinomial situation. We demonstrate striking speed improvements that find the global maximum quicker and permit computationally intensive analyses such as those useful for estimating the number of clusters K. We demonstrate that methods to estimate K are far more reliable using the sped-up maximum likelihood approach. Genetics is our motivating problem, but the model is generally applicable to mixtures of coordinate-wise independent Multinomials.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2015 program





For program information, contact the JSM Registration Department or phone (888) 231-3473.

For Professional Development information, contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

2015 JSM Online Program Home