Conference Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 428 - Clustering and Dimension-Reduction Methods: From Omics to Single-Cell Data
Type: Contributed
Date/Time: Wednesday, August 10, 2022 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistics in Genomics and Genetics
Abstract #323448
Title: Improvement to GLASS/Maximum Tree Method of Species Tree Inference from Estimated Gene Trees Using Measurement Error Modified Single Linkage Clustering
Author(s): Sarah Alver* and James Degnan and Fletcher Christensen
Companies: University of New Mexico and University of New Mexico and University of New Mexico
Keywords: phylogenetics; species tree inference; measurement error; single-linkage clustering
Abstract:

The Global LAteSt Split/Maximum Tree method of species tree (ST) inference performs well and is statistically consistent when inferring the ST from known gene trees (GT). With estimated GTs, the inferred ST is the maximum likelihood tree and a consistent estimator of the ST under some conditions. The method as implemented in STEM (ST Estimation using Maximum likelihood) software has performed relatively poorly when the input is GT estimated from DNA sequences, and conditions for statistical consistency in this case can be unrealistic. We propose a modification to the STEM tree: an application of clustering in measurement error models as described by Su, Reedy and Carroll in 2018. The proposed method replaces estimated pairwise coalescence times used by STEM with randomly generated realizations from the distribution, estimated through measurement error modeling, of true pairwise coalescence times. As with STEM, the minimum of these realizations is taken over all loci for each pairwise distance to form a distance matrix, and single linkage clustering is used to infer the ST. In simulation studies, the new method outperforms STEM in terms of Robinson-Foulds distance from the true ST.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2022 program