Abstract:
|
Statistical phylogenetics is the inference of a tree structure representing evolutionary history using biological sequence data (e.g. DNA) under a likelihood model of sequence evolution. All such inferences perform either heuristic search or Markov chain Monte Carlo (MCMC) on a graph built with the various trees as vertices and edges representing tree modifications. Because this graph is connected with nonzero transition probabilities, MCMC is guaranteed to work in the large time limit, although inference using a finite number of steps is determined by mixing properties of MCMC on the graph. However, little is known about the large-scale structure of, or properties of MCMC on, the graphs actually used for inference. In this talk, I will first demonstrate significant graph effects on phylogenetic inference for the subtree-prune-regraft (rSPR) graph, which is a popular such graph involving reconnection of subtrees of a tree in a different location. I will then recap what is known about the rSPR graph and describe our work on Ricci-Ollivier curvature of the SPR graph, then describe consequences for random walks on the SPR graph and phylogenetic MCMC.
|
ASA Meetings Department
732 North Washington Street, Alexandria, VA 22314
(703) 684-1221 • meetings@amstat.org
Copyright © American Statistical Association.