JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 176
Type: Contributed
Date/Time: Monday, August 4, 2014 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Computing
Abstract #312859 View Presentation
Title: Faster Exact Probabilities for Statistics of Overlapping Pattern Occurrences
Author(s): Donald Martin*+
Companies: North Carolina State University
Keywords: spaced seeds ; deterministic finite automaton ; distribution of pattern statistic ; statistical inference
Abstract:

When using an auxiliary Markov chain to compute probabilities for pattern statistics, the computational complexity is directly related to the number of Markov chain states. Thus in recent years, minimal deterministic finite automata have been used as data structures that facilitate computation while keeping the number of states at a minimum. For statistics where overlapping and non-overlapping pattern occurrences are treated differently, one could form an extended automaton that includes prefixes of initial and overlapping word occurrences, and then minimize the extended automaton. However, there are situations where forming a full extended automaton and then minimizing it is computationally expensive. We give a method to bypass the formation of a full extended automaton before minimization that facilitates efficient computation. The method is applied to the distribution of the number of sequence positions covered by spaced seed hits, a pattern matching paradigm that has proven fruitful in similarity searches for DNA sequences.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.