The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.
Online Program Home
Abstract Details
Activity Number:
|
499
|
Type:
|
Topic Contributed
|
Date/Time:
|
Wednesday, August 1, 2012 : 10:30 AM to 12:20 PM
|
Sponsor:
|
WNAR
|
Abstract - #306285 |
Title:
|
Penalized Mixtures of Variable Order Markov Chains for Biological Sequence Data
|
Author(s):
|
Karin S Dorman*+
|
Companies:
|
Iowa State University
|
Address:
|
Iowa State University, Ames, IA, , USA
|
Keywords:
|
interpolated Markov model ;
mixture model
|
Abstract:
|
The interpolated Markov model (IMM) is a heuristic model for discrete sequences of random variables that has had success in speech recognition and biological sequence classification. The basic idea is to grow higher order Markov chains only as needed to predict the next nucleotide. If a substring is common, then the full substring or an even longer superstring is used for prediction. If it is uncommon, then a shorter suffix is used for prediction. The IMM is a type of mixture Markov chain, where each transition is generated from a mixture of chains of varying order. The IMM is actually a heuristic algorithm for estimating the mixing proportions. We present a penalized mixture model to estimate the weights and transition probabilities in a mathematical framework. We demonstrate its application in genetics, where it has extensive potential.
|
The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.
Back to the full JSM 2012 program
|
2012 JSM Online Program Home
For information, contact jsm@amstat.org or phone (888) 231-3473.
If you have questions about the Continuing Education program, please contact the Education Department.