Abstract #302312

This is the preliminary program for the 2003 Joint Statistical Meetings in San Francisco, California. Currently included in this program is the "technical" program, schedule of invited, topic contributed, regular contributed and poster sessions; Continuing Education courses (August 2-5, 2003); and Committee and Business Meetings. This on-line program will be updated frequently to reflect the most current revisions.

To View the Program:
You may choose to view all activities of the program or just parts of it at any one time. All activities are arranged by date and time.

The views expressed here are those of the individual authors
and not necessarily those of the ASA or its board, officers, or staff.


Back to main JSM 2003 Program page



JSM 2003 Abstract #302312
Activity Number: 125
Type: Contributed
Date/Time: Monday, August 4, 2003 : 10:30 AM to 12:20 PM
Sponsor: Section on Bayesian Stat. Sciences
Abstract - #302312
Title: Sequential Monte Carlo and Dirichlet Mixtures for Extracting Protein Alignment Models.
Author(s): Tanya Logvinenko*+ and Jun S. Liu
Companies: Stanford University and Harvard University
Address: 2 Carver St. #2, Somerville, MA, 02143,
Keywords: multiple sequence alignment ; Bayesian statistics ; sequential Monte Carlo ; hidden Markov model ; Gibbs Sampler
Abstract:

Multiple sequence alignment (MSA) can be viewed as computation of a posterior mean of a position specific profile matrix (PSPM), given a set of protein sequences to be aligned. Conditional distribution of a sequence given a PSPM, can be described by a Hidden Markov Model, with a "hidden" state being an unknown alignment path A which generates sequence S from (unknown) PSPM. We propose a novel querycentric Bayesian algorithm which applies Sequential Monte Carlo (SMC) framework to align the sequences and create the position specific profile matrix, regarding unknown missing data and imputing them sequentially. After information from all sequences is incorporated into the final profile, the sequences are realigned to the profile, and Gibbs Sampler is introduced to improve the multiple alignment. The use of SMC and Gibbs Sampler allows the procedure to avoid getting stuck in a local mode. To capture information contained in a column of aligned amino acids our method uses a Drichlet Mixture prior. Examples and comparisons of this MSA method show performance at least as good as the performance of the best existing methods available for MSA.


  • The address information is for the authors that have a + after their name.
  • Authors who are presenting talks have a * after their name.

Back to the full JSM 2003 program

JSM 2003 For information, contact meetings@amstat.org or phone (703) 684-1221. If you have questions about the Continuing Education program, please contact the Education Department.
Revised March 2003