JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 466
Type: Contributed
Date/Time: Wednesday, August 6, 2014 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #312392 View Presentation
Title: Statistical Modeling of Genomic Words and Motifs
Author(s): Guozhu Zhang*+ and Stephen Lee
Companies: and University of Idaho
Keywords: Segmentation ; Genomic words ; motifs
Abstract:

The arrangement of the four nucleotides A, C, G, and T along the genome is known to be non-random. Vast amount of information are built into the complex arrangements and compositions of genomic nucleotides. It can be viewed as a book of nucleotide text of instructions at the cellular level. Genome is decoded as a continuous stream of nucleotide alphabets message as one read the genomic text. We approach the reading of genomic text by segmentation - dividing the continuous stream into chunks according to some statistical measures of homogeneity. The goal would be to segment the genome into the most probable dictionary of motifs or words. Words are defined by our segmentation method as more homogeneous units within the boundaries than without. The core idea of this paper is to introduce the method of setting word boundaries. We applied the method to compare the yeast and worm genomes, to distinguish ordered and disordered protein sequences, and to characterize different English texts.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.