Online Program Home
My Program

Abstract Details

Activity Number: 422 - Statistical Learning for Functional Data
Type: Contributed
Date/Time: Tuesday, July 31, 2018 : 2:00 PM to 3:50 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #328747 Presentation
Title: Probabilistic K-Mean with Local Alignment for Functional Motif Discovery
Author(s): Marzia A Cremona* and Francesca Chiaromonte
Companies: The Pennsylvania State University and The Pennsylvania State University
Keywords: Functional Data Analysis; Clustering; Bioinformatics; Shape

The aim is to address the problem of discovering functional motifs, i.e. typical "shapes" that may recur several times in a set of (multidimensional) curves, capturing important local characteristics of these curves. We formulate probabilistic K-mean with local alignment, a novel algorithm that leverages ideas from Functional Data Analysis (joint clustering and alignment of curves), Bioinformatics (local alignment through the extension of high similarity "seeds") and fuzzy clustering (curves belonging to more than one cluster, if they contain more than one typical "shape"). Our algorithm identifies shared curve portions, which represent candidate functional motifs in a set of curves under consideration. It can employ various dissimilarity measures in order to capture different shape characteristics. After demonstrating the performance of the algorithm on simulated data, we apply it to discover functional motifs in "Omics" signals related to mutagenesis and genome dynamics, exploring high-resolution profiles of different mutation rates in regions of the human genome where these rates are globally elevated.

Authors who are presenting talks have a * after their name.

Back to the full JSM 2018 program