|
Activity Number:
|
101
|
|
Type:
|
Contributed
|
|
Date/Time:
|
Monday, August 4, 2008 : 8:30 AM to 10:20 AM
|
|
Sponsor:
|
Section on Statistical Computing
|
| Abstract - #300705 |
|
Title:
|
Large-Scale Clustering of Dependent Curves
|
|
Author(s):
|
Huijing Jiang*+ and Nicoleta Serban
|
|
Companies:
|
Georgia Institute of Technology and Georgia Institute of Technology
|
|
Address:
|
350448 Georgia Tech Station, Atlanta, GA, 30332,
|
|
Keywords:
|
clustering ; spatial dependence ; functional data analysis ; demographics ; Incomplete Choslesky Factorization ; kernel-based decomposition
|
|
Abstract:
|
In this paper, we introduce a model-based method for clustering multiple curves or functionals under spatial dependence specified up to a set of unknown parameters. The functionals are decomposed using a semiparametric model where the fixed effects account for the large-scale clustering association and the random effects for the small-scale spatial-dependence variability. The clustering model assumes the clustering membership as a realization from a Markov random field. Within our estimation framework, the emphasis is on a large number of functionals/spatial units with sparsely sampled time points. To overcome the computational cost resulting from large dependence matrix operations, the estimation algorithm includes a two-stage approximation: low-ranked kernel-based decomposition of the dependence matrix and Incomplete Choslesky Factorization of the kernel matrix.
|