JSM 2014 Home
Online Program Home
My Program

Abstract Details

Activity Number: 614
Type: Contributed
Date/Time: Thursday, August 7, 2014 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Mining
Abstract #312940
Title: Model-Based Biclustering of Clickstream Data
Author(s): Volodymyr Melnykov*+
Companies: University of Alabama
Keywords: clickstream ; finite mixture model ; model-based clustering ; biclustering ; model selection
Abstract:

Navigation patterns expressed by sequences of visited web-sites can characterize the behavior and habits of web-users. Such web-page routes taken by individuals are commonly called clickstreams. Clustering clickstream sequences is challenging since one needs to group categorical data sequences rather than vectors and the majority of traditional clustering algorithms are not applicable in this setting. We consider model-based clustering relying on the mixture of first order Markov models. Since the number of distinct web-pages, and therefore the number of states in a Markov process, can be high, such a mixture model involves a large number of parameters. We propose grouping states by their similarity to reduce the number of parameters in the model. The developed methodology is illustrated on synthetic and real datasets with good results.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2014 program




2014 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Professional Development program, please contact the Education Department.

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

ASA Meetings Department  •  732 North Washington Street, Alexandria, VA 22314  •  (703) 684-1221  •  meetings@amstat.org
Copyright © American Statistical Association.