Online Program Home
  My Program

Abstract Details

Activity Number: 460 - Clustering Methods for Big Data Problems
Type: Topic Contributed
Date/Time: Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Learning and Data Science
Abstract #323326 View Presentation
Title: A Parallel EM Algorithm for Statistical Learning via Mixture Models
Author(s): Geoffrey McLachlan*
Companies: The University of Queensland
Keywords: mixture models ; EM algorithm ; skew distributions ; multithreaded programs
Abstract:

Finite mixtures of skew normal and skew t-distributions provide a flexible tool for modelling heterogeneous data with asymmetric distributional features. However, parameter estimation via the Expectation-Maximization (EM) algorithm can become very time-consuming due to the complicated expressions involved in the E-step that are numerically expensive to evaluate. We therefore propose a block implementation of the EM algorithm that facilitates the calculations in the E- and M-steps to be spread across a large number of threads. The approach can be easily implemented for use by multicore and multiprocessor machines. The improvement in time performance is illustrated on some real datasets.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2017 program

 
 
Copyright © American Statistical Association