Activity Number:
|
460
- Clustering Methods for Big Data Problems
|
Type:
|
Topic Contributed
|
Date/Time:
|
Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM
|
Sponsor:
|
Section on Statistical Learning and Data Science
|
Abstract #323326
|
View Presentation
|
Title:
|
A Parallel EM Algorithm for Statistical Learning via Mixture Models
|
Author(s):
|
Geoffrey McLachlan*
|
Companies:
|
The University of Queensland
|
Keywords:
|
mixture models ;
EM algorithm ;
skew distributions ;
multithreaded programs
|
Abstract:
|
Finite mixtures of skew normal and skew t-distributions provide a flexible tool for modelling heterogeneous data with asymmetric distributional features. However, parameter estimation via the Expectation-Maximization (EM) algorithm can become very time-consuming due to the complicated expressions involved in the E-step that are numerically expensive to evaluate. We therefore propose a block implementation of the EM algorithm that facilitates the calculations in the E- and M-steps to be spread across a large number of threads. The approach can be easily implemented for use by multicore and multiprocessor machines. The improvement in time performance is illustrated on some real datasets.
|
Authors who are presenting talks have a * after their name.