| Activity Number: | 460 
                            	- Clustering Methods for Big Data Problems | 
                    
                        | Type: | Topic Contributed | 
                    
                        | Date/Time: | Wednesday, August 2, 2017 : 8:30 AM to 10:20 AM | 
                    
                        | Sponsor: | Section on Statistical Learning and Data Science | 
                
                    
                        | Abstract #323326 | View Presentation | 
                    
                        | Title: | A Parallel EM Algorithm for Statistical Learning via Mixture Models | 
                
                
                    | Author(s): | Geoffrey McLachlan* | 
                
                    | Companies: | The University of Queensland | 
                
                    | Keywords: | mixture models  ; 
                            EM algorithm ; 
                            skew distributions ; 
                            multithreaded programs | 
                
                    | Abstract: | 
                            Finite mixtures of skew normal and skew t-distributions provide a flexible tool for modelling heterogeneous data with asymmetric distributional features. However, parameter estimation via the Expectation-Maximization (EM) algorithm can become very time-consuming due to the complicated expressions involved in the E-step that are numerically expensive to evaluate. We therefore propose a block implementation of the EM algorithm that facilitates the calculations in the E- and M-steps to be spread across a large number of threads.  The approach can be easily implemented for use by multicore and multiprocessor machines. The improvement in time performance is illustrated on some real datasets.   
                         | 
                
                
                    Authors who are presenting talks have a * after their name.