JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 288
Type: Topic Contributed
Date/Time: Tuesday, August 2, 2011 : 8:30 AM to 10:20 AM
Sponsor: Section on Statistical Computing
Abstract - #300853
Title: Statistical Learning in the Cloud with Graphlab
Author(s): Carlos Guestrin*+
Companies: Carnegie Mellon University
Address: , Pittsburgh, PA, 15213,
Keywords: machine learning ; statistical learning ; parallel algorithms ; distributed algorithms ; cloud computing ; large-scale data
Abstract:

Exponentially increasing dataset sizes have driven Statistical Learning experts to explore parallel and distributed computing. Furthermore, cloud computing resources such as Amazon EC2 have become available, providing cheap and scalable platforms for large scale computation. However, due to the complexities involved in distributed design, it can be difficult for researchers to take full advantage of cloud resources. Existing high-level parallel abstractions like MapReduce are insufficiently expressive while low-level tools like MPI and Pthreads leave learning experts repeatedly solving the same design challenges.

Targeting common patterns in learning , we developed GraphLab, which compactly expresses asynchronous iterative algorithms with sparse computational dependencies, while ensuring data consistency and achieving a high degree of parallel performance. We demonstrate the expressiveness of the framework by designing and implementing parallel versions for a variety of real-world tasks, including learning graphical models with approximate inference, Gibbs sampling, tensor factorization, Co-EM, Lasso and Compressed Sensing, evaluating on clouds of up to 256 processors.


The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program




2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.