JSM 2011 Online Program

The views expressed here are those of the individual authors and not necessarily those of the JSM sponsors, their officers, or their staff.

Abstract Details

Activity Number: 144
Type: Invited
Date/Time: Monday, August 1, 2011 : 10:30 AM to 12:20 PM
Sponsor: Section on Statistical Learning and Data Mining
Abstract - #300384
Title: Large Scale Data at Facebook
Author(s): Eric Sun*+
Companies: Facebook, Inc.
Address: , , ,
Keywords: Facebook ; entities ; social networks ; crowdsourcing ; large-scale

In this talk we share lessons about user behavior gained from the analysis of aggregated status updates, check-ins, and Like data from the 750 million active users on Facebook. In particular, we focus on how user behavior has influenced the growth and curation of the graph of Facebook Community Pages. Community Pages represent concepts can be added to users' profiles on Facebook, allowing them to express their passions and share their interests with others. With hundreds of millions of Pages, problems like deduplication and disambiguation quickly become computationally difficult. We propose several solutions for these problems that can be applied at Facebook's scale, including cleaning up the graph via statistical algorithms and integrating user feedback via crowdsourcing.

The address information is for the authors that have a + after their name.
Authors who are presenting talks have a * after their name.

Back to the full JSM 2011 program

2011 JSM Online Program Home

For information, contact jsm@amstat.org or phone (888) 231-3473.

If you have questions about the Continuing Education program, please contact the Education Department.