Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 266 - Recent Advances in Statistical Network Analysis with Applications
Type: Invited
Date/Time: Wednesday, August 11, 2021 : 1:30 PM to 3:20 PM
Sponsor: Section on Statistical Graphics
Abstract #316919
Title: On Scalable Estimation for Overlapping Clustering Models
Author(s): Purnamrita Sarkar and Deepayan Chakrabarti*
Companies: University of Texas, Austin and University of Texas at Austin
Keywords: networks; overlapping; clustering; SVM
Abstract:

People belong to multiple communities, words belong to multiple topics, and books cover multiple genres; overlapping clusters are commonplace. Many existing overlapping clustering methods model each person (or word, or book) as a non-negative weighted combination of “exemplars” who belong solely to one community, with some small noise. Geometrically, each person is a point on a cone whose corners are these exemplars. This basic form encompasses the widely used Mixed Membership Stochastic Blockmodel of networks and its degree-corrected variants, as well as topic models such as LDA. We show that a simple one-class SVM yields provably consistent parameter inference for all such models, and scales to large datasets. Experimental results on several simulated and real datasets show our algorithm (called SVM-cone) is both accurate and scalable.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program