Online Program Home
  My Program

All Times EDT

Abstract Details

Activity Number: 49 - Recent Advances in Statistical Inference on Graphs
Type: Topic-Contributed
Date/Time: Sunday, August 8, 2021 : 3:30 PM to 5:20 PM
Sponsor: Section on Statistical Learning and Data Science
Abstract #317278
Title: A Deterministic Hitting-Time Moment Approach to Seed-Set Expansion Over a Graph
Author(s): Alexander Foss*
Companies: Sandia National Laboratories
Keywords: Graph; hitting-time; network; seed-set expansion; mixture model
Abstract:

We introduce HITMIX, a new technique for network seed-set expansion, i.e., the problem of identifying a set of graph vertices related to a given seed-set of vertices. We use the moments of the graph's hitting-time distribution to quantify the relationship of each non-seed vertex to the seed-set. This involves a deterministic calculation for the hitting-time moments that is scalable in the number of graph edges and so avoids directly sampling a Markov chain over the graph. The moments are used to fit a mixture model to estimate the probability that each non-seed vertex should be grouped with the seed set. This membership probability enables us to sort the non-seeds and threshold in a statistically-justified way. To the best of our knowledge, HITMIX is the first full statistical model for seed-set expansion that can give vertex-level membership probabilities. While HITMIX is a global method, its linear computation complexity in practice enables computations on large graphs. We have a high-performance implementation, and we present computational results on stochastic blockmodels and a small-world network from the SNAP repository.


Authors who are presenting talks have a * after their name.

Back to the full JSM 2021 program