Abstract:
|
While the study of a single network is well-established, technological advances now allow for the collection of multiple networks with relative ease. Increasingly, anywhere from several to thousands of networks can be created from brain imaging, gene co-expression data, or microbiome measurements. And these networks, in turn, are being looked to as potentially powerful features to be used in modeling. However, with networks being non-Euclidean in nature, how best to incorporate them into standard modeling tasks is not obvious. In this talk, we discuss a Gaussian process framework that provides a unified approach to binary classification, anomaly detection, and survival analysis with network inputs. We encode the networks in the kernel of a GP prior via their pairwise differences and show that our model achieves posterior consistency under several choices of graph distance that induce provably positive definite kernels. We provide an extensive simulation study to identify which distances perform the best for popular random graph models and to demonstrate that our model outperforms other state-of-the-art graph classifiers.
|