Abstract:
|
Networks are ubiquitous in our lives and play a crucial role in information transmission. The network position, usually captured by centrality, affects individual’s decision making and thus provides information for inference and prediction. Though with a complete network, it is often the case that we might not be able to observe all information of individuals in the network (e.g. Facebook observes the friendship networks, but some users’ information might be missing). We propose a semi-supervised method for a regression problem where a network is fully observed but covariates and outcome are missing at random. By incorporating the network information, the method provides better estimates and prediction, but also a better estimate of centrality. We illustrate our method via a real data example of inferring the performance of Chinese firms with a complete equity holding network.
It is a joint work with Junhui Cai, Haipeng Shen, Dan Yang and Wu Zhu.
|