Authors
Timothée Tabouy, Pierre Barbillon, Julien Chiquet
Publication date
2020/1/2
Journal
Journal of the American Statistical Association
Volume
115
Issue
529
Pages
455-466
Publisher
Taylor & Francis
Description
This article deals with nonobserved dyads during the sampling of a network and consecutive issues in the inference of the stochastic block model (SBM). We review sampling designs and recover missing at random (MAR) and not missing at random (NMAR) conditions for the SBM. We introduce variants of the variational EM algorithm for inferring the SBM under various sampling designs (MAR and NMAR) all available as an R package. Model selection criteria based on integrated classification likelihood are derived for selecting both the number of blocks and the sampling design. We investigate the accuracy and the range of applicability of these algorithms with simulations. We explore two real-world networks from ethnology (seed circulation network) and biology (protein–protein interaction network), where the interpretations considerably depend on the sampling designs considered. Supplementary materials for …
Total citations
2019202020212022202320243771475
Scholar articles
T Tabouy, P Barbillon, J Chiquet - Journal of the American Statistical Association, 2020