missSBM: An R Package for Handling Missing Values in the Stochastic Block Model
release_qx4ax7vg6reljkftjnepklng64
by
Pierre Barbillon, Julien Chiquet, Timothée Tabouy
2021
Abstract
The Stochastic Block Model (SBM) is a popular probabilistic model for random
graphs. It is commonly used for clustering network data by aggregating nodes
that share similar connectivity patterns into blocks. When fitting an SBM to a
network which is partially observed, it is important to take into account the
underlying process that generates the missing values, otherwise the inference
may be biased. This paper introduces missSBM, an R-package fitting the SBM when
the network is partially observed, i.e., the adjacency matrix contains not only
1's or 0's encoding presence or absence of edges but also NA's encoding missing
information between pairs of nodes. This package implements a set of algorithms
for fitting the binary SBM, possibly in the presence of external covariates, by
performing variational inference adapted to several observation processes. Our
implementation automatically explores different block numbers to select the
most relevant model according to the Integrated Classification Likelihood (ICL)
criterion. The ICL criterion can also help determine which observation process
better corresponds to a given dataset. Finally, missSBM can be used to perform
imputation of missing entries in the adjacency matrix. We illustrate the
package on a network data set consisting of interactions between political
blogs sampled during the French presidential election in 2007.
In text/plain
format
Archived Content
There are no accessible files associated with this release. You could check other releases for this work for an accessible version.
Know of a fulltext copy of on the public web? Submit a URL and we will archive it
1906.12201v3
access all versions, variants, and formats of this works (eg, pre-prints)