This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Note on the Identifiability of the Degree-Corrected Stochastic Block Model

John Park Department of Mathematics, the University of Arizona Yunpeng Zhao Department of Statistics, Colorado State University Ning Hao Department of Mathematics, the University of Arizona Statistics & Data Science GIDP, the University of Arizona
Abstract

In this short note, we address the identifiability issues inherent in the Degree-Corrected Stochastic Block Model (DCSBM). We provide a rigorous proof demonstrating that the parameters of the DCSBM are identifiable up to a scaling factor and a permutation of the community labels, under a mild condition.

Keywords: Clustering, Community Detection, Networks.

1 Introduction

The Stochastic Block Model (SBM) is a foundational framework for modeling community structures in networks. It groups nodes into distinct communities, with the probability of an edge between two nodes determined solely by their community memberships (Holland and Leinhardt,, 1981; Nowicki and Snijders,, 2001; Bickel and Chen,, 2009; Choi et al.,, 2012). SBM extends the Erdős-Rényi model (Erdős and Rényi,, 1959), which assumes a uniform edge probability across all node pairs, thereby offering a simpler framework without community differentiation. In contrast, SBM captures both inter-community and intra-community connection patterns, making it a valuable tool for community detection (Abbe,, 2017; Zhao,, 2017). However, SBM assumes homogeneity within each community, limiting its applicability to real-world networks where significant degree variability exists among nodes within the same community. To address this limitation, the Degree-Corrected Stochastic Block Model (DCSBM) was introduced. DCSBM extends SBM by incorporating degree heterogeneity, allowing nodes within a community to exhibit varying degrees of connectivity (Karrer and Newman,, 2011; Ball et al.,, 2011). This enhancement improves the model’s flexibility, making DCSBM particularly suited for networks such as social, biological, and communication systems, where nodes often exhibit varying connectivity patterns even within the same community.

It is well known that parameters in an SBM, like those in other clustering models such as Gaussian mixture models, are not identifiable due to the possible permutation of community labels. For the DCSBM, this issue is further compounded by the non-identifiability of degree parameters due to scaling factors (Karrer and Newman,, 2011; Ball et al.,, 2011; Zhao et al.,, 2012). Despite these challenges, researchers generally agree that these identifiability issues pose only minor technical obstacles and that no additional identifiability problems exist for the DCSBM. However, formal references addressing the identifiability of the DCSBM are limited. In this note, we point out a small gap in a commonly used argument and provide a rigorous proof confirming that the parameter system of the DCSBM is identifiable up to a scaling factor and a permutation of the community labels, under a mild condition that each community has at least three members. To our best knowledge, this condition has never been explicitly stated in the literature. In addition, we show that this condition is necessary using a counterexample. Together, our results fill a critical gap in the theoretical understanding of the identifiability of the DCSBM.

2 Model Description

Consider an undirected graph G=(V,E)G=(V,E) with nn nodes. The adjacency matrix AA represents the presence or absence of edges between nodes. The DCSBM with nn nodes and KK communities is characterized by the following components with technical conditions.

  1. 1.

    Community Membership Matrix ZZ: ZZ is an n×Kn\times K binary matrix where Zik=1Z_{ik}=1 if node ii belongs to community kk, and 0 otherwise. We require that each community has at least 1 member. We use ziz_{i} to denote the community label of node ii. That is, zi=kz_{i}=k if and only if Zik=1Z_{ik}=1.

  2. 2.

    Degree Parameter Matrix Θ\Theta: Θ\Theta is an n×nn\times n diagonal matrix where Θii=θi>0\Theta_{ii}=\theta_{i}>0 represents the degree parameter for node ii.

  3. 3.

    Community Connection Matrix BB: BB is a symmetric full rank K×KK\times K matrix where BklB_{kl} represents the intensity of connection between nodes in community kk and community ll.

A triple {Z,Θ,B}\{Z,\Theta,B\} defines a DCSBM, where the off-diagonal elements of the adjacency matrix AA are generated independently with 𝔼[Aij]=Δij\mathbb{E}[A_{ij}]=\Delta_{ij}, and

Δ=ΘZBZΘ.\Delta=\Theta ZBZ^{\top}\Theta. (1)

We assume that all triples {Z,Θ,B}\{Z,\Theta,B\} satisfy the conditions listed above throughout this paper.

Now, we introduce useful notations and technical lemmas. We use [n][n] to denote the set of integers {1,,n}\{1,\dots,n\}. A partition of [n][n] is a collection of nonempty, mutually disjoint subsets of [n][n] whose union is [n][n]. A permutation matrix is a square binary matrix with exactly one entry of 1 in each row and each column, with all other entries being 0. A K×KK\times K permutation matrix PP corresponds to a permutation of a set with KK elements. The operator diag()\mathrm{diag}(\cdot) is defined as diag:nn×n\mathrm{diag}:\mathbb{R}^{n}\to\mathbb{R}^{n\times n}, mapping a vector to a diagonal matrix such that [diag(v)]ii=vi[\mathrm{diag}(v)]_{ii}=v_{i} for a vector v=(v1,,vn)v=(v_{1},\dots,v_{n})^{\top}. Finally, we use 1n1_{n} to denote an nn-dimensional vector where all entries are equal to 1.

Every equivalence relation on a set defines a partition of that set, and vice versa. For a matrix QQ, we define its ii-th row and jj-th row to be equivalent if they are identical. Using this equivalence relation, any matrix QQ with nn rows defines a partition of [n][n]. The following lemmas will be useful in our theoretical derivations.

Lemma 1.

An n×Kn\times K community membership matrix ZZ defines a partition of [n][n] based on row equivalence. Two community membership matrices ZZ and Z~\tilde{Z} define the same partition if and only if Z~=ZP\tilde{Z}=ZP for a permutation matrix PP.

Proof of Lemma 1: Define subset Ck[n]C_{k}\subset[n] as {i[n]Zik=1}\{i\in[n]\mid Z_{ik}=1\}. It is straightforward to verify that {C1,,CK}\{C_{1},\dots,C_{K}\} is a partition of [n][n]. Similarly, we can define another partition {C~1,,C~K}\{\tilde{C}_{1},\dots,\tilde{C}_{K}\} based on Z~\tilde{Z}. The partitions {C1,,CK}\{C_{1},\dots,C_{K}\} and {C~1,,C~K}\{\tilde{C}_{1},\dots,\tilde{C}_{K}\} are identical if and only if there exists a permutation π:[K][K]\pi:[K]\to[K] such that C~π(k)=Ck\tilde{C}_{\pi(k)}=C_{k}. This permutation π\pi induces a permutation matrix PP with Pk,π(k)=1P_{k,\pi(k)}=1, and it follows that Z~=ZP\tilde{Z}=ZP. \Box

Lemma 2.

Let ZZ and Z~\tilde{Z} be two n×Kn\times K community membership matrices, and RR and R~\tilde{R} be two full row-rank K×KK\times K^{\prime} matrices where KKK^{\prime}\geq K. The matrices ZRZR and Z~R~\tilde{Z}\tilde{R} define the same partition of [n][n] based on row equivalence if and only if Z~=ZP\tilde{Z}=ZP for a permutation matrix PP.

Proof of Lemma 2: All rows of RR are distinct, since RR has full row rank. The rows of ZRZR are replications of rows of RR, and the ii-th row of ZRZR is identical to the kk-th row of RR if and only if Zik=1Z_{ik}=1. As a result, ZRZR and ZZ define the same partition of [n][n] based on row equivalence. The same conclusion holds for Z~R~\tilde{Z}\tilde{R} and Z~\tilde{Z}.

Thus, ZRZR and Z~R~\tilde{Z}\tilde{R} define the same partition of [n][n] if and only if ZZ and Z~\tilde{Z} define the same partition, and by Lemma 1, if and only if Z~=ZP\tilde{Z}=ZP for a permutation matrix PP. \Box

For a matrix QQ, we define its ii-th row and jj-th row to be proportionally equivalent if they are proportional to each other up to a positive scaling factor. Any matrix QQ defines a partition via this proportional equivalence.

Lemma 3.

The matrices ZZ, Z~\tilde{Z}, RR and R~\tilde{R} are defined as in Lemma 2. Θ\Theta and Θ~\tilde{\Theta} are two n×nn\times n positive diagonal matrices. The matrices ΘZR\Theta ZR and Θ~Z~R~\tilde{\Theta}\tilde{Z}\tilde{R} define the same partition of [n][n] based on row proportional equivalence if and only if Z~=ZP\tilde{Z}=ZP for a permutation matrix PP. Moreover, if {C1,,CK}\{C_{1},\dots,C_{K}\} is the partition defined by ΘZR\Theta ZR and Θ~Z~R~\tilde{\Theta}\tilde{Z}\tilde{R}, Θii1Θ~ii=Θjj1Θ~jj\Theta_{ii}^{-1}\tilde{\Theta}_{ii}=\Theta_{jj}^{-1}\tilde{\Theta}_{jj} whenever i,jCki,j\in C_{k}.

Proof of Lemma 3: No two rows of RR are proportional to each other since RR has full row rank. Therefore, the partition defined by ZRZR via row equivalence and the partition defined by ΘZR\Theta ZR via row proportional equivalence are the same, and the first conclusion follows Lemma 2. To show the second statement, we write

ZR=Θ1Θ~Z~R~,ZR=\Theta^{-1}\tilde{\Theta}\tilde{Z}\tilde{R},

where Θ1Θ~\Theta^{-1}\tilde{\Theta} is a positive diagonal matrix. It is straightforward to check when i,jCki,j\in C_{k}, the ii-th and jj-th rows of ZRZR are identical, and the ii-th and jj-th rows of Z~R~\tilde{Z}\tilde{R} are identical, which implies Θii1Θ~ii=Θjj1Θ~jj\Theta_{ii}^{-1}\tilde{\Theta}_{ii}=\Theta_{jj}^{-1}\tilde{\Theta}_{jj}. \Box

3 The Identifiability Issue on the DCSBM

3.1 A folklore

In the literature on model identifiability, the expected values of the diagonal entries of AA are typically assumed to follow the same form as the off-diagonal entries, i.e., 𝔼[A]=Δ=ΘZBZΘ\mathbb{E}[A]=\Delta=\Theta ZBZ^{\top}\Theta. Under this assumption, the derivations in Karrer and Newman, (2011) suggest that the parameters are identifiable up to KK degrees of freedom, corresponding to KK scaling factors. The following proposition, though rarely stated explicitly in the literature, has been employed to provide a resolution to the identifiability issue of the DCSBM.

Proposition 1.

Two parameter systems {Z,Θ,B}\{Z,\Theta,B\} and {Z~,Θ~,B~}\{\tilde{Z},\tilde{\Theta},\tilde{B}\} satisfy

ΘZBZΘ=Θ~Z~B~Z~Θ~=Δ.\Theta ZBZ^{\top}\Theta=\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}=\Delta. (2)

if and only if Z~=ZP\tilde{Z}=ZP, B~=PDBDP\tilde{B}=P^{\top}DBDP, and Θ~=Θdiag(ZD11K)\tilde{\Theta}=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right) where PP is a K×KK\times K permutation matrix, and DD is a K×KK\times K positive diagonal matrix.

In this proposition, Z~=ZP\tilde{Z}=ZP indicates the membership matrices in two parametrization systems are up to a permutation PP. When two systems use the same labels, i.e., P=IKP=I_{K}, they represent the same DCSBM if and only if B~=DBD\tilde{B}=DBD, and Θ~=Θdiag(ZD11K)\tilde{\Theta}=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right) for a diagonal matrix DD containing KK scaling factors, one for each community.

Proof of Proposition 1: To show the “if” part, we calculate

Θ~Z~B~Z~Θ~\displaystyle\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta} =Θdiag(ZD11K)ZPPDBDPPZdiag(ZD11K)Θ\displaystyle=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZPP^{\top}DBDPP^{\top}Z^{\top}\mathrm{diag}\left(ZD^{-1}1_{K}\right)\Theta
=Θdiag(ZD11K)ZDBDZdiag(ZD11K)Θ.\displaystyle=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZDBDZ^{\top}\mathrm{diag}\left(ZD^{-1}1_{K}\right)\Theta.

To obtain equation (2), it suffices to show

diag(ZD11K)ZD=Z.\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZD=Z.

We verify it by checking each entry. As both diag(ZD11K)\mathrm{diag}\left(ZD^{-1}1_{K}\right) and DD are diagonal matrices, the (i,k)(i,k) entry on the left is

[diag(ZD11K)ZD]ik=[diag(ZD11K)]iiZikDkk.[\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZD]_{ik}=[\mathrm{diag}\left(ZD^{-1}1_{K}\right)]_{ii}Z_{ik}D_{kk}.

When zikz_{i}\neq k, both [diag(ZD11K)ZD]ik[\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZD]_{ik} and ZikZ_{ik} are zero. When zi=kz_{i}=k, Zik=1Z_{ik}=1, it is easy to check [diag(ZD11K)]ii=Dkk1[\mathrm{diag}\left(ZD^{-1}1_{K}\right)]_{ii}=D_{kk}^{-1}, so

[diag(ZD11K)]iiZikDkk=Dkk1ZikDkk=Zik.[\mathrm{diag}\left(ZD^{-1}1_{K}\right)]_{ii}Z_{ik}D_{kk}=D_{kk}^{-1}Z_{ik}D_{kk}=Z_{ik}.

Now we show the “only if” part. First, we calculate the rank of Δ=ΘZBZΘ\Delta=\Theta ZBZ^{\top}\Theta as

rank(Δ)=rank(ZBZ)=rank(B)=K.\mathrm{rank}(\Delta)=\mathrm{rank}(ZBZ^{\top})=\mathrm{rank}(B)=K.

The first equal sign follows that Θ\Theta is a positive definite diagonal matrix. The second equal sign holds because the membership matrix ZZ contains K×KK\times K identity matrix IKI_{K} as a submatrix (up to a permutation).

Now consider the eigenvalue decomposition Δ=UΛU\Delta=U\Lambda U^{\top}, where Λ\Lambda is a K×KK\times K diagonal matrix contains all nonzero eigenvalues of Δ\Delta, UU is a n×Kn\times K matrix whose columns are eigenvectors corresponding to nonzero eigenvalues. By the eigenvalue decomposition, we can write

U=ΔUΛ1=ΘZBZΘUΛ1=ΘZR,U=\Delta U\Lambda^{-1}=\Theta ZBZ^{\top}\Theta U\Lambda^{-1}=\Theta ZR, (3)

where R=BZΘUΛ1R=BZ^{\top}\Theta U\Lambda^{-1} is a full rank K×KK\times K matrix. We conduct similar operations with {Z~,Θ~,B~}\{\tilde{Z},\tilde{\Theta},\tilde{B}\}, and express UU via both parameter system

U=ΘZR=Θ~Z~R~,U=\Theta ZR=\tilde{\Theta}\tilde{Z}\tilde{R}, (4)

where R~=B~Z~Θ~UΛ1\tilde{R}=\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}U\Lambda^{-1}. (4) implies that ΘZR\Theta ZR and Θ~Z~R~\tilde{\Theta}\tilde{Z}\tilde{R} define the same partition via row proportional equivalence. By Lemma 3, we have Z~=ZP\tilde{Z}=ZP for a permutation matrix PP. In addition, we denote the partition by {C1,,CK}\{C_{1},\dots,C_{K}\} where Ck={i[n]zi=k}C_{k}=\{i\in[n]\mid z_{i}=k\}. Lemma 3 also implies that Θii1Θ~ii=Θjj1Θ~jj\Theta_{ii}^{-1}\tilde{\Theta}_{ii}=\Theta_{jj}^{-1}\tilde{\Theta}_{jj}, when i,jCki,j\in C_{k}.

Let DD be a K×KK\times K diagonal matrix, where Dkk=Θ~ii1ΘiiD_{kk}=\tilde{\Theta}_{ii}^{-1}\Theta_{ii} for any ii with zi=kz_{i}=k, which is well-defined by the previous paragraph. It can be verified directly that Θ1Θ~=diag(ZD11K)\Theta^{-1}\tilde{\Theta}=\mathrm{diag}\left(ZD^{-1}1_{K}\right), and diag(ZD11K)Z=ZD1\mathrm{diag}\left(ZD^{-1}1_{K}\right)Z=ZD^{-1}.

Note that we have shown Z~=ZP\tilde{Z}=ZP. Equation (2) with Z~=ZP\tilde{Z}=ZP leads to

ZBZ\displaystyle ZBZ^{\top} =Θ1Θ~Z~B~Z~Θ~Θ1\displaystyle=\Theta^{-1}\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}
=Θ1Θ~ZPB~PZΘ~Θ1\displaystyle=\Theta^{-1}\tilde{\Theta}ZP\tilde{B}P^{\top}Z^{\top}\tilde{\Theta}\Theta^{-1}
=diag(ZD11K)ZPB~PZdiag(ZD11K)\displaystyle=\mathrm{diag}\left(ZD^{-1}1_{K}\right)ZP\tilde{B}P^{\top}Z^{\top}\mathrm{diag}\left(ZD^{-1}1_{K}\right)
=ZD1PB~PD1Z,\displaystyle=ZD^{-1}P\tilde{B}P^{\top}D^{-1}Z^{\top},

which indicate B=D1PB~PD1B=D^{-1}P\tilde{B}P^{\top}D^{-1}, or equivalently B~=PDBDP\tilde{B}=P^{\top}DBDP. We have thus shown that if two parameter sets produce the same expected adjacency matrices, then Z~=ZP\tilde{Z}=ZP, B~=PDBDP\tilde{B}=P^{\top}DBDP, and Θ~=Θdiag(ZD11K)\tilde{\Theta}=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right). \Box

Proposition 1 states that the equation (2) implies the community structure of the DCSBM is well-defined, only up to a label permutation. However, this result is insufficient to address the identifiability issue of the DCSBM, where the diagonal elements in Δ\Delta are always assumed to be zero.

Let \mathbb{P} denote the natural projection from the linear space of n×nn\times n matrices to the subspace of all n×nn\times n matrices with zero diagonals. In the DCSBM, it is typically assumed that, instead if (1), the expected adjacency matrix satisfies

𝔼[A]=(ΘZBZΘ).\mathbb{E}[A]=\mathbb{P}(\Theta ZBZ^{\top}\Theta).

In other words, two systems {Z,Θ,B}\{Z,\Theta,B\} and {Z~,Θ~,B~}\{\tilde{Z},\tilde{\Theta},\tilde{B}\} define the same DCSBM if and only if

(ΘZBZΘ)=(Θ~Z~B~Z~Θ~).\mathbb{P}(\Theta ZBZ^{\top}\Theta)=\mathbb{P}(\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}). (5)

It is important to note that equation (5) imposes a weaker constraint than (2). Consequently, Proposition 1 does not suffice to fully resolve the identifiability issue in the DCSBM. The counterexample provided below further illustrates this point.

3.2 A counterexample

The following counter example shows that, without additional conditions, two parameter systems with completely different community structures may define the same DCSBM.

Consider two parameter sets with different community structures:

Θ=[200020002],Z\displaystyle\Theta=\begin{bmatrix}2&0&0\\ 0&2&0\\ 0&0&2\end{bmatrix},\qquad Z =[100101],B=140[2112];\displaystyle=\begin{bmatrix}1&0\\ 0&1\\ 0&1\end{bmatrix},\qquad B=\frac{1}{40}\begin{bmatrix}2&1\\ 1&2\end{bmatrix};
Θ~=[100020004],Z~\displaystyle\tilde{\Theta}=\begin{bmatrix}1&0&0\\ 0&2&0\\ 0&0&4\end{bmatrix},\qquad\tilde{Z} =[101001],B~=140[2112].\displaystyle=\begin{bmatrix}1&0\\ 1&0\\ 0&1\end{bmatrix},\qquad\tilde{B}=\frac{1}{40}\begin{bmatrix}2&1\\ 1&2\end{bmatrix}.

Then two matrices

ΘZBZΘ=[0.20.10.10.10.20.20.10.20.2] and Θ~Z~B~Z~Θ~=[0.050.10.10.10.20.20.10.20.8]\displaystyle\Theta ZBZ^{\top}\Theta=\begin{bmatrix}0.2&0.1&0.1\\ 0.1&0.2&0.2\\ 0.1&0.2&0.2\end{bmatrix}\text{ and }\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}=\begin{bmatrix}0.05&0.1&0.1\\ 0.1&0.2&0.2\\ 0.1&0.2&0.8\end{bmatrix}

share the same off-diagonal components. In this case, even if we observe infinitely many copies of AA and find 𝔼(A)\mathbb{E}(A), there is no way to decide whether the model is generated by {Z,Θ,B}\{Z,\Theta,B\} or {Z~,Θ~,B~}\{\tilde{Z},\tilde{\Theta},\tilde{B}\}.

3.3 Identifiability of the DCSBM

The identifiability issue of the DCSBM can be resolved under a mild condition on the size of the smallest community.

Condition 1.

For a DCSBM parametrized by {Z,Θ,B}\{Z,\Theta,B\}, assume that each community has at least three members. That is, Z1n31KZ^{\top}1_{n}\geq 3\cdot 1_{K}, where the inequality holds entry-wisely.

Theorem 1.

Assume that both parameter systems {Z,Θ,B}\{Z,\Theta,B\} and {Z~,Θ~,B~}\{\tilde{Z},\tilde{\Theta},\tilde{B}\} satisfy condition 1. These parameters define the same model, i.e., (5) holds, if and only if Z~=ZP\tilde{Z}=ZP, B~=PDBDP\tilde{B}=P^{\top}DBDP, and Θ~=Θdiag(ZD11K)\tilde{\Theta}=\Theta\mathrm{diag}\left(ZD^{-1}1_{K}\right) where PP is a K×KK\times K permutation matrix and DD is a K×KK\times K positive diagonal matrix.

Proof of Theorem 1: The “if” part of the statement follows directly from Proposition 1.

For the “only if” portion, it suffices to show that equation (5) and Condition 1 imply that ΘZBZΘ=Θ~Z~B~Z~Θ~.\Theta ZBZ^{\top}\Theta=\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}. Then the conclusion follows Proposition 1.

Note that equation (5) implies that there exists a diagonal matrix CC such that

ΘZBZΘ=Θ~Z~B~Z~Θ~+C,\Theta ZBZ^{\top}\Theta=\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}+C,

which leads to

Θ1CΘ1=ZBZΘ1Θ~Z~B~Z~Θ~Θ1.\Theta^{-1}C\Theta^{-1}=ZBZ^{\top}-\Theta^{-1}\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}. (6)

Let eie_{i} denote the ii-th coordinate vector in n\mathbb{R}^{n}, defined as the vector with a 1 in the ii-th position and 0 elsewhere. Let ii and jj be two nodes with zi=zjz_{i}=z_{j}. Multiplying eieje_{i}-e_{j} on both sides of (6), we have

Θ1CΘ1(eiej)\displaystyle\Theta^{-1}C\Theta^{-1}(e_{i}-e_{j}) =(ZBZΘ1Θ~Z~B~Z~Θ~Θ1)(eiej)\displaystyle=\left(ZBZ^{\top}-\Theta^{-1}\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}\right)(e_{i}-e_{j})
=Θ1Θ~Z~B~Z~Θ~Θ1(eiej),\displaystyle=-\Theta^{-1}\tilde{\Theta}\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}(e_{i}-e_{j}),

because Z(eiej)=0Z^{\top}(e_{i}-e_{j})=0. The last equation further implies

Θ~1CΘ1(eiej)=Z~B~Z~Θ~Θ1(eiej)=Z~w,\tilde{\Theta}^{-1}C\Theta^{-1}(e_{i}-e_{j})=-\tilde{Z}\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}(e_{i}-e_{j})=\tilde{Z}w, (7)

where w=B~Z~Θ~Θ1(eiej)w=-\tilde{B}\tilde{Z}^{\top}\tilde{\Theta}\Theta^{-1}(e_{i}-e_{j}).

If ww has a nonzero component, say wl0w_{l}\neq 0, then u=Z~wu=\tilde{Z}w has at least 3 nonzero components by Condition 1 since us=wlu_{s}=w_{l} for all nodes ss with z~s=l\tilde{z}_{s}=l. In contrast, the left hand side of equation (7) has at most 22 nonzero components. Thus, equation (7) indicates w=0w=0, and hence Cii=Cjj=0C_{ii}=C_{jj}=0. We can repeat the argument with arbitrary ii and jj from the same community and conclude C=0C=0. \Box

4 Conclusion

We have clarified identifiability issues on the DCSBM. Our result implies that the community structure of the DCSBM is well-defined under a mild condition on the size of the smallest community. It is worth mentioning that all the theoretical results discussed in this paper rely solely on the mean structure of the model. Therefore, these results are applicable not only to the DCSBM with a Bernoulli distribution on links, but also to networks with a Poisson distribution on links, or even networks with more flexible continuous link weights.

Acknowledgment

This work was partially supported by the National Science Foundation.

References

  • Abbe, (2017) Abbe, E. (2017). Community detection and stochastic block models: recent developments. Journal of Machine Learning Research, 18(1):6446–6531.
  • Ball et al., (2011) Ball, B., Karrer, B., and Newman, M. E. (2011). Efficient and principled method for detecting communities in networks. Physical Review E, 84(3):036103.
  • Bickel and Chen, (2009) Bickel, P. J. and Chen, A. (2009). A nonparametric view of network models and Newman-Girvan and other modularities. Proceedings of the National Academy of Sciences, 106:21068–21073.
  • Choi et al., (2012) Choi, D. S., Wolfe, P. J., and Airoldi, E. M. (2012). Stochastic blockmodels with a growing number of classes. Biometrika, 99(2):273–284.
  • Erdős and Rényi, (1959) Erdős, P. and Rényi, A. (1959). On random graphs i. Publ. math. debrecen, 6(290-297):18.
  • Holland and Leinhardt, (1981) Holland, P. and Leinhardt, S. (1981). An exponential family of probability distributions for directed graphs:. J. of Amer. Statist Assoc., 76(373):62–65.
  • Karrer and Newman, (2011) Karrer, B. and Newman, M. E. (2011). Stochastic blockmodels and community structure in networks. Physical review E, 83(1):016107.
  • Nowicki and Snijders, (2001) Nowicki, K. and Snijders, T. A. B. (2001). Estimation and prediction for stochastic blockstructures. Journal of the American Statistical Association, 96(455):1077–1087.
  • Zhao, (2017) Zhao, Y. (2017). A survey on theoretical advances of community detection in networks. Wiley Interdisciplinary Reviews: Computational Statistics, 9(5).
  • Zhao et al., (2012) Zhao, Y., Levina, E., and Zhu, J. (2012). Consistency of community detection in networks under degree-corrected stochastic block models. The Annals of Statistics, 40(4):2266 – 2292.