Differentially private analysis of networks with covariates via a generalized $\beta$ -model

Ting Yan Department of Statistics, Central China Normal University, Wuhan, 430079, China. Emails: [email protected]

Abstract

How to achieve the tradeoff between privacy and utility is one of fundamental problems in private data analysis. In this paper, we give a rigourous differential privacy analysis of networks in the appearance of covariates via a generalized $\beta$ -model, which has an $n$ -dimensional degree parameter $\beta$ and a $p$ -dimensional homophily parameter $\gamma$ . Under $(k_{n},\epsilon_{n})$ -edge differential privacy, we use the popular Laplace mechanism to release the network statistics. The method of moments is used to estimate the unknown model parameters. We establish the conditions guaranteeing consistency of the differentially private estimators $\widehat{\beta}$ and $\widehat{\gamma}$ as the number of nodes $n$ goes to infinity, which reveal an interesting tradeoff between a privacy parameter and model parameters. The consistency is shown by applying a two-stage Newton’s method to obtain the upper bound of the error between $(\widehat{\beta},\widehat{\gamma})$ and its true value $(\beta,\gamma)$ in terms of the $\ell_{\infty}$ distance, which has a convergence rate of rough order $1/n^{1/2}$ for $\widehat{\beta}$ and $1/n$ for $\widehat{\gamma}$ , respectively. Further, we derive the asymptotic normalities of $\widehat{\beta}$ and $\widehat{\gamma}$ , whose asymptotic variances are the same as those of the non-private estimators under some conditions. Our paper sheds light on how to explore asymptotic theory under differential privacy in a principled manner; these principled methods should be applicable to a class of network models with covariates beyond the generalized $\beta$ -model. Numerical studies and a real data analysis demonstrate our theoretical findings.

Keywords: Asymptotic normality; Consistency; Covariate; Differential privacy; Network data

1 Introduction

Social network data may contain sensitive information about relationships between individuals (e.g., friendship, email exchange, sexual interaction) and even individuals themselves (e.g., respondents in sexual partner networks). Undoubtedly, it will expose individual’s privacy if these data are directly released to the public for various research purposes. Even if individuals are anonymized by removing identifications before being made public, it is still easy to attack by applying some de-anonymization techniques [e.g., Narayanan and Shmatikov, (2009)]. A randomized data releasing mechanism that injects random noises to the original data (i.e., input perturbation) or their aggregate statistics to queries (i.e., output perturbation), provides an alternative to protect data privacy. To rigorously restrict privacy leakage, Dwork et al., (2006) developed a privacy notation–differential privacy that requires that the output of a query does not change too much if we add/remove any single individual’s record to/from a database in randomized data releasing mechanisms. Since then, it has been widely accepted as a privacy standard for releasing sensitive data.

Many differentially private algorithms have been proposed to release network data or their aggregate statistics, especially in computer and machine learning literature [e.g., Day et al., (2016); Macwan and Patel, (2018); Nguyen et al., (2016); Wang et al., (2022)]. On the other hand, denoising approaches have been developed to improve the estimation of network statistics [e.g, Hay et al., (2009); Karwa and Slavković, (2016); Yan, (2021)]. However, differentially private inference in network models is still in its infancy. This is partly because network data are nonstandard and asymptotic analysis is usually based on only one observed network. The increasing dimension of parameters and the appearance of noises poses additional challenge as well [Fienberg, (2012)]. Recently, Karwa and Slavković, (2016) derived consistency and asymptotic normality of the differentially private estimator of the parameter constructed from the denoised degree sequence in the $\beta$ -model, which is an exponential random graph model with the degree sequence as the sufficient statistic [Chatterjee et al., (2011)]. Yan, (2021) derived asymptotic properties of the differentially private estimators in the $p_{0}$ model for directed networks. In despite of these recent developments, to the best of our knowledge, the differentially private analyses of networks in the presence of covariates have not been explored, in that neither their releasing methods nor their theoretical properties are well understood.

The covariates of nodes could have important implications on the link formation. A commonly existing phenomenon in social and econometric network data is that individuals tend to form connections with those like themselves, which is referred to as homophily. Therefore, it is of interest to see how covariates influence differentially private estimation in network models. The aim of this paper is to give a rigorously differentially private analysis of networks in the presence of covariates via a generalized $\beta$ -model. It contains an $n$ -dimensional degree parameter $\beta$ characterizing the variation in the node degrees and a $p$ -dimensional regression parameter $\gamma$ of covariates measuring phomophic or heteriphic effects. A detailed description is given in Section 2.1. This model had been introduced in Graham, (2017) to model economic networks and a directed version was proposed in Yan et al., (2019). In this model, the degree sequence $d$ and the covariate term $y$ are the sufficient statistics. Therefore, it is sufficient to treat only these information as privacy contents in this model. Under $(k_{n},\epsilon_{n})$ -edge differential privacy, we propose to use a joint Laplace mechanism to release the network statistics $d$ and $y$ , which are added the discrete Laplacian noises and continuous Laplacian noises, respectively.

We construct estimating equations to infer model parameters based on the original maximum likelihood equations, in which the original network statistics are directly replaced by their noisy outputs. We develop new approaches to establish asymptotic theory of differentially private estimators. Owning to noises having zero mean, they are the same as the moment equations. The main contributions are as follows. First, we establish the conditions on a privacy parameter and model parameters that guarantee consistency of the differentially private estimator, which control the trade-off between privacy and utility. A key idea for the proof is that we use a two-stage Newton’s method that first obtains the upper bound of the error in terms of $\ell_{\infty}$ norm between $\widehat{\beta}_{\gamma}$ and $\beta$ with a given $\gamma$ , and then derives the upper bound of the error between $\widehat{\gamma}$ and $\gamma$ by using a profiled function, where $\widehat{\beta}$ and $\widehat{\gamma}$ are the differentially private estimators of $\beta$ and $\gamma$ , respectively. As a result, we obtain the convergence rates of $\widehat{\beta}$ and $\widehat{\gamma}$ having respective orders of $O_{p}(n^{-1/2})$ and $O_{p}(n^{-1})$ roughly, both up to a logarithm factor. Notably, the convergence rate for $\widehat{\beta}$ matches the minimax optimal upper bound $\|\widehat{\beta}-\beta\|_{\infty}=O_{p}((\log p/n)^{1/2})$ for the Lasso estimator in the linear model with $p=n$ -dimensional parameter $\beta$ and the sample size $n$ in Lounici, (2008). Second, we derive the asymptotic normal distributions of $\widehat{\beta}$ and $\widehat{\gamma}$ . This is proved by applying Taylor’s expansions to a series of functions constructed from estimating equations and showing that various remainder terms in the expansions are asymptotically neglect. The convergence rate $1/n$ of $\widehat{\gamma}$ makes that the asymptotic distribution of $\widehat{\beta}$ does not depend on $\widehat{\gamma}$ and therefore has no bias. The asymptotic distribution of $\widehat{\gamma}$ of the homophily parameter $\gamma$ contains a bias term in terms of a weighted sum of covariates. Finally, we provide simulation studies as well as a real data analysis to illustrate the theoretical results.

We note that Karwa and Slavković, (2016) obtained asymptotic results of the edge-differentially private estimator based on the denoising process while our asymptotic results do not require the denoising process. Another important difference from Karwa and Slavković, (2016) is that we characterize how errors of estimators depend on the privacy parameter and we do not make the assumption that all parameters are bounded above by a constant in asymptotic theories.

For the rest of the paper, we proceed as follows. In Section 2, we give a necessary background on the generalized $\beta$ -model and differential privacy. In Section 3, we present the estimation. In Section 4, we present the consistency and asymptotic normality of the differentially private estimator. We carry out simulations and illustrate our results by a real data analysis in Section 5. We give the summary and further discussion in Section 6. The proofs of the main results are regelated into Section 7. The proofs of supported lemmas are given in the supplementary material.

2 Model and differential privacy

In this section, we introduce the generalized $\beta$ -model with covariates and present the necessary background for differential privacy.

2.1 Generalized $\beta$ -model

Let $G_{n}$ be an undirected graph on $n\geq 2$ nodes labeled by “ $1,\ldots,n$ ”. Let $A=(a_{ij})_{n\times n}$ be the adjacency matrix of $G_{n}$ , where $a_{ij}$ is an indicator denoting whether node $i$ is connected to node $j$ . That is, $a_{ij}=1$ if there is a link between $i$ and $j$ ; $a_{ij}=0$ otherwise. We do not consider self-loops here, i.e., $a_{ii}=0$ . Let $d_{i}=\sum_{j\neq i}a_{ij}$ be the degree of node $i$ and $d=(d_{1},\ldots,d_{n})^{\top}$ be the degree sequence of the graph $G_{n}$ . We also observe a $p$ -dimensional vector $z_{ij}$ , the covariate information attached to the edge between nodes $i$ and $j$ . The covariate $z_{ij}$ can be formed according to the similarity or dissimilarity between nodal attributes $z_{i}$ and $z_{j}$ for nodes $i$ and $j$ . Specifically, $z_{ij}$ can be represented through a symmetric function $g(\cdot,\cdot)$ with $z_{i}$ and $z_{j}$ as its arguments. As an example if $z_{i}$ is an indicator of genders (e.g., $1$ for male and $-1$ for female), then we could use $z_{ij}=z_{i}\cdot z_{j}$ to denote the similarity or dissimilarity measurement between $i$ and $j$ .

The $\beta$ -model with covariates [Graham, (2017); Yan et al., (2019)] assumes that the edge $a_{ij}$ between $i$ and $j$ conditional on the unobserved degree effects and observed covariates has the following probability:

\mathbb{P}(a_{ij}=a|\beta,z_{ij})=\frac{e^{(\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma)a}}{1+e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}},~{}~{}a\in\{0,1\},

(1)

independent of other edges. The parameter $\beta_{i}$ is the intrinsic individual effect that reflects the node heterogeneity to participate in network connection. The common parameter $\gamma$ is exogenous, measuring the homophily or heterophilic effect. A larger homophily component $z_{ij}^{\top}\gamma$ means a larger homophily effect. We will refer to $\gamma$ as the homophily parameter hereafter although it could represent heterophilic measurement. Hereafter, we call model (1) the covariate-adjusted $\beta$ -model.

The log-likelihood function is

\ell(\beta,\gamma)=\sum_{i=1}^{n}\beta_{i}d_{i}+\sum_{i<j}a_{ij}z_{ij}^{\top}\gamma-\sum_{i<j}\log\left(1+\frac{e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}}{1+e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}}\right).

(2)

The maximum likelihood equations are

\begin{array}[]{rcl}d_{i}&=&\sum_{j\neq i}\frac{e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}}{1+e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}},\\ \sum_{i<j}a_{ij}z_{ij}&=&\sum_{i<j}\frac{z_{ij}e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}}{1+e^{\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma}}.\end{array}

(3)

The R language provides a standard package “glm” to solve (3), which implements an iteratively reweighted least squares method for generalized linear models [McCullagh and Nelder, (1989)].

2.2 Differential privacy

Given an original database $D$ with records of $n$ persons, we consider a randomized data releasing mechanism $Q$ that takes $D$ as input and outputs a sanitized database $S=(S_{1},\ldots,S_{\ell})$ for public use. As an illustrated example, the additive noise mechanism returns the answer $f(D)+z$ to the query $f(D)$ , where $z$ is a random noise. Let $\epsilon$ be a positive real number and $\mathcal{S}$ denote the sample space of $Q$ . The data releasing mechanism $Q$ is $\epsilon$ -differentially private if for any two neighboring databases $D_{1}$ and $D_{2}$ that differ on a single element (i.e., the data of one person), and all measurable subsets $B$ of $\mathcal{S}$ [Dwork et al., (2006)],

Q(S\in B|D_{1})\leq e^{\epsilon}\times Q(S\in B|D_{2}).

This says the probability of an output $S$ given the input $D_{1}$ is less than that given the input $D_{2}$ multiplied by a privacy factor $e^{\epsilon}$ . The privacy parameter $\epsilon$ is chosen according to the privacy policy, which controls the trade-off between privacy and utility. It is generally public. Smaller value of $\epsilon$ means more privacy protection.

Differential privacy requires that the distribution of the output is almost the same whether or not an individual’s record appears in the original database. We illustrate why it protects privacy with an example. Suppose a hospital wants to release some statistics on the medical records of their patients to the public. In response, a patient may wish to make his record omitted from the study due to a privacy concern that the published results will reveal something about him personally. Differential privacy alleviates this concern because whether or not the patient participates in the study, the probability of a possible output is almost the same. From a theoretical point, any test statistic has nearly no power for testing whether an individual’s data is in the original database or not [Wasserman and Zhou, (2010)].

What is being protected in the differential privacy is precisely the difference between two neighboring databases. Within network data, depending on the definition of the graph neighbor, differential privacy is divided into $k$ -node differential privacy [Hay et al., (2009)] and $k$ -edge differential privacy [Nissim et al., (2007)]. Two graphs are called neighbors if they differ in exactly $k$ edges, then differential privacy is $k$ -edge differential privacy. The special case with $k=1$ is generally referred to as edge differential privacy. Analogously, we can define $k$ -node differential privacy by letting graphs be neighbors if one can be obtained from the other by removing $k$ nodes and its adjacent edges. Edge differential privacy protects edges not to be detected, whereas node differential privacy protects nodes together with their adjacent edges, which is a stronger privacy policy. However, it may be infeasible to design algorithms that are both node differential privacy and have good utility since it generally needs a large noise [e.g., Hay et al., (2009)]. Following Hay et al., (2009) and Karwa and Slavković, (2016), we use edge differential privacy here.

Let $\delta(G,G^{\prime})$ be the harming distance between two graphs $G$ and $G^{\prime}$ , i.e., the number of edges on which $G$ and $G^{\prime}$ differ. The formal definition of $(\epsilon,k)$ -edge differential privacy is as follows.

Definition 1 (Edge differential privacy).

Let $\epsilon>0$ be a privacy parameter. A randomized mechanism $Q(\cdot|G)$ is $(k,\epsilon)$ -edge differentially private if

\sup_{G,G^{\prime}\in\mathcal{G},\delta(G,G^{\prime})=k}\sup_{S\in\mathcal{S}}\frac{Q(S|G)}{Q(S|G^{\prime})}\leq e^{\epsilon},

where $\mathcal{G}$ is the set of all graphs of interest on $n$ nodes and $\mathcal{S}$ is the set of all possible outputs.

Let $f:\mathcal{G}\rightarrow\mathbb{R}^{\ell}$ be a function. The global sensitivity [Dwork et al., (2006)] of the function $f$ , denoted $\Delta f$ , is defined below.

Definition 2.

(Global Sensitivity). Let $f:\mathcal{G}\to\mathbb{R}^{\ell}$ . The global sensitivity of $f$ is defined as

\Delta(f)=\max_{\delta(G,G^{\prime})=k}\|f(G)-f(G^{\prime})\|_{1}

where $\|\cdot\|_{1}$ is the $L_{1}$ norm.

The global sensitivity measures the largest change for the query function $f$ in terms of the $L_{1}$ -norm between any two neighboring graphs. The magnitude of noises added in the differentially private algorithm $Q$ crucially depends on the global sensitivity. If the outputs are the network statistics, then a simple algorithm to guarantee edge differential privacy is the Laplace Mechanism [e.g., Dwork et al., (2006)] that adds the Laplacian noise proportional to the global sensitivity of $f$ .

Lemma 1.

(Laplace Mechanism). Suppose that $f:\mathcal{G}\to\mathbb{R}^{\ell}$ is a output function in $\mathcal{G}$ . Let $z_{1},\ldots,z_{\ell}$ be independently and identically distributed Laplace random variables with density function $e^{-|z|/\lambda}/(2\lambda)$ . Then the Laplace mechanism outputs $f(G)+(z_{1},\ldots,z_{\ell})$ is $(\epsilon,k)$ -edge differentially private, where $\epsilon=\Delta(f)/\lambda$ .

When $f(G)$ is integer, one can use a discrete Laplace random variable as the noise, where it has the probability mass function:

\mathbb{P}(X=x)=\frac{1-\lambda}{1+\lambda}\lambda^{|x|},~{}~{}x\in\{0,\pm 1,\ldots\},\lambda\in(0,1).

(4)

Lemma 1 still holds if the continuous Laplace distribution is replaced by the discrete version and the privacy parameter is chosen by $\epsilon=-\Delta(f)\log\lambda$ ; see Karwa and Slavković, (2016).

We introduce a nice property on differential privacy: any function of a differentially private mechanism is also differentially private, as stated in the lemma below.

Lemma 2 (Dwork et al., (2006)).

Let $f$ be an output of an $\epsilon$ -differentially private mechanism and $g$ be any function. Then $g(f(G))$ is also $\epsilon$ -differentially private.

3 Releasing network statistics and estimation

3.1 Releasing

From the log-likelihood function (2), we know that $(d,\sum_{i<j}a_{ij}z_{ij})$ is the sufficient statistic. Thus, the private information in the covariates-adjusted $\beta$ –model is $(d,\sum_{i<j}a_{ij}z_{ij})$ . We use the continuous Laplace mechanism in Lemma 1 and its discrete version to release the covariate statistic $\sum_{i<j}z_{ij}a_{ij}$ and the degree sequence $d$ under $(\epsilon_{n}/2,k_{n})$ -edge differential privacy, respectively. The joint mechanism satisfies $(\epsilon_{n},k_{n})$ -edge differential privacy. The subscript $n$ means that $k_{n}$ and $\epsilon_{n}$ are allowed to depend on $n$ . If we add $k_{1}$ or remove $k_{2}=k_{n}-k_{1}$ edges in $G_{n}$ and denote the induced graph as $G_{n}^{\prime}$ , then

\|d-d^{\prime}\|_{1}=2k_{n},~{}~{}\|\sum_{i<j}(a_{ij}-a^{\prime}_{ij})z_{ij}\|_{1}\leq pk_{n}z_{*}

where $d^{\prime}$ is the degree sequence of $G_{n}^{\prime}$ , $a^{\prime}_{ij}$ is the value of edge $(i,j)$ in $G_{n}^{\prime}$ and $z_{*}=\max_{ijk}|z_{ijk}|$ . So the global sensitivity is $2k_{n}$ for $d$ and $pk_{n}z_{*}$ for $\sum_{i<j}a_{ij}z_{ij})$ . We release the sufficient statistics $d$ and $y:=\sum_{i<j}a_{ij}z_{ij}$ as follows:

\displaystyle\begin{array}[]{rcl}\tilde{d}_{i}&=&d_{i}+\xi_{i},~{}~{}i=1,\ldots,n,\\ \tilde{y}_{t}&=&\sum_{i<j}z_{ijt}a_{ij}+\eta_{t},~{}~{}t=1,\ldots,p,\end{array}

(7)

where $\xi_{i}$ , $i=1,\ldots,n$ , are independently generated from the discrete Laplace distribution with $\lambda_{n1}=e^{-\epsilon_{n}/(4k_{n})}$ , and $\eta_{t}$ , $t=1,\ldots,p$ , are independently generated from the Laplace distribution with $\lambda_{n2}=2pk_{n}z_{*}/\epsilon_{n}$ .

3.2 Estimation

Write $\mu(x)=e^{x}/(1+e^{x})$ . Define

\pi_{ij}:=z_{ij}^{\top}\gamma+\beta_{i}+\beta_{j}.

(8)

It is clear that $\mu(\pi_{ij})$ is the expectation of $a_{ij}$ . When we emphasize the arguments $\beta$ and $\gamma$ in $\mu(\cdot)$ , we write $\mu_{ij}(\beta,\gamma)$ instead of $\mu(\pi_{ij})$ . To estimate model parameters, we directly replace $d$ and $y$ in maximum likelihood equations (3) with their noisy observed values $\tilde{d}$ and $\tilde{y}$ :

\begin{array}[]{rcl}\tilde{d}_{i}&=&\sum_{j\neq i}\mu_{ij}(\beta,\gamma),~{}~{}i=1,\ldots,n,\\ \tilde{y}&=&\sum_{i=1}^{n}\sum_{j=1,j<i}^{n}z_{ij}\mu_{ij}(\beta,\gamma).\end{array}

(9)

Because the expectations of the noises are zero, the above equation are the same as the moment equations.

Let $(\widehat{\beta},\widehat{\gamma})$ be the solution to the equations (9). Since $(\tilde{d},\tilde{y})$ satisfies $(\epsilon_{n},k_{n})$ -edge differential privacy, $(\widehat{\beta},\widehat{\gamma})$ is also $(\epsilon_{n},k_{n})$ -edge differentially private according to Lemma 2. A two-step iterative algorithm by alternating between solving the first equation in (9) via the fixed point method in Chatterjee et al., (2011) for a given $\gamma$ and solving the second equation in (9) via the Newton method or the gradient descent method, can be employed to obtain the solution.

4 Asymptotic properties

In this section, we present consistency and asymptotic normality of the differentially private estimator $(\widehat{\beta},\widehat{\gamma})$ . We first introduce some notations. For a subset $C\subset\mathbb{R}^{n}$ , let $C^{0}$ and $\overline{C}$ denote the interior and closure of $C$ , respectively. For a vector $x=(x_{1},\ldots,x_{n})^{\top}\in\mathbb{R}^{n}$ , denote by $\|x\|$ for a general norm on vectors with the special cases $\|x\|_{\infty}=\max_{1\leq i\leq n}|x_{i}|$ and $\|x\|_{1}=\sum_{i}|x_{i}|$ for the $\ell_{\infty}$ - and $\ell_{1}$ -norm of $x$ respectively. Let $B(x,\epsilon)=\{y:\|x-y\|_{\infty}\leq\epsilon\}$ be an $\epsilon$ -neighborhood of $x$ . For an $n\times n$ matrix $J=(J_{ij})$ , $\|J\|_{\infty}$ denotes the matrix norm induced by the $\ell_{\infty}$ -norm on vectors in $\mathbb{R}^{n}$ , i.e.,

\|J\|_{\infty}=\max_{x\neq 0}\frac{\|Jx\|_{\infty}}{\|x\|_{\infty}}=\max_{1\leq i\leq n}\sum_{j=1}^{n}|J_{ij}|,

and $\|J\|$ be a general matrix norm. Define the matrix maximum norm: $\|J\|_{\max}=\max_{i,j}|J_{ij}|$ . We use the superscript “*” to denote the true parameter under which the data are generated. When there is no ambiguity, we omit the superscript “*”. Define

z_{*}:=\max_{i,j}\|z_{ij}\|_{\infty}.

The notation $\sum_{j<i}$ is a shorthand for $\sum_{i=1}^{n}\sum_{j=1,j<i}^{n}$ .

Recall that $\mu(x)=e^{x}/(1+e^{x})$ . Write $\mu^{\prime}$ , $\mu^{\prime\prime}$ and $\mu^{\prime\prime\prime}$ as the first, second and third derivative of $\mu(x)$ on $x$ , respectively. A direct calculation gives that

\mu^{\prime}(x)=\frac{e^{x}}{(1+e^{x})^{2}},~{}~{}\mu^{\prime\prime}(x)=\frac{e^{x}(1-e^{x})}{(1+e^{x})^{3}},~{}~{}\mu^{\prime\prime\prime}(x)=\frac{e^{x}(1-4e^{x}+e^{2x})}{(1+e^{x})^{4}}.

It is easily checked that

|\mu^{\prime}(x)|\leq\frac{1}{4},~{}~{}|\mu^{\prime\prime}(x)|\leq\frac{1}{4},~{}~{}|\mu^{\prime\prime\prime}(x)|\leq\frac{1}{4}.

(10)

Let $\epsilon_{n1}$ and $\epsilon_{n2}$ be two small positive numbers. Note that $f(x)=e^{x}(1+e^{x})^{-2}$ is a decreasing function of $x$ when $x\geq 0$ and $f(x)=f(-x)$ . Recall that $\pi_{ij}=\beta_{i}+\beta_{j}+z_{ij}^{\top}\gamma$ . Define

b_{n}:=\sup_{\beta\in B(\beta^{*},\epsilon_{n1}),\gamma\in B(\gamma^{*},\epsilon_{n2})}\max_{i,j}\frac{(1+e^{\pi_{ij}})^{2}}{e^{\pi_{ij}}}=O(e^{2\|\beta^{*}\|_{\infty}+\|\gamma^{*}\|_{\infty}}).

(11)

In other words, we have

\inf_{\beta\in B(\beta^{*},\epsilon_{n1}),\gamma\in B(\gamma^{*},\epsilon_{n2})}\min_{i,j}\frac{e^{\pi_{ij}}}{(1+e^{\pi_{ij}})^{2}}\geq\frac{1}{b_{n}}.

Note that $b_{n}\geq 1/4$ . When causing no confusion, we will simply write $\mu_{ij}$ stead of $\mu_{ij}(\beta,\gamma)$ for shorthand. We will use the notations $\mu(\pi_{ij})$ and $\mu_{ij}(\beta,\gamma)$ interchangeably. Hereafter, we assume that the dimension $p$ of $z_{ij}$ is fixed.

4.1 Consistency

To derive consistency of the differentially private estimator, let us first define a system of functions based on the estimating equations (9). Define

F_{i}(\beta,\gamma)=\sum\limits_{j=1,j\neq i}^{n}\mu_{ij}(\beta,\gamma)-\tilde{d}_{i},~{}~{}i=1,\ldots,n,

(12)

and $F(\beta,\gamma)=(F_{1}(\beta,\gamma),\ldots,F_{n}(\beta,\gamma))^{\top}$ . Further, we define $F_{\gamma,i}(\beta)$ as the value of $F_{i}(\beta,\gamma)$ for an arbitrarily fixed $\gamma$ and $F_{\gamma}(\beta)=(F_{\gamma,1}(\beta),\ldots,F_{\gamma,n}(\beta))^{\top}$ . Let $\widehat{\beta}_{\gamma}$ be a solution to $F_{\gamma}(\beta)=0$ . Correspondingly, we define two functions for exploring the asymptotic behaviors of the estimator of the homophily parameter:

	$\displaystyle Q(\beta,\gamma)=\sum_{j<i}z_{ij}\mu_{ij}(\beta,\gamma)-\tilde{y},$		(13)
	$\displaystyle Q_{c}(\gamma)=\sum_{j<i}z_{ij}\mu_{ij}(\widehat{\beta}_{\gamma},\gamma)-\tilde{y}.$		(14)

$Q_{c}(\gamma)$ could be viewed as the profile function of $Q(\beta,\gamma)$ in which the degree parameter $\beta$ is profiled out. It is clear that

F(\widehat{\beta},\widehat{\gamma})=0,~{}~{}F_{\gamma}(\widehat{\beta}_{\gamma})=0,~{}~{}Q(\widehat{\beta},\widehat{\gamma})=0,~{}~{}Q_{c}(\widehat{\gamma})=0.

By the compound function derivation law, we have

	$\displaystyle 0=\frac{\partial F_{\gamma}(\widehat{\beta}_{\gamma})}{\partial\gamma^{\top}}=\frac{\partial F(\widehat{\beta}_{\gamma},\gamma)}{\partial\beta^{\top}}\frac{\partial\widehat{\beta}_{\gamma}}{\gamma^{\top}}+\frac{\partial F(\widehat{\beta}_{\gamma},\gamma)}{\partial\gamma^{\top}},$		(15)
	$\displaystyle\frac{\partial Q_{c}(\gamma)}{\partial\gamma^{\top}}=\frac{\partial Q(\widehat{\beta}_{\gamma},\gamma)}{\partial\beta^{\top}}\frac{\partial\widehat{\beta}_{\gamma}}{\gamma^{\top}}+\frac{\partial Q(\widehat{\beta}_{\gamma},\gamma)}{\partial\gamma^{\top}}.$		(16)

By solving $\partial\widehat{\beta}_{\gamma}/\partial\gamma^{\top}$ in (15) and substituting it into (16), we get the Jacobian matrix $Q_{c}^{\prime}(\gamma)$ $(=\partial Q_{c}(\gamma)/\partial\gamma)$ :

\displaystyle\frac{\partial Q_{c}(\gamma)}{\partial\gamma^{\top}}=\frac{\partial Q(\widehat{\beta}_{\gamma},\gamma)}{\partial\gamma^{\top}}-\frac{\partial Q(\widehat{\beta}_{\gamma},\gamma)}{\partial\beta^{\top}}\left[\frac{\partial F(\widehat{\beta}_{\gamma},\gamma)}{\partial\beta^{\top}}\right]^{-1}\frac{\partial F(\widehat{\beta}_{\gamma},\gamma)}{\partial\gamma^{\top}},

(17)

where

\frac{\partial Q(\widehat{\beta}_{\gamma},\gamma)}{\partial\gamma^{\top}}:=\frac{\partial Q(\beta,\gamma)}{\partial\gamma^{\top}}{\Big{|}}_{\beta=\widehat{\beta}_{\gamma},\gamma=\gamma},~{}~{}\frac{\partial F(\widehat{\beta}_{\gamma},\gamma)}{\partial\beta^{\top}}:=\frac{\partial F(\beta,\gamma)}{\partial\beta^{\top}}{\Big{|}}_{\beta=\widehat{\beta}_{\gamma},\gamma=\gamma}.

The asymptotic behavior of $\widehat{\gamma}$ crucially depends on the Jacobian matrix $Q_{c}^{\prime}(\gamma)$ . Since $\widehat{\beta}_{\gamma}$ does not have a closed form, conditions that are directly imposed on $Q_{c}^{\prime}(\gamma)$ are not easily checked. To derive feasible conditions, we define

H(\beta,\gamma)=\frac{\partial Q(\beta,\gamma)}{\partial\gamma}-\frac{\partial Q(\beta,\gamma)}{\partial\beta}\left[\frac{\partial F(\beta,\gamma)}{\partial\beta}\right]^{-1}\frac{\partial F(\beta,\gamma)}{\partial\gamma},

(18)

which is a general form of $\partial Q_{c}(\gamma)/\partial\gamma$ . Note that $H(\widehat{\beta}_{\gamma},\gamma)$ is the Fisher information matrix of the concentrated likelihood function $\ell_{c}(\gamma)$ , where the degree parameter $\beta$ is profiled out. When $\beta\in B(\beta^{*},\epsilon_{n1})$ and $b_{n}^{2}\kappa_{n}^{2}\epsilon_{n1}=o(1)$ , we have the approximation:

\frac{1}{n^{2}}H(\beta,\gamma^{*})=\frac{1}{n^{2}}H(\beta^{*},\gamma^{*})+o(1),

whose proof is given in the supplementary material. We assume that there exists a number $\rho_{n}$ such that

\sup_{\beta\in B(\beta^{*},\epsilon_{n1})}\|H^{-1}(\beta,\gamma^{*})\|_{\infty}\leq\frac{\rho_{n}}{n^{2}}.

Note that the dimension of $H(\beta,\gamma)$ is fixed and every its entry is a sum $n(n-1)/2$ of terms. If $n^{-2}H(\beta,\gamma^{*})$ converges to a constant matrix, then $\rho_{n}$ is bounded. Moreover, if $n^{-2}H(\beta,\gamma^{*})$ is positively definite, then

\rho_{n}=\sqrt{p}\times\sup_{\beta\in B(\beta^{*},\epsilon_{n1})}1/\lambda_{\min}(\beta),

where $\lambda_{\min}(\beta)$ is the smallest eigenvalue of $n^{-2}H(\beta,\gamma^{*})$ .

We use a two-stage Newton iterative sequence to show consistency. In the first stage, we obtain an upper bound of the error between $\widehat{\beta}_{\gamma}$ and $\beta$ in terms of the $\ell_{\infty}$ norm for a given $\gamma$ . This is done by verifying the well-known Newton-Kantororich conditions, under which the optimal error bounds are established. Then we derive the upper bound of the error between $\widehat{\gamma}$ and $\gamma$ by using a profiled function $Q_{c}(\gamma)$ constructed from estimating equations. Now we formally state the consistency result.

Theorem 1.

Let $\tilde{\epsilon}_{n}=1+(4k_{n}/\epsilon_{n})(\log n/n)^{1/2}$ and $\tau_{n}=b_{n}^{3}+(k_{n}/\epsilon_{n})b_{n}^{3}+z_{*}/(\log n)^{1/2}$ . If

b_{n}^{2}\tilde{\epsilon}_{n}=o\left(\sqrt{\frac{n}{\log n}}\right),~{}~{}b_{n}^{3}\rho_{n}^{2}\tau_{n}z_{*}^{2}=o\left(\frac{n}{\log n}\right),

then the differentially private estimator $(\widehat{\beta},\widehat{\gamma})$ exists with probability approaching one and is consistent in the sense that

	$\displaystyle\\|\widehat{\gamma}-\gamma^{*}\\|_{\infty}$	$\displaystyle=O_{p}\left(\frac{\rho_{n}\tau_{n}\log n}{n}\right)=o_{p}(1),$
	$\displaystyle\\|\widehat{\beta}-\beta^{*}\\|_{\infty}$	$\displaystyle=O_{p}\left(b_{n}\tilde{\epsilon}_{n}\sqrt{\frac{\log n}{n}}\right)=o_{p}(1).$

The scalar factor $\tau_{n}$ appears due to that the magnitude of $\|Q_{c}(\gamma^{*})\|_{\infty}$ is $O(\tau_{n}n\log n)$ . Note that the error bound of the parameter estimator in Theorem 3 of Karwa and Slavković, (2016) does not depend on the privacy parameter. Our result here characterizes how the error bound varies on $\epsilon_{n}$ . In view of the above theorem, we present the consistency conditions and the error bounds under two special cases. The first case is that the parameters and covariates are bounded. The second is that $k_{n}/\epsilon_{n}$ goes to zero.

Corollary 1.

Assume that $\|\beta^{*}\|_{\infty}$ and $\|\gamma^{*}\|_{\infty}$ and $z_{*}$ are bounded above by a constant. If $k_{n}/\epsilon_{n}=o(n/\log n)$ , then

\|\widehat{\gamma}-\gamma^{*}\|_{\infty}=O_{p}\left(\frac{k_{n}}{\epsilon_{n}}\cdot\frac{\log n}{n}\right),~{}~{}\|\widehat{\beta}-\beta^{*}\|_{\infty}=O_{p}\left(\sqrt{\frac{\log n}{n}}\right).

Corollary 2.

Assume that $k_{n}/\epsilon_{n}\to 0$ . If $b_{n}=o((n/\log n)^{1/4})$ , $\rho_{n}b_{n}^{3}=o(n^{1/2}/\log n)$ and $z_{*}=o((\log n)^{1/2})$ , then

\|\widehat{\gamma}-\gamma^{*}\|_{\infty}=O_{p}\left(\frac{\rho_{n}b_{n}^{3}\log n}{n}\right),~{}~{}\|\widehat{\beta}-\beta^{*}\|_{\infty}=O_{p}\left(b_{n}\sqrt{\frac{\log n}{n}}\right).

Remark 1.

The condition $k_{n}/\epsilon_{n}\to 0$ means that the outputs are nearly the same as the original network statistics. From Corollary 2, we can see that the MLE of $\gamma$ has a convergence rate of order $\log n/n$ .

4.2 Asymptotic normality of $\widehat{\beta}$

The asymptotic distribution of $\widehat{\beta}$ depends crucially on the inverse of the Fisher information matrix $V$ of $\beta$ . Given $m,M>0$ , we say an $n\times n$ matrix $V=(v_{ij})$ belongs to the matrix class $\mathcal{L}_{n}(m,M)$ if $V$ is a diagonally balanced matrix with positive elements bounded by $m$ and $M$ , i.e.,

\begin{array}[]{l}v_{ii}=\sum_{j=1,j\neq i}^{n}v_{ij},~{}~{}i=1,\ldots,n,\\ 0<m\leq v_{ij}\leq M,~{}~{}i,j=1,\ldots,n;i\neq j.\end{array}

Clearly, $F^{\prime}_{\gamma}(\beta)$ belongs to the matrix class $\mathcal{L}(b_{n}^{-1},1/4)$ when $\beta\in B(\beta^{*},\epsilon_{n1})$ and $\gamma\in B(\gamma^{*},\epsilon_{n2})$ . We will obtain the asymptotic distribution of the estimator $\widehat{\beta}$ through obtaining its asymptotic expression, which depends on the inverse of $F^{\prime}_{\gamma}(\beta)$ . However, its inverse does not have a closed form. Yan et al., (2015) proposed to approximate the inverse $V^{-1}$ of $V$ by a diagonal matrix

S=\mathrm{diag}(1/v_{11},\ldots,1/v_{nn}),

(19)

and obtained the upper bound of the approximate error.

Note that $v_{ii}=\sum_{j\neq i}\mathrm{Var}(a_{ij})$ . By the central limit theorem for the bounded case, as in Loéve, (1977, p.289), if $b_{n}=o(n^{1/2})$ , then $v_{ii}^{-1/2}\{d_{i}-\mathbb{E}(d_{i})\}$ converges in distribution to the standard normal distribution. When considering the asymptotic behaviors of the vector $(d_{1},\ldots,d_{r})$ with a fixed $r$ , one could replace the degrees $d_{1},\ldots,d_{r}$ by the independent random variables $\tilde{d}_{i}=d_{i,r+1}+\ldots+d_{in}$ , $i=1,\ldots,r$ . Therefore, we have the following proposition.

Proposition 1.

If $b_{n}=o(n^{1/2})$ , then as $n\to\infty$ :
(1)For any fixed $r\geq 1$ , the components of $(d_{1}-\mathbb{E}(d_{1}),\ldots,d_{r}-\mathbb{E}(d_{r}))$ are asymptotically independent and normally distributed with variances $v_{11},\ldots,v_{rr}$ , respectively.
(2)More generally, $\sum_{i=1}^{n}c_{i}(d_{i}-\mathbb{E}(d_{i}))/\sqrt{v_{ii}}$ is asymptotically normally distributed with mean zero and variance $\sum_{i=1}^{\infty}c_{i}^{2}$ whenever $c_{1},c_{2},\ldots$ are fixed constants and the latter sum is finite.

Part (2) follows from part (1) and the fact that

\lim_{r\to\infty}\limsup_{t\to\infty}\mathrm{Var}\left(\sum_{k=r+1}^{n}c_{i}\frac{d_{i}-\mathbb{E}(d_{i})}{\sqrt{v_{ii}}}\right)=0

by Theorem 4.2 of Billingsley, (1995). To prove the above equation, it suffices to show that the eigenvalues of the covariance matrix of $(d_{i}-\mathbb{E}(d_{i}))/v_{ii}^{1/2}$ , $i=r+1,\ldots,n$ are bounded by 2 (for all $r<n$ ). This is true by the well-known Perron-Frobenius theory: if $A$ is a symmetric matrix with nonnegative elements, then its largest eigenvalue is less than the largest value of row sums.

We apply a second order Taylor expansion to $\sum_{j\neq i}\mu_{ij}(\widehat{\beta},\widehat{\gamma})$ to derive the asymptotic expression of $\widehat{\beta}$ . In the expansion, the first order term is the sum of $V(\widehat{\beta}-\beta)$ and $V_{\gamma\beta}(\widehat{\gamma}-\gamma)$ , where $V_{\gamma\beta}=\partial F(\beta,\gamma)/\partial\gamma^{\top}$ . Since $V^{-1}$ does not have a closed form, we work with $S$ defined at (19) to approximate it. By Theorem 1, $\widehat{\gamma}$ has a $n^{-1}$ convergence rate up to a logarithm factor. This makes that the term $V_{\gamma\beta}(\widehat{\gamma}-\gamma)$ is an remainder term. The second order term in the expansion is also asymptotically neglect. Then we represent $\widehat{\theta}-\theta$ as the sum of $S(d-\mathbb{E}d)$ and a remainder term. The central limit theorem is proved by establishing the asymptotic normality of $S(d-\mathbb{E}d)$ and showing the remainder is negligible. We formally state the central limit theorem as follows.

Theorem 2.

Assume that the conditions in Theorem 1 hold. If $b_{n}^{3}\tilde{\varepsilon}_{n}^{2}+z_{*}\rho_{n}\tau_{n}b_{n}=o(n^{1/2}/\log n)$ , then for fixed $k$ the vector $(v_{11}^{1/2}(\widehat{\beta}_{1}-\beta^{*}),\ldots,v_{kk}^{1/2}(\widehat{\beta}_{k}-\beta^{*}_{k}))$ converges in distribution to the $k$ -dimensional multivariate standard normal distribution.

Remark 2.

The asymptotic variance of $\widehat{\beta}_{i}$ is $v_{ii}^{-1/2}$ lying between $O(b_{n}/n)$ and $O(1/n)$ , which is the same as in the non-private estimator.

4.3 Asymptotic normality of $\widehat{\gamma}$

Let $T_{ij}$ be an $n$ -dimensional column vector with $i$ th and $j$ th elements ones and other elements zeros. Define

\begin{array}[]{c}V=\frac{\partial F(\beta^{*},\gamma^{*})}{\partial\beta^{\top}},~{}~{}V_{\gamma\beta}=\frac{\partial Q(\beta^{*},\gamma^{*})}{\partial\beta^{\top}},\\ s_{ij}(\beta,\gamma)=(a_{ij}-\mathbb{E}a_{ij})(z_{ij}-V_{\gamma\beta}V^{-1}T_{ij}).\end{array}

Note that $s_{ij}(\beta,\gamma)$ , $i<j$ , are independent vectors. By direct calculations,

(V_{\gamma\beta})_{kj}=\frac{\partial Q_{k}(\beta^{*},\gamma^{*})}{\partial\beta_{j}}=\sum_{i\neq j}z_{ijk}\mu^{\prime}(\pi^{*}_{ij}),

such that

\|V_{\gamma\beta}\|_{\infty}\leq\frac{z_{*}}{4}(n-1).

By Lemma 4, we have

\|V_{\gamma\beta}V^{-1}\|_{\infty}\leq\|V_{\gamma\beta}\|_{\infty}\|V^{-1}\|_{\infty}\leq\frac{z_{*}(n-1)}{4}\cdot\frac{b_{n}(3n-4)}{2(n-1)(n-2)}<b_{n}z_{*}.

Therefore, all $s_{ij}(\beta^{*},\gamma^{*})$ are bounded. Let $\bar{Q}=Q(\beta^{*},\gamma^{*})-\eta$ and $\bar{F}=F(\beta^{*},\gamma^{*})-\xi$ . Note that

\mathrm{Cov}(\sum\nolimits_{i<j}s_{ij}(\beta^{*},\gamma^{*}))=\mathrm{Cov}(\bar{Q}-V_{\beta\gamma}^{\top}V^{-1}\bar{F})=H(\beta^{*},\gamma^{*}),

where $H(\beta,\gamma)$ is defined at (18). By the central limit theorem for the bounded case, as in Loéve, (1977, p.289), we have the following proposition.

Proposition 2.

For any nonzero fixed vector $c=(c_{1},\ldots c_{p})^{\top}$ , if $(c^{\top}H(\beta^{*},\gamma^{*})c)$ diverges, then
$(c^{\top}H(\beta^{*},\gamma^{*})c)^{-1/2}c^{\top}\sum_{i<j}\tilde{s}_{ij}(\beta^{*},\gamma^{*})$ converges in distribution to the standard normal distribution.

Let $N=n(n-1)/2$ and

\bar{H}=\lim_{n\to\infty}\frac{1}{N}H(\beta^{*},\gamma^{*}).

We assume that the above limit exists. We briefly describe the idea of proving asymptotic normality of $\widehat{\gamma}$ . We use a mean-value expansion to derive the explicit expression of $\widehat{\gamma}-\gamma^{*}$ , which mainly contains a term $Q_{c}(\gamma^{*})$ multiplied by $\bar{H}^{-1}$ . Then by applying a third-order Taylor expansion to $Q_{c}(\gamma^{*})$ , we find that the first order term is asymptotically normal, the second is the asymptotic bias term and the third is a remainder term. The asymptotic distribution of $\widehat{\gamma}$ is stated below.

Theorem 3.

Assume that the conditions in Theorem 1 hold. If $b_{n}^{4}\tilde{\epsilon}_{n}^{3}z_{*}=o(n^{1/2}/(\log n)^{3/2})$ , then as $n$ goes to infinity, $\sqrt{N}c^{\top}(\widehat{\gamma}-\gamma)$ converges in distribution to the normal distribution with mean $-c^{\top}\bar{H}^{-1}B_{*}$ and variance $c^{\top}\bar{H}c$ , where

B_{*}=\frac{1}{\sqrt{N}}\sum_{k=1}^{n}\frac{\sum_{j\neq k}z_{kj}\mu^{\prime\prime}(\pi_{kj}^{*})}{\sum_{j\neq k}\mu^{\prime}(\pi_{kj}^{*})}.

(20)

Remark 3.

We now discuss the bias term $B_{*}$ in (20). We assume that $z_{ij}$ is centered and independently drawn from a $p$ -dimensional multivariate distribution with bounded supports. Then, by Hoeffding,’s (1963) inequality, $\sum_{j\neq k}z_{kj}\mu_{kj}^{\prime\prime}(\pi_{ij}^{*})=o_{p}((n\log n)^{1/2})$ such that $B_{*}=O_{p}((\log n/n)^{1/2})$ . In this case, there is no bias in the limiting distribution of $\widehat{\gamma}$ . When $B_{*}=O(1)$ , the confidence intervals and the p-values of hypothesis testing constructed from $\widehat{\gamma}$ cannot achieve the nominal level without bias-correction. This is referred to as the well-known incidental parameter problem in econometrics literature [Neyman and Scott, (1948); Graham, (2017)]. As in Dzemski, (2019), we could use a simple analytical bias correction formula: $\widehat{\gamma}_{bc}=\widehat{\gamma}-N^{-1/2}H^{-1}(\widehat{\beta},\widehat{\gamma})\hat{B}$ , where $\widehat{B}$ is the plug-in estimate of $B_{*}$ by replacing $\beta^{*}$ and $\gamma^{*}$ with their estimators $\widehat{\beta}$ and $\widehat{\gamma}$ , respectively.

5 Numerical studies

5.1 Simulations

In this section, we evaluate the performance of the asymptotic theories through finite sizes of networks. The parameters in the simulations are as follows. The setting of the parameter ${\beta}^{*}$ took a linear form. Specifically, we set $\beta_{i}^{*}=(i-1)c\log n/(n-1)$ for $i=1,\ldots,n$ , where we chose four different values for $c$ , i.e., $c=0.05,0.15,0.3,0.5$ . Since the conditions to guarantee asymptotic properties in theorems depend on the whole quantity $k_{n}/\epsilon_{n}$ , we set $k_{n}$ to be fixed (i.e., $k_{n}=1$ ) and let $\epsilon_{n}$ go to zero as $n$ increases. We considered two different values for $\epsilon_{n}$ : $\log n/n^{1/6}$ and $\log n/n^{1/4}$ . The edge covariates formed as follows. For each node $i$ , we generated two dichotomous random variables $x_{i1}$ and $x_{i2}$ from $\{1,-1\}$ with unequal probabilities $0.4$ and $0.6$ and equal probabilities, respectively. Then we set $z_{ij}=(x_{i1}*x_{j1},x_{i2}*x_{j2})^{\top}$ . For the parameter $\gamma^{*}$ , we let it be $(0.5,-0.5)^{\top}$ . Thus, the first positive value measures the homophily effect and the second negative value measures heterophilic effect. We carried out simulations under two different sizes of networks: $n=100$ and $n=200$ . Each simulation was repeated $10,000$ times.

By Theorem 2, $\hat{\xi}_{ij}=[\hat{\beta}_{i}-\hat{\beta}_{j}-(\beta_{i}^{*}-\beta_{j}^{*})]/(1/\hat{v}_{ii}+1/\hat{v}_{jj})^{1/2}$ converges in distribution to the standard normal distributions, where $\hat{v}_{ii}$ is the estimate of $v_{ii}$ by replacing $\beta^{*}$ with $\hat{\beta}$ . We choose five special pairs $(1,2),(n/2,n/2+1)$ , $(n-1,n)$ , $(1,n/2)$ and $(1,n)$ for $(i,j)$ . We record the coverage probability of the $95\%$ confidence interval, the length of the confidence interval, and the frequency that the estimates do not exist. These values for $\widehat{\gamma}$ are also reported.

Table 1: The reported values are the coverage frequency (

\times 100\%

) for

\alpha_{i}-\alpha_{j}

for a pair

(i,j)

/ the length of the confidence interval / the frequency (

\times 100\%

) that the estimate did not exist.

$\epsilon_{n}=\log n/n^{1/6}$
$n$	$(i,j)$	$c=0.05$	$c=0.15$	$c=0.3$	$c=0.5$
100	(1, 2)	$93.98/1.19/0$	$93.80/1.21/0$	$93.51/1.27/0.08$	$93.36/1.38/28.16$
	(49,50)	$93.91/1.20/0$	$93.80/1.25/0$	$93.57/1.43/0.08$	$93.26/1.84/28.16$
	(99, 100)	$93.62/1.21/0$	$93.69/1.32/0$	$93.51/1.73/0.08$	$96.60/2.89/28.16$
	(1,50)	$94.15/1.20/0$	$93.88/1.23/0$	$93.64/1.34/0.08$	$92.93/1.61/28.16$
	(1, 100)	$93.91/1.20/0$	$94.29/1.26/0$	$93.53/1.50/0.08$	$95.50/2.24/28.16$
200	(1, 2)	$94.37/0.84/0$	$94.57/0.85/0$	$94.44/0.90/0.01$	$94.23/1.00/16.65$
	(99,100)	$94.19/0.84/0$	$94.56/0.89/0$	$94.08/1.05/0.01$	$93.88/1.43/16.65$
	(199,200)	$94.80/0.85/0$	$94.11/0.95/0$	$94.76/1.34/0.01$	$96.05/2.46/16.65$
	(1,100)	$94.27/0.84/0$	$94.38/0.87/0$	$94.51/0.98/0.01$	$93.57/1.23/16.65$
	(1,200)	$94.34/0.84/0$	$94.60/0.90/0$	$94.69/1.14/0.01$	$95.76/1.81/16.65$
$\epsilon_{n}=\log n/n^{1/4}$
100	(1, 2)	$92.61/1.20/0$	$92.74/1.21/0$	$91.88/1.27/0.40$	$91.44/1.39/42.79$
	(49,50)	$92.48/1.20/0$	$92.45/1.25/0$	$91.53/1.43/0.40$	$90.04/1.85/42.79$
	(99, 100)	$92.69/1.21/0$	$92.40/1.32/0$	$90.98/1.73/0.40$	$95.30/2.89/42.79$
	(1,50)	$92.66/1.20/0$	$92.67/1.23/0$	$92.03/1.35/0.40$	$90.81/1.61/42.79$
	(1, 100)	$92.52/1.20/0$	$92.77/1.26/0$	$91.19/1.51/0.40$	$93.60/2.17/42.79$
200	(1, 2)	$93.74/0.84/0$	$93.87/0.85/0$	$93.59/0.90/0.04$	$93.15/1.00/33.72$
	(99,100)	$93.47/0.84/0$	$93.83/0.89/0$	$93.18/1.05/0.04$	$92.14/1.43/33.72$
	(199,200)	$93.95/0.85/0$	$93.26/0.95/0$	$92.98/1.34/0.04$	$93.84/2.47/33.72$
	(1,100)	$93.76/0.84/0$	$93.68/0.87/0$	$93.57/0.98/0.04$	$92.11/1.23/33.72$
	(1,200)	$93.55/0.84/0$	$93.92/0.90/0$	$93.09/1.13/0.04$	$94.16/1.81/33.72$

Table 1 reports the simulation results for $\beta_{i}^{*}-\beta_{j}^{*}$ . The reported frequencies and lengths were conditional on the event that the estimator exists. We found that empirical coverage frequencies are very close to the nominal $95\%$ level when $c\leq 0.3$ and $\epsilon_{n}=\log n/n^{1/6}$ , and they are a little less than the nominal level when $\epsilon_{n}=\log n/n^{1/4}$ and $n=100$ . As expected, the length of the confidence interval increases with $c$ . Conversely, it decreases as $n$ increases. When $c=0.5$ , the estimator failed to exist with a positive frequencies over $20\%$ . In other cases, estimates existed almost in every simulation.

Table 2 reports the median of $\widehat{\gamma}$ as well as those of the bias corrected estimator $\widehat{\gamma}_{bc}=\widehat{\gamma}-\bar{H}^{-1}\hat{B}$ . As we can see, the bias is small and the empirical coverage frequencies for the estimators $\widehat{\gamma}_{bc}$ are more closer to the target level $95\%$ than those values for bias-uncorrected $\widehat{\gamma}$ when $c\leq 0.15$ . When $c=0.3$ and $n=100$ , they are a little lower than the target level. On the other hand, when $n$ is fixed, the length of confidence interval of $\widehat{\gamma}$ increases as $c$ becomes larger.

Table 2: The reported values are the coverage frequency (

\times 100\%

) for

\gamma_{i}

for

i

with bias-correction (uncorrected)/ bias (

\times 100

) / length of confidence interval (

\times 10

) /the frequency (

\times 100\%

) that the MLE did not exist (

\boldsymbol{\gamma}^{*}=(0.5,-0.5)^{\top}

$n$	$c$	$\gamma$	$\epsilon_{n}=\log n/n^{1/6}$	$\epsilon_{n}=\log n/n^{1/4}$
$100$	$0.05$	$\gamma_{1}$	$95.14(93.69)/1.03/0.13/0$	$94.84(92.37)/1.02/0.13/0$
		$\gamma_{2}$	$95.31(93.62)/1.01/0.12/0$	$95.01(92.62)/1.00/0.12/0$
	$0.15$	$\gamma_{1}$	$94.65(93.40)/0.87/0.13/0$	$94.70(93.18)/0.87/0.13/0$
		$\gamma_{2}$	$94.71(92.96)/0.80/0.13/0$	$94.61(92.57)/0.80/0.13/0$
	$0.3$	$\gamma_{1}$	$94.10(93.74)/0.37/0.15/0.08$	$92.96(92.17)/0.37/0.15/0.40$
		$\gamma_{2}$	$93.87(93.47)/0.20/0.15/0.08$	$93.01(92.52)/0.20/0.15/0.40$
	$0.5$	$\gamma_{1}$	$91.08(92.72)/0.75/0.20/28.16$	$89.70(91.94)/0.76/0.20/42.79$
		$\gamma_{2}$	$90.41(93.19)/1.08/0.19/28.16$	$89.06(92.50)/1.08/0.19/42.79$
$200$	$0.05$	$\gamma_{1}$	$95.23(93.72)/0.51/0.06/0$	$95.11(93.50)/0.51/0.06/0$
		$\gamma_{2}$	$95.31(93.64)/0.50/0.06/0$	$95.29(93.46)/0.50/0.06/0$
	$0.15$	$\gamma_{1}$	$95.38(93.92)/0.41/0.07/0$	$95.14(93.70)/0.41/0.07/0$
		$\gamma_{2}$	$94.90(93.11)/0.37/0.06/0$	$94.72(92.85)/0.37/0.06/0$
	$0.3$	$\gamma_{1}$	$95.01(94.80)/0.08/0.08/0.01$	$94.60(94.30)/0.08/0.08/0.04$
		$\gamma_{2}$	$94.45(94.52)/0.02/0.08/0.01$	$94.10(94.22)/0.02/0.08/0.04$
	$0.5$	$\gamma_{1}$	$91.40(93.64)/0.64/0.11/16.65$	$90.43(93.00)/0.64/0.11/33.72$
		$\gamma_{2}$	$90.10(93.70)/0.86/0.10/16.65$	$89.56(93.29)/0.86/0.10/33.72$

5.2 A real data example

We use the Enron email dataset as an example analysis [Cohen, (2004)], available from https://www.cs.cmu.edu/~enron/. This dataset was released by William Cohen at Carnegie Mellon University and has been widely studied. The Enron email data was originally acquired and made public by the Federal Energy Regulatory Commission during its investigation into fraudulent accounting practices. Some of the emails have been deleted upon requests from affected employees. However, the raw data is messy and needs to be cleaned before any analysis is conducted. Zhou et al., (2007) applied data cleaning strategies to compile the Enron email dataset. We use their cleaned data for the subsequent analysis. The resulting data comprises $21,635$ messages sent between $156$ employees with their covariate information. Thus, the corresponding graph has multiple edges. We treat it as a simple undirected graph for our analysis, where each edge denotes that there are at least one message between the corresponding two nodes. We remove the isolated nodes “32” and “37” with zero degrees, where the estimators of the corresponding degree parameters do not exist. This leaves a connected network with $154$ nodes and $1843$ edges. The minimum, $1/4$ quantile, median, $3/4$ quantile and maximum values of $d$ are $1$ , $15$ , $21$ , $30$ and $88$ , respectively.

Refer to caption — Figure 1: Visualization of Enran email network among $154$ employees. The vertex sizes are proportional to nodal degrees. The positions of the vertices are the same in two graphs. For nodes with degrees less than $3$ , we set their sizes the same (as a node with degrees 3). In the left graph, the colors indicate different departments (red for legal and blue for trading and orange for other), while in the right graph, the colors represent different genders (blue for male and red for female).

Each employee has three categorical variables: departments of these employees (Trading, Legal, Other), the genders (Male, Female) and seniorities (Senior, Junior). We plot the network with individual departments and genders in Figure 1. We can see that the degrees exhibit a great variation across nodes and it is not easy to judge homophic or heteriphic effects that require quantitative analysis. The $3$ -dimensional covariate vector $z_{ij}$ of edge $(i,j)$ is formed by using a homophilic matching function between these three covariates of two employees $i$ and $j$ , i.e., if $x_{ik}$ and $x_{jk}$ are equal, then $z_{ijk}=1$ ; otherwise $z_{ijk}=-1$ .

We evaluate how close the estimator $(\hat{\alpha},\hat{\beta})$ is to the original MLE $(\tilde{\alpha},\tilde{\beta})$ fitted in the generalized $\beta$ –model. We chose two $\epsilon_{n}$ ( $\log n/n^{1/6}$ , $\log n/n^{1/4}$ ) and let $k_{n}=1$ as in simulations, and repeated to release $d$ and $y$ using according to (7) $1,000$ times for each $\epsilon_{n}$ . Then we computed the median private estimate and the upper ( $97.5^{th}$ ) and the lower ( $2.5^{th}$ ) quantiles. The private estimate existed for each output. The frequencies that the private estimate fails to exist are $33.7\%$ and $45.2\%$ for $\epsilon=\log n/n^{1/6}$ and $\epsilon=\log n/n^{1/4}$ , respectively. The results for the estimates $\widehat{\gamma}$ and $\widehat{\beta}$ are shown in Table 3 and Figure 2. From Table 3, we can see that the median value of $\widehat{\gamma}$ is the same as the MLE and the MLE lies in the $95\%$ confidence interval. The similar phenomenon can also be observed in Figure 2. From this figure, we can see that the length of confidence interval of private estimates under $\epsilon_{n}=\log n/n^{1/6}$ are shorter than those under $\epsilon_{n}=\log n/n^{1/4}$ .

Table 3: The nonprivate MLE

\widehat{\gamma}_{i}^{0}

\gamma_{i}

, the median of private estimates of

\gamma_{i}

, the length of confidence interval for Enron email data.

Covariate	$\hat{\gamma}_{i}^{0}$	$\hat{\gamma}_{i}$	$95\%$ confidence interval
	$\epsilon_{n}=\log n/n^{1/6}$
Department	$-0.016$	$-0.016$	$[-0.083,0.033]$
Gender	$0.063$	$0.063$	$[0.005,0.136]$
Seniority	$0.032$	$0.032$	$[-0.022,0.085]$
	$\epsilon_{n}=\log n/n^{1/4}$
Department	$-0.016$	$-0.016$	$[-0.083,0.033]$
Gender	$0.063$	$0.062$	$[0.004,0.135]$
Seniority	$0.032$	$0.032$	$[-0.021,0.085]$

6 Summary and discussion

We have present the $(k_{n},\epsilon_{n})$ -edge-differentially private estimation for inferring the degree parameter and homophily parameter in the generalized $\beta$ –model. We establish consistency of the estimator under several conditions and also derive its asymptotic normal distribution. It is worth noting that the conditions imposed on $b_{n}$ imply the network density going to zero with a very slow rate. When networks are very sparse, adding noises easily produces the outputs of negative degrees, which will lead to the non-existence of the maximum likelihood estimator in the covariate-adjusted $\beta$ -model. In addition, the conditions in Theorems 2 and 3 seem stronger than those needed for consistency. Note that the asymptotic behavior of the estimator depends not only on $b_{n}$ , but also on the configuration of the parameters. It is of interest to see whether conditions for guaranteeing theoretical properties could be relaxed.

We use a generalized $\beta$ –model to give a rigorous differential privacy analysis of networks with covariates. It is notable that the assumption of the logistic distribution of an edge is not essential in our strategies for proofs. Our these principled methods should be applicable to a class of network models beyond the covariate-adjusted $\beta$ -model [Wang et al., (2023)]. For instance, the developed two-stage Newton method to prove the consistency of the differentially private estimator still works if the logistic distribution is replaced with the probit distribution. Further, the edge independence assumption is not directly used in this method, which only plays a role to derive the upper bound of $d$ and $y$ . There are many tail probability inequalities in dependent random variables that could be applied to network statistics with edge dependence situations. We hope that the methods developed here can be further to be applied to edge-dependence network models.

7 Appendix

7.1 Preliminaries

In this section, we present three results that will be used in the proofs. The first is on the approximation error of using $S$ to approximate the inverse of $V$ belonging to the matrix class $\mathcal{L}_{n}(m,M)$ , where $V=(v_{ij})_{n\times n}$ and $S=\mathrm{diag}(1/v_{11},\ldots,1/v_{nn})$ . Yan et al., (2015) obtained the upper bound of the approximation error, which has an order $n^{-2}$ . The second is a tight bound of $\|V^{-1}\|_{\infty}$ in Hillar et al., (2012). These two results are stated below as lemmas.

Lemma 3 (Yan et al., (2015)).

If $V\in\mathcal{L}_{n}(m,M)$ , then the following holds:

\|V^{-1}-S\|_{\max}\leq\frac{1}{(n-1)^{2}}\left(\frac{M}{2m^{2}}+\frac{nM^{2}}{2(n-2)m^{3}}+\frac{3n-2}{2nm}\right)=O(\frac{M^{2}}{n^{2}m^{3}}),

where $\|A\|_{\max}=\max_{i,j}|A_{ij}|$ for a general matrix $A$ .

Lemma 4 (Hillar et al., (2012)).

For $V\in\mathcal{L}_{n}(m,M)$ , when $n\geq 3$ , we have

\frac{1}{2M(n-1)}\leq\|V^{-1}\|_{\infty}\leq\frac{3n-4}{2m(n-1)(n-2)}.

Let $F(x):\mathbb{R}^{n}\to\mathbb{R}^{n}$ be a function vector on $x\in\mathbb{R}^{n}$ . We say that a Jacobian matrix $F^{\prime}(x)$ with $x\in\mathbb{R}^{n}$ is Lipschitz continuous on a convex set $D\subset\mathbb{R}^{n}$ if for any $x,y\in D$ , there exists a constant $\lambda>0$ such that for any vector $v\in\mathbb{R}^{n}$ the inequality

\|[F^{\prime}(x)]v-[F^{\prime}(y)]v\|_{\infty}\leq\lambda\|x-y\|_{\infty}\|v\|_{\infty}

holds. We will use the Newton iterative sequence to establish the existence and consistency of the differentially private estimator. Gragg and Tapia, (1974) gave the optimal error bound for the Newton method under the Kantovorich conditions [Kantorovich, (1948)]. We only show partial results here that are enough for our applications.

Lemma 5 (Gragg and Tapia, (1974)).

Let $D$ be an open convex set of $\mathbb{R}^{n}$ and $F:D\to\mathbb{R}^{n}$ be Fréchet differentiable on $D$ with a Jacobian $F^{\prime}(x)$ that is Lipschitz continuous on $D$ with Lipschitz coefficient $\lambda$ . Assume that $x_{0}\in D$ is such that $[F^{\prime}(x_{0})]^{-1}$ exists,

\|[F^{\prime}(x_{0})]^{-1}\|\leq\aleph,~{}~{}\|[F^{\prime}(x_{0})]^{-1}F(x_{0})\|\leq\delta,~{}~{}h=2\aleph\lambda\delta\leq 1,

and

B(x_{0},t^{*})\subset D,~{}~{}t^{*}=\frac{2}{h}(1-\sqrt{1-h})\delta=\frac{2\delta}{1+\sqrt{1-h}}.

Then: (1) The Newton iterations $x_{k+1}=x_{k}-[F^{\prime}(x_{k})]^{-1}F(x_{k})$ exist and $x_{k}\in B(x_{0},t^{*})\subset D$ for $k\geq 0$ . (2) $x^{*}=\lim x_{k}$ exists, $x^{*}\in\overline{B(x_{0},t^{*})}\subset D$ and $F(x^{*})=0$ .

7.2 Error bound between $\widehat{\beta}_{\gamma}$ and $\beta^{*}$

We will use the Newton method to derive the error bound between $\widehat{\beta}_{\gamma}$ and $\beta^{*}$ through verifying the Kantororich conditions in Lemma 5. The Kantororich conditions require the Lipschitz continuous of $F^{\prime}_{\gamma}(\beta)$ and the upper bounds of $F_{\gamma}(\beta^{*})$ . We need three lemmas below, whose proofs are in supplementary material.

Lemma 6.

For any given $\gamma$ , the Jacobian matrix $F^{\prime}_{\gamma}(x)$ of $F_{\gamma}(x)$ on $x$ is Lipschitz continuous on $\mathbb{R}^{n}$ with the Lipschitz coefficient $(n-1)$ .

Lemma 7.

The tail probabilities of $\|d-\mathbb{E}d\|_{\infty}$ and $\|y-\mathbb{E}y\|_{\infty}$ are given below:

	$\displaystyle\mathbb{P}\Bigg{(}\\|d-\mathbb{E}d\\|_{\infty}\geq\sqrt{n\log n}\Bigg{)}$	$\displaystyle\leq$	$\displaystyle\frac{2}{n},$
	$\displaystyle\mathbb{P}\left(\\|y-\mathbb{E}y\\|_{\infty}\geq 2z_{*}\sqrt{n(n-1)\log n/2}\right)$	$\displaystyle\leq$	$\displaystyle\frac{2p}{n^{2}}.$

Lemma 8.

For any given $c>0$ , we have

\mathbb{P}(\max_{i=1,\ldots,n}|\xi_{i}|>c)<n\exp(-\frac{c\varepsilon_{n}}{4k_{n}}),~{}~{}\mathbb{P}(\max_{i=1,\ldots,p}|\eta_{i}|>c)<p\exp(-\frac{c\varepsilon_{n}}{pz_{*}k_{n}}).

Recall that

\tilde{\varepsilon}_{n}=1+\frac{4k_{n}}{\varepsilon_{n}}\sqrt{\frac{\log n}{n}}.

We state the error bound between $\widehat{\beta}_{\gamma}$ and $\beta^{*}$ below.

Lemma 9.

Assume $\epsilon_{n2}=O((\log n)^{1/2}n^{-1/2})$ and $\gamma\in B(\gamma^{*},\epsilon_{n2})$ . If $b_{n}^{2}\tilde{\varepsilon}_{n}=o((n/\log n)^{1/2})$ , then with probability approaching one, $\widehat{\beta}_{\gamma}$ exists and satisfies

\|\widehat{\beta}_{\gamma}-\beta^{*}\|_{\infty}=O_{p}\left(b_{n}\tilde{\varepsilon}_{n}\sqrt{\frac{\log n}{n}}\right)=o_{p}(1).

Proof of Lemma 9.

We will derive the error bound between $\widehat{\beta}_{\gamma}$ and $\beta^{*}$ through constructing the Newton iterative sequence $\beta^{(n+1)}=\beta^{(n)}-F_{\gamma}^{\prime}(\beta^{(n)})F_{\gamma}(\beta^{(n)})$ , where we choose $\beta^{*}$ as the starting point $\beta^{(0)}:=\beta^{*}$ . To this end, it is sufficient to verify the Kantovorich conditions in Lemma 5. Note that $F^{\prime}_{\gamma}(\beta)\in\mathcal{L}_{n}(b_{n}^{-1},1/4)$ when $\beta\in B(\beta^{*},\epsilon_{n1})$ and $\gamma\in B(\gamma^{*},\epsilon_{n2})$ , where $\epsilon_{n1}$ is a small positive number and $\epsilon_{n2}=O((\log n/n)^{1/2})$ . The following calculations are based on the event $E_{n}$ :

E_{n}=\{d:\|\tilde{d}-\mathbb{E}d\|_{\infty}\leq\tilde{\varepsilon}_{n}(n\log n)^{1/2}\}.

Let $V=(v_{ij})=\partial F_{\gamma}(\beta^{*})/\partial\beta^{\top}$ and $S=\mathrm{diag}(1/v_{11},\ldots,1/v_{nn})$ . By Lemma 4, we have $\aleph=\|V^{-1}\|_{\infty}=O(b_{n}/n)$ . Recall that $F_{\gamma^{*}}(\beta^{*})=\mathbb{E}d-\tilde{d}$ . Note that the dimension $p$ of $\gamma$ is a fixed constant. If $\epsilon_{n2}=O((\log n)^{1/2}n^{-1/2})$ , by the mean value theorem and the event $E_{n}$ , we have

$\displaystyle\\|F_{\gamma}(\beta^{*})\\|_{\infty}$	$\displaystyle\leq$	$\displaystyle\\|\tilde{d}-\mathbb{E}d\\|_{\infty}+\max_{i}\|\sum\nolimits_{j\neq i}[\mu_{ij}(\beta^{},\gamma)-\mu_{ij}(\beta^{},\gamma^{*})]\|$
	$\displaystyle\leq$	$\displaystyle O(\tilde{\varepsilon}_{n}(n\log n)^{1/2})+\max_{i}\sum_{j\neq i}\|\mu_{ij}^{\prime}(\beta^{},\bar{\gamma})\|\|z_{ij}^{\top}(\gamma-\gamma^{})\|$
	$\displaystyle\leq$	$\displaystyle O(\tilde{\varepsilon}_{n}(n\log n)^{1/2})+O(n\cdot(\log n)^{1/2}n^{-1/2})$
	$\displaystyle=$	$\displaystyle O(\tilde{\varepsilon}_{n}(n\log n)^{1/2}).$

Repeatedly utilizing Lemma 4, we have

\displaystyle\delta=\|[F^{\prime}_{\gamma}(\beta^{*})]^{-1}F_{\gamma}(\beta^{*})\|_{\infty}=\|[F^{\prime}_{\gamma}(\beta^{*})]^{-1}\|_{\infty}\|F_{\gamma}(\beta^{*})\|_{\infty}=O\left(\tilde{\varepsilon}_{n}b_{n}\sqrt{\frac{\log n}{n}}\right)

By Lemma 6, $F_{\gamma}(\beta)$ is Lipschitz continuous with Lipschitz coefficient $\lambda=n-1$ . Therefore, if $\tilde{\varepsilon}_{n}b_{n}^{2}=o((n/\log n)^{1/2})$ , then

	$\displaystyle h=2\aleph\lambda\delta$	$\displaystyle=$	$\displaystyle O(\frac{b_{n}}{n})\times O(n)\times O(\tilde{\varepsilon}_{n}b_{n}\sqrt{\frac{\log n}{n}})$
		$\displaystyle=$	$\displaystyle O\left(\tilde{\varepsilon}_{n}b_{n}^{2}\sqrt{\frac{\log n}{n}}\right)=o(1).$

The above arguments verify the Kantovorich conditions. By Lemma 5, it yields that

\|\widehat{\beta}_{\gamma}-\beta^{*}\|_{\infty}=O\left(\tilde{\varepsilon}_{n}b_{n}\sqrt{\frac{\log n}{n}}\right).

(21)

To finish the proof, it is left to show $\mathbb{P}(E_{n}^{c})\to 0$ . Note that $\tilde{d}_{i}=d_{i}+\xi_{i}$ . By Lemmas 7 and 8, it can be verified as follows:

$\displaystyle\mathbb{P}(E_{n}^{c})$	$\displaystyle=$	$\displaystyle\mathbb{P}(\\|\tilde{d}-\mathbb{E}d\\|_{\infty}>\tilde{\varepsilon}_{n}\sqrt{n\log n})$
	$\displaystyle\leq$	$\displaystyle\mathbb{P}(\\|d-\mathbb{E}d\\|_{\infty}>\sqrt{n\log n})+\mathbb{P}(\max_{i=1,\ldots,n}\xi_{i}>(\tilde{\varepsilon}_{n}-1)\sqrt{n\log n})$
	$\displaystyle\leq$	$\displaystyle\frac{2}{n}\to 0.$

It completes the proof. ∎

7.3 Proof of Theorem 1

To show Theorem 1, we need three lemmas below.

Lemma 10.

Let $D=B(\gamma^{*},\epsilon_{n2})(\subset\mathbb{R}^{p})$ be an open convex set containing the true point $\gamma^{*}$ . If $\|\tilde{d}-\mathbb{E}d\|_{\infty}=O(\tilde{\epsilon}_{n}(n\log n)^{1/2})$ , then $Q_{c}(\gamma)$ is Lipschitz continuous on $D$ with the Lipschitz coefficient $n^{2}b_{n}^{-3}$ .

Lemma 11.

Write $\widehat{\beta}^{*}$ as $\widehat{\beta}_{\gamma^{*}}$ and $V=\partial F(\beta^{*},\gamma^{*})/\partial\beta^{\top}$ . If $b_{n}^{2}\tilde{\varepsilon}_{n}=o((n/\log n)^{1/2})$ , then $\widehat{\beta}^{*}$ has the following expansion:

\widehat{\beta}^{*}-\beta^{*}=V^{-1}F(\beta^{*},\gamma^{*})+V^{-1}R,

(22)

where $R=(R_{1},\ldots,R_{n})^{\top}$ is the remainder term and

\left\|V^{-1}R\right\|_{\infty}=O_{p}(\frac{b_{n}^{3}\tilde{\varepsilon}_{n}^{2}\log n}{n}).

Lemma 12.

If $b_{n}^{2}\tilde{\varepsilon}_{n}=o((n/\log n)^{1/2})$ , for any $\beta\in B(\beta^{*},\epsilon_{n1})$ and $\gamma\in B(\gamma^{*},\epsilon_{n2})$ , then we have

\|\frac{\partial Q(\beta,\gamma)}{\partial\beta^{\top}}(\widehat{\beta}_{\gamma}-\beta^{*})\|_{\infty}=O_{p}(\tilde{\varepsilon}_{n}b_{n}^{3}n\log n).

Now we are ready to prove Theorem 1.

Proof of Theorem 1.

We construct the Newton iterative sequence to show the consistency. It is sufficient to verify the Kantovorich conditions in Lemma 5. In the Newton method, we set $\gamma^{*}$ as the initial point $\gamma^{(0)}$ and $\gamma^{(k+1)}=\gamma^{(k)}-[Q_{c}^{\prime}(\gamma^{(k)})]^{-1}Q_{c}(\gamma^{(k)})$ .

The following calculations are based on the event $E_{n}$ that for $\gamma\in B(\gamma^{*},\epsilon_{n2})$ , $\widehat{\beta}_{\gamma}$ exists and satisfies

\|\widehat{\beta}_{\gamma}-\beta^{*}\|_{\infty}=O\left(\tilde{\varepsilon}_{n}b_{n}\sqrt{\frac{\log n}{n}}\right).

This shows that $\widehat{\beta}_{\gamma^{(0)}}$ exists such that $Q_{c}(\gamma^{(0)})$ and $Q_{c}^{\prime}(\gamma^{(0)})$ are well defined. This in turn shows that in every iterative step, $\gamma^{(k+1)}$ exists as long as $\gamma^{(k)}$ exists.

Recall the definition of $Q_{c}(\gamma)$ and $Q(\beta,\gamma)$ in (13) and (14). By Lemmas 7 and 8, we have

$\displaystyle\\|Q(\beta^{},\gamma^{})\\|_{\infty}$	$\displaystyle=$	$\displaystyle\\|\mathbb{E}y-y\\|_{\infty}+\\|\eta\\|_{\infty}$
	$\displaystyle=$	$\displaystyle O_{p}(z_{*}n(\log n)^{1/2})+O_{p}((pk_{n}/\varepsilon_{n})\log n)$
	$\displaystyle=$	$\displaystyle O_{p}((z_{*}+\frac{pk_{n}(\log n)^{1/2}}{n\varepsilon_{n}})n(\log n)^{1/2}).$

By the mean value theorem and Lemma 12, we have

\|Q(\widehat{\beta}^{*},\gamma^{*})-Q(\beta^{*},\gamma^{*})\|_{\infty}=\|\frac{\partial Q(\bar{\beta},\gamma)}{\partial\beta^{\top}}(\widehat{\beta}_{\gamma}-\beta^{*})\|_{\infty}=O_{p}(\tilde{\varepsilon}_{n}b_{n}^{3}n\log n)).

Then it follows that

	$\displaystyle\\|Q_{c}(\gamma^{*})\\|_{\infty}$	$\displaystyle\leq$	$\displaystyle\\|Q(\beta^{},\gamma^{})\\|_{\infty}+\\|Q(\widehat{\beta}^{},\gamma^{})-Q(\beta^{},\gamma^{})\\|_{\infty}$
		$\displaystyle=$	$\displaystyle O_{p}\left((b_{n}^{3}+(k_{n}/\varepsilon_{n})b_{n}^{3}+z_{*}/(\log n)^{1/2})n\log n)\right):=O_{p}(\tau_{n}n\log n).$

By Lemma 10, $Q^{\prime}_{c}(\gamma)$ is Lipschitz continuous with $\lambda=n^{2}b_{n}^{3}$ . Note that $\aleph=\|[Q_{c}^{\prime}(\gamma^{*})]^{-1}\|_{\infty}=O(\rho_{n}n^{-2})$ . Thus,

\delta=\|[Q_{c}^{\prime}(\gamma^{*})]^{-1}Q_{c}(\gamma^{*})\|_{\infty}=O_{p}\left(\frac{\rho_{n}\tau_{n}\log n}{n}\right).

As a result, if $b_{n}^{3}\rho_{n}^{2}\tau_{n}=o(n/\log n)$ , then

h=2\aleph\lambda\delta=O_{p}(\frac{\rho_{n}}{n^{2}}\cdot n^{2}b_{n}^{3}\cdot\rho_{n}\tau_{n}\cdot\frac{\log n}{n})=o_{p}(1).

By Lemma 5, with probability approaching one, the limiting point of the sequence $\{\gamma^{(k)}\}_{k=1}^{\infty}$ exists denoted by $\widehat{\gamma}$ , and satisfies

\|\widehat{\gamma}-\gamma^{*}\|_{\infty}=O_{p}\left(\frac{\rho_{n}\tau_{n}\log n}{n}\right).

At the same time, by Lemma 9, $\widehat{\beta}_{\widehat{\gamma}}$ exists, denoted by $\widehat{\beta}$ . The limiting points $(\widehat{\beta},\widehat{\gamma})$ satisfies the equation (9). It completes the proof. ∎

7.4 Proofs for Theorem 2

Proof of Theorem 2.

To simplify notations, write $\widehat{\pi}_{ij}=\widehat{\beta}_{i}+\widehat{\beta}_{j}+z_{ij}^{\top}\widehat{\gamma}$ , $\pi_{ij}^{*}=\beta_{i}^{*}+\beta_{j}^{*}+z_{ij}^{\top}\gamma^{*}$ , $\mu_{ij}^{\prime}=\mu^{\prime}(\pi_{ij}^{*})$ and

V=\frac{\partial F(\beta^{*},\gamma^{*})}{\partial\beta^{\top}},~{}~{}V_{\beta\gamma}=\frac{\partial F(\beta^{*},\gamma^{*})}{\partial\gamma^{\top}}.

By a second order Taylor expansion, we have

\mu(\widehat{\pi}_{ij})-\mu(\pi_{ij}^{*})=\mu_{ij}^{\prime}(\widehat{\beta}_{i}-\beta_{i})+\mu_{ij}^{\prime}(\widehat{\beta}_{j}-\beta_{j})+\mu_{ij}^{\prime}z_{ij}^{\top}(\widehat{\gamma}-\gamma)+g_{ij},

(23)

where

g_{ij}=\frac{1}{2}\begin{pmatrix}\widehat{\beta}_{i}-\beta_{i}^{*}\\ \widehat{\beta}_{j}-\beta_{j}^{*}\\ \widehat{\gamma}-\gamma^{*}\end{pmatrix}^{\top}\begin{pmatrix}\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})&-\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})&\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})z_{ij}^{\top}\\ -\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})&\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})&-\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})z_{ij}^{\top}\\ \mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})z_{ij}^{\top}&-\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})z_{ij}^{\top}&\mu^{\prime\prime}_{ij}(\tilde{\pi}_{ij})z_{ij}z_{ij}^{\top}\end{pmatrix}\begin{pmatrix}\widehat{\beta}_{i}-\beta_{i}^{*}\\ \widehat{\beta}_{j}-\beta_{j}^{*}\\ \widehat{\gamma}-\gamma^{*}\end{pmatrix},

and $\tilde{\pi}_{ij}$ lies between $\pi_{ij}^{*}$ and $\widehat{\pi}_{ij}$ . Recall that $z_{*}:=\max_{i,j}\|z_{ij}\|_{\infty}$ . Since $|\mu^{\prime\prime}(\pi_{ij})|\leq 1/4$ (see (10)), we have

\begin{array}[]{rcl}|g_{ij}|&\leq&\|\widehat{\beta}-\beta^{*}\|_{\infty}^{2}+\tfrac{1}{2}\|\widehat{\beta}-\beta^{*}\|_{\infty}\|\widehat{\gamma}-\gamma^{*}\|_{1}z_{*}+\tfrac{1}{4}\|\|\widehat{\gamma}-\gamma^{*}\|_{1}^{2}z_{*}^{2}\\ &\leq&\tfrac{1}{2}[4\|\widehat{\beta}-\beta^{*}\|_{\infty}^{2}+\|\widehat{\gamma}-\gamma^{*}\|_{1}^{2}z_{*}^{2}].\end{array}

Let $g_{i}=\sum_{j\neq i}g_{ij}$ and $g=(g_{1},\ldots,g_{n})^{\top}$ . If $\rho_{n}^{2}\tau_{n}^{2}=o(n/\log n)$ , by Theorem 1, we have

\max_{i=1,\ldots,n}|g_{i}|\leq n\max_{i,j}|g_{ij}|=O_{p}\left(\left(b_{n}^{2}\tilde{\varepsilon}_{n}^{2}+\frac{\rho_{n}^{2}\tau_{n}^{2}\log n}{n}\right)\log n\right)=O_{p}\left(b_{n}^{2}\tilde{\varepsilon}_{n}^{2}\log n\right).

(24)

By writing (23) into a matrix form, we have

\tilde{d}-\mathbb{E}d=V(\widehat{\beta}-\beta^{*})+V_{\beta\gamma}(\widehat{\gamma}-\gamma^{*})+g,

which is equivalent to

\widehat{\beta}-\beta^{*}=V^{-1}(d-\mathbb{E}d)+V^{-1}V_{\beta\gamma}(\widehat{\gamma}-\gamma^{*})+V^{-1}g+V^{-1}\xi.

(25)

We bound the last three remainder terms in the above equation as follows. Let $W=V^{-1}-S$ . Note that $(Sg)_{i}=g_{i}/v_{ii}$ and $(n-1)b_{n}^{-1}\leq v_{ii}\leq(n-1)/4$ . By Lemma 4 and inequality (24), we have

\displaystyle\|V^{-1}g\|_{\infty}\leq\|V^{-1}\|_{\infty}\|g\|_{\infty}=O(\frac{b_{n}}{n}\times b_{n}^{2}\tilde{\varepsilon}_{n}^{2}\log n)=O_{p}(\frac{b_{n}^{3}\tilde{\varepsilon}_{n}^{2}\log n}{n}).

(26)

Note that the $i$ th row of $V_{\beta\gamma}$ is $\sum_{j=1,j\neq i}^{n}\mu^{\prime}_{ij}z_{ij}^{\top}$ . By Theorem 1, we have

\|V_{\beta\gamma}(\widehat{\gamma}-\gamma^{*})\|_{\infty}\leq(n-1)z_{*}\|\widehat{\gamma}-\gamma^{*}\|_{1}=O_{p}(z_{*}\rho_{n}\tau_{n}\log n).

By Lemma 3, we have

\begin{array}[]{rcl}\|V^{-1}V_{\beta\gamma}(\widehat{\gamma}-\gamma^{*})\|_{\infty}\leq\|V^{-1}\|_{\infty}\|V_{\gamma\beta}(\widehat{\gamma}-\gamma^{*})\|_{\infty}=O_{p}(\frac{z_{*}\rho_{n}\tau_{n}b_{n}\log n}{n}).\end{array}

(27)

By Lemma 8,

P(\|\xi\|_{\infty}>8(k_{n}/\varepsilon_{n})\log n)\leq n\times\exp(-8(k_{n}/\varepsilon_{n})\log n\times 4(\varepsilon_{n}/k_{n}))=\frac{1}{n},

such that

\|\xi\|_{\infty}=O_{p}((k_{n}/\varepsilon_{n})\log n).

Thus, it yields

\|V^{-1}\xi\|_{\infty}\leq\|V^{-1}\|_{\infty}\|\xi\|_{\infty}=O_{p}\left(b_{n}(k_{n}/\varepsilon_{n})\frac{\log n}{n}\right).

(28)

By combining (25), (26), (27) and (28), it yields

\widehat{\beta}_{i}-\beta^{*}_{i}=[V^{-1}(d-\mathbb{E}d)]_{i}+O_{p}(\frac{(b_{n}^{3}\tilde{\varepsilon}_{n}^{2}+z_{*}\rho_{n}\tau_{n}b_{n})\log n}{n}).

Let $W=V^{-1}-S$ and $U=\mathrm{Cov}(W(d-\mathbb{E}d))$ . It is easy to verify that

U=V^{-1}-S-S(I_{n}-VS)

and

[S(I_{n}-VS)]_{ij}=\frac{(\delta_{ij}-1)v_{ij}}{v_{ii}v_{jj}}.

By Lemma 3, we have

\|U\|_{\max}=O(b_{n}^{3}n^{-2}).

Therefore,

[W(d-\mathbb{E}d)]_{i}=O_{p}(b_{n}^{3/2}n^{-1}).

(29)

Consequently, by combining (25), (26), (27) and (29), we have

\widehat{\beta}_{i}-\beta^{*}_{i}=\frac{d_{i}-\mathbb{E}d_{i}}{v_{ii}}++O_{p}(\frac{(b_{n}^{3}\tilde{\varepsilon}_{n}^{2}+z_{*}\rho_{n}\tau_{n}b_{n})\log n}{n}).

Therefore, Theorem 2 immediately follows from Proposition 1. ∎

7.5 Proof of Theorem 3

Proof of Theorem 3.

Assume that the conditions in Theorem 1 hold. A mean value expansion gives

Q_{c}(\widehat{\gamma})-Q_{c}(\gamma^{*})=\frac{\partial Q_{c}(\bar{\gamma})}{\partial\gamma^{\top}}(\widehat{\gamma}-\gamma^{*}),

where $\bar{\gamma}$ lies between $\gamma^{*}$ and $\widehat{\gamma}$ . By noting that $Q_{c}(\widehat{\gamma})=0$ , we have

\sqrt{N}(\widehat{\gamma}-\gamma^{*})=-\Big{[}\frac{1}{N}\frac{\partial Q_{c}(\bar{\gamma})}{\partial\gamma^{\top}}\Big{]}^{-1}\times\frac{1}{\sqrt{N}}¡¡Q_{c}(\gamma^{*}).

Note that the dimension of $\gamma$ is fixed. By Theorem 1 and (4.1), we have

\frac{1}{N}\frac{\partial Q_{c}(\bar{\gamma})}{\partial\gamma^{\top}}\stackrel{{\scriptstyle p}}{{\to}}\bar{H}=\lim_{N\to\infty}\frac{1}{N}H(\beta^{*},\gamma^{*}).

Write $\widehat{\beta}^{*}$ as $\widehat{\beta}(\gamma^{*})$ for convenience. Let $\bar{Q}(\beta,\gamma)=Q(\beta,\gamma)-\eta$ and $\bar{Q}_{c}(\beta,\gamma)=Q_{c}(\beta,\gamma)-\eta$ . Note that $\eta$ is a Laplace random vector. By Lemma 10, if $k_{n}/\epsilon_{n}=o(n/\log n)$ , then

\frac{\|\eta\|_{\infty}}{N^{1/2}}=O_{p}\left(\frac{p(k_{n}/\epsilon_{n})\log n}{n}\right)=o_{p}(1).

Therefore,

\sqrt{N}(\widehat{\gamma}-\gamma^{*})=-\bar{H}^{-1}\cdot\frac{1}{\sqrt{N}}\bar{Q}(\widehat{\beta}^{*},\gamma^{*})+o_{p}(1).

(30)

By applying a third order Taylor expansion to $\bar{Q}(\widehat{\beta}^{*},\gamma^{*})$ , it yields

\frac{1}{\sqrt{N}}\bar{Q}(\widehat{\beta}^{*},\gamma^{*})=S_{1}+S_{2}+S_{3},

(31)

where

\begin{array}[]{l}S_{1}=\frac{1}{\sqrt{N}}\bar{Q}(\beta^{*},\gamma^{*})+\frac{1}{\sqrt{N}}\Big{[}\frac{\partial\bar{Q}(\beta^{*},\gamma^{*})}{\partial\beta^{\top}}\Big{]}(\widehat{\beta}^{*}-\beta^{*}),\\ S_{2}=\frac{1}{2\sqrt{N}}\sum_{k=1}^{n}\Big{[}(\widehat{\beta}_{k}^{*}-\beta_{k}^{*})\frac{\partial^{2}\bar{Q}(\beta^{*},\gamma^{*})}{\partial\beta_{k}\partial\beta^{\top}}\times(\widehat{\beta}^{*}-\beta^{*})\Big{]},\\ S_{3}=\frac{1}{6\sqrt{N}}\sum_{k=1}^{n}\sum_{l=1}^{n}\{(\widehat{\beta}_{k}^{*}-\beta_{k}^{*})(\widehat{\beta}_{l}^{*}-\beta_{l}^{*})\Big{[}\frac{\partial^{3}\bar{Q}(\bar{\beta}^{*},\gamma^{*})}{\partial\beta_{k}\partial\beta_{l}\partial\beta^{\top}}\Big{]}(\widehat{\beta}^{*}-\beta^{*})\},\end{array}

and $\bar{\beta}^{*}=t\beta^{*}+(1-t)\widehat{\beta}^{*}$ for some $t\in(0,1)$ . We will show that (1) $S_{1}$ asymptotically follows a multivariate normal distribution; (2) $S_{2}$ is a bias term; (3) $S_{3}$ is an asymptotically negligible remainder term. Specifically, they are accurately characterized as follows:

$\displaystyle S_{1}$	$\displaystyle=$	$\displaystyle\frac{1}{\sqrt{N}}\sum_{j<i}s_{ij}(\beta^{},\gamma^{})+O_{p}(\frac{b_{n}^{3}\tilde{\varepsilon}_{n}^{3}z_{*}\log n}{n}),$
$\displaystyle S_{2}$	$\displaystyle=$	$\displaystyle\sum_{k=1}^{n}\frac{\sum_{j\neq k}\mu_{kj}^{\prime\prime}(\beta^{},\gamma^{})z_{kj}}{v_{kk}}+O_{p}\left(\frac{b_{n}^{4}\tilde{\varepsilon}_{n}^{3}z_{*}(\log n)^{1/2}}{n^{1/2}}\right),$
$\displaystyle\\|S_{3}\\|_{\infty}$	$\displaystyle=$	$\displaystyle O_{p}(\frac{(\log n)^{3/2}b_{n}^{3}\tilde{\varepsilon}_{n}^{3}}{n^{1/2}}).$

We defer the proofs of the above equations to supplementary material. Substituting the above equations into (30) then gives

\sqrt{N}(\widehat{\gamma}-\gamma^{*})=-\bar{H}^{-1}B_{*}+\bar{H}^{-1}\times\frac{1}{\sqrt{N}}\sum_{i<j}s_{ij}(\beta^{*},\gamma^{*})+O_{p}\left(\frac{b_{n}^{4}\tilde{\varepsilon}_{n}^{3}z_{*}(\log n)^{3/2}}{n^{1/2}}\right).

If $b_{n}^{4}\tilde{\varepsilon}_{n}^{3}z_{*}=o(n^{1/2}/(\log n)^{3/2})$ , then Theorem 3 immediately follows from Proposition 2.

∎

References

Billingsley, (1995) Billingsley, P. (1995). Probability and measure. 3rd edition. Wiley, New York.
Chatterjee et al., (2011) Chatterjee, S., Diaconis, P., and Sly, A. (2011). Random graphs with a given degree sequence. Annals of Applied Probability, 21(4):1400–1435.
Cohen, (2004) Cohen, W. W. (2004). Enron email dataset (retrieved march 12, 2005).
Day et al., (2016) Day, W.-Y., Li, N., and Lyu, M. (2016). Publishing graph degree distribution with node differential privacy. In Proceedings of the 2016 International Conference on Management of Data, pages 123–138, New York, NY, USA.
Dwork et al., (2006) Dwork, C., Mcsherry, F., Nissim, K., and Smith, A. (2006). Calibrating noise to sensitivity in private data analysis. Lecture Notes in Computer Science, pages 265–284.
Dzemski, (2019) Dzemski, A. (2019). An empirical model of dyadic link formation in a network with unobserved heterogeneity. The Review of Economics and Statistics, (To appear).
Fienberg, (2012) Fienberg, S. E. (2012). A brief history of statistical models for network analysis and open challenges. Journal of Computational and Graphical Statistics, 21(4):825–839.
Gragg and Tapia, (1974) Gragg, W. B. and Tapia, R. A. (1974). Optimal error bounds for the newtonï¿½ckantorovich theorem. SIAM Journal on Numerical Analysis, 11(1):10–13.
Graham, (2017) Graham, B. S. (2017). An econometric model of network formation with degree heterogeneity. Econometrica, 85(4):1033–1063.
Hay et al., (2009) Hay, M., Li, C., Miklau, G., and Jensen, D. (2009). Accurate estimation of the degree distribution of private networks. In 2009 Ninth IEEE International Conference on Data Mining, pages 169–178.
Hillar et al., (2012) Hillar, C. J., Lin, S., and Wibisono, A. (2012). Inverses of symmetric, diagonally dominant positive matrices and applications.
Hoeffding, (1963) Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables. Journal of the American Statistical Association, 58(301):13–30.
Kantorovich, (1948) Kantorovich, L. V. (1948). Functional analysis and applied mathematics. Uspekhi Mat Nauk, pages 89–185.
Karwa and Slavković, (2016) Karwa, V. and Slavković, A. (2016). Inference using noisy degrees-differentially private beta model and synthetic graphs. The Annals of Statistics, 44:87–112.
Loéve, (1977) Loéve, M. (1977). Probability theory I. 4th ed. Springer, New York.
Lounici, (2008) Lounici, K. (2008). Sup-norm convergence rate and sign concentration property of lasso and dantzig estimators. Electronic Journal of Statistics, 2:90–102.
Macwan and Patel, (2018) Macwan, K. R. and Patel, S. J. (2018). Node differential privacy in social graph degree publishing. Procedia Computer Science, 143:786 – 793. 8th International Conference on Advances in Computing & Communications (ICACC-2018).
McCullagh and Nelder, (1989) McCullagh, P. and Nelder, J. (1989). Generalized Linear Models, Second Edition. Chapman and Hall.
Narayanan and Shmatikov, (2009) Narayanan, A. and Shmatikov, V. (2009). De-anonymizing social networks. In 2009 30th IEEE Symposium on Security and Privacy, pages 173–187.
Neyman and Scott, (1948) Neyman, J. and Scott, E. (1948). Consistent estimates based on partially consistent observations. Econometrica, (16):1–32.
Nguyen et al., (2016) Nguyen, H. H., Imine, A., and Rusinowitch, M. (2016). Detecting communities under differential privacy. In Proceedings of the 2016 ACM on Workshop on Privacy in the Electronic Society, WPES ¡¯16, page 83¨C93, New York, NY, USA. Association for Computing Machinery.
Nissim et al., (2007) Nissim, K., Raskhodnikova, S., and Smith, A. (2007). Smooth sensitivity and sampling in private data analysis. In Proceedings of the thirty-ninth annual ACM symposium on Theory of computing, pages 75–84.
Wang et al., (2022) Wang, Q., Yan, T., Jiang, B., and Leng, C. (2022). Two-mode networks: inference with as many parameters as actors and differential privacy. Journal of Machine Learning Research, 292(23):1–38.
Wang et al., (2023) Wang, Q., Zhang, Y., and Yan, T. (2023). Asymptotic theory in network models with covariates and a growing number of node parameters. Annals of the Institute of Statistical Mathematics, 75(2):369–392.
Wasserman and Zhou, (2010) Wasserman, L. and Zhou, S. (2010). A statistical framework for differential privacy. Journal of the American Statistical Association, 105(489):375–389.
Yan, (2021) Yan, T. (2021). Directed networks with a differentially private bi-degree sequence. Statistica Sinica, 31(4):pp. 2031–2050.
Yan et al., (2019) Yan, T., Jiang, B., Fienberg, S. E., and Leng, C. (2019). Statistical inference in a directed network model with covariates. Journal of the American Statistical Association, 114(526):857–868.
Yan et al., (2015) Yan, T., Zhao, Y., and Qin, H. (2015). Asymptotic normality in the maximum entropy models on graphs with an increasing number of parameters. Journal of Multivariate Analysis, 133:61 – 76.
Zhou et al., (2007) Zhou, Y., Goldberg, M., Magdon-Ismail, M., and Wallace, W. A. (2007). Strategies for cleaning organizational emails with an application to enron email dataset. In in 5th Conference of North American Association for Computatational Social Organization Science, Pittsburgh. North American Association for Computational Social and Organizational Science.

Differentially private analysis of networks with covariates via a generalized β\beta-model

Abstract

1 Introduction

2 Model and differential privacy

2.1 Generalized β\beta-model

2.2 Differential privacy

Definition 1 (Edge differential privacy).

Definition 2.

Lemma 1.

Lemma 2 (Dwork et al., (2006)).

3 Releasing network statistics and estimation

3.1 Releasing

3.2 Estimation

4 Asymptotic properties

4.1 Consistency

Theorem 1.

Corollary 1.

Corollary 2.

Remark 1.

4.2 Asymptotic normality of β^\widehat{\beta}

Proposition 1.

Theorem 2.

Remark 2.

4.3 Asymptotic normality of γ^\widehat{\gamma}

Proposition 2.

Theorem 3.

Remark 3.

5 Numerical studies

5.1 Simulations

5.2 A real data example

6 Summary and discussion

7 Appendix

7.1 Preliminaries

Lemma 3 (Yan et al., (2015)).

Lemma 4 (Hillar et al., (2012)).

Lemma 5 (Gragg and Tapia, (1974)).

7.2 Error bound between β^γ\widehat{\beta}_{\gamma} and β∗\beta^{*}

Lemma 6.

Lemma 7.

Lemma 8.

Lemma 9.

Proof of Lemma 9.

7.3 Proof of Theorem 1

Lemma 10.

Lemma 11.

Lemma 12.

Proof of Theorem 1.

7.4 Proofs for Theorem 2

Proof of Theorem 2.

7.5 Proof of Theorem 3

Proof of Theorem 3.

References

Differentially private analysis of networks with covariates via a generalized $\beta$ -model

2.1 Generalized $\beta$ -model

4.2 Asymptotic normality of $\widehat{\beta}$

4.3 Asymptotic normality of $\widehat{\gamma}$

7.2 Error bound between $\widehat{\beta}_{\gamma}$ and $\beta^{*}$