Locally Differentially Private Analysis of Graph Statistics

Jacob Imola
UC San Diego The first and second authors made equal contributions. Takao Murakami^†^†footnotemark:
AIST Kamalika Chaudhuri
UC San Diego

Abstract

Differentially private analysis of graphs is widely used for releasing statistics from sensitive graphs while still preserving user privacy. Most existing algorithms however are in a centralized privacy model, where a trusted data curator holds the entire graph. As this model raises a number of privacy and security issues – such as, the trustworthiness of the curator and the possibility of data breaches, it is desirable to consider algorithms in a more decentralized local model where no server holds the entire graph.

In this work, we consider a local model, and present algorithms for counting subgraphs – a fundamental task for analyzing the connection patterns in a graph – with LDP (Local Differential Privacy). For triangle counts, we present algorithms that use one and two rounds of interaction, and show that an additional round can significantly improve the utility. For $k$ -star counts, we present an algorithm that achieves an order optimal estimation error in the non-interactive local model. We provide new lower-bounds on the estimation error for general graph statistics including triangle counts and $k$ -star counts. Finally, we perform extensive experiments on two real datasets, and show that it is indeed possible to accurately estimate subgraph counts in the local differential privacy model.

1 Introduction

Analysis of network statistics is a useful tool for finding meaningful patterns in graph data, such as social, e-mail, citation and epidemiological networks. For example, the average degree (i.e., number of edges connected to a node) in a social graph can reveal the average connectivity. Subgraph counts (e.g., the number of triangles, stars, or cliques) can be used to measure centrality properties such as the clustering coefficient, which represents the probability that two friends of an individual will also be friends of one another [41]. However, the vast majority of graph analytics is carried out on sensitive data, which could be leaked through the results of graph analysis. Thus, there is a need to develop solutions that can analyze these graph properties while still preserving the privacy of individuals in the network.

The standard way to analyze graphs with privacy is through differentially private graph analysis [49, 23, 22]. Differential privacy provides individual privacy against adversaries with arbitrary background knowledge, and has currently emerged as the gold standard for private analytics. However, a vast majority of differentially private graph analysis algorithms are in the centralized (or global) model [13, 15, 16, 27, 33, 35, 42, 48, 49, 52, 59, 58], where a single trusted data curator holds the entire graph and releases sanitized versions of the statistics. By assuming a trusted party that can access the entire graph, it is possible to release accurate graph statistics (e.g., subgraph counts [33, 35, 52], degree distribution [16, 27, 48], spectra [59]) and synthetic graphs [15, 58].

In many applications however, a single trusted curator may not be practicable due to security or logistical reasons. A centralized data holder is amenable to security issues such as data breaches and leaks – a growing threat in recent years [38, 51]. Additionally, decentralized social networks [43, 50] (e.g., Diaspora [5]) have no central server that contains an entire social graph, and use instead many servers all over the world, each containing the data of users who have chosen to register there. Finally, a centralized solution is also not applicable to fully decentralized applications, where the server does not automatically hold information connecting users. An example of this is a mobile application that asks each user how many of her friends she has seen today, and sends noisy counts to a central server. In this application, the server does not hold any individual edge, but can still aggregate the responses to determine the average mobility in an area.

The standard privacy solution that does not assume a trusted third party is LDP (Local Differential Privacy) [20, 34]. This is a special case of DP (Differential Privacy) in the local model, where each user obfuscates her personal data by herself and sends the obfuscated data to a (possibly malicious) data collector. Since the data collector does not hold the original personal data, it does not suffer from data leakage issues. Therefore, LDP has recently attracted attention from both academia [8, 11, 10, 24, 31, 32, 39, 45, 57, 62] as well as industry [56, 17, 55]. However, the use of LDP has mostly been in the context of tabular data where each row corresponds to an individual, and little attention has been paid to LDP for more complex data such as graphs (see Section 2 for details).

Refer to caption — Figure 1: Example of subgraph counts.

In this paper, we consider LDP for graph data, and provide algorithms and theoretical performance guarantees for calculating graph statistics in this model. In particular, we focus on counting triangles and $k$ -stars – the most basic and useful subgraphs. A triangle is a set of three nodes with three edges (we exclude automorphisms; i.e., #closed triplets $=3\times$ #triangles). A $k$ -star consists of a central node connected to $k$ other nodes. Figure 1 shows an example of triangles and $k$ -stars. Counting them is a fundamental task of analyzing the connection patterns in a graph, as the clustering coefficient can be calculated from triangle and $2$ -star counts as: $\frac{3\times\text{\#triangles}}{\#2\text{-stars}}$ (in Figure 1, $\frac{3\times 5}{20}=0.75$ ).

When we look to protect privacy of relationship information modeled by edges in a graph, we need to pay attention to the fact that some relationship information could be leaked from subgraph counts. For example, suppose that user (node) $v_{2}$ in Figure 1 knows all edges connected to $v_{2}$ and all edges between $v_{3},\ldots,v_{7}$ as background knowledge, and that $v_{2}$ wants to know who are friends with $v_{1}$ . Then “#2-stars = 20” reveals the fact that $v_{1}$ has three friends, and “#triangles = 5” reveals the fact that the three friends of $v_{1}$ are $v_{3}$ , $v_{4}$ , and $v_{6}$ . Moreover, a central server that holds all friendship information (i.e., all edges) may face data breaches, as explained above. Therefore, a private algorithm for counting subgraphs in the local model is highly beneficial to individual privacy.

The main challenge in counting subgraphs in the local model is that existing techniques and their analysis do not directly apply. The existing work on LDP for tabular data assumes that each person’s data is independently and identically drawn from an underlying distribution. In graphs, this is no longer the case; e.g., each triangle is not independent, because multiple triangles can involve the same edge; each $k$ -star is not independent for the same reason. Moreover, complex inter-dependencies involving multiple people are possible in graphs. For example, each user cannot count triangles involving herself, because she cannot see edges between other users; e.g., user $v_{1}$ cannot see an edge between $v_{3}$ and $v_{4}$ in Figure 1.

We show that although these complex dependency among users introduces challenges, it also presents opportunities. Specifically, this kind of interdependency also implies that extra interaction between users and a data collector may be helpful depending on the prior responses. In this work, we investigate this issue and provide algorithms for accurately calculating subgraph counts under LDP.

Our contributions. In this paper, we provide algorithms and corresponding performance guarantees for counting triangles and $k$ -stars in graphs under edge Local Differential Privacy. Specifically, our contributions are as follows:

•

For triangles, we present two algorithms. The first is based on Warner’s RR (Randomized Response) [60] and empirical estimation [31, 39, 57]. We then present a more sophisticated algorithm that uses an additional round of interaction between users and data collector. We provide upper-bounds on the estimation error for each algorithm, and show that the latter can significantly reduce the estimation error.
•

For $k$ -stars, we present a simple algorithm using the Laplacian mechanism. We analyze the upper-bound on the estimation error for this algorithm, and show that it is order optimal in terms of the number of users among all LDP mechanisms that do not use additional interaction.
•

We provide lower-bounds on the estimation error for general graph functions including triangle counts and $k$ -star counts in the local model. These are stronger than known upper bounds in the centralized model, and illustrate the limitations of the local model over the central.
•

Finally, we evaluate our algorithms on two real datasets, and show that it is indeed possible to accurately estimate subgraph counts in the local model. In particular, we show that the interactive algorithm for triangle counts and the Laplacian algorithm for the $k$ -stars provide small estimation errors when the number of users is large.

We implemented our algorithms with C/C++, and published them as open-source software [1].

2 Related Work

Graph DP. DP on graphs has been widely studied, with most prior work being in the centralized model [13, 15, 16, 27, 33, 35, 42, 48, 49, 52, 59, 58]. In this model, a number of algorithms have been proposed for releasing subgraph counts [33, 35, 52], degree distributions [16, 27, 48], eigenvalues and eigenvectors [59], and synthetic graphs [15, 58].

There has also been a handful of work on graph algorithms in the local DP model [46, 53, 63, 64, 65]. For example, Qin et al. [46] propose an algorithm for generating synthetic graphs. Zhang et al. [65] propose an algorithm for software usage analysis under LDP, where a node represents a software component (e.g., function in a code) and an edge represents a control-flow between components. Neither of these works focus on subgraph counts.

Sun et al. [53] propose an algorithm for counting subgraphs in the local model under the assumption that each user allows her friends to see all her connections. However, this assumption does not hold in many practical scenarios; e.g., a Facebook user can change her setting so that friends cannot see her connections. Therefore, we assume that each user knows only her friends rather than all of her friends’ friends. The algorithms in [53] cannot be applied to this setting.

Ye et al. [63] propose a one-round algorithm for estimating graph metrics including the clustering coefficient. Here they apply Warner’s RR (Randomized Response) to an adjacency matrix. However, it introduces a very large bias for triangle counts. In [64], they propose a method for reducing the bias in the estimate of triangle counts. However, the method in [64] introduces some approximation, and it is unclear whether their estimate is unbiased. In this paper, we propose a one-round algorithm for triangles that uses empirical estimation as a post-processing step, and prove that our estimate is unbiased. We also show in Appendix A that our one-round algorithm significantly outperforms the one-round algorithm in [63]. Moreover, we show in Section 5 that our two-rounds algorithm significantly outperforms our one-round algorithm.

Our work also differs from [53, 63, 64] in that we provide lower-bounds on the estimation error.

LDP. Apart from graphs, a number of works have looked at analyzing statistics (e.g., discrete distribution estimation[8, 24, 31, 32, 39, 57, 62], heavy hitters [11, 10, 45]) under LDP.

However, they use LDP in the context of tabular data, and do not consider the kind of complex interdependency in graph data (as described in Section 1). For example, the RR with empirical estimation is optimal in the low privacy regimes for estimating a distribution for tabular data [31, 32]. We apply the RR and empirical estimation to counting triangles, and show that it is suboptimal and significantly outperformed by a more sophisticated two-rounds algorithm.

Upper/lower-bounds. Finally, we note that existing work on upper-bounds and lower-bounds cannot be directly applied to our setting. For example, there are upper-bounds [8, 31, 32, 62, 29, 28] and lower-bounds [7, 21, 19, 29, 18, 28, 30] on the estimation error (or sample complexity) in distribution estimation of tabular data. However, they assume that each original data value is independently sampled from an underlying distribution. They cannot be directly applied to our graph setting, because each triangle and each $k$ -star involve multiple edges and are not independent (as described in Section 1). Rashtchian et al. [47] provide lower-bounds on communication complexity (i.e., number of queries) of vector-matrix-vector queries for estimating subgraph counts. However, their lower-bounds are not on the estimation error, and cannot be applied to our problem.

3 Preliminaries

3.1 Graphs and Differential Privacy

Graphs. Let $\mathbb{N}$ , $\mathbb{Z}_{\geq 0}$ , $\mathbb{R}$ , and $\mathbb{R}_{\geq 0}$ be the sets of natural numbers, non-negative integers, real numbers, and non-negative real numbers, respectively. For $a\in\mathbb{N}$ , let $[a]=\{1,2,\ldots,a\}$ .

We consider an undirected graph $G=(V,E)$ , where $V$ is a set of nodes (i.e., users) and $E$ is a set of edges. Let $n\in\mathbb{N}$ be the number of users in $V$ , and let $v_{i}\in V$ the $i$ -th user; i.e., $V=\{v_{1},\ldots,v_{n}\}$ . An edge $(v_{i},v_{j})\in E$ represents a relationship between users $v_{i}\in V$ and $v_{j}\in V$ . The number of edges connected to a single node is called the degree of the node. Let $d_{max}\in\mathbb{N}$ be the maximum degree (i.e., maximum number of edges connected to a node) in graph $G$ . Let $\mathcal{G}$ be the set of possible graphs $G$ on $n$ users. A graph $G\in\mathcal{G}$ can be represented as a symmetric adjacency matrix $\mathbf{A}=(a_{i,j})\in\{0,1\}^{n\times n}$ , where $a_{i,j}=1$ if $(v_{i},v_{j})\in E$ and $a_{i,j}=0$ otherwise.

Types of DP. DP (Differential Privacy) [23, 22] is known as a gold standard for data privacy. According to the underlying architecture, DP can be divided into two types: centralized DP and LDP (Local DP). Centralized DP assumes the centralized model, where a “trusted” data collector collects the original personal data from all users and obfuscates a query (e.g., counting query, histogram query) on the set of personal data. LDP assumes the local model, where each user does not trust even the data collector. In this model, each user obfuscates a query on her personal data by herself and sends the obfuscated data to the data collector.

If the data are represented as a graph, we can consider two types of DP: edge DP and node DP [27, 49]. Edge DP considers two neighboring graphs $G,G^{\prime}\in\mathcal{G}$ that differ in one edge. In contrast, node DP considers two neighboring graphs $G,G^{\prime}\in\mathcal{G}$ in which $G^{\prime}$ is obtained from $G$ by adding or removing one node along with its adjacent edges.

Although Zhang et al. [65] consider node DP in the local model where each node represents a software component, we consider a totally different problem where each node represents a user. In the latter case, node DP requires us to hide the existence of each user along with her all edges. However, many applications in the local model send the identity of each user to a server. For example, we can consider a mobile application that sends to a server how many friends a user met today along with her user ID. In this case, the user may not mind sending her user ID, but may want to hide her edge information (i.e., who she met today). Although we cannot use node DP in such applications, we can use edge DP to deny the presence/absence of each edge (friend). Thus we focus on edge DP in the same way as [46, 53, 63, 64].

Below we explain edge DP in the centralized model.

Centralized DP. We call edge DP in the centralized model edge centralized DP. Formally, it is defined as follows:

Definition 1 ( $\varepsilon$ -edge centralized DP).

Let $\varepsilon\in\mathbb{R}_{\geq 0}$ . A randomized algorithm $\mathcal{M}$ with domain $\mathcal{G}$ provides $\varepsilon$ -edge centralized DP if for any two neighboring graphs $G,G^{\prime}\in\mathcal{G}$ that differ in one edge and any $S\subseteq\mathrm{Range}(\mathcal{M})$ ,

\displaystyle\Pr[\mathcal{M}(G)\in S]\leq e^{\varepsilon}\Pr[\mathcal{M}(G^{\prime})\in S].

(1)

Edge centralized DP guarantees that an adversary who has observed the output of $\mathcal{M}$ cannot determine whether it is come from $G$ or $G^{\prime}$ with a certain degree of confidence. The parameter $\varepsilon$ is called the privacy budget. If $\varepsilon$ is close to zero, then $G$ and $G^{\prime}$ are almost equally likely, which means that an edge in $G$ is strongly protected.

We also note that edge DP can be used to protect $k\in\mathbb{N}$ edges by using the notion of group privacy [23]. Specifically, if $\mathcal{M}$ provides $\varepsilon$ -edge centralized DP, then for any two graphs $G,G^{\prime}\in\mathcal{G}$ that differ in $k$ edges and any $S\subseteq\mathrm{Range}(\mathcal{M})$ , we obtain: $\Pr[\mathcal{M}(G)\in S]\leq e^{k\varepsilon}\Pr[\mathcal{M}(G^{\prime})\in S]$ ; i.e., $k$ edges are protected with privacy budget $k\varepsilon$ .

3.2 Local Differential Privacy

LDP (Local Differential Privacy) [34, 20] is a privacy metric to protect personal data of each user in the local model. LDP has been originally introduced to protect each user’s data record that is independent from the other records. However, in a graph, each edge is connected to two users. Thus, when we define edge DP in the local model, we should consider what we want to protect. In this paper, we consider two definitions of edge DP in the local model: edge LDP in [46] and relationship DP introduced in this paper. Below, we will explain these two definitions in detail.

Edge LDP. Qin et al. [46] defined edge LDP based on a user’s neighbor list. Specifically, let $\mathbf{a}_{i}=(a_{i,1},\ldots,a_{i,n})\in\{0,1\}^{n}$ be a neighbor list of user $v_{i}$ . Note that $\mathbf{a}_{i}$ is the $i$ -th row of the adjacency matrix $\mathbf{A}$ of graph $G$ . In other words, graph $G$ can be represented as neighbor lists $\textbf{a}_{1},\ldots,\textbf{a}_{n}$ .

Then edge LDP is defined as follows:

Definition 2 ( $\varepsilon$ -edge LDP [46]).

Let $\varepsilon\in\mathbb{R}_{\geq 0}$ . For any $i\in[n]$ , let $\mathcal{R}_{i}$ with domain $\{0,1\}^{n}$ be a randomized algorithm of user $v_{i}$ . $\mathcal{R}_{i}$ provides $\varepsilon$ -edge LDP if for any two neighbor lists $\mathbf{a}_{i},\mathbf{a}^{\prime}_{i}\in\{0,1\}^{n}$ that differ in one bit and any $S\subseteq\mathrm{Range}(\mathcal{R}_{i})$ ,

\displaystyle\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})\in S]\leq e^{\varepsilon}\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})\in S].

(2)

Edge LDP in Definition 2 protects a single bit in a neighbor list with privacy budget $\varepsilon$ . As with edge centralized DP, edge LDP can also be used to protect $k\in\mathbb{N}$ bits in a neighbor list by using group privacy; i.e., $k$ bits in a neighbor list are protected with privacy budget $k\varepsilon$ .

RR (Randomized Response). As a simple example of a randomized algorithm $\mathcal{R}_{i}$ providing $\varepsilon$ -edge LDP, we explain Warner’s RR (Randomized Response) [60] applied to a neighbor list, which is called the randomized neighbor list in [46].

Given a neighbor list $\mathbf{a}_{i}\in\{0,1\}^{n}$ , this algorithm outputs a noisy neighbor lists $\mathbf{b}=(b_{1},\ldots,b_{n})\in\{0,1\}^{n}$ by flipping each bit in $\mathbf{a}_{i}$ with probability $p=\frac{1}{e^{\varepsilon}+1}$ ; i.e., for each $j\in[n]$ , $b_{j}\neq a_{i,j}$ with probability $p$ and $b_{j}=a_{i,j}$ with probability $1-p$ . Since $\Pr[\mathcal{R}(\mathbf{a}_{i})\in S]$ and $\Pr[\mathcal{R}(\mathbf{a}^{\prime}_{i})\in S]$ in (2) differ by $e^{\varepsilon}$ for $\mathbf{a}_{i}$ and $\mathbf{a}^{\prime}_{i}$ that differ in one bit, this algorithm provides $\varepsilon$ -edge LDP.

Relationship DP. In graphs such as social networks, it is usually the case that two users share knowledge of the presence of an edge between them. To hide their mutual edge, we must consider that both user’s outputs can leak information. We introduce a DP definition called relationship DP that hides one entire edge in graph $G$ during the whole process:

Definition 3 ( $\varepsilon$ -relationship DP).

Let $\varepsilon\in\mathbb{R}_{\geq 0}$ . A tuple of randomized algorithms $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ , each of which is with domain $\{0,1\}^{n}$ , provides $\varepsilon$ -relationship DP if for any two neighboring graphs $G,G^{\prime}\in\mathcal{G}$ that differ in one edge and any $S\subseteq\mathrm{Range}(\mathcal{R}_{1})\times\ldots\times\mathrm{Range}(\mathcal{R}_{n})$ ,

	$\displaystyle\Pr[(\mathcal{R}_{1}(\mathbf{a}_{1}),\ldots,\mathcal{R}_{n}(\mathbf{a}_{n}))\in S]$
	$\displaystyle\leq e^{\varepsilon}\Pr[(\mathcal{R}_{1}(\mathbf{a}^{\prime}_{1}),\ldots,\mathcal{R}_{n}(\mathbf{a}^{\prime}_{n}))\in S],$		(3)

where $\mathbf{a}_{i}$ (resp. $\mathbf{a}^{\prime}_{i}$ ) $\in\{0,1\}^{n}$ is the $i$ -th row of the adjacency matrix of graph $G$ (resp. $G^{\prime}$ ).

Relationship DP is the same as decentralized DP in [53] except that the former (resp. latter) assumes that each user knows only her friends (resp. all of her friends’ friends).

Edge LDP assumes that user $v_{i}$ ’s edge connected to user $v_{j}$ and user $v_{j}$ ’s edge connected to user $v_{i}$ are different secrets, with user $v_{i}$ knowing the former and user $v_{j}$ knowing the latter. Relationship DP assumes that the two secrets are the same.

Note that the threat model of relationship DP is different from that of LDP – some amount of trust must be given to the other users in relationship DP. Specifically, user $v_{i}$ must trust user $v_{j}$ to not leak information about their shared edge. If $k\in\mathbb{N}$ users decide not to follow their protocols, then up to $k$ edges incident to user $v_{i}$ may be compromised. This trust model is stronger than LDP, which assumes nothing about what other users do, but is much weaker than centralized DP in which all edges are in the hands of the central party.

Other than the differing threat models, relationship DP and edge LDP are quite closely related:

Proposition 1.

If randomized algorithms $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ provide $\varepsilon$ -edge LDP, then $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ provides $2\varepsilon$ -relationship DP.

Proof.

The existence of edge $(v_{i},v_{j})\in E$ affects two elements $a_{i,j},a_{j,i}\in\{0,1\}$ in the adjacency matrix $\mathbf{A}$ . Then by group privacy [23], Proposition 1 holds. ∎

Proposition 1 states that when we want to protect one edge as a whole, the privacy budget is at most doubled. Note, however, that some randomized algorithms do not have this doubling issue. For example, we can apply the RR to the $i$ -th neighbor list $\mathbf{a}_{i}$ so that $\mathcal{R}_{i}$ outputs noisy bits $(b_{1},\ldots,b_{i-1})\in\{0,1\}^{i-1}$ for only users $v_{1},\ldots,v_{i-1}$ with smaller user IDs; i.e., for each $j\in\{1,\ldots,i-1\}$ , $b_{j}\neq a_{i,j}$ with probability $p=\frac{1}{e^{\varepsilon}+1}$ and $b_{j}=a_{i,j}$ with probability $1-p$ . In other words, we can extend the RR for a neighbor list so that $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ outputs only the lower triangular part of the noisy adjacency matrix. Then all of $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ provide $\varepsilon$ -edge LDP. In addition, the existence of edge $(v_{i},v_{j})\in E$ $(i>j)$ affects only one element $a_{i,j}$ in the lower triangular part of $\mathbf{A}$ . Thus, $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ provides $\varepsilon$ -relationship DP (not $2\varepsilon$ ).

Our proposed algorithm in Section 4.3 also has this property; i.e., it provides both $\varepsilon$ -edge LDP and $\varepsilon$ -relationship DP.

3.3 Global Sensitivity

In this paper, we use the notion of global sensitivity [23] to provide edge centralized DP or edge LDP.

Let $\mathcal{D}$ be the set of possible input data of a randomized algorithm. In edge centralized DP, $\mathcal{D}=\mathcal{G}$ . In edge LDP, $\mathcal{D}=\{0,1\}^{n}$ . Let $f:\mathcal{D}\rightarrow\mathbb{R}$ be a function that takes data $D\in\mathcal{D}$ as input and outputs some statistics $f(D)\in\mathbb{R}$ about the data. The most basic method for providing DP is to add the Laplacian noise proportional to the global sensitivity [23].

Definition 4 (Global sensitivity).

The global sensitivity of a function $f:\mathcal{D}\rightarrow\mathbb{R}$ is given by:

\displaystyle GS_{f}=\underset{D,D^{\prime}\in\mathcal{D}:D\sim D^{\prime}}{\max}|f(D)-f(D^{\prime})|,

where $D\sim D^{\prime}$ represents that $D$ and $D^{\prime}$ are neighbors; i.e., they differ in one edge in edge centralized DP, and differ in one bit in edge LDP.

In graphs, the global sensitivity $GS_{f}$ can be very large. For example, adding one edge may result in the increase of triangle (resp. $k$ -star) counts by $n-2$ (resp. $\binom{n}{k-1}$ ).

One way to significantly reduce the global sensitivity is to use graph projection [16, 35, 48], which removes some neighbors from a neighbor list so that the maximum degree $d_{max}$ is upper-bounded by a predetermined value $\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}$ . By using the graph projection with $\tilde{d}_{max}\ll n$ , we can enforce small global sensitivity; e.g., the global sensitivity of triangle (resp. $k$ -star) counts is at most $\tilde{d}_{max}$ (resp. $\binom{\tilde{d}_{max}}{k-1}$ ) after the projection.

Ideally, we would like to set $\tilde{d}_{max}=d_{max}$ to avoid removing neighbors from a neighbor list (i.e., to avoid the loss of utility). However, the maximum degree $d_{max}$ can leak some information about the original graph $G$ . In this paper, we address this issue by privately estimating $d_{max}$ with edge LDP and then using the private estimate of $d_{max}$ as $\tilde{d}_{max}$ . This technique is also known as adaptive clipping in differentially private stochastic gradient descent (SGD) [44, 54].

3.4 Graph Statistics and Utility Metrics

Graph statistics. We consider a graph function that takes a graph $G\in\mathcal{G}$ as input and outputs some graph statistics. Specifically, let $f_{\triangle}:\mathcal{G}\rightarrow\mathbb{Z}_{\geq 0}$ be a graph function that outputs the number of triangles in $G$ . For $k\in\mathbb{N}$ , let $f_{k\star}:\mathcal{G}\rightarrow\mathbb{Z}_{\geq 0}$ be a graph function that outputs the number of $k$ -stars in $G$ . For example, if a graph $G$ is as shown in Figure 1, then $f_{\triangle}(G)=5$ , $f_{2\star}(G)=20$ , and $f_{3\star}(G)=8$ . The clustering coefficient can also be calculated from $f_{\triangle}(G)$ and $f_{2\star}(G)$ as: $\frac{3f_{\triangle}(G)}{f_{2\star}(G)}=0.75$ .

Table 1: Basic notations in this paper.

Symbol Description $n$ Number of users. $G=(V,E)$ Graph with $n$ nodes (users) $V$ and edges $E$ . $v_{i}$ $i$ -th user in $V$ . $d_{max}$ Maximum degree of $G$ . $\tilde{d}_{max}$ Upper-bound on $d_{max}$ (used for projection). $\mathcal{G}$ Set of possible graphs on $n$ users. $\mathbf{A}=(a_{i,j})$ Adjacency matrix. $\mathbf{a}_{i}$ $i$ -th row of $\mathbf{A}$ (i.e., neighbor list of $v_{i}$ ). $\mathcal{R}_{i}$ Randomized algorithm on $\mathbf{a}_{i}$ . $f_{\triangle}(G)$ Number of triangles in $G$ . $f_{k\star}(G)$ Number of $k$ -stars in $G$ .

Table 1 shows the basic notations used in this paper.

Utility metrics. We use the $l_{2}$ loss (i.e., squared error) [31, 57, 39] and the relative error [12, 14, 61] as utility metrics.

Specifically, let $\hat{f}(G)\in\mathbb{R}$ be an estimate of graph statistics $f(G)\in\mathbb{R}$ . Here $f$ can be instantiated by $f_{\triangle}$ or $f_{k\star}$ ; i.e., $\hat{f}_{\triangle}(G)$ and $\hat{f}_{k\star}(G)$ are the estimates of $f_{\triangle}(G)$ and $f_{k\star}(G)$ , respectively. Let $l_{2}^{2}$ be the $l_{2}$ loss function, which maps the estimate $\hat{f}(G)$ and the true value $f(G)$ to the $l_{2}$ loss; i.e., $l_{2}^{2}(\hat{f}(G),f(G))=(\hat{f}(G)-f(G))^{2}$ . Note that when we use a randomized algorithm providing edge LDP (or edge centralized DP), $\hat{f}(G)$ depends on the randomness in the algorithm. In our theoretical analysis, we analyze the expectation of the $l_{2}$ loss over the randomness, as with [31, 57, 39].

When $f(G)$ is large, the $l_{2}$ loss can also be large. Thus in our experiments, we also evaluate the relative error, along with the $l_{2}$ loss. The relative error is defined as: $\frac{|\hat{f}(G)-f(G)|}{\max\{f(G),\eta\}}$ , where $\eta\in\mathbb{R}_{\geq 0}$ is a very small positive value. Following the convention [12, 14, 61], we set $\eta=0.001n$ for $f_{\triangle}$ and $f_{k\star}$ .

4 Algorithms

In the local model, there are several ways to model how the data collector interacts with the users [20, 30, 46]. The simplest model would be to assume that the data collector sends a query $\mathcal{R}_{i}$ to each user $v_{i}$ once, and then each user $v_{i}$ independently sends an answer $\mathcal{R}_{i}(\mathbf{a}_{i})$ to the data collector. In this model, there is one-round interaction between each user and the data collector. We call this the one-round LDP model. For example, the RR for a neighbor list in Section 3.2 assumes this model.

However, in certain cases it may be possible for the data collector to send a query to each user multiple times. This could allow for more powerful queries that result in more accurate subgraph counts [53] or more accurate synthetic graphs [46]. We call this the multiple-rounds LDP model.

In Sections 4.1 and 4.2, we consider the problems of computing $f_{k\star}(G)$ and $f_{\triangle}(G)$ for a graph $G\in\mathcal{G}$ in the one-round LDP model. Our algorithms and bounds highlight limitations of the one-round LDP model. Compared to the centralized graph DP model, the one-round LDP model cannot compute $f_{k\star}(G)$ as accurately. Furthermore, the algorithm for $f_{\triangle}(G)$ does not perform well. In Section 4.3, we propose a more sophisticated algorithm for computing $f_{\triangle}(G)$ in the two-rounds LDP model, and show that it provides much smaller expected $l_{2}$ loss than the algorithm in the one-round LDP model. In Section 4.4, we show a general result about lower bounds on the expected $l_{2}$ loss of graph statistics in LDP. The proofs of all statements in Section 4 are given in Appendix D.

4.1 One-Round Algorithms for $k$ -Stars

Algorithm. We begin with the problem of computing $f_{k\star}(G)$ in the one-round LDP model. For this model, we introduce a simple algorithm using the Laplacian mechanism, and prove that this algorithm can achieve order optimal expected $l_{2}$ loss among all one-round LDP algorithms.

Data: Graph

G

represented as neighbor lists

\mathbf{a}_{1},\ldots,\mathbf{a}_{n}\allowbreak\in\{0,1\}^{n}

, privacy budget

\varepsilon\in\mathbb{R}_{\geq 0}

\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}

Result: Private estimate of

f_{k\star}(G)

\Delta\leftarrow\binom{\tilde{d}_{max}}{k-1}

;

2 for $i=1$ to $n$ do

\mathbf{a}_{i}\leftarrow\texttt{GraphProjection}(\mathbf{a}_{i},\tilde{d}_{max})

;

d_{i}

is a degree of user

v_{i}

. */

d_{i}\leftarrow\sum_{j=1}^{n}a_{i,j}

;

r_{i}\leftarrow\binom{d_{i}}{k}

;

\hat{r}_{i}\leftarrow r_{i}+\textrm{Lap}\left(\frac{\Delta}{\varepsilon}\right)

;

release(\hat{r}_{i})

;

9 end for

return $\sum_{i=1}^{n}\hat{r}_{i}$

Algorithm 1 LocalLap_k⋆

Algorithm 1 shows the one-round algorithm for $k$ -stars. It takes as input a graph $G$ (represented as neighbor lists $\mathbf{a}_{1},\ldots,\mathbf{a}_{n}\in\{0,1\}^{n}$ ), the privacy budget $\varepsilon$ , and a non-negative integer $\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}$ .

The parameter $\tilde{d}_{max}$ plays a role as an upper-bound on the maximum degree $d_{max}$ of $G$ . Specifically, let $d_{i}\in\mathbb{Z}_{\geq 0}$ be the degree of user $v_{i}$ ; i.e., the number of “ $1$ ”s in her neighbor list $\mathbf{a}_{i}$ . In line 3, user $v_{i}$ uses a function (denoted by GraphProjection) that performs graph projection [16, 35, 48] for $\mathbf{a}_{i}$ as follows. If $d_{i}$ exceeds $\tilde{d}_{max}$ , it randomly selects $\tilde{d}_{max}$ neighbors out of $d_{i}$ neighbors; otherwise, it uses $\mathbf{a}_{i}$ as it is. This guarantees that each user’s degree never exceeds $\tilde{d}_{max}$ ; i.e., $d_{i}\leq\tilde{d}_{max}$ after line 4.

After the graph projection, user $v_{i}$ counts the number of $k$ -stars $r_{i}\in\mathbb{Z}_{\geq 0}$ of which she is a center (line 5), and adds the Laplacian noise to $r_{i}$ (line 6). Here, since adding one edge results in the increase of at most $\binom{\tilde{d}_{max}}{k-1}$ $k$ -stars, the sensitivity of $k$ -star counts for user $v_{i}$ is at most $\binom{\tilde{d}_{max}}{k-1}$ (after graph projection). Therefore, we add $\textrm{Lap}(\frac{\Delta}{\varepsilon})$ to $r_{i}$ , where $\Delta=\binom{\tilde{d}_{max}}{k-1}$ and for $b\in\mathbb{R}_{\geq 0}$ $\textrm{Lap}(b)$ is a random variable that represents the Laplacian noise with mean $0$ and scale $b$ . The final answer of Algorithm 1 is simply the sum of all the noisy $k$ -star counts. We denote this algorithm by LocalLap_k⋆.

The value of $\tilde{d}_{max}$ greatly affects the utility. If $\tilde{d}_{max}$ is too large, a large amount of the Laplacian noise would be added. If $\tilde{d}_{max}$ is too small, a great number of neighbors would be reduced by graph projection. When we have some prior knowledge about the maximum degree $d_{max}$ , we can set $\tilde{d}_{max}$ to an appropriate value. For example, the maximum number of connections allowed on Facebook is $5000$ [3]. In this case, we can set $\tilde{d}_{max}=5000$ , and then graph projection does nothing. Given that the number of Facebook monthly active users is over $2.7$ billion [6], $\tilde{d}_{max}=5000$ is much smaller than $n$ . For another example, if we know that the degree is smaller than $1000$ for most users, then we can set $\tilde{d}_{max}=1000$ and perform graph projection for users whose degrees exceed $\tilde{d}_{max}$ .

In some applications, the data collector may not have such prior knowledge about $\tilde{d}_{max}$ . In this case, we can privately estimate $d_{max}$ by allowing an additional round between each user and the data collector, and use the private estimate of $d_{max}$ as $\tilde{d}_{max}$ . We describe how to privately estimate $d_{max}$ with edge LDP at the end of Section 4.1.

Theoretical properties. LocalLap_k⋆ has the following guarantees:

Theorem 1.

LocalLap_k⋆ provides $\varepsilon$ -edge LDP.

Theorem 2.

Let $\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})$ be the output of LocalLap_k⋆. Then, for all $k\in\mathbb{N},\varepsilon\in\mathbb{R}_{\geq 0},\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}$ , and $G\in\mathcal{G}$ such that the maximum degree $d_{max}$ of $G$ is at most $\tilde{d}_{max}$ , $\mathbb{E}[l_{2}^{2}(\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max}),f_{k\star}(G))]=O\left(\frac{n\tilde{d}_{max}^{2k-2}}{\varepsilon^{2}}\right)$ .

The factor of $n$ in the expected $l_{2}$ loss of LocalLap_k⋆ comes from the fact that we are adding the Laplacian noise $n$ times. In the centralized model, this factor of $n$ is not there, because the central data collector sees all $k$ -stars; i.e., the data collector knows $f_{k\star}(G)$ . The sensitivity of $f_{k\star}$ is at most $2\binom{\tilde{d}_{max}}{k-1}$ (after graph projection) under edge centralized DP. Therefore, we can consider an algorithm that simply adds the Laplacian noise $\textrm{Lap}(2\binom{\tilde{d}_{max}}{k-1}/\varepsilon)$ to $f_{k\star}(G)$ , and outputs $f_{k\star}(G)+\textrm{Lap}(2\binom{\tilde{d}_{max}}{k-1}/\varepsilon)$ . We denote this algorithm by CentralLap_k⋆. Since the bias of the Laplacian noise is $0$ , CentralLap_k⋆ attains the expected $l_{2}$ loss ( $=$ variance) of $O\left(\frac{\tilde{d}_{max}^{2k-2}}{\varepsilon^{2}}\right)$ .

It seems impossible to avoid this factor of $n$ in the one-round LDP model, as the data collector will be dealing with $n$ independent answers to queries. Indeed, this is the case—we prove that the expected $l_{2}$ error of LocalLap_k⋆ is order optimal among all one-round LDP algorithms, and the one-round LDP model cannot improve the factor of $n$ .

Corollary 1.

Let $\hat{f}_{k\star}(G,\tilde{d}_{max},\varepsilon)$ be any one-round LDP algorithm that computes $f_{k\star}(G)$ satisfying $\varepsilon$ -edge LDP. Then, for all $k,n,\tilde{d}_{max}\in\mathbb{N}$ and $\varepsilon\in\mathbb{R}_{\geq 0}$ such that $n$ is even, there exists a set of graphs $\mathcal{A}\subseteq\mathcal{G}$ on $n$ nodes such that the maximum degree of each $G\in\mathcal{A}$ is at most $\tilde{d}_{max}$ , and $\frac{1}{|\mathcal{A}|}\sum_{G\in\mathcal{A}}\operatorname{\mathbb{E}}[l_{2}^{2}(\hat{f}_{k\star}(G,\tilde{d}_{max},\varepsilon),f_{k\star}(G))]\geq\Omega\left(\frac{e^{2\varepsilon}}{(e^{2\varepsilon}+1)^{2}}\tilde{d}_{max}^{2k-2}n\right)$ .

This is a corollary of a more general result of Section 4.4. Thus, any algorithm computing $k$ -stars cannot avoid the factor of $n$ in its $l_{2}^{2}$ loss. $k$ -stars is an example where the non-interactive graph LDP model is strictly weaker than the centralized DP model.

Nevertheless, we note that LocalLap_k⋆ can accurately calculate $f_{k\star}(G)$ for a large number of users $n$ . Specifically, the relative error decreases with increase in $n$ because LocalLap_k⋆ has a factor of $n$ (not $n^{2}$ ) in the expected $l_{2}$ error, i.e., $\mathbb{E}[(\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})-f_{k\star}(G))^{2}]=O(n)$ and $f_{k\star}(G)^{2}\geq\Omega(n^{2})$ (when we ignore $\tilde{d}_{max}$ and $\varepsilon$ ). In our experiments, we show that the relative error of LocalLap_k⋆ is small when $n$ is large.

Private calculation of $d_{max}$ . By allowing an additional round between each user and the data collector, we can privately estimate $d_{max}$ and use the private estimate of $d_{max}$ as $\tilde{d}_{max}$ . Specifically, we divide the privacy budget $\varepsilon$ into $\varepsilon_{0}\in\mathbb{R}_{\geq 0}$ and $\varepsilon_{1}\in\mathbb{R}_{\geq 0}$ ; i.e., $\varepsilon=\varepsilon_{0}+\varepsilon_{1}$ . We first estimate $d_{max}$ with $\varepsilon_{0}$ -edge LDP and then run LocalLap_k⋆ with the remaining privacy budget $\varepsilon_{1}$ . Note that LocalLap_k⋆ with the private calculation of $d_{max}$ results in a two-rounds LDP algorithm.

We consider the following simple algorithm. At the first round, each user $v_{i}$ adds the Laplacian noise $\textrm{Lap}(\frac{1}{\varepsilon_{0}})$ to her degree $d_{i}$ . Let $\hat{d}_{i}\in\mathbb{R}$ be the noisy degree of $v_{i}$ ; i.e., $\hat{d}_{i}=d_{i}+\textrm{Lap}(\frac{1}{\varepsilon_{0}})$ . Then user $v_{i}$ sends $\hat{d}_{i}$ to the data collector. Let $\hat{d}_{max}\in\mathbb{R}$ be the maximum value of the noisy degree; i.e., $\hat{d}_{max}=\max\{\hat{d}_{1},\ldots,\hat{d}_{n}\}$ . We call $\hat{d}_{max}$ the noisy max degree. The data collector calculates the noisy max degree $\hat{d}_{max}$ as an estimate of $d_{max}$ , and sends $\hat{d}_{max}$ back to all users. At the second round, we run LocalLap_k⋆ with input $G$ , $\varepsilon$ , and $\lfloor\hat{d}_{max}\rfloor$ .

At the first round, the calculation of $\hat{d}_{max}$ provides $\varepsilon_{0}$ -edge LDP because each user’s degree has the sensitivity $1$ under edge LDP. At the second round, Theorem 1 guarantees that LocalLap_k⋆ provides $\varepsilon_{1}$ -edge LDP. Then by the composition theorem [23], this two-rounds algorithm provides $\varepsilon$ -edge LDP in total ( $\varepsilon=\varepsilon_{0}+\varepsilon_{1}$ ).

In our experiments, we show that this algorithm provides the utility close to LocalLap_k⋆ with the true max degree $d_{max}$ .

4.2 One-Round Algorithms for Triangles.

Algorithm. Now, we focus our attention on the more challenging $f_{\triangle}$ query. This query is more challenging in the graph LDP model because no user is aware of any triangle; i.e., user $v_{i}$ is not aware of any triangle formed by $(v_{i},v_{j},v_{k})$ , because $v_{i}$ cannot see any edge $(v_{j},v_{k})\in E$ in graph $G$ .

One way to count $f_{\triangle}(G)$ with edge LDP is to apply the RR (Randomized Response) to a neighbor list. For example, user $v_{i}$ applies the RR to $a_{i,1},\ldots,a_{i,i-1}$ (which corresponds to users $v_{1},\ldots,v_{i-1}$ with smaller user IDs) in her neighbor list $\mathbf{a}_{i}$ ; i.e., we apply the RR to the lower triangular part of adjacency matrix $\mathbf{A}$ , as described in Section 3.2. Then the data collector constructs a noisy graph $G^{\prime}=(V,E^{\prime})\in\mathcal{G}$ from the lower triangular part of the noisy adjacency matrix, and estimates the number of triangles from $G^{\prime}$ . However, simply counting the triangles in $G^{\prime}$ can introduce a significant bias because $G^{\prime}$ is denser than $G$ especially when $\varepsilon$ is small.

Through clever post-processing known as empirical estimation [31, 39, 57], we are able to obtain an unbiased estimate of $f_{\triangle}(G)$ from $G^{\prime}$ . Specifically, a subgraph with three nodes can be divided into four types depending on the number of edges. Three nodes with three edges form a triangle. We refer to three nodes with two edges, one edge, and no edges as 2-edges, 1-edge, and no-edges, respectively. Figure 2 shows their shapes. $f_{\triangle}(G)$ can be expressed using $m_{3}$ , $m_{2}$ , $m_{1}$ , and $m_{0}$ as follows:

Proposition 2.

Let $G^{\prime}=(V,E^{\prime})$ be a noisy graph generated by applying the RR to the lower triangular part of $\mathbf{A}$ . Let $m_{3},m_{2},m_{1},m_{0}\in\mathbb{Z}_{\geq 0}$ be respectively the number of triangles, 2-edges, 1-edge, and no-edges in $G^{\prime}$ . Then

\displaystyle\textstyle{\mathbb{E}\left[\frac{e^{3\varepsilon}}{(e^{\varepsilon}-1)^{3}}m_{3}-\frac{e^{2\varepsilon}}{(e^{\varepsilon}-1)^{3}}m_{2}+\frac{e^{\varepsilon}}{(e^{\varepsilon}-1)^{3}}m_{1}-\frac{1}{(e^{\varepsilon}-1)^{3}}m_{0}\right]=f_{\triangle}(G).}

(4)

Therefore, the data collector can count $m_{3}$ , $m_{2}$ , $m_{1}$ , and $m_{0}$ from $G^{\prime}$ , and calculate an unbiased estimate of $f_{\triangle}(G)$ by (4). In Appendix A, we show that the $l_{2}$ loss is significantly reduced by this empirical estimation.

Data: Graph

G

represented as neighbor lists

\mathbf{a}_{1},\ldots,\mathbf{a}_{n}\in\{0,1\}^{n}

, privacy budget

\varepsilon\in\mathbb{R}_{\geq 0}

Result: Private estimate of

f_{\triangle}(G)

1 for $i=1$ to $n$ do

R_{i}\leftarrow(RR_{\varepsilon}(a_{i,1}),\ldots,RR_{\varepsilon}(a_{i,i-1}))

;

release(R_{i})

;

5 end for

G^{\prime}=(V,E^{\prime})\leftarrow\texttt{UndirectedGraph}(R_{1},\ldots,R_{n})

;

/* Counts

m_{3},m_{2},m_{1},m_{0}

G^{\prime}

. */

(m_{3},m_{2},m_{1},m_{0})\leftarrow\texttt{Count}(G^{\prime})

;

8 return $\frac{1}{(e^{\varepsilon}-1)^{3}}(e^{3\varepsilon}m_{3}-e^{2\varepsilon}m_{2}+e^{\varepsilon}m_{1}-m_{0})$

Algorithm 2 LocalRR_△

Algorithm 2 shows this algorithm. In line 2, user $v_{i}$ applies the RR with privacy budget $\varepsilon$ (denoted by $RR_{\varepsilon}$ ) to $a_{i,1},\ldots,a_{i,i-1}$ in her neighbor list $\mathbf{a}_{i}$ , and outputs $R_{i}=(RR_{\varepsilon}(a_{i,1}),\ldots,RR_{\varepsilon}(a_{i,i-1}))$ . In other words, we apply the RR to the lower triangular part of $\mathbf{A}$ and there is no overlap between edges sent by users. In line 5, the data collector uses a function (denoted by UndirectedGraph) that converts the bits of $(R_{1},\ldots,R_{n})$ into an undirected graph $G^{\prime}=(V,E^{\prime})$ by adding edge $(v_{i},v_{j})$ with $i>j$ to $E^{\prime}$ if and only if the $j$ -th bit of $R_{i}$ is $1$ . Note that $G^{\prime}$ is biased, as explained above. In line 6, the data collector uses a function (denoted by Count) that calculates $m_{3}$ , $m_{2}$ , $m_{1}$ , and $m_{0}$ from $G^{\prime}$ . Finally, the data collector outputs the expression inside the expectation on the left-hand side of (4), which is an unbiased estimator for $f_{\triangle}(G)$ by Proposition 2. We denote this algorithm by LocalRR_△.

Theoretical properties. LocalRR_△ provides the following guarantee.

Theorem 3.

LocalRR_△ provides $\varepsilon$ -edge LDP and $\varepsilon$ -relationship DP.

LocalRR_△ does not have the doubling issue (i.e., it provides not $2\varepsilon$ but $\varepsilon$ -relationship DP), because we apply the RR to the lower triangular part of $\mathbf{A}$ , as explained in Section 3.2.

Unlike the RR and empirical estimation for tabular data [31], the expected $l_{2}$ loss of LocalRR_△ is complicated. To simplify the utility analysis, we assume that $G$ is generated from the Erdös-Rényi graph distribution $\mathbf{G}(n,\alpha)$ with edge existence probability $\alpha$ ; i.e., each edge in $G$ with $n$ nodes is independently generated with probability $\alpha\in[0,1]$ .

Theorem 4.

Let $\mathbf{G}(n,\alpha)$ be the Erdös-Rényi graph distribution with edge existence probability $\alpha\in[0,1]$ . Let $p=\frac{1}{e^{\varepsilon}+1}$ and $\beta=\alpha(1-p)+(1-\alpha)p$ . Let $\hat{f}_{\triangle}(G,\varepsilon)$ be the output of LocalRR_△. If $G\sim\mathbf{G}(n,\alpha)$ , then for all $\varepsilon\in\mathbb{R}_{\geq 0}$ , $\mathbb{E}[l_{2}^{2}(\hat{f}_{\triangle}(G,\varepsilon),f_{\triangle}(G))]=O\left(\frac{e^{6\varepsilon}}{(e^{\varepsilon}-1)^{6}}\beta n^{4}\right)$ .

Note that we assume the Erdös-Rényi model only for the utility analysis of LocalRR_△, and do not assume this model for the other algorithms. The upper-bound of LocalRR_△ in Theorem 4 is less ideal than the upper-bounds of the other algorithms in that it does not consider all possible graphs $G\in\mathcal{G}$ . Nevertheless, we also show that the $l_{2}$ loss of LocalRR_△ is roughly consistent with Theorem 4 in our experiments using two real datasets (Section 5) and the Barabási-Albert graphs [9], which have power-law degree distribution (Appendix B).

The parameters $\alpha$ and $\beta$ are edge existence probabilities in the original graph $G$ and noisy graph $G^{\prime}$ , respectively. Although $\alpha$ is very small in a sparse graph, $\beta$ can be large for small $\varepsilon$ . For example, if $\alpha\approx 0$ and $\varepsilon=1$ , then $\beta\approx\frac{1}{e+1}=0.27$ .

Theorem 4 states that for large $n$ , the $l_{2}$ loss of LocalRR_△ ( $=O(n^{4})$ ) is much larger than the $l_{2}$ loss of LocalRR ${}_{k}\star$ ( $=O(n)$ ). This follows from the fact that user $v_{i}$ is not aware of any triangle formed by $(v_{i},v_{j},v_{k})$ , as explained above.

In contrast, counting $f_{\triangle}(G)$ in the centralized model is much easier because the data collector sees all triangles in $G$ ; i.e., the data collector knows $f_{\triangle}(G)$ . The sensitivity of $f_{\triangle}$ is at most $\tilde{d}_{max}$ (after graph projection). Thus, we can consider a simple algorithm that outputs $f_{\triangle}(G)+\textrm{Lap}(\tilde{d}_{max}/\varepsilon)$ . We denote this algorithm by CentralLap_△. CentralLap_△ attains the expected $l_{2}$ loss ( $=$ variance) of $O\left(\frac{\tilde{d}_{max}^{2}}{\varepsilon^{2}}\right)$ .

The large $l_{2}$ loss of LocalRR_△ is caused by the fact that each edge is released independently with some probability of being flipped. In other words, there are three independent random variables that influence any triangle in $G^{\prime}$ . The next algorithm, using interaction, reduces this influencing number from three to one by using the fact that a user knows the existence of two edges for any triangle that involves the user.

4.3 Two-Rounds Algorithms for Triangles

Algorithm. Allowing for two-rounds interaction, we are able to compute $f_{\triangle}$ with a significantly improved $l_{2}$ loss, albeit with a higher per-user communication overhead. As described in Section 4.2, it is impossible for user $v_{i}$ to see edge $(v_{j},v_{k})\in E$ in graph $G=(V,E)$ at the first round. However, if the data collector publishes a noisy graph $G^{\prime}=(V,E^{\prime})$ calculated by LocalRR_△ at the first round, then user $v_{i}$ can see a noisy edge $(v_{j},v_{k})\in E^{\prime}$ in the noisy graph $G^{\prime}$ at the second round. Then user $v_{i}$ can count the number of noisy triangles formed by $(v_{i},v_{j},v_{k})$ such that $(v_{i},v_{j})\in E$ , $(v_{i},v_{k})\in E$ , and $(v_{j},v_{k})\in E^{\prime}$ , and send the noisy triangle counts with the Laplacian noise to the data collector in an analogous way to LocalLap_k⋆. Since user $v_{i}$ always knows that two edges $(v_{i},v_{j})$ and $(v_{i},v_{k})$ exist in $G$ , there is only one noisy edge in any noisy triangle (whereas all edges are noisy in LocalRR_△). This is an intuition behind our proposed two-rounds algorithm.

As with the RR in Section 4.2, simply counting the noisy triangles can introduce a bias. Therefore, we calculate an empirical estimate of $f_{\triangle}(G)$ from the noisy triangle counts. Specifically, the following is the empirical estimate of $f_{\triangle}(G)$ :

Proposition 3.

Let $G^{\prime}=(V,E^{\prime})$ be a noisy graph generated by applying the RR with privacy budget $\varepsilon_{1}\in\mathbb{R}_{\geq 0}$ to the lower triangular part of $\mathbf{A}$ . Let $p_{1}=\frac{1}{e^{\varepsilon_{1}}+1}$ . Let $t_{i}\in\mathbb{Z}_{\geq 0}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $(v_{i},v_{j})\in E$ , $(v_{i},v_{k})\in E$ , and $(v_{j},v_{k})\in E^{\prime}$ . Let $s_{i}\in\mathbb{Z}_{\geq 0}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $(v_{i},v_{j})\in E$ , and $(v_{i},v_{k})\in E$ . Let $w_{i}=t_{i}-p_{1}s_{i}$ . Then

\displaystyle\textstyle{\mathbb{E}\left[\frac{1}{1-2p_{1}}\sum_{i=1}^{n}w_{i}\right]=f_{\triangle}(G).}

(5)

Note that in Proposition 3, we count only triplets $(v_{i},v_{j},v_{k})$ with $j<k<i$ to use only the lower triangular part of $\mathbf{A}$ . $t_{i}$ is the number of noisy triangles user $v_{i}$ can see at the second round. $s_{i}$ is the number of $2$ -stars of which user $v_{i}$ is a center. Since $t_{i}$ and $s_{i}$ can reveal information about an edge in $G$ , user $v_{i}$ adds the Laplacian noise to $w_{i}$ $(=t_{i}-p_{1}s_{i})$ in (5), and sends it to the data collector. Then the data collector calculates an unbiased estimate of $f_{\triangle}(G)$ by (5).

Data: Graph

G

represented as neighbor lists

\mathbf{a}_{1},\ldots,\mathbf{a}_{n}\in\{0,1\}^{n}

, two privacy budgets

\varepsilon_{1},\varepsilon_{2}>0

\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}

Result: Private estimate of

f_{\triangle}(G)

p_{1}\leftarrow\frac{1}{e^{\varepsilon_{1}}+1}

;

/* First round. */

2 for $i=1$ to $n$ do

R_{i}\leftarrow(RR_{\varepsilon_{1}}(a_{i,1}),\ldots,RR_{\varepsilon_{1}}(a_{i,i-1}))

;

release(R_{i})

;

6 end for

G^{\prime}=(V,E^{\prime})\leftarrow\texttt{UndirectedGraph}(R_{1},\ldots,R_{i-1})

;

/* Second round. */

8 for $i=1$ to $n$ do

\mathbf{a}_{i}\leftarrow\texttt{GraphProjection}(\mathbf{a}_{i},\tilde{d}_{max})

;

t_{i}\leftarrow|\{(v_{i},v_{j},v_{k}):j<k<i,a_{i,j}=a_{i,k}=1,(v_{j},v_{k})\in E^{\prime}\}|

;

s_{i}\leftarrow|\{(v_{i},v_{j},v_{k}):j<k<i,a_{i,j}=a_{i,k}=1\}|

;

w_{i}\leftarrow t_{i}-p_{1}s_{i}

;

\hat{w}_{i}\leftarrow w_{i}+\textrm{Lap}(\frac{\tilde{d}_{max}}{\varepsilon_{2}})

;

release(\hat{w}_{i})

;

16 end for

return $\frac{1}{1-2p_{1}}\sum_{i=1}^{n}\hat{w}_{i}$

Algorithm 3 Local2Rounds_△

Algorithm 3 contains the formal description of this process. It takes as input a graph $G$ , the privacy budgets $\varepsilon_{1},\varepsilon_{2}\in\mathbb{R}_{\geq 0}$ at the first and second rounds, respectively, and a non-negative integer $\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}$ . At the first round, we apply the RR to the lower triangular part of $\mathbf{A}$ (i.e., there is no overlap between edges sent by users) and use the UndirectedGraph function to obtain a noisy graph $G^{\prime}=(V,E^{\prime})$ by the RR in the same way as Algorithm 2. Note that $G^{\prime}$ is biased. We calculate an unbiased estimate of $f_{\triangle}(G)$ from $G^{\prime}$ at the second round.

At the second round, each user $v_{i}$ calculates $\hat{w}_{i}=w_{i}+\textrm{Lap}(\frac{\tilde{d}_{max}}{\varepsilon_{2}})$ by adding the Laplacian noise to $w_{i}$ in Proposition 3 whose sensitivity is at most $\tilde{d}_{max}$ (as we will prove in Theorem 5). Finally, we output $\frac{1}{1-2p_{1}}\sum_{i=1}^{n}\hat{w}_{i}$ , which is an unbiased estimate of $f_{\triangle}(G)$ by Proposition 3. We call this algorithm Local2Rounds_△.

Theoretical properties. Local2Rounds_△ has the following guarantee.

Theorem 5.

Local2Rounds_△ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP and $(\varepsilon_{1}+\varepsilon_{2})$ -relationship DP.

As with LocalRR_△, Local2Rounds_△ does not have the doubling issue; i.e., it provides $\varepsilon$ -relationship DP (not $2\varepsilon$ ). This follows from the fact that we use only the lower triangular part of $\mathbf{A}$ ; i.e., we assume $j<k<i$ in counting $t_{i}$ and $s_{i}$ .

Theorem 6.

Let $\hat{f}_{\triangle}(G,\varepsilon_{1},\varepsilon_{2},\tilde{d}_{max})$ be the output of Local2Rounds_△. Then, for all $\varepsilon_{1},\varepsilon_{2}\in\mathbb{R}_{\geq 0}$ , $\tilde{d}_{max}\in\mathbb{Z}_{\geq 0}$ , and $G\in\mathcal{G}$ such that the maximum degree $d_{max}$ of $G$ is at most $\tilde{d}_{max}$ , $\mathbb{E}[l_{2}^{2}(\hat{f}_{\triangle}(G,\varepsilon_{1},\varepsilon_{2},\tilde{d}_{max}),f_{\triangle}(G))]\leq O\left(\frac{e^{\varepsilon_{1}}}{(1-e^{\varepsilon_{1}})^{2}}\left(\tilde{d}_{max}^{3}n+\frac{e^{\varepsilon_{1}}}{\varepsilon_{2}^{2}}\tilde{d}_{max}^{2}n\right)\right)$ .

Theorem 6 means that for triangles, the $l_{2}$ loss is reduced from $O(n^{4})$ to $O(\tilde{d}_{max}^{3}n)$ by introducing an additional round.

Private calculation of $d_{max}$ . As with $k$ -stars, we can privately calculate $d_{max}$ by using the method described in Section 4.1. Furthermore, the private calculation of $d_{max}$ does not increase the number of rounds; i.e., we can run Local2Rounds_△ with the private calculation of $d_{max}$ in two rounds.

Specifically, let $\varepsilon_{0}\in\mathbb{R}_{\geq 0}$ be the privacy budget for the private calculation of $d_{max}$ . At the first round, each user $v_{i}$ adds $\textrm{Lap}(\frac{1}{\varepsilon_{0}})$ to her degree $d_{i}$ , and sends the noisy degree $\hat{d}_{i}$ ( $=d_{i}+\textrm{Lap}(\frac{1}{\varepsilon_{0}})$ ) to the data collector, along with the outputs $R_{i}=(RR_{\varepsilon}(a_{i,1}),\ldots,RR_{\varepsilon}(a_{i,i-1}))$ of the RR. The data collector calculates the noisy max degree $\hat{d}_{max}$ ( $=\max\{\hat{d}_{1},\ldots,\hat{d}_{n}\}$ ) as an estimate of $d_{max}$ , and sends it back to all users. At the second round, we run Local2Rounds_△ with input $G$ (represented as $\mathbf{a}_{1},\ldots,\mathbf{a}_{n}$ ), $\varepsilon_{1}$ , $\varepsilon_{2}$ , and $\lfloor\hat{d}_{max}\rfloor$ .

At the first round, the calculation of $\hat{d}_{max}$ provides $\varepsilon_{0}$ -edge LDP. Note that it provides $2\varepsilon_{0}$ -relationship DP (i.e., it has the doubling issue) because one edge $(v_{i},v_{j})\in E$ affects both of the degrees $d_{i}$ and $d_{j}$ by 1. At the second round, LocalLap_k⋆ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP and $(\varepsilon_{1}+\varepsilon_{2})$ -relationship DP (Theorem 5). Then by the composition theorem [23], this two-rounds algorithm provides $(\varepsilon_{0}+\varepsilon_{1}+\varepsilon_{2})$ -edge LDP and $(2\varepsilon_{0}+\varepsilon_{1}+\varepsilon_{2})$ -relationship DP. Although the total privacy budget is larger for relationship DP, the difference ( $=\varepsilon_{0}$ ) can be very small. In fact, we set $(\varepsilon_{0},\varepsilon_{1},\varepsilon_{2})=(0.1,0.45,0.45)$ or $(0.2,0.9,0.9)$ in our experiments (i.e., the difference is $0.1$ or $0.2$ ), and show that this algorithm provides almost the same utility as Local2Rounds_△ with the true max degree $d_{max}$ .

Time complexity. We also note that Local2Rounds_△ has an advantage over LocalRR_△ in terms of the time complexity.

Specifically, LocalRR_△ is inefficient because the data collector has to count the number of triangles $m_{3}$ in the noisy graph $G^{\prime}$ . Since the noisy graph $G^{\prime}$ is dense (especially when $\varepsilon$ is small) and there are $\binom{n}{3}$ subgraphs with three nodes in $G^{\prime}$ , the number of triangles is $m_{3}=O(n^{3})$ . Then, the time complexity of LocalRR_△ is also $O(n^{3})$ , which is not practical for a graph with a large number of users $n$ . In fact, we implemented LocalRR_△ ( $\varepsilon=1$ ) with C/C++ and measured its running time using one node of a supercomputer (ABCI: AI Bridging Cloud Infrastructure [4]). When $n=5000$ , $10000$ , $20000$ , and $40000$ , the running time was $138$ , $1107$ , $9345$ , and $99561$ seconds, respectively; i.e., the running time was almost cubic in $n$ . We can also estimate the running time for larger $n$ . For example, when $n=1000000$ , LocalRR_△ ( $\varepsilon=1$ ) would require about $35$ years $(=1107\times 100^{3}/(3600\times 24\times 365))$ .

In contrast, the time complexity of Local2Rounds_△ is $O(n^{2}+nd_{max}^{2})$ ¹¹1When we evaluate Local2Rounds_△ in our experiments, we can apply the RR to only edges that are required at the second round; i.e., $(v_{j},v_{k})\in G^{\prime}$ in line 8 of Algorithm 3. Then the time complexity of Local2Rounds_△ can be reduced to $O(nd_{max}^{2})$ in total. We also confirmed that when $n=1000000$ , the running time of Local2Rounds_△ was $311$ seconds on one node of the ABCI. Note, however, that this does not protect individual privacy, because it reveals the fact that users $v_{j}$ and $v_{k}$ are friends with $u_{i}$ to the data collector.. The factor of $n^{2}$ comes from the fact that the size of the noisy graph $G^{\prime}$ is $O(n^{2})$ . This also causes a large communication overhead, as explained below.

Communication overhead. In Local2Rounds_△, each user need to see the noisy graph $G^{\prime}$ of size $O(n^{2})$ to count $t_{i}$ and $s_{i}$ . This results in a per-user communication overhead of $O(n^{2})$ . Although we do not simulate the communication overhead in our experiments that use Local2Rounds_△, the $O(n^{2})$ overhead might limit its application in very large graphs. An interesting avenue of future work is how to compress the graph size (e.g., via graph projection or random projection) to reduce both the time complexity and the communication overhead.

	Centralized	One-round local		Two-rounds local
	Upper Bound	Lower Bound	Upper Bound	Upper Bound
$f_{k\star}$	$O\left(\frac{d_{max}^{2k-2}}{\varepsilon^{2}}\right)$	$\Omega\left(\frac{e^{2\varepsilon}}{(e^{2\varepsilon}+1)^{2}}d_{max}^{2k-2}n\right)$	$O\left(\frac{d_{max}^{2k-2}}{\varepsilon^{2}}n\right)$	$O\left(\frac{d_{max}^{2k-2}}{\varepsilon^{2}}n\right)$
$f_{\triangle}$	$O\left(\frac{d_{max}^{2}}{\varepsilon^{2}}\right)$	$\Omega\left(\frac{e^{2\varepsilon}}{(e^{2\varepsilon}+1)^{2}}d_{max}^{2}n\right)$	$O\left(\frac{e^{6\varepsilon}}{(e^{\varepsilon}-1)^{6}}n^{4}\right)$ (when $G\sim\mathbf{G}(n,\alpha)$ )	$O\left(\frac{e^{\varepsilon}}{(e^{\varepsilon}-1)^{2}}(d_{max}^{3}n+\frac{e^{\varepsilon}}{\varepsilon^{2}}d_{max}^{2}n)\right)$

Table 2: Bounds on

l_{2}

losses for privately estimating

f_{k\star}

and

f_{\triangle}

with

\varepsilon

-edge LDP. For upper-bounds, we assume that

\tilde{d}_{max}=d_{max}

. For the centralized model, we use the Laplace mechanism. For the one-round

f_{\triangle}

algorithm, we apply Theorem 4 with constant

\alpha

. For the two-round protocol

f_{\triangle}

algorithm, we apply Theorem 6 with

\varepsilon_{1}=\varepsilon_{2}=\frac{\varepsilon}{2}

4.4 Lower Bounds

We show a general lower bound on the $l_{2}$ loss of private estimators $\hat{f}$ of real-valued functions $f$ in the one-round LDP model. Treating $\varepsilon$ as a constant, we have shown that when $\tilde{d}_{max}=d_{max}$ , the expected $l_{2}$ loss of LocalLaplace_k⋆ is $O(nd_{max}^{2k-2})$ (Theorem 2). However, in the centralized model, we can use the Laplace mechanism with sensitivity $2\binom{d_{max}}{k-1}$ to obtain $l_{2}^{2}$ errors of $O(d_{max}^{2k-2})$ for $f_{k\star}$ . Thus, we ask if the factor of $n$ is necessary in the one-round LDP model.

We answer this question affirmatively. We show for many types of queries $f$ , there is a lower bound on $l_{2}^{2}(f(G),\hat{f}(G))$ for any private estimator $\hat{f}$ of the form

\hat{f}(G)=\tilde{f}(\mathcal{R}_{1}(\mathbf{a}_{1}),\ldots,\mathcal{R}_{n}(\mathbf{a}_{n})),

(6)

where $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ satisfy $\varepsilon$ -edge LDP or $\varepsilon$ -relationship DP and $\tilde{f}$ is an aggregate function that takes $\mathcal{R}_{1}(\mathbf{a}_{1}),\ldots,\mathcal{R}_{n}(\mathbf{a}_{n})$ as input and outputs $\hat{f}(G)$ . Here we assume that $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ are independently run, meaning that they are in the one-round setting. For our lower bound, we require that input edges to $f$ be “independent” in the sense that adding an edge to an input graph $G$ independently change $f$ by at least $D\in\mathbb{R}$ . The specific structure of input graphs we require is as follows:

Definition 5.

[ $(n,D)$ -independent cube for $f$ ] Let $D\in\mathbb{R}_{\geq 0}$ . For $\kappa\in\mathbb{N}$ , let $G=(V,E)\in\mathcal{G}$ be a graph on $n=2\kappa$ nodes, and let $M=\{(v_{i_{1}},v_{i_{2}}),(v_{i_{3}},v_{i_{4}}),\ldots,(v_{i_{2k-1}},v_{i_{2\kappa}})\}$ for integers $i_{j}\in[n]$ be a set of edges such that each of $i_{1},\ldots,i_{2\kappa}$ is distinct (i.e., perfect matching on the nodes). Suppose that $M$ is disjoint from $E$ ; i.e., $(v_{i_{2j-1}},v_{i_{2j}})\notin E$ for any $j\in[\kappa]$ . Let $\mathcal{A}=\{(V,E\cup N):N\subseteq M\}$ . Note that $\mathcal{A}$ is a set of $2^{\kappa}$ graphs. We say $\mathcal{A}$ is an $(n,D)$ -independent cube for $f$ if for all $G^{\prime}=(V,E^{\prime})\in\mathcal{A}$ , we have

f(G^{\prime})=f(G)+\sum_{e\in E^{\prime}\cap M}C_{e},

where $C_{e}\in\mathbb{R}$ satisfies $|C_{e}|\geq D$ for any $e\in M$ .

Such a set of inputs has an “independence” property because, regardless of which edges from $M$ has been added before, adding edge $e\in M$ always changes $f$ by $C_{e}$ . Figure 3 shows an example of a $(4,2)$ -independent cube for $f$ .

We can also construct a independent cube for a $k$ -star function as follows. Assume that $n$ is even. It is well known in graph theory that if $n$ is even, then for any $d\in[n-1]$ , there exists a $d$ -regular graph where every node has degree $d$ [25]. Therefore, there exists a $(d_{max}-1)$ -regular graph $G=(V,E)$ of size $n$ . Pick an arbitrary perfect matching $M$ on the nodes. Now, let $G^{\prime}=(V,E^{\prime})$ such that $E^{\prime}=E\setminus M$ . Every node in $G^{\prime}$ has degree between $d_{max}-2$ and $d_{max}-1$ . Adding an edge in $M$ to $G^{\prime}$ will produce at least $2\binom{d_{max}-2}{k-1}$ new $k$ -stars. Thus, $\mathcal{A}=\{(V,E^{\prime}\cup N):N\subseteq M\}$ forms an $(n,2\binom{d_{max}-2}{k-1})$ -independent cube for $f_{k\star}$ . Note that the maximum degree of each graph in $\mathcal{A}$ is at most $d_{max}$ . Figure 4 shows how to construct an independent cube for a $k$ -star function when $n=6$ and $d_{max}=4$ .

Using the structure that the $(n,D)$ -independent cube imposes on $f$ , we can prove a lower bound:

Theorem 7.

Let $\hat{f}(G)$ have the form of (6), where $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ are independently run. Let $\cal{A}$ be an $(n,D)$ -independent cube for $f$ . If $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ provides $\varepsilon$ -relationship DP, then we have

\frac{1}{\mathcal{A}}\sum_{G\in\mathcal{A}}\operatorname{\mathbb{E}}[l_{2}^{2}(f(G),\hat{f}(G))]=\Omega\left(\frac{e^{\varepsilon}}{(e^{\varepsilon}+1)^{2}}nD^{2}\right).

A corollary of Theorem 7 is that if $\mathcal{R}_{1},\ldots,\mathcal{R}_{n}$ satisfy $\varepsilon$ -edge LDP, then they satisfy $2\varepsilon$ -relationship DP and thus for edge LDP we have a lower bound of $\Omega\left(\frac{e^{2\varepsilon}}{(e^{2\varepsilon}+1)^{2}}nD^{2}\right)$ .

Theorem 7, combined with the fact that there exists an $(n,2\binom{d_{max}-2}{k-1})$ -independent cube for a $k$ -star function implies Corollary 1. In Appendix C, we also construct an $(n,\frac{d_{max}}{2}-2)$ independent cube for $f_{\triangle}$ and establish a lower bound of $\Omega(\frac{e^{2\varepsilon}}{(e^{2\varepsilon}+1)^{2}}nd_{max}^{2})$ for $f_{\triangle}$ .

The upper and lower bounds on the $l_{2}$ losses shown in this section appear in Table 2.

5 Experiments

Based on our theoretical results in Section 4, we would like to pose the following questions:

•

For triangle counts, how much does the two-rounds interaction help over a single round in practice?
•

What is the privacy-utility trade-off of our LDP algorithms (i.e., how beneficial are our LDP algorithms)?

We conducted experiments to answer to these questions.

5.1 Experimental Set-up

We used the following two large-scale datasets:

IMDB. The Internet Movie Database (denoted by IMDB) [2] includes a bipartite graph between $896308$ actors and $428440$ movies. We assumed actors as users. From the bipartite graph, we extracted a graph $G^{*}$ with $896308$ nodes (actors), where an edge between two actors represents that they have played in the same movie. There are $57064358$ edges in $G^{*}$ , and the average degree in $G^{*}$ is $63.7$ $(=\frac{57064358}{896308})$ .

Orkut. The Orkut online social network dataset (denoted by Orkut) [36] includes a graph $G^{*}$ with $3072441$ users and $117185083$ edges. The average degree in $G^{*}$ is $38.1$ $(=\frac{117185083}{3072441})$ . Therefore, Orkut is more sparse than IMDB (whose average degree in $G^{*}$ is $63.7$ ).

For each dataset, we randomly selected $n$ users from the whole graph $G^{*}$ , and extracted a graph $G=(V,E)$ with $n$ users. Then we estimated the number of triangles $f_{\triangle}(G)$ , the number of $k$ -stars $f_{k\star}(G)$ , and the clustering coefficient ( $=\frac{3f_{\triangle}(G)}{f_{2\star}(G)}$ ) using $\varepsilon$ -edge LDP (or $\varepsilon$ -edge centralized DP) algorithms in Section 4. Specifically, we used the following algorithms:

Algorithms for triangles. For algorithms for estimating $f_{\triangle}(G)$ , we used the following three algorithms: (1) the RR (Randomized Response) with the empirical estimation method in the local model (i.e., LocalRR_△ in Section 4.2), (2) the two-rounds algorithm in the local model (i.e., Local2Rounds_△ in Section 4.3), and (3) the Laplacian mechanism in the centralized model (i.e., CentralLap $\triangle$ in Section 4.2).

Algorithms for $k$ -stars. For algorithms for estimating $f_{k\star}(G)$ , we used the following two algorithms: (1) the Laplacian mechanism in the local model (i.e., LocalLap ${}_{k}\star$ in Section 4.1) and (2) the Laplacian mechanism in the centralized model (i.e., CentralLap ${}_{k}\star$ in Section 4.1).

For each algorithm, we evaluated the $l_{2}$ loss and the relative error (as described in Section 3.4), while changing the values of $n$ and $\varepsilon$ . To stabilize the performance, we attempted $\gamma\in\mathbb{N}$ ways to randomly select $n$ users from $G^{*}$ , and averaged the utility value over all the $\gamma$ ways to randomly select $n$ users. When we changed $n$ from $1000$ to $10000$ , we set $\gamma=100$ because the variance was large. For other cases, we set $\gamma=10$ .

In Appendix B, we also report experimental results using artificial graphs based on the Barabási-Albert model [9].

5.2 Experimental Results

Relation between $n$ and the $l_{2}$ loss. We first evaluated the $l_{2}$ loss of the estimates of $f_{\triangle}(G)$ , $f_{2\star}(G)$ , and $f_{3\star}(G)$ while changing the number of users $n$ . Figures 5 and 6 shows the results ( $\varepsilon=1$ ). Here we did not evaluate LocalRR_△ when $n$ was larger than $10000$ , because LocalRR_△ was inefficient (as described in Section 4.3 “Time complexity”). In Local2Rounds_△, we set $\varepsilon_{1}=\varepsilon_{2}=\frac{1}{2}$ . As for $\tilde{d}_{max}$ , we set $\tilde{d}_{max}=d_{max}$ (i.e., we assumed that $d_{max}$ is publicly available and did not perform graph projection) because we want to examine how well our theoretical results hold in our experiments. We also evaluate the effectiveness of the private calculation of $d_{max}$ at the end of Section 5.2.

Figure 5 shows that Local2Rounds_△ significantly outperforms LocalRR_△. Specifically, the $l_{2}$ loss of Local2Rounds_△ is smaller than that of LocalRR_△ by a factor of about $10^{2}$ . The difference between Local2Rounds_△ and LocalRR_△ is larger in Orkut. This is because Orkut is more sparse, as described in Section 5.1. For example, when $n=10000$ , the maximum degree $d_{max}$ in $G$ was $73.5$ and $27.8$ on average in IMDB and Orkut, respectively. Recall that for a fixed $\varepsilon$ , the expected $l_{2}$ loss of Local2Rounds_△ and LocalRR_△ can be expressed as $O(nd_{max}^{3})$ and $O(n^{4})$ , respectively. Thus Local2Rounds_△ significantly outperforms LocalRR_△, especially in sparse graphs.

Figures 5 and 6 show that the $l_{2}$ loss is roughly consistent with our upper-bounds in terms of $n$ . Specifically, LocalRR_△, Local2Rounds_△, CentralLap_△, LocalLap_k⋆, and CentralLap_k⋆ achieve the expected $l_{2}$ loss of $O(n^{4})$ , $O(nd_{max}^{3})$ , $O(d_{max}^{2})$ , $O(nd_{max}^{2k-2})$ , and $O(d_{max}^{2k-2})$ , respectively. Here note that each user’s degree increases roughly in proportion to $n$ (though the degree is much smaller than $n$ ), as we randomly select $n$ users from the whole graph $G^{*}$ . Assuming that $d_{max}=O(n)$ , Figures 5 and 6 are roughly consistent with the upper-bounds. The figures also show the limitations of the local model in terms of the utility when compared to the centralized model.

Relation between $\varepsilon$ and the $l_{2}$ loss. Next we evaluated the $l_{2}$ loss when we changed the privacy budget $\varepsilon$ in edge LDP. Figure 7 shows the results for triangles and $2$ -stars ( $n=10000$ ). Here we omit the result of $3$ -stars because it is similar to that of $2$ -stars. In Local2Rounds_△, we set $\varepsilon_{1}=\varepsilon_{2}=\frac{\varepsilon}{2}$ .

Figure 7 shows that the $l_{2}$ loss is roughly consistent with our upper-bounds in terms of $\varepsilon$ . For example, when we decrease $\varepsilon$ from $0.4$ to $0.1$ , the $l_{2}$ loss increases by a factor of about $5000$ , $200$ , and $16$ for both the datasets in LocalRR_△, Local2Rounds_△, and CentralLap_△, respectively. They are roughly consistent with our theoretical results that for small $\varepsilon$ , the expected $l_{2}$ loss of LocalRR_△, Local2Rounds_△, and CentralLap_△ is $O(\varepsilon^{-6})$ ²²2We used $e^{\varepsilon}\approx\varepsilon+1$ to derive the upper-bound of LocalRR_△ for small $\varepsilon$ ., $O(\varepsilon^{-4})$ , and $O(\varepsilon^{-2})$ , respectively.

Figure 7 also shows that Local2Rounds_△ significantly outperforms LocalRR_△ especially when $\varepsilon$ is small, which is also consistent with our theoretical results. Conversely, the difference between LocalRR_△ and Local2Rounds_△ is small when $\varepsilon$ is large. This is because when $\varepsilon$ is large, the RR outputs the true value with high probability. For example, when $\varepsilon\geq 5$ , the RR outputs the true value with $\frac{e^{\varepsilon}}{e^{\varepsilon}+1}>0.993$ . However, LocalRR_△ with such a large value of $\varepsilon$ does not guarantee strong privacy, because it outputs the true value in most cases. Local2Rounds_△ significantly outperforms LocalRR_△ when we want to estimate $f_{\triangle}(G)$ or $f_{k\star}(G)$ with a strong privacy guarantee; e.g., $\varepsilon\leq 1$ [37].

Relative error. As the number of users $n$ increases, the numbers of triangles $f_{\triangle}(G)$ and $k$ -stars $f_{k\star}(G)$ increase. This causes the increase of the $l_{2}$ loss. Therefore, we also evaluated the relative error, as described in Section 3.4.

Figure 8 shows the relation between $n$ and the relative error (we omit the result of $3$ -stars because it is similar to that of $2$ -stars). In the local model, we used Local2Rounds_△ and LocalLap ${}_{k}\star$ for estimating $f_{\triangle}(G)$ and $f_{k\star}(G)$ , respectively (we did not use Local2RR_△, because it is both inaccurate and inefficient). For both algorithms, we set $\varepsilon=1$ or $2$ ( $\varepsilon_{1}=\varepsilon_{2}=\frac{\varepsilon}{2}$ in Local2Rounds_△) and $\tilde{d}_{max}=d_{max}$ . Then we estimated the clustering coefficient as: $\frac{3\hat{f}_{\triangle}(G,\varepsilon_{1},\varepsilon_{2},d_{max})}{\hat{f}_{k\star}(G,\varepsilon,d_{max})}$ , where $\hat{f}_{\triangle}(G,\varepsilon_{1},\varepsilon_{2},d_{max})$ and $\hat{f}_{k\star}(G,\varepsilon,d_{max})$ are the estimates of $f_{\triangle}(G)$ and $f_{k\star}(G)$ , respectively. If the estimate of the clustering coefficient is smaller than $0$ (resp. larger than $1$ ), we set the estimate to $0$ (resp. $1$ ) because the clustering coefficient is always between $0$ and $1$ . In the centralized model, we used CentralLap_△ and CentralLap ${}_{k}\star$ ( $\varepsilon=1$ or $2$ , $\tilde{d}_{max}=d_{max}$ ) and calculated the clustering coefficient in the same way.

Figure 8 shows that for all cases, the relative error decreases with increase in $n$ . This is because both $f_{\triangle}(G)$ and $f_{k\star}(G)$ significantly increase with increase in $n$ . Specifically, let $f_{\triangle,v_{i}}(G)\in\mathbb{Z}_{\geq 0}$ the number of triangles that involve user $v_{i}$ , and $f_{k\star,v_{i}}(G)\in\mathbb{Z}_{\geq 0}$ be the number of $k$ -stars of which user $v_{i}$ is a center. Then $f_{\triangle}(G)=\frac{1}{3}\sum_{i=1}^{n}f_{\triangle,v_{i}}(G)$ and $f_{k\star,v_{i}}(G)=\sum_{i=1}^{n}f_{k\star,v_{i}}(G)$ . Since both $f_{\triangle,v_{i}}(G)$ and $f_{k\star,v_{i}}(G)$ increase with increase in $n$ , both $f_{\triangle}(G)$ and $f_{k\star}(G)$ increase at least in proportion to $n$ . Thus $f_{\triangle}(G)^{2}\geq\Omega(n^{2})$ and $f_{k\star}(G)^{2}\geq\Omega(n^{2})$ . In contrast, Local2Rounds_△, LocalLap ${}_{k}\star$ , CentralLap_△, and CentralLap ${}_{k}\star$ achieve the expected $l_{2}$ loss of $O(n)$ , $O(n)$ , $O(1)$ , and $O(1)$ , respectively (when we ignore $d_{max}$ and $\varepsilon$ ), all of which are smaller than $O(n^{2})$ . Therefore, the relative error decreases with increase in $n$ .

This result demonstrates that we can accurately estimate graph statistics for large $n$ in the local model. In particular, the relative error is smaller in IMDB because IMDB is denser and includes a larger number of triangles and $k$ -stars; i.e., the denominator of the relative error is large. For example, when $n=200000$ and $\varepsilon=1$ , the relative error is $0.30$ and $0.0028$ for triangles and $2$ -stars, respectively. Note that the clustering coefficient requires $2\varepsilon$ because we need to estimate both $f_{\triangle}(G)$ and $f_{k\star}(G)$ . Yet, we can still accurately calculate the clustering coefficient with a moderate privacy budget; e.g., the relative error of the clustering coefficient is $0.30$ when the privacy budget is $2$ (i.e., $\varepsilon=1$ ). If $n$ is larger, then $\varepsilon$ would be smaller at the same value of the relative error.

Private calculation of $d_{max}$ . We have so far assumed that $\tilde{d}_{max}=d_{max}$ (i.e., $d_{max}$ is publicly available) in our experiments. We finally evaluate the methods to privately calculate $d_{max}$ with $\varepsilon_{0}$ -edge LDP (described in Sections 4.1 and 4.3).

Specifically, we used Local2Rounds_△ and LocalLap ${}_{k}\star$ for estimating $f_{\triangle}(G)$ and $f_{k\star}(G)$ , respectively, and evaluated the following three methods for setting $\tilde{d}_{max}$ : (i) $\tilde{d}_{max}=n$ ; (ii) $\tilde{d}_{max}=d_{max}$ ; (iii) $\tilde{d}_{max}=\hat{d}_{max}$ , where $\hat{d}_{max}$ is the private estimate of $d_{max}$ (noisy max degree) in Sections 4.1 and 4.3.

We set $n=200000$ in IMDB and $n=1600000$ in Orkut. Regarding the total privacy budget $\varepsilon$ in edge LDP for estimating $f_{\triangle}(G)$ or $f_{k\star}(G)$ , we set $\varepsilon=1$ or $2$ . We used $\frac{\varepsilon}{10}$ for privately calculating $d_{max}$ (i.e., $\varepsilon_{0}=\frac{\varepsilon}{10}$ ), and the remaining privacy budget $\frac{9\varepsilon}{10}$ as input to Local2Rounds_△ or LocalLap ${}_{k}\star$ . In Local2Rounds_△, we set $\varepsilon_{1}=\varepsilon_{2}$ ; i.e., we set $(\varepsilon_{0},\varepsilon_{1},\varepsilon_{2})=(0.1,0.45,0.45)$ or $(0.2,0.9,0.9)$ . Then we estimated the clustering coefficient in the same way as Figure 8.

Figure 9 shows the results. Figure 9 shows that the algorithms with $\tilde{d}_{max}=\hat{d}_{max}$ (noisy max degree) achieves the relative error close to (sometimes almost the same as) the algorithms with $\tilde{d}_{max}=d_{max}$ and significantly outperforms the algorithms with $\tilde{d}_{max}=n$ . This means that we can privately estimate $d_{max}$ without a significant loss of utility.

Summary of results. In summary, our experimental results showed that the estimation error of triangle counts is significantly reduced by introducing the interaction between users and a data collector. The results also showed that we can achieve small relative errors (much smaller than 1) for subgraph counts with privacy budget $\varepsilon=1$ or $2$ in edge LDP.

As described in Section 1, non-private subgraph counts may reveal some friendship information, and a central server may face data breaches. Our LDP algorithms are highly beneficial because they enable us to analyze the connection patterns in a graph (i.e., subgraph counts) or to understand how likely two friends of an individual will also be a friend (i.e., clustering coefficient) while strongly protecting individual privacy.

6 Conclusions

We presented a series of algorithms for counting triangles and $k$ -stars under LDP. We showed that an additional round can significantly reduce the estimation error in triangles, and the algorithm based on the Laplacian mechanism provides an order optimal error in the non-interactive local model. We also showed lower-bounds for general functions including triangles and $k$ -stars. We conducted experiments using two real datasets, and showed that our algorithms achieve small relative errors, especially when the number of users is large.

As future work, we would like to develop algorithms for other subgraph counts such as cliques and $k$ -triangles [33].

Acknowledgments

Kamalika Chaudhuri and Jacob Imola would like to thank ONR under N00014-20-1-2334 and UC Lab Fees under LFR 18-548554 for research support. Takao Murakami was supported in part by JSPS KAKENHI JP19H04113.

References

[1] Tool: LDP graph statistics. https://github.com/LDPGraphStatistics/LDPGraphStatistics.
[2] 12th Annual Graph Drawing Contest. http://mozart.diei.unipg.it/gdcontest/contest2005/index.html, 2005.
[3] What to Do When Your Facebook Profile is Maxed Out on Friends. https://authoritypublishing.com/social-media/what-to-do-when-your-facebook-profile-is-maxed-out-on-friends/, 2012.
[4] AI bridging cloud infrastructure (ABCI). https://abci.ai/, 2020.
[5] The diaspora* project. https://diasporafoundation.org/, 2020.
[6] Facebook Reports Third Quarter 2020 Results. https://investor.fb.com/investor-news/press-release-details/2020/Facebook-Reports-Third-Quarter-2020-Results/default.aspx, 2020.
[7] Jayadev Acharya, Clément L. Canonne, Yuhan Liu, Ziteng Sun, and Himanshu Tyagi. Interactive inference under information constraints. CoRR, 2007.10976, 2020.
[8] Jayadev Acharya, Ziteng Sun, and Huanyu Zhang. Hadamard response: Estimating distributions privately, efficiently, and with little communication. In Proc. AISTATS’19, pages 1120–1129, 2019.
[9] Albert-László Barabási. Network Science. Cambridge University Press, 2016.
[10] Raef Bassily, Kobbi Nissim, Uri Stemmer, and Abhradeep Thakurta. Practical locally private heavy hitters. In Proc. NIPS’17, pages 2285––2293, 2017.
[11] Raef Bassily and Adam Smith. Local, private, efficient protocols for succinct histograms. In Proc. STOC’15, pages 127–135, 2015.
[12] Vincent Bindschaedler and Reza Shokri. Synthesizing plausible privacy-preserving location traces. In Proc. S&P’16, pages 546–563, 2016.
[13] Jeremiah Blocki, Avrim Blum, Anupam Datta, and Or Sheffet. The johnson-lindenstrauss transform itself preserves differential privacy. In Proc. FOCS’12, pages 410–419, 2012.
[14] Rui Chen, Gergely Acs, and Claude Castelluccia. Differentially private sequential data publication via variable-length n-grams. In Proc. CCS’12, pages 638–649, 2012.
[15] Xihui Chen, Sjouke Mauw, and Yunior Ramírez-Cruz. Publishing community-preserving attributed social graphs with a differential privacy guarantee. Proceedings on Privacy Enhancing Technologies (PoPETs), (4):131–152, 2020.
[16] Wei-Yen Day, Ninghui Li, and Min Lyu. Publishing graph degree distribution with node differential privacy. In Proc. SIGMOD’16, pages 123–138, 2016.
[17] Bolin Ding, Janardhan Kulkarni, and Sergey Yekhanin. Collecting telemetry data privately. In Proc. NIPS’17, pages 3574–3583, 2017.
[18] John Duchi and Ryan Rogers. Lower Bounds for Locally Private Estimation via Communication Complexity. arXiv:1902.00582 [math, stat], May 2019. arXiv: 1902.00582.
[19] John Duchi, Martin Wainwright, and Michael Jordan. Minimax Optimal Procedures for Locally Private Estimation. arXiv:1604.02390 [cs, math, stat], November 2017. arXiv: 1604.02390.
[20] John C. Duchi, Michael I. Jordan, and Martin J. Wainwright. Local privacy and statistical minimax rates. In Proc. FOCS’13, pages 429–438, 2013.
[21] John C. Duchi, Michael I. Jordan, and Martin J. Wainwright. Local privacy, data processing inequalities, and minimax rates. CoRR, 1302.3203, 2014.
[22] Cynthia Dwork. Differential privacy. In Proc. ICALP’06, pages 1–12, 2006.
[23] Cynthia Dwork and Aaron Roth. The Algorithmic Foundations of Differential Privacy. Now Publishers, 2014.
[24] Giulia Fanti, Vasyl Pihur, and Ulfar Erlingsson. Building a RAPPOR with the unknown: Privacy-preserving learning of associations and data dictionaries. Proceedings on Privacy Enhancing Technologies (PoPETs), 2016(3):1–21, 2016.
[25] Ghurumuruhan Ganesan. Existence of connected regular and nearly regular graphs. CoRR, 1801.08345, 2018.
[26] Aric A. Hagberg, Daniel A. Schult, and Pieter J. Swart. Exploring network structure, dynamics, and function using networkx. In Proceedings of the 7th Python in Science Conference (SciPy’08), pages 11–15, 2008.
[27] Michael Hay, Chao Li, Gerome Miklau, and David Jensen. Accurate estimation of the degree distribution of private networks. In Proc. ICDM’09, pages 169–178, 2009.
[28] Matthew Joseph, Janardhan Kulkarni, Jieming Mao, and Zhiwei Steven Wu. Locally Private Gaussian Estimation. arXiv:1811.08382 [cs, stat], October 2019. arXiv: 1811.08382.
[29] Matthew Joseph, Jieming Mao, Seth Neel, and Aaron Roth. The Role of Interactivity in Local Differential Privacy. arXiv:1904.03564 [cs, stat], November 2019. arXiv: 1904.03564.
[30] Matthew Joseph, Jieming Mao, and Aaron Roth. Exponential separations in local differential privacy. In Proc. SODA’20, pages 515–527, 2020.
[31] Peter Kairouz, Keith Bonawitz, and Daniel Ramage. Discrete distribution estimation under local privacy. In Proc. ICML’16, pages 2436–2444, 2016.
[32] Peter Kairouz, Sewoong Oh, and Pramod Viswanath. Extremal mechanisms for local differential privacy. Journal of Machine Learning Research, 17(1):492–542, 2016.
[33] Vishesh Karwa, Sofya Raskhodnikova, Adam Smith, and Grigory Yaroslavtsev. Private analysis of graph structure. Proceedings of the VLDB Endowment, 4(11):1146–1157, 2011.
[34] Shiva Prasad Kasiviswanathan, Homin K. Lee, Kobbi Nissim, and Sofya Raskhodnikova. What can we learn privately? In Proc. FOCS’08, pages 531–540, 2008.
[35] Shiva Prasad Kasiviswanathan, Kobbi Nissim, Sofya Raskhodnikova, and Adam Smith. Analyzing graphs with node differential privacy. In Proc. TCC’13, pages 457–476, 2013.
[36] Jure Leskovec and Andrej Krevl. SNAP Datasets: Stanford large network dataset collection. http://snap.stanford.edu/data, 2014.
[37] Ninghui Li, Min Lyu, and Dong Su. Differential Privacy: From Theory to Practice. Morgan & Claypool Publishers, 2016.
[38] Chris Morris. Hackers had a banner year in 2019. https://fortune.com/2020/01/28/2019-data-breach-increases-hackers/, 2020.
[39] Takao Murakami and Yusuke Kawamoto. Utility-optimized local differential privacy mechanisms for distribution estimation. In Proc. USENIX Security’19, pages 1877–1894, 2019.
[40] Kevin P. Murphy. Machine Learning: A Probabilistic Perspective. The MIT Press, 2012.
[41] M. E. J. Newman. Random graphs with clustering. Physical Review Letters, 103(5):058701, 2009.
[42] Kobbi Nissim, Sofya Raskhodnikova, and Adam Smith. Smooth sensitivity and sampling in private data analysis. In Proc. STOC’07, pages 75–84, 2007.
[43] Thomas Paul, Antonino Famulari, and Thorsten Strufe. A survey on decentralized online social networks. Computer Networks, 75:437–452, 2014.
[44] Venkatadheeraj Pichapati, Ananda Theertha Suresh, Felix X. Yu, Sashank J. Reddi, and Sanjiv Kumar. AdaCliP: Adaptive clipping for private SGD. CoRR, 1908.07643, 2019.
[45] Zhan Qin, Yin Yang, Ting Yu, Issa Khalil, Xiaokui Xiao, and Kui Ren. Heavy hitter estimation over set-valued data with local differential privacy. In Proc. CCS’16, pages 192–203, 2016.
[46] Zhan Qin, Ting Yu, Yin Yang, Issa Khalil, Xiaokui Xiao, and Kui Ren. Generating synthetic decentralized social graphs with local differential privacy. In Proc. CCS’17, pages 425–438, 2017.
[47] Cyrus Rashtchian, David P. Woodruff, and Hanlin Zhu. Vector-matrix-vector queries for solving linear algebra, statistics, and graph problems. CoRR, 2006.14015, 2020.
[48] Sofya Raskhodnikova and Adam Smith. Efficient lipschitz extensions for high-dimensional graph statistics and node private degree distributions. CoRR, 1504.07912, 2015.
[49] Sofya Raskhodnikova and Adam Smith. Differentially Private Analysis of Graphs, pages 543–547. Springer, 2016.
[50] Andrea De Salve, Paolo Mori, and Laura Ricci. A survey on privacy in decentralized online social networks. Computer Science Review, 27:154–176, 2018.
[51] Tara Seals. Data breaches increase 40% in 2016. https://www.infosecurity-magazine.com/news/data-breaches-increase-40-in-2016/, 2017.
[52] Shuang Song, Susan Little, Sanjay Mehta, Staal Vinterboy, and Kamalika Chaudhuri. Differentially private continual release of graph statistics. CoRR, 1809.02575, 2018.
[53] Haipei Sun, Xiaokui Xiao, Issa Khalil, Yin Yang, Zhan Qui, Hui (Wendy) Wang, and Ting Yu. Analyzing subgraph statistics from extended local views with decentralized differential privacy. In Proc. CCS’19, pages 703–717, 2019.
[54] Om Thakkar, Galen Andrew, and H. Brendan McMahan. Differentially private learning with adaptive clipping. CoRR, 1905.03871, 2019.
[55] Abhradeep Guha Thakurta, Andrew H. Vyrros, Umesh S. Vaishampayan, Gaurav Kapoor, Julien Freudiger, Vivek Rangarajan Sridhar, and Doug Davidson. Learning New Words, US Patent 9,594,741, Mar. 14 2017.
[56] Úlfar Erlingsson, Vasyl Pihur, and Aleksandra Korolova. RAPPOR: Randomized aggregatable privacy-preserving ordinal response. In Proc. CCS’14, pages 1054–1067, 2014.
[57] Tianhao Wang, Jeremiah Blocki, Ninghui Li, and Somesh Jha. Locally differentially private protocols for frequency estimation. In Proc. USENIX Security’17, pages 729–745, 2017.
[58] Yue Wang and Xintao Wu. Preserving differential privacy in degree-correlation based graph generation. Transactions on Data Privacy, 6(2), 2013.
[59] Yue Wang, Xintao Wu, and Leting Wu. Differential privacy preserving spectral graph analysis. In Proc. PAKDD’13, pages 329–340, 2013.
[60] Stanley L. Warner. Randomized response: A survey technique for eliminating evasive answer bias. Journal of the American Statistical Association, 60(309):63–69, 1965.
[61] Xiaokui Xiao, Gabriel Bender, Michael Hay, and Johannes Gehrke. ireduct: Differential privacy with reduced relative errors. In Proc. SIGMOD’11, pages 229–240, 2011.
[62] Min Ye and Alexander Barga. Optimal schemes for discrete distribution estimation under local differential privacy. In Proc. ISIT’17, pages 759––763, 2017.
[63] Qingqing Ye, Haibo Hu, Man Ho Au, Xiaofeng Meng, and Xiaokui Xiao. Towards locally differentially private generic graph metric estimation. In Proc. ICDE’20, pages 1922–1925, 2020.
[64] Qingqing Ye, Haibo Hu, Man Ho Au, Xiaofeng Meng, and Xiaokui Xiao. LF-GDPR: A framework for estimating graph metrics with local differential privacy. IEEE Transactions on Knowledge and Data Engineering (Early Access), pages 1–16, 2021.
[65] Hailong Zhang, Sufian Latif, Raef Bassily, and Atanas Rountev. Differentially-private control-flow node coverage for software usage analysis. In Proc. USENIX Security’20, pages 1021–1038, 2020.

Appendix A Effectiveness of empirical estimation in LocalRR_△

In Section 4.2, we presented LocalRR_△, which uses the empirical estimation method after the RR. Here we show the effectiveness of empirical estimation by comparing LocalRR_△ with the RR without empirical estimation [46, 63].

As the RR without empirical estimation, we applied the RR to the lower triangular part of the adjacency matrix $\mathbf{A}$ ; i.e., we ran lines 1 to 6 in Algorithm 2. Then we output the number of noisy triangles $m_{3}$ . We denote this algorithm by RR w/o emp.

Figure 10 shows the $l_{2}$ loss of LocalRR_△ and RR w/o emp when we changed $n$ from $1000$ to $10000$ or $\varepsilon$ in edge LDP from $0.1$ to $2$ . The experimental set-up is the same as Section 5.1. Figure 10 shows that LocalRR_△ significantly outperforms RR w/o emp, which means that the $l_{2}$ loss is significantly reduced by empirical estimation. As shown in Section 5, the $l_{2}$ loss of LocalRR_△ is also significantly reduced by an additional round of interaction.

Appendix B Experiments on Barabási-Albert Graphs

Experimental set-up. In Section 5, we evaluated our algorithms using two real datasets: IMDB and Orkut. We also evaluated our algorithms using artificial graphs that have power-law degree distributions. We used the BA (Barabási-Albert) model [9] to generate such graphs.

In the BA model, an artificial graph (referred to as a BA graph) is grown by adding new nodes one at a time. Each new node is connected to $\lambda\in\mathbb{N}$ existing nodes with probability proportional to the degree of the existing node. In our experiments, we used NetworkX [26], a Python package for graph analysis, to generate BA graphs.

We generated a BA graph $G^{*}$ with $1000000$ nodes using NetworkX. For the attachment parameter $\lambda$ , we set $\lambda=10$ or $50$ . When $\lambda=10$ (resp. $50$ ), the average degree of $G^{*}$ was $10.0$ (resp. $50.0$ ). For each case, we randomly generated $n$ users from the whole graph $G^{*}$ , and extracted a graph $G=(V,E)$ with the $n$ users. Then we estimated the number of triangles $f_{\triangle}(G)$ and the number of $2$ -stars $f_{2\star}(G)$ . For triangles, we evaluated LocalRR_△, Local2Rounds_△, and CentralLap $\triangle$ . For $2$ -stars, we evaluated LocalLap ${}_{2}\star$ and CentralLap ${}_{2}\star$ . In Local2Rounds_△, we set $\varepsilon_{1}=\varepsilon_{2}$ . For $\tilde{d}_{max}$ , we set $\tilde{d}_{max}=d_{max}$ .

We evaluated the $l_{2}$ loss while changing $n$ and $\varepsilon$ . We attempted $\gamma\in\mathbb{N}$ ways to randomly select $n$ users from $G^{*}$ , and averaged the $l_{2}$ loss over all the $\gamma$ ways to randomly select $n$ users. As with Section 5, we set $\gamma=100$ and changed $n$ from $1000$ to $10000$ while fixing $\varepsilon=1$ . Then we set $\gamma=10$ and changed $\varepsilon$ from $0.1$ to $2$ while fixing $n=10000$ .

Experimental results. Figure 11 shows the results. Overall, Figure 11 has a similar tendency to Figures 5, 6, and 7. For example, Local2Rounds_△ significantly outperforms LocalRR_△, especially when the graph $G$ is sparse; i.e., $\lambda=10$ . In Local2Rounds_△, CentralLap $\triangle$ , LocalLap ${}_{2}\star$ , and CentralLap ${}_{2}\star$ , the $l_{2}$ loss increases with increase in $\lambda$ . This is because the maximum degree $d_{max}$ $(=\tilde{d}_{max})$ increases with increase in $\lambda$ .

Figure 11 also shows that the $l_{2}$ loss is roughly consistent with our upper-bounds in Section 4. For example, recall that LocalRR_△, Local2Rounds_△, CentralLap_△, LocalLap_2⋆, and CentralLap_2⋆ achieve the expected $l_{2}$ loss of $O(n^{4})$ , $O(nd_{max}^{3})$ , $O(d_{max}^{2})$ , $O(nd_{max}^{2})$ , and $O(d_{max}^{2})$ , respectively. Assuming that $d_{max}=O(n)$ , the left panels of Figure 11 are roughly consistent with these upper-bounds. In addition, the right panels of Figure 11 show that when we set $\lambda=10$ and decrease $\varepsilon$ from $0.4$ to $0.1$ , the $l_{2}$ loss increases by a factor of about $3800$ , $250$ , and $16$ in LocalRR_△, Local2Rounds_△, and CentralLap_△, respectively. They are roughly consistent with our upper-bounds – for small $\varepsilon$ , the expected $l_{2}$ loss of LocalRR_△, Local2Rounds_△, and CentralLap_△ is $O(\varepsilon^{-6})$ , $O(\varepsilon^{-4})$ , and $O(\varepsilon^{-2})$ , respectively.

In summary, for both the two real datasets and the BA graphs, our experimental results showed the following findings: (1) Local2Rounds_△ significantly outperforms LocalRR_△, especially when the graph $G$ is sparse; (2) our experimental results are roughly consistent with our upper-bounds.

Appendix C Construction of an $(n,\frac{d_{max}}{2}-2)$ independent cube for $f_{\triangle}$

Suppose that $n$ is even and $d_{max}$ is divisible by $4$ . Since $d_{max}<n$ , it is possible to write $n=\eta_{1}\frac{d_{max}}{2}+\eta_{2}$ for integers $\eta_{1},\eta_{2}$ such that $\eta_{1}\geq 1$ and $1\leq\eta_{2}<\frac{d_{max}}{2}$ . Because $\eta_{1}\frac{d_{max}}{2}$ and $n$ are even, we must have $\eta_{2}$ is even. Now, we can write $n=(\eta_{1}-1)\frac{d_{max}}{2}+(\eta_{2}+\frac{d_{max}}{2})$ . Thus, we can define a graph $G=(V,E)$ on $n$ nodes consisting of $(\eta_{1}-1)$ cliques of even size $\frac{d_{max}}{2}$ and one final clique of an even size $\eta_{2}+\frac{d_{max}}{2}\in(\frac{d_{max}}{2},d_{max})$ with all cliques disjoint.

Since $G=(V,E)$ consists of even-sized cliques, it contains a perfect matching $M$ . Figure 12 shows examples of $G$ and $M$ , where $n=14$ , $d_{max}=8$ , $\eta_{1}=3$ , and $\eta_{2}=2$ . Let $G^{\prime}=(V,E^{\prime})$ such that $E^{\prime}=E\setminus M$ . Let $\mathcal{A}=\{(V,E^{\prime}\cup N:N\subseteq M\}$ . Each edge in $G$ is part of at least $\frac{d_{max}}{2}-2$ triangles. For each pair of edges in $M$ , the triangles of $G$ of which they are part are disjoint. Thus, for any edge $e\in M$ , removing $e$ from a graph in $\mathcal{A}$ will remove at least $\frac{d_{max}}{2}-2$ triangles. This implies that $\mathcal{A}$ is an $(n,\frac{d_{max}}{2}-2)$ independent cube for $f_{\triangle}$ .

Appendix D Proof of Statements in Section 4

Here we prove the statements in Section 4. Our proofs will repeatedly use the well-known bias-variance decomposition [40], which we briefly explain below. We denote the variance of the random variable $X$ by $\mathbb{V}[X]$ . If we are producing a private, randomized estimate $\hat{f}(G)$ of the graph function $f(G)$ , then the expected $l_{2}$ loss (over the randomness in the algorithm) can be written as:

\operatorname{\mathbb{E}}[l_{2}^{2}(\hat{f}(G),f(G))]=\left(\operatorname{\mathbb{E}}[\hat{f}(G)]-f(G)\right)^{2}+\operatorname{\mathbb{V}}[\hat{f}(G)].

(7)

The first term is the bias, and the second term is the variance. If the estimate is unbiased (i.e., $\operatorname{\mathbb{E}}[\hat{f}(G)]=f(G)$ ), then the expected $l_{2}$ loss is equal to the variance.

D.1 Proof of Theorem 1

Let $\mathcal{R}_{i}$ be LocalLap_k⋆. Let $d_{i},d^{\prime}_{i}\in\mathbb{Z}_{\geq 0}$ be the number of “1”s in two neighbor lists $\mathbf{a}_{i},\mathbf{a}^{\prime}_{i}\in\{0,1\}^{n}$ that differ in one bit. Let $r_{i}=\binom{d_{i}}{k}$ and $r^{\prime}_{i}=\binom{d^{\prime}_{i}}{k}$ . Below we consider two cases about $d_{i}$ : when $d_{i}<\tilde{d}_{max}$ and when $d_{i}\geq\tilde{d}_{max}$ .

Case 1: $d_{i}<\tilde{d}_{max}$ . In this case, both $\mathbf{a}_{i}$ and $\mathbf{a}^{\prime}_{i}$ do not change after graph projection, as $d^{\prime}_{i}\leq d_{i}+1\leq\tilde{d}_{max}$ . Then we obtain:

	$\displaystyle\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{r}_{i}]$	$\displaystyle=\exp\left(-\frac{\varepsilon\|\hat{r}_{i}-r_{i}\|}{\Delta}\right)$
	$\displaystyle\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{r}_{i}]$	$\displaystyle=\exp\left(-\frac{\varepsilon\|\hat{r}_{i}-r^{\prime}_{i}\|}{\Delta}\right),$

where $\Delta=\binom{\tilde{d}_{max}}{k-1}$ . Therefore,

$\displaystyle\frac{\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{r}_{i}]}{\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{r}_{i}]}$	$\displaystyle=\exp\left(\frac{\varepsilon\|\hat{r}_{i}-r^{\prime}_{i}\|}{\Delta}-\frac{\varepsilon\|\hat{r}_{i}-r_{i}\|}{\Delta}\right)$
	$\displaystyle\leq\exp\left(\frac{\varepsilon\|r^{\prime}_{i}-r_{i}\|}{\Delta}\right)$	(8)
	$\displaystyle\hskip 11.38109pt(\text{by the triangle inequality}).$

If $d^{\prime}_{i}=d_{i}+1$ , then $|r^{\prime}_{i}-r_{i}|$ in (8) can be written as follows:

\displaystyle|r^{\prime}_{i}-r_{i}|=\binom{d_{i}+1}{k}-\binom{d_{i}}{k}=\binom{d_{i}}{k-1}<\binom{\tilde{d}_{max}}{k-1}=\Delta,

Since we add $\textrm{Lap}(\frac{\Delta}{\varepsilon})$ to $r_{i}$ , we obtain:

\displaystyle\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{r}_{i}]\leq e^{\varepsilon}\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{r}_{i}].

(9)

If $d^{\prime}_{i}=d_{i}-1$ , then $|r^{\prime}_{i}-r_{i}|=\binom{d_{i}}{k}-\binom{d_{i}-1}{k}=\binom{d_{i}-1}{k-1}<\Delta$ and (9) holds. Therefore, LocalLap_k⋆ provides $\varepsilon$ -edge LDP.

Case 2: $d_{i}\geq\tilde{d}_{max}$ . Assume that $d^{\prime}_{i}=d_{i}+1$ . In this case, $d^{\prime}_{i}>\tilde{d}_{max}$ . Therefore, $d^{\prime}_{i}$ becomes $\tilde{d}_{max}$ after graph projection. In addition, $d_{i}$ also becomes $\tilde{d}_{max}$ after graph projection. Therefore, we obtain $d_{i}=d^{\prime}_{i}=\tilde{d}_{max}$ after graph projection. Thus $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{r}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{r}_{i}]$ .

Assume that $d^{\prime}_{i}=d_{i}-1$ . If $d_{i}>\tilde{d}_{max}$ , then $d_{i}=d^{\prime}_{i}=\tilde{d}_{max}$ after graph projection. Thus $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{r}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{r}_{i}]$ . If $d_{i}=\tilde{d}_{max}$ , then (9) holds. Therefore, LocalLap_k⋆ provides $\varepsilon$ -edge LDP. ∎

D.2 Proof of Theorem 2

Assuming the maximum degree $d_{max}$ of $G$ is at most $\tilde{d}_{max}$ , the only randomness in the algorithm will be the Laplace noise since graph projection will not occur. Since the Laplacian noise $\textrm{Lap}(\frac{\Delta}{\varepsilon})$ has mean $0$ , the estimate $\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})$ is unbiased. Then by the bias-variance decomposition [40], the expected $l_{2}$ loss $\mathbb{E}[l_{2}^{2}(\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max}),\allowbreak f_{k\star}(G))]$ is equal to the variance of $\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})$ . The variance of $\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})$ can be written as follows:

	$\displaystyle\mathbb{V}[\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})]$	$\displaystyle=\mathbb{V}\left[\sum_{i=1}^{n}\textrm{Lap}\left(\frac{\Delta}{\varepsilon}\right)\right]$
		$\displaystyle=\frac{n\Delta^{2}}{\varepsilon^{2}}.$

Since $\Delta=\binom{\tilde{d}_{max}}{k-1}=O(\tilde{d}_{max}^{k-1})$ , we obtain:

	$\displaystyle\mathbb{E}[l_{2}^{2}(\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max}),f_{k\star}(G))]$	$\displaystyle=\mathbb{V}[\hat{f}_{k\star}(G,\varepsilon,\tilde{d}_{max})]$
		$\displaystyle=O\left(\frac{n\tilde{d}_{max}^{2k-2}}{\varepsilon^{2}}\right).$

∎

D.3 Proof of Proposition 2

Let $\mu=e^{\varepsilon}$ and $\mathbf{Q}\in[0,1]^{4\times 4}$ be a $4\times 4$ matrix such that:

\displaystyle\mathbf{Q}=\frac{1}{(\mu+1)^{3}}\left(\begin{array}[]{cccc}\mu^{3}&3\mu^{2}&3\mu&1\\ \mu^{2}&\mu^{3}+2\mu&2\mu^{2}+1&\mu\\ \mu&2\mu^{2}+1&\mu^{3}+2\mu&\mu^{2}\\ 1&3\mu&3\mu^{2}&\mu^{3}\end{array}\right).

(14)

Let $c_{3},c_{2},c_{1},c_{0}\in\mathbb{Z}_{\geq 0}$ be respectively the number of triangles, 2-edges, 1-edge, and no-edges in $G$ . Then we obtain:

\displaystyle(\mathbb{E}[m_{3}],\mathbb{E}[m_{2}],\mathbb{E}[m_{1}],\mathbb{E}[m_{0}])=(c_{3},c_{2},c_{1},c_{0})\mathbf{Q}.

(15)

In other words, $\mathbf{Q}$ is a transition matrix from a type of subgraph (i.e., triangle, 2-edges, 1-edge, or no-edge) in $G$ to a type of subgraph in $G^{\prime}$ .

Let $\hat{c}_{3},\hat{c}_{2},\hat{c}_{1},\hat{c}_{0}\in\mathbb{R}$ be the empirical estimate of $(c_{3},c_{2},c_{1},c_{0})$ . By (15), they can be written as follows:

\displaystyle(\hat{c}_{3},\hat{c}_{2},\hat{c}_{1},\hat{c}_{0})=(m_{3},m_{2},m_{1},m_{0})\mathbf{Q}^{-1}.

(16)

Let $\mathbf{Q}_{i,j}^{-1}$ be the ( $i,j$ )-th element of $\mathbf{Q}^{-1}$ . By using Cramer’s rule, we obtain:

	$\displaystyle\mathbf{Q}_{1,1}^{-1}$	$\displaystyle=\textstyle{\frac{\mu^{3}}{(\mu-1)^{3}}},~{}\mathbf{Q}_{2,1}^{-1}=\textstyle{-\frac{\mu^{2}}{(\mu-1)^{3}}},$		(17)
	$\displaystyle\mathbf{Q}_{3,1}^{-1}$	$\displaystyle=\textstyle{\frac{\mu}{(\mu-1)^{3}}},~{}\mathbf{Q}_{4,1}^{-1}=\textstyle{-\frac{1}{(\mu-1)^{3}}}.$		(18)

By (16), (17), and (18), we obtain:

\displaystyle\textstyle{\hat{c}_{3}=\frac{\mu^{3}}{(\mu-1)^{3}}m_{3}-\frac{\mu^{2}}{(\mu-1)^{3}}m_{2}+\frac{\mu}{(\mu-1)^{3}}m_{1}-\frac{1}{(\mu-1)^{3}}m_{0}.}

Since $\mu=e^{\varepsilon}$ and the empirical estimate is unbiased [31, 57], we obtain (4) in Proposition 2. ∎

D.4 Proof of Theorem 3

Since LocalRR_△ applies the RR to the lower triangular part of the adjacency matrix $\mathbf{A}$ , it provides $\varepsilon$ -edge LDP for $(R_{1},\ldots,R_{n})$ . Lines 5 to 8 in Algorithm 2 are post-processing of $(R_{1},\ldots,R_{n})$ . Thus, by the immunity to post-processing [23], LocalRR_△ provides $\varepsilon$ -edge LDP for the output $\frac{1}{(\mu-1)^{3}}(\mu^{3}m_{3}-\mu^{2}m_{2}+\mu m_{1}-m_{0})$ .

In addition, the existence of edge $(v_{i},v_{j})\in E$ $(i>j)$ affects only one element $a_{i,j}$ in the lower triangular part of $\mathbf{A}$ . Therefore, LocalRR_△ provides $\varepsilon$ -relationship DP.

D.5 Proof of Theorem 4

By Proposition 2, the estimate $\hat{f}_{\triangle}(G,\varepsilon)$ by LocalRR_△ is unbiased. Then by the bias-variance decomposition [40], the expected $l_{2}$ loss $\mathbb{E}[l_{2}^{2}(\hat{f}_{\triangle}(G,\varepsilon),f_{\triangle}(G))]$ is equal to the variance of $\hat{f}_{\triangle}(G,\varepsilon)$ . Let $a_{3}=\frac{\mu^{3}}{(\mu-1)^{3}}$ , $a_{2}=-\frac{\mu^{2}}{(\mu-1)^{3}}$ , $a_{1}=\frac{\mu}{(\mu-1)^{3}}$ , and $a_{0}=-\frac{1}{(\mu-1)^{3}}$ . Then the variance of $\hat{f}_{\triangle}(G,\varepsilon)$ can be written as follows:

$\displaystyle\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$	$\displaystyle=\operatorname{\mathbb{V}}[a_{3}m_{3}+a_{2}m_{2}+a_{1}m_{1}+a_{0}m_{0}]$
	$\displaystyle=a_{3}^{2}\operatorname{\mathbb{V}}[m_{3}]+a_{2}^{2}\operatorname{\mathbb{V}}_{RR}[m_{2}]+a_{1}^{2}\operatorname{\mathbb{V}}[m_{1}]+a_{0}^{2}\operatorname{\mathbb{V}}[m_{0}]$
	$\displaystyle\hskip 9.95845pt+\sum_{i=0}^{3}\sum_{j=0,j\neq i}^{3}2a_{i}a_{j}\text{cov}(m_{i},m_{j}),$	(19)

where $\text{cov}(m_{i},m_{j})$ represents the covariance of $m_{i}$ and $m_{j}$ . The covariance $\text{cov}(m_{i},m_{j})$ can be written as follows:

$\displaystyle\text{cov}(m_{i},m_{j})$	$\displaystyle\leq\sqrt{\operatorname{\mathbb{V}}[m_{i}]\operatorname{\mathbb{V}}[m_{j}]}$
	$\displaystyle\hskip 11.95013pt(\text{by Cauchy-Schwarz inequality})$
	$\displaystyle\leq\max\{\operatorname{\mathbb{V}}[m_{i}],\operatorname{\mathbb{V}}[m_{j}]\}$
	$\displaystyle\leq\operatorname{\mathbb{V}}[m_{i}]+\operatorname{\mathbb{V}}[m_{j}].$	(20)

By (19) and (20), we obtain:

	$\displaystyle\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$
	$\displaystyle\leq(a_{3}^{2}+4a_{3}(a_{2}+a_{1}+a_{0}))\operatorname{\mathbb{V}}[m_{3}]$
	$\displaystyle\hskip 12.80373pt+(a_{2}^{2}+4a_{2}(a_{3}+a_{1}+a_{0}))\operatorname{\mathbb{V}}[m_{2}]$
	$\displaystyle\hskip 12.80373pt+(a_{1}^{2}+4a_{1}(a_{3}+a_{2}+a_{0}))\operatorname{\mathbb{V}}[m_{1}]$
	$\displaystyle\hskip 12.80373pt+(a_{0}^{2}+4a_{0}(a_{3}+a_{2}+a_{1}))\operatorname{\mathbb{V}}[m_{0}]$
	$\displaystyle=O\left(\frac{e^{6\varepsilon}}{(e^{\varepsilon}-1)^{6}}(\operatorname{\mathbb{V}}[m_{3}]+\operatorname{\mathbb{V}}[m_{2}]+\operatorname{\mathbb{V}}[m_{1}]+\operatorname{\mathbb{V}}[m_{0}])\right).$		(21)

Below we calculate $\operatorname{\mathbb{V}}[m_{3}]$ , $\operatorname{\mathbb{V}}[m_{2}]$ , $\operatorname{\mathbb{V}}[m_{1}]$ , and $\operatorname{\mathbb{V}}[m_{0}]$ by assuming the Erdös-Rényi model $\mathbf{G}(n,\alpha)$ for $G$ :

Lemma 1.

Let $G\sim\textbf{G}(n,\alpha)$ . Let $p=\frac{1}{e^{\varepsilon}+1}$ and $\beta=\alpha(1-p)+(1-\alpha)p$ . Then $\operatorname{\mathbb{V}}[m_{3}]=O(\beta^{5}n^{4}+\beta^{3}n^{3})$ , $\operatorname{\mathbb{V}}[m_{2}]=O(\beta^{3}n^{4}+\beta^{2}n^{3})$ , and $\operatorname{\mathbb{V}}[m_{1}]=\operatorname{\mathbb{V}}[m_{0}]=O(\beta n^{4})$ .

Before going into the proof of Lemma 1, we prove Theorem 4 using Lemma 1. By (21) and Lemma 1, we obtain:

\displaystyle\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]=O\left(\frac{e^{6\varepsilon}}{(e^{\varepsilon}-1)^{6}}\beta n^{4}\right),

which proves Theorem 4. ∎

We now prove Lemma 1:

Proof of Lemma 1.

Fist we show the variance of $m_{3}$ and $m_{0}$ . Then we show the variance of $m_{2}$ and $m_{1}$ .

Variance of $m_{3}$ and $m_{0}$ . Since each edge in the original graph $G$ is independently generated with probability $\alpha\in[0,1]$ , each edge in the noisy graph $G^{\prime}$ is independently generated with probability $\beta=\alpha(1-p)+(1-\alpha)p\in[0,1]$ , where $p=\frac{1}{e^{\varepsilon}+1}$ . Thus $m_{3}$ is the number of triangles in graph $G^{\prime}\sim\textbf{G}(n,\beta)$ .

For $i,j,k\in[n]$ , let $y_{i,j,k}\in\{0,1\}$ be a variable that takes $1$ if and only if $(v_{i},v_{j},v_{k})$ forms a triangle. Then $\mathbb{E}[m_{3}^{2}]$ can be written as follows:

\displaystyle\mathbb{E}[m_{3}^{2}]=\sum_{i<j<k}~{}\sum_{i^{\prime}<j^{\prime}<k^{\prime}}\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]

(22)

$\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]$ in (22) is the probability that both $(v_{i},v_{j},v_{k})$ and $(v_{i^{\prime}},v_{j^{\prime}},v_{k^{\prime}})$ form a triangle. This event can be divided into the following four types:

1.

$(i,j,k)=(i^{\prime},j^{\prime},k^{\prime})$ . There are $\binom{n}{3}$ such terms in (22). For each term, $\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]=\beta^{3}$ .
2.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have two elements in common. There are $\binom{n}{2}(n-2)(n-3)=12\binom{n}{4}$ such terms in (22). For each term, $\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]=\beta^{5}$ .
3.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have one element in common. There are $n\binom{n-1}{2}\binom{n-3}{2}=30\binom{n}{5}$ such terms in (22). For each term, $\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]=\beta^{6}$ .
4.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have no common elements. There are $\binom{n}{3}\binom{n-3}{3}=20\binom{n}{6}$ such terms in in (22). For each term, $\mathbb{E}[y_{i,j,k}y_{i^{\prime},j^{\prime},k^{\prime}}]=\beta^{6}$ .

Moreover, $\mathbb{E}[m_{3}]^{2}=\binom{n}{3}^{2}\beta^{6}$ . Therefore, the variance of $m_{3}$ can be written as follows:

	$\displaystyle\operatorname{\mathbb{V}}[m_{3}]$	$\displaystyle=\textstyle{\binom{n}{3}\beta^{3}+12\binom{n}{4}\beta^{5}+30\binom{n}{5}\beta^{6}+20\binom{n}{6}\beta^{6}-\binom{n}{3}^{2}\beta^{6}}$
		$\displaystyle=\textstyle{\binom{n}{3}\beta^{3}(1-\beta^{3})+12\binom{n}{4}\beta^{5}(1-\beta)}$
		$\displaystyle=O(\beta^{5}n^{4}+\beta^{3}n^{3}).$

By changing $\beta$ to $1-\beta$ and counting triangles, we get a random variable with the same distribution as $m_{0}$ . Thus,

	$\displaystyle\operatorname{\mathbb{V}}[m_{0}]$	$\displaystyle=\textstyle{\binom{n}{3}(1-\beta)^{3}(1-(1-\beta)^{3})+12\binom{n}{4}(1-\beta)^{5}\beta}$
		$\displaystyle=O(\beta n^{4}).$

Variance of $m_{2}$ and $m_{1}$ . For $i,j,k\in[n]$ , let $z_{i,j,k}\in\{0,1\}$ be a variable that takes $1$ if and only if $(v_{i},v_{j},v_{k})$ forms $2$ -edges (i.e., exactly one edge is missing in the three nodes). Then $\mathbb{E}[m_{2}^{2}]$ can be written as follows:

\displaystyle\mathbb{E}[m_{2}^{2}]=\sum_{i<j<k}\sum_{i^{\prime}<j^{\prime}<k^{\prime}}\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]

(23)

$\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]$ in (23) is the probability that both $(v_{i},v_{j},v_{k})$ and $(v_{i^{\prime}},v_{j^{\prime}},v_{k^{\prime}})$ form $2$ -edges. This event can be divided into the following four types:

1.

$(i,j,k)=(i^{\prime},j^{\prime},k^{\prime})$ . There are $\binom{n}{3}$ such terms in (23). For each term, $\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]=3\beta^{2}(1-\beta)$ .
2.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have two elements in common. There are $\binom{n}{2}(n-2)(n-3)=12\binom{n}{4}$ such terms in (23). For example, consider a term in which $i=i^{\prime}=1$ , $j=j^{\prime}=2$ , $k=3$ , and $k^{\prime}=4$ . Both $(v_{1},v_{2},v_{3})$ and $(v_{1},v_{2},v_{4})$ form 2-edges if:
(a) $(v_{1},v_{2}),(v_{1},v_{3}),(v_{1},v_{4})\in E^{\prime}$ , $(v_{2},v_{3}),(v_{2},v_{4})\notin E^{\prime}$ ,
(b) $(v_{1},v_{2}),(v_{1},v_{3}),(v_{2},v_{4})\in E^{\prime}$ , $(v_{2},v_{3}),(v_{1},v_{4})\notin E^{\prime}$ ,
(c) $(v_{1},v_{2}),(v_{2},v_{3}),(v_{1},v_{4})\in E^{\prime}$ , $(v_{1},v_{3}),(v_{2},v_{4})\notin E^{\prime}$ ,
(d) $(v_{1},v_{2}),(v_{2},v_{3}),(v_{2},v_{4})\in E^{\prime}$ , $(v_{1},v_{3}),(v_{1},v_{4})\notin E^{\prime}$ , or
(e) $(v_{1},v_{3}),(v_{1},v_{4}),(v_{2},v_{3}),(v_{2},v_{4})\in E^{\prime}$ , $(v_{1},v_{2})\notin E^{\prime}$ .
Thus, $\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]=4\beta^{3}(1-\beta)^{2}+\beta^{4}(1-\beta)$ for this term. Similarly, $\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]=4\beta^{3}(1-\beta)^{2}+\beta^{4}(1-\beta)$ for the other terms.
3.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have one element in common. There are $n\binom{n-1}{2}\binom{n-3}{2}=30\binom{n}{5}$ such terms in (23). For each term, $\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]=(3\beta^{2}(1-\beta))^{2}=9\beta^{4}(1-\beta)^{2}$ .
4.

$(i,j,k)$ and $(i^{\prime},j^{\prime},k^{\prime})$ have no common elements. There are $\binom{n}{3}\binom{n-3}{3}=20\binom{n}{6}$ such terms in (23). For each term, $\mathbb{E}[z_{i,j,k}z_{i^{\prime},j^{\prime},k^{\prime}}]=(3\beta^{2}(1-\beta))^{2}=9\beta^{4}(1-\beta)^{2}$ .

Moreover, $\mathbb{E}[m_{2}]^{2}=(3\binom{n}{3}\beta^{2}(1-\beta))^{2}=9\binom{n}{3}^{2}\beta^{4}(1-\beta)^{2}$ . Therefore, the variance of $m_{2}$ can be written as follows:

	$\displaystyle\mathbb{V}[m_{2}]$	$\displaystyle=\mathbb{E}[m_{2}^{2}]-\mathbb{E}[m_{2}]^{2}$
		$\displaystyle=\textstyle{3\binom{n}{3}\beta^{2}(1-\beta)+12\binom{n}{4}\left(4\beta^{3}(1-\beta)^{2}+\beta^{4}(1-\beta)\right)}$
		$\displaystyle\hskip 9.95845pt\textstyle{+270\binom{n}{5}\beta^{4}(1-\beta)^{2}+180\binom{n}{6}\beta^{4}(1-\beta)^{2}}$
		$\displaystyle\hskip 9.95845pt\textstyle{-9\binom{n}{3}^{2}\beta^{4}(1-\beta)^{2}.}$

By simple calculations,

\displaystyle\textstyle{270\binom{n}{5}+180\binom{n}{6}-9\binom{n}{3}^{2}=-108\binom{n}{4}-9\binom{n}{3}.}

Thus we obtain:

	$\displaystyle\mathbb{V}[m_{2}]$	$\displaystyle=\textstyle{3\binom{n}{3}\beta^{2}(1-\beta)\left(1-3\beta^{2}(1-\beta)\right)}$
		$\displaystyle\hskip 9.95845pt\textstyle{+12\binom{n}{4}\beta^{3}(1-\beta)\left(4(1-\beta)+\beta-9\beta(1-\beta)\right)}$
		$\displaystyle=O(\beta^{3}n^{4}+\beta^{2}n^{3}).$

Similarly, the variance of $m_{1}$ can be written as follows:

	$\displaystyle\mathbb{V}[m_{1}]$	$\displaystyle=\textstyle{3\binom{n}{3}\beta(1-\beta)^{2}\left(1-3\beta(1-\beta)^{2}\right)}$
		$\displaystyle\hskip 9.95845pt\textstyle{+12\binom{n}{4}\beta(1-\beta)^{3}\left(4\beta+(1-\beta)-9\beta(1-\beta)\right)}$
		$\displaystyle=O(\beta n^{4}).$

∎

D.6 Proof of Proposition 3

Let $t_{*}=\sum_{i=1}^{n}t_{i}$ and $s_{*}=\sum_{i=1}^{n}s_{i}$ . Let $s_{*}^{\wedge}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $a_{i,j}=a_{i,k}=1$ , and $a_{j,k}=0$ . Let $s_{*}^{\triangle}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $a_{i,j}=a_{i,k}=a_{j,k}=1$ . Note that $s_{*}=s_{*}^{\wedge}+s_{*}^{\triangle}$ and $s_{*}^{\triangle}=f_{\triangle}(G)$ .

Consider a triangle $(v_{i},v_{j},v_{k})\in G$ . This triangle is counted $1-p_{1}$ ( $=\frac{e^{\varepsilon_{1}}}{e^{\varepsilon_{1}}+1}$ ) times in expectation in $t_{*}$ . Consider $2$ -edges $(v_{i},v_{j},v_{k})\in G$ (i.e., exactly one edge is missing in the three nodes). This is counted $p_{1}$ ( $=\frac{1}{e^{\varepsilon_{1}}+1}$ ) times in expectation in $t_{*}$ . No other events can change $t_{*}$ . Therefore, we obtain:

\displaystyle\mathbb{E}[t_{*}]=(1-p_{1})s_{*}^{\triangle}+p_{1}s_{*}^{\wedge}.

By $s_{*}=s_{*}^{\wedge}+s_{*}^{\triangle}$ and $s_{*}^{\triangle}=f_{\triangle}(G)$ , we obtain:

	$\displaystyle\mathbb{E}\left[\sum_{i=1}^{n}w_{i}\right]$	$\displaystyle=\mathbb{E}\left[\sum_{i=1}^{n}(t_{i}-p_{1}s_{i})\right]$
		$\displaystyle=\mathbb{E}[t_{}-p_{1}s_{}]$
		$\displaystyle=\mathbb{E}[t_{}]-p_{1}\mathbb{E}[s_{}^{\wedge}+s_{*}^{\triangle}]$
		$\displaystyle=(1-p_{1})s_{}^{\triangle}+p_{1}s_{}^{\wedge}-p_{1}(s_{}^{\wedge}+s_{}^{\triangle})$
		$\displaystyle=(1-2p_{1})f_{\triangle}(G),$

hence

\displaystyle\textstyle{\mathbb{E}\left[\frac{1}{1-2p_{1}}\sum_{i=1}^{n}w_{i}\right]=f_{\triangle}(G).}

∎

D.7 Proof of Theorem 5

Let $\mathcal{R}_{i}$ be Local2Rounds_△. Consider two neighbor lists $\mathbf{a}_{i},\mathbf{a}^{\prime}_{i}\in\{0,1\}^{n}$ that differ in one bit. Let $d_{i}$ (resp. $d^{\prime}_{i}$ ) $\in\mathbb{Z}_{\geq 0}$ be the number of “1”s in $\mathbf{a}_{i}$ (resp. $\mathbf{a}^{\prime}_{i}$ ). Let $\bar{\mathbf{a}}_{i}$ (resp. $\bar{\mathbf{a}}^{\prime}_{i}$ ) $\in\{0,1\}^{n}$ be neighbor lists obtained by setting all of the $i$ -th to the $n$ -th elements in $\mathbf{a}_{i}$ (resp. $\mathbf{a}^{\prime}_{i}$ ) to $0$ . Let $\bar{d}_{i}$ (resp. $\bar{d}^{\prime}_{i}$ ) $\in\mathbb{Z}_{\geq 0}$ be the number of “1”s in $\bar{\mathbf{a}}_{i}$ (resp. $\bar{\mathbf{a}}^{\prime}_{i}$ ). For example, if $n=6$ , $\mathbf{a}_{4}=(1,0,1,0,1,1)$ , and $\mathbf{a}^{\prime}_{4}=(1,1,1,0,1,1)$ , then $d_{4}=4$ , $d^{\prime}_{4}=5$ , $\bar{\mathbf{a}}_{4}=(1,0,1,0,0,0)$ , $\bar{\mathbf{a}}^{\prime}_{4}=(1,1,1,0,0,0)$ , $\bar{d}_{4}=2$ , and $\bar{d}^{\prime}_{4}=3$ .

Furthermore, let $t_{i}$ (resp. $t^{\prime}_{i}$ ) $\in\mathbb{Z}_{\geq 0}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $(v_{i},v_{j})\in E$ , $(v_{i},v_{k})\in E$ , and $(v_{j},v_{k})\in E^{\prime}$ in $\mathbf{a}_{i}$ (resp. $\mathbf{a}^{\prime}_{i}$ ). Let $s_{i}$ (resp. $s^{\prime}_{i}$ ) $\in\mathbb{Z}_{\geq 0}$ be the number of triplets $(v_{i},v_{j},v_{k})$ such that $j<k<i$ , $(v_{i},v_{j})\in E$ , and $(v_{i},v_{k})\in E$ in $\mathbf{a}_{i}$ (resp. $\mathbf{a}^{\prime}_{i}$ ). Let $w_{i}=t_{i}-p_{1}s_{i}$ and $w^{\prime}_{i}=t^{\prime}_{i}-p_{1}s^{\prime}_{i}$ . Below we consider two cases about $d_{i}$ : when $d_{i}<\tilde{d}_{max}$ and when $d_{i}\geq\tilde{d}_{max}$ .

Case 1: $d_{i}<\tilde{d}_{max}$ . Assume that $d^{\prime}_{i}=d_{i}+1$ . In this case, we have either $\bar{\mathbf{a}}^{\prime}_{i}=\bar{\mathbf{a}}_{i}$ or $\bar{d}^{\prime}_{i}=\bar{d}_{i}+1$ . If $\bar{\mathbf{a}}^{\prime}_{i}=\bar{\mathbf{a}}_{i}$ , then $s_{i}=s^{\prime}_{i}$ , $t_{i}=t^{\prime}_{i}$ , and $w_{i}=w^{\prime}_{i}$ , hence $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{w}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{w}_{i}]$ . If $\bar{d}^{\prime}_{i}=\bar{d}_{i}+1$ , then $s_{i}$ and $s^{\prime}_{i}$ can be expressed as $s_{i}=\binom{\bar{d}_{i}}{2}$ and $s^{\prime}_{i}=\binom{\bar{d}^{\prime}_{i}}{2}=\binom{\bar{d}_{i}+1}{2}$ , respectively. Then we obtain:

\displaystyle s^{\prime}_{i}-s_{i}=\binom{\bar{d}_{i}+1}{2}-\binom{\bar{d}_{i}}{2}=\bar{d}_{i}.

In addition, since we consider an additional constraint “ $(v_{j},v_{k})\in E^{\prime}$ ” in counting $t_{i}$ and $t^{\prime}_{i}$ , we have $t^{\prime}_{i}-t_{i}\leq s^{\prime}_{i}-s_{i}$ . Therefore,

	$\displaystyle\|w^{\prime}_{i}-w_{i}\|$	$\displaystyle=\|t^{\prime}_{i}-t_{i}-p_{1}(s^{\prime}_{i}-s_{i})\|$
		$\displaystyle\leq(1-p_{1})\bar{d}_{i}$
		$\displaystyle\leq(1-p_{1})d_{i}$
		$\displaystyle<\tilde{d}_{max}\hskip 14.22636pt\text{(by $p_{1}>0$ and $d_{i}<\tilde{d}_{max}$)}.$

Since we add $\textrm{Lap}(\frac{\tilde{d}_{max}}{\varepsilon_{2}})$ to $w_{i}$ , we obtain:

\displaystyle\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{w}_{i}]\leq e^{\varepsilon_{2}}\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{w}_{i}].

(24)

Assume that $d^{\prime}_{i}=d_{i}-1$ . In this case, we have either $\bar{\mathbf{a}}^{\prime}_{i}=\bar{\mathbf{a}}_{i}$ or $\bar{d}^{\prime}_{i}=\bar{d}_{i}-1$ . If $\bar{\mathbf{a}}^{\prime}_{i}=\bar{\mathbf{a}}_{i}$ , then $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{w}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{w}_{i}]$ . If $\bar{d}^{\prime}_{i}=\bar{d}_{i}-1$ , then we obtain $s_{i}-s^{\prime}_{i}=\bar{d}_{i}-1$ and $t_{i}-t^{\prime}_{i}\leq s_{i}-s^{\prime}_{i}$ . Thus $|w^{\prime}_{i}-w_{i}|\leq(1-p_{1})(\tilde{d}_{i}-1)<\tilde{d}_{max}$ and (24) holds. Therefore, Local2Rounds_△ provides $\varepsilon_{2}$ -edge LDP at the second round. Since Local2Rounds_△ provides $\varepsilon_{1}$ -edge LDP at the first round (by Theorem 3), it provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP in total by the composition theorem [23].

Case 2: $d_{i}\geq\tilde{d}_{max}$ . Assume that $d^{\prime}_{i}=d_{i}+1$ . In this case, we obtain $d_{i}=d^{\prime}_{i}=\tilde{d}_{max}$ after graph projection.

Note that $\mathbf{a}_{i}$ and $\mathbf{a}^{\prime}_{i}$ can differ in zero or two bits after graph projection. For example, consider the case where $n=8$ , $\mathbf{a}_{5}=(1,1,0,1,0,1,1,1)$ , $\mathbf{a}^{\prime}_{5}=(1,1,1,1,0,1,1,1)$ , and $\tilde{d}_{max}=4$ . If the permutation is 1,4,6,8,2,7,5,3, then $\mathbf{a}_{5}=\mathbf{a}^{\prime}_{5}=(1,0,0,1,0,1,0,1)$ after graph projection. However, if the permutation is 3,1,4,6,8,2,7,5, then $\mathbf{a}_{5}$ and $\mathbf{a}^{\prime}_{5}$ become $\mathbf{a}_{5}=(1,0,0,1,0,1,0,1)$ and $\mathbf{a}^{\prime}_{5}=(1,0,1,1,0,1,0,0)$ , respectively; i.e., they differ in the third and eighth elements.

If $\mathbf{a}_{i}=\mathbf{a}^{\prime}_{i}$ , then $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{w}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{w}_{i}]$ . If $\mathbf{a}_{i}$ and $\mathbf{a}^{\prime}_{i}$ differ in two bits, $\bar{\mathbf{a}}_{i}$ and $\bar{\mathbf{a}}^{\prime}_{i}$ differ in at most two bits (because we set all of the $i$ -th to the $n$ -th elements in $\mathbf{a}_{i}$ and $\mathbf{a}^{\prime}_{i}$ to $0$ ). For example, we can consider the following three cases:

•

If $\mathbf{a}_{5}=(1,0,0,1,0,1,0,1)$ and $\mathbf{a}^{\prime}_{5}=(1,0,0,1,0,1,1,0)$ , then $\bar{\mathbf{a}}_{5}=\bar{\mathbf{a}}^{\prime}_{5}=(1,0,0,1,0,0,0,0)$ .
•

If $\mathbf{a}_{5}=(1,0,0,1,0,1,0,1)$ and $\mathbf{a}^{\prime}_{5}=(1,0,1,1,0,1,0,0)$ , then $\bar{\mathbf{a}}_{5}=(1,0,0,1,0,0,0,0)$ and $\bar{\mathbf{a}}^{\prime}_{5}=(1,0,1,1,0,0,\allowbreak 0,0)$ ; i.e., they differ in one bit.
•

If $\mathbf{a}_{5}=(1,1,0,1,0,1,0,0)$ and $\mathbf{a}^{\prime}_{5}=(1,0,1,1,0,1,0,0)$ , then $\bar{\mathbf{a}}_{5}=(1,1,0,1,0,0,0,0)$ and $\bar{\mathbf{a}}^{\prime}_{5}=(1,0,1,1,0,0,\allowbreak 0,0)$ ; i.e., they differ in two bits.

If $\bar{\mathbf{a}}_{i}=\bar{\mathbf{a}}^{\prime}_{i}$ , then $\Pr[\mathcal{R}_{i}(\mathbf{a}_{i})=\hat{w}_{i}]=\Pr[\mathcal{R}_{i}(\mathbf{a}^{\prime}_{i})=\hat{w}_{i}]$ . If $\bar{\mathbf{a}}_{i}$ and $\bar{\mathbf{a}}^{\prime}_{i}$ differ in one bit, then $\bar{d}^{\prime}_{i}=\bar{d}_{i}+1$ . In this case, we obtain (24) in the same way as Case 1.

We need to be careful when $\bar{\mathbf{a}}_{i}$ and $\bar{\mathbf{a}}^{\prime}_{i}$ differ in two bits. In this case, $\bar{d}^{\prime}_{i}=\bar{d}_{i}$ (because $d_{i}=d^{\prime}_{i}=\tilde{d}_{max}$ after graph projection). Then we obtain $s_{i}=s^{\prime}_{i}=\binom{\tilde{d}_{max}}{2}$ . Since the number of $2$ -stars that involve a particular user in $\bar{\mathbf{a}}_{i}$ is $\bar{d}_{i}-1$ , we obtain $t^{\prime}_{i}-t_{i}\leq\bar{d}_{i}-1$ . Therefore,

\displaystyle|w^{\prime}_{i}-w_{i}|=|t^{\prime}_{i}-t_{i}|\leq\bar{d}_{i}-1<\tilde{d}_{max},

and (24) holds. Therefore, if $d^{\prime}_{i}=d_{i}+1$ , then Local2Rounds_△ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP in total.

Assume that $d^{\prime}_{i}=d_{i}-1$ . If $d_{i}>\tilde{d}_{max}$ , then $d_{i}=d^{\prime}_{i}=\tilde{d}_{max}$ after graph projection. Thus Local2Rounds_△ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP in total in the same as above. If $d_{i}=\tilde{d}_{max}$ , then we obtain (24) in the same way as Case 1, and therefore Local2Rounds_△ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP in total.

In summary, Local2Rounds_△ provides $(\varepsilon_{1}+\varepsilon_{2})$ -edge LDP in both Case 1 and Case 2. Local2Rounds_△ also provides $(\varepsilon_{1}+\varepsilon_{2})$ -relationship DP because it uses only the lower triangular part of the adjacency matrix $\mathbf{A}$ . ∎

D.8 Proof of Theorem 6

When the maximum degree $d_{max}$ of $G$ is at most $\tilde{d}_{max}$ , no graph projection will occur. By Proposition 3, the estimate $f_{\triangle}(G,\varepsilon)$ by Local2Rounds_△ is unbiased.

By bias-variance decomposition (7), the expected $l_{2}$ loss $\operatorname{\mathbb{E}}[l_{2}^{2}(\hat{f}_{\triangle}(G,\varepsilon),f_{\triangle}(G))]$ is equal to $\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$ . Recall that $p_{1}=\frac{1}{1+e^{\varepsilon_{1}}}$ . $\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$ can be written as follows:

	$\displaystyle\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$		(25)
	$\displaystyle=\textstyle{\frac{1}{(1-2p_{1})^{2}}\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}\hat{w}_{i}\right]}$
	$\displaystyle=\textstyle{\frac{1}{(1-2p_{1})^{2}}\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}t_{i}-p_{1}s_{i}+\textrm{Lap}(\frac{\tilde{d}_{max}(1-p_{1})}{\varepsilon_{2}})\right]}$
	$\displaystyle=\textstyle{\frac{1}{(1-2p_{1})^{2}}\left(\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}t_{i}-p_{1}s_{i}\right]+\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}\textrm{Lap}(\frac{\tilde{d}_{max}(1-p_{1})}{\varepsilon_{2}})\right]\right)}$
	$\displaystyle=\textstyle{\frac{1}{(1-2p_{1})^{2}}\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}t_{i}\right]+\frac{n}{(1-2p_{1})^{2}}2\frac{\tilde{d}_{max}^{2}(1-p_{1})^{2}}{\varepsilon_{2}^{2}}}.$		(26)

In the last line, we are able to get rid of the $s_{i}$ ’s because they are deterministic. We are also able to sum the variances of the Lap random variables since they are independent; we are not able to do the same with the sum of the $t_{i}$ s.

Recall the definition of $E^{\prime}$ computed by the first round of Local2Rounds_△—the noisy edges released by randomized response. Now,

\displaystyle t_{i}

\displaystyle=\sum_{a_{i,j}=a_{i,k}=1,j<k<i}\textbf{1}((v_{j},v_{k})\in E^{\prime}).

This gives

	$\displaystyle\sum_{i=1}^{n}t_{i}$	$\displaystyle=\sum_{i=1}^{n}\sum_{\begin{subarray}{c}a_{i,j}=a_{i,k}=1\\ j<k<i\end{subarray}}\textbf{1}((v_{j},v_{k})\in E^{\prime})$
		$\displaystyle=\sum_{1\leq j<k\leq n}\sum_{\begin{subarray}{c}i>k\\ a_{i,j}=a_{i,k}=1\end{subarray}}\textbf{1}((v_{j},v_{k})\in E^{\prime})$
		$\displaystyle=\sum_{1\leq j<k\leq n}\|\{i:i>k,a_{i,j}=a_{i,k}=1\}\|\textbf{1}((v_{j},v_{k})\in E^{\prime}\|.$

Let $c_{jk}=|\{i:i>k,a_{i,j}=a_{i,k}=1\}|$ . Notice that $\textbf{1}((v_{j},v_{k})\in E^{\prime})$ are independent events. Thus, the variance of the above expression is

$\displaystyle\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}t_{i}\right]$	$\displaystyle=\operatorname{\mathbb{V}}\left[\sum_{1\leq j<k\leq n}c_{jk}\textbf{1}((v_{j},v_{k})\in E^{\prime})\right]$
	$\displaystyle=\sum_{1\leq j<k\leq n}c_{jk}^{2}\operatorname{\mathbb{V}}[\textbf{1}((v_{j},v_{k}\in E^{\prime}))]$
	$\displaystyle=p_{1}(1-p_{1})\sum_{1\leq j<k\leq n}c_{jk}^{2}.$	(27)

$c_{jk}$ is the number of ordered 2-paths from $j$ to $k$ in $G$ . Because the degree of user $v_{j}$ is at most $\tilde{d}_{max}$ , $0\leq c_{jk}\leq\tilde{d}_{max}$ . There are at most $n\tilde{d}_{max}^{2}$ ordered 2-paths in $G$ , since there are only $\tilde{d}_{max}$ nodes to go to once a first is picked. Thus, $\sum_{1\leq j<k\leq n}c_{jk}\leq n\tilde{d}_{max}^{2}$ . Using a Jensen’s inequality style argument, the best way to maximize (27) is to have all $c_{jk}$ be $0$ or $\tilde{d}_{max}$ . At most $n\tilde{d}_{max}$ of the $c_{jk}$ can be $\tilde{d}_{max}$ , and the rest are zero. Thus,

	$\displaystyle\operatorname{\mathbb{V}}\left[\sum_{i=1}^{n}t_{i}\right]$	$\displaystyle=p_{1}(1-p_{1})\sum_{1\leq j<k\leq n}c_{ij}^{2}$
		$\displaystyle\leq p_{1}(1-p_{1})n\tilde{d}_{max}\times\tilde{d}_{max}^{2}.$

Plugging this into (26)

	$\displaystyle\operatorname{\mathbb{V}}[\hat{f}_{\triangle}(G,\varepsilon)]$	$\displaystyle\leq\frac{p_{1}(1-p_{1})n\tilde{d}_{max}^{3}}{(1-2p_{1})^{2}}+\frac{2n\tilde{d}_{max}^{2}(1-p_{1})^{2}}{(1-2p_{1})^{2}\varepsilon_{2}^{2}}$
		$\displaystyle\leq O\left(\frac{p_{1}n\tilde{d}_{max}^{3}+n\tilde{d}_{max}^{2}/\varepsilon_{2}^{2}}{(1-2p_{1})^{2}}\right)$
		$\displaystyle\leq O\left(\frac{e^{\varepsilon_{1}}}{(1-e^{\varepsilon_{1}})^{2}}\left(n\tilde{d}_{max}^{3}+\frac{e^{\varepsilon_{1}}}{\varepsilon_{2}^{2}}n\tilde{d}_{max}^{2}\right)\right).$

∎

D.9 Proof of Theorem 7

Preliminaries.

We begin by defining a Boolean version of the independent cube in Definition 5, which we call the Boolean independent cube. The Boolean independent cube works for functions $g:\{0,1\}^{\kappa}\rightarrow\mathbb{R}$ in the local DP model, where each of $\kappa\in\mathbb{N}$ users has a single bit and obfuscates the bit to provide $\varepsilon$ -DP. As shown later, there is a one-to-one correspondence between the independent cube in Definition 5 and the Boolean independent cube. Based on this, we show a lower-bound for the Boolean independent cube, and use the lower-bound to prove Theorem 7.

Below we define the Boolean independent cube. For $i\in[\kappa]$ , let $x_{i}\in\{0,1\}$ be a bit of user $v_{i}$ . Let $X=(x_{1},\ldots,x_{\kappa})$ . We assume user $v_{i}$ obfuscates $x_{i}$ using a randomizer $\mathcal{S}_{i}:\{0,1\}\rightarrow\mathcal{Z}_{i}$ , where $\mathcal{S}_{i}$ satisfies $\varepsilon$ -DP and $\mathcal{Z}_{i}$ is a range of $\mathcal{S}_{i}$ . Examples of $\mathcal{S}_{i}$ include Warner’s RR. Furthermore, we assume the one-round setting, where each $\mathcal{S}_{i}$ is independent, and where the estimator $\hat{g}$ for $g$ has the form

\hat{g}(X)=\tilde{g}(\mathcal{S}_{1}(x_{1}),\ldots,\mathcal{S}_{\kappa}(x_{\kappa})).

(28)

$\tilde{g}$ is an aggregate function that takes $\mathcal{S}_{1}(x_{1}),\ldots,\mathcal{S}_{\kappa}(x_{\kappa})$ as input and outputs $\hat{g}(X)$ .

We will prove a lower bound which uses the following stripped-down form of an independent cube (Definition 5).

Definition 6.

[Boolean $(\kappa,D)$ -independent cube] Let $g:\{0,1\}^{\kappa}\rightarrow\mathbb{R}$ , and $D\in\mathbb{R}$ . We say $g$ has a Boolean $(\kappa,D)$ -independent cube if for all $(x_{1},\ldots,x_{\kappa})\in\{0,1\}^{\kappa}$ we have

g(x_{1},\ldots,x_{\kappa})=g(0,0,\ldots,0)+\sum_{i=1}^{\kappa}x_{i}C_{i},

where $C_{i}\in\mathbb{R}$ satisfies $|C_{i}|\geq D$ for any $i\in[\kappa]$ .

The following theorem applies to the Boolean independent cube and will help us establish Theorem 7. We prove this theorem in Section D.10.

Theorem 8.

Let $g:\mathcal{X}^{\kappa}\rightarrow\mathbb{R}$ be a function that has a Boolean $(\kappa,D)$ -independent cube. Let $\hat{g}(X)$ be an estimator having the form of (28), where each $\mathcal{S}_{i}$ provides $\varepsilon$ -DP and is mutually independent. Let $X$ be drawn uniformly from $\{0,1\}^{\kappa}$ . Over the randomness both in selecting $X$ and in $\mathcal{S}_{1},\ldots,\mathcal{S}_{\kappa}$ , $\operatorname{\mathbb{E}}_{X,\mathcal{S}_{1},\ldots,\mathcal{S}_{\kappa}}[l_{2}^{2}(g(X),\hat{g}(X))]=\Omega\left(\frac{e^{\varepsilon}}{(e^{\varepsilon}+1)^{2}}\kappa D^{2}\right)$ .

Proof of Theorem 7 using Theorem 8.

To prove Theorem 7, let $\mathcal{A}$ be the $(n,D)$ -independent cube (Definition 5) for $f$ given in the statement of Theorem 7. Let $G$ be the graph, and $\mathbf{A}$ be the corresponding symmetric adjacency matrix. Below we sometimes write $f$ as a function on neighbor lists $\mathbf{a}_{1},\ldots,\mathbf{a}_{n}$ (rather than $G$ ) because there is a one-to-one correspondence between $G$ and $\mathbf{a}_{1},\ldots,\mathbf{a}_{n}$ . Let $M$ be the perfect matching that defines $\mathcal{A}$ . Let $n=2\kappa$ .

The idea is to pair up users that $M$ matches to make a new function $g$ that has a Boolean $(\kappa,D)$ -independent cube and new randomizers $\mathcal{S}_{1},\ldots,\mathcal{S}_{\kappa}$ that satisfy $\varepsilon$ -DP. In other words, we regard a pair of users in $M$ as a virtual user (since $n=2\kappa$ , there are $\kappa$ virtual users in total). Then we apply Theorem 8.

Assume that $M=\{(v_{1},v_{2}),(v_{3},v_{4}),\ldots,(v_{2\kappa-1},v_{2\kappa})\}$ without loss of generality (we can construct $g$ and $\mathcal{S}_{1},\ldots,\mathcal{S}_{\kappa}$ for arbitrary $M$ in the same way). For $x_{1},\ldots,x_{\kappa}\in\{0,1\}$ , define

	$\displaystyle g(x_{1},\ldots,x_{\kappa})=$	$\displaystyle f(\mathbf{a}_{1}+x_{1}\mathbf{e}_{2},~{}\mathbf{a}_{2}+x_{1}\mathbf{e}_{1},~{}\ldots,$
		$\displaystyle\hskip 9.3894pt\mathbf{a}_{2\kappa-1}+x_{\kappa}\mathbf{e}_{2\kappa},~{}\mathbf{a}_{2\kappa}+x_{\kappa}\mathbf{e}_{2\kappa-1}),$

where $\mathbf{e}_{i}\in\{0,1\}^{n}$ is the $i$ -th standard basis vector that has $1$ in the $i$ -th coordinate and $0$ elsewhere. In other words, $x_{i}\in\{0,1\}$ indicates whether the $i$ -th edge in $M$ should be added to $G$ . Thus, $g$ has a Boolean $(\kappa,D)$ -independent cube, and there is a one-to-one correspondence between an $(n,D)$ -independent cube $\mathcal{A}$ in Definition 5 and $(\kappa,D)$ -Boolean independent cube $\{0,1\}^{\kappa}$ in Definition 6. Figure 13 shows a $(2,2)$ -Boolean independent cube for $g$ corresponding to the $(4,2)$ -independent cube for $f$ in Figure 3.

Now, for $i\in[\kappa]$ , define $\mathcal{S}_{i}(x_{i})$ for $x_{i}\in\{0,1\}$ by

\displaystyle\mathcal{S}_{i}(x_{i})=(\mathcal{R}_{2i-1}(\mathbf{a}_{2i-1}+x_{i}\mathbf{e}_{2i}),\mathcal{R}_{2i}(\mathbf{a}_{2i}+x_{i}\mathbf{e}_{2i-1})).

(29)

In other words, $\mathcal{S}_{i}(x_{i})$ is simply the product of the outputs of users $(v_{2i-1},v_{2i})$ , with $x_{i}$ indicating whether to add the edge in $M$ between them.

Assume that each $\mathcal{R}_{i}$ is mutually independent and that $(\mathcal{R}_{1},\ldots,\mathcal{R}_{n})$ provides $\varepsilon$ -relationship DP in Definition 3. Then by (3) and (29), each $\mathcal{S}_{i}$ provides $\varepsilon$ -DP and is mutually independent.

Define the estimator $\hat{g}$ by

\displaystyle\hat{g}(x_{1},\ldots,x_{\kappa})

\displaystyle=\tilde{f}(\mathcal{S}_{1}(x_{1}),\ldots,\mathcal{S}_{\kappa}(x_{\kappa})).

Then by Theorem 8, for $X=(x_{1},\ldots,x_{\kappa})$ ,

\operatorname{\mathbb{E}}_{X,\mathcal{S}_{1},\ldots,\mathcal{S}_{\kappa}}[l_{2}^{2}(g(X),\hat{g}(X))]\geq\Omega\left(\frac{e^{\varepsilon}}{(e^{\varepsilon}+1)^{2}}\kappa D^{2}\right).

Since there is a one-to-one correspondence between the ( $n,D$ )-independent cube $\mathcal{A}$ and the ( $\kappa,D$ )-Boolean independent cube $\{0,1\}^{\kappa}$ , we also have

\operatorname{\mathbb{E}}_{G,\mathcal{R}_{1},\ldots,\mathcal{R}_{n}}[l_{2}^{2}(f(G),\hat{f}(G))]\geq\Omega\left(\frac{e^{\varepsilon}}{(e^{\varepsilon}+1)^{2}}nD^{2}\right),

where $G$ is drawn uniformly from $\mathcal{A}$ , which proves Theorem 7. ∎

D.10 Proof of Theorem 8

Assume that $\mathcal{S}_{i}:\{0,1\}\rightarrow\mathcal{Z}_{i}$ . For $X=(x_{1},\ldots,x_{\kappa})\in\{0,1\}^{\kappa}$ , let $S(X)=(\mathcal{S}_{1}(x_{1}),\cdots\mathcal{S}_{\kappa}(x_{\kappa}))$ and $Z=(z_{1},\ldots,z_{\kappa})$ with $z_{i}\in\mathcal{Z}_{i}$ . We rewrite the quantity of interest as

\operatorname{\mathbb{E}}_{X,S(X)}[l_{2}^{2}(g(X),\hat{g}(X))]=\operatorname{\mathbb{E}}_{X,S(X)}[(g(X)-\tilde{g}(S(X)))^{2}].

By the law of total expectation, this quantity is the same as the expected value of the conditional expected value of $(g(X)-\tilde{g}(S(X)))^{2}$ given $S(X)=Z$ :

	$\displaystyle\operatorname{\mathbb{E}}_{X,S(X)}[(g(X)-\tilde{g}(S(X)))^{2}]$
	$\displaystyle=\operatorname{\mathbb{E}}_{S(X)}\operatorname{\mathbb{E}}_{X}[(g(X)-\tilde{g}(Z))^{2}\|S(X)=Z].$		(30)

Let $\mu_{Z}=\operatorname{\mathbb{E}}_{X}[g(X)|S(X)=Z]$ . Then the inner expectation in (30) can be written as follows:

	$\displaystyle\;\operatorname{\mathbb{E}}_{X}[(g(X)-\tilde{g}(Z))^{2}\|S(X)=Z]$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[((g(X)-\mu_{Z})+(\mu_{Z}-\tilde{g}(Z)))^{2}\|S(X)=Z]$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})^{2}\|S(X)=Z]$
	$\displaystyle\hskip 9.95845pt+2(\mu_{Z}-\tilde{g}(Z))\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})\|S(X)=Z]$
	$\displaystyle\hskip 9.95845pt+(\mu_{Z}-\tilde{g}(Z))^{2}$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})^{2}\|S(X)=Z]+(\mu_{Z}-\tilde{g}(Z))^{2}$
	$\displaystyle=\operatorname{\mathbb{V}}_{X}[g(X)\|S(X)=Z]+(\mu_{Z}-\tilde{g}(Z))^{2}.$

Thus, it suffices to show that $\operatorname{\mathbb{V}}_{X}[g(X)|S(X)=Z]\geq\Omega\left(\frac{e^{\varepsilon}}{(1+e^{\varepsilon})^{2}}\kappa D^{2}\right)$ . For $B=(b_{1},\ldots,b_{\kappa})\in\{0,1\}^{\kappa}$ , we have

\displaystyle\Pr[X=B|S(X)=Z]=\frac{\Pr[X=B]\Pr[S(X)=Z|X=B]}{\Pr[S(X)=Z]}.

Since $\Pr[S(X)=Z]$ does not depend on $B$ and $\Pr[X=B]=\frac{1}{2^{\kappa}}$ , $\Pr[X=B|S(X)=Z]$ can also be expressed as

\displaystyle\Pr[X=B|S(X)=Z]

\displaystyle\propto\Pr[S(X)=Z|X=B].

(31)

Since $S_{1},\ldots,S_{\kappa}$ are independently run, we have

	$\displaystyle\Pr[S(X)=Z\|X=B]$	$\displaystyle=\Pr[\mathcal{S}_{1}(b_{1})=z_{1},\ldots,\mathcal{S}_{\kappa}(b_{\kappa})=z_{\kappa}]$
		$\displaystyle=\prod_{i=1}^{\kappa}\Pr[\mathcal{S}_{i}(b_{i})=z_{i}].$

Define

\displaystyle p_{i}=\frac{\Pr[\mathcal{S}_{i}(1)=z_{i}]}{\Pr[\mathcal{S}_{i}(0)=z_{i}]+\Pr[\mathcal{S}_{i}(1)=z_{i}]}.

Because each $\mathcal{S}_{i}$ satisfies $\varepsilon$ -DP, we have $\frac{1}{1+e^{\varepsilon}}\leq p_{i}\leq\frac{e^{\varepsilon}}{1+e^{\varepsilon}}$ . By (31) and $\sum_{B\in\{0,1\}^{\kappa}}\Pr[X=B|S(X)=Z]=1$ , we have

\displaystyle\Pr[X=B|S(X)=Z]=\prod_{i=1}^{\kappa}(p_{i})^{b_{i}}(1-p_{i})^{1-b_{i}}.

(32)

This means that $\Pr[X=B|S(X)=Z]$ is distributed according to the independent product of $Bernoulli(p_{i})$ for $i\in[\kappa]$ .

Now, because $g$ has a Boolean $(\kappa,D)$ -independent cube, there are $C_{1},\ldots,C_{\kappa}\in\mathcal{S}$ with $|C_{i}|\geq D$ such that

\displaystyle g(X)

\displaystyle=g(0,\ldots,0)+\sum_{i=1}^{\kappa}x_{i}C_{i}.

By (32), $x_{i}$ is an independent draw from $Bernoulli(p_{i})$ given $S(X)=Z$ . Thus, the variance of $g(X)$ given $S(X)=Z$ is

	$\displaystyle\operatorname{\mathbb{V}}_{X}[g(X)\|S(X)=Z]$	$\displaystyle=\sum_{i=1}^{\kappa}\operatorname{\mathbb{V}}[x_{i}\|S(X)=Z]C_{i}^{2}$
		$\displaystyle\geq\sum_{i=1}^{\kappa}p_{i}(1-p_{i})D^{2}$
		$\displaystyle\geq\sum_{i=1}^{\kappa}\frac{e^{\varepsilon}}{(1+e^{\varepsilon})^{2}}D^{2}$
		$\displaystyle\geq\kappa\frac{e^{\varepsilon}}{(1+e^{\varepsilon})^{2}}D^{2}.$

∎

	$\displaystyle\;\operatorname{\mathbb{E}}_{X}[(g(X)-\tilde{g}(Z))^{2}\|S(X)=Z]$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[((g(X)-\mu_{Z})+(\mu_{Z}-\tilde{g}(Z)))^{2}\|S(X)=Z]$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})^{2}\|S(X)=Z]$
	$\displaystyle\hskip 9.95845pt+2(\mu_{Z}-\tilde{g}(Z))\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})\|S(X)=Z]$
	$\displaystyle\hskip 9.95845pt+(\mu_{Z}-\tilde{g}(Z))^{2}$
	$\displaystyle=\operatorname{\mathbb{E}}_{X}[(g(X)-\mu_{Z})^{2}\|S(X)=Z]+(\mu_{Z}-\tilde{g}(Z))^{2}$
	$\displaystyle=\operatorname{\mathbb{V}}_{X}[g(X)\|S(X)=Z]+(\mu_{Z}-\tilde{g}(Z))^{2}.$

Locally Differentially Private Analysis of Graph Statistics

Abstract

1 Introduction

2 Related Work

3 Preliminaries

3.1 Graphs and Differential Privacy

Definition 1 (ε\varepsilon-edge centralized DP).

3.2 Local Differential Privacy

Definition 2 (ε\varepsilon-edge LDP [46]).

Definition 3 (ε\varepsilon-relationship DP).

Proposition 1.

Proof.

3.3 Global Sensitivity

Definition 4 (Global sensitivity).

3.4 Graph Statistics and Utility Metrics

4 Algorithms

4.1 One-Round Algorithms for kk-Stars

Theorem 1.

Theorem 2.

Corollary 1.

4.2 One-Round Algorithms for Triangles.

Proposition 2.

Theorem 3.

Theorem 4.

4.3 Two-Rounds Algorithms for Triangles

Proposition 3.

Theorem 5.

Theorem 6.

4.4 Lower Bounds

Definition 5.

Theorem 7.

5 Experiments

5.1 Experimental Set-up

5.2 Experimental Results

6 Conclusions

Acknowledgments

References

Appendix A Effectiveness of empirical estimation in LocalRR△

Appendix B Experiments on Barabási-Albert Graphs

Appendix C Construction of an (n,dm​a​x2−2)(n,\frac{d_{max}}{2}-2) independent cube for f△f_{\triangle}

Appendix D Proof of Statements in Section 4

D.1 Proof of Theorem 1

D.2 Proof of Theorem 2

D.3 Proof of Proposition 2

D.4 Proof of Theorem 3

D.5 Proof of Theorem 4

Lemma 1.

Proof of Lemma 1.

D.6 Proof of Proposition 3

D.7 Proof of Theorem 5

D.8 Proof of Theorem 6

D.9 Proof of Theorem 7

Preliminaries.

Definition 6.

Theorem 8.

Proof of Theorem 7 using Theorem 8.

D.10 Proof of Theorem 8

Definition 1 ( $\varepsilon$ -edge centralized DP).

Definition 2 ( $\varepsilon$ -edge LDP [46]).

Definition 3 ( $\varepsilon$ -relationship DP).

4.1 One-Round Algorithms for $k$ -Stars

Appendix A Effectiveness of empirical estimation in LocalRR_△

Appendix C Construction of an $(n,\frac{d_{max}}{2}-2)$ independent cube for $f_{\triangle}$