Crossings in Randomly Embedded Graphs

Santiago Arenas-Velilla
Centro de Investigación en Matemáticas,
Guanajuato, Gto. 36000, Mexico
[email protected]
Octavio Arizmendi
Centro de Investigación en Matemáticas,
Guanajuato, Gto. 36000, Mexico
[email protected]

Abstract

We consider the number of crossings in a graph which is embedded randomly on a convex set of points. We give an estimate to the normal distribution in Kolmogorov distance which implies a convergence rate of order $n^{-1/2}$ for various families of graphs, including random chord diagrams or full cycles.

keywords: crossings, random graphs, normal approximation.

1 Introduction

Let $G=(V,E)$ be a graph with vertex set $V$ and edge set $E\subset V\times V$ , which is embedded randomly in a convex set of points. We are interested in the random variable counting the number of crossings under this embedding.

Formally, for a graph $G=(V,E)$ with vertex set $[n]=\{1,\ldots,n\}$ , an embedding given by the permutation $\pi:[n]\to[n]$ , is the graph isomorphism induced by the permutation $\pi$ . The crossings of such embedding are given by the set $\{(a,b,c,d)|\{a,b\},\{c,d\}\in E,\pi(a)>\pi(c)>\pi(b)>\pi(d)\}$ . Figure 1 shows graphical representation of a couple embeddings of a path graph $P_{20}$ . The first one having $40$ crossing and the second one having $60$ crossings.

Refer to caption — (a) $P_{20}$ with 40 crossings

To our best of our knowledge there is not much work about general graphs. The paper by Flajolet and Noy [4] considers the case where $G$ is a union of disjoint edges (is called a matching, a pairing or a chord diagram) and proves a central limit theorem. This result is also proved with the use of weighted dependency graphs, in [3]. More important to us the recent paper by Paguyo [5] gives a rate of convergence in that case. Another related paper is [1], where the authors consider a uniform random tree.

In this paper, we will show that under some asymptotic behaviour of very precise combinatorial quantities of the graph, the random variable counting the number of crossings in a random embedding approximates a normal distribution with mean $\mu$ and variance $\sigma^{2}$ which can be calculated precisely (see Lemmas 1 and 2). Moreover, we give a convergence rate in this limit theorem.

Theorem 1.

Let $G$ be a graph with maximum degree $\Delta$ , and let $X$ be the number of crossings of a random uniform embedding of $G$ . Let $\mu$ and $\sigma^{2}$ the mean and the variance of $X$ . Then, with $W=(X-\mu)/\sigma$ ,

d_{Kol}(W,Z)\leq\frac{4m_{2}(G)\Delta m}{3\sigma^{2}}\left(\frac{6\Delta m}{\sigma}+\sqrt{1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{m(\Delta-1)^{2}}{m_{2}(G)}}\right),

(1.1)

where $m$ is the number of edges of $G$ , $m_{r}(G)$ is the number of $r$ -matching of $G$ , and $Z$ is a standard Gaussian random variable.

Examples of families of graphs that satisfy such a normal approximation, with a rate proportional to $1/\sqrt{n}$ are pairings, cycle graphs, path graphs, union of triangles, among others. We explain in detail these examples in Section 4.

We should mention that our method of proof resembles the one used by Paguyo in [5], for the case of pairings. The main idea is to write the number of crossing as a sum of indicators variables and then consider the size biased transform in the case of sums of Bernoulli variables. However, there is a crucial difference between Paguyo’s way to write such variables and how we do it, which in our opinion is more flexible. To be precise, Paguyo considers for each four points $a<b<c<d$ in the circle the indicator that there is a crossing formed by the edges $\{a,c\}$ and $\{b,d\}$ . This random variable is easy to handle for the case of a pairing but for a general given graph, even calculating the probability of such indicator to be $1$ can be very complicated. Our approach instead looks at a given $2$ -matching in the graph $G$ and consider the indicator random variable of this $2$ -matching, when embedded randomly, to form a crossing.

2 Preliminaries

In this section we establish some notations for graphs and remind the reader about the main tool that we will use to quantify convergence to a normal distribution: the size bias transform.

2.1 Notation and Definitions on Graphs

A graph is pair $G=(V,E)$ , where $E\subset\{\{v,w\}|v,w\in V\}$ . Elements in $V$ are called vertices and elements in $E$ are called edges. An edge $\{v,w\}$ is sometimes written as $v\sim_{G}w$ or $v\sim w$ if the underlying graph $G$ is clear. The number or vertices $|V|$ , will be denoted by $n$ , while the number of edges, $|E|$ , will be denoted be $m$ .

For a vertex $v$ , we say that $w$ is a neighbour of $v$ , if $\{v,w\}\in E$ . The number of neighbours of $v$ is called the degree of $v$ , denoted by $deg(v)$ . The largest degree among all vertices in a graph will be denoted by $\Delta$ .

A subgraph of $G$ , is a graph, $H=(W,F)$ , such that $W\subset V$ and $F\subset G$ . We say that $G=(V,E)$ and $G^{\prime}=(V^{\prime},E^{\prime})$ are isomorphic if there exist a bijection $\varphi:V\to V^{\prime}$ such that $\{u,v\}\in E$ if and only if $\{\varphi(u),\varphi(v)\}\in E^{\prime}$ , for all $u,v\in V$ .

An $r$ -matching in a graph $G$ is a set of $r$ edges in $G$ , no two of which have a vertex in common. We denote by $M_{r}(G)$ the set $r$ -matchings of $G$ and by $m_{r}(G)$ their cardinality. Note the $m_{1}=m$ corresponds to the number of edges of the graph $G$ .

2.2 Size Bias Transform

Let $X$ be a positive random variable with mean $\mu$ finite. We say that the random variable $X^{s}$ has the size bias distribution with respect to $X$ if for all $f$ such that $\mathbb{E}[Xf(X)]<\infty$ , we have

\mathbb{E}[Xf(X)]=\mu\mathbb{E}[f(X^{s})].

In the case of $X=\sum_{i=1}^{n}X_{i}$ , with $X_{i}$ ’s positive random variables with finite mean $\mu_{i}$ , there is a recipe to construct $X^{s}$ (Proposition 3.21 from [6]) from the individual size bias distributions of the summands $X_{i}$ :

1.

For each $i=1,\ldots,n$ , let $X_{i}^{s}$ having the size bias distribution with respect to $X_{i}$ , independent of the vector $(X_{j})_{j\neq i}$ and $(X_{j}^{s})_{j\neq i}$ . Given $X_{i}^{s}=x$ , define the vector $(X_{j}^{(i)})_{j\neq i}$ to have the distribution of $(X_{j})_{j\leq i}$ conditional to $X_{i}=x$ .
2.

Choose a random index $I$ with $\mathbb{P}(I=i)=\mu_{i}/\mu$ , where $\mu=\sum\mu_{i}$ , independent of all else.
3.

Define $X^{s}=\sum_{j\neq I}X_{j}^{(I)}+X_{I}^{s}$ .

It is important to notice that the random variables are not necessarily independent or have the same distribution. Also, $X$ can be an infinite sum (See Proposition 2.2 from [2]).

If $X$ is a Bernoulli random variable, we have that $X^{s}=1$ . Indeed, if $\mathbb{P}(X=1)=p$ , $\mathbb{E}(X)=p=\mu$ and then

\mathbb{E}[Xf(X)]=(1-p)(0f(0))+p(1f(1))=pf(1)=\mu f(1)=\mu\mathbb{E}[f(1)].

Therefore, we have the following corollary (Corollary 3.24 from [6]) by specializing the above recipe.

Corollary 1.

Let $X_{1},X_{2},\ldots,X_{n}$ be Bernoulli random variables with parameter $p_{i}$ . For each $i=1,\ldots,n$ let $(X_{j}^{(i)})_{j\neq i}$ having the distribution of $(X_{j})_{j\neq i}$ conditional on $X_{i}=1$ . If $X=\sum_{i=1}^{n}X_{i}$ , $\mu=\mathbb{E}[X]$ , and $I$ is chosen independent of all else with $\mathbb{P}(I=i)=p_{i}/\mu$ , then $X^{s}=1+\sum_{j\neq I}X_{j}^{(I)}$ has the size bias distribution of $X$ .

The following result (Theorem 5.3 from [2]) gives us bounds for the Kolmogorov distance, in the case that a bounded size bias coupling exists. This distance is given by

d_{Kol}(X,Y):=\sup_{z\in\mathbb{R}}|F_{X}(z)-F_{Y}(z)|,

where $F_{X}$ and $F_{Y}$ are the distribution functions of the random variables $X$ and $Y$ .

Theorem 2.

Let $X$ be a non negative random variable with finite mean $\mu$ and finite, positive variance $\sigma^{2}$ , and suppose $X^{s}$ , have the size bias distribution of $X$ , may be coupled to $X$ so that $|X^{s}-X|\leq A$ , for some $A$ . Then with $W=(X-\mu)/\sigma$ ,

d_{Kol}(W,Z)\leq\frac{6\mu A^{2}}{\sigma^{3}}+\frac{2\mu\Psi}{\sigma^{2}},

(2.1)

where $Z$ is a standard Gaussian random variable, and $\Psi$ is given by

\Psi=\sqrt{\mathrm{Var}(\mathbb{E}[X^{s}-X|X])}

(2.2)

3 Mean and Variance

Let $S_{n}$ be the set of permutation of $n$ elements. For a permutation $\pi\in S_{n}$ , and a graph $G$ , let $G_{\pi}$ be the graph whose edges are given by

v\sim_{G}w\Longleftrightarrow\pi(v)\sim_{G_{\pi}}\pi(w),\qquad\forall v,w\in V.

For a random uniform permutation $\pi$ , let $X:=X(G_{\pi})$ be the random variable that counts the number of crossings of $G_{\pi}$ , that is

X=\sum_{i\in M_{2}(G)}\mathbb{I}_{\{i\text{ is a crossing in }G_{\pi}\}}=\sum_{i\in M_{2}(G)}Y_{i}

(3.1)

where $M_{2}(G)$ is the set of 2-matching of $G$ .

In this section we give a formula for the mean and variance of the random variable $X$ in terms of the number of subgraphs of certain type.

Lemma 1.

For a graph $G$ , if $X$ denote the number of crossings in a random embedding on a set of $n$ points in convex position, then its expectation is given by

\mu:=\mu(G)=\mathbb{E}(X)=\frac{1}{3}m_{2}(G),

where $m_{2}(G)$ denotes the number of 2-matching of $G$ .

Proof.

For each $i\in M_{2}(G)$ , we notice that $Y_{i}\sim\mathrm{Bernoulli}(1/3)$ . Indeed, if $i$ consist of two edges $\{v_{1},v_{2}\}$ and $\{v_{3},v_{4}\}$ , the probability to obtain a crossing only depends on the cyclic orders in which $v_{1},v_{2},v_{3}$ and $v_{4}$ are embedded in $\{1,\dots,n\}$ , not in the precise position of them. From the $6$ possible orders, only $1/3$ of them yield a crossing. See Figure 2, for the $6$ possible cyclic orders of $v_{1},v_{2},v_{3}$ and $v_{4}$ .

Figure 2: Possibles cyclic orders for a

2

-matching.

Summing over all $j$ , the expected value of $X$ is

\mathbb{E}X=\sum_{i\in M_{2}(G)}\mathbb{P}(i\text{ is a crossing})=\frac{1}{3}m_{2}(G),

as desired. ∎

For the the second moment it is necessary to expand $X^{2}$ to get

\mathbb{E}X^{2}=\mathbb{E}\sum_{i,j\in M_{2}(G)}\mathbb{I}_{\{i,j\text{ are crossings}\}}=\sum_{i,j\in M_{2}(G)}\mathbb{P}(i,j\text{ are crossings}).

The analysis for $\mathbb{E}X^{2}$ is significantly more complicated, since $\mathbb{P}(i,j\text{ are crossings})$ depends of how many edges and vertices the two $2$ -matchings, $i$ and $j$ , share. Thus the previous sum can be divided in $8$ types, depending of how the $2$ -matchings, $i$ and $j$ , share edges and vertices as is shown in Figure 3. We call such different configuration the “kind of pair of $2$ -matching” .

Figure 3: Kinds of pair of

2

-matchings in the sum of the second moment of

X

The probabilities of that both $i$ and $j$ are crossing for each type of double $2$ -matching are the following (with the obvious abuse of notation):

	$\displaystyle\mathbb{P}(C_{1})=\frac{1}{9},\quad\mathbb{P}(C_{2})=\frac{1}{9},\quad\mathbb{P}(C_{3})=\frac{2}{15},\quad\mathbb{P}(C_{4})=\frac{7}{60},$
	$\displaystyle\mathbb{P}(C_{5})=\frac{1}{10},\quad\mathbb{P}(C_{6})=\frac{1}{12},\quad\mathbb{P}(C_{7})=\frac{1}{6},\quad\mathbb{P}(C_{8})=\frac{1}{3}.$

Thus, we arrive to the following.

Lemma 2.

The second moment of $X$ is given by the formula,

\mathbb{E}X^{2}=\frac{6}{9}m_{4}(G)+\frac{4}{5}m_{3}(G)+\frac{1}{3}m_{2}(G)+\frac{4}{9}S_{2}+\frac{7}{15}S_{4}+\frac{1}{5}S_{5}+\frac{1}{6}S_{6}+\frac{1}{3}S_{7}

(3.2)

where $S_{i}$ is the number of subgraphs of $G$ of type $C_{i}$ .

Before proving our main result we will apply the above lemmas for a few examples.

Example 1 (Pairing).

Consider $G$ to be a disjoint union of $n$ copies of $K_{2}$ graphs. The expectation is given by

\mathbb{E}X=\frac{1}{3}m_{2}(G)=\frac{n(n-1)}{6}.

For the variance, we only need to consider, $m_{2}(G)$ , $m_{4}(G)$ and $m_{3}(G)$ , since the other types of subgraphs are not present in $G$ . The number of $r$ -matchings is given by $m_{r}(G)=\binom{n}{r}$ , since any choice of $r$ different edges is an $r$ -matching, then we obtain

\mathbb{E}X^{2}=\frac{6}{9}\binom{n}{4}+\frac{12}{15}\binom{n}{3}+\frac{1}{3}\binom{n}{2}=\frac{n^{4}}{36}-\frac{n^{3}}{30}+\frac{13n^{2}}{180}-\frac{n}{15},

and thus the variance is

\mathrm{Var}(X)=\mathbb{E}X^{2}-(\mathbb{E}X)^{2}=\frac{n(n-1)(n+3)}{45}.

Example 2 (Path).

Let $P_{n}$ be the path graph on $n$ vertices. In this case, the number of $r$ -matchings is $m_{r}(P_{n})=\binom{n-r}{r}$ , so we obtain that

	$\displaystyle m_{4}=\binom{n-4}{4},\quad m_{3}=\binom{n-3}{3},\quad m_{2}=\binom{n-2}{2},\quad S_{2}=3\binom{n-4}{3},$
	$\displaystyle S_{4}=\binom{n-4}{2},\quad S_{5}=2\binom{n-4}{2},\quad S_{6}=n-4,\quad S_{7}=2\binom{n-3}{2}.$

Then,

\mathbb{E}X^{2}=\frac{n^{4}}{36}-\frac{23n^{3}}{90}+\frac{35n^{2}}{36}-\frac{86n}{45}-\frac{5}{3},

and thus the variance is given by

\mathrm{Var}(X)=\mathbb{E}X^{2}-(\mathbb{E}X)^{2}=\frac{n^{3}}{45}-\frac{n^{2}}{18}-\frac{11n}{45}+\frac{2}{3}.

Example 3 (Cycle).

Let $C_{n}$ be the cycle graph on $n$ vertices. The number of $r$ -matchings is given by $m_{r}(C_{n})=\frac{n}{r}\binom{n-r-1}{r-1}$ , then

\mathbb{E}X=\frac{m_{2}}{3}=\frac{n(n-3)}{6}.

On the other hand, the number of subgraphs is given by

	$\displaystyle m_{4}=\frac{n}{4}\binom{n-5}{3},\quad m_{3}=\frac{n}{3}\binom{n-4}{2},\quad m_{2}=\frac{n(n-3)}{2},\quad S_{2}=n\binom{n-5}{2},$
	$\displaystyle S_{4}=\frac{n(n-5)}{2},\quad S_{5}=n(n-5),\quad S_{6}=n,\quad S_{7}=n(n-4),$

from where the second moment is

\mathbb{E}X^{2}=\frac{n^{4}}{36}-\frac{13n^{3}}{90}+\frac{47n^{2}}{180}-\frac{n}{3},

and thus the variance is given by

\mathrm{Var}(X)=\mathbb{E}X^{2}-(\mathbb{E}X)^{2}=\frac{n^{3}}{45}-\frac{n^{2}}{90}-\frac{n}{3}.

Example 4 (Triangles).

Let $G$ be the disjoint union of $n$ copies of $K_{3}$ . In this case, the subgraphs of type $S_{5}$ and $S_{6}$ are not present in $G$ . In order to obtain an $r$ -matching for $r\geq 2$ , we need to choose $r$ triangles and for each one we have 3 options to form the matching, so the number of $r$ -matching is $m_{r}(G)=3^{r}\binom{n}{r}$ . Similarly, the number of other types of subgraphs is given by

	$\displaystyle m_{4}=3^{4}\binom{n}{4},\quad m_{3}=3^{3}\binom{n}{3},\quad m_{2}=3^{2}\binom{n}{2},$
	$\displaystyle S_{2}=3^{4}\binom{n}{3},\quad S_{4}=3^{2}\binom{n}{2},\quad S_{7}=2\cdot 3^{2}\binom{n}{2}.$

Then, the expectation and the second moment are

\mathbb{E}X=\frac{1}{3}m_{2}(G)=\frac{3n(n-1)}{2},\qquad\mathbb{E}X^{2}=\frac{9n^{4}}{4}-\frac{39n^{3}}{10}+\frac{51n^{2}}{20}-\frac{9n}{10},

and thus the variance is

\mathrm{Var}(X)=\mathbb{E}X^{2}-(\mathbb{E}X)^{2}=\frac{3n^{3}}{5}+\frac{3n^{2}}{10}-\frac{9n}{10}.

4 Proof of the Main Theorem

4.1 Construction of Size Bias Transform

Let $\pi$ be a fixed permutation and let $I=(e,f)$ be a random index chosen with probability $\mathbb{P}(I=i)=1/m_{2}(G)$ , which corresponds to a 2-matching (in this way $e,f$ are edges), and which is independent of $\pi$ . For said fixed $\pi$ , we have a fit $G_{\pi}$ . We construct $\pi^{s}$ as follows

•

If $\pi$ is such that $G_{\pi}$ has a crossing at $I$ , we define $\pi^{s}=\pi$ .
•

Otherwise, we perform the following process to obtain a permutation with a cross on $I$ . Suppose $e=\{u_{1},u_{2}\}$ and $f=\{v_{1},v_{2}\}$ , then under $\pi$ these edges are $\pi(e)=\{\pi(u_{1}),\pi(u_{2})\}$ and $\pi(f)=\{\pi(v_{1}),\pi(v_{2})\}$ . Since they do not cross, without loss of generality we can assume that $\pi(u_{1})<\pi(v_{1})<\pi(v_{2})<\pi(u_{2})$ satisfies. Now, we choose a random vertex uniformly among the vertices of the edges of $I$ . In case the vertex is $u_{1}$ or $v_{1}$ , we leave these fixed and swap the positions between $\pi(v_{2})$ and $\pi(u_{2})$ and define $\pi^{s}$ as the resulting permutation. In case of choosing $u_{2}$ or $v_{2}$ , we do the same process leaving these vertices fixed and exchanging $\pi(v_{1})$ and $\pi(u_{1})$ . In this way we obtain a permutation $\pi^{s}$ such that it has a crossing at $I$

Note that $\pi^{s}$ is a uniform random permutation conditional on the event that $\pi(u_{i})$ ’s and $\pi(v_{i})$ ’s are in alternating in the cyclic order. This in turn means that $G_{\pi^{s}}$ , is a uniform random embedding conditioned on the event that the event $I$ has a crossing.

In summary, we obtain that

X(G_{\pi^{s}})=\sum_{i\in M_{2}(G)}\mathbb{I}_{\{\pi^{s}(i)\text{ is a crossing in }G_{\pi^{s}}\}}=\sum_{i\in M_{2}(G)}Y_{\pi^{s}(i)},

satisfies $\{Y_{\pi^{s}(j)}\}_{j\neq I}$ has the distribution of $\{Y_{\pi(j)}\}_{j\neq I}$ conditional on $Y_{\pi(I)}=1$ . Then, by Corollary 1, we get that $X^{s}=X(G_{\pi^{s}})$ , has the size bias distribution of $X$ .

Define $X^{s}=X(G_{\pi}^{s})$ as the size bias transform of $X(G_{\pi})$ . By construction, $|X^{s}-X|\leq 2\Delta(m-1)$ . Indeed, because in the worst case each of the edges incident to each vertex creates a new crossing.

4.2 Bounding the Variance of $\mathbb{E}[X^{s}-X|X]$

In order to use Theorem 2 we need a bound for the variance of a conditional expectation, which depends of $X$ and its size bias transform $X^{s}$ . A bound for that variance is given in the next lemma, which is one of the main results in this paper.

Lemma 3.

Let $G=(V,E)$ a graph with maximum degree $\Delta$ , and let $X$ be the number of crossings of a random uniform embedding of $G$ . Then

\mathrm{Var}(\mathbb{E}[X^{s}-X|X])\leq 4\Delta^{2}(m-1)^{2}\left(1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}(m-4)}{2m_{2}(G)}\right),

(4.1)

where $X^{s}$ is the size bias transform of $X$ , $m$ is the number of edges of the graph $G$ and $m_{r}(G)$ is the number of $r$ -matchings of the graph $G$ .

Proof.

Note that

	$\displaystyle\mathbb{E}[X^{s}-X\|X]$	$\displaystyle=\sum_{i\in M_{2}(G)}\mathbb{E}[X^{s}-X\|X,I=i]\mathbb{P}(I=i)$
		$\displaystyle=\frac{1}{m_{2}(G)}\sum_{i\in M_{2}(G)}(X^{(i)}-X),$

where $X^{(i)}$ denote $X^{s}$ conditioned to have a crossing in the index $i$ . This gives that

\mathrm{Var}(\mathbb{E}[X^{s}-X|X])=\frac{1}{m_{2}(G)^{2}}\sum_{i,j\in M_{2}(G)}\mathrm{Cov}(X^{(i)}-X,X^{(j)}-X).

(4.2)

We identify two kinds of terms in the summation of the covariance, when the indices satisfy $V(i)\cap V(j)\neq\emptyset$ or when they satisfy $V(i)\cap V(j)=\emptyset$ , where $V(i)$ denote the set of vertices of the 2-matching $i$ .

Case $V(i)\cap V(j)\neq\emptyset$ : In this case, we have that

|\{i,j\in M_{2}(G):V(i)\cap V(j)\neq\emptyset\}|=m_{2}(G)^{2}-\binom{4}{2}m_{4}(G)=m_{2}(G)^{2}-6m_{4}(G).

From the construction of the size bias transform, we have that $|X^{(i)}-X|\leq 2\Delta(m-1)$ . Indeed, if $X^{s}$ have a crossing in the matching $i$ , there are two options for the matching $i$ in $X$ , that is, it is a crossing or not. If $i$ is a crossing, then $X^{(i)}=X$ , because we don’t need to crossing any edges. On the other hand, if $i$ is not a crossing of $X$ , then to obtain $X^{(i)}$ it is necessary crossing the edges of the matching $i$ , which can be generate at least a new crossing for each of the edges incidents to the four vertices of $i$ . Then, an upper bound for the variance given by

\mathrm{Var}(X^{(i)}-X)\leq\mathbb{E}[(X^{(i)}-X)^{2}]\leq 4\Delta^{2}(m-1)^{2}.

(4.3)

Then, the contribution in the sum (4.2) of the 2-matchings such that $V(i)\cap V(j)\neq\emptyset$ is bounded by

\frac{m_{2}(G)^{2}-6m_{4}(G)}{m_{2}(G)^{2}}4\Delta^{2}(m-1)^{2}=\left(1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}\right)4\Delta^{2}(m-1)^{2}.

Case $V(i)\cap V(j)=\emptyset$ : In this case, we have that

|\{i,j\in M_{2}(G):V(i)\cap V(j)=\emptyset\}|=\binom{4}{2}m_{4}(G)=6m_{4}(G).

Let $N(i)$ be the set of edges incidents to the vertices of the 2-matching $i$ . We can divide the sum over the 2-matching with $V(i)\cap V(j)=\emptyset$ . In the case of $N(i)\cap N(j)=\emptyset$ , we obtain that the random variables $X^{(i)}-X$ and $X^{(i)}-X$ are independent. Indeed, from the construction $X^{(i)}-X$ only depends on the edges incident to the $2$ -matching $i$ , and similarly $X^{(j)}-X$ only depends on the edges incident to the $2$ -matching $j$ . Hence, in this case we obtain that $\mathrm{Cov}(X^{(i)}-X,X^{(j)}-X)=0$ .

So we are interested in the pairs of $2$ - matchings such that $V(i)\cap V(j)=\emptyset$ , but $N(i)\cap N(j)\neq\emptyset$ . An upper bound for the number of such pairs of $2$ -matchings is given by $m_{2}(G)(\Delta-1)^{2}(m-4)/2$ .

Indeed, in this case, there exists at least one edge between $V(i)$ and $V(j)$ . So, to obtain such configuration one may proceed as follows. First, one chooses a $2$ -matching, $i$ , and one considers one of the $4$ vertices in $i$ , say $v$ , and looks for a neighbour of $v$ , say $w$ , which should be in the $2$ -matching $j$ . There are at most $\Delta-1$ choices for $w$ . Now, to construct $j$ , we need to find a neighbour of $w$ which is not $v$ for one of the edges forming $j$ , giving at most $\Delta-1$ possibilities and another edge which cannot be in $i$ , or contain $w$ , giving at most $m-4$ possibilities. Putting this together and considering double counting, we obtain the desired $m_{2}(G)(\Delta-1)^{2}(m-4)/2$ . See Figure 4 for a diagram explaining this counting.

Figure 4: A generic pair of

2

-matchings sharing a vertex.

Finally, using the upper bound for the variance given in (4.3), we obtain

	$\displaystyle\frac{1}{m_{2}(G)^{2}}\sum_{\begin{subarray}{c}i,j\in M_{2}(G)\\ V(i)\cap V(j)=\emptyset\end{subarray}}\mathrm{Cov}(X^{(i)}-X,X^{(j)}-X)$	$\displaystyle=\frac{1}{m_{2}(G)^{2}}\sum_{\begin{subarray}{c}i,j\in M_{2}(G)\\ N(i)\cap N(j)\neq\emptyset\end{subarray}}\mathrm{Cov}(X^{(i)}-X,X^{(j)}-X)$
		$\displaystyle\leq\frac{1}{m_{2}(G)^{2}}\sum_{\begin{subarray}{c}i,j\in M_{2}(G)\\ N(i)\cap N(j)\neq\emptyset\end{subarray}}4\Delta^{2}(m-1)^{2}$
		$\displaystyle\leq\frac{1}{m_{2}(G)}2\Delta^{2}(\Delta-1)^{2}(m-1)^{2}(m-4).$

Thus, the contribution of the pairs of $2$ -matchings such that $V(i)\cap V(j)=\emptyset$ in the covariance sum (4.2) is bounded by

\frac{2\Delta^{2}(\Delta-1)^{2}(m-1)^{2}(m-4)}{m_{2}(G)}.

Therefore,

\mathrm{Var}(\mathbb{E}[X^{s}-X|X])\leq 4\Delta^{2}(m-1)^{2}\left(1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}(m-4)}{2m_{2}(G)}\right).

∎

4.3 Kolmogorov Distance

Using the previous results, we are in position to apply Theorem 2. Therefore, we can obtain a bound for the Kolmogorov distance of the (normalized) number of crossings and a standard Gaussian random variable.

Theorem 3.

d_{Kol}(W,Z)\leq\frac{4m_{2}(G)\Delta m}{3\sigma^{2}}\left[\frac{6\Delta m}{\sigma}+\sqrt{1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(G)}}\right],

where $m$ is the number of edges of $G$ , $m_{r}(G)$ is the number of $r$ -matchings of $G$ , and $Z$ is a standard Gaussian random variable.

Proof.

By Lemma 1 we have that $\mu=m_{2}(G)/3$ , also by Lemma 3, $\Psi$ defined in (2.2) is bounded as follows,

\Psi\leq 2\Delta m\sqrt{1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(G)}}.

Then, using Theorem 2, and the fact that $|X^{s}-X|\leq A=2\Delta m$ , we obtain

	$\displaystyle d_{Kol}(W,Z)$	$\displaystyle\leq\frac{6\mu A^{2}}{\sigma^{3}}+\frac{2\mu\Psi}{\sigma^{2}}$
		$\displaystyle\leq\frac{8\Delta^{2}m_{2}(G)m^{2}}{\sigma^{3}}+\frac{4m_{2}(G)\Delta m}{3\sigma^{2}}\sqrt{1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(G)}}$
		$\displaystyle=\frac{4m_{2}(G)\Delta m}{3\sigma^{2}}\left(\frac{6\Delta m}{\sigma}+\sqrt{1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(G)}}\right)$

∎

5 Some Examples

In this section we provide various examples for which Theorem 3 can be applied directly. To show its easy applicability, we give explicit bounds on the quantities appearing in (1.1).

5.1 Pairing

Let $G$ be a pairing or matching graph on $2n$ vertices, that is, a disjoint union of $n$ copies of $K_{2}$ , as in Example 1. In particular, $m=n$ , $m_{2}(G)=\binom{n}{2}$ and $m_{4}(G)=\binom{n}{4}$ and the variance is given by

\sigma^{2}=\frac{n(n-1)(n+3)}{45}

which is bigger that $n^{3}/45$ for $n>3$ . On the other hand, since $\Delta=1$ , we see that, for $n>3$ ,

1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{m(\Delta-1)^{2}}{2m_{2}(G)}=1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}=1-\frac{6\binom{n}{4}}{\binom{n}{2}^{2}}=\frac{4n-6}{n^{2}-n}=\frac{4}{n}-\frac{2}{n^{2}-n}<\frac{4}{n}.

Thus,

d_{Kol}(W,Z)\leq\frac{4\cdot 45n^{3}}{3\cdot 2n^{3}}\left(\frac{6\cdot\sqrt{45}n}{n^{3/2}}+\frac{2}{\sqrt{n}}\right)\leq\frac{1268}{\sqrt{n}}.

5.2 Path Graph

Let $P_{n}$ be the path graph on $n$ vertices. From Example 2, $m_{2}(P_{n})=\binom{n-2}{2}$ and $m_{4}(P_{n})=\binom{n-4}{4}$ , and we obtain that

1-\frac{6m_{4}(P_{n})}{m_{2}(P_{n})^{2}}=1-\frac{6\binom{n-4}{4}}{\binom{n-2}{2}^{2}}=\frac{2(6n^{3}-71n^{2}+289n-402)}{n^{4}-10n^{3}+37n^{2}-60n+36}=\frac{12}{n}+o(n^{-1})

On the other hand, $\Delta=2$ , and then, one easily sees that,

4m_{2}(P_{n})\Delta m\leq 4n^{3},\qquad 6\Delta m\leq 12n,\quad\text{ and }\quad\frac{m(\Delta-1)^{2}}{2m_{2}(P_{n})}=\frac{1}{(n-2)}.

Finally, since the variance is given by $\sigma^{2}=n^{3}/45-n^{2}/18-11n/45-2/3>n^{3}/60$ , for $n\geq 14$ , we get

d_{Kol}(W,Z)\leq\frac{4\cdot 60n^{3}}{3n^{3}}\left(\frac{12\cdot\sqrt{60}n}{n^{3/2}}+\frac{4}{\sqrt{n}}\right)\leq\frac{7757}{\sqrt{n}}.

5.3 Cycle Graph

Let $C_{n}$ be the cycle graph on $n$ vertices. In this case $m=n$ and $\Delta=2$ , and from Example 3, $m_{2}(C_{n})=\frac{n(n-3)}{2}$ and $m_{4}(C_{n})=\frac{n}{4}\binom{n-5}{3}$ , then

1-\frac{6m_{4}(C_{n})}{m_{2}(C_{n})^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(C_{n})}=1-\frac{6\frac{n}{4}\binom{n-5}{3}}{\left(\frac{n(n-3)}{2}\right)^{2}}+\frac{n}{n(n-3)}=\frac{13n^{2}-101n+210}{n(n-3)^{2}}\leq\frac{13}{n},\text{ for }n\geq 5.

Also, $4m_{2}(C_{n})\Delta m\leq 4n^{3}$ , and $6\Delta m=12n$ . Since the variance is $\sigma^{2}=n^{3}/45-n^{2}/90-n/3>n^{3}/50,$ for $n\geq 15$ , we obtain

d_{Kol}(W,Z)\leq\frac{4\cdot 50n^{3}}{3n^{3}}\left(\frac{12\sqrt{50}n}{n^{3/2}}+\frac{\sqrt{13}}{\sqrt{n}}\right)\leq\frac{{5898}}{\sqrt{n}}.

5.4 Disjoint Union of Triangles

Consider $n$ copies of $K_{3}$ and let $G$ be the disjoint union of them. From Example 4 we have that $G$ is a graph with $3n$ vertices, $m=3n$ edges, maximum degree $\Delta=2$ , $m_{2}(G)=3^{2}\binom{n-2}{2}$ and $m_{4}(G)=3^{4}\binom{n-4}{4}$ , then

1-\frac{6m_{4}(G)}{m_{2}(G)^{2}}+\frac{(\Delta-1)^{2}m}{2m_{2}(G)}=1-\frac{6\left(3^{4}\binom{n}{4}\right)}{\left(3^{2}\binom{n}{2}\right)^{2}}+\frac{3n}{2\cdot 3^{2}\binom{n}{2}}=\frac{13n-18}{3n(n-1)}\leq\frac{13}{3n}.

On the other hand, we can obtain that, $4m_{2}(G)\Delta m\leq 108n^{3}$ and $6\Delta m=36n$ . Finally, since the variance is $\sigma^{2}=3n^{3}/5+3n^{2}/10-9n/10>3n^{3}/5,$ for $n\geq 3$ , then we get

d_{Kol}(W,Z)\leq\frac{108\cdot 5n^{3}}{9n^{3}}\left(\frac{36\sqrt{5}n}{\sqrt{3}n^{3/2}}+\frac{\sqrt{13}}{\sqrt{2n}}\right)\leq\frac{2942}{\sqrt{n}}.

6 Another Possible Limit

The following shows that not every sequence of graphs satisfies a central limit theorem for the number of crossings, even if the variance is not always $0$ and that having $m_{2}$ going to infinity is not enough. Moreover, it shows that we can have another type of limit for the number of crossings.

Consider the graph $G_{n}$ which consists of a star graph with $n-1$ vertices for which an edge is attached at one of the leaves, as in Figure 5a.

Note that in this case $m_{2}=n-3$ and $m_{4}=0$ and the only other term appearing in (3.2) is $S_{7}$ , which for this graph equals $\binom{n-3}{2}$ . This implies that

\mathbb{E}(X)=\frac{n-3}{3},\quad\mathbb{E}(X^{2})=\frac{(n-2)(n-3)}{6},

from where $\sigma^{2}=n(n-3)/18$ , $1-6m_{4}/m_{2}^{2}=1$ and

\frac{(\Delta-1)^{2}m}{2m_{2}}=\frac{(n-2)^{2}(n-1)}{2(n-3)}\approx\frac{n^{2}}{2}.

Thus the right hand of (1.1) does not approximate $0$ as $n\to\infty$ .

One can calculate explicitly the probability of having $k$ crossings. Indeed, lets us denote by $v_{0}$ is the center and by $v_{n}$ , the tail (the only vertex at distance 2 from $v_{0}$ ) and by $v_{n-1}$ the vertex which has $v_{0}$ and $v_{n}$ . The number of crossings in an embedding of $G_{n}$ depends only on the position of this three vertices. More precisely, there will be exactly $k$ crossing if the following two conditions are satisfied (see Figure 5b for an example):

1.

There are exactly $k$ and $n-2-k$ vertices in the two arcs that remain when removing $v_{n}$ and $v_{n-1}$ .
2.

$v_{0}$ is in the arc with $n-2-k$ vertices.

Figure 5: a)

G_{12}

drawn without crossing (left). b)

G_{10}

drawn in convex position with

3

crossings formed by the edge

(v_{n-1},v_{n})

and the edges

(v_{0},v_{i})

where

v_{i}

are between

v_{n-1}

and

v_{n}

in the cyclic order.

By a simple counting argument, since all permutations have the same probability of occurrence, one sees that such conditions are be satisfied with probability

\mathbb{P}(X_{n}=k)=\frac{2(n-2-k)}{(n-1)(n-2)},\qquad k=0,\ldots,n-2.

Finally, dividing by $n$ , the random variable $Y_{n}=X_{n}/n$ satisfies that

\mathbb{P}\left(Y_{n}=\frac{k}{n}\right)=\frac{2(n-2-k)}{(n-1)(n-2)}\approx\frac{2}{n}\left(1-\frac{k}{n}\right),\qquad k=0,\ldots,n-2,

which implies that $Y_{n}\to Y$ , weakly, where $Y$ is a random variable supported on $(0,1)$ with density $f_{Y}(x)=2(1-x).$

Acknowledgement

We would like to thank Professor Goldstein for pointing out the paper [5] and for various communications during the preparation of this paper. OA would like to thank Clemens Huemer for initial discussions on the topic that led to work in this problem.

Santiago Arenas-Velilla was supported by a scholarship from CONACYT.

Octavio Arizmendi was supported by CONACYT Grant CB-2017-2018-A1-S-9764.
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 734922.

References

[1] Octavio Arizmendi, Pilar Cano, and Clemens Huemer. On the number of crossings in a random labelled tree with vertices in convex position. arXiv preprint arXiv:1902.05223, 2019.
[2] Louis H.Y. Chen, Larry Goldstein, and Qi-Man Shao. Normal Approximation by Stein’s Method. Springer Berlin Heidelberg, 2011.
[3] Valentin Féray. Weighted dependency graphs. Electronic Journal of Probability, 23:1 – 65, 2018.
[4] Philippe Flajolet and Marc Noy. Analytic combinatorics of chord diagrams. In Formal Power Series and Algebraic Combinatorics, pages 191–201, Berlin, Heidelberg, 2000. Springer Berlin Heidelberg.
[5] J.E. Paguyo. Convergence rates of limit theorems in random chord diagrams. arXiv preprint arXiv:2104.01134, 2021.
[6] Nathan Ross. Fundamentals of Stein’s method. Probability Surveys, 8:210 – 293, 2011.

Crossings in Randomly Embedded Graphs

Abstract

1 Introduction

Theorem 1.

2 Preliminaries

2.1 Notation and Definitions on Graphs

2.2 Size Bias Transform

Corollary 1.

Theorem 2.

3 Mean and Variance

Lemma 1.

Proof.

Lemma 2.

Example 1 (Pairing).

Example 2 (Path).

Example 3 (Cycle).

Example 4 (Triangles).

4 Proof of the Main Theorem

4.1 Construction of Size Bias Transform

4.2 Bounding the Variance of 𝔼​[Xs−X|X]\mathbb{E}[X^{s}-X|X]

Lemma 3.

Proof.

4.3 Kolmogorov Distance

Theorem 3.

Proof.

5 Some Examples

5.1 Pairing

5.2 Path Graph

5.3 Cycle Graph

5.4 Disjoint Union of Triangles

6 Another Possible Limit

Acknowledgement

References

4.2 Bounding the Variance of $\mathbb{E}[X^{s}-X|X]$