Crossings in Randomly Embedded Graphs
Abstract
We consider the number of crossings in a graph which is embedded randomly on a convex set of points. We give an estimate to the normal distribution in Kolmogorov distance which implies a convergence rate of order for various families of graphs, including random chord diagrams or full cycles.
keywords: crossings, random graphs, normal approximation.
1 Introduction
Let be a graph with vertex set and edge set , which is embedded randomly in a convex set of points. We are interested in the random variable counting the number of crossings under this embedding.
Formally, for a graph with vertex set , an embedding given by the permutation , is the graph isomorphism induced by the permutation . The crossings of such embedding are given by the set . Figure 1 shows graphical representation of a couple embeddings of a path graph . The first one having crossing and the second one having crossings.


To our best of our knowledge there is not much work about general graphs. The paper by Flajolet and Noy [4] considers the case where is a union of disjoint edges (is called a matching, a pairing or a chord diagram) and proves a central limit theorem. This result is also proved with the use of weighted dependency graphs, in [3]. More important to us the recent paper by Paguyo [5] gives a rate of convergence in that case. Another related paper is [1], where the authors consider a uniform random tree.
In this paper, we will show that under some asymptotic behaviour of very precise combinatorial quantities of the graph, the random variable counting the number of crossings in a random embedding approximates a normal distribution with mean and variance which can be calculated precisely (see Lemmas 1 and 2). Moreover, we give a convergence rate in this limit theorem.
Theorem 1.
Let be a graph with maximum degree , and let be the number of crossings of a random uniform embedding of . Let and the mean and the variance of . Then, with ,
(1.1) |
where is the number of edges of , is the number of -matching of , and is a standard Gaussian random variable.
Examples of families of graphs that satisfy such a normal approximation, with a rate proportional to are pairings, cycle graphs, path graphs, union of triangles, among others. We explain in detail these examples in Section 4.
We should mention that our method of proof resembles the one used by Paguyo in [5], for the case of pairings. The main idea is to write the number of crossing as a sum of indicators variables and then consider the size biased transform in the case of sums of Bernoulli variables. However, there is a crucial difference between Paguyo’s way to write such variables and how we do it, which in our opinion is more flexible. To be precise, Paguyo considers for each four points in the circle the indicator that there is a crossing formed by the edges and . This random variable is easy to handle for the case of a pairing but for a general given graph, even calculating the probability of such indicator to be can be very complicated. Our approach instead looks at a given -matching in the graph and consider the indicator random variable of this -matching, when embedded randomly, to form a crossing.
2 Preliminaries
In this section we establish some notations for graphs and remind the reader about the main tool that we will use to quantify convergence to a normal distribution: the size bias transform.
2.1 Notation and Definitions on Graphs
A graph is pair , where . Elements in are called vertices and elements in are called edges. An edge is sometimes written as or if the underlying graph is clear. The number or vertices , will be denoted by , while the number of edges, , will be denoted be .
For a vertex , we say that is a neighbour of , if . The number of neighbours of is called the degree of , denoted by . The largest degree among all vertices in a graph will be denoted by .
A subgraph of , is a graph, , such that and . We say that and are isomorphic if there exist a bijection such that if and only if , for all .
An -matching in a graph is a set of edges in , no two of which have a vertex in common. We denote by the set -matchings of and by their cardinality. Note the corresponds to the number of edges of the graph .
2.2 Size Bias Transform
Let be a positive random variable with mean finite. We say that the random variable has the size bias distribution with respect to if for all such that , we have
In the case of , with ’s positive random variables with finite mean , there is a recipe to construct (Proposition 3.21 from [6]) from the individual size bias distributions of the summands :
-
1.
For each , let having the size bias distribution with respect to , independent of the vector and . Given , define the vector to have the distribution of conditional to .
-
2.
Choose a random index with , where , independent of all else.
-
3.
Define .
It is important to notice that the random variables are not necessarily independent or have the same distribution. Also, can be an infinite sum (See Proposition 2.2 from [2]).
If is a Bernoulli random variable, we have that . Indeed, if , and then
Therefore, we have the following corollary (Corollary 3.24 from [6]) by specializing the above recipe.
Corollary 1.
Let be Bernoulli random variables with parameter . For each let having the distribution of conditional on . If , , and is chosen independent of all else with , then has the size bias distribution of .
The following result (Theorem 5.3 from [2]) gives us bounds for the Kolmogorov distance, in the case that a bounded size bias coupling exists. This distance is given by
where and are the distribution functions of the random variables and .
Theorem 2.
Let be a non negative random variable with finite mean and finite, positive variance , and suppose , have the size bias distribution of , may be coupled to so that , for some . Then with ,
(2.1) |
where is a standard Gaussian random variable, and is given by
(2.2) |
3 Mean and Variance
Let be the set of permutation of elements. For a permutation , and a graph , let be the graph whose edges are given by
For a random uniform permutation , let be the random variable that counts the number of crossings of , that is
(3.1) |
where is the set of 2-matching of .
In this section we give a formula for the mean and variance of the random variable in terms of the number of subgraphs of certain type.
Lemma 1.
For a graph , if denote the number of crossings in a random embedding on a set of points in convex position, then its expectation is given by
where denotes the number of 2-matching of .
Proof.
For each , we notice that . Indeed, if consist of two edges and , the probability to obtain a crossing only depends on the cyclic orders in which and are embedded in , not in the precise position of them. From the possible orders, only of them yield a crossing. See Figure 2, for the possible cyclic orders of and .
Summing over all , the expected value of is
as desired. ∎
For the the second moment it is necessary to expand to get
The analysis for is significantly more complicated, since depends of how many edges and vertices the two -matchings, and , share. Thus the previous sum can be divided in types, depending of how the -matchings, and , share edges and vertices as is shown in Figure 3. We call such different configuration the “kind of pair of -matching” .
The probabilities of that both and are crossing for each type of double -matching are the following (with the obvious abuse of notation):
Thus, we arrive to the following.
Lemma 2.
The second moment of is given by the formula,
(3.2) |
where is the number of subgraphs of of type .
Before proving our main result we will apply the above lemmas for a few examples.
Example 1 (Pairing).
Consider to be a disjoint union of copies of graphs. The expectation is given by
For the variance, we only need to consider, , and , since the other types of subgraphs are not present in . The number of -matchings is given by , since any choice of different edges is an -matching, then we obtain
and thus the variance is
Example 2 (Path).
Let be the path graph on vertices. In this case, the number of -matchings is , so we obtain that
Then,
and thus the variance is given by
Example 3 (Cycle).
Let be the cycle graph on vertices. The number of -matchings is given by , then
On the other hand, the number of subgraphs is given by
from where the second moment is
and thus the variance is given by
Example 4 (Triangles).
Let be the disjoint union of copies of . In this case, the subgraphs of type and are not present in . In order to obtain an -matching for , we need to choose triangles and for each one we have 3 options to form the matching, so the number of -matching is . Similarly, the number of other types of subgraphs is given by
Then, the expectation and the second moment are
and thus the variance is
4 Proof of the Main Theorem
4.1 Construction of Size Bias Transform
Let be a fixed permutation and let be a random index chosen with probability , which corresponds to a 2-matching (in this way are edges), and which is independent of . For said fixed , we have a fit . We construct as follows
-
•
If is such that has a crossing at , we define .
-
•
Otherwise, we perform the following process to obtain a permutation with a cross on . Suppose and , then under these edges are and . Since they do not cross, without loss of generality we can assume that satisfies. Now, we choose a random vertex uniformly among the vertices of the edges of . In case the vertex is or , we leave these fixed and swap the positions between and and define as the resulting permutation. In case of choosing or , we do the same process leaving these vertices fixed and exchanging and . In this way we obtain a permutation such that it has a crossing at
Note that is a uniform random permutation conditional on the event that ’s and ’s are in alternating in the cyclic order. This in turn means that , is a uniform random embedding conditioned on the event that the event has a crossing.
In summary, we obtain that
satisfies has the distribution of conditional on . Then, by Corollary 1, we get that , has the size bias distribution of .
Define as the size bias transform of . By construction, . Indeed, because in the worst case each of the edges incident to each vertex creates a new crossing.
4.2 Bounding the Variance of
In order to use Theorem 2 we need a bound for the variance of a conditional expectation, which depends of and its size bias transform . A bound for that variance is given in the next lemma, which is one of the main results in this paper.
Lemma 3.
Let a graph with maximum degree , and let be the number of crossings of a random uniform embedding of . Then
(4.1) |
where is the size bias transform of , is the number of edges of the graph and is the number of -matchings of the graph .
Proof.
Note that
where denote conditioned to have a crossing in the index . This gives that
(4.2) |
We identify two kinds of terms in the summation of the covariance, when the indices satisfy or when they satisfy , where denote the set of vertices of the 2-matching .
Case : In this case, we have that
From the construction of the size bias transform, we have that . Indeed, if have a crossing in the matching , there are two options for the matching in , that is, it is a crossing or not. If is a crossing, then , because we don’t need to crossing any edges. On the other hand, if is not a crossing of , then to obtain it is necessary crossing the edges of the matching , which can be generate at least a new crossing for each of the edges incidents to the four vertices of . Then, an upper bound for the variance given by
Case : In this case, we have that
Let be the set of edges incidents to the vertices of the 2-matching . We can divide the sum over the 2-matching with . In the case of , we obtain that the random variables and are independent. Indeed, from the construction only depends on the edges incident to the -matching , and similarly only depends on the edges incident to the -matching . Hence, in this case we obtain that .
So we are interested in the pairs of - matchings such that , but . An upper bound for the number of such pairs of -matchings is given by .
Indeed, in this case, there exists at least one edge between and . So, to obtain such configuration one may proceed as follows. First, one chooses a -matching, , and one considers one of the vertices in , say , and looks for a neighbour of , say , which should be in the -matching . There are at most choices for . Now, to construct , we need to find a neighbour of which is not for one of the edges forming , giving at most possibilities and another edge which cannot be in , or contain , giving at most possibilities. Putting this together and considering double counting, we obtain the desired . See Figure 4 for a diagram explaining this counting.
Finally, using the upper bound for the variance given in (4.3), we obtain
Thus, the contribution of the pairs of -matchings such that in the covariance sum (4.2) is bounded by
Therefore,
∎
4.3 Kolmogorov Distance
Using the previous results, we are in position to apply Theorem 2. Therefore, we can obtain a bound for the Kolmogorov distance of the (normalized) number of crossings and a standard Gaussian random variable.
Theorem 3.
Let be a graph with maximum degree , and let be the number of crossings of a random uniform embedding of . Let and the mean and the variance of . Then, with ,
where is the number of edges of , is the number of -matchings of , and is a standard Gaussian random variable.
5 Some Examples
In this section we provide various examples for which Theorem 3 can be applied directly. To show its easy applicability, we give explicit bounds on the quantities appearing in (1.1).
5.1 Pairing
Let be a pairing or matching graph on vertices, that is, a disjoint union of copies of , as in Example 1. In particular, , and and the variance is given by
which is bigger that for . On the other hand, since , we see that, for ,
Thus,
5.2 Path Graph
Let be the path graph on vertices. From Example 2, and , and we obtain that
On the other hand, , and then, one easily sees that,
Finally, since the variance is given by , for , we get
5.3 Cycle Graph
Let be the cycle graph on vertices. In this case and , and from Example 3, and , then
Also, , and . Since the variance is for , we obtain
5.4 Disjoint Union of Triangles
Consider copies of and let be the disjoint union of them. From Example 4 we have that is a graph with vertices, edges, maximum degree , and , then
On the other hand, we can obtain that, and . Finally, since the variance is for , then we get
6 Another Possible Limit
The following shows that not every sequence of graphs satisfies a central limit theorem for the number of crossings, even if the variance is not always and that having going to infinity is not enough. Moreover, it shows that we can have another type of limit for the number of crossings.
Consider the graph which consists of a star graph with vertices for which an edge is attached at one of the leaves, as in Figure 5a.
Note that in this case and and the only other term appearing in (3.2) is , which for this graph equals . This implies that
from where , and
Thus the right hand of (1.1) does not approximate as .
One can calculate explicitly the probability of having crossings. Indeed, lets us denote by is the center and by , the tail (the only vertex at distance 2 from ) and by the vertex which has and . The number of crossings in an embedding of depends only on the position of this three vertices. More precisely, there will be exactly crossing if the following two conditions are satisfied (see Figure 5b for an example):
-
1.
There are exactly and vertices in the two arcs that remain when removing and .
-
2.
is in the arc with vertices.
By a simple counting argument, since all permutations have the same probability of occurrence, one sees that such conditions are be satisfied with probability
Finally, dividing by , the random variable satisfies that
which implies that , weakly, where is a random variable supported on with density
Acknowledgement
We would like to thank Professor Goldstein for pointing out the paper [5] and for various communications during the preparation of this paper. OA would like to thank Clemens Huemer for initial discussions on the topic that led to work in this problem.
Santiago Arenas-Velilla was supported by a scholarship from CONACYT.
Octavio Arizmendi was supported by CONACYT Grant CB-2017-2018-A1-S-9764.
This project has received funding from the European Union’s Horizon 2020 research and innovation programme under the Marie Skłodowska-Curie grant agreement No 734922.
References
- [1] Octavio Arizmendi, Pilar Cano, and Clemens Huemer. On the number of crossings in a random labelled tree with vertices in convex position. arXiv preprint arXiv:1902.05223, 2019.
- [2] Louis H.Y. Chen, Larry Goldstein, and Qi-Man Shao. Normal Approximation by Stein’s Method. Springer Berlin Heidelberg, 2011.
- [3] Valentin Féray. Weighted dependency graphs. Electronic Journal of Probability, 23:1 – 65, 2018.
- [4] Philippe Flajolet and Marc Noy. Analytic combinatorics of chord diagrams. In Formal Power Series and Algebraic Combinatorics, pages 191–201, Berlin, Heidelberg, 2000. Springer Berlin Heidelberg.
- [5] J.E. Paguyo. Convergence rates of limit theorems in random chord diagrams. arXiv preprint arXiv:2104.01134, 2021.
- [6] Nathan Ross. Fundamentals of Stein’s method. Probability Surveys, 8:210 – 293, 2011.