Cycles of given lengths in unicyclic components in sparse random graphs
Abstract.
Let be subset of and let be the number of cycles belonging to unicyclic components whose length is in in the random graph .
We find the limiting distribution of in the subcritical regime with and the critical regime with . Depending on the regime and a condition involving the series
, we obtain in the limit either a Poisson or a normal distribution as .
MSC: Primary 05A16, 01C80; Secondary 05C10
Keywords: Random graphs, random variables, analytic combinatorics.
1. Introduction
A graph is unicyclic if it is connected and has a unique cycle. We say that a cycle in a graph is isolated if it is the unique cycle in a unicyclic connected component. Let be the random graph with vertices and exactly edges drawn uniformly at random from the set of possible edges. This is the model introduced in the seminal paper of Erdős and Rényi [3], in which each graph has the same probability
We are interested in the number of isolated cycles in whose lengths are restricted to take certain values. More precisely, let and a subset of . We denote by the random variable equal to the number of isolated cycles in whose lengths lie in . Our main result gives the limiting distribution of for various values of , corresponding to the so-called subcritical and critical regimes. Depending on the regime and a condition involving the generating function , we obtain in the limit as either a Poisson or a normal distribution.
The number of cycles in has been studied since the appearance of [3]. When , Erdős and Rényi showed [3, Theorem 3b] that the number of cycles of length converges to a Poisson law with parameter . Let be the random variable equal to the number of isolated cycles in . When and , asymptotically almost surely (that is, with probability tending to 1 as ) all cycles are isolated. As a consequence we have
We next recall the different regimes for sparse random graphs (see for instance [7, 1]). The following results hold asymptotically almost surely (shortened to a.a.s.).
-
•
Subcritical regime. When with , the connected components of are either trees or unicyclic graphs.
-
•
Barely subcritical regime. When with and ,
-
•
Critical regime. This is when and . In this regime the connected components of are trees, unicyclic graphs, and complex components. A complex component is obtained from a connected cubic multigraph by performing the following operations: first replace edges in by induced paths of any length so that to obtain a simple graph , and then attach rooted trees to the vertices of .
-
•
Supercritical regime. When with , there exists a unique component of linear size and the remaining components are either trees or unicyclic graphs. The ‘Symmetry principle’ (see [7, Section 5.6]) says that in this case in some sense ‘looks like’ a subcritical random graph with suitable parameters.
In the barely subcritical regime Kolchin showed that if , then the normalized random variable tends in distribution to a Gaussian law (see [8, Theorem 1.1.15]). In the critical regime, Flajolet, Knuth and Pittel [4, Corollary 6] showed that . By the so-called symmetry property [7, Theorem 5.24], properly normalized should also be Gaussian when and with .
Some results have been obtained fixing a set of positive integers as possible cycle lengths. Following [4], define an -cycle as an isolated cycle whose length is in . Let be the number of -cycles in . It is shown in [4, Corollary 7] that if , then the probability that a graph (or multigraph) with vertices and edges has no -cycle is equal to
(1) |
Our results concern the distribution of the random variables . In particular, we obtain full limiting distributions both in the subcritical and the critical regimes.
Theorem 1.1.
Let and set , considered as a function of one complex variable in the unit disk . Let be the random variable equal to the number of -cycles in . Then the following holds:
-
(A)
(Subcritical regime). Let be such that and . Then
(2) -
(B)
(Barely subcritical regime). Let with and . Then two situations may happen: if , then
(3) Otherwise, if , then
(4) -
(C)
(Critical regime). Let , with . Let be the unique positive solution of . Then two situations may happen: if , then
(5) Otherwise, if , then
(6)
Points (A), (B) and (C) in Theorem 1.1 are the contents of Theorems 3.1, 3.2 and 3.4 given in the Section 3. We remark that in the previous statement there is no discontinuity between equations (2)–(3)–(5) and equations (4)–(6): the Taylor expansion of the term in the statement for the critical regime is equal to , which coincides with the term in the barely subcritical region.
The proofs are based on estimating coefficients of generating functions by means of Cauchy integrals along suitable contours and applying the saddle-point method.
Remarks. Observe that (1) follows directly from (2). Let us mention that technical refinements of our techniques would provide similar results for the region just before the supercritical regime, namely when , . We do not include the analysis of this region because the computations become too involved.
Finally, one may wonder why in the previous theorem we do not have a corresponding result for the supercritical regime. The reason is that in this case our techniques, based on the detailed structure of together with saddle-point estimates for the associated generating functions, do not apply in this situation. Given the Symmetry principle mentioned above, one should expect the number of -cycles in the supercritical regime follows a limit Poisson law as in the subcritical regime, but the tools provided by the Symmetry principle do no seem precise enough to prove such a statement.
2. Preliminaries and notation
All graphs considered in this paper are labelled. The size of a graph is the number of vertices. The excess of a graph is the number of vertices minus the number of edges. In the excess is .
2.1. Analytic combinatorics of graphs
We use the language of analytic combinatorics as in [5]. Given a generating function , we write . If and , we write if there exists such that for . All the generating functions that appear in this work are exponential generating functions of the form , or EGF for short (see [5, Chapter 2]).
We denote by and the EGF of rooted and unrooted labelled trees, respectively. It is well known that
(7) |
The EGF of unicyclic graphs (connected graphs with vertices and edges) is given by (see for instance [6, Equation (3.5)])
(8) |
We write , so that .
2.2. From Poisson parametrizations to central limit theorems
We include the following result by Kolchin that provides an approximation to a normal law by a Poisson parametrization.
Theorem 2.1 ([8, Theorem 1.1.15]).
Let . If as then
3. Proof of Theorem 1.1
We present separately the proof for each regime in Theorem 1.1. The main idea in all proofs is to encode the typical structure of random graphs in the regime under consideration using generating functions and then obtain large power estimates by means of saddle point bounds.
3.1. Subcritical regime
In this regime, the connected components of are a.a.s. a set of acyclic graphs (a forest) together with a set of unicyclic graphs. We exploit this property in order to get the following result which refines the first statement in Theorem 1.1:
Theorem 3.1.
Let such that , and . Let and . Then the random variable equal to the number of -cycles satisfies
Moreover, if as then
Proof.
It suffices to consider graphs whose connected components are trees and unicyclic graphs. Using the symbolic method we obtain that the probability that contains exactly unicyclic components containing an -cycle is equal to
(9) |
The term encodes the components containing an -cycle, while the term encodes the rest of unicyclic components (whose lengths do not belong to ). Using Cauchy integral’s formula we get
(10) | |||
After the change of variables , it becomes
(11) |
where
(12) | |||||
(13) |
Note that the function given by (13) is exactly the same as [2, Equation (30)], which satisfies the conditions . In the range with , we can apply saddle-point methods by choosing a circular path as the contour of integration. As shown in [4], we split the integral in (11) into three parts, namely . It suffices to integrate from to , for a convenient value of , because the remaining integrals can be bounded by the magnitude of the central integrand. Following the proof of [2, Theorem 3.2] and choosing (so that but as ) we have
(14) |
and for all choices of in we have
(15) |
As in the vicinity of , we have
(16) |
and
(17) |
for fixed . Using expansions (14), (16), (17) and the bound (15) we have
where . If we set the integral in the above equation becomes
(18) |
Observe that and the estimate (18) is a real number (because (10) is a real number). Hence (18) is equal to
It follows that
That is
(19) |
Using Stirling’s formula for the corresponding range of , we have
(20) |
Multiplying (19) and (20), after cancellations we obtain
This proves the first part of the theorem.
Now, suppose that as . The previous arguments work in a similar way. Instead of using the estimate (17), which is only valid when is small enough, we exploit the fact that has non-negative Taylor coefficients. Hence, Equation (17) can be replaced by the relation
which is valid for each choice of . Applying the same arguments as before and that for large , we conclude that
for a suitable constant . The second result in the theorem follows from the fact that , and hence is bounded. ∎
3.2. Barely subcritical regime
In the barely subcritical regime the asymptotic structure of is the same as in the subcritical regime. However, the integration countour we use is slightly more complicated in order to encode cycles of arbitrary length.
Theorem 3.2.
Let with tending to infinity with . Let and . Then the random variable equal to the number of -cycles satisfies
(21) |
Assume moreover that . Then for fixed real numbers
(22) |
Proof.
The arguments and notation are similar to the ones in the proof of Theorem 3.1. As mentioned in the proof of Theorem 3.1, a.a.s. in this regime contains only trees and unicyclic graphs as components. We need estimates for (9) in this new range of . We use again the same methods as in the proof of [2, Theorem 3.2]. Let
Then and as . The expansion of in the vicinity of is
(23) |
For , , the expansion of in the vicinity of is
(24) |
The integrand can be bounded on because
(25) |
Combining (23), (24) and (25), we have
We set and the integral becomes
After simple algebraic manipulations as in the proof of Theorem 3.1 we obtain
This proves the first part of the theorem.
We assume now that . Set with . We can apply Theorem 2.1 and obtain
The central limit theorem for follows, that is, for fixed real we have
∎
3.3. Critical regime
In this regime we have to take into account the appearance of complex components. Let be the probability that has a total excess with exactly unicyclic components containing an -cycle. The following lemma gives an estimate for .
Lemma 3.3.
Let with . Let be the positive solution to . Let
which satisfies .
Then for fixed we have
where
Moreover, for large enough there exist absolute constants and such that
(26) |
Proof.
The proof is based on analytic techniques introduced in [4] and [6]; see also [9]. The probability is given by
(27) |
where is the EGF of complex components with total excess given by [6, Equation (6.8)]. As shown in [6], when , the series can be approximated [6, Equation (6.8)] by , where
and the error term is of order . In order to evaluate (27) we have to compute the expression
(28) |
where
(29) | |||||
(30) |
We remark the difference between and the function defined in Equation (13). Note also that is exactly the same as in [6, Equation (10.12)], which satisfies and also if . We now follow the method of the proof of [6, Lemma 3] in order to compute our integral by choosing as path of integration
(31) |
where is the unique positive solution of , and belongs to the interval
Given that
(32) |
as long as , our choice ensures that the terms in (32) can be moved out of the integral. By following the proof of [6, Equation (10.1) of Lemma 3] we obtain that, for fixed values of
(33) |
Since in the terms above we have
This proves the first statement of the theorem.
Next let us assume that . We know that (see for instance [6, Lemma 4]). From (27) we have
Then, we obtain
(34) |
with is defined by (30). In this case we take as contour of integration the circle with . On this circle, since , for some constant and function with , we have
Note that . Then for large enough
(35) |
Using Stirling’s formula we find that
and for , we have
(36) |
Combining (35) and (36) in (34), we deduce that
for some constant . Since , when we deduce
∎
We can now proof the main result in the critical regime.
Theorem 3.4.
Let where . Let be the positive solution to . Let and let . Then the random variable equal -cycles in satisfies
(37) |
Moreover, assume that . Then, for each choice of real
(38) |
Proof.
By Lemma 3.3 and the dominated convergence theorem, is equal to
For Janson, Knuth, Łuczak and Pittel [6, Equation (13.17) and Corollary p. 61] have shown that the probability that has total excess is asymptotically , and that the -th moment of the excess satisfies . Hence . This shows relation (37).
4. Acknowledgments
Part of this research was done when the J.R. was visiting IRIF – Université Paris Denis Diderot in July 2018. J.R. thanks the hospitality of the institution during his research stay. V.R. was partially supported by grants ANR 2010 BLAN 0204 (Magnum). V.R. and J.R. were supported by the Project PHC Procope ID-57134837 ‘Analytic, probabilistic and geometric Methods for Random Constrained graphs’. J.R. was partially supported by the FP7-PEOPLE-2013-CIG project CountGraph (ref. 630749). Finally, M.N. and J.R. were supported by the Spanish project MTM2017-82166-P and by the H2020-MSCA-RISE-2020 project RandNET (ref. 101007705).
References
- [1] N. Alon and J. H. Spencer. The probabilistic method. Wiley-Interscience Series in Discrete Mathematics and Optimization. Third edition.
- [2] H. Daudé and V. Ravelomanana. Random 2-xorsat phase transition. Algorithmica, Special issue of LATIN 2008:1–18, 2009.
- [3] P. Erdős and A. Rényi. On the evolution of random graphs. Publ. Math. Inst. Hung. Acad. Sci, 5:17–61, 1960.
- [4] P. Flajolet, D. E. Knuth, and B. Pittel. The first cycles in an evolving graph. Discrete Mathematics, 75(1-3):167–215, 1989.
- [5] P. Flajolet and B. Sedgewick. Analytic Combinatorics. Cambridge University Press, 2009.
- [6] S. Janson, D. E. Knuth, T. Łuczak, and B. Pittel. The Birth of the Giant Component. Random Structures and Algorithms, 4(3):233–358, 1993.
- [7] S. Janson, T. Łuczak, and A. Ruciński. Random Graphs. Wiley-Interscience, 2000.
- [8] V. F. Kolchin. Random Graphs. Cambridge University Press, 1998.
- [9] Marc Noy, Vlady Ravelomanana, and Juanjo Rué. On the probability of planarity of a random graph near the critical point. Proc. Amer. Math. Soc., 143(3):925–936, 2015.