Rainbow spanning trees in randomly coloured $G_{k-out}$

Deepak Bal, Alan Frieze, and Paweł Prałat Department of Mathematics, Montclair State University, Montclair NJ 07043, USA; e-mail: [email protected]Department of Mathematical Sciences, Carnegie Mellon University, Pittsburgh PA 15213, USA; e-mail: [email protected]; research supported in part by NSF grant DMS1952285.Department of Mathematics, Toronto Metropolitan University, Toronto, ON, Canada; e-mail: [email protected]; research supported in part by NSERC grant; part of this work was done while the author was visiting the Simons Institute for the Theory of Computing.

Abstract

Given a graph $G=(V,E)$ on $n$ vertices and an assignment of colours to its edges, a set of edges $S\subseteq E$ is said to be rainbow if edges from $S$ have pairwise different colours assigned to them. In this paper, we investigate rainbow spanning trees in randomly coloured random $G_{k-out}$ graphs.

1 Introduction

Let $G=(V,E)$ be a graph in which the edges are coloured. A set $S\subseteq E$ is said to be rainbow coloured if each edge of $S$ is in a different colour. There is by now a large body of research on the existence of rainbow structures in randomly coloured random graphs. Let us highlight a few selected results. Frieze and McKay [14] and Bal, Bennett, Frieze and Prałat [1] studied the existence of rainbow spanning trees in $G_{n,m}$ , the classical Erdős–Rényi random graph process. Cooper and Frieze [6], Frieze and Loh [13] and Ferber and Krivelevich [10] studied the existence of rainbow Hamilton cycles in $G_{n,m}$ . Janson and Wormald [15] studied the existence of rainbow Hamilton cycles in random regular graphs. Finally, Bal, Bennett, Pérez-Giménez and Prałat [2], investigated rainbow perfect matchings and Hamilton cycles in random geometric graphs. Of the most popular random graph models, what is missing here is the random multigraph $G_{k-out}$ . The aim of this paper is to initiate the study of these problems in the context of a randomly coloured $G_{k-out}$ graphs.

All asymptotics throughout are as $n\to\infty$ (we emphasize that the notations $o(\cdot)$ and $O(\cdot)$ refer to functions of $n$ , not necessarily positive, whose growth is bounded). We say that an event in a probability space holds with high probability (or w.h.p.) if the probability that it holds tends to one as $n\to\infty$ . We often write $G_{k-out}$ when we mean a graph drawn from the distribution $G_{k-out}$ .

The random graph $G_{k-out}=G_{k-out}(n)$ is defined as follows. It has vertex set $[n]:=\{1,\ldots,n\}$ and each vertex $i\in[n]$ independently chooses $k$ random distinct neighbours from $[n]\setminus\{i\}$ , so that each of the $\binom{n-1}{k}$ sets is equally likely to be chosen. It was shown by Fenner and Frieze [8] that $G_{k-out}$ is $k$ -connected w.h.p. for $k\geq 2$ . It was shown by Frieze [11] that $G_{2-out}$ has a perfect matching w.h.p., and by Bohman and Frieze [3] that $G_{3-out}$ is Hamiltonian w.h.p. All of the above results are sharp. For more details we direct the reader to Chapter 18 in [12].

We define the randomly coloured graph $G_{k,q}=G_{k,q}(n)$ (not to be confused with $G_{n,m}$ ) as follows: the underlying graph on $n$ vertices is $G_{k-out}$ and (i) there is a set $Q$ of $q$ colours, (ii) each colour appears $\rho:={\left\lfloor kn/q\right\rfloor}$ or $\rho+1$ times (there are $kn-q\rho$ popular colours that appear $\rho+1$ times, the remaining colours are unpopular and appear $\rho$ times; note that if $q$ divides $kn$ , then all colours are unpopular), (iii) $kn$ colours, including repetitions, are randomly assigned to the $kn$ edges of $G_{k-out}$ . Finally, let us note that, without loss of generality, we may assume that $q\leq kn$ . Indeed, if the number of colours is more than $kn$ , then some colours are not used at all and the problem is equivalent to the one with $q=kn$ .

In this paper we investigate spanning trees. We will prove the following theorem.

Theorem 1.

If $k\geq 2$ and $q\geq n-1$ , then $G_{k,q}$ has a Rainbow Spanning Tree (RST) w.h.p.

The result is best possible. Trivially, if $q\leq n-2$ , then there are not enough colours to create a rainbow tree. If $k=1$ , then $G_{k,q}$ is disconnected w.h.p. [8].

2 Preliminaries

2.1 Colour Monotonicity

In our problem, we randomly colour $kn$ edges of $G_{k,q}$ with $q$ colours and we aim to create a rainbow structure. Recall that there are $q_{2}=kn-q\rho$ popular colours that are present $\rho+1$ times and $q_{1}=q-q_{2}$ unpopular colours that are present $\rho$ times.

It is natural to expect that the more colours are available, the easier it is to achieve our goal. We prove this monotonicity property in the following, slightly broader, context. Suppose that we are given a finite set $X$ and a set of colours $C$ where $|C|=|X|$ . (In our application, $X$ is the set of $kn$ edges of $G_{k,q}$ and $C$ is the set of $kn$ colours: $q$ colours from set $Q$ , including repetitions.) We also have two distinct partitions of $C$ : $\mathcal{C}=\{C_{1},\ldots,C_{q}\}$ and $\mathcal{\widehat{C}}=\{\widehat{C}_{1},\ldots,\widehat{C}_{q+1}\}$ , for some positive integer $q\leq|X|-1$ . (In our application, each part corresponds to a colour from set $Q$ . Partitions $\mathcal{C}$ and $\mathcal{\widehat{C}}$ correspond to colourings with $q$ and $q+1$ colours respectively.) Let $\rho={\left\lfloor|X|/q\right\rfloor}$ , $\widehat{\rho}={\left\lfloor|X|/(q+1)\right\rfloor}$ , $q_{2}=|X|-q\rho$ , and $q_{1}=q-q_{2}$ . Suppose that $|C_{i}|=\rho$ for $1\leq i\leq q_{1}$ and $|C_{i}|=\rho+1$ for $q_{1}+1\leq i\leq q$ , that is, there are $q_{1}$ parts in $\mathcal{C}$ of size $\rho$ and $q_{2}$ parts of size $\rho+1$ .

We are given a collection of sets $X_{1},X_{2},\ldots$ , each set $X_{i}$ is a subset of $X$ . (In our application, the $X_{i}$ are the edges of spanning trees, perfect matchings, Hamilton cycles, etc.) Our goal is to create at least one rainbow set from this collection. Let us consider a random colouring of $X$ via a random bijection from $C$ to $X$ . In order to show that the probability that some $X_{i}$ is rainbow in a random colouring with $q+1$ colours is at least the corresponding probability when elements of $X$ are coloured with $q$ colours, we need to couple the two partitions. In order to do that we need to consider two cases.

Case 1: $\widehat{\rho}=\rho$ . Partition $\mathcal{\widehat{C}}$ is obtained from $\mathcal{C}$ by choosing $\rho$ parts in $\mathcal{C}$ of size $\rho+1$ and replacing them with $\rho+1$ parts of size $\rho$ (see Figure 1). We couple the two colourings by first randomly mapping the $|C|-\rho(\rho+1)$ colours from the parts that are the same in both partitions, and conditioning on the result. If some rainbow $X_{i}$ is created, then it is clearly present in both colourings. Otherwise, it is easy to see that partition $\mathcal{\widehat{C}}$ is at least as likely to complete a rainbow colouring. Indeed, suppose that some $X_{i}$ has $s$ elements that are not coloured yet; we may assume that $1\leq s\leq\rho$ as, otherwise, such a set cannot be rainbow via the first partition (it could be rainbow via the second partition if $s=\rho+1$ ). The probability that partition ${\mathcal{C}}$ completes a rainbow colouring is equal to $\prod_{i=1}^{s-1}\frac{\rho(\rho+1)-i(\rho+1)}{\rho(\rho+1)-i}$ that is at most the corresponding probability for partition $\mathcal{\widehat{C}}$ , namely, $\prod_{i=1}^{s-1}\frac{\rho(\rho+1)-i\rho}{\rho(\rho+1)-i}$ .

Refer to caption — Figure 1: The coupling of the two colourings in Case 1.

Case 2: $\widehat{\rho}=\rho-1$ . As before, we start with partition $\mathcal{C}$ but this time we take all $q_{2}$ parts of size $\rho+1$ (possibly $q_{2}=0$ so we might not have any) and we choose $\rho-1-q_{2}$ parts of size $\rho$ . To create partition $\widehat{C}$ , we replace them with $q_{2}$ parts of size $\rho$ and $\rho-q_{2}$ parts of size $\rho-1$ . As in the other case, either we create some rainbow $X_{i}$ or partition $\mathcal{\widehat{C}}$ is at least as likely to complete a rainbow colouring of some set $X_{i}$ .

The above coupling allows us to concentrate on the minimum number of colours. In particular, in the proof of Theorem 1, without loss of generality, we may assume that $q=n-1$ .

2.2 Degree Monotonicity

Based on Section 2.1, in order to prove Theorem 1 we may assume that $q=|Q|=n-1$ . In this setup, there are $k$ popular colours present $k+1$ times in $G_{k,q}$ and $q-k=n-1-k$ unpopular colours present $k$ times in $G_{k,q}$ . Exactly one out of $k+1$ copies of each popular colour is called special. Similarly, there are $k+1$ popular colours present $k+2$ times in $G_{k+1,q}$ , $q-(k+1)=n-2-k$ unpopular colours present $k+1$ times, and there are $k+1$ special copies of popular colours (one special copy of each).

It seems reasonable to expect that if $G_{k,q}$ has a particular rainbow structure w.h.p., then so does $G_{k+1,q}$ . Let $\Gamma_{k+1}$ be the bipartite graph with vertex sets $[n]$ and $Q^{\prime}=Q\cup\left\{q\right\}$ , where $q$ is a “dummy” vertex that will be associated with special copies of popular colours. For each of the $(k+1)$ edges chosen by $v\in[n]$ in $G_{k+1,q}$ , we observe a colour $c$ of that edge without exposing its other endpoint. If a copy of colour $c$ is non-special, then we add an edge between $v$ and $c\in Q$ in $\Gamma_{k+1}$ ; otherwise, we add an edge between $v$ and the “dummy” vertex $q$ . Note that we need the extra vertex $q$ to make $\Gamma_{k+1}$ regular.

Recall that $(k+1)n$ colours, including repetitions, are randomly assigned to the $(k+1)n$ edges of $G_{k+1,q}$ . Hence, $\Gamma_{k+1}$ is distributed as a random $(k+1)$ -regular bipartite (multi)graph (where multiple edges are allowed but not loops). Indeed, it fits the bipartite configuration model of Bollobás [4]. Each vertex could be replaced by a distinct set of $(k+1)$ points, each colour $c\in Q$ naturally corresponds to $(k+1)$ points associated with non-special copies of that colour, and the “dummy” vertex $q$ corresponds to $(k+1)$ points associated with special copies of popular colours. Then, we randomly pair these points to get the colouring of the edges of $G_{k+1,q}$ . Note that $\Gamma_{k+1}$ contains no information about the actual vertex choices of edges in $G_{k+1,q}$ , only their colour. Informally, we can think of each edge of $\Gamma_{k+1}$ being associated with a box containing a random vertex from $[n]$ . We do not need to open these boxes for what is next.

We will use the fact that $\Gamma_{k+1}$ is contiguous to the sum of $(k+1)$ independent random perfect matchings—see the survey on random regular graphs by Wormald [18]. If we delete one of these matchings, then we obtain a random $k$ -regular bipartite graph contiguous to $\Gamma_{k}$ . Arguing as before, we observe that $\Gamma_{k}$ may be viewed as a random assignment of $kn$ colours, including repetitions, to the $kn$ edges of $G_{k,q}$ . “Opening” the boxes on each edge of $\Gamma_{k}$ gives us $G_{k,q}$ , which by assumption has a required rainbow structure w.h.p. So, using the above coupling we conclude that $G_{k+1,q}$ also has a rainbow structure w.h.p.

3 Rainbow Spanning Trees

In view of the results in Sections 2.1 and 2.2, we will assume that $k=2$ and $q=n-1$ for this section.

To establish the existence of a RST to prove Theorem 1, we will use the result of Edmonds [7] on the matroid intersection problem. A finite matroid $M$ is a pair $(E,\mathcal{I})$ , where $E$ is a finite set (called the ground set) and $\mathcal{I}$ is a family of subsets of $E$ (called the independent sets) with the following properties:

•

$\emptyset\in\mathcal{I}$ ,
•

for each $A^{\prime}\subseteq A\subseteq E$ , if $A\in\mathcal{I}$ , then $A^{\prime}\in\mathcal{I}$ (hereditary property),
•

if $A$ and $B$ are two independent sets of $\mathcal{I}$ and $A$ has more elements than $B$ , then there exists an element in $A$ that when added to $B$ gives a larger independent set than $B$ (augmentation property).

A maximal independent set (that is, an independent set which becomes dependent on adding any element of $E$ ) is called a basis for the matroid. An observation, directly analogous to the one of bases in linear algebra, is that any two bases of a matroid $M$ have the same number of elements. This number is called the rank of $M$ . For more details on matroids see, for example, [16].

In this scenario, $M_{1},M_{2}$ are matroids over a common ground set $E$ with rank functions $r_{1},r_{2}$ , respectively. Edmonds’ general theorem shows that

\max(|I|:I\mbox{ is independent in both matroids})=\min_{{E_{1}\cup E_{2}=E\atop E_{1}\cap E_{2}=\emptyset}}(r_{1}(E_{1})+r_{2}(E_{2})),

(1)

where $r_{i}(E_{i})$ is the rank of the matroid induced by $E_{i}$ .

In our application, the common ground set $E$ is the set of coloured multi-edges of $G_{k,q}$ . $M_{1}$ is the cycle matroid of the graph $G_{k-out}$ ; that is, $S\subseteq E$ is independent in $M_{1}$ if $S$ induces a graph with no cycle (colours are ignored, two parallel edges are considered to be a cycle of length 2). Hence, for every $S\subseteq E$ we have $r_{1}(S)=n-\kappa(S)$ , where $\kappa(S)$ is the number of components of the graph $G_{S}=([n],S)$ induced by $S$ . $M_{2}$ is the partition matroid associated with the colours; that is, $S\subseteq E$ is independent in $M_{2}$ if $S$ has no two edges in the same colour. This time, for every $S\subseteq E$ we have that $r_{2}(S)$ is the number of distinct colours occurring in $S$ . We use Edmonds’ theorem to get the following useful observation that has been used a number of times in related contexts. In temporal order, it was used by Fenner and Frieze [9], Frieze and McKay [14] and by Suzuki [17].

Lemma 2.

Let $G=(V,E)$ be a multigraph in which each edge is coloured with a colour from a set $Q$ . A necessary and sufficient condition for the existence of a RST is that

\kappa(C_{I})\leq|Q|+1-|I|\hskip 72.26999pt\mbox{for all }I\subseteq Q,

(2)

where $C_{I}\subseteq E$ is the set of edges of colour from set $I$ .

Proof.

Clearly, $G$ has a rainbow spanning tree if and only if $G$ contains a set $S$ of coloured edges of size $|V|-1$ such that $S$ is independent both in $M_{1}$ ( $S$ induces a spanning tree) and in $M_{2}$ ( $S$ is rainbow). Since no set of size at least $|V|$ is independent in $M_{1}$ , the necessary and sufficient condition is that the right side of (1) is at least $|V|-1$ . Hence, the desired condition is that for every partition of the edge set $E$ into $E_{1}$ and $E_{2}$ we have $r_{1}(E_{1})+r_{2}(E_{2})\geq|V|-1$ .

Let us fix a partition of $E$ into $E_{1}$ and $E_{2}$ . Let $J\subseteq Q$ be the set of colours occurring in $E_{2}$ , $E^{\prime}_{2}\subseteq E$ be the set of edges coloured with a colour from $J$ , and $E^{\prime}_{1}=E\setminus E^{\prime}_{2}$ . Clearly, $(E^{\prime}_{1},E^{\prime}_{2})$ is also a partition of $E$ , $E_{2}\subseteq E^{\prime}_{2}$ and so $E^{\prime}_{1}\subseteq E_{1}$ , and $r_{2}(E_{2})=r_{2}(E^{\prime}_{2})=|J|$ . Moreover, since $E^{\prime}_{1}\subseteq E_{1}$ , $r_{1}(E^{\prime}_{1})\leq r_{1}(E_{1})$ and so

r_{1}(E_{1})+r_{2}(E_{2})\geq r_{1}(E^{\prime}_{1})+r_{2}(E^{\prime}_{2}).

Therefore, without loss of generality, we may restrict ourselves to sets $E_{2}$ containing all edges of colour from some set $J\subseteq Q$ and then take $I=Q\setminus J$ (that is, $E_{2}=C_{J}$ and $E_{1}=C_{I}$ ). The condition to verify is the following:

|V|-1\leq r_{1}(E_{1})+r_{2}(E_{2})=(|V|-\kappa(C_{I}))+(|Q|-|I|)

which is equivalent to (2). The proof of the lemma is finished. ∎

Recall that $k=2$ , $q=n-1$ , and so $\rho=k=2$ . For a given $\ell\in[q]\cup\{0\}$ , we define the event

{\cal A}_{\ell}=\{\exists I\subseteq Q,|I|=\ell:\kappa(C_{I})\geq q-|I|+2\}.

Trivially, ${\cal A}_{0}$ cannot occur. Since each colour of $Q$ is used at least once, ${\cal A}_{1}$ cannot occur as well: if $|I|=1$ , then $\kappa(C_{I})\leq n-1$ but $q-|I|+2\geq n$ . With some additional work we can eliminate ${\cal A}_{2}$ and ${\cal A}_{3}$ as well. Note that in $G_{k-out}$ , the probability that a vertex $v$ chooses $u$ as a neighbour and vice versa is $O(1/n^{2})$ . Hence by linearity of expectation, the expected number of multiple edges in $G_{k-out}$ is $O(1)$ . The probability that both edges in a multiple edge receive the same (fixed) colour in $G_{k,q}$ is $O(1/q^{2})$ and so the expected number of monochromatic multiple edges in $G_{k,q}$ is $O(1/q)=o(1)$ . It follows from the first moment method that w.h.p. there are no monochromatic multiple edges. Since we aim for a statement that holds w.h.p., we may assume that this property is satisfied. We get that if $|I|=2$ , then $\kappa(C_{I})\leq n-2$ but $q-|I|+2\geq n-1$ . Similarly, the expected number of triples of colours such that there are two multiple edges that use only these colours is $O(1/q)=o(1)$ . Hence, we may assume that for any $I$ of size 3, $|C_{I}|\geq 5$ and so $\kappa(C_{I})\leq n-3$ whereas $q-|I|+2\geq n-2$ . Finally, note that ${\cal A}_{q}$ cannot occur w.h.p. since $G_{k-out}$ is connected w.h.p.

We know that if there is no RST, then ${\cal A}_{\ell}$ occurs for some $\ell\in[4,q-1]$ . Let us concentrate on a minimal $\ell$ , corresponding set $I$ , and let $S=C_{I}$ . Let us start with the following simple but useful observation.

Claim 1.

$G_{S}$ has no bridges.

Proof.

If there is a bridge in $G_{S}$ , then we simply remove it and all edges of the same colour. The number of components increases by at least one and the number of colours decreases by one. Clearly, ${\cal A}_{\ell-1}$ occurs, contradicting the minimality of $\ell$ . ∎

Recall that $\kappa(S)\geq q-\ell+2=n-\ell+1$ . Suppose then that $G_{S}$ has $i$ isolated vertices and $n-\ell+x-i$ non-trivial components for some integer $x\geq 1$ . Let $T$ be the set of vertices in the non-trivial components of $G_{S}$ . By Claim 1, $G_{S}$ has no bridges. Since non-trivial components without bridges have at least three vertices,

i+3(n-\ell+x-i)\leq n

(3)

which gives us that

|T|=n-i\leq{3\over 2}(\ell-x)\leq{3\over 2}(\ell-1).

Let ${\cal B}_{\ell}$ denote the event

\begin{array}[]{ll}\{\exists I\subseteq Q,|I|=\ell,T\subseteq[n]:&t=|T|\leq 3(\ell-1)/2,\\ &\mbox{all edges coloured with $I$ are contained in $T$},\\ &\mbox{there are $u\geq\max\{\rho\ell,t\}$ $I$-coloured edges}\}.\end{array}

(Here $u\geq t$ because we are dealing with bridgeless components and $u\geq\rho\ell$ as each colour appears at least $\rho$ times.) Clearly,

{\cal A}_{\ell}\subseteq{\cal B}_{\ell}\hskip 36.135pt\mbox{ for }\ell\geq 4,

(4)

and we will bound the probability of ${\cal B}_{\ell}$ to deal with small values of $\ell$ , that is, for $\ell\leq n/20$ .

3.1 $4\leq\ell\leq n/20$

Recall that $k=2$ , $q=n-1$ , and $\rho={\left\lfloor kn/q\right\rfloor}={\left\lfloor kn/(n-1)\right\rfloor}=2$ . There are $q_{1}=q-k=n-1-k=n-3$ unpopular colours that appear $k=2$ times and $q_{2}=k=2$ popular colours that appear $k+1=3$ times.

Note that

	$\displaystyle\mathbb{P}({\cal B}_{\ell})$	$\displaystyle\leq\sum_{t=1}^{3(\ell-1)/2}\binom{n}{t}\sum_{\ell_{1}+\ell_{2}=\ell}\binom{q_{1}}{\ell_{1}}\binom{q_{2}}{\ell_{2}}\binom{tk}{k\ell+\ell_{2}}$
		$\displaystyle\qquad\times\left(\frac{t}{n}\right)^{k\ell+\ell_{2}}\frac{(k\ell+\ell_{2})!}{(kn)(kn-1)\cdots(kn-(k\ell+\ell_{2}-1))},$

where the second sum is taken over all integers $0\leq\ell_{1}\leq q_{1}$ , $0\leq\ell_{2}\leq q_{2}$ such that $\ell_{1}+\ell_{2}=\ell$ . Indeed, in order to estimate $\mathbb{P}({\cal B}_{\ell})$ , we consider all possibilities for $t=|T|\leq 3(\ell-1)/2$ (the first sum) and all sets of size $t$ (the term ${n\choose t}$ ). We independently consider sets of colours $I\subseteq Q$ with $\ell_{1}$ unpopular colours and $\ell_{2}$ popular ones (the second sum). Then, we need to select specific colours (the term $\binom{q_{1}}{\ell_{1}}\binom{q_{2}}{\ell_{2}}$ ). Since all edges coloured with $I$ are contained in $T$ , we need to select which of the $tk$ edges of $G_{k-out}$ generated by vertices of $T$ are coloured with $I$ (the term $\binom{tk}{k\ell+\ell_{2}}$ ). The selected edges need to stay within $T$ (the term $\left(\frac{t}{n}\right)^{k\ell+\ell_{2}}$ ) and be coloured with $I$ (the last term).

Clearly, $\binom{tk}{k\ell+\ell_{2}}\leq(tk)^{k\ell+\ell_{2}}/(k\ell+\ell_{2})!$ . Moreover, since $\ell\leq n/20$ and $\ell_{2}\leq q_{2}=k$ ,

$\displaystyle(kn)(kn-1)\cdots(kn-(k\ell+\ell_{2}-1))$	$\displaystyle\geq$	$\displaystyle(kn)^{k\ell+\ell_{2}}\left(1-\frac{k\ell+\ell_{2}}{kn}\right)^{k\ell+\ell_{2}}$
	$\displaystyle\geq$	$\displaystyle(kn)^{k\ell+\ell_{2}}\left(\frac{19}{20}+o(1)\right)^{k\ell+\ell_{2}}$
	$\displaystyle=$	$\displaystyle(kn)^{k\ell+\ell_{2}}\left(\sqrt{\frac{20}{19}}+o(1)\right)^{-2(k\ell+\ell_{2})}$
	$\displaystyle\geq$	$\displaystyle(kn)^{k\ell+\ell_{2}}\ 1.03^{-2(k\ell+\ell_{2})}.$

We get that

	$\displaystyle\mathbb{P}({\cal B}_{\ell})$	$\displaystyle\leq$	$\displaystyle\sum_{t=1}^{3(\ell-1)/2}{n\choose t}\sum_{\ell_{1}+\ell_{2}=\ell}\binom{q_{1}}{\ell_{1}}\binom{q_{2}}{\ell_{2}}(tk)^{k\ell+\ell_{2}}\left(\frac{t}{n}\right)^{k\ell+\ell_{2}}\frac{1.03^{\,2(k\ell+\ell_{2})}}{(kn)^{k\ell+\ell_{2}}}$
		$\displaystyle=$	$\displaystyle\sum_{t=1}^{3(\ell-1)/2}{n\choose t}\sum_{\ell_{1}+\ell_{2}=\ell}\binom{q_{1}}{\ell_{1}}\binom{q_{2}}{\ell_{2}}\left(\frac{1.03t}{n}\right)^{2(k\ell+\ell_{2})}.$

Since $\binom{q_{1}}{\ell_{1}}\binom{q_{2}}{\ell_{2}}\leq\binom{q}{\ell}\leq\binom{n}{\ell}\leq(ne/\ell)^{\ell}$ , $\binom{n}{t}\leq(ne/t)^{t}$ , and $2(k\ell+\ell_{2})\geq 2k\ell=4\ell$ , we get

\displaystyle\mathbb{P}({\cal B}_{\ell})

\displaystyle\leq

\displaystyle(\ell+1)\sum_{t=1}^{3(\ell-1)/2}\left(\frac{ne}{t}\right)^{t}\left(\frac{ne}{\ell}\right)^{\ell}\left(\frac{1.03t}{n}\right)^{4\ell}.

Note that since the ratio between the $(t+1)$ -st and the $t$ -th term is

\frac{ne}{t+1}\left(\frac{t+1}{t}\right)^{4\ell-t}\geq\frac{ne}{t+1}\geq 2,

the above sum is of the order of its last term. We get that

$\displaystyle\mathbb{P}({\cal B}_{\ell})$	$\displaystyle=$	$\displaystyle O\left(\ell\left(\frac{ne}{3\ell/2}\right)^{3\ell/2}\left(\frac{ne}{\ell}\right)^{\ell}\left(\frac{1.03\cdot 3\ell/2}{n}\right)^{4\ell}\right)$
	$\displaystyle=$	$\displaystyle O\left(\ell\left(\frac{2}{3}\cdot e^{5/3}\cdot 1.545^{8/3}\ \frac{\ell}{n}\right)^{3\ell/2}\right)$
	$\displaystyle=$	$\displaystyle O\left(\ell\left(\frac{12\,\ell}{n}\right)^{3\ell/2}\right).$

Clearly, $\sum_{\ell=4}^{n/20}\mathbb{P}({\cal B}_{\ell})=O(n^{-6})=o(1)$ , since the sum is dominated by its first term.

3.2 $n/20<\ell\leq n-\frac{5001n\log\log n}{\log n}$

We first bound the number of pairwise vertex disjoint cycles in $G_{k-out}$ , including loops (cycles of length 1) and parallel edges (cycles of length 2). In our application, we concentrate on $k=2$ but the following bound holds in general.

Lemma 3.

Fix any integer $k\geq 2$ . W.h.p. no family of pairwise vertex disjoint cycles in $G_{k-out}$ consists of more than $3n\log(2k)/\log n$ cycles.

Proof.

Let $\omega=\frac{\log n}{2\log(2k)}$ . Let $Z$ denote the number of cycles of length at most $\omega$ in $G_{k-out}$ (possibly overlapping). There are $\binom{n}{s}\frac{s!}{2s}=\binom{n}{s}\frac{(s-1)!}{2}$ cycles of length $s$ that one can form out of $n$ labelled vertices. For a given pair of vertices, the probability that there is an edge between them is clearly at most $\frac{2k}{n}$ . Hence,

\mathbb{E}\left(Z\right)\leq\sum_{s=1}^{\omega}\binom{n}{s}\frac{(s-1)!}{2}\left(\frac{2k}{n}\right)^{s}\leq\sum_{s=1}^{\omega}\frac{(2k)^{s}}{2s}.

Since the ratio between the two consecutive terms ( $(s+1)$ st and $s$ th) is $(2k)s/(s+1)\geq k\geq 2$ ,

\mathbb{E}\left(Z\right)\leq\frac{(2k)^{\omega}}{\omega}=o\left(\exp\left(\frac{\log n}{2\log(2k)}\log(2k)\right)\right)=o(\sqrt{n}).

The Markov inequality implies that $Z\leq n\log(2k)/\log n$ w.h.p.

On the other hand, there are trivially at most $n/\omega=2n\log(2k)/\log n$ pairwise vertex disjoint cycles of length greater than $\omega$ , and the lemma follows. ∎

For the next property we need, we assume that $k=2$ . As in Section 2.2, let $\Gamma_{2}$ be the bipartite (multi)graph with vertex sets $[n]$ and $Q^{\prime}=Q\cup\{q\}$ , where $q$ is a “dummy” vertex associated with special copies of popular colours. There is an edge $vc$ in $\Gamma_{2}$ if one of $v$ ’s choices has non-special copy of colour $c$ ; as before, an edge $vq$ occurs in $\Gamma_{2}$ if one of $v$ ’s choices is one of the two special copies of popular colours. $\Gamma_{2}$ is a random 2-regular bipartite graph. Any 2-regular graph is a collection of cycles. We will use a well-known and easy to prove fact that w.h.p. there are not too many cycles in $\Gamma_{2}$ . For completeness, we provide an elementary proof.

Lemma 4.

$\Gamma_{2}$ contains at most $(\log n)^{2}$ cycles w.h.p.

Proof.

Let $Z$ denote the number of cycles in $\Gamma_{2}$ . We will use the configuration model of Bollobás [4] to estimate $\mathbb{E}(Z)$ ; see also Chapter 11 of Frieze and Karoński [10]. Let us select any vertex $v$ and any of the two points in it. We expose the other endpoint of that edge associated with vertex $y$ , and move on to the other point associated with $y$ . We continue the process until the first cycle is discovered. Then, we select any other point that is not matched yet and continue from there. The probability of closing a cycle at $i$ th step of this process is precisely $1/(2n-2i+1)$ . Indeed, $2(i-1)$ points are matched before step $i$ and one point is considered at $i$ th step, so there are $2n-2i+1$ points available to be matched with the considered point to form an edge; only one of these points (namely, the one associated with vertex $v$ we start with) closes the cycle. We get that

\mathbb{E}(Z)=\sum_{i=1}^{n}\frac{1}{2n-2i+1}=\sum_{j=1}^{2n}\frac{1}{j}-\sum_{j=1}^{n}\frac{1}{2j}=\ln(2n)-\frac{1}{2}\ln(n)+O(1)=\frac{1}{2}\ln n+O(1).

Showing concentration of $Z$ around its expectation is easy but, since we aim for a slightly weaker result, we may trivially use the Markov inequality to get the desired bound that holds w.h.p. ∎

We continue concentrating on a minimal $\ell$ (in the range for $\ell$ considered in this section) for which ${\cal A}_{\ell}$ occurs and the corresponding set $I\subseteq Q$ . We let $S=C_{I}$ and we expose $G_{S}=([n],S)$ , the sub-graph induced by $S$ . In particular, $\kappa(S)\geq n-\ell+1$ and we may assume that $G_{S}$ is bridgeless by Claim 1. Let $X_{0}$ denote the number of isolated vertices of $G_{S}$ . Since $G_{S}$ is bridgeless, each non-trivial component has a cycle. Hence, the number of non-trivial components, $\eta$ , is bounded by the number of pairwise disjoint cycles. By Lemma 3, we may assume that $\eta\leq\frac{3n\log(2k)}{\log n}\leq\frac{5n}{\log n}$ . It follows that

n-\ell+1\leq\kappa(S)=X_{0}+\eta\text{ where }\eta\leq\frac{5n}{\log n}.

(5)

Almost all sets of colours $I$ yield a graph $G_{S}$ with many isolated vertices ensuring that (5) does not hold. Only a small fraction of possible sets of colours need special attention. We will use $\Gamma_{2}$ to estimate how many different configurations we need to take care of.

Let us concentrate on the set of edges incident with colours $I$ in $\Gamma_{2}$ , ignoring the two edges incident with a “dummy” vertex $q$ . (Note that removing at most two edges from $G_{S}$ may only increase the number of isolated vertices, so $X_{0}\leq X_{0}^{\prime}$ , where $X_{0}^{\prime}$ is the number of isolated vertices in $G_{S^{\prime}}$ ; $S^{\prime}$ is the set obtained from $S$ after removing the edges incident with special copies of popular colours (there are at most two such edges).) By Lemma 4, this set of edges in $\Gamma_{2}$ induces a graph $\Gamma$ consisting of $r\leq(\log n)^{2}$ cycles and $s$ paths. The $2s$ endpoints of paths in $\Gamma$ must be in $[n]$ and so each path corresponds to two distinct non-isolated vertices of $G_{S^{\prime}}$ . Let $x$ be the number of vertices in $[n]$ in $\Gamma$ that are of degree 2 (vertices on cycles or internal vertices on paths); they also correspond to non-isolated vertices of $G_{S^{\prime}}$ . The remaining vertices in $\Gamma$ are isolated and the corresponding vertices in $G_{S^{\prime}}$ might be isolated. Note that we did not expose edges yet only assigned colours. It is possible that after exposing edges such vertices will be chosen by other ones and become non-isolated. In any case, $X^{\prime}_{0}\leq n-2s-x$ . By comparing the number of edges in $\Gamma$ incident to $[n]$ with the number of edges incident to $I\subseteq Q$ , we get $2\cdot x+1\cdot(2s)=2\ell$ so $x=\ell-s$ . Combining the two observations together, we get $X^{\prime}_{0}\leq n-2s-x=n-\ell-s$ .

If $s\geq\frac{5n}{\log n}$ , then

\kappa(G_{S})=X_{0}+\eta\leq X^{\prime}_{0}+\eta\leq n-\ell-s+\eta\leq n-\ell-\frac{5n}{\log n}+\eta\leq n-\ell,

contradicting (5).

Let us assume then that $s<\frac{5n}{\log n}$ . Let $V_{0}\subseteq[n]$ denote the set of vertices in $\Gamma$ that are covered by paths and cycles, and let $V_{1}=[n]\setminus V_{0}$ . As argued above, $|V_{1}|=n-\ell-s$ and so $|V_{1}|\geq\frac{5000n\log\log n}{\log n}$ since $\ell\leq n-\frac{5001n\log\log n}{\log n}$ and $s<\frac{5n}{\log n}<\frac{n\log\log n}{\log n}$ . Vertices in $V_{0}$ correspond to non-isolated vertices in $G_{S^{\prime}}$ . Vertices in $G_{S^{\prime}}$ corresponding to vertices in $V_{1}$ that are isolated in $\Gamma$ do not generate any random edges but it does not mean that they are isolated in $G_{S^{\prime}}$ . Let $V_{2}\subseteq V_{1}$ denote the set of vertices that are incident with an $I$ -coloured edge in $\Gamma$ and are chosen by some vertex in $V_{0}$ . It is important to notice that we have not conditioned on the other endpoints of the choices in $\Gamma$ (endpoints of random edges generated by vertices in $G_{S^{\prime}}$ corresponding to vertices in $V_{0}$ in $\Gamma$ ). We may expose these $2\ell\geq n/10$ edges now, one at a time, each time updating set $V_{2}$ . Provided that $|V_{2}|\leq|V_{1}|/50$ , we add a new vertex to $V_{2}$ with probability at least $(49|V_{1}|/50)/n\geq|V_{1}|/(2n)$ . Let $X\in\text{Bin}\left(n/10,|V_{1}|/(2n)\right)$ with $\mathbb{E}(X)=|V_{1}|/20\geq\frac{250n\log\log n}{\log n}$ . The established coupling implies that

\mathbb{P}\left(|V_{2}|\leq\frac{|V_{1}|}{50}\right)\leq\mathbb{P}\left(X\leq\frac{|V_{1}|}{50}\right)\leq\mathbb{P}\left(X\leq\frac{\mathbb{E}(X)}{2}\right),

and so the Chernoff’s bounds imply that

\mathbb{P}\left(|V_{2}|\leq\frac{100n\log\log n}{\log n}\right)\leq\exp\left(-\frac{\mathbb{E}(X)}{12}\right)\leq\exp\left(-\frac{20n\log\log n}{\log n}\right).

Now, we need to estimate the number of configurations we need to investigate. After orienting (arbitrarily) cycles in $\Gamma_{2}$ , there are at most $\binom{n}{s}$ choices for the beginnings and at most $\binom{n}{s}$ choices for the endings of paths. Such choices yield paths in $\Gamma$ . By Lemma 4, there are at most $2^{(\log n)^{2}}$ choices to determine which cycles from $\Gamma_{2}$ should stay in $\Gamma$ . Hence, the number of configurations (different bipartite graphs $\Gamma$ ) we need to deal with is at most

	$\displaystyle\sum_{s<5n/\log n}\binom{n}{s}^{2}2^{(\log n)^{2}}$	$\displaystyle\leq$	$\displaystyle 2\cdot\binom{n}{5n/\log n}^{2}2^{(\log n)^{2}}\leq 2\cdot\left(\frac{en}{5n/\log n}\right)^{10n/\log n}2^{(\log n)^{2}}$
		$\displaystyle\leq$	$\displaystyle\exp\left(\frac{10n\log\log n}{\log n}+O((\log n)^{2})\right)\leq\exp\left(\frac{15n\log\log n}{\log n}\right).$

Comparing it with an upper bound for the failure probability for each configuration, we get that if $s<\frac{5n}{\log n}$ and $|V_{1}|\geq\frac{5000n\log\log n}{\log n}$ , then w.h.p. $|V_{2}|>\frac{100n\log\log n}{\log n}$ . Since $X^{\prime}_{0}\leq n-\ell-s-|V_{2}|\leq n-\ell-|V_{2}|$ , we get that

\kappa(G_{S})\leq X^{\prime}_{0}+\eta\leq n-\ell-\frac{100n\log\log n}{\log n}+\eta<n-\ell,

contradicting (5).

3.3 $n-\frac{5001n\log\log n}{\log n}<\ell\leq n-1$

The two special copies of the two popular colours can give us some unnecessary technical problems in the following calculations. So let us delete the two edges, $e_{1},e_{2}$ , associated with special copies so that each colour is used exactly twice. The graph $G^{*}$ obtained this way has $2n-2$ edges. Deleting edges can only increase the number of connected components so it suffices to prove that this quantity is small enough to satisfy (2) after the deletion. It is straightforward to show that $G^{*}$ remains connected w.h.p. but we will not need this fact in our argument.

For the range of $\ell$ considered in this section, it is easier to concentrate on the largest set of colours $I\subseteq Q$ and the associated set of edges $S=C_{I}\cup\{e_{1},e_{2}\}$ for which $G_{S}$ has too many components i.e. $\kappa(G_{S})\geq n-\ell+1$ , in violation of (2). Note that for every colour $c$ not in $I$ , the two edges of colour $c$ in $G^{*}$ join distinct components of $G_{S}$ ; otherwise, $I\cup\left\{c\right\}$ also yields a graph with too many components. This means that there are no edges of colour belonging to $Q\setminus I$ in $G^{*}$ joining vertices in the same component of $G_{S}$ . Put another way, let ${\mathcal{C}}_{m}$ be the event in $G_{2,q}$ that we can find $m$ colours and the associated $m$ unique pairs of edges $M$ such that

(i)

$M\cap\{e_{1},e_{2}\}=\emptyset$ ,
(ii)

each pair in $M$ has the same colour (that is distinct from colours of other pairs in $M$ ),
(iii)

$G_{\bar{M}}=G^{*}-M=G_{2-out}-(M\cup\{e_{1},e_{2}\})$ has at least $m+2$ components, and
(iv)

no edge of $M$ joins two vertices of the same component of $G^{*}$ .

We have to show that ${\mathcal{C}}_{m}$ is unlikely, for $1\leq m\leq m_{0}:=\frac{5001n\log\log n}{\log n}$ .

Suppose that $V_{1},V_{2},\ldots,V_{p}$ are the components of $G_{\bar{M}}$ . Let $n_{i}=|V_{i}|$ . We will use $\delta_{v}$ for the number of choices of $v$ in $G_{2-out}$ outside its component in $G_{\bar{M}}$ , and let $\Delta_{i}=\sum_{v\in V_{i}}\delta_{v}$ . Note that the number of edges in component $V_{i}$ is $2n_{i}-\Delta_{i}$ . Hence,

\Delta_{i}\leq n_{i}+1,

(6)

as otherwise there are not enough edges inside $V_{i}$ to get connectivity. It follows that

	$\displaystyle\mathbb{P}({\mathcal{C}}_{m})$	$\displaystyle\leq\binom{n-1}{m}\sum_{p=m+2}^{2m+3}\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\binom{n}{n_{1},n_{2},\ldots,n_{p}}$
		$\displaystyle\qquad\times\sum_{\begin{subarray}{c}\Delta_{1}+\cdots+\Delta_{p}=2m\\ \Delta_{1},\ldots,\Delta_{p}\geq 0\end{subarray}}\prod_{i=1}^{p}\left(\frac{n_{i}}{n}\right)^{2n_{i}-\Delta_{i}}\left(1-\frac{n_{i}}{n}\right)^{\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}\frac{1}{\binom{2n}{2m}},$

where

\Psi(n_{1},\ldots,n_{p})=\prod_{i=1}^{n}\ell_{i}!\quad\text{ with }\quad\ell_{i}=|\left\{j\in[p]:n_{j}=i\right\}|.

Indeed, we first need to choose the colours $M$ which can be done in $\binom{n-1}{m}$ ways. There are at least $m+2$ components in $G_{\bar{M}}$ but, since we removed exactly $2m+2$ edges from $G_{2-out}$ to get $G_{\bar{M}}$ , the number of them is at most $2m+3$ . We then need to choose the component sizes and the components in $\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\end{subarray}}\binom{n}{n_{1},n_{2},\ldots,n_{p}}/\Psi(n_{1},\ldots,n_{p})$ ways. Function $\Psi(n_{1},\ldots,n_{p})$ removes an implicit ordering of the components in the multinomial coefficient. We then consider all possibilities for the number of $M$ coloured edges (the $2m$ edges present in $G^{*}$ ) leaving each component in $\sum_{\begin{subarray}{c}\Delta_{1}+\cdots+\Delta_{p}=2m\\ \Delta_{1},\ldots,\Delta_{p}\geq 0\end{subarray}}$ ways. The factor $\binom{2n_{i}}{\Delta_{i}}$ accounts for choosing which of the $2n_{i}$ edges generated by vertices in $V_{i}$ have colour in $M$ and are not one of the two special edges, $e_{1},e_{2}$ . The factor $n_{i}/n$ (respectively, $1-n_{i}/n$ ) is the probability that the edge choice of a vertex in $V_{i}$ is in $V_{i}$ (respectively, not in $V_{i}$ ). Finally, we need to make sure that the $2m$ edges we identified received precisely the $2m$ non-special copies of the $m$ colours we selected. This happens with probability $1/\binom{2n}{2m}$ as any set of $2m$ colours from the set of $2n$ available colours (including repetitions and including special copies) is assigned to these edges with uniform probability; only one of them has colours that are exactly the ones we selected at the very beginning (note that special colours were excluded then).

Continuing,

$\displaystyle\mathbb{P}({\mathcal{C}}_{m})$	$\displaystyle\leq\frac{\binom{n}{m}}{\binom{2n}{2m}}\sum_{p=m+2}^{2m+3}\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\binom{n}{n_{1},n_{2},\ldots,n_{p}}$
	$\displaystyle\qquad\times\sum_{\Delta_{1}+\cdots+\Delta_{p}=2m}\prod_{i=1}^{p}\left(\frac{n_{i}}{n}\right)^{2n_{i}-\Delta_{i}}\left(1-\frac{n_{i}}{n}\right)^{\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$
	$\displaystyle\leq\frac{\binom{n}{m}}{\binom{2n}{2m}}\sum_{p=m+2}^{2m+3}\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\binom{n}{n_{1},n_{2},\ldots,n_{p}}$
	$\displaystyle\qquad\times\sum_{\Delta_{1}+\cdots+\Delta_{p}=2m}\prod_{i=1}^{p}\left(\frac{n_{i}}{n}\right)^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$
	$\displaystyle\leq\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!}{n^{2n-2m}}\sum_{\begin{subarray}{c}p\geq m+2\\ n_{1}+\cdots+n_{p}=n\\ n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\\ \Delta_{1}+\cdots+\Delta_{p}=2m\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\prod_{i=1}^{p}\frac{n_{i}^{2n_{i}-\Delta_{i}}}{n_{i}!}\binom{2n_{i}}{\Delta_{i}}.$	(7)

Let us prove the following simple structural property of $G_{2-out}$ . It will imply that the largest component (of size $n_{1}$ ) has size asymptotic to $n$ .

Lemma 5.

For $S\subseteq[n]$ , let $e^{+}(S)$ denote the number of choices by vertices in $S$ that are not in $S$ , and let $e(S,[n]\setminus S)=e^{+}(S)+e^{+}([n]\setminus S)$ denote the number of edges in $G_{2-out}$ that are between $S$ and its complement. Then, w.h.p. the following property holds for any $m$ such that $1\leq m\leq m_{0}:=\frac{5001n\log\log n}{\log n}$ :

for all

S\subseteq[n],9m\leq|S|\leq n/2

, we have

e(S,[n]\setminus S)>2m+2

Proof.

We will independently deal with small and large sets by proving the following statement. For any $m$ such that $1\leq m\leq m_{0}:=\frac{5001n\log\log n}{\log n}=o(n)$ , the following two properties hold with probability $1-O(n^{-1})$ :

(a)

for all $S\subseteq[n],9m\leq|S|=s\leq n/200$ , we have $e^{+}(S)\geq s/2>2m+2$ . (8)
(b)

for all $S\subseteq[n],n/200\leq|S|=s\leq n/2$ , we have $e(S,[n]\setminus S)>2m+2$ . (9)

Note that, since $16e^{2}<200$ ,

	$\displaystyle\mathbb{P}(\neg\eqref{badS})$	$\displaystyle\leq\sum_{s=9m}^{n/200}\binom{n}{s}\binom{2s}{3s/2}\left(\frac{s}{n}\right)^{3s/2}\leq\sum_{s=9m}^{n/200}\left(\frac{en}{s}\right)^{s}2^{2s}\left(\frac{s}{n}\right)^{3s/2}$
		$\displaystyle\leq\sum_{s=9m}^{n/200}\left(\frac{s}{n}\cdot 16e^{2}\right)^{s/2}=O(n^{-1}),$

so property (a) holds.

To see that property (b) holds too, note that

	$\displaystyle\mathbb{P}(\neg\eqref{badS1})$	$\displaystyle\leq\sum_{s=n/200}^{n/2}\sum_{i=0}^{2m+2}\sum_{j=0}^{i}\binom{n}{s}\binom{2s}{j}\left(\frac{s}{n}\right)^{2s-j}\left(1-\frac{s}{n}\right)^{j}\binom{2(n-s)}{i-j}\left(\frac{s}{n}\right)^{i-j}\left(1-\frac{s}{n}\right)^{2(n-s)-i+j}$
		$\displaystyle=\sum_{s=n/200}^{n/2}\sum_{i=0}^{2m+2}\sum_{j=0}^{i}\binom{n}{s}\binom{2s}{j}\binom{2(n-s)}{i-j}\left(\frac{s}{n}\right)^{2s+i-2j}\left(1-\frac{s}{n}\right)^{2(n-s)-i+2j}$
		$\displaystyle=O(m^{2})\sum_{s=\sigma n=n/200}^{n/2}\left(\frac{1}{\sigma^{\sigma}(1-\sigma)^{1-\sigma}}\right)^{n}\binom{2n}{2m+2}^{2}\sigma^{2\sigma n-(2m+2)}(1-\sigma)^{2(1-\sigma)n-(2m+2)}$
		$\displaystyle=O(m^{2})\sum_{s=\sigma n=n/200}^{n/2}(\sigma^{\sigma}(1-\sigma)^{1-\sigma})^{n}\binom{2n}{2m+2}^{2}\left(\sigma(1-\sigma)\right)^{-(2m+2)}$
		$\displaystyle=O(m^{2}n)\ c^{n}\exp\left(O\left(\frac{n(\log\log n)^{2}}{\log n}\right)\right)=c^{n}e^{o(n)}=O(n^{-1}),$

where $c=(1/200)^{1/200}(199/200)^{199/200}<0.97$ . (In the above computation, $i$ corresponds to $e(S,[n]\setminus S)$ and $j$ corresponds to $e^{+}(S)$ .) ∎

The lemma implies that we may assume that

n_{1}\geq n-9m\text{ in \eqref{ss0}};

(10)

otherwise, the number of edges joining distinct components in $G_{\bar{M}}$ would be greater than $2m+2$ . Hence, we may rewrite (7) as follows:

\displaystyle\mathbb{P}({\mathcal{C}}_{m})

\displaystyle\leq\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!}{n^{2n-2m}}\sum_{p=m+2}^{2m+3}\sum_{s=p-1}^{9m}\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n-s=n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\\ \Delta_{1}+\cdots+\Delta_{p}=2m\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\prod_{i=1}^{p}\frac{n_{i}^{2n_{i}-\Delta_{i}}}{n_{i}!}\binom{2n_{i}}{\Delta_{i}}.

Define

f(a,x)=a^{2a-x}\binom{2a}{x}.

Note that if $x\geq 1$ , then

\frac{f(a,x)}{f(a,x-1)}=\frac{2a-x+1}{ax}\leq\frac{2}{x},

and so $f(a,x)\leq\frac{2^{x}}{x!}f(a,0)$ . Using this observation, we get that

	$\displaystyle\sum_{\Delta_{1}+\cdots+\Delta_{p}=2m}$	$\displaystyle\prod_{i=1}^{p}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$
		$\displaystyle\leq\sum_{\Delta_{1},\ldots,\Delta_{p}\leq 2m}\prod_{i=1}^{p}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$
		$\displaystyle=\sum_{\Delta_{1},\ldots,\Delta_{p-1}\leq 2m}\left(\prod_{i=1}^{p-1}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}\right)\sum_{\Delta_{p}\leq 2m}n_{p}^{2n_{p}-\Delta_{p}}\binom{2n_{p}}{\Delta_{p}}$
		$\displaystyle\leq\sum_{\Delta_{1},\ldots,\Delta_{p-1}\leq 2m}\left(\prod_{i=1}^{p-1}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}\right)n_{p}^{2n_{p}-0}\binom{2n_{i}}{0}\left(1+\frac{2}{1}+\frac{2}{1}\cdot\frac{2}{2}+\frac{2}{1}\cdot\frac{2}{2}\cdot\frac{2}{3}+\cdots\right)$
		$\displaystyle\leq\sum_{\Delta_{1},\ldots,\Delta_{p-1}\leq 2m}\left(\prod_{i=1}^{p-1}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}\right)n_{p}^{2n_{p}}\sum_{k\geq 0}\frac{2^{k}}{k!}$
		$\displaystyle=e^{2}n_{p}^{2n_{p}}\sum_{\Delta_{1},\ldots,\Delta_{p-1}\leq 2m}\prod_{i=1}^{p-1}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$
		$\displaystyle\leq\ldots\leq(e^{2})^{p}\prod_{i=1}^{p}n_{i}^{2n_{i}}.$

Unfortunately, the constant

e^{2}=\sum_{k\geq 0}\frac{2^{k}}{k!}

associated with the sum over all possible values of $\Delta_{i}$ is too large for the final argument to follow. Fortunately, any constant smaller than $e^{2}$ would work. We may squeeze a bit more by using the following observation. Since $\sum_{i=2}^{p}n_{i}=s\leq 9m$ (see (10)) and $p-1\geq m+1$ , there are at most $p/2$ values of $n_{i}$ , $i\geq 2$ , that are at least $18$ ( $n_{1}\sim n$ certainly is at least 18); the remaining ones are at most 17. It is important to notice that the sequence of $n_{i}$ ’s is non-increasing so we conclude that at least the last $p/2-1$ values of $n_{i}$ are at most 17. As a result, since $\Delta_{i}\leq n_{i}+1$ (see (6)), the corresponding values of $\Delta_{i}$ ’s are at most 18 (we will refer to them as small). The contribution from small $\Delta_{i}$ ’s is $A$ , where

A=\sum_{k\leq 18}\frac{2^{k}}{k!}\leq e^{2}-10^{-12}.

We get that

	$\displaystyle\sum_{\Delta_{1}+\cdots+\Delta_{p}=2m}\prod_{i=1}^{p}n_{i}^{2n_{i}-\Delta_{i}}\binom{2n_{i}}{\Delta_{i}}$	$\displaystyle\leq(e^{2})^{p/2+1}A^{p/2-1}\prod_{i=1}^{p}n_{i}^{2n_{i}}$
		$\displaystyle=\frac{e^{2}}{A}(e^{2}A)^{p/2}\prod_{i=1}^{p}n_{i}^{2n_{i}}\leq 2B^{p}\prod_{i=1}^{p}n_{i}^{2n_{i}},$

where

B=e\sqrt{A}\leq e^{2}-10^{-12}.

It follows that

\mathbb{P}({\mathcal{C}}_{m})\leq 2\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!}{n^{2n-2m}}\sum_{p=m+2}^{2m+3}B^{p}\sum_{s=p-1}^{9m}\sum_{\begin{subarray}{c}n_{1}+\cdots+n_{p}=n\\ n-s=n_{1}\geq n_{2}\geq\cdots\geq n_{p}\geq 1\end{subarray}}\frac{1}{\Psi(n_{1},\ldots,n_{p})}\prod_{i=1}^{p}\frac{n_{i}^{2n_{i}}}{n_{i}!}.

Now, for a given sequence $n_{1},\ldots,n_{p}$ and an integer $i$ , $2\leq i\leq p$ , let us define

g(a,b)=\frac{a^{2a}b^{2b}}{a!b!\Psi(a,n_{2},\ldots,n_{i-1},b,n_{i+1},\ldots,n_{p})}

and suppose that $n\sim a\gg 9m\geq b>1$ . Then, since $(1+1/x)^{x}$ is an increasing function of $x$ (tending to $e$ but we do not need this fact) and

\frac{\Psi(a,n_{2},\ldots,n_{i-1},b,n_{i+1},\ldots,n_{p})}{\Psi(a+1,n_{2},\ldots,n_{i-1},b-1,n_{i+1},\ldots,n_{p})}\leq 2,

we get that

	$\displaystyle\frac{g(a,b)}{g(a+1,b-1)}$	$\displaystyle\leq\frac{a^{2a}}{(a+1)^{2a+2}}\cdot\frac{b^{2b}}{(b-1)^{2b-2}}\cdot\frac{a+1}{b}\cdot 2$
		$\displaystyle=\frac{\left(1+\frac{1}{b-1}\right)^{2(b-1)}}{\left(1+\frac{1}{a}\right)^{2a}}\cdot\frac{b}{a+1}\cdot 2\leq\frac{2b}{a}\leq\frac{20m}{n}.$

It implies that the terms corresponding to sequences of $n_{i}$ ’s with larger values of $n_{1}$ (smaller values of $s$ in our bound) are much larger. On the other hand, there are more sequences with smaller values of $n_{1}$ (larger values of $s$ ) to consider. However, since $n_{2}+\cdots+n_{p}=s$ and $n_{i}\geq 1$ , there are only $\binom{s-1}{p-2}$ choices for the sequence of $n_{i}$ ’s to consider. Combining the two observations together, we get that the term corresponding to $s=p-1$ (and the associated unique sequence of $n_{i}$ ’s) is a dominating term:

	$\displaystyle\mathbb{P}({\mathcal{C}}_{m})$	$\displaystyle\leq 2\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!}{n^{2n-2m}}\sum_{p=m+2}^{2m+3}B^{p}\frac{(n-p+1)^{2(n-p+1)}}{(n-p+1)!}\frac{1}{\Psi(n-p+1,1,\ldots,1)}$
		$\displaystyle\qquad\cdot\left(1+\sum_{s=p}^{9m}\binom{s-1}{p-2}\left(\frac{20m}{n}\right)^{s-(p-1)}\right)$
		$\displaystyle\leq 3\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!}{n^{2n-2m}}\sum_{p=m+2}^{2m+3}B^{p}\frac{(n-p+1)^{2(n-p+1)}}{(n-p+1)!(p-1)!}.$

Note that the ratio between the $(p+1)$ st term and the $p$ th one is

	$\displaystyle B$	$\displaystyle\cdot\frac{(n-p)^{2(n-p)}}{(n-p+1)^{2(n-p+1)}}\cdot\frac{(n-p+1)!}{(n-p)!}\cdot\frac{p-1}{p}$
		$\displaystyle=B\cdot\left(1-\frac{1}{n-p+1}\right)^{2(n-p+1)}\cdot\frac{n-p+1}{(n-p)^{2}}\cdot\frac{p-1}{p}=\Theta(1/n).$

Hence,

\displaystyle\mathbb{P}({\mathcal{C}}_{m})

\displaystyle\leq 4\frac{\binom{n}{m}}{\binom{2n}{2m}}\frac{n!B^{m+2}}{n^{2n-2m}}\frac{(n-m-1)^{2(n-m-1)}}{(n-m-1)!(m+1)!}.

(11)

If $m$ is a constant, then $\mathbb{P}({\mathcal{C}}_{m})=O(1/n)$ . In order to investigate larger values of $m$ , note that the ratio between the $(m+1)$ st term and the $m$ th one is

	$\displaystyle\frac{\binom{n}{m+1}}{\binom{n}{m}}$	$\displaystyle\cdot\frac{\binom{2n}{2m}}{\binom{2n}{2m+2}}\cdot Bn^{2}\cdot\frac{(n-m-2)^{2(n-m-2)}}{(n-m-1)^{2(n-m-1)}}\cdot\frac{n-m-1}{m+2}$
		$\displaystyle=\frac{n-m}{m+1}\cdot\frac{(2m+1)(2m+2)}{(2n-2m)(2n-2m-1)}\cdot Bn^{2}\cdot\left(1-\frac{1}{n-m-1}\right)^{2(n-m-1)}$
		$\displaystyle\qquad\cdot\frac{n-m-1}{(n-m-2)^{2}}\cdot\frac{1}{m+2}\to(B)(e^{-2})<1-10^{-13}\qquad\text{ as }m,n\to\infty.$

Hence, if $m$ is sufficiently large (say, $m\geq m^{\prime}$ ) the ratio is at most, say, $1-10^{-14}$ . Combining the two observations together, we get that

\sum_{1\leq m\leq m_{0}}\mathbb{P}({\mathcal{C}}_{m})=\sum_{1\leq m<m^{\prime}}\mathbb{P}({\mathcal{C}}_{m})+\sum_{m^{\prime}\leq m\leq m_{0}}\mathbb{P}({\mathcal{C}}_{m})=O\Big{(}\mathbb{P}({\mathcal{C}}_{1})+\mathbb{P}({\mathcal{C}}_{m^{\prime}})\Big{)}=O\left(\frac{1}{n}\right)=o(1).

That finishes the proof of Theorem 1.

4 Matchings and Hamilton Cycles

In this paper, we dealt with rainbow spanning trees proving the strongest possible result, both in terms of $q$ , the number of colours, and $k$ , the degree of the associated random graph. We leave investigating other rainbow structures for future research.

Recall that it was shown by Frieze [11] that $G_{2-out}$ has a perfect matching w.h.p., and by Bohman and Frieze [3] that $G_{3-out}$ is Hamiltonian w.h.p. Both results are sharp. Hence, based on our observation in Section 2.1, it is natural to investigate the following questions.

•

What is the smallest value of $q$ such that $G_{2,q}$ has a Rainbow Perfect Matching (RPM) w.h.p.? (Trivially, $q\geq n/2$ and $q\leq 2n$ as $G_{2,2n}$ is rainbow.)
•

What is the smallest value of $q$ such that $G_{3,q}$ has a Rainbow Hamilton Cycle (RHC) w.h.p.? (Trivially, $q\geq n$ and $q\leq 3n$ as $G_{3,3n}$ is rainbow.)

References

[1] D. Bal, P. Bennett, A. Frieze, and P. Prałat, Power of $k$ choices and rainbow spanning trees in random graphs, Electronic Journal of Combinatorics 22(1) (2015), #P1.29.
[2] D. Bal, P. Bennett, X. Pérez-Giménez, and P. Prałat, Rainbow perfect matchings and Hamilton cycles in the random geometric graph, Random Structures and Algorithms 51(4) (2017), 587–606.
[3] T. Bohman and A.M. Frieze, Hamilton cycles in 3-out, Random Structures and Algorithms 35 (2009), 393–417.
[4] B. Bollobás, A probabilistic proof of an asymptotic formula for the number of labelled graphs, European Journal on Combinatorics 1 (1980) 311–316.
[5] B. Bollobás, Random graphs, Academic press, 1985.
[6] C. Cooper and A.M. Frieze, Multi-coloured Hamilton cycles in random edge-coloured graphs, Combinatorics, Probability and Computing 11 (2002), 129–134.
[7] J. Edmonds, Submodular functions, matroids and certain polyhedra, in Combinatorial Structures and their Applications, R.Guy et al, eds., Gordon and Breach, 1970, 69–87.
[8] T.I. Fenner and A.M. Frieze, On the connectivity of random $m$ -orientable graphs and digraphs, Combinatorica 2 (1982), 347–359.
[9] T.I. Fenner and A.M. Frieze, On the existence of polychromatic sets of edges in graphs and digraphs, Progress in Graph Theory, Edited by J.A. Bondy and U.S.R. Murty, Academic Press, (1984) 219-232.
[10] A. Ferber and M. Krivelevich, Rainbow Hamilton cycles in random graphs and hypergraphs, Recent trends in combinatorics, IMA Volumes in Mathematics and its applications, A Beveridge et al, Eds., Springer 2016, 167–189.
[11] A.M. Frieze, Maximum matchings in a class of random graphs, Journal of Combinatorial Theory B 40 (1986), 196–212.
[12] A.M. Frieze and M. Karoński, Introduction to Random Graphs, Cambridge University Press, 2015.
[13] A.M. Frieze and P. Loh, Rainbow Hamilton cycles in random graphs, Random Structures and Algorithms 44 (2014), 328–354.
[14] A. Frieze and B.D. Mckay, Multicoloured trees in random graphs, Random Structures and Algorithms 5 (1994), 45–56.
[15] S. Janson and N. Wormald. Rainbow Hamilton cycles in random regular graphs, Random Structures Algorithms, 30(1-2) (2007), 35–49.
[16] J. Oxley, Matroid Theory, Oxford: Oxford University Press, 1992.
[17] K. Suzuki, A necessary and sufficient condition for the existence of a heterochromatic spanning tree in a graph, Graphs and Combinatorics 22 (2006) 261-269.
[18] N.C. Wormald, Models of random regular graphs, In J.D. Lamb and D.A. Preece, editors, Surveys in Combinatorics, 1999, volume 267 of London Mathematical Society Lecture Note Series, Cambridge University Press (1999) 239-298.

Rainbow spanning trees in randomly coloured Gk−o​u​tG_{k-out}

Abstract

1 Introduction

Theorem 1.

2 Preliminaries

2.1 Colour Monotonicity

2.2 Degree Monotonicity

3 Rainbow Spanning Trees

Lemma 2.

Proof.

Claim 1.

Proof.

3.1 4≤ℓ≤n/204\leq\ell\leq n/20

3.2 n/20<ℓ≤n−5001​n​log⁡log⁡nlog⁡nn/20<\ell\leq n-\frac{5001n\log\log n}{\log n}

Lemma 3.

Proof.

Lemma 4.

Proof.

3.3 n−5001​n​log⁡log⁡nlog⁡n<ℓ≤n−1n-\frac{5001n\log\log n}{\log n}<\ell\leq n-1

Lemma 5.

Proof.

4 Matchings and Hamilton Cycles

References

Rainbow spanning trees in randomly coloured $G_{k-out}$

3.1 $4\leq\ell\leq n/20$

3.2 $n/20<\ell\leq n-\frac{5001n\log\log n}{\log n}$

3.3 $n-\frac{5001n\log\log n}{\log n}<\ell\leq n-1$