This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Square percolation and the threshold for quadratic divergence in random right-angled Coxeter groups

Jason Behrstock Lehman College and The Graduate Center, CUNY, New York, New York, USA [email protected] Victor Falgas-Ravry Umeå Universitet, Umeå, Sweden [email protected]  and  Tim Susse Bard College at Simon’s Rock, Great Barrington, Mass., USA [email protected]
Abstract.

Given a graph Γ\Gamma, its auxiliary square-graph (Γ)\square(\Gamma) is the graph whose vertices are the non-edges of Γ\Gamma and whose edges are the pairs of non-edges which induce a square (i.e., a 44-cycle) in Γ\Gamma. We determine the threshold edge-probability p=pc(n)p=p_{c}(n) at which the Erdős–Rényi random graph Γ=Γn,p\Gamma=\Gamma_{n,p} begins to asymptotically almost surely have a square-graph with a connected component whose squares together cover all the vertices of Γn,p\Gamma_{n,p}. We show pc(n)=62/np_{c}(n)=\sqrt{\sqrt{6}-2}/\sqrt{n}, a polylogarithmic improvement on earlier bounds on pc(n)p_{c}(n) due to Hagen and the authors. As a corollary, we determine the threshold p=pc(n)p=p_{c}(n) at which the random right-angled Coxeter group WΓn,pW_{\Gamma_{n,p}} asymptotically almost surely becomes strongly algebraically thick of order 11 and has quadratic divergence.

2010 Mathematics Subject Classification:
05C80, 20F65, 57M15, 60B99, 20F55, 20F69

1. Introduction

In this paper we investigate the phase transition for a variant of “square percolation”, with motivation coming from both previous work on clique percolation and from questions in geometric group theory.

Clique percolation was introduced by Derényi, Palla and Vicsek [9] as a simple model for community detection, and quickly became well-studied in network science, from computational, empirical, and theoretical perspectives, see e.g. [7, 9, 18, 20, 21, 22]. In (k,)(k,\ell)–clique percolation, to investigate the “community structure” of a graph or network Γ\Gamma one studies the auxiliary (k,)(k,\ell)-clique graph whose vertices are the kk–cliques of Γ\Gamma and whose edges are those pairs of kk–cliques having at least \ell vertices in common.

One of the main research questions in the area was determining the threshold p=p(n)p=p(n) for the emergence of a “giant component” in the auxiliary (k,)(k,\ell)-clique graph when the original graph Γ𝒢(n,p)\Gamma\in\mathcal{G}(n,p) is an Erdős–Rényi random graph on nn vertices with edge-probability pp. This was completely resolved in 2009 by Bollobás and Riordan [7], in a highly impressive paper making sophisticated use of branching processes. In the concluding remarks of their paper, Bollobás and Riordan suggested a study of “square percolation” as a natural extension of their work. More precisely, given a graph Γ\Gamma they suggested studying the component structure of the auxiliary graph whose vertices are the not necessarily induced 44-cycles in Γ\Gamma, and whose edges are pairs of 44–cycles with a diagonal111Note that given a 44–cycle in a graph, we use the term “diagonal” to refer to the pair of vertices of a diagonal, even though they they may not span an edge in the graph; indeed most of this paper concerns induced 44–cycles so that the edge spanned by the diagonal is not in the graph. in common. For Γ𝒢(n,p)\Gamma\in\mathcal{G}(n,p), they stated that they believed the threshold for the associated auxiliary graph to contain a giant component containing a positive proportion of all squares of Γ\Gamma should be λc/n\lambda_{c}/\sqrt{n}, where λc=62\lambda_{c}=\sqrt{\sqrt{6}-2} (see the discussion around equation (19) in Section 2.3 of [7]).

A related (but slightly different) notion of “square percolation” arose independently in joint work of the authors with Hagen [5] on the divergence of the random right-angled Coxeter group, providing motivation from geometric group theory for understanding the phase transition in an auxiliary graph formed from the induced 44-cycles of an Erdős–Rényi random graph Γ𝒢(n,p)\Gamma\in\mathcal{G}(n,p). To make this more precise, we make the following definition.

Definition 1.1.

To any graph Γ\Gamma, we associate an auxiliary square-graph, (Γ)\square\left({\Gamma}\right), whose vertices are the non-edges of Γ\Gamma, and whose edges are the pairs of non-edges of Γ\Gamma that together induce a 44–cycle (a.k.a. square) in Γ\Gamma.

Thus for vertices a,b,c,da,b,c,d in a graph Γ\Gamma, the pair {ac,bd}\{ac,bd\} is an edge of (Γ)\square(\Gamma) if and only if (i) acac and bdbd are non-edges of Γ\Gamma (and thus vertices of (Γ)\square(\Gamma)), and (ii) abab, bcbc, cdcd and dada are all edges of Γ\Gamma.

Remark 1.2.

This definition of the auxiliary square-graph (Γ)\square(\Gamma) differs slightly from the one used in the related papers [5, 8]. In those papers, the auxiliary graph had the induced 44-cycles as its vertices, and its edges were those pairs of induced 44-cycles having a diagonal in common. These two variants of auxiliary square-graphs encode essentially the same information, but the formulation above is more natural from a combinatorial perspective and more convenient for the exploration processes we shall consider in this paper.

We investigate the component structure of (Γ)\square(\Gamma), albeit with an unusual twist. With a view to applications in geometric group theory, we will be interested in the question of whether or not (Γ)\square(\Gamma) has a component that “covers” all of the vertex-set of the original graph Γ\Gamma.

Definition 1.3.

We refer to connected components of (Γ)\square\left({\Gamma}\right) as square-components of Γ\Gamma. Given a square-component CC we define its support to be the collection of vertices of Γ\Gamma given by:

supp(C)=vwC{v,w},\operatorname{\mathrm{supp}}(C)=\bigcup_{vw\in C}\{v,w\},

and say that the component CC of (Γ)\square\left({\Gamma}\right) covers the vertex set supp(C)V(Γ)\operatorname{\mathrm{supp}}(C)\subseteq V(\Gamma). If CC covers all of V(Γ)V(\Gamma), we say it is a square-component with full support

Write Γ𝒢(n,p)\Gamma\in\mathcal{G}(n,p) to denote that Γ\Gamma is an instance of the Erdős–Rényi random graph model with parameters nn and pp, i.e., that Γ\Gamma is a graph on nn vertices obtained by including each edge at random with probability pp, independently of all the others.

Our main combinatorial result in this paper is pinpointing the precise threshold pc(n)p_{c}(n) at which Γ𝒢(n,p)\Gamma\in\mathcal{G}(n,p) asymptotically almost surely222As usual, asymptotically almost surely or a.a.s. is shorthand for “with probability tending to 11 as nn\rightarrow\infty.” experiences a phase transition from having only square-components with support of logarithmic order to having a square-component with full support. Throughout this paper we set λc=62\lambda_{c}=\sqrt{\sqrt{6}-2}. The following two results establish that the critical threshold probability is p(n)=λcn1/2p(n)=\lambda_{c}n^{-1/2} by showing highly disparate behavior on either side of this threshold as given by the following two contrasting results.

Theorem 1.4 (Subcritical Behavior).

Let λ<λc\lambda<\lambda_{c} be fixed. Suppose that p(n)λn1/2p(n)\leq\lambda n^{-1/2}. Then for Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p), a.a.s. every square-component of Γ\Gamma covers at most O((logn)232)O((\log n)^{2^{32}}) vertices.

Theorem 1.5 (Supercritical Behavior).

Let λ>λc\lambda>\lambda_{c} be fixed, and let f:+f\colon\mathbb{N}\rightarrow\mathbb{R}^{+} be a function with f(n)0f(n)\rightarrow 0 and f(n)n2f(n)n^{2}\rightarrow\infty as nn\rightarrow\infty. Let p=p(n)p=p(n) be an edge-probability with

λn1/2p(n)1f(n).\lambda n^{-1/2}\leq p(n)\leq 1-f(n).

Then for Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p), a.a.s. there is a square-component of Γ\Gamma covering all vertices of Γ\Gamma.

Our proofs of Theorems 1.4 and 1.5 confirm the conjecture of Bollobás and Riordan regarding the location of the phase transition for their version of (non-induced) square percolation. Further, Theorems 1.4 and 1.5 have a direct application to the study of the geometric properties of random right-angled Coxeter group, which we now describe.

Given a graph Γ\Gamma, we define the associated right-angled Coxeter group (RACG) WΓW_{\Gamma} by taking the free group on V(Γ)V(\Gamma) and adding the relations a2=1a^{2}=1 and ab=baab=ba for all aV(Γ)a\in V(\Gamma), abE(Γ)ab\in E(\Gamma). In this way, the graph Γ\Gamma encodes a finite presentation for the right-angled Coxeter group WΓW_{\Gamma}. Given graphs Γ\Gamma and Λ\Lambda, it is well-known that the associated groups WΓW_{\Gamma} and WΛW_{\Lambda} are isomorphic if and only if the graphs Γ\Gamma and Λ\Lambda are isomorphic, see [19]. Thus algebraic and geometric properties of WΓW_{\Gamma} can be studied via purely graph-theoretic means, as we do in this paper. Indeed, a number of geometric properties of a right-angled Coxeter group WΓW_{\Gamma} admit encodings as graph-theoretic properties of the presentation graph Γ\Gamma. Such properties include thickness and having quadratic divergence, which are both important in geometric group theory (see Section 3 below for a formal definition of these notions). An investigation of right-angled Coxeter groups with quadratic divergence was the main motivation for the work undertaken in this paper.

The correspondence between right-angled Coxeter groups and graphs allows one to define models of random groups based on random graph models. In particular, in this paper we consider the random right-angled Coxeter group, WΓW_{\Gamma} where the presentation graph Γ𝒢n,p\Gamma\in\mathcal{G}_{n,p} is an Erdős–Rényi random graph. Using Theorems 1.4 and 1.5 above on square-components in Erdős–Rényi random graphs, we prove the following.

Theorem 1.6 (Criticality for quadratic divergence of RACGs).

Let ϵ>0\epsilon>0. If

λc+ϵnp(n)1(1+ϵ)lognn\frac{\lambda_{c}+\epsilon}{\sqrt{n}}\leq p(n)\leq 1-\frac{(1+\epsilon)\log{n}}{n}

and Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p). Then, a.a.s. the right-angled Coxeter group WΓW_{\Gamma} has quadratic divergence and is strongly algebraically thick of order exactly 11.

On the other hand, if p(n)p(n) satisfies

0p(n)λϵn0\leq p(n)\leq\frac{\lambda-\epsilon}{\sqrt{n}}

then the right-angled Coxeter group WΓW_{\Gamma} a.a.s has at least cubic divergence and is not strongly algebraically thick of order 0 or 11.

The geometric properties of WΓW_{\Gamma} when Γ𝒢n,p\Gamma\in{\mathcal{G}}_{n,p} and p=1θ(n2)p=1-\theta(n^{-2}) was described in detail by Behrstock, Hagen and Sisto in [6, Theorem V]. Together with their work, our results give an essentially complete picture of quadratic and linear divergence in random right-angled Coxeter groups.

Organization of the paper

In Section 3 we provide additional background material on the geometry of random groups and derive Theorem 1.6 from Theorems 1.41.5. In Section 4, we recall some basic facts about branching processes and give an outline of the proof strategy we follow for our main results, and of the ways in which it departs from the framework used by Bolloás and Riordan in their study of clique percolation in random graphs. Theorem 1.4 is proved in Section 5, while Theorem 1.5 is derived in Section 6. We end the paper in Section 7 with some discussion of the results and of further work and related problems.

Acknowledgments

The authors thank Mark Hagen for discussions during the early stages of this work. Behrstock was supported by NSF grant DMS-1710890. Falgas-Ravry was supported by Swedish Research Council grant VR 2016-03488. The authors thank Ela Behrstock for her skillful rendering of Figures 2 and 2. Aspects of this work were motivated by output from software written by the authors (available upon request from the authors, some available online at http://comet.lehman.cuny.edu/behrstock/random.html). Accordingly, this research was supported, in part, by a grant of computer time from the City University of New York High Performance Computing Center under NSF Grants CNS-0855217, CNS-0958379, and ACI-1126113.

2. Graph-theoretic notation and standard notions

Given a set AA and rr\in\mathbb{N}, let A(r)A^{(r)} denote the collection of all subsets of AA of cardinality rr. So for example A(2)A^{(2)} is the collection unordered distinct pairs of elements of AA. As a notational convenience, we set [n]:={1,2,,n}[n]:=\{1,2,\ldots,n\}, and we often denote the unordered set {u,v}\{u,v\} by uvuv.

A graph is a pair Γ=(V,E)\Gamma=(V,E), where V=V(Γ)V=V(\Gamma) is a set of vertices and E=E(Γ)E=E(\Gamma) is a collection of pairs of vertices referred to as the edges of Γ\Gamma. A subgraph of Γ\Gamma is a graph GG with V(G)V(Γ)V(G)\subseteq V(\Gamma) and E(G)E(Γ)E(G)\subseteq E(\Gamma). If V(G)=XV(G)=X and E(G)=V(Γ)X(2)E(G)=V(\Gamma)\cap X^{(2)}, then we say GG is the subgraph of Γ\Gamma induced by XX and denote this fact by G=Γ[X]G=\Gamma[X]. When there is no risk of confusion, we may abuse notation and use XX to refer to both the subset of V(Γ)V(\Gamma) and the associated induced subgraph Γ[X]\Gamma[X]. The complement of a graph Γ=(V,E)\Gamma=(V,E) is the graph Γc=(V,V(2)E)\Gamma^{c}=(V,V^{(2)}\setminus E).

A path of length \ell in a graph Γ\Gamma is an ordered sequence of +1\ell+1 distinct vertices v0,v1,,vv_{0},v_{1},\ldots,v_{\ell} together with a set of \ell edges {vi1vi:i[l]}E(Γ)\{v_{i-1}v_{i}:\ i\in[l]\}\subseteq E(\Gamma). Such a path is said to join v0v_{0} to vv_{\ell}. Two vertices are said to be connected in Γ\Gamma if there is a path joining them. Being connected is an equivalence relation on the vertices of Γ\Gamma. A (connected) component of Γ\Gamma is then a nonempty set of vertices from V(Γ)V(\Gamma) that forms an equivalence class under this relation.

In this paper we study squares in graphs. An square, or 44–cycle, in Γ\Gamma is a copy of the graph C4=({a,b,c,d},{ab,bc,cd,da})C_{4}=(\{a,b,c,d\},\{ab,bc,cd,da\}) as an subgraph of Γ\Gamma. In an abuse of notation, we will denote such a C4C_{4} by abcdabcd. In other words, if we say “abcdabcd is a copy of C4C_{4}/a square in Γ\Gamma”, we mean “ab,bc,cd,daE(Γ)ab,bc,cd,da\in E(\Gamma)”. Further if we say “abcdabcd is an induced C4C_{4}/square in Γ\Gamma”, we mean that abcdabcd is a square in Γ\Gamma and that in addition ac,bdE(Γ)ac,bd\notin E(\Gamma). A useful notion for studying squares in graphs is that of a link graph: given a vertex xV(Γ)x\in V(\Gamma), the link graph Γx\Gamma_{x} of xx is the collection of neighbors of xx in Γ\Gamma, i.e., Γx={yV(Γ):xyE(Γ)}\Gamma_{x}=\{y\in V(\Gamma):\ xy\in E(\Gamma)\}.

By Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) we mean that Γ\Gamma is a random graph on the vertex set [n][n] obtained by including each edge uvuv in E(Γ)E(\Gamma) with probability pp, independently of all the others. This is known as the Erdős–Rényi random graph model. Given a sequence of edge probabilities p=p(n)p=p(n) and a graph property 𝒫\mathcal{P}, we say that a typical instance of Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) has property 𝒫\mathcal{P}, or, equivalently, that Γ𝒫\Gamma\in\mathcal{P} holds asymptotically almost surely (a.a.s.) if

limn(Γ𝒫)=1.\lim_{n\rightarrow\infty}\mathbb{P}(\Gamma\in\mathcal{P})=1.

Throughout the paper, we use standard Landau notation: given functions f,g:+f,g\colon\mathbb{N}\rightarrow\mathbb{R}^{+}, we write f=o(g)f=o(g) for limnf(n)/g(n)=0\lim_{n\rightarrow\infty}f(n)/g(n)=0 and f=O(g)f=O(g) if there exists a constant C>0C>0 such that lim supnf(n)/g(n)C\limsup_{n\rightarrow\infty}f(n)/g(n)\leq C. Further we write f=ω(g)f=\omega(g) for g=o(f)g=o(f), f=Ω(g)f=\Omega(g) for g=O(f)g=O(f). Finally if f=O(g)f=O(g) and f=Ω(g)f=\Omega(g) both hold, we denote this fact by f=Θ(g)f=\Theta(g).

3. Geometric group theory and the 𝒞𝒮\mathcal{CFS} property

Our main result in this paper establishes that p(n)=λc/np(n)=\lambda_{c}/\sqrt{n} is the threshold for a typical instance Γ\Gamma of the Erdős–Rényi random graph model 𝒢(n,p)\mathcal{G}(n,p) to have a square-graph with a component covering all of V(Γ)V(\Gamma). This property is a.a.s. equivalent to possessing the 𝒞𝒮\mathcal{CFS}–property, defined below.

3.1. Background

Recall that the graph joint Γ1Γ2\Gamma_{1}\ast\Gamma_{2} of two graphs Γ1\Gamma_{1} and Γ2\Gamma_{2} is the graph obtained by taking disjoint unions of Γ1\Gamma_{1} and Γ2\Gamma_{2}, and adding in all edges from Γ1\Gamma_{1} to Γ2\Gamma_{2}.

Definition 3.1.

A finite graph Γ\Gamma is defined to be 𝒞𝒮\mathcal{CFS} (constructed from squares) if Γ\Gamma has induced subgraphs KK and Γ\Gamma^{\prime} with KK a (possibly empty) clique so that:

  • Γ=ΓK\Gamma=\Gamma^{\prime}\ast K, and

  • (Γ)\square(\Gamma^{\prime}) has a component CC with supp(C)=V(Γ)\operatorname{\mathrm{supp}}(C)=V(\Gamma^{\prime}).

Dani–Thomas were the first to introduce a special case of the 𝒞𝒮\mathcal{CFS} property for triangle-free graphs in [8]. The 𝒞𝒮\mathcal{CFS} property for arbitrary graphs was then studied by Hagen and the authors in [5], with an eye towards establishing when this property holds a.a.s. in random graphs, while in [15] Levcovitz studied the geometric properties of right-angled Coxeter groups whose presentation graphs do not possess the 𝒞𝒮\mathcal{CFS} property.

With Hagen, the authors determined in  [5] the threshold for the 𝒞𝒮\mathcal{CFS} property to hold a.a.s. in Erdős–Rényi random graphs up to a polylogarithmic factor.

Theorem 3.2 (Theorems 5.1 and 5.7 in [5]).

If p(n)(logn)1/np(n)\leq\left(\log n\right)^{-1}/\sqrt{n}, then a.a.s. a graph Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) does not have the 𝒞𝒮\mathcal{CFS} property. On the other hand if p(n)5logn/np(n)\geq 5\sqrt{\log n}/\sqrt{n} and (1p)n2(1-p)n^{2}\to\infty, then a.a.s. Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) does have the 𝒞𝒮\mathcal{CFS} property.

Our contribution in this paper is to eliminate the polylogarithmic gap in Theorem 3.2 and thus to determine the precise threshold for the 𝒞𝒮\mathcal{CFS} property in random graphs.

The 𝒞𝒮\mathcal{CFS} property is closely linked to the large scale geometry of right-angled Coxeter groups, connected to divergence and (strong algebraic) thickness. Divergence is a quasi-isometry invariant of groups introduced by Gersten [12] and further developed by Druţu, Mozes and Sapir [10], while thickness was introduced by Behrstock–Druţu–Mosher in [4] and then further refined by Behrstock–Druţu in [3]. We define these notions and explain how they are related below.

Definition 3.3.

Let (X,d)(X,d) be a geodesic metric space, let oXo\in X and let ρ(0,1]\rho\in(0,1]. Given x,yXx,y\in X with d(x,o)=d(y,o)=rd(x,o)=d(y,o)=r, we define dρr(x,y)d_{\rho r}(x,y) to be the infimum of the lengths of paths in XB(o,ρr)X\setminus B(o,\rho r) between xx and yy, if such a path exists, and \infty otherwise; here B(o,ρr)B(o,\rho r) denotes the ball of radius ρr\rho r about oo. We then set

δρ(r)=supoXsupx,ydρr(x,y).\delta_{\rho}(r)=\sup_{o\in X}\sup_{x,y}d_{\rho r}(x,y).

The divergence of XX is defined to be the collection of functions δρ:rδρ(r)\delta_{\rho}:\ r\mapsto\delta_{\rho}(r),

Div(X):={δρ:ρ(0,1]}.\text{Div}(X):=\{\delta_{\rho}:\rho\in(0,1]\}.

Given two non-decreasing functions f,g:+f,g\colon\mathbb{N}\to\mathbb{R}^{+}, we say that fgf\lesssim g if there exists C1C\geq 1 so that:

f(r)Cg(Cr+C)+Cr+C,f(r)\leq C\cdot g(Cr+C)+Cr+C,

and we say fgf\sim g if fgf\lesssim g and gfg\lesssim f. Importantly, two polynomials that are non decreasing +\mathbb{N}\to\mathbb{R}^{+} and have the same degree are equivalent under this relation, and further for a,ba,b\in\mathbb{N} we have xaxbx^{a}\sim x^{b} if and only if a=ba=b.

When XX is the Cayley graph of a right-angled Coxeter group, it is straightforward to see that δρ(r)δ1(r)\delta_{\rho}(r)\sim\delta_{1}(r). Therefore when we are referring to the divergence function of WΓW_{\Gamma}, we will mean δ1(r)\delta_{1}(r). We say that a RACG WΓW_{\Gamma} has quadratic divergence if δ1(r)r2\delta_{1}(r)\sim r^{2} and linear divergence if δ1(r)r\delta_{1}(r)\sim r.

Definition 3.4.

Let GG be a finitely generated group.

  • We say that GG is strongly algebraically thick of order 0 if it has linear divergence.

  • We say that GG is strongly algebraically thick of order at most nn if GG has a collection of subgroups ={Hα}\mathcal{H}=\{H_{\alpha}\} so that:

    • αHα\left\langle\bigcup_{\alpha}H_{\alpha}\right\rangle has finite index in GG

    • for Hα,HβH_{\alpha},H_{\beta}\in\mathcal{H} there exists a sequence H0=Hα,H1,Hk1,Hk=HβH_{0}=H_{\alpha},H_{1},\ldots H_{k-1},H_{k}=H_{\beta} of elements of \mathcal{H} so that Hi1HiH_{i-1}\cap H_{i} is infinite for each 1ik1\leq i\leq k

    • there exists a constant M>0M>0 so that each HαH_{\alpha}\in\mathcal{H} is MM–quasiconvex, that is to say, every pair of points in HαH_{\alpha} can be connected by an (M,M)(M,M)–quasigeodesic contained in Hα.H_{\alpha}.

    • each HαH_{\alpha}\in\mathcal{H} is strongly algebraically thick of order at most n1n-1.

Further, we say that GG is strongly algebraically thick of order exactly nn if it is strongly algebraically thick of order at most nn and not strongly algebraically thick of order at most n1n-1. We also usually write “thick” as a shorthand for “strongly algebraically thick”.

Behrstock and Druţu discovered that the order of thickness provides upper bounds on the divergence of a metric space. In particular they proved:

Proposition 3.5 ([3, Corollary 4.17]).

Let GG be a finitely generated group which is strongly algebraically thick of order at most nn. Then for every ρ(0,1]\rho\in(0,1], δρ(r)rn+1\delta_{\rho}(r)\lesssim r^{n+1}.

The group theoretic motivation for studying the 𝒞𝒮\mathcal{CFS} property is that it provides a graph theoretical proxy for certain geometric properties of right-angled Coxeter groups, such as their divergence. To see that WΓW_{\Gamma} has quadratic divergence when Γ\Gamma has the 𝒞𝒮\mathcal{CFS} property is straightforward, since interpreting the definition of 𝒞𝒮\mathcal{CFS} in the Cayley graph yields a chain of linearly many spaces with linear divergence with each intersecting the next in an infinite diameter set. Indeed, it is an immediate consequence of the definitions that if if GG is the direct product of two infinite groups then GG has linear divergence, just as a path avoiding a linear-sized ball in the plane has linear length. Hence every finitely generated abelian group of rank at least 22 has linear divergence (and is thick of order 0).

Now, if Γ=KΓ\Gamma=K\star\Gamma^{\prime} where KK is a clique, then WΓ2|K|×WΓW_{\Gamma}\cong\mathbb{Z}_{2}^{|K|}\times W_{\Gamma^{\prime}}. In such a case WΓW_{\Gamma^{\prime}} is a finite-index subgroup of WΓW_{\Gamma^{\prime}} and thus, up to finite index, we can assume that Γ\Gamma does not contain a vertex sending an edge to all other vertices of Γ\Gamma.

Now, WΓW_{\Gamma} contains a network of convex subgroups generated by the induced square in the full-support component of (Γ)\square(\Gamma). Each of these groups is virtually 2\mathbb{Z}^{2}, that is to say has a finite index subgroup which is a copy of 2\mathbb{Z}^{2}. Further, two induced squares in Γ\Gamma correspond to incident edges in (Γ)\square(\Gamma) if and only if the intersection of the associated virtual 2\mathbb{Z}^{2} subgroups is virtually \mathbb{Z}, that is to say has a finite index subgroup which is a copy of \mathbb{Z}.

Thus, paths in the full-support component of (Γ)\square(\Gamma) give the connecting sequences needed in Definition 3.4. Hence if Γ\Gamma has the 𝒞𝒮\mathcal{CFS} property, WΓW_{\Gamma} is thick of order at most 11 and has at most quadratic divergence.

As shown by Dani–Thomas in the triangle-free case [8, Theorem 1.1 and Remark 4,8], and by the present authors with Hagen in the general case (as above) [5, Proposition 3.1], if Γ\Gamma has the 𝒞𝒮\mathcal{CFS} property then the associated right-angled Coxeter group WΓW_{\Gamma} has thickness of order at most 11, and hence has at most quadratic divergence. Further, in [6], Behrstock, Hagen and Sisto show that a right-angled Coxeter group WΓW_{\Gamma} has linear divergence (and is thick of order 0) if and only if Γ\Gamma is the join of two non-complete graphs. Finally Levcovitz proved that any graph without 𝒞𝒮\mathcal{CFS} has at least cubic divergence [15], and so we see that WΓW_{\Gamma} has exactly quadratic divergence if and only if Γ\Gamma is not the join of two non-complete graphs and has the 𝒞𝒮\mathcal{CFS} property.

3.2. Proof of threshold for quadratic divergence in random RACGs

Assuming our main theorems about square percolation, Theorems 1.4 and 1.5, we are now in a position to provide a proof of Theorem 1.6 on the threshold for quadratic divergence in RACGs:

Proof of Theorem 1.6 from Theorems 1.4 and 1.5.

Let ϵ>0\epsilon>0, and suppose that

λc+ϵnp(n)1(1+ϵ)lognn.\frac{\lambda_{c}+\epsilon}{\sqrt{n}}\leq p(n)\leq 1-\frac{(1+\epsilon)\log{n}}{n}.

By Theorem 1.5, the graph Γ\Gamma a.a.s. has the 𝒞𝒮\mathcal{CFS} property. Thus, by [5, Proposition 3.1], WΓW_{\Gamma} a.a.s. has at most quadratic divergence and is thick of order at most 1. Further, since 1p(n)(1+ϵ)lognn1-p(n)\geq\frac{(1+\epsilon)\log{n}}{n}, standard results on the connectivity of Erdős–Rényi random graphs tell us that a.a.s. the complement of Γ\Gamma is connected, and thus that Γ\Gamma itself is a.a.s.  not the join of two non-trivial graphs. Thus, [6] implies that WΓW_{\Gamma} is not thick of order 0 and hence is thick of order exactly one and has precisely quadratic divergence.

On the other hand, if

λcϵnp(n),\frac{\lambda_{c}-\epsilon}{\sqrt{n}}\leq p(n),

then by Theorem 1.4 no component of the square graph can have full support, and thus the graph Γ\Gamma is not 𝒞𝒮\mathcal{CFS}. It then follows from [15], that WΓW_{\Gamma} has at least cubic divergence, and thus by Proposition 3.5 that it is not thick of order 1. ∎

4. Branching processes and proof strategy

4.1. Branching processes

We recall here some basic facts and definitions from the theory of branching processes that we will use in our argument; for a more general treatment of such processes, see e.g. [2].

Definition 4.1.

A Galton–Watson branching process 𝐖=(Wt)t0\mathbf{W}=(W_{t})_{t\in\mathbb{Z}_{\geq 0}} with offspring distribution XX is a sequence of non-negative integer-valued random variables with W0=1W_{0}=1 and for all t1t\geq 1, Wt=i=1Wt1Xi,tW_{t}=\sum_{i=1}^{W_{t-1}}X_{i,t}, where the Xi,tX_{i,t}: i,ti,t\in\mathbb{N} are independent, identically-distributed random variables with Xi,tXX_{i,t}\sim X for all i,ti,t.

A Galton–Watson branching process can be viewed as a random rooted tree: in the zero-th generation there is a root or ancestor, who begets a random number X1,1XX_{1,1}\sim X of children that form the first generation. In every subsequent generation, each child independently begets a random number of children, with the ii-th member of generation tt begetting Xi,tXX_{i,t}\sim X children.

Galton–Watson branching processes are a widely studied family of random processes and are the subject of much probabilistic research; see e.g. [2] and the references therein. Here we introduce only some fairly standard elements of the theory that are needed for our argument. A Galton–Watson process 𝐖\mathbf{W} is said to become extinct if Wt=0W_{t}=0 for some tt\in\mathbb{N}. The total progeny of 𝐖\mathbf{W} is the total number of vertices in the associated tree, which we denote by W=t=0WtW=\sum_{t=0}^{\infty}W_{t}; this quantity is finite if and only if 𝐖\mathbf{W} becomes extinct.

A key tool in the study of 𝐖\mathbf{W} is the generating function of its offspring distribution, fX(t)=𝔼tXf_{X}(t)=\mathbb{E}t^{X}. The following standard results from the theory of branching processes relate the probability of extinction for 𝐖\mathbf{W} to the mean and generating function of its offspring distribution XX.

Proposition 4.2 (See e.g. [2]).

Let μ=𝔼X\mu=\mathbb{E}X and f(t)=fX(t)f(t)=f_{X}(t). Let 𝐖\mathbf{W} be a Galton–Watson branching process with offspring distribution XX. Then the following hold:

  1. (i)

    (subcritical regime) if μ<1\mu<1, then almost surely 𝐖\mathbf{W} becomes extinct, and what is more

    (𝐖 has not become extinct by generation k)=(Wk0)μk.\mathbb{P}\left(\mathbf{W}\textrm{ has not become extinct by generation }k\right)=\mathbb{P}(W_{k}\neq 0)\leq\mu^{k}.
  2. (ii)

    (supercritical regime) if μ>1\mu>1, then the probability θe\theta_{e} that 𝐖\mathbf{W} becomes extinct is the smallest solution θ[0,1]\theta\in[0,1] to the equation

    f(θ)=θ,f(\theta)=\theta,

    and satisfies θe<1\theta_{e}<1.

We shall also need the following result on the distribution of the total progeny WW of 𝐖\mathbf{W}.

Proposition 4.3 (Dwass’s formula [11]).

Let 𝐖\mathbf{W} be a Galton–Watson branching process with offspring distribution XX. Then the total progeny WW satisfies

(W=k)=1k(X1+X2+Xk=k1),\mathbb{P}\left(W=k\right)=\frac{1}{k}\mathbb{P}\left(X_{1}+X_{2}+\cdots X_{k}=k-1\right),

where X1,X2,,XkX_{1},X_{2},\ldots,X_{k} are independent, identically distributed random variables with XiXX_{i}\sim X for all i[k]i\in[k].

4.2. Departures from the Bollobás–Riordan framework

Bollobás and Riordan in [7] developed a powerful branching process framework for the study of clique percolation. Much of that framework can be adapted to the study of the non-induced square percolation we are concerned with in this paper. However there remains a number of significant hurdles which need to be overcome in order to extend their techniques to the present setting.

In the subcritical regime, the structure of squares makes the analysis of exceptional edges and offspring distributions (which are the crux of the argument) differ significantly from the Bollobás–Riordan paper; care is needed to handle the resulting complications correctly. Indeed, Bollobás and Riordan are able to model clique percolation using a Galton–Watson branching process whose offspring distribution is roughly Poisson; however, for square percolation, the offspring distribution is more heavy-tailed, forcing us to resort to somewhat delicate technical arguments.

Further, in the supercritical regime, because of our motivation from geometric group theory, we are interested in the study of induced square percolation. In particular, adding new edges to a graph could destroy some induced squares and hence split apart square-components even as we are trying to build a giant square-component. This situation is quite unlike that in clique percolation, and we have to use a completely different sprinkling argument to obtain our results (inter alia sprinkling vertices rather than edges). Thus here again there are significant complications and major departures from Bollobás and Riordan’s framework in [7].

4.3. Proof strategy

Our results rely on the analysis of a branching process exploration of the square-components of a graph Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) for some fixed p=λn1/2p=\lambda n^{-1/2} where λ>0\lambda>0.

We begin with an arbitrary induced square S1=abcdS_{1}=abcd in Γ\Gamma. Its diagonals acac and bdbd give us two pairs of non-edges which can be used to discover further non-edges of Γ\Gamma belonging to the same square-component. The size of the set (ΓaΓc){b,d}\left(\Gamma_{a}\cap\Gamma_{c}\right)\setminus\{b,d\} of common neighbors of aa and cc in V(Γ){b,d}V(\Gamma)\setminus\{b,d\} is a binomially distributed random variable ZBinom(n4,p2)Z\sim\mathrm{Binom}(n-4,p^{2}). Assuming that ΓaΓc\Gamma_{a}\cap\Gamma_{c} is an independent set (i.e., contains no edge of Γ\Gamma) these common neighbors together with b,db,d give rise to (Z+22)\binom{Z+2}{2} non-edges that lie in the same square-component as acac; however, since we already knew about the pair bdbd, only X=(Z+22)1X=\binom{Z+2}{2}-1 of these are new. We then pursue our exploration of the square-component of acac by iterating this procedure: for each as-yet untested non-edge xyxy in our square-component, we can first find the common neighbors of xyxy, and add as “children” of xyxy all the new non-edges discovered in this way.

This can be viewed as a Galton–Watson branching process 𝐖\mathbf{W} with offspring distribution XX in a natural way. Assuming the past exploration does not greatly interfere with the distribution of the number of children in our process, the expected number of children at each step is roughly equal to

𝔼X=𝔼((Z+22)1)=𝔼(Z2+3Z2).\mathbb{E}X=\mathbb{E}\left(\binom{Z+2}{2}-1\right)=\mathbb{E}\left(\frac{Z^{2}+3Z}{2}\right).

The expected value of XX is readily computed from the first and second moments of the binomial distribution of ZZ, yielding

𝔼X=12λ4+2λ2+o(1).\mathbb{E}X=\frac{1}{2}\lambda^{4}+2\lambda^{2}+o(1).

The Galton–Watson process 𝐖\mathbf{W} becomes critical when the expectation of its offspring distribution is 11. Solving

12λ4+2λ2=1\frac{1}{2}\lambda^{4}+2\lambda^{2}=1

and selecting the non-negative root λc=62=0.6704\lambda_{c}=\sqrt{\sqrt{6}-2}=0.6704\ldots, we thus see that for for any fixed λ<λc\lambda<\lambda_{c}, our branching process 𝐖\mathbf{W} is subcritical. We thus expect it to terminate a.a.s. after a fairly small number of steps, from which one can hope to, in turn, deduce that a.a.s. all square-components are small. On the other hand, for any fixed λ>λc\lambda>\lambda_{c}, 𝐖\mathbf{W} is supercritical, and with probability strictly bounded away from zero it does not terminate before we have discovered a reasonably large number of non-edges. A second-moment argument can then be used to show that a strictly positive proportion of non-edges must lie in reasonably large square-components. With a little glueing work, we can then hope to show that in fact a strictly positive proportion of non-edges lie in a giant square-component that covers all the vertices of Γ\Gamma.

The above is however a simplification of what is actually required to make the arguments go through, and the situation turns out to be considerably more nuanced than what we described above. A first issue is our assumption that the vertices in ZZ form an independent set: in the subcritical regime, we need to consider what happens if the set ZZ of common neighbors of some non-edge xyxy which we are testing interacts with some other previously discovered vertices, or with vertices in ZZ. In particular, any “exceptional” edge from ZZ to previously discovered vertices other than xx, yy could potentially create many additional squares, and hence add many new pairs to our square-component which are not accounted for by our branching process. Bollobás and Riordan faced a similar problem in their work on clique percolation. However, as stated in the previous subsection, the way they dealt with “exceptional edges” does not quite work for us in the square percolation setting. One issue is that in a copy of C4C_{4}, vertices on opposite sides of a diagonal are not adjacent, so that the number of 44-cycles created by an exceptional edge cannot always be bounded by the degree of a newly discovered vertex. In addition, we note that if it is not dealt with properly, the presence of exceptional edges could significantly affect the future distribution of the number of children in our branching process: if in the example above a,ca,c had three common neighbors among the already discovered vertices rather than two, then the correct number of children for acac in the exploration process would be (Z+32)3=(Z+22)1+Z\binom{Z+3}{2}-3=\binom{Z+2}{2}-1+Z, which has expectation equal to 1+λc2>11+\lambda_{c}^{2}>1 when λ=λc\lambda=\lambda_{c}. Finally, for the argument to work, we need not only for a Galton–Watson branching process with offspring distribution XX to become a.a.s. extinct within a few generations (which is an easy first moment argument): we also need its total progeny to be a.a.s. small. Here the fact that XX is a quadratic function of the binomial random variable ZZ (and thus rather heavy-tailed) causes complicated issues, which were not faced in [7] (where the offspring distribution was essentially Poisson with mean <1<1). Overcoming these problems is the main work done in Section 5.

Secondly, in the supercritical argument, after establishing the a.a.s. existence of many non-edges in reasonably large square-components, we must prove the a.a.s. existence of a giant square-component covering all vertices and a strictly positive proportion of non-edges of Γ\Gamma. Here the crucial point is that, because of the applications in geometric group theory motivating our work, we are considering induced square percolation. The size of a largest square-component in Γ\Gamma is not monotone with respect to the addition of edges to the graph — adding an edge could very well destroy an induced square, thus potentially breaking a large square-component in to several smaller pieces. So we have to use a completely new sprinkling argument to be able to agglomerate all “reasonably large” square-components into a single giant square-component. To do this we reserve some vertices for sprinkling, rather than edges. We use these vertices to build bridges between reasonably large square-components in a sequence of rounds until all such components are joined into one. Finally once we have established the a.a.s. existence of a giant square-component, some care is needed to ensure this square-component covers every vertex of Γ\Gamma. Assembling a giant square-component and ensuring it has full support in this way involves overcoming a number of interesting obstacles, and is the main work done in Section 6.

5. The subcritical regime: proof of Theorem 1.4

Theorem 1.4 will be established as an immediate consequence of a stronger result, Theorem 5.1, which we state and prove below after providing a few preliminary definitions.

Given a graph Γ\Gamma, in addition to the square-graph (Γ)\square(\Gamma) from Definition 1.1 we shall consider a different but closely related auxiliary graph (Γ)(Γ)\boxtimes(\Gamma)(\Gamma) that includes information about all squares in Γ\Gamma (rather than just the induced squares). Explicitly we let (Γ):=(V(Γ)(2),{{ac,bd}:ab,bc,cd,adE(Γ)})\boxtimes(\Gamma):=(V(\Gamma)^{(2)},\{\{ac,bd\}:\ ab,bc,cd,ad\in E(\Gamma)\}) be the graph whose vertices are pairs of vertices from V(Γ)V(\Gamma) and whose edges correspond to (not necessarily induced) copies of C4C_{4}. The support supp(C)\mathrm{supp}(C) of a component CC of (Γ)\boxtimes(\Gamma) is defined as in Definition 1.3, mutatis mutandis.

Note that the square graph (Γ)\square(\Gamma) is exactly the subgraph of (Γ)\boxtimes(\Gamma) induced by the set {abV(Γ)(2):abE(Γ)}\{ab\in V(\Gamma)^{(2)}:\ ab\notin E(\Gamma)\} of non-edges of Γ\Gamma. In particular, for every square-component CC in (Γ)\square(\Gamma), there is a component CC^{\prime} in (Γ)\boxtimes(\Gamma) with CCC\subseteq C^{\prime} and thus supp(C)supp(C)\operatorname{\mathrm{supp}}(C)\subseteq\operatorname{\mathrm{supp}}(C^{\prime}). To establish Theorem 1.4, it is thus enough to prove the following stronger theorem that bounds the size of the support in Γ\Gamma of components of (Γ)\boxtimes(\Gamma).

Theorem 5.1.

Let λ<λc\lambda<\lambda_{c} be fixed. Suppose that p(n)λn1/2p(n)\leq\lambda n^{-1/2}. Then for Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p), a.a.s. every component of (Γ)\boxtimes(\Gamma) has a support of size O((logn)231)O((\log n)^{2^{31}}).

Since the order of the support of the largest component in (Γ)\boxtimes(\Gamma) is monotone non-decreasing with respect to the addition of edges to Γ\Gamma, we may assume in the remainder of this section that p(n)=λn1/2p(n)=\lambda n^{-1/2}. Further, since λ<λc\lambda<\lambda_{c} is fixed, there exists a constant ε>0\varepsilon>0 such that for a binomially distributed random variable ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}) we have

(5.1) 𝔼((Z+22)1)=1ε.\displaystyle\mathbb{E}\left(\binom{Z+2}{2}-1\right)=1-\varepsilon.

With this last equality in hand, we are now ready to present and analyse the exploration process that lies at the heart of our proof of Theorem 5.1.

We shall discover a superset of the component of (Γ)\boxtimes(\Gamma) which contains sine fixed pair v1v2VΓ(2)v_{1}v_{2}\in V_{\Gamma}^{(2)}. We begin our exploration by finding common neighbors of v1v_{1} and v2v_{2}, then adding all pairs of such newly discovered vertices to a set of unexplored pairs. These new pairs obviously lie in the same component of (Γ)\boxtimes(\Gamma) as v1v2v_{1}v_{2}.

After this initial step in the exploration, we proceed as follows. First, we choose a new pair ata_{t} from our set of unexplored pairs. By assumption, there exists a previously explored pair btb_{t} such that all 44 edges from ata_{t} to btb_{t} are present. We continue our exploration by finding the set ZtZ_{t} of common neighbors of the vertices in ata_{t} among the previously undiscovered vertices of Γ\Gamma. We then add all pairs (Ztbt)(2){bt}\left(Z_{t}\cup b_{t}\right)^{(2)}\setminus\{b_{t}\} to our set of active pairs — these obviously lie in the same component of (Γ)\boxtimes(\Gamma) as ata_{t} — and delete ata_{t} from that set. We then repeat the procedure, choosing a new unexplored pair at+1a_{t+1}, finding its common neighbors among undiscovered vertices, etc.

This, however, is not enough to discover the totality of the component of v1v2v_{1}v_{2} in (Γ)\boxtimes(\Gamma). Indeed, it is possible that the pair ata_{t} has additional common neighbors among already discovered vertices (in addition to the two neighbors in btb_{t}), which could give rise to additional, unexplored pairs that lie in the same component of (Γ)\boxtimes(\Gamma) as ata_{t}. To deal with this possibility, we have to add in an exceptional phase in our exploration, which takes care of potential additional edges among the vertices we have discovered. In this exceptional phase, we generously overestimate how many new unexplored pairs could be discovered, and for each of these new pairs we start new and essentially independent versions of our exploration process, which we think of as children processes of our original exploration process.

The key is that, with the exceptional phase factored in, we do discover a superset of the collection of all pairs in the same component of (Γ)\boxtimes(\Gamma) as our starting pair v1v2v_{1}v_{2}. We compare the non-exceptional phase of our exploration to a subcritical branching process and give upper bound on its total progeny, which is small. Obtaining this bound is somewhat tricky (due to the nature of our offspring distribution) and relies on a rather technical application of Dwass’s formula (Proposition 4.3). The remainder of the proof is provided in Lemma 5.8 which shows that, with high probability, we do not run through more than five exceptional phases. In particular, we do not start too many child processes, so that with high probability our overall exploration process stops before we have discovered a large number of vertices.

5.1. An exploration process

Our exploration process will proceed by considering the following for each time t0t\geq 0:

  • (Discovered vertices.) An ordered set of vertices: Dt={v1,v2,,vdt}D_{t}=\{v_{1},v_{2},\ldots,v_{d_{t}}\}.

  • (Active pairs.) A set of pairs of vertices (ordered lexicographically with respect to the ordering on DtD_{t}): At={x1y1,x2y2,,xatyat}Dt(2)A_{t}=\{x_{1}y_{1},x_{2}y_{2},\ldots,x_{a_{t}}y_{a_{t}}\}\subseteq D_{t}^{(2)}.

  • (Discovered pairs.) A set of pairs of vertices: StDt(2)S_{t}\subseteq D_{t}^{(2)}.

  • (Explored edge set.) A set of edges: EtDt(2)E(Γ)E_{t}\subseteq D_{t}^{(2)}\cap E(\Gamma).

  • (Epoch.) An integer: et{0,1,2,3,4,5}e_{t}\in\{0,1,2,3,4,5\}.

These sets will satisfy:

  • (\star)

    for all t0t\geq 0 and for every active pair xiyiAtx_{i}y_{i}\in A_{t}, the vertices xix_{i} and yiy_{i} have either 0 or 22 common neighbors in the graph (Dt,Et)(D_{t},E_{t}).

The initial state of the exploration consists of the following data, which is seeded by a choice of v1v2v_{1}v_{2}, an arbitrary pair of vertices from V(Γ)V(\Gamma) (note that this pair can, alternatively, be thought of as a vertex of (Γ)\boxtimes(\Gamma)). We set D0={v1,v2}D_{0}=\{v_{1},v_{2}\}, A0=S0={v1v2}A_{0}=S_{0}=\{v_{1}v_{2}\}, E0=E_{0}=\emptyset and e0=0e_{0}=0.

At each time step tt our exploration proceeds as follows, with ε\varepsilon as given in equation (5.1):

  1. 1.

    If |Dt|<2210ε210(logn)231|D_{t}|<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}} and AtA_{t}\neq\emptyset, let a=xya={x}{y} be the first pair in AtA_{t}. For each zV(Γ)Dtz\in V(\Gamma)\setminus D_{t} we test whether or not zz sends an edge in Γ\Gamma to both vertices of aa. Set Zt:={zV(Γ)Dt:zx,zyE(Γ)}Z_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ {z}{x},{z}{y}\in E(\Gamma)\} and FtF_{t} to be the collection of joint neighbors of xx and yy in (Dt,Et)(D_{t},E_{t}). (Note, by property ()(\star), the set FtF_{t} is either empty or consists of a pair of discovered vertices.)

    We arbitrarily order the vertices in ZtZ_{t} as {vdt+1,vdt+2,,vdt+1}\{v_{d_{t}+1},v_{d_{t}+2},\ldots,v_{d_{t+1}}\}, and add them to DtD_{t} to form Dt+1D_{t+1}. We then set

    At+1=(At(FtZt)(2))(aFt(2))A_{t+1}=\left(A_{t}\cup\left(F_{t}\cup Z_{t}\right)^{(2)}\right)\setminus\left(a\cup F_{t}^{(2)}\right)

    to be the new collection of active pairs (which again is ordered lexicographically with respect to the ordering on DtD_{t}), set Et+1=Et{zx,zy:zZt}E_{t+1}=E_{t}\cup\{zx,zy:\ z\in Z_{t}\}, set et+1=ete_{t+1}=e_{t}, St+1=StAt+1S_{t+1}=S_{t}\cup A_{t+1} and then proceed to the next time step t+1t+1 of the process. Note that since the only new edges being added are ones connecting a new vertex to xx and yy and since xyAt+1xy\notin A_{t+1} each pair in At+1A_{t+1} has either 0 or 22 common neighbors in (Dt+1,Et+1)(D_{t+1},E_{t+1}), i.e., property (\star) is still satisfied in the next time-step.

  2. 2.

    If |Dt|2210ε210(logn)231|D_{t}|\geq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}, then we terminate the process and declare large stop.

  3. 3.

    If |Dt|<2210ε210(logn)231|D_{t}|<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}} and At=A_{t}=\emptyset, then we consider i=|E(Γ[Dt])Et|i=|E(\Gamma[D_{t}])\setminus E_{t}|.

    If i=0i=0 or et+i5e_{t}+i\geq 5, then we terminate our exploration process and declare extinction stop or exceptional stop, respectively.
    Otherwise we set et+1=et+ie_{t+1}=e_{t}+i, update EtE_{t} by setting Et=E(Γ[Dt])E_{t}=E(\Gamma[D_{t}]), and set i1=ii_{1}=i (which by assumption is >0>0). We then run the following subroutines:

    1. 3A.

      Set

      Zt1:={zV(Γ)Dt:z sends at least three edges into Dt}.Z^{1}_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ z\textrm{ sends at least three edges into }D_{t}\}.

      We then update our value of i1i_{1}, setting i1=|Zt1|i_{1}=|Z^{1}_{t}|.

      • If i1>0i_{1}>0 and et+1+i1>5e_{t+1}+i_{1}>5, then we terminate the whole exploration process and declare exceptional stop.

      • Else if i1>0i_{1}>0 and et+1+i15e_{t+1}+i_{1}\leq 5, we add Zt1Z^{1}_{t} to DtD_{t}, update EtE_{t} by setting Et=E(Γ[Dt])E_{t}=E(\Gamma[D_{t}]), update StS_{t} by setting St=Dt(2)S_{t}=D_{t}^{(2)}. We then update et+1e_{t+1} to et+1+i1e_{t+1}+i_{1} and run through subroutine 3.3A. again.

      • Otherwise i1=0i_{1}=0 and we proceed to subroutine 3.3B.

    2. 3B.

      Let

      Zt2:={zV(Γ)Dt:z sends at least two edges into Dt},Z^{2}_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ z\textrm{ sends at least two edges into }D_{t}\},

      Since subroutine 3A. terminated with i1=0i_{1}=0 each vertex in Zt2Z^{2}_{t} sends exactly two edges into DtD_{t}. We set Dt+1=DtZt2D_{t+1}=D_{t}\cup Z^{2}_{t}, and let At+1A_{t+1} consist of all pairs of vertices in Dt+1D_{t+1} containing at least one vertex of Zt2Z^{2}_{t}. Further, we set St+1=StAt+1S_{t+1}=S_{t}\cup A_{t+1}.

      Once this is done, we proceed to the next time-step t+1t+1 in the overall exploration process, observing that property ()(\star) has been preserved (since by construction every vertex in Zt2Z^{2}_{t} has degree exactly two in (Dt+1,Et+1)(D_{t+1},E_{t+1})).

\labellist
\hair

2pt \pinlabelDtD_{t} [ ] at 7 8 \pinlabelFtF_{t} [ ] at 12 40 \pinlabelxx [ ] at 36 91 \pinlabelyy [ ] at 36 9 \pinlabelf1f_{1} [ ] at 27 51 \pinlabelf2f_{2} [ ] at 46 51 \pinlabelz1z_{1} [ ] at 136 40 \pinlabelz2z_{2} [ ] at 153 40 \endlabellist\includegraphics[scale=1.0]Figure_Process_1

Figure 1. An illustration of Stage 1 of the Exploration Process, with exploration from the active pair a=xya=xy at time tt. In this example, Ft={f1,f2}F_{t}=\{f_{1},f_{2}\} and the set of newly discovered vertices is Zt={z1,z2}Z_{t}=\{z_{1},z_{2}\}. We thus have At+1At={f1z1,f2z1,f1z2,f2z2,z1z2}A_{t+1}\setminus A_{t}=\{f_{1}z_{1},f_{2}z_{1},f_{1}z_{2},f_{2}z_{2},z_{1}z_{2}\} and AtAt+1={xy}A_{t}\setminus A_{t+1}=\{xy\}.
\labellist
\hair

2pt \pinlabelv1v_{1} [ ] at 119 99 \pinlabelv2v_{2} [ ] at 119 18 \pinlabelw1w_{1} [ ] at 51 64 \pinlabelw2w_{2} [ ] at 154 64 \pinlabelz1z_{1} [ ] at 87 65 \pinlabelz2z_{2} [ ] at 124 3 \endlabellist\includegraphics[scale=1.0]Figure_Process_2

Figure 2. Illustration of Stage 3 of the Exploration Process. Suppose the pairs w1w2,w1z1,w2z1Stw_{1}w_{2},w_{1}z_{1},w_{2}z_{1}\in S_{t} were discovered while v1v2v_{1}v_{2} was active. When w1w2w_{1}w_{2} is active in Stage 1 the pairs {v1z2,v2z2}\{v_{1}z_{2},v_{2}z_{2}\} will be discovered. However the pair z1z2z_{1}z_{2} will only be discovered in an instance of Stage 3, if the two dashed edges are revealed to be present (thus making w1z1w2z2w_{1}z_{1}w_{2}z_{2} a square).

5.2. Analysing the process

The exploration process defined in the previous subsection can terminate for one of three reasons:

  1. (1)

    |Dt|2210ε210(logn)231|D_{t}|\geq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}} (large stop);

  2. (2)

    et5e_{t}\geq 5 (exceptional stop);

  3. (3)

    At=A_{t}=\emptyset and Et=Γ[Dt]E_{t}=\Gamma[D_{t}] (extinction stop).

It follows from the above that the process must in fact terminate within O((logn)232)O\left((\log n)^{2^{32}}\right) time-steps. We begin our analysis by noting that, given our aim of proving Theorem 1.4, extinction stops are good for us:

Lemma 5.2.

Suppose that the exploration from v1v2{v_{1}}{v_{2}} terminates at time TT with an extinction stop. Let CC be the component of (Γ)\boxtimes(\Gamma) containing v1v2{v_{1}}{v_{2}}. Then CSTC\subseteq S_{T}. Furthermore, the number of vertices in the support of CC is at most 2210ϵ210(logn)2312^{2^{10}}\epsilon^{-2^{10}}\left(\log{n}\right)^{2^{31}} and |C|2211ϵ211(logn)232|C|\leq 2^{2^{11}}\epsilon^{-2^{11}}\left(\log{n}\right)^{2^{32}}.

Proof.

We perform our exploration process from the pair v1v2v_{1}v_{2}, and assume it terminates with an extinction stop at time TT. It is enough to show that given a=u1u2CSta=u_{1}u_{2}\in C\cap S_{t}, for every neighbor b=w1w2b=w_{1}w_{2} of aa in (Γ)\boxtimes(\Gamma), there is some tTt^{\prime}\leq T such that bStb\in S_{t^{\prime}}.

If aa was discovered at a time-step where 1. applied or if aA0a\in A_{0}, then aAta\in A_{t} and was an active pair at some time t0t\geq 0. Thus, at some later time-step tt^{\prime} where 1. applies, our exploration process selects aa as its “exploration pair” and discover all neighbors z1z2z_{1}z_{2} of aa in (Γ)\boxtimes(\Gamma) with z1z2(ZtFt)(2)z_{1}z_{2}\in\left(Z_{t^{\prime}}\cup F_{t^{\prime}}\right)^{(2)} — where FtF_{t^{\prime}} is the pair we used to discover aa, and ZtZ_{t^{\prime}} is the collection of joint neighbors of u1u_{1} and u2u_{2} that lie in V(Γ)DtV(\Gamma)\setminus D_{t^{\prime}}. If bb is in this set, then bStb\in S_{t^{\prime}}.

Otherwise, if we failed to find b=w1w2b=w_{1}w_{2} at this time tt^{\prime}, bb must contain at least one vertex from DtFtD_{t^{\prime}}\setminus F_{t^{\prime}}. By property ()(\star) of our exploration, at least one of the edges uiwju_{i}w_{j}, i,j{1,2}i,j\in\{1,2\} lies outside EtE_{t^{\prime}}.

In particular, since we do not end with a large or exceptional stop, this edge will be uncovered at later time step t′′t^{\prime\prime} where 3.3A. applies. But by the end of 3.3B., all vertices sending at least two edges into Dt′′D_{t^{\prime\prime}} have been added to Dt′′D_{t^{\prime\prime}}. Thus all common neighbors of aa will be found (since aDt′′a\subseteq D_{t^{\prime\prime}}, and a common neighbor of aa has at least two neighbors in Dt′′D_{t^{\prime\prime}}). Hence after the updates bDt′′b\subseteq D_{t^{\prime\prime}}. There are two options: if both vertices of bb are present after 3.3A., then bSt′′b\in S_{t^{\prime\prime}}, since after 3.3A. all possible pairs of discovered vertices (not already tested) are added to St′′S_{t^{\prime\prime}}. Otherwise, both vertices of bb are present after 3.3B., and since at least one of them was discovered in 3.3B., bb is added to At′′+1St′′+1A_{t^{\prime\prime}+1}\subseteq S_{t^{\prime\prime}+1}, and we are done again.

If on the other hand aa was discovered at a step tt where 3.3B. applies, then aa is added to At+1A_{t+1}, and the above applies. Finally, if aa was discovered at a time step tt where 3.3A. applies, then in 3.3A. and 3.3B., all common neighbors of aa are added to DtD_{t}, and all such pairs are added to StS_{t} or St+1S_{t+1}.

Either way, since StSTS_{t}\subseteq S_{T} for all tTt\leq T, we see that bSTb\in S_{T}. Thus every neighbor of aa in (Γ)\boxtimes(\Gamma) is eventually discovered by our exploration process, and CSTC\subseteq S_{T} as claimed. Further, since STDT(2)S_{T}\subseteq D_{T}^{(2)} by construction, and since our exploration ends with an extinction stop by the hypothesis of the lemma, we have |C|12|DT|2<2211ε211(logn)232|C|\leq\frac{1}{2}|D_{T}|^{2}<2^{2^{11}}\varepsilon^{-2^{11}}(\log n)^{2^{32}} as claimed. ∎

We now turn to the technical crux of the analysis.

Lemma 5.3.

If we are at a time-step tt of the process where 1. applies, then given the past history of the process, the random variable |At+1At||A_{t+1}\setminus A_{t}| counting the number of new active pairs discovered by a1=x1y1a_{1}=x_{1}y_{1} is stochastically dominated by a random variable X=Z2+3Z2X=\frac{Z^{2}+3Z}{2}, where ZZ is a binomial random variable with parameters nn and p2p^{2}.

Proof.

As observed by Bollobás and Riordan [7, Inequality (3)], the past of the exploration is the intersection of the principal increasing event 𝒰t={EtE(Γ)}\mathcal{U}_{t}=\{E_{t}\subseteq E(\Gamma)\} (corresponding to the edges that we have discovered are present in Γ\Gamma) and a decreasing event 𝒟\mathcal{D} (corresponding to the intersection of a number of events of the form “at least one of zx,zyzx,zy is not in E(Γ)E(\Gamma)”). In particular appealing to Harris’s Lemma [13], the conditional probability that xx and yy both send an edge to zV(Γ)Dtz\in V(\Gamma)\setminus D_{t} in Γ\Gamma given the history is at most the unconditional probability p2p^{2}. (We note here that the fact zDtz\notin D_{t} is essential — we have no control over the conditional probabilities of edges inside the set of discovered vertices DtD_{t}.)

Thus conditional on the past history of the exploration process, the distribution of |Zt||Z_{t}| is stochastically dominated by a random variable ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}). The Lemma then immediately follows from the definition of our exploration process. (Note that here we are using the fact property (\star) is maintained throughout our exploration.) ∎

Lemma 5.4.

Let ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}) and kk\in\mathbb{N}. Then (Z9logn+9logk)n5k6\mathbb{P}(Z\geq 9\log n+9\log k)\leq n^{-5}k^{-6}.

Proof.

Recall that in this section, p=λn1/2p=\lambda n^{-1/2}, for some constant λ<λc\lambda<\lambda_{c}. Since ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}), we have:

(Z9(logn+logk))\displaystyle\mathbb{P}\Bigl{(}Z\geq 9(\log n+\log k)\Bigr{)} =r=9(logn+logk)n(nr)p2r(1p2)nr<r=9(logn+logk)nnr(λcn1/2)2r\displaystyle=\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}\binom{n}{r}p^{2r}(1-p^{2})^{n-r}<\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}n^{r}\left(\lambda_{c}n^{-1/2}\right)^{2r}
=r=9(logn+logk)n(λc)2r<nλc29(logn+logk)nexp(18logλc(logn+logk)),\displaystyle=\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}\left(\lambda_{c}\right)^{2r}<n\lambda_{c}^{2\lceil 9(\log n+\log k)\rceil}\leq n\exp\left(18\log\lambda_{c}(\log n+\log k)\right),

where in the last two inequalities we used the fact (λc)2=62<1(\lambda_{c})^{2}=\sqrt{6}-2<1. Since 18logλc<618\log\lambda_{c}<-6, this immediately gives us the desired bound

(Z9logn+9logk)<ne6logn6logk=n5k6.\mathbb{P}\left(Z\geq 9\log n+9\log k\right)<ne^{-6\log n-6\log k}=n^{-5}k^{-6}.

Corollary 5.5.

(xyV(Γ)(2) : |ΓxΓy|9logn)n3\mathbb{P}\left(\exists\ {x}{y}\in V(\Gamma)^{(2)}\textrm{ : }|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n\right)\leq n^{-3}.

Proof.

Fix xyV(Γ)(2){x}{y}\in V(\Gamma)^{(2)}. By Lemma 5.4 with k=1k=1, we have

(|ΓxΓy|9logn)=r=9lognn2(n2r)p2r(1p2)n2r<(Z9logn)n5.\displaystyle\mathbb{P}\left(|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n\right)=\sum_{r=\lceil 9\log n\rceil}^{n-2}\binom{n-2}{r}p^{2r}(1-p^{2})^{n-2r}<\mathbb{P}\left(Z\geq 9\log n\right)\leq n^{-5}.

Taking a union bound over all (n2)<n2\binom{n}{2}<n^{2} possible choices of the pair xyxy, the lemma follows. ∎

We now analyse the total progeny of the Galton–Watson branching process with offspring distribution given by the random variable XX from the statement of Lemma 5.3. By (5.1), 𝔼X=1ϵ\mathbb{E}X=1-\epsilon and thus the branching process is subcritical. Unfortunately, combining the Markovian bound on the extinction time from Proposition 4.2(i), with the bounds on the maximum degree in (Γ)\boxtimes(\Gamma) from Corollary 5.5 does not give us sufficiently good control on the extinction time and total progeny of our Galton–Watson process. Thus we turn to an application of Dwass’s formula to obtain the tighter bounds needed for the proof of Theorem 1.4.

Lemma 5.6.

Let 𝐖=(Wt)t0\mathbf{W}=\left(W_{t}\right)_{t\in\mathbb{Z}_{\geq 0}} be a Galton–Watson branching process with an offspring distribution XX as in Lemma 5.3. Set k0=226ε2(logn)5k_{0}=2^{26}\varepsilon^{-2}(\log n)^{5}. Then 𝐖\mathbf{W} is subcritical, and its total progeny W=t=0WtW=\sum_{t=0}^{\infty}W_{t} satisfies

(Wk0)=O(n5).\mathbb{P}\left(W\geq k_{0}\right)=O\left(n^{-5}\right).
Proof.

Let {Xk,j:k,j[k]}\{X_{k,j}:\ k\in\mathbb{N},\ j\in[k]\} be an infinite family of independent, identically distributed copies of XX. For each kk\in\mathbb{N} and every j[k]j\in[k], write Xk,jX_{k,j} as Xk,j=Xak,j+Xbk,jX_{k,j}={X^{a}}_{k,j}+{X^{b}}_{k,j}, where Xak,j=min(Xk,j,28(logn+logk)2){X^{a}}_{k,j}=\min\left(X_{k,j},2^{8}(\log n+\log k)^{2}\right). Set μka=𝔼(Xak,1)\mu_{k}^{a}=\mathbb{E}({X^{a}}_{k,1}). By construction, μka𝔼[X]=1ε\mu_{k}^{a}\leq\mathbb{E}[X]=1-\varepsilon. Since X=(Z2+3Z)/22Z2X=(Z^{2}+3Z)/2\leq 2Z^{2}, where ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}), and since 2(9(logn+logk))2<28(logn+logk)22\left(9(\log n+\log k)\right)^{2}<2^{8}(\log n+\log k)^{2}, Lemma 5.4 implies that for every kk\in\mathbb{N} and j[k]j\in[k],

(5.2) (Xbk,j>0)(2Z228(logn+logk)2)(Z9(logn+logk))n5k6.\displaystyle\mathbb{P}\left({X^{b}}_{k,j}>0\right)\leq\mathbb{P}\left(2Z^{2}\geq 2^{8}(\log n+\log k)^{2}\right)\leq\mathbb{P}\left(Z\geq 9\left(\log n+\log k\right)\right)\leq n^{-5}k^{-6}.

Applying Dwass’s formula, Proposition 4.3, to our branching process 𝐖\mathbf{W}, we have that for any kk\in\mathbb{N},

(W=k)\displaystyle\mathbb{P}(W=k) =1k(j=1kXk,j=k1)\displaystyle=\frac{1}{k}\mathbb{P}\left(\sum_{j=1}^{k}X_{k,j}=k-1\right)
(5.3) 1k((j=1kXak,jμka28(logn+logk)2k(1μka)128(logn+logk)2)+(j=1kXbk,j>0)).\displaystyle\leq\frac{1}{k}\left(\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{k}^{a}}{2^{8}(\log n+\log k)^{2}}\geq\frac{k(1-\mu_{k}^{a})-1}{2^{8}(\log n+\log k)^{2}}\right)+\mathbb{P}\left(\sum_{j=1}^{k}{X^{b}}_{k,j}>0\right)\right).

Since μka1ε\mu_{k}^{a}\leq 1-\varepsilon, for k2ε1k\geq 2\varepsilon^{-1} we have that 2(1μka)2k>2(1μka)ε>ε2(1-\mu_{k}^{a})-\frac{2}{k}>2(1-\mu_{k}^{a})-\varepsilon>\varepsilon and hence k(1μka)1>εk2k(1-\mu_{k}^{a})-1>\frac{\varepsilon k}{2}. Thus we have

s:=k(1μka)128(logn+logk)2>εk29(logn+logk)2:=s.s:=\frac{k(1-\mu_{k}^{a})-1}{2^{8}(\log n+\log k)^{2}}>\frac{\varepsilon k}{2^{9}(\log n+\log k)^{2}}:=s^{\prime}.

Since the random variables (Xak,jμka)/(28(logn+logk)2)({X^{a}}_{k,j}-\mu_{k}^{a})/(2^{8}(\log n+\log k)^{2}) are by construction independent random variables with mean zero and absolute value at most 11, we can apply a standard Chernoff bound [1, Theorem A.1.16] in (5.2) to obtain:

(j=1kXak,jμ1a28(logn)2s)e(s)22k=eε2k219(logn+logk)4.\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{1}^{a}}{2^{8}(\log n)^{2}}\geq s\right)\leq e^{-\frac{(s^{\prime})^{2}}{2k}}=e^{-\frac{\varepsilon^{2}k}{2^{19}(\log n+\log k)^{4}}}.

Letting, k0=k0(n)=226ε2(logn)5k_{0}=k_{0}(n)=\lceil 2^{26}\varepsilon^{-2}(\log n)^{5}\rceil we have that for nn sufficiently large, all kk0(n)k\geq k_{0}(n) satisfy k5ε2219(logn+logk)5k\geq 5\varepsilon^{-2}\cdot 2^{19}(\log{n}+\log{k})^{5}. In particular for such kk we have

(5.4) (j=1kXak,jμka28(logn)2s)eϵ2k219(logn+logk)4e5(logn+logk)n5k5.\displaystyle\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{k}^{a}}{2^{8}(\log n)^{2}}\geq s\right)\leq e^{-\frac{\epsilon^{2}k}{2^{19}\left(\log n+\log k\right)^{4}}}\leq e^{-5\left(\log n+\log k\right)}\leq n^{-5}k^{-5}.

Further, applying inequality (5.2) and a union bound, we get that

(5.5) (j=1kXbk,j>0)k(Xbk,1>0)n5k5.\displaystyle\mathbb{P}\left(\sum_{j=1}^{k}{X^{b}}_{k,j}>0\right)\leq k\mathbb{P}({X^{b}}_{k,1}>0)\leq n^{-5}k^{-5}.

Combining (5.2), (5.4) and (5.5), we finally have

(Wk0)\displaystyle\mathbb{P}(W\geq k_{0}) k=k0(W=k)k=k01k2n5k5=O(n5).\displaystyle\leq\sum_{k=k_{0}}^{\infty}\mathbb{P}(W=k)\leq\sum_{k=k_{0}}^{\infty}\frac{1}{k}2n^{-5}k^{-5}=O\left(n^{-5}\right).

We now use Lemma 5.6 to estimate the probability that our exploration ends with a large stop. The key is to view our exploration as a branching process of branching processes: we begin with a subcritical branching process, corresponding to step 1. in our exploration process. Call this the ancestral branching process. When this process becomes extinct, we run through step 3. of our exploration process, potentially adding new active pairs to our otherwise empty set of active pairs AtA_{t}. For each of these new pairs, we start an independent child branching process. For each of these we repeat the same procedure as for the ancestral branching process (so the child processes can generate their own child processes, and so forth). Thus to bound the total number of pairs discovered over the entire course of our exploration, we must control the growth of this branching process of branching processes.

Lemma 5.7.

The probability that the exploration from v1v2v_{1}v_{2} terminates with a large stop is O(n3)O(n^{-3}).

Proof.

We view our exploration process as a kind of branching process of branching processes, with parent processes begetting children processes as described at the start of this section. Beginning from the single pair v1v2v_{1}v_{2}, the exploration of its component in (Γ)\boxtimes(\Gamma) undertaken at time-steps tt when 1. applies is dominated by a subcritical branching process 𝐖\mathbf{W} as in the statement of Lemma 5.6. When that process terminates, we have discovered a certain set Dt0D_{t_{0}} of vertices of Γ\Gamma. If 3. applies, then we add a certain number of vertices to Dt0D_{t_{0}} to form Dt0+1D_{t_{0}+1} and then add a subset of the pairs from Dt0+1(2)D_{t_{0}+1}^{(2)} to form At0+1A_{t_{0}+1}. We may view each of the pairs aia_{i} added to At0+1A_{t_{0}+1} at this time as the root of a subcritical independent branching process 𝐖i\mathbf{W}_{i}. There are at most |Dt0+1|2|D_{t_{0}+1}|^{2} of these “child-processes”, and they are stochastically dominated by independent copies 𝐖i\mathbf{W}_{i} of the subcritical branching process 𝐖\mathbf{W} from Lemma 5.6. When all these branching processes have become extinct, we now have a (larger) set Dt1D_{t_{1}} of discovered vertices and we may be back at a time step where 3. applies. We repeat our procedure — adding vertices to form Dt+1D_{t^{\prime}+1}, adding new pairs to form At1+1A_{t_{1}+1}, etc. The whole procedure can begin again at most 55 times (for otherwise an exceptional stop must have occurred).

We run our exploration ignoring large stops and only applying 1. and 3. until the process terminates (with an exceptional stop or an extinction stop), and show that if an exceptional stop does not occur then with probability O(n3)O(n^{-3}) the final size of the discovered set of vertices is at most 2210ε210(logn)2312^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}.

Since we never can consider more than (n2)\binom{n}{2} different active pairs, it follows that we never start more than n2n^{2} branching processes 𝐖i\mathbf{W}_{i} in the course of our exploration. By Lemma 5.6, and a union bound, the probability that one of our at most n2n^{2} branching processes 𝐖i\mathbf{W}_{i} has a total progeny of more than k0=226ε2(logn)5k_{0}=2^{26}\varepsilon^{-2}(\log n)^{5} is O(n3)O(n^{-3}). Assume from now on this does not happen. Since the progeny of our processes correspond to pairs xyV((Γ))=V(Γ)(2)xy\in V(\boxtimes(\Gamma))=V(\Gamma)^{(2)}, none of our processes can add more than 2k02k_{0} vertices to the set of discovered vertices DtD_{t}.

Further, by Corollary 5.5, the probability that there is any pair xyV((Γ))=V(Γ)(2)xy\in V(\boxtimes(\Gamma))=V(\Gamma)^{(2)} such that |ΓxΓy|9logn|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n is at most n3n^{-3}. Assume from now on this does not happen. Then in the first time-step t0t_{0} where 3. applies, we can bound the number of vertices added to Dt0D_{t_{0}}: each pair xyxy can contribute at most 9logn9\log n vertices to a Zt1Z^{1}_{t} or Zt2Z^{2}_{t}, and as we do not have an exceptional stop we can repeat the addition procedure at most 55 times. In particular we have

|Dt0+1|\displaystyle|D_{t_{0}+1}| ((((((|Dt0|)29logn)29logn)29logn))29logn)29logn)29logn\displaystyle\leq\left(\left(\left(\left(\left(\left(|D_{t_{0}}|\right)^{2}9\log n\right)^{2}9\log n\right)^{2}9\log n)\right)^{2}9\log n\right)^{2}9\log n\right)^{2}9\log n
(|Dt0|9(logn))26(18(logn)k0)26.\displaystyle\leq\left(|D_{t_{0}}|9\left(\log n\right)\right)^{2^{6}}\leq\left(18(\log n)k_{0}\right)^{2^{6}}\ .

The number of child processes started at that time step is at most |Dt0+1|2|D_{t_{0}+1}|^{2}; by our assumption, each of these discovers at most 2k02k_{0} vertices in total, so that by the next time-step t1t_{1} when 3. applies, we have

|Dt1||Dt0+1|22k0(18(logn)k0)272k0:=2k1.|D_{t_{1}}|\leq|D_{t_{0}+1}|^{2}2k_{0}\leq\left(18(\log n)k_{0}\right)^{2^{7}}2k_{0}:=2k_{1}.

Repeating the analysis above, we obtain

|Dt1+1|(18(logn)k1)26,|D_{t_{1}+1}|\leq\left(18(\log n)k_{1}\right)^{2^{6}},

and in the next time-step t2t_{2} where 3. applies we have

|Dt2||Dt1+1|22k0(18(logn)k1)272k0=:2k2.|D_{t_{2}}|\leq|D_{t_{1}+1}|^{2}2k_{0}\leq\left(18(\log n)k_{1}\right)^{2^{7}}2k_{0}=:2k_{2}.

and we can keep going in this way, defining k3k_{3}, k4k_{4} mutatis mutandis. If we avoid an exceptional stop, then we must terminate by the fifth time-step t4t_{4} when 3. applies. Iterating our analysis, we see that the size of the final set of discovered vertices Dt4D_{t_{4}} is as most

|Dt4|\displaystyle|D_{t_{4}}| (18(logn)k3)272k0\displaystyle\leq\left(18(\log n)k_{3}\right)^{2^{7}}2k_{0}
=(9(logn)(9(logn)(9(logn)(18(logn)k0)272k0)272k0)272k0)272k0\displaystyle=\left(9(\log n)\left(9(\log n)\left(9(\log n)\left(18(\log n)k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}
<(18(logn)k0)229<2210ε210(logn)231.\displaystyle<\left(18(\log n)k_{0}\right)^{2^{29}}<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}.

This shows that the probability our process terminates with a large stop is O(n3)O(n^{-3}). ∎

Finally, we compute the probability that an exploration ends with an exceptional stop.

Lemma 5.8.

The probability that the exploration from v1v2v_{1}v_{2} terminates with an exceptional stop is O((logn)30231n5/2)O((\log n)^{30\cdot 2^{31}}n^{-5/2}).

Proof.

Suppose that the exploration from v1v2v_{1}v_{2} terminates at time TT with an exceptional stop. Then we must have discovered at least 55 exceptional edges at time-steps tTt\leq T when 3. applied, where we call an edge exceptional if it appeared in E(Γ[Dt])EtE(\Gamma[D_{t}])\setminus E_{t} (type 1) or as the third or above edge from some vertex zZt1z\in Z_{t}^{1} to DtD_{t} (type 2), and where such edges are ordered according to the ordering of the vertices of DtD_{t}.

Since the exploration did not terminate with a large stop, at each of these time steps we had |Dt|2210ε210(logn)231:=Δ|D_{t}|\leq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}:=\Delta at the start of each time-step tt. Also, since we did not terminate with a large stop, the time TT at which the process terminated must satisfy TΔ2T\leq\Delta^{2} (this is an upper bound on the number of pairs we could have tested at time steps where 1. or 3. applied).

In any time-step tt where 3. applies and we are testing for membership in one of the (at most five) sets Zt1Z^{1}_{t} considered in that turn, the probability that a vertex in V(Γ)DtV(\Gamma)\setminus D_{t} sends at least three edges of Γ\Gamma to the set DtD_{t} is at most

i=3|Dt|(|Dt|i)pi(1p)|Dt|ii=3ΔΔipi=O(Δ4p3).\sum_{i=3}^{|D_{t}|}\binom{|D_{t}|}{i}p^{i}(1-p)^{|D_{t}|-i}\leq\sum_{i=3}^{\Delta}\Delta^{i}p^{i}=O\left(\Delta^{4}p^{3}\right).

Since we have at most nn vertices in V(Γ)DtV(\Gamma)\setminus D_{t} and at most TΔ2T\leq\Delta^{2} time-steps to choose from, the probability of having found at least jj type 2 exceptional edges for some 1j51\leq j\leq 5 is

(5.6) O(Tj(nΔ4p3)j)=O(Δ6jnj/2).\displaystyle O\Bigl{(}T^{j}\left(n\Delta^{4}p^{3}\right)^{j}\Bigr{)}=O\left(\Delta^{6j}n^{-j/2}\right).

A similar (but simpler) calculation yields that the probability of having found 5j5-j edges of type 1 is:

(5.7) O(T5j(Δ2p)5j)=O(Δ4(5j)n(5j)/2).\displaystyle O\Bigl{(}T^{5-j}{\left(\Delta^{2}p\right)}^{5-j}\Bigr{)}=O\left(\Delta^{4(5-j)}n^{-(5-j)/2}\right).

Adding the bounds (5.6) and (5.7) together and substituting in the value of Δ\Delta, the Lemma follows. ∎

We are now in a position to prove Theorem 5.1, which, as previously noted, immediately implies Theorem 1.4.

Proof of Theorem 5.1.

Let v1v2v_{1}v_{2} be an arbitrary pair of vertices from V(Γ)V(\Gamma). By Lemmas 5.7 and 5.8, with probability 1O((logn)30231n5/2)=1o(n2)1-O((\log n)^{30\cdot 2^{31}}n^{-5/2})=1-o(n^{-2}) the exploration from v1v2v_{1}v_{2} terminates with an extinction stop. By Lemma 5.2 we obtain a bound on the size of each component of v1v2v_{1}v_{2} in (Γ)\boxtimes(\Gamma) found by this exploration, and a bound on its support as well. By a simple union bound, with probability 1o(1)1-o(1), all pairs v1v2v_{1}v_{2} lie in components of (Γ)\boxtimes(\Gamma) supported on sets of size at most 2210ε210(logn)2312^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}} in V(Γ)V(\Gamma). ∎

6. The supercritical regime: proof of Theorem 1.5

Fix λ>λc\lambda>\lambda_{c}. Suppose λn1/2p(n)(1f(n))\lambda n^{-1/2}\leq p(n)\leq(1-f(n)), where f(n)f(n) is a function with f(n)=o(1)f(n)=o(1) and f(n)=ω(n2)f(n)=\omega(n^{-2}). Let Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p). By [5, Theorem 5.1], we know that if p(n)5logn/np(n)\geq 5\sqrt{\log{n}/n}, then a.a.s. there is a square-component covering all of V(Γ)V(\Gamma). We may thus restrict our attention in the proof of Theorem 1.5 to the range λn1/2p(n)5logn/n\lambda n^{-1/2}\leq p(n)\leq 5\sqrt{\log{n}/n}. For such p(n)p(n), there exists ε>0\varepsilon>0 such that if ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}), then

(6.1) 𝔼(Z2+3Z2)1+ε.\displaystyle\mathbb{E}\left(\frac{Z^{2}+3Z}{2}\right)\geq 1+\varepsilon.

We shall prove the existence of a giant square-component with full support in four stages: first, we define an exploration process in (Γ)\square(\Gamma). In a second stage, we analyse the process to show that a.a.s. a large proportion of non-edges of Γ\Gamma lie in “somewhat large” square-components of (Γ)\square(\Gamma). Next, in the (more involved) third stage of the argument, we perform vertex-sprinkling to show a.a.s. a large proportion of non-edges of Γ\Gamma lie in a giant square-component. Finally, we show there a.a.s. a giant square-component covering all of V(Γ)V(\Gamma).

6.1. An exploration process

We consider an exploration process consisting of the following data at each time t0t\geq 0:

  • (Discovered vertices.) An ordered set of vertices: Dt={v1,v2,,vdt}V(Γ)D_{t}=\{v_{1},v_{2},\ldots,v_{d_{t}}\}\subseteq V(\Gamma).

  • (Active pairs.) A set of pairs of vertices (ordered lexicographically with respect to the ordering on DtD_{t}): At={x1y1,x2y2,,xatyat}Dt(2)E(Γ)A_{t}=\{x_{1}y_{1},x_{2}y_{2},\ldots,x_{a_{t}}y_{a_{t}}\}\subseteq D_{t}^{(2)}\setminus E(\Gamma).

  • (Reached pairs.) A set of pairs of vertices: RtDt(2)E(Γ)R_{t}\subseteq D_{t}^{(2)}\setminus E(\Gamma).

These sets will satisfy:

  • (\star)

    for every active pair xiyiAtx_{i}y_{i}\in A_{t}, the vertices xix_{i} and yiy_{i} have at least 22 common neighbors in the subgraph of Γ\Gamma induced by DtD_{t}.

The initial state t=0t=0 of the exploration consists of an arbitrary induced C4C_{4} of Γ\Gamma, denoted v1v2v3v4v_{1}v_{2}v_{3}v_{4}, and the sets: D0={v1,v2,v3,v4}D_{0}=\{v_{1},v_{2},v_{3},v_{4}\}, A0={v1v3,v2v4}A_{0}=\{v_{1}v_{3},v_{2}v_{4}\}, and R0=R_{0}=\emptyset.

Our exploration then proceeds as follows:
1. If |Rt|+|At|>(logn)4|R_{t}|+|A_{t}|>(\log n)^{4}, then we terminate the process.
2. If |Rt|+|At|(logn)4|R_{t}|+|A_{t}|\leq(\log n)^{4} and AtA_{t}\neq\emptyset, then for each zV(Γ)Dtz\in V(\Gamma)\setminus D_{t} we test whether or not zz sends an edge in Γ\Gamma to both of {x1,y1}\{x_{1},y_{1}\} which are the vertices of the first pair a1=x1y1a_{1}=x_{1}y_{1} in the ordered set AtA_{t}. We then set Zt:={zV(Γ)Dt:zx1,zy1E(Γt)}Z_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ zx_{1},zy_{1}\in E(\Gamma_{t})\}. Denote by FtF_{t} the set of common neighbors of x1x_{1} and y1y_{1} in DtD_{t} (which by property ()(\star) has size at least 22).

We then set At+1=(At{x1y1})((FtZt)(2)(Ft(2)E(Γ)))A_{t+1}=\left(A_{t}\setminus\{x_{1}y_{1}\}\right)\cup\left(\left(F_{t}\cup Z_{t}\right)^{(2)}\setminus\left(F_{t}^{(2)}\cup E(\Gamma)\right)\right), Rt+1=Rt{x1y1}R_{t+1}=R_{t}\cup\{x_{1}y_{1}\} and Dt+1=DtZtD_{t+1}=D_{t}\cup Z_{t}. We then proceed to the next time-step in the exploration process, noting that property ()(\star) is maintained.
3. If |Rt|(logn)4|R_{t}|\leq(\log n)^{4} and At=A_{t}=\emptyset, then we terminate the process.

6.2. Many non-edges in somewhat large components

The exploration process defined in the previous subsection can terminate for one of two reasons:

  1. (1)

    (Large stop.) |Rt|+|At|>(logn)4|R_{t}|+|A_{t}|>(\log n)^{4}, or

  2. (2)

    (Extinction stop.) At=A_{t}=\emptyset.

The process always terminates after some number T(logn)4T\leq\left(\log n\right)^{4} of time-steps. By construction, at all times t0t\geq 0 the collection of pairs AtRtA_{t}\cup R_{t} is a subset of the square-component of v1v3v_{1}v_{3} and v2v4v_{2}v_{4} in (Γ)\square(\Gamma). Our aim is to show that with somewhat large probability |AT|+|RT|>(logn)4|A_{T}|+|R_{T}|>(\log n)^{4}.

Lemma 6.1.

At any time-step t0t\geq 0, the distribution conditional on the past history of the process of the random variable Xt=|At+1At|X_{t}=|A_{t+1}\setminus A_{t}| counting the number of new active pairs discovered by a1=x1y1a_{1}=x_{1}y_{1} stochastically dominates a random variable XX^{\prime} with mean 𝔼(X)1+ε+o(1)\mathbb{E}(X^{\prime})\geq 1+\varepsilon+o(1).

Proof.

Write EtE_{t} for the set of edges discovered by the process. As observed by Bollobás and Riordan [7, Inequality (3)], the past of the process is the intersection of the principal increasing event 𝒰={EtE(Γ)}\mathcal{U}=\{E_{t}\subseteq E(\Gamma)\} and a decreasing event 𝒟\mathcal{D} (corresponding to the intersection of a number of events of the form “at least one of zx,zyzx,zy is not in Γ\Gamma” for some previously tested pair xyxy). In particular, for xyAtxy\in A_{t} and zV(Γ)Dtz\in V(\Gamma)\setminus D_{t},

(xz,yzE(Γ)|𝒟𝒰)=(xz,yzE(Γ)|𝒟).\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\cap\mathcal{U}\right)=\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\right).

Let 𝒟(z)\mathcal{D}^{\prime}(z) denote the decreasing event

𝒟(z)=xyDt(2){xy}{at least one of zx,zy is not in Γ}.\mathcal{D}^{\prime}(z)=\bigcap_{x^{\prime}y^{\prime}\in D_{t}^{(2)}\setminus\{xy\}}\{\textrm{at least one of $zx^{\prime},zy^{\prime}$ is not in $\Gamma$}\}.

Note that the event {xz,yzE(Γ)}\{xz,yz\in E(\Gamma)\} is independent of 𝒟𝒟(z)\mathcal{D}\setminus\mathcal{D}^{\prime}(z). Using this fact and appealing to Harris’s Lemma [13], we have

(xz,yzE(Γ)|𝒟)=(xz,yzE(Γ)|𝒟𝒟(z))(xz,yzE(Γ)|𝒟(z)).\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\right)=\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\cap\mathcal{D}^{\prime}(z)\right)\geq\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}^{\prime}(z)\right).

Now conditional on 𝒟\mathcal{D}^{\prime}, the probability that both xzxz and yzyz are in E(Γ)E(\Gamma) is readily computed: it is equal to

p2(1p)|Dt|2p2(1p)|Dt|2+2p(1p)(1p)|Dt|2+(1p)2((1p)|Dt|2+(|Dt|2)p(1p)|Dt|3)\displaystyle\frac{p^{2}(1-p)^{|D_{t}|-2}}{p^{2}(1-p)^{|D_{t}|-2}+2p(1-p)(1-p)^{|D_{t}|-2}+(1-p)^{2}\left((1-p)^{|D_{t}|-2}+(|D_{t}|-2)p(1-p)^{|D_{t}|-3}\right)}

which is equal to p2(1+O(|Dt|p))=p2+o(p2)p^{2}\left(1+O(|D_{t}|p)\right)=p^{2}+o(p^{2}) (since|Dt|2(|Rt|+|At|)2(logn)4|D_{t}|\leq 2\left(|R_{t}|+|A_{t}|\right)\leq 2(\log n)^{4}).

The indicator function of the event Yz={zZt}Y_{z}=\{z\in Z_{t}\} thus stochastically dominates a Bernoulli random variable with mean p2+o(p2)p^{2}+o(p^{2}). Further the (Yz)zV(Γ)Dt(Y_{z})_{z\in V(\Gamma)\setminus D_{t}} are independent events given our conditioning (since each such event is only affected by the state of edges from DtD_{t} to zz). The random variable |Zt||Z_{t}| thus stochastically dominates a random variable ZBinom(n|Dt|,p2+o(p2))Z^{\prime}\sim\mathrm{Binom}(n-|D_{t}|,p^{2}+o(p^{2})). Let XX^{\prime} denote the sum of (Z)2+3Z2\frac{(Z^{\prime})^{2}+3Z^{\prime}}{2} independent Bernoulli random variables with parameter 1p1-p. For |Dt|2(logn)4|D_{t}|\leq 2(\log n)^{4}, we have

𝔼|At+1At|\displaystyle\mathbb{E}|A_{t+1}\setminus A_{t}| =𝔼(|Zt|2+3|Zt|2|E(Γ)(FtZt)(2)|)\displaystyle=\mathbb{E}\left(\frac{|Z_{t}|^{2}+3|Z_{t}|}{2}-|E(\Gamma)\cap(F_{t}\cup Z_{t})^{(2)}|\right)
𝔼X=(1p)𝔼(|Z|2+3|Z|2)1+ε+o(1),\displaystyle\geq\mathbb{E}X^{\prime}=(1-p)\mathbb{E}\left(\frac{|Z^{\prime}|^{2}+3|Z^{\prime}|}{2}\right)\geq 1+\varepsilon+o(1),

where the inequality follows from the stochastic domination of ZZ^{\prime} by ZtZ_{t}, and where in the last line we have used (6.1) and the fact that a binomial distribution with parameters n|Dt|n-|D_{t}| and p2+o(p2)p^{2}+o(p^{2}) is close to Binom(n,p2)\mathrm{Binom}(n,p^{2}). ∎

Let θe=θe(n,p)\theta_{e}=\theta_{e}(n,p) denote the extinction probability of the supercritical branching process 𝐖\mathbf{W} with the offspring distribution XX^{\prime} given in the proof of Lemma 6.1. Note that by Proposition 4.2(b), θe\theta_{e} is bounded away from 11.

Up to the time when it terminates, our exploration process on the square-component of v1v2v3v4v_{1}v_{2}v_{3}v_{4} stochastically dominates 𝐖\mathbf{W}. We now use this fact to show many non-edges of Γ\Gamma lie in “somewhat large” square-components.

Lemma 6.2 (Many squares in large square-components).

Fix λ>λc\lambda>\lambda_{c}, p(n)p(n) satisfying λn1/2p(n)5n1/2logn\lambda n^{-1/2}\leq p(n)\leq 5n^{-1/2}\sqrt{\log n}, and θe=θe(n,p)\theta_{e}=\theta_{e}(n,p) as above. Then, with probability 1o(n2)1-o(n^{-2}) the number NN of induced C4C_{4}s in Γ\Gamma which are part of square-components of order at least (logn)4(\log n)^{4} satisfies

N=(1+o(1))𝔼N3p4(1p)2(n4)(1θe)(1+o(1)).N=(1+o(1))\mathbb{E}N\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e})(1+o(1)).
Proof.

Given a collection of 44 vertices SV(Γ)(4)S\in V(\Gamma)^{(4)}, let ESE_{S} be the indicator function of the event that Γ[S]C4\Gamma[S]\cong C_{4} and that our exploration process from SS terminates with a large stop (which is equivalent to SS being part of a square-component of order at least (logn)4(\log n)^{4}). Conditional on Γ[S]C4\Gamma[S]\cong C_{4}, Lemma 6.1 implies that (ES=1)1θe\mathbb{P}(E_{S}=1)\geq 1-\theta_{e} (which is the probability that the branching process 𝐖\mathbf{W} does not become extinct). Applying Wald’s identity, the expectation μN\mu_{N} of NN thus satisfies

μN=𝔼N\displaystyle\mu_{N}=\mathbb{E}N =𝔼SV(Γ)(4)ES=SV(Γ)(4)(Γ[S]C4)(ES=1|Γ[S]C4)\displaystyle=\mathbb{E}\sum_{S\in V(\Gamma)^{(4)}}E_{S}=\sum_{S\in V(\Gamma)^{(4)}}\mathbb{P}(\Gamma[S]\cong C_{4})\mathbb{P}\left(E_{S}=1|\Gamma[S]\cong C_{4}\right)
(6.2) 3p4(1p)2(n4)(1θe).\displaystyle\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e}).

We now use Chebyshev’s inequality to show NN is concentrated around its mean. To do this, we must bound 𝔼N2=S,SV(Γ)(4)𝔼ESES\mathbb{E}N^{2}=\sum_{S,S^{\prime}\in V(\Gamma)^{(4)}}\mathbb{E}E_{S}E_{S^{\prime}}. Consider two collections of 44 vertices S,SV(Γ)(4)S,S^{\prime}\in V(\Gamma)^{(4)}.

Claim 1.

If SS=S\cap S^{\prime}=\emptyset, then ESE_{S} and ESE_{S^{\prime}} satisfy

𝔼(ESES)=𝔼(ES)𝔼(ES)+O((logn)9n1𝔼(ES)).\mathbb{E}\left(E_{S}E_{S^{\prime}}\right)=\mathbb{E}\left(E_{S}\right)\mathbb{E}\left(E_{S^{\prime}}\right)+O\left((\log n)^{9}n^{-1}\mathbb{E}(E_{S})\right).
Proof.

Our claim is that ESE_{S} and ESE_{S^{\prime}} are essentially independent. Indeed, let us first perform our exploration process from SS (stopping immediately if Γ[S]\Gamma[S] does not induce a copy of C4C_{4}). For ZBinom(n,p2)Z\sim\mathrm{Binom}(n,p^{2}) (and λn1/2p5n1/2(log)1/2\lambda n^{-1/2}\leq p\leq 5n^{-1/2}(\log)^{-1/2} as everywhere in this section), we have

(6.3) (Z27logn)=r=27lognn(nr)p2r(1p2)nrr27logn(enr25lognn)r=o(n5).\displaystyle\mathbb{P}(Z\geq 2^{7}\log n)=\sum_{r=\lceil 2^{7}\log n\rceil}^{n}\binom{n}{r}p^{2r}(1-p^{2})^{n-r}\leq\sum_{r\geq 2^{7}\log n}\left(\frac{en}{r}\cdot 25\frac{\log n}{n}\right)^{r}=o(n^{-5}).

Thus with probability 1o(n3)1-o(n^{-3}), the number of vertices added to DtD_{t} in the last stage of the exploration process from SS is at most 27logn2^{7}\log n, implying that the set DSD_{S} of vertices discovered by the process from SS has size at most 2(logn)4+27logn2(\log n)^{4}+2^{7}\log n. Further, the exploration process from SS tests at most (logn)4\left(\log n\right)^{4} pairs in total. This allows us to bound the probability that the exploration process from SS^{\prime} interacts with the exploration process from SS.

First of all, by Markov’s inequality the probability that a vertex in SS^{\prime} is discovered by the process from SS is at most

4p2(logn)4=O((logn)5n1).4p^{2}\left(\log n\right)^{4}=O\left((\log n)^{5}n^{-1}\right).

Secondly, the exploration process from SS^{\prime} tests at most (logn)4(\log n)^{4} pairs xyxy. By Markov’s inequality again, the probability that some zDSz\in D_{S} sends an edge to both vertices in such a pair is at most

(logn)4|DS|p2=O((log)9n1).(\log n)^{4}|D_{S}|p^{2}=O\left((\log)^{9}n^{-1}\right).

In particular, the probability of ES=1E_{S^{\prime}}=1 given ES=1E_{S}=1 differs from 𝔼ES\mathbb{E}E_{S^{\prime}} by at most O((logn)9n1O((\log n)^{9}n^{-1}, as claimed. ∎

Claim 2.

For any SV(Γ)(4)S\in V(\Gamma)^{(4)}, we have

SV(Γ)(4):SS𝔼(ESES)=O(n3p4𝔼(ES)).\displaystyle\sum_{S^{\prime}\in V(\Gamma)^{(4)}:\ S\cap S^{\prime}\neq\emptyset}\mathbb{E}\left(E_{S}E_{S^{\prime}}\right)=O\left(n^{3}p^{4}\mathbb{E}(E_{S})\right). ()\displaystyle(\dagger)
Proof.

Fix SS and consider the various ways in which SS and SS^{\prime} could intersect non-trivially.

  • There is one choice of SS^{\prime} with S=SS=S^{\prime}, for which we have 𝔼(ESES)=𝔼(S)\mathbb{E}(E_{S}E_{S^{\prime}})=\mathbb{E}(S).

  • Next, we have at most 4n4n choices of SS^{\prime} with |SS|=3|S\cap S^{\prime}|=3. Write S={a,b,c,d}S=\{a,b,c,d\} and S={a,b,c,d}S^{\prime}=\{a,b,c,d^{\prime}\}. For ESESE_{S}E_{S^{\prime}} to be non-zero, both SS and SS^{\prime} must induce copies of C4C_{4} in Γ\Gamma and moreover ESE_{S} must occur. This is only possible if ES=1E_{S}=1 and dd^{\prime} sends edges to the two neighbors of dd in {a,b,c}\{a,b,c\}. Arguing as in Claim 1, these two events are almost independent and occur with probability (1+o(1))p2𝔼ES(1+o(1))p^{2}\mathbb{E}E_{S}. Thus the contribution of SS^{\prime} with |SS|=3|S\cap S^{\prime}|=3 to the left-hand side of ()(\dagger) is at most O(np2𝔼(ES))O(np^{2}\mathbb{E}(E_{S})).

  • There are at most 4n34n^{3} choices of SS^{\prime} with |SS|=1|S\cap S^{\prime}|=1. For ESESE_{S}E_{S^{\prime}} to be non-zero, it is necessary for SS^{\prime} to induce a copy of C4C_{4} in Γ\Gamma and for ES=1E_{S}=1. Arguing as in Claim 1, these two events are almost independent and occur with probability (1+o(1))p4𝔼ES(1+o(1))p^{4}\mathbb{E}E_{S}. Thus the contribution of SS^{\prime} with |SS|=1|S\cap S^{\prime}|=1 to the left-hand side of ()(\dagger) is at most O(n3p4𝔼(ES))O(n^{3}p^{4}\mathbb{E}(E_{S})).

  • Finally, there are at most 6n26n^{2} choices of SS with |SS|=2|S\cap S^{\prime}|=2. For ESESE_{S}E_{S^{\prime}} to be non-zero, it is necessary for the vertices in SSS^{\prime}\setminus S to be incident at least three edges in Γ[S]\Gamma[S^{\prime}] and for ES=1E_{S}=1. Arguing as in Claim 1, these two events are almost independent and occur with probability at most (1+o(1))p3𝔼ES(1+o(1))p^{3}\mathbb{E}E_{S}. Thus the contribution of SS^{\prime} with |SS|=2|S\cap S^{\prime}|=2 to the left-hand side of ()(\dagger) is at most O(n2p3𝔼(ES))O(n^{2}p^{3}\mathbb{E}(E_{S})).

Since np=ω(1)np=\omega(1), we have O(1+np2+n2p3+n3p4)=O(n3p4)O(1+np^{2}+n^{2}p^{3}+n^{3}p^{4})=O(n^{3}p^{4}), and the analysis above shows the left-hand side of ()(\dagger) is at most O(n2p3𝔼(ES))O(n^{2}p^{3}\mathbb{E}(E_{S})), as claimed. ∎

Together, Claims 1 and 2 imply 𝔼N2(𝔼N)2+O(n3p4𝔼N)\mathbb{E}N^{2}\leq\left(\mathbb{E}N\right)^{2}+O\left(n^{3}p^{4}\mathbb{E}N\right). By inequality 6.2, we know μN=𝔼N=Ω(n4p4)\mu_{N}=\mathbb{E}N=\Omega(n^{4}p^{4}). Since p=Ω(n1/2)p=\Omega(n^{-1/2}), it follows that

Var(N)=O(n2p3𝔼(N))=O(μNn)=O((μN)2n3).\mathrm{Var}(N)=O\left(n^{2}p^{3}\mathbb{E}(N)\right)=O\left(\frac{\mu_{N}}{n}\right)=O\left(\frac{(\mu_{N})^{2}}{n^{3}}\right).

Applying Chebyshev’s inequality yields that with probability at least 1o(n2)1-o(n^{-2}),

N=(1o(1))μN3p4(1p)2(n4)(1θe)(1o(1)),N=(1-o(1))\mu_{N}\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e})(1-o(1)),

as desired. ∎

Corollary 6.3.

Let λ>λc\lambda>\lambda_{c} be fixed, and let p=p(n)p=p(n) be an edge-probability satisfying λn1/2p5n1/2logn\lambda n^{-1/2}\leq p\leq 5n^{-1/2}\sqrt{\log n}. Then for all ε1>0\varepsilon_{1}>0 sufficiently small, there exist a constant ε2>0\varepsilon_{2}>0 such that if Γ1𝒢((1ε1)n,p(n))\Gamma_{1}\in{\mathcal{G}}((1-\varepsilon_{1})n,p(n)), then with probability 1O(n1)1-O(n^{-1}) the number NvN_{v} of non-edges of Γ1\Gamma_{1} that lie in square-components of (Γ1)\square(\Gamma_{1}) of order at least (logn)4(\log n)^{4}, satisfies

Nv=(1+o(1))𝔼(Nv)ε2n2.N_{v}=(1+o(1))\mathbb{E}(N_{v})\geq\varepsilon_{2}n^{2}.
Proof.

Let λ=(λ+λc)/2\lambda^{\prime}=(\lambda+\lambda_{c})/2. For ε1\varepsilon_{1} sufficiently small, we have p(n)λ(n(1ε1))1/2p(n)\geq\lambda^{\prime}(n(1-\varepsilon_{1}))^{-1/2}. We now consider Γ1𝒢(n(1ε1),p)\Gamma_{1}\in{\mathcal{G}}(n(1-\varepsilon_{1}),p).

Ideally, we would now like to directly apply Lemma 6.2 in Γ1\Gamma_{1}. However, to ensure the stochastic domination in Lemma 6.1, we started our exploration process from an induced C4C_{4} rather than a non-edge — so we know that Ω(n4p4)\Omega(n^{4}p^{4}) induced C4C_{4}s are part of square-components of order at least (logn)4(1+o(1))(\log n)^{4}(1+o(1)) whereas we want to show Ω(n2)\Omega(n^{2}) non-edges lie in such components. Since some non-edges could have as many as Ω(logn)\Omega(\log n) common neighbors in Γ1\Gamma_{1}, it would in principle be possible for pp of order n1/2n^{-1/2} that, for example, the collection of the diagonals of the induced C4C_{4}s contained in such “large” components consists of a set of only O(n2/(logn)2)O(n^{2}/(\log n)^{2}) non-edges. We must thus rule out situation.

The simplest way to do this is to run through our proof of Lemma 6.2 again, but this time for the variant of our exploration process from Section 6.1 where we begin with an arbitrary non-edge v1v2v_{1}v_{2} of Γ1\Gamma_{1}, set D0={v1,v2}D_{0}=\{v_{1},v_{2}\}, A0={v1v2}A_{0}=\{v_{1}v_{2}\} and R0=R_{0}=\emptyset. We say such an exploration survives infancy if at the first time-step the pair v1v2v_{1}v_{2} discovers a set Z1Z_{1} of joint neighbors that spans at least one non-edge v3v4v_{3}v_{4} of Γ\Gamma.

For pp in the range we are considering the random graph Γ\Gamma a.a.s. does not contain a complete graph on 66 vertices, and we can use this to give a constant order lower bound on θS\theta_{S}, the probability the process survives infancy:

θS\displaystyle\theta_{S} (|Γv1Γv2|6|v1v2E(Γ))(Γ contain a clique on 6 vertices)\displaystyle\geq\mathbb{P}\left(|\Gamma_{v_{1}}\cap\Gamma_{v_{2}}|\geq 6\ |v_{1}v_{2}\notin E(\Gamma)\right)-\mathbb{P}\left(\Gamma\textrm{ contain a clique on $6$ vertices}\right)
(n26)p12(1p2)n8(n6)p(62)=Θ((np2)6)=Θ(1).\displaystyle\geq\binom{n-2}{6}p^{12}(1-p^{2})^{n-8}-\binom{n}{6}p^{\binom{6}{2}}=\Theta((np^{2})^{6})=\Theta(1).

Conditional on surviving infancy, by Lemma 6.1 the exploration process from v3v4v_{3}v_{4} stochastically dominates a supercritical branching process 𝐖\mathbf{W} with extinction probability θe=θe((1ε1)n,p)\theta_{e}=\theta_{e}((1-\varepsilon_{1})n,p). Applying Wald’s identity, this implies that the number NvN_{v} of non-edges of Γ1\Gamma_{1} that belong to square-components of order at least (logn)4(\log n)^{4} satisfies

𝔼(Nv)((1ε1)n2)(1p)θS(1θe)=Ω(n2).\mathbb{E}(N_{v})\geq\binom{(1-\varepsilon_{1})n}{2}(1-p)\theta_{S}(1-\theta_{e})=\Omega(n^{2}).

We now bound 𝔼(NV)2\mathbb{E}(N_{V})^{2} much as we did in Lemma 6.2. Given a pair xyV(Γ)(2)xy\in V(\Gamma)^{(2)}, write ExyE_{xy} for the event that xyxy is a non-edge and that our exploration process from xyxy terminates with a large stop. Claim 1 from the proof of Lemma 6.2 shows mutatis mutandis that if {x,y}{x,y}=\{x,y\}\cap\{x^{\prime},y^{\prime}\}=\emptyset then

𝔼(ExyExy)=𝔼(Exy)𝔼(Exy)+O((logn)9n1).\mathbb{E}\left(E_{xy}E_{x^{\prime}y^{\prime}}\right)=\mathbb{E}(E_{xy})\mathbb{E}(E_{x^{\prime}y^{\prime}})+O\left((\log n)^{9}n^{-1}\right).

For non-disjoint pairs {x,y}\{x,y\} and {x,y}\{x^{\prime},y^{\prime}\}, the situation is actually easier than it was in Claim 2: such pairs contribute at most 2n𝔼Nv2n\mathbb{E}N_{v} to 𝔼((Nv)2)\mathbb{E}\left((N_{v})^{2}\right). Thus

Var(Nv)=O(n𝔼(Nv))=O(n1(𝔼(Nv))2),\mathrm{Var}(N_{v})=O\left(n\mathbb{E}(N_{v})\right)=O\left(n^{-1}(\mathbb{E}(N_{v}))^{2}\right),

and we conclude that with probability 1O(n1)1-O(n^{-1}) there are (1+o(1))𝔼(Nv)=Ω(n2)(1+o(1))\mathbb{E}(N_{v})=\Omega(n^{2}) non-edges contained in square-components of order at least (logn)4(\log n)^{4}. The Corollary then follows from a suitable choice of the constant ε2\varepsilon_{2}. ∎

6.3. A connecting lemma

The key to our sprinkling argument is the following, which we use to connect the somewhat large square-components into even larger square-components. We connect square-components by sprinkling in vertices, and looking for complete bipartite graphs with bipartition {x1,x2,y1,y2}{z1,z2}\{x_{1},x_{2},y_{1},y_{2}\}\sqcup\{z_{1},z_{2}\}, where x1x2x_{1}x_{2}, y1y2y_{1}y_{2} are non-edges in distinct square-components, and z1z2z_{1}z_{2} is a non-edge inside the set of newly sprinkled vertices — see Figure 3 below.

Recall that a pp-random bipartite graph with partition VWV\sqcup W is a graph on the vertex set VWV\sqcup W obtained by including each pair {v,w}\{v,w\} with vV,wWv\in V,w\in W as an edge independently at random with probability pp.

Lemma 6.4 (Connecting Lemma).

Let λ>λc\lambda>\lambda_{c}, δ(0,12)\delta\in(0,\frac{1}{2}) and ε1,ε2>0\varepsilon_{1},\varepsilon_{2}>0 be fixed. Let VV be a set of (1δ)n(1-\delta)n vertices, and WW be a set of ε1n2log2n\frac{\varepsilon_{1}n}{2\log_{2}n} vertices disjoint from VV. Suppose we are given disjoint subsets C1,C2,CrC_{1},C_{2},\ldots C_{r} of V(2)V^{(2)} and a subset SW(2)S\subseteq W^{(2)} with the following properties:

  1. (1)

    |S|(ε1)2n28(log2n)2|S|\geq\frac{(\varepsilon_{1})^{2}n^{2}}{8(\log_{2}n)^{2}};

  2. (2)

    |Ci|M|C_{i}|\geq M for every ii: 1ir1\leq i\leq r, and some MM satisfying: (logn)4Mε24n2(\log n)^{4}\leq M\leq\frac{\varepsilon_{2}}{4}n^{2};

  3. (3)

    i|Ci|ε2n2\sum_{i}|C_{i}|\geq\varepsilon_{2}n^{2}.

Let p=p(n)p=p(n) be an edge probability with

λ1n<p(n)<5lognn.\lambda\frac{1}{\sqrt{n}}<p(n)<5\frac{\sqrt{\log n}}{\sqrt{n}}.

Consider the pp-random bipartite graph Bp(V,W)B_{p}(V,W) with bipartition VWV\sqcup W. Let Boost be the event that for every CiC_{i} with |Ci|2M|C_{i}|\leq 2M there exists CjCiC_{j}\neq C_{i} and a triple (x1x2,y1y2,z1z2)Ci×Cj×S(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times C_{j}\times S such that the restriction Bp({x1,x2,y1,y2},{z1,z2})B_{p}(\{x_{1},x_{2},y_{1},y_{2}\},\{z_{1},z_{2}\}) of Bp(V,W)B_{p}(V,W) to {x1,x2,y1,y2}{z1,z2}\{x_{1},x_{2},y_{1},y_{2}\}\sqcup\{z_{1},z_{2}\} is complete. Then for all nn sufficiently large we have

(Boost)1exp(ε2(ε1)2216(logn)2).\mathbb{P}(\textrm{Boost})\geq 1-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right).

The proof of the connecting lemma relies on a celebrated inequality of Janson and some careful book-keeping.

Proposition 6.5 (The extended Janson inequality [14]).

Let UU be a finite set and UqU_{q} a qq-random subset of UU for some q[0,1]q\in[0,1]. Let \mathcal{F} be a family of subsets of UU, and for every FF\in\mathcal{F} let IFI_{F} be the indicator function of the event {FUq}\{F\subseteq U_{q}\}. Set I=FIFI_{\mathcal{F}}=\sum_{F\in\mathcal{F}}I_{F}, and let μ=𝔼I\mu=\mathbb{E}I_{\mathcal{F}} and Δ=F,F:FF𝔼(IFIF)\Delta=\sum_{F,F^{\prime}\in\mathcal{F}:\ F\cap F\ \neq\emptyset}\mathbb{E}(I_{F}I_{F}^{\prime}). Then

(I=0)exp(μ22Δ).\mathbb{P}\left(I_{\mathcal{F}}=0\right)\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right).
Proof of Lemma 6.4.

Fix CiC_{i} with M|Ci|2MM\leq|C_{i}|\leq 2M. Set M=min(2M,n)M^{\prime}=\min(2M,n). Let 0\mathcal{F}_{0} denote the collection of connecting triples (x1x2,y1y2,z1z2)Ci×jiCj×S(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times\bigcup_{j\neq i}C_{j}\times S. Further let

={{xizj:i,j[2]}{yizj:i,j[2]}:(x1x2,y1y2,z1z2)0}.\mathcal{F}=\{\{x_{i}z_{j}:\ i,j\in[2]\}\cup\{y_{i}z_{j}:\ i,j\in[2]\}:\ (x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in\mathcal{F}_{0}\}.

Observe that the elements of \mathcal{F} are subsets of either 66 or 88 edges (depending on whether the pairs x1x2x_{1}x_{2} and y1y2y_{1}y_{2} overlap or not) of the complete bipartite graph B(V,W)B(V,W) with bipartition VWV\sqcup W. We shall apply Janson’s inequality to \mathcal{F} to give an upper bound on the probability that CiC_{i} does not connect to jiCj\bigcup_{j\neq i}C_{j} via a pair of squares of Bp(V,W)B_{p}(V,W). To this end, we must compute and bound the μ\mu and Δ\Delta parameters for \mathcal{F}. The first of these is straightforward:

(6.4) μ:\displaystyle\mu: =𝔼I=F𝔼IF|Ci|.|jiCj|.|S|p8Mε2(ε1)28n4(logn)2p8\displaystyle=\mathbb{E}I_{\mathcal{F}}=\sum_{F\in\mathcal{F}}\mathbb{E}I_{F}\geq|C_{i}|.\bigl{|}\bigcup_{j\neq i}C_{j}\bigr{|}.|S|p^{8}\geq M\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{8}\frac{n^{4}}{(\log n)^{2}}p^{8}

To bound the Δ\Delta parameter, fix a connecting triple t=(x1x2,y1y2,z1z2)Ci×jiCj×St=(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times\bigcup_{j\neq i}C_{j}\times S, and consider the contribution to Δ\Delta made by pairs (t,t)(t,t^{\prime}) of connecting triples that share at least one edge of B(V,W)B(V,W); call such pairs of connecting triples dependent.

Write L(t)L(t) for the set {x1,x2,y1,y2}\{x_{1},x_{2},y_{1},y_{2}\} (which can have size either 44 or 33 — the latter if one of the xix_{i} is equal to one of the yjy_{j}) and R(t)R(t) for the pair {z1,z2}\{z_{1},z_{2}\}. Also let FtF_{t}\in\mathcal{F} be the collection of edges of B(V,W)B(V,W) from L(t)L(t) to R(t)R(t). Clearly if L(t)L(t)=L(t)\cap L(t^{\prime})=\emptyset or R(t)R(t)=R(t)\cap R(t^{\prime})=\emptyset, then (t,t)(t,t^{\prime}) do not form a pair of dependent connecting triples.

Fix a connecting triple tt. For (i,j)[4]×[2](i,j)\in[4]\times[2], let Di,j(t)D_{i,j}(t) denote the collection of connecting triples tt^{\prime} with |L(t)L(t)|=i|L(t)\cap L(t^{\prime})|=i and |R(t)R(t)|=j|R(t)\cap R(t^{\prime})|=j. Further let Di,ja(t)D_{i,j}^{a}(t) and Di,jb(t)D_{i,j}^{b}(t) denote the collection of tt^{\prime} in Di,j(t)D_{i,j}(t) with |L(t)|=4|L(t)|=4 and |L(t)|=3|L(t^{\prime})|=3 respectively. We shall bound the sizes of the sets Di,ja(t)D_{i,j}^{a}(t) and Di,jb(t)D_{i,j}^{b}(t). Note to begin with that there are at most 2ε1nlogn\frac{2\varepsilon_{1}n}{\log n} ways of deleting a vertex in R(t)R(t) and replacing it by a different vertex in WW. In particular, for all connecting triples tt and all i[|L(t)|]i\in[|L(t)|], we have

|Di,1a(t)||Di,2a(t)|2ε1nlogn\displaystyle|D_{i,1}^{a}(t)|\leq|D_{i,2}^{a}(t)|\cdot\frac{2\varepsilon_{1}n}{\log n} and |Di,1b(t)||Di,2b(t)|2ε1nlogn.\displaystyle|D_{i,1}^{b}(t)|\leq|D_{i,2}^{b}(t)|\cdot\frac{2\varepsilon_{1}n}{\log n}.

Thus we may focus on bounding the sizes of Di,ja(t)D_{i,j}^{a}(t) and Di,jb(t)D_{i,j}^{b}(t) in the case where j=2j=2.

Case 1, |L(t)|=4|L(t)|=4:

  • there are at most 66 ways of splitting L(t)L(t) into a pair from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, and at most 2424 ways of deleting a vertex from L(t)L(t) and viewing the remaining 33 vertices as the union of a pair from CiC_{i} and an (overlapping) pair from jiCj\bigcup_{j\neq i}C_{j}, whence |D4,2a(t)|6|D_{4,2}^{a}(t)|\leq 6 and |D3,2b(t)|24|D_{3,2}^{b}(t)|\leq 24;

  • there are at most 4n4n ways of deleting one vertex from L(t)L(t) and replacing it by another vertex from VV. As noted above, there are most 66 ways of splitting the resulting 44-set into a pair from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, whence |D3,2a(t)|24n|D_{3,2}^{a}(t)|\leq 24n;

  • there are at most 6(n2/2)=3n26(n^{2}/2)=3n^{2} ways of deleting two vertices from L(t)L(t) and replacing them by two other vertices from VV, whence (similarly to the above) we have |D2,2a(t)|18n2|D_{2,2}^{a}(t)|\leq 18n^{2}; further, there are at most 6n6n ways of deleting a pair of vertices from VV and replacing them by a single vertex from VV, whence (similarly to the above, since there are most 66 ways of viewing three vertices of a pair from CiC_{i} and an (overlapping) pair from jiCj\bigcup_{j\neq i}C_{j}) we get |D2,2b(t)|36n|D_{2,2}^{b}(t)|\leq 36n;

  • since CiC_{i} contains at most 2M2M pairs, there are at most 4(n2/2)M=2n2M4(n^{2}/2)M^{\prime}=2n^{2}M^{\prime} ways of deleting three vertices in L(t)L(t) and replacing them by another three vertices from VV in such a way that the resulting set can still be viewed as the union of a pair from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, whence (similarly to the above) we have |D1,2a(t)|12n2M|D_{1,2}^{a}(t)|\leq 12n^{2}M^{\prime}; further and similarly there are at most 4M+4Mn8Mn4M+4M^{\prime}n\leq 8M^{\prime}n ways of deleting three vertices in L(t)L(t) and replacing them by a pair of vertices from VV, whence (as before) we have |D1,2b(t)|48Mn|D_{1,2}^{b}(t)|\leq 48M^{\prime}n;

Case 2, |L(t)|=3|L(t)|=3:

  • there are at most 66 ways of splitting L(t)L(t) into a pair from from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, whence |D3,2b(t)|6|D_{3,2}^{b}(t)|\leq 6; further there are at most 6n6n ways of adding a vertex to L(t)L(t) and splitting the resulting 44-set into two disjoint pairs, whence |D3,2a(t)|6n|D_{3,2}^{a}(t)|\leq 6n;

  • there are at most 3n3n ways of deleting one vertex from L(t)L(t) and replacing it by another vertex from VV, whence (as above) |D2,2b(t)|18n|D_{2,2}^{b}(t)|\leq 18n; further, there are at most 3(n2/2)3(n^{2}/2) ways of deleting one vertex from L(t)L(t) and replacing it by a pair from VV, whence (as above) |D2,2a(t)|9n2|D_{2,2}^{a}(t)|\leq 9n^{2};

  • since CiC_{i} contains at most 2M2M pairs, there are at most 3Mn3M^{\prime}n ways of deleting two vertices in L(t)L(t) and replacing them by another two vertices from VV in such a way that the resulting set can still be viewed as the union of a pair from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, whence |D1,2b(t)|18Mn|D_{1,2}^{b}(t)|\leq 18M^{\prime}n; similarly, there are at most 3M(n2/2)3M^{\prime}(n^{2}/2) ways of deleting two vertices in L(t)L(t) and replacing them by a triple of vertices from VV in such a way that the resulting set can be viewed as the disjoint union of a pair from CiC_{i} and a pair from jiCj\bigcup_{j\neq i}C_{j}, whence |D1,2a(t)|9Mn2|D_{1,2}^{a}(t)|\leq 9M^{\prime}n^{2}. .

Given tDi,ja(t)t^{\prime}\in D_{i,j}^{a}(t) and considering the edges between L(t)L(t)L(t)\cup L(t^{\prime}) and R(t)R(t)R(t)\cup R(t^{\prime}), we see that

(6.5) 𝔼IFtIFt=𝔼IFtp2(4i)+i(2j)=𝔼IFtp8ij.\displaystyle\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}=\mathbb{E}I_{F_{t}}p^{2(4-i)+i(2-j)}=\mathbb{E}I_{F_{t}}p^{8-ij}.

Similarly, for tDi,jb(t)t^{\prime}\in D^{b}_{i,j}(t) we have

(6.6) 𝔼IFtIFt=𝔼IFtp2(3i)+i(2j)=𝔼IFtp6ij.\displaystyle\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}=\mathbb{E}I_{F_{t}}p^{2(3-i)+i(2-j)}=\mathbb{E}I_{F_{t}}p^{6-ij}.

With the bounds on the size of Di,ja(t)D_{i,j}^{a}(t) and Di,jb(t)D_{i,j}^{b}(t) derived above and equalities (6.5) and (6.6) in hand, we are now ready to bound the contribution to Δ\Delta from a connecting triple tt.

Case 1: |L(t)|=4|L(t)|=4.

{𝔼IFtIFt.\displaystyle\sum\Bigl{\{}\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}\Bigr{.} .:(t,t) form a pair of dependent connecting triples}\displaystyle\Bigl{.}:\ (t,t^{\prime})\textrm{ form a pair of dependent connecting triples}\Bigr{\}}
=\displaystyle= 𝔼IFt(i=14(|Di,2a|p82i+|Di,2|bp62i)+i=14(|Di,1a|p8i+|Di,1b|p6i))\displaystyle\ \mathbb{E}I_{F_{t}}\left(\sum_{i=1}^{4}\left(|D_{i,2}^{a}|p^{8-2i}+|D_{i,2}|^{b}p^{6-2i}\right)+\sum_{i=1}^{4}\left(|D_{i,1}^{a}|p^{8-i}+|D_{i,1}^{b}|p^{6-i}\right)\right)
\displaystyle\leq 𝔼IFt(((12n2Mp6+48Mnp4)+(18n2p4+36np2)+(24np2+24)+(6)).\displaystyle\ \mathbb{E}I_{F_{t}}\Bigl{(}\left((12n^{2}M^{\prime}p^{6}+48M^{\prime}np^{4})+(18n^{2}p^{4}+36np^{2})+(24np^{2}+24)+(6)\right)\Bigr{.}
.+2ε1nlogn((12n2Mp7+48Mnp5)+(18n2p6+36np4)+(24np5+24p3)+(6p4)))\displaystyle\Bigl{.}\ \ +\frac{2\varepsilon_{1}n}{\log n}\left((12n^{2}M^{\prime}p^{7}+48M^{\prime}np^{5})+(18n^{2}p^{6}+36np^{4})+(24np^{5}+24p^{3})+(6p^{4})\right)\Bigr{)}
(6.7) \displaystyle\leq (𝔼IFt)210(np2)3max(1,ε1lognMp).\displaystyle\ \left(\mathbb{E}I_{F_{t}}\right)2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).

(Note in the last line we use the fact that np2(λc)2>1/4np^{2}\geq(\lambda_{c})^{2}>1/4, whence (np)14p(np)^{-1}\leq 4p.)

Case 2: |L(t)|=3|L(t)|=3.

{𝔼IFtIFt.\displaystyle\sum\Bigl{\{}\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}\Bigr{.} .:(t,t) form a pair of dependent connecting triples}\displaystyle\Bigl{.}:\ (t,t^{\prime})\textrm{ form a pair of dependent connecting triples}\Bigr{\}}
=\displaystyle= 𝔼IFt(i=13(|Di,2a|p82i+|Di,2b|p62i)+i=13(|Di,1a|p8i+|Di,1b|p6i))\displaystyle\ \mathbb{E}I_{F_{t}}\left(\sum_{i=1}^{3}\left(|D_{i,2}^{a}|p^{8-2i}+|D_{i,2}^{b}|p^{6-2i}\right)+\sum_{i=1}^{3}\left(|D_{i,1}^{a}|p^{8-i}+|D_{i,1}^{b}|p^{6-i}\right)\right)
\displaystyle\leq 𝔼IFt(((9n2Mp6+18nMp4)+(9n2p4+18np2)+(6np2+6)).\displaystyle\ \mathbb{E}I_{F_{t}}\Bigl{(}\left((9n^{2}M^{\prime}p^{6}+18nM^{\prime}p^{4})+(9n^{2}p^{4}+18np^{2})+(6np^{2}+6)\right)\Bigr{.}
.+2ε1nlogn((9n2Mp7+18nMp5)+(9n2p6+18np4)+(6np5+6p3)))\displaystyle\Bigl{.}\ \ +\frac{2\varepsilon_{1}n}{\log n}\left((9n^{2}M^{\prime}p^{7}+18nM^{\prime}p^{5})+(9n^{2}p^{6}+18np^{4})+(6np^{5}+6p^{3})\right)\Bigr{)}
(6.8) \displaystyle\leq (𝔼IFt)210(np2)3max(1,ε1lognMp).\displaystyle\ \left(\mathbb{E}I_{F_{t}}\right)2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).

Together, inequalities (6.7) and (6.8) yield that

(6.9) Δ12μ.210(np2)3max(1,ε1lognMp).\displaystyle\Delta\leq\frac{1}{2}\mu.2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).

Applying the Extended Janson Inequality, Proposition 6.5, together with the bounds (6.4) and (6.9) on μ\mu and Δ\Delta, we get:

(I=0)\displaystyle\mathbb{P}(I_{\mathcal{F}}=0) exp(μ22Δ)exp(μ210(np2)3max(1,ε1lognMp))\displaystyle\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right)\leq\exp\left(-\frac{\mu}{2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right)}\right)
exp(ε2(ε1)2213M(np2)(logn)2max(1,ε1lognMp))\displaystyle\leq\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{13}}\frac{M(np^{2})}{(\log n)^{2}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right)}\right)
(6.13) {exp(ε2(ε1)2215M(logn)2)if 2Mp1lognε1,exp(ε2ε1215p1logn)if p1lognε12Mn,exp(ε2ε1215p1Mnlogn)if n2M.\displaystyle\leq\left\{\begin{array}[]{ll}\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{15}}\frac{M}{(\log n)^{2}}\right)&\textrm{if }2M\leq\frac{p^{-1}\log n}{\varepsilon_{1}},\\ \exp\left(-\frac{\varepsilon_{2}\varepsilon_{1}}{2^{15}}\frac{p^{-1}}{\log n}\right)&\textrm{if }\frac{p^{-1}\log n}{\varepsilon_{1}}\leq 2M\leq n,\\ \exp\left(-\frac{\varepsilon_{2}\varepsilon_{1}}{2^{15}}\frac{p^{-1}M}{n\log n}\right)&\textrm{if }n\leq 2M.\end{array}\right.

Now, the probability that CiC_{i} fails to connect to {Cj:ji}\cup\{C_{j}:\ j\neq i\} via a connecting triple is exactly the probability that I=0I_{\mathcal{F}}=0. Applying Markov’s inequality together with (6.3) and using our assumptions that M(logn)4M\geq(\log n)^{4} and that C1,,CrC_{1},\ldots,C_{r} are disjoint, we have for nn sufficiently large that

(Boost)\displaystyle\mathbb{P}(\textrm{Boost}) 1r(I=0)1n2M(I=0)1exp(ε2(ε1)2216(logn)2),\displaystyle\geq 1-r\mathbb{P}(I_{\mathcal{F}}=0)\geq 1-\frac{n^{2}}{M}\mathbb{P}(I_{\mathcal{F}}=0)\geq 1-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right),

provided nn is sufficiently large. This concludes the proof of the connecting lemma. ∎

6.4. Sprinkling vertices

With Lemma 6.4 in hand, we can return to the proof of Theorem 1.5. To complete the proof, we shall use a multiple-round vertex-sprinkling argument. We partition Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) into the union of

  1. (i)

    Γ1𝒢((1ε1)n,p)\Gamma_{1}\in{\mathcal{G}}((1-\varepsilon_{1})n,p) on V1=[(1ε1)n]V_{1}=[(1-\varepsilon_{1})n],

  2. (ii)

    Γ2𝒢(ε1n,p)\Gamma_{2}\in{\mathcal{G}}(\varepsilon_{1}n,p) on V2=[n]V1V_{2}=[n]\setminus V_{1}, and

  3. (iii)

    a pp-random bipartite graph B=Bp(V1,V2)B=B_{p}(V_{1},V_{2}) with bipartition V1V2V_{1}\sqcup V_{2}.

We further partition V2V_{2} into 2log2n2\log_{2}n sets of size ε1n2log2n\frac{\varepsilon_{1}n}{2\log_{2}n}, V2=i=12log2nV2,iV_{2}=\sqcup_{i=1}^{2\log_{2}n}V_{2,i} (and ignore rounding errors). We say that Γ1\Gamma_{1} is a good configuration if it satisfies the conclusion of Corollary 6.3, i.e., if at least ε2n2\varepsilon_{2}n^{2} non-edges of Γ1\Gamma_{1} lie in square-components in (Γ1)\square(\Gamma_{1}) of order at least M0:=(logn)4M_{0}:=(\log n)^{4} (this is actually slightly weaker than what Corollary 6.3 gives us, but is all we need here).

We shall condition on Γ1\Gamma_{1} being a good configuration when we perform our vertex-sprinkling. By Corollary 6.3, this occurs with probability 1O(n1)1-O(n^{-1}). A key observation is that the state of the edges in Γ2\Gamma_{2} and BB are independent of our conditioning. Our strategy is then to reveal the 2log2n2\log_{2}n sprinkling sets V2,kV_{2,k} one by one, and use them to create bridges between “somewhat large” square-components and thereby increase the minimum order of all “somewhat large” square-components.

More precisely, before stage k1k\geq 1 we have revealed all the edges inside

V1,k1:=V1(i=1k1V2,i).V_{1,k-1}:=V_{1}\cup\left(\cup_{i=1}^{k-1}V_{2,i}\right).

At this stage, we deem a square-component “large” if it contains at least Mk1M_{k-1} non-edges of Γ\Gamma, and “very large” if it contains at least 2Mk12M_{k-1} non-edges of Γ\Gamma (which constitute, as we recall, the vertices of the square-graph). Now in stage kk, we reveal the set SkS_{k} of non-edges of Γ\Gamma that lie inside V2,kV_{2,k} and the edges between V1,kV_{1,k} and V2,kV_{2,k}. We then merge components as follows: given two square-components CiC_{i} and CjC_{j}, a connecting triple is a triple (x1x2,y1y2,z1z2)Ci×Cj×Sk(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times C_{j}\times S_{k}. Such a connecting triple is active if all edges between the sets {x1,x2,y1,y2}\{x_{1},x_{2},y_{1},y_{2}\} and {z1,z2}\{z_{1},z_{2}\} are in Γ\Gamma; in this case the components CiC_{i} and CjC_{j} lie inside the same square-component CC in (Γ[V1,k])\square(\Gamma[V_{1,k}]) (see Figure 3). In particular, if both CiC_{i} and CjC_{j} contained at least Mk1M_{k-1} non-edges, then CC must contain at least Mk=2Mk1M_{k}=2M_{k-1} non-edges.

\labellist
\pinlabel

y1y_{1} [ ] at 0 107 \pinlabely2y_{2} [ ] at 0 26 \pinlabelx1x_{1} [ ] at 0 270 \pinlabelx2x_{2} [ ] at 0 189 \pinlabelz1z_{1} [ ] at 210 189 \pinlabelz2z_{2} [ ] at 210 107 \endlabellist\includegraphics[scale=0.5]Connecting_triple

Figure 3. In this connecting triple, both x1z1x2z2x_{1}z_{1}x_{2}z_{2} and y1z1y2z2y_{1}z_{1}y_{2}z_{2} form induced copies of C4C_{4}, joining up the square-components containing the non-edges x1x2x_{1}x_{2} and y1y2y_{1}y_{2} via the non-edge of sprinkled vertices z1z2z_{1}z_{2}.

The connecting lemma we proved in the previous subsection immediately implies that with high probability at each stage kk, all components which are “large” but not “very large” must join up with at least one other “large” component. We make this explicit with a lemma below. Recall that throughout this section, λ>λc\lambda>\lambda_{c} is fixed and the edge-probability p=p(n)p=p(n) satisfies λ1n<p(n)<5lognn\lambda\frac{1}{\sqrt{n}}<p(n)<5\frac{\sqrt{\log n}}{\sqrt{n}}. Let ε1,ε2>0\varepsilon_{1},\varepsilon_{2}>0 be the constants whose existence is guaranteed by Corollary 6.3.

Lemma 6.6 (Sprinkling lemma).

Suppose that before stage kk, at least ε2n2\varepsilon_{2}n^{2} non-edges of Γ[V1,k1]\Gamma[V_{1,k-1}] lie in square-components of order at least Mk1=2k1M0M_{k-1}=2^{k-1}M_{0} in (Γ[V1,k1])\square(\Gamma[V_{1,k-1}]). Suppose Mk1ε24n2M_{k-1}\leq\frac{\varepsilon_{2}}{4}n^{2}. Then with probability at least

12exp(ε2(ε1)2216(logn)2)1-2\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

when we have revealed the edges from V2,kV_{2,k} to V1,k1V2,k:=V1,kV_{1,k-1}\cup V_{2,k}:=V_{1,k} at least ε2n2\varepsilon_{2}n^{2} non-edges of Γ[V1,k]\Gamma[V_{1,k}] lie in square-components of order at least Mk=2Mk1M_{k}=2M_{k-1} in (Γ[V1,k])\square(\Gamma[V_{1,k}]).

In particular, with probability at least

14log2nexp(ε2(ε1)2216(logn)2)1-4\log_{2}n\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

we have that starting from a good configuration Γ[V1]\Gamma[V_{1}], the sprinkling process described above has discovered within 2log2n4log2logn2\log_{2}n-4\log_{2}\log n steps a giant square-component containing at least ε2n2\varepsilon_{2}n^{2} non-edges, and all non-edges from Γ[V1]\Gamma[V_{1}] that lie inside components of (Γ[V1])\square(\Gamma[V_{1}]) of size at least (logn)4.(\log n)^{4}.

Proof.

Let SkS_{k} denote the set of non-edges of V2,kV_{2,k}. We have 𝔼|Sk|=(1p)(ε1lognn2)=(1o(1))ε12(logn)2n2\mathbb{E}|S_{k}|=(1-p)\binom{\frac{\varepsilon_{1}}{\log n}n}{2}=(1-o(1))\frac{\varepsilon_{1}^{2}}{(\log n)^{2}}n^{2} By a standard Chernoff bound,

(|Sk|ε1n24(logn)2)\displaystyle\mathbb{P}\left(|S_{k}|\leq\frac{\varepsilon_{1}n^{2}}{4(\log n)^{2}}\right) exp((ε1)216n2(logn)2)\displaystyle\leq\exp\left(-\frac{(\varepsilon_{1})^{2}}{16}\frac{n^{2}}{(\log n)^{2}}\right)

If |Sk|ε1n2(logn)2|S_{k}|\geq\frac{\varepsilon_{1}n^{2}}{(\log n)^{2}} holds, we can apply Lemma 6.4, concluding that every component of size Mk1M_{k-1} is joined with at least one other, resulting in a component of size Mk=2Mk1M_{k}=2M_{k-1}. Thus the desired conclusion for the first part of the lemma holds with probability at least

1exp((ε1)216n2(logn)2)exp(ε2(ε1)2216(logn)2)12exp(ε2(ε1)2216(logn)2),1-\exp\left(-\frac{(\varepsilon_{1})^{2}}{16}\frac{n^{2}}{(\log n)^{2}}\right)-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)\geq 1-2\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right),

as desired.

For the “in particular” part, we first apply a simple union bound to the first 2log2n4log2logn2\log_{2}n-4\log_{2}\log n steps of the process, to show that with probability at least

(6.14) 12(2log2n4log2logn)exp(ε2(ε1)2216(logn)2)\displaystyle 1-2(2\log_{2}n-4\log_{2}\log n)\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

our sprinkling process has uncovered a collection of square-components, each of which contains at least ε2n22\frac{\varepsilon_{2}n^{2}}{2} non-edges, and, whose union contains at least ε2n2\varepsilon_{2}n^{2} non-edges and includes all non-edges of Γ[V1]\Gamma[V_{1}] coming from components of (Γ[V1])\square(\Gamma[V_{1}]) of size at least (logn)4(\log n)^{4}. There can be at most 12(ε2)1\frac{1}{2}(\varepsilon_{2})^{-1} such components. By (6.3), the probability that a fixed pair of such components fails to join up in the next round of sprinkling is at most

exp((ε2)2ε1216p1nlogn)exp((ε2)2ε12165n32(logn)32).\exp\left(-\frac{(\varepsilon_{2})^{2}\varepsilon_{1}}{2^{16}}\frac{p^{-1}n}{\log n}\right)\leq\exp\left(-\frac{(\varepsilon_{2})^{2}\varepsilon_{1}}{2^{16}}\frac{5n^{\frac{3}{2}}}{(\log n)^{\frac{3}{2}}}\right).

Taking the union bound over the at most 18(ε2)2\frac{1}{8}(\varepsilon_{2})^{-2} pairs of components, we have that the probability any pair of these components fail to join up is, for large nn, a lot less than the last term in equation (6.14):

8log2lognexp(ε2(ε1)2216(logn)2).8\log_{2}\log n\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right).

Combining this with (6.14), we get the claimed bound on the probability of having discovered a giant square-component containing at least ε2n2\varepsilon_{2}n^{2} non-edges and all non-edges of Γ[V1]\Gamma[V_{1}] contained in components of (Γ[V1])\square(\Gamma[V_{1}]) of size at least (logn)4(\log n)^{4}. ∎

6.5. Covering the whole world

All that now remains to complete the proof of Theorem 1.5 is to show that a.a.s. there is a square-component covering all vertices of (Γ)\square(\Gamma). Note that while in Lemma 6.6 we showed that (Γ)\square(\Gamma) has a giant component, we have not quite shown it is unique: in principle, one could stitch together a rival giant component at the last stage of sprinkling by building numerous bridges between small components. This is of course highly unlikely, and one could show uniqueness of the giant by exploiting the fact that the number of non-edges of Γ\Gamma lying in square-components of order at least (logn)4(\log n)^{4} is a.a.s. concentrated around its expectation (as shown in Corollary 6.3). However we do not have a nice form for this expectation, so a little care would be needed to show it changes continuously with nn to make the argument above fully rigorous. As this paper is already sufficiently long and as the uniqueness of the giant is not our main concern here, we eschew this and focus instead on the problem of ensuring we have a giant component whose support covers all the vertices. We sidestep the issue of the uniqueness of the giant by considering a partition of [n][n] which allows us to both build a preferred giant and, crucially, to ensure this preferred giant has full support. We begin by establishing a useful corollary of the work in the previous subsections.

Corollary 6.7.

Let λ>λc\lambda>\lambda_{c} be fixed, and let p=p(n)p=p(n) be an edge-probability satisfying λn1/2p5n1/2logn\lambda n^{-1/2}\leq p\leq 5n^{-1/2}\sqrt{\log n}. Let Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p). Then there for every ε3>0\varepsilon_{3}>0 sufficiently small, there exists a constant ε4>0\varepsilon_{4}>0 such that given fixed sets UU[n]U\subseteq U^{\prime}\subseteq[n] with |U|=(12ε3)n|U|=\lfloor(1-2\varepsilon_{3})n\rfloor, |U|=(1ε3)n|U^{\prime}|=\lfloor(1-\varepsilon_{3})n\rfloor all of the following hold with probability 1O(n1)1-O(n^{-1}):

  1. (i)

    there are at least ε4n2\varepsilon_{4}n^{2} non-edges in Γ[U]\Gamma[U] contained in square-components of (Γ[U])\square(\Gamma[U]) of order at least (logn)4(\log n)^{4};

  2. (ii)

    there is a unique square-component in (Γ[U])\square(\Gamma[U^{\prime}]) containing all non-edges in Γ[U]\Gamma[U] contained in square-components of (Γ[U])\square(\Gamma[U]) of order at least (logn)4(\log n)^{4};

  3. (iii)

    there is a unique square-component in (Γ)\square(\Gamma) containing all non-edges contained in square-components of (Γ[U])\square(\Gamma[U]) or (Γ[U])\square(\Gamma[U^{\prime}]) of order at least (logn)4(\log n)^{4}.∎

Proof.

The corollary is immediate for sufficiently small ε3>0\varepsilon_{3}>0 from an application of Corollary 6.3 inside UU (for part (i)) and two applications of Lemma 6.6 (for parts (ii) and (iii) respectively), together with a suitable choice of the constant ε4\varepsilon_{4}. ∎

We now apply this Corollary to prove Theorem 1.5.

Proof of Theorem 1.5.

First note that if f(n)f(n) is any function with f(n)=o(1)f(n)=o(1) and f(n)=Ω(n2)f(n)=\Omega(n^{-2}), and 5n1/2lognp(n)1f(n)5n^{-1/2}\sqrt{\log{n}}\leq p(n)\leq 1-f(n), then Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) has the 𝒞𝒮\mathcal{CFS} property, by [5, Theorem 5.1]. Now assume that λ>λc\lambda>\lambda_{c} and λn1/2p(n)5n1/2logn\lambda n^{-1/2}\leq p(n)\leq 5n^{-1/2}\sqrt{\log{n}}. Pick ε3>0\varepsilon_{3}>0 sufficiently small, and let ε4>0\varepsilon_{4}>0 be the constant whose existence is guaranteed by Corollary 6.7. Partition [n][n] into K=2(ε3)1K=\lceil 2(\varepsilon_{3})^{-1}\rceil sets

Ui={x[n]:(i1)ε3n<xiε3n}.U_{i}=\{x\in[n]:\ (i-1)\varepsilon_{3}n<x\leq i\varepsilon_{3}n\}.

For each pair (i,j)(i,j) of distinct elements of [K][K], we apply Corollary 6.7 to the sets U=[n](UiUj)U=[n]\setminus(U_{i}\cup U_{j}) and U=[n]UiU^{\prime}=[n]\setminus U_{i}; taking a union bound over all such pairs (i,j)(i,j), we see that with probability 1O(K2n1)=1O(n1)1-O(K^{2}n^{-1})=1-O(n^{-1}), for every pair of distinct elements i,j[K]i,j\in[K] the following hold:

  1. (1)

    at least ε4n2\varepsilon_{4}n^{2} non-edges in Γ[[n](UiUj)]\Gamma[[n]\setminus(U_{i}\cup U_{j})] are contained in components of (Γ[[n](UiUj)])\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})]) of order at least (logn)4(\log n)^{4};

  2. (2)

    there is a unique component CijC^{\prime}_{ij} of (Γ[[n]Ui])\square(\Gamma[[n]\setminus U_{i}]) containing all non-edges of Γ[[n](UiUj)]\Gamma[[n]\setminus(U_{i}\cup U_{j})] contained in components of (Γ[[n](UiUj)])\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})]) of order at least (logn)4(\log n)^{4};

  3. (3)

    there is a unique component CiC_{i} of (Γ)\square(\Gamma) containing CijC^{\prime}_{ij} as well as all non-edges of Γ[[n]Ui]\Gamma[[n]\setminus U_{i}] contained in components of (Γ[[n]Ui])\square(\Gamma[[n]\setminus U_{i}]) of order at least (logn)4(\log n)^{4}.

We claim that i,j[K]\forall i,j\in[K] we have Ci=CjC_{i}=C_{j}. Indeed for iji\neq j, note that CiCijC_{i}\supseteq C^{\prime}_{ij} and CjCjiC_{j}\supseteq C^{\prime}_{ji}. Since both CijC^{\prime}_{ij} and CjiC^{\prime}_{ji} contain all of the at least ε4n2\varepsilon_{4}n^{2} non-edges contained in components of (Γ[[n](UiUj)])\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})]) of order at least (logn)4(\log n)^{4}, it follows that CiCjCijCjiC_{i}\cap C_{j}\supseteq C^{\prime}_{ij}\supseteq C^{\prime}_{ji}\neq\emptyset. Since their intersection is non-empty, CiC_{i} and CjC_{j} are the same component of (Γ)\square(\Gamma), as claimed. We may thus let CC_{\star} denote the a.a.s. unique square-component with C=CiC_{\star}=C_{i} for all i[K]i\in[K].

We now show that a.a.s. the support of this component CC_{\star} is the whole vertex set [n][n]. Pick i[K]i\in[K] and condition on the event that there is a square-component CiC_{i}^{\prime} in (Γ[[n]Ui])\square(\Gamma[[n]\setminus U_{i}]) of order at least ε4n2\varepsilon_{4}n^{2} (an event which occurs with probability 1O(n1)1-O(n^{-1}), as we saw in (2) above). If two or more such components exist, pick a largest one. Further, condition on each vertex xUix\in U_{i} having at least ε32n\frac{\varepsilon_{3}}{2}n non-neighbors in Γ[Ui]\Gamma[U_{i}]. By a standard application of Chernoff bounds and a union-bound, this event occurs with probability 1O(n1)1-O(n^{-1}).

Having thus conditioned on the state of pairs in Γ[Ui]\Gamma[U_{i}] and Γ[[n]Ui]\Gamma[[n]\setminus U_{i}], we now show that a.a.s. for every vertex xUix\in U_{i}, there exist yUiy\in U_{i} and uvCiuv\in C_{i}^{\prime} such that xyuvxyuv induces a copy of C4C_{4} — so that that xyxy belongs to the component CC^{\prime} of (Γ)\square(\Gamma) containing CiC_{i}^{\prime}. Combining this with (3) above (which implie that a.a.s. C=CC^{\prime}=C_{\star}) and a simple union bound will then yield Theorem 1.5.

Given non-edges xyUi(2)E(Γ[Ui])xy\in U_{i}^{(2)}\setminus E(\Gamma[U_{i}]) and uvCiuv\in C_{i}^{\prime}, let Xxy,uvX_{xy,uv} be the indicator function of the event that all of uxux, uyuy, vxvx and vyvy are edges of Γ\Gamma. Observe that this event is independent of our conditioning. For xUix\in U_{i}, set

Xx=yUi:xyE[Ui]uvCiXxy,uvX_{x}=\sum_{y\in U_{i}:\ xy\notin E[U_{i}]}\sum_{uv\in C_{i}^{\prime}}X_{xy,uv}

to be the number of induced C4C_{4}’s xyuvxyuv of Γ\Gamma with yUiy\in U_{i} and uvCiuv\in C_{i}^{\prime}. We shall again apply the Extended Janson Inequality ((Proposition 6.5) to bound (Xx=0)\mathbb{P}(X_{x}=0). Given our conditioning, the expectation of XxX_{x} is

μ:=|{yUi:xyE(Γ)}||Ci|p4ε3ε4(λc)42n=Ω(n).\mu:=|\{y\in U_{i}:\ xy\notin E(\Gamma)\}|\cdot|C_{i}^{\prime}|p^{4}\geq\frac{\varepsilon_{3}\varepsilon_{4}(\lambda_{c})^{4}}{2}n=\Omega(n).

We now compute the corresponding Δ\Delta-parameter in Janson’s inequality. For y,yUiΓx[Ui]y,y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}] and uv,uvCiuv,u^{\prime}v^{\prime}\in C^{\prime}_{i}, write {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\} if {ux,uy,vx,vy}{ux,uy,vx,vy}\{ux,uy,vx,vy\}\cap\{u^{\prime}x,u^{\prime}y^{\prime},v^{\prime}x,v^{\prime}y\}\neq\emptyset. Note that the random variables Xxy,uvX_{xy,uv} and Xxy,uvX_{xy^{\prime},u^{\prime}v^{\prime}} are independent unless {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}, and further that {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\} implies {u,v}{u,v}\{u,v\}\cap\{u^{\prime},v^{\prime}\}\neq\emptyset.

Pick and fix yUiΓx[Ui]y\in U_{i}\setminus\Gamma_{x}[U_{i}] and uvCiuv\in C^{\prime}_{i}, which can be done in at most |Ui||Ci||U_{i}|\cdot|C_{i}^{\prime}| ways. We perform a case analysis to determine the contributions to the Δ\Delta-parameter from terms of the form 𝔼Xxy,uvXxy,uv\mathbb{E}X_{xy,uv}X_{xy^{\prime},u^{\prime}v^{\prime}} with {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}:

  • there are at most |Ui||U_{i}| choices of yUiΓx[Ui]y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}] with yyy^{\prime}\neq y such that {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},uv\}, and for such yy^{\prime} we have 𝔼[Xxy,uvXxy,uv]=p6\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv}]=p^{6};

  • there are at most nn choices of v[n]Uiv^{\prime}\in[n]\setminus U_{i} with vvv^{\prime}\neq v such that {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy,uv^{\prime}\}, and for such vv^{\prime} we have 𝔼[Xxy,uvXxy,uv]=p6\mathbb{E}[X_{xy,uv}X_{xy,uv^{\prime}}]=p^{6} (with the contribution from the symmetric cases {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy,u^{\prime}v\} analysed similarly);

  • finally, there are at most n|Ui|n|U_{i}| choices of (v,y)(v^{\prime},y^{\prime}) with v[n]Uiv^{\prime}\in[n]\setminus U_{i}, yUiΓx[Ui]y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}], vvv^{\prime}\neq v and yyy\neq y^{\prime} such that {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},uv^{\prime}\}, and for such (v,y)(v^{\prime},y^{\prime}) we have 𝔼[Xxy,uvXxy,uv]=p7\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv^{\prime}}]=p^{7} (with the contribution from the symmetric cases {xy,uv}{xy,uv}\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v\} analysed similarly).

Putting it all together, we see

Δ={xy,uv}{xy,uv}𝔼[Xxy,uvXxy,uv]|Ui||Ci|(|Ui|p6+2np6+2n|Ui|p7).\displaystyle\Delta=\sum_{\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}}\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv}]\leq|U_{i}|\cdot|C_{i}^{\prime}|\Bigl{(}|U_{i}|p^{6}+2np^{6}+2n|U_{i}|p^{7}\Bigr{)}.

Since |Ui|ε3n|U_{i}|\leq\varepsilon_{3}n, |Ci|n2/2|C^{\prime}_{i}|\leq n^{2}/2 and p5lognnp\leq 5\sqrt{\frac{\log{n}}{n}}, the above implies

Δ\displaystyle\Delta 57(ε3)2nn(logn)7/2(1+o(1)),\displaystyle\leq 5^{7}(\varepsilon_{3})^{2}n\sqrt{n}(\log n)^{7/2}(1+o(1)),

which for nn large enough is at most 220ϵ3ϵ4(λc)4μn1/2(logn)7/2\displaystyle\frac{2^{20}\epsilon_{3}}{\epsilon_{4}(\lambda_{c})^{4}}\mu n^{1/2}(\log n)^{7/2}. Applying Janson’s inequality,

(Xx=0)exp(μ22Δ)exp(ϵ4λc4μ220(logn)3/2n)=o(n1).\displaystyle\mathbb{P}(X_{x}=0)\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right)\leq\exp\left(-\frac{\epsilon_{4}\lambda_{c}^{4}\mu}{2^{20}(\log n)^{3/2}\sqrt{n}}\right)=o(n^{-1}).

Thus with probability 1o(n1)1-o(n^{-1}), Xx>0X_{x}>0 and the vertex xx is covered by the square-component in (Γ)\square(\Gamma) that contains CiC_{i}^{\prime}. Taking a union bound over xUix\in U_{i}, with probability 1o(1)1-o(1), every vertex xUix\in U_{i} is covered by this square-component (which as we showed is the a.a.s  uniquely determined giant component CC_{\star}). Taking a union bound over i[K]i\in[K] and combining this with (1)–(3) above, we see that with probability 1o(1)1-o(1) the square-component CC_{\star} covers all of [n][n]. This concludes the proof of Theorem 1.5. ∎

7. Concluding remarks

There are several natural questions arising from our work. To begin with, one could ask for more information about the component structure in (Γ)\square(\Gamma): in the subcritical regime, can one get a better upper bound on the order of square-components? In the supercritical regime, can one give good bounds on the order of the second-largest square-component? In particular, can one give better bounds than just o(n2)o(n^{2}), and can one show its support has size o(n)o(n)? This may be feasible albeit technically challenging.

Another question on the probabilistic side is determining the range of p=p(n)p=p(n) for which the square graph (Γ)\square(\Gamma) of Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) is a.a.s. connected, a very different question from the ones considered in this paper. Investigating other parameters such as the diameter of (Γ)\square(\Gamma) may also lead in interesting directions.

Further afield, one could consider percolation problems for similar structures. We could for example consider the graph on all triples of independent vertices in Γ\Gamma obtained by setting two such triples to be adjacent if they induce a copy of the 66-cycle C6C_{6} or of the complete balanced bipartite graph K3,3K_{3,3} in Γ\Gamma. We would guess the techniques in this paper would adapt well to the latter problem, as Bollobás and Riordan also suggest, but that new ideas would be needed for the former.

One could also consider a similar problem for a pp-random rr-uniform hypergraph Γ\Gamma, by setting a set of rr vertex-disjoint non-edges F1,F2,,FrE(Γ)F_{1},F_{2},\ldots,F_{r}\notin E(\Gamma) to be adjacent if all of the r2r^{2} edges meeting each FiF_{i} in exactly one vertex are present in Γ\Gamma (in other words, if and only if the union of the FiF_{i}s induces a copy of the complete balanced rr-partite rr-uniform hypergraph on r2r^{2} vertices). Note the case r=2r=2 corresponds exactly to square percolation. Levcovitz [16] has provided a quasi-isometry invariant for right-angled Coxeter groups by associating a hypergraph to any such group, so analysis of suitable variants of square percolation for hypergraphs may yield interesting applications in geometric group theory (besides constituting a challenging and rather natural problem in combinatorial probability).

Finally it would be interesting to study other properties of the right-angled Coxeter group WΓW_{\Gamma} when Γ𝒢(n,p)\Gamma\in{\mathcal{G}}(n,p) using tools from random graph theory. In particular, determining the threshold for algebraic thickness of every order, or the exact rate of divergence of WΓW_{\Gamma} for all pp would be of great interest (see [6, Question 1]). Doing so will require new group theoretic ideas to translate these properties into graph theoretic language, and the identification of suitably tractable graph theoretic proxies for these in 𝒢(n,p){\mathcal{G}}(n,p). Work of Levcovitz [16] provides promising progress towards finding combinatorial properties to encode higher rates of polynomial divergence in right-angled Coxeter groups; indeed, as we finalized this paper, Levcovitz released a new preprint [17] that provides such a translation, which we expect will be of use in future work on this problem. As Levcovitz’s work involves hypergraphs, developing new techniques for generalizations of square percolation to hypergraphs will likely be key to further progress.

Finally, one could study thickness and relative hyperbolicity in random right-angled Coxeter groups with presentation graphs drawn from other distributions than the Erdős–Rényi random graph model, such as random regular graphs. We do not know of any work which has been done in this direction at the present time.

References

  • [1] Alon, N., and Spencer, J. H. The probabilistic method, fourth ed. Wiley-Interscience Series in Discrete Mathematics and Optimization. John Wiley & Sons, Inc., Hoboken, NJ, 2015.
  • [2] Balister, P. Branching processes. Lecture Note Series, IMS, NUS (2006).
  • [3] Behrstock, J., and Druţu, C. Divergence, thick groups, and short conjugators. Illinois J. Math. 58, 4 (2014), 939–980.
  • [4] Behrstock, J., Druţu, C., and Mosher, L. Thick metric spaces, relative hyperbolicity, and quasi-isometric rigidity. Mathematische Annalen 344, 3 (2009), 543–595.
  • [5] Behrstock, J., Falgas-Ravry, V., Hagen, M. F., and Susse, T. Global structural properties of random graphs. Int. Math. Res. Not. 2018, 5 (2018), 1411–1441.
  • [6] Behrstock, J., Hagen, M., and Sisto, A. Thickness, relative hyperbolicity, and randomness in Coxeter groups. Algebr. Geom. Topol. 17, 2 (2017), 705–740.
  • [7] Bollobás, B., and Riordan, O. Clique percolation. Random Structures Algorithms 35, 3 (2009), 294–322.
  • [8] Dani, P., and Thomas, A. Divergence in right-angled Coxeter groups. Trans. Amer. Math. Soc. 367, 5 (2015), 3549–3577.
  • [9] Derényi, I., Palla, G., and Vicsek, T. Clique percolation in random networks. Phys. Rev. Lett. 94 (Apr 2005), 160202.
  • [10] Druţu, C., Mozes, S., and Sapir, M. Divergence in lattices in semisimple Lie groups and graphs of groups. Trans. Amer. Math. Soc. 362, 5 (2010), 2451–2505.
  • [11] Dwass, M. The total progeny in a branching process and a related random walk. Journal of Applied Probability 6, 3 (1969), 682–686.
  • [12] Gersten, S. M. Divergence in 33-manifold groups. Geom. Funct. Anal. 4, 6 (1994), 633–647.
  • [13] Harris, T. E. A lower bound for the critical probability in a certain percolation process. In Mathematical Proceedings of the Cambridge Philosophical Society (1960), vol. 56, Cambridge University Press, pp. 13–20.
  • [14] Janson, S., Łuczak, T., and Ruciński, A. Random graphs, vol. 45. John Wiley & Sons, 2011.
  • [15] Levcovitz, I. Divergence of CAT(0)\rm CAT(0) cube complexes and Coxeter groups. Algebr. Geom. Topol. 18, 3 (2018), 1633–1673.
  • [16] Levcovitz, I. A quasi-isometry invariant and thickness bounds for right-angled Coxeter groups. Groups Geom. Dyn. 13, 1 (2019), 349–378.
  • [17] Levcovitz, I. Characterizing divergence and thickness in right-angled Coxeter groups. arXiv:2007.13796, 2020.
  • [18] Li, M., Deng, Y., and Wang, B.-H. Clique percolation in random graphs. Phys. Rev. E (3) 92, 4 (2015), 042116–042122.
  • [19] Mühlherr, B. Automorphisms of graph-universal coxeter groups. Journal of Algebra 200, 2 (1998), 629–649.
  • [20] Palla, G., Derényi, I., Farkas, I., and Vicsek, T. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435 (June 2005), 814–818.
  • [21] Tóth, B., Vicsek, T., and Palla, G. Overlapping modularity at the critical point of kk-clique percolation. J. Stat. Phys. 151, 3-4 (2013), 689–706.
  • [22] Wang, B., Cao, L., Suzuki, H., and Aihara, K. Impacts of clustering on interacting epidemics. J. Theoret. Biol. 304 (2012), 121–130.