Square percolation and the threshold for quadratic divergence in random right-angled Coxeter groups

Jason Behrstock Lehman College and The Graduate Center, CUNY, New York, New York, USA [email protected] , Victor Falgas-Ravry Umeå Universitet, Umeå, Sweden [email protected] and Tim Susse Bard College at Simon’s Rock, Great Barrington, Mass., USA [email protected]

Abstract.

Given a graph $\Gamma$ , its auxiliary square-graph $\square(\Gamma)$ is the graph whose vertices are the non-edges of $\Gamma$ and whose edges are the pairs of non-edges which induce a square (i.e., a $4$ -cycle) in $\Gamma$ . We determine the threshold edge-probability $p=p_{c}(n)$ at which the Erdős–Rényi random graph $\Gamma=\Gamma_{n,p}$ begins to asymptotically almost surely have a square-graph with a connected component whose squares together cover all the vertices of $\Gamma_{n,p}$ . We show $p_{c}(n)=\sqrt{\sqrt{6}-2}/\sqrt{n}$ , a polylogarithmic improvement on earlier bounds on $p_{c}(n)$ due to Hagen and the authors. As a corollary, we determine the threshold $p=p_{c}(n)$ at which the random right-angled Coxeter group $W_{\Gamma_{n,p}}$ asymptotically almost surely becomes strongly algebraically thick of order $1$ and has quadratic divergence.

2010 Mathematics Subject Classification:

05C80, 20F65, 57M15, 60B99, 20F55, 20F69

1. Introduction

In this paper we investigate the phase transition for a variant of “square percolation”, with motivation coming from both previous work on clique percolation and from questions in geometric group theory.

Clique percolation was introduced by Derényi, Palla and Vicsek [9] as a simple model for community detection, and quickly became well-studied in network science, from computational, empirical, and theoretical perspectives, see e.g. [7, 9, 18, 20, 21, 22]. In $(k,\ell)$ –clique percolation, to investigate the “community structure” of a graph or network $\Gamma$ one studies the auxiliary $(k,\ell)$ -clique graph whose vertices are the $k$ –cliques of $\Gamma$ and whose edges are those pairs of $k$ –cliques having at least $\ell$ vertices in common.

One of the main research questions in the area was determining the threshold $p=p(n)$ for the emergence of a “giant component” in the auxiliary $(k,\ell)$ -clique graph when the original graph $\Gamma\in\mathcal{G}(n,p)$ is an Erdős–Rényi random graph on $n$ vertices with edge-probability $p$ . This was completely resolved in 2009 by Bollobás and Riordan [7], in a highly impressive paper making sophisticated use of branching processes. In the concluding remarks of their paper, Bollobás and Riordan suggested a study of “square percolation” as a natural extension of their work. More precisely, given a graph $\Gamma$ they suggested studying the component structure of the auxiliary graph whose vertices are the not necessarily induced $4$ -cycles in $\Gamma$ , and whose edges are pairs of $4$ –cycles with a diagonal¹¹1Note that given a $4$ –cycle in a graph, we use the term “diagonal” to refer to the pair of vertices of a diagonal, even though they they may not span an edge in the graph; indeed most of this paper concerns induced $4$ –cycles so that the edge spanned by the diagonal is not in the graph. in common. For $\Gamma\in\mathcal{G}(n,p)$ , they stated that they believed the threshold for the associated auxiliary graph to contain a giant component containing a positive proportion of all squares of $\Gamma$ should be $\lambda_{c}/\sqrt{n}$ , where $\lambda_{c}=\sqrt{\sqrt{6}-2}$ (see the discussion around equation (19) in Section 2.3 of [7]).

A related (but slightly different) notion of “square percolation” arose independently in joint work of the authors with Hagen [5] on the divergence of the random right-angled Coxeter group, providing motivation from geometric group theory for understanding the phase transition in an auxiliary graph formed from the induced $4$ -cycles of an Erdős–Rényi random graph $\Gamma\in\mathcal{G}(n,p)$ . To make this more precise, we make the following definition.

Definition 1.1.

To any graph $\Gamma$ , we associate an auxiliary square-graph, $\square\left({\Gamma}\right)$ , whose vertices are the non-edges of $\Gamma$ , and whose edges are the pairs of non-edges of $\Gamma$ that together induce a $4$ –cycle (a.k.a. square) in $\Gamma$ .

Thus for vertices $a,b,c,d$ in a graph $\Gamma$ , the pair $\{ac,bd\}$ is an edge of $\square(\Gamma)$ if and only if (i) $ac$ and $bd$ are non-edges of $\Gamma$ (and thus vertices of $\square(\Gamma)$ ), and (ii) $ab$ , $bc$ , $cd$ and $da$ are all edges of $\Gamma$ .

Remark 1.2.

This definition of the auxiliary square-graph $\square(\Gamma)$ differs slightly from the one used in the related papers [5, 8]. In those papers, the auxiliary graph had the induced $4$ -cycles as its vertices, and its edges were those pairs of induced $4$ -cycles having a diagonal in common. These two variants of auxiliary square-graphs encode essentially the same information, but the formulation above is more natural from a combinatorial perspective and more convenient for the exploration processes we shall consider in this paper.

We investigate the component structure of $\square(\Gamma)$ , albeit with an unusual twist. With a view to applications in geometric group theory, we will be interested in the question of whether or not $\square(\Gamma)$ has a component that “covers” all of the vertex-set of the original graph $\Gamma$ .

Definition 1.3.

We refer to connected components of $\square\left({\Gamma}\right)$ as square-components of $\Gamma$ . Given a square-component $C$ we define its support to be the collection of vertices of $\Gamma$ given by:

\operatorname{\mathrm{supp}}(C)=\bigcup_{vw\in C}\{v,w\},

and say that the component $C$ of $\square\left({\Gamma}\right)$ covers the vertex set $\operatorname{\mathrm{supp}}(C)\subseteq V(\Gamma)$ . If $C$ covers all of $V(\Gamma)$ , we say it is a square-component with full support

Write $\Gamma\in\mathcal{G}(n,p)$ to denote that $\Gamma$ is an instance of the Erdős–Rényi random graph model with parameters $n$ and $p$ , i.e., that $\Gamma$ is a graph on $n$ vertices obtained by including each edge at random with probability $p$ , independently of all the others.

Our main combinatorial result in this paper is pinpointing the precise threshold $p_{c}(n)$ at which $\Gamma\in\mathcal{G}(n,p)$ asymptotically almost surely²²2As usual, asymptotically almost surely or a.a.s. is shorthand for “with probability tending to $1$ as $n\rightarrow\infty$ .” experiences a phase transition from having only square-components with support of logarithmic order to having a square-component with full support. Throughout this paper we set $\lambda_{c}=\sqrt{\sqrt{6}-2}$ . The following two results establish that the critical threshold probability is $p(n)=\lambda_{c}n^{-1/2}$ by showing highly disparate behavior on either side of this threshold as given by the following two contrasting results.

Theorem 1.4 (Subcritical Behavior).

Let $\lambda<\lambda_{c}$ be fixed. Suppose that $p(n)\leq\lambda n^{-1/2}$ . Then for $\Gamma\in{\mathcal{G}}(n,p)$ , a.a.s. every square-component of $\Gamma$ covers at most $O((\log n)^{2^{32}})$ vertices.

Theorem 1.5 (Supercritical Behavior).

Let $\lambda>\lambda_{c}$ be fixed, and let $f\colon\mathbb{N}\rightarrow\mathbb{R}^{+}$ be a function with $f(n)\rightarrow 0$ and $f(n)n^{2}\rightarrow\infty$ as $n\rightarrow\infty$ . Let $p=p(n)$ be an edge-probability with

\lambda n^{-1/2}\leq p(n)\leq 1-f(n).

Then for $\Gamma\in{\mathcal{G}}(n,p)$ , a.a.s. there is a square-component of $\Gamma$ covering all vertices of $\Gamma$ .

Our proofs of Theorems 1.4 and 1.5 confirm the conjecture of Bollobás and Riordan regarding the location of the phase transition for their version of (non-induced) square percolation. Further, Theorems 1.4 and 1.5 have a direct application to the study of the geometric properties of random right-angled Coxeter group, which we now describe.

Given a graph $\Gamma$ , we define the associated right-angled Coxeter group (RACG) $W_{\Gamma}$ by taking the free group on $V(\Gamma)$ and adding the relations $a^{2}=1$ and $ab=ba$ for all $a\in V(\Gamma)$ , $ab\in E(\Gamma)$ . In this way, the graph $\Gamma$ encodes a finite presentation for the right-angled Coxeter group $W_{\Gamma}$ . Given graphs $\Gamma$ and $\Lambda$ , it is well-known that the associated groups $W_{\Gamma}$ and $W_{\Lambda}$ are isomorphic if and only if the graphs $\Gamma$ and $\Lambda$ are isomorphic, see [19]. Thus algebraic and geometric properties of $W_{\Gamma}$ can be studied via purely graph-theoretic means, as we do in this paper. Indeed, a number of geometric properties of a right-angled Coxeter group $W_{\Gamma}$ admit encodings as graph-theoretic properties of the presentation graph $\Gamma$ . Such properties include thickness and having quadratic divergence, which are both important in geometric group theory (see Section 3 below for a formal definition of these notions). An investigation of right-angled Coxeter groups with quadratic divergence was the main motivation for the work undertaken in this paper.

The correspondence between right-angled Coxeter groups and graphs allows one to define models of random groups based on random graph models. In particular, in this paper we consider the random right-angled Coxeter group, $W_{\Gamma}$ where the presentation graph $\Gamma\in\mathcal{G}_{n,p}$ is an Erdős–Rényi random graph. Using Theorems 1.4 and 1.5 above on square-components in Erdős–Rényi random graphs, we prove the following.

Theorem 1.6 (Criticality for quadratic divergence of RACGs).

Let $\epsilon>0$ . If

\frac{\lambda_{c}+\epsilon}{\sqrt{n}}\leq p(n)\leq 1-\frac{(1+\epsilon)\log{n}}{n}

and $\Gamma\in{\mathcal{G}}(n,p)$ . Then, a.a.s. the right-angled Coxeter group $W_{\Gamma}$ has quadratic divergence and is strongly algebraically thick of order exactly $1$ .

On the other hand, if $p(n)$ satisfies

0\leq p(n)\leq\frac{\lambda-\epsilon}{\sqrt{n}}

then the right-angled Coxeter group $W_{\Gamma}$ a.a.s has at least cubic divergence and is not strongly algebraically thick of order $0$ or $1$ .

The geometric properties of $W_{\Gamma}$ when $\Gamma\in{\mathcal{G}}_{n,p}$ and $p=1-\theta(n^{-2})$ was described in detail by Behrstock, Hagen and Sisto in [6, Theorem V]. Together with their work, our results give an essentially complete picture of quadratic and linear divergence in random right-angled Coxeter groups.

Organization of the paper

In Section 3 we provide additional background material on the geometry of random groups and derive Theorem 1.6 from Theorems 1.4–1.5. In Section 4, we recall some basic facts about branching processes and give an outline of the proof strategy we follow for our main results, and of the ways in which it departs from the framework used by Bolloás and Riordan in their study of clique percolation in random graphs. Theorem 1.4 is proved in Section 5, while Theorem 1.5 is derived in Section 6. We end the paper in Section 7 with some discussion of the results and of further work and related problems.

Acknowledgments

The authors thank Mark Hagen for discussions during the early stages of this work. Behrstock was supported by NSF grant DMS-1710890. Falgas-Ravry was supported by Swedish Research Council grant VR 2016-03488. The authors thank Ela Behrstock for her skillful rendering of Figures 2 and 2. Aspects of this work were motivated by output from software written by the authors (available upon request from the authors, some available online at http://comet.lehman.cuny.edu/behrstock/random.html). Accordingly, this research was supported, in part, by a grant of computer time from the City University of New York High Performance Computing Center under NSF Grants CNS-0855217, CNS-0958379, and ACI-1126113.

2. Graph-theoretic notation and standard notions

Given a set $A$ and $r\in\mathbb{N}$ , let $A^{(r)}$ denote the collection of all subsets of $A$ of cardinality $r$ . So for example $A^{(2)}$ is the collection unordered distinct pairs of elements of $A$ . As a notational convenience, we set $[n]:=\{1,2,\ldots,n\}$ , and we often denote the unordered set $\{u,v\}$ by $uv$ .

A graph is a pair $\Gamma=(V,E)$ , where $V=V(\Gamma)$ is a set of vertices and $E=E(\Gamma)$ is a collection of pairs of vertices referred to as the edges of $\Gamma$ . A subgraph of $\Gamma$ is a graph $G$ with $V(G)\subseteq V(\Gamma)$ and $E(G)\subseteq E(\Gamma)$ . If $V(G)=X$ and $E(G)=V(\Gamma)\cap X^{(2)}$ , then we say $G$ is the subgraph of $\Gamma$ induced by $X$ and denote this fact by $G=\Gamma[X]$ . When there is no risk of confusion, we may abuse notation and use $X$ to refer to both the subset of $V(\Gamma)$ and the associated induced subgraph $\Gamma[X]$ . The complement of a graph $\Gamma=(V,E)$ is the graph $\Gamma^{c}=(V,V^{(2)}\setminus E)$ .

A path of length $\ell$ in a graph $\Gamma$ is an ordered sequence of $\ell+1$ distinct vertices $v_{0},v_{1},\ldots,v_{\ell}$ together with a set of $\ell$ edges $\{v_{i-1}v_{i}:\ i\in[l]\}\subseteq E(\Gamma)$ . Such a path is said to join $v_{0}$ to $v_{\ell}$ . Two vertices are said to be connected in $\Gamma$ if there is a path joining them. Being connected is an equivalence relation on the vertices of $\Gamma$ . A (connected) component of $\Gamma$ is then a nonempty set of vertices from $V(\Gamma)$ that forms an equivalence class under this relation.

In this paper we study squares in graphs. An square, or $4$ –cycle, in $\Gamma$ is a copy of the graph $C_{4}=(\{a,b,c,d\},\{ab,bc,cd,da\})$ as an subgraph of $\Gamma$ . In an abuse of notation, we will denote such a $C_{4}$ by $abcd$ . In other words, if we say “ $abcd$ is a copy of $C_{4}$ /a square in $\Gamma$ ”, we mean “ $ab,bc,cd,da\in E(\Gamma)$ ”. Further if we say “ $abcd$ is an induced $C_{4}$ /square in $\Gamma$ ”, we mean that $abcd$ is a square in $\Gamma$ and that in addition $ac,bd\notin E(\Gamma)$ . A useful notion for studying squares in graphs is that of a link graph: given a vertex $x\in V(\Gamma)$ , the link graph $\Gamma_{x}$ of $x$ is the collection of neighbors of $x$ in $\Gamma$ , i.e., $\Gamma_{x}=\{y\in V(\Gamma):\ xy\in E(\Gamma)\}$ .

By $\Gamma\in{\mathcal{G}}(n,p)$ we mean that $\Gamma$ is a random graph on the vertex set $[n]$ obtained by including each edge $uv$ in $E(\Gamma)$ with probability $p$ , independently of all the others. This is known as the Erdős–Rényi random graph model. Given a sequence of edge probabilities $p=p(n)$ and a graph property $\mathcal{P}$ , we say that a typical instance of $\Gamma\in{\mathcal{G}}(n,p)$ has property $\mathcal{P}$ , or, equivalently, that $\Gamma\in\mathcal{P}$ holds asymptotically almost surely (a.a.s.) if

\lim_{n\rightarrow\infty}\mathbb{P}(\Gamma\in\mathcal{P})=1.

Throughout the paper, we use standard Landau notation: given functions $f,g\colon\mathbb{N}\rightarrow\mathbb{R}^{+}$ , we write $f=o(g)$ for $\lim_{n\rightarrow\infty}f(n)/g(n)=0$ and $f=O(g)$ if there exists a constant $C>0$ such that $\limsup_{n\rightarrow\infty}f(n)/g(n)\leq C$ . Further we write $f=\omega(g)$ for $g=o(f)$ , $f=\Omega(g)$ for $g=O(f)$ . Finally if $f=O(g)$ and $f=\Omega(g)$ both hold, we denote this fact by $f=\Theta(g)$ .

3. Geometric group theory and the $\mathcal{CFS}$ property

Our main result in this paper establishes that $p(n)=\lambda_{c}/\sqrt{n}$ is the threshold for a typical instance $\Gamma$ of the Erdős–Rényi random graph model $\mathcal{G}(n,p)$ to have a square-graph with a component covering all of $V(\Gamma)$ . This property is a.a.s. equivalent to possessing the $\mathcal{CFS}$ –property, defined below.

3.1. Background

Recall that the graph joint $\Gamma_{1}\ast\Gamma_{2}$ of two graphs $\Gamma_{1}$ and $\Gamma_{2}$ is the graph obtained by taking disjoint unions of $\Gamma_{1}$ and $\Gamma_{2}$ , and adding in all edges from $\Gamma_{1}$ to $\Gamma_{2}$ .

Definition 3.1.

A finite graph $\Gamma$ is defined to be $\mathcal{CFS}$ (constructed from squares) if $\Gamma$ has induced subgraphs $K$ and $\Gamma^{\prime}$ with $K$ a (possibly empty) clique so that:

•

$\Gamma=\Gamma^{\prime}\ast K$ , and
•

$\square(\Gamma^{\prime})$ has a component $C$ with $\operatorname{\mathrm{supp}}(C)=V(\Gamma^{\prime})$ .

Dani–Thomas were the first to introduce a special case of the $\mathcal{CFS}$ property for triangle-free graphs in [8]. The $\mathcal{CFS}$ property for arbitrary graphs was then studied by Hagen and the authors in [5], with an eye towards establishing when this property holds a.a.s. in random graphs, while in [15] Levcovitz studied the geometric properties of right-angled Coxeter groups whose presentation graphs do not possess the $\mathcal{CFS}$ property.

With Hagen, the authors determined in [5] the threshold for the $\mathcal{CFS}$ property to hold a.a.s. in Erdős–Rényi random graphs up to a polylogarithmic factor.

Theorem 3.2 (Theorems 5.1 and 5.7 in [5]).

If $p(n)\leq\left(\log n\right)^{-1}/\sqrt{n}$ , then a.a.s. a graph $\Gamma\in{\mathcal{G}}(n,p)$ does not have the $\mathcal{CFS}$ property. On the other hand if $p(n)\geq 5\sqrt{\log n}/\sqrt{n}$ and $(1-p)n^{2}\to\infty$ , then a.a.s. $\Gamma\in{\mathcal{G}}(n,p)$ does have the $\mathcal{CFS}$ property.

Our contribution in this paper is to eliminate the polylogarithmic gap in Theorem 3.2 and thus to determine the precise threshold for the $\mathcal{CFS}$ property in random graphs.

The $\mathcal{CFS}$ property is closely linked to the large scale geometry of right-angled Coxeter groups, connected to divergence and (strong algebraic) thickness. Divergence is a quasi-isometry invariant of groups introduced by Gersten [12] and further developed by Druţu, Mozes and Sapir [10], while thickness was introduced by Behrstock–Druţu–Mosher in [4] and then further refined by Behrstock–Druţu in [3]. We define these notions and explain how they are related below.

Definition 3.3.

Let $(X,d)$ be a geodesic metric space, let $o\in X$ and let $\rho\in(0,1]$ . Given $x,y\in X$ with $d(x,o)=d(y,o)=r$ , we define $d_{\rho r}(x,y)$ to be the infimum of the lengths of paths in $X\setminus B(o,\rho r)$ between $x$ and $y$ , if such a path exists, and $\infty$ otherwise; here $B(o,\rho r)$ denotes the ball of radius $\rho r$ about $o$ . We then set

\delta_{\rho}(r)=\sup_{o\in X}\sup_{x,y}d_{\rho r}(x,y).

The divergence of $X$ is defined to be the collection of functions $\delta_{\rho}:\ r\mapsto\delta_{\rho}(r)$ ,

\text{Div}(X):=\{\delta_{\rho}:\rho\in(0,1]\}.

Given two non-decreasing functions $f,g\colon\mathbb{N}\to\mathbb{R}^{+}$ , we say that $f\lesssim g$ if there exists $C\geq 1$ so that:

f(r)\leq C\cdot g(Cr+C)+Cr+C,

and we say $f\sim g$ if $f\lesssim g$ and $g\lesssim f$ . Importantly, two polynomials that are non decreasing $\mathbb{N}\to\mathbb{R}^{+}$ and have the same degree are equivalent under this relation, and further for $a,b\in\mathbb{N}$ we have $x^{a}\sim x^{b}$ if and only if $a=b$ .

When $X$ is the Cayley graph of a right-angled Coxeter group, it is straightforward to see that $\delta_{\rho}(r)\sim\delta_{1}(r)$ . Therefore when we are referring to the divergence function of $W_{\Gamma}$ , we will mean $\delta_{1}(r)$ . We say that a RACG $W_{\Gamma}$ has quadratic divergence if $\delta_{1}(r)\sim r^{2}$ and linear divergence if $\delta_{1}(r)\sim r$ .

Definition 3.4.

Let $G$ be a finitely generated group.

•

We say that $G$ is strongly algebraically thick of order $0$ if it has linear divergence.
•
We say that $G$ is strongly algebraically thick of order at most $n$ if $G$ has a collection of subgroups $\mathcal{H}=\{H_{\alpha}\}$ so that:
- –
  
  $\left\langle\bigcup_{\alpha}H_{\alpha}\right\rangle$ has finite index in $G$
- –
  
  for $H_{\alpha},H_{\beta}\in\mathcal{H}$ there exists a sequence $H_{0}=H_{\alpha},H_{1},\ldots H_{k-1},H_{k}=H_{\beta}$ of elements of $\mathcal{H}$ so that $H_{i-1}\cap H_{i}$ is infinite for each $1\leq i\leq k$
- –
  
  there exists a constant $M>0$ so that each $H_{\alpha}\in\mathcal{H}$ is $M$ –quasiconvex, that is to say, every pair of points in $H_{\alpha}$ can be connected by an $(M,M)$ –quasigeodesic contained in $H_{\alpha}.$
- –
  
  each $H_{\alpha}\in\mathcal{H}$ is strongly algebraically thick of order at most $n-1$ .

Further, we say that $G$ is strongly algebraically thick of order exactly $n$ if it is strongly algebraically thick of order at most $n$ and not strongly algebraically thick of order at most $n-1$ . We also usually write “thick” as a shorthand for “strongly algebraically thick”.

Behrstock and Druţu discovered that the order of thickness provides upper bounds on the divergence of a metric space. In particular they proved:

Proposition 3.5 ([3, Corollary 4.17]).

Let $G$ be a finitely generated group which is strongly algebraically thick of order at most $n$ . Then for every $\rho\in(0,1]$ , $\delta_{\rho}(r)\lesssim r^{n+1}$ .

The group theoretic motivation for studying the $\mathcal{CFS}$ property is that it provides a graph theoretical proxy for certain geometric properties of right-angled Coxeter groups, such as their divergence. To see that $W_{\Gamma}$ has quadratic divergence when $\Gamma$ has the $\mathcal{CFS}$ property is straightforward, since interpreting the definition of $\mathcal{CFS}$ in the Cayley graph yields a chain of linearly many spaces with linear divergence with each intersecting the next in an infinite diameter set. Indeed, it is an immediate consequence of the definitions that if if $G$ is the direct product of two infinite groups then $G$ has linear divergence, just as a path avoiding a linear-sized ball in the plane has linear length. Hence every finitely generated abelian group of rank at least $2$ has linear divergence (and is thick of order $0$ ).

Now, if $\Gamma=K\star\Gamma^{\prime}$ where $K$ is a clique, then $W_{\Gamma}\cong\mathbb{Z}_{2}^{|K|}\times W_{\Gamma^{\prime}}$ . In such a case $W_{\Gamma^{\prime}}$ is a finite-index subgroup of $W_{\Gamma^{\prime}}$ and thus, up to finite index, we can assume that $\Gamma$ does not contain a vertex sending an edge to all other vertices of $\Gamma$ .

Now, $W_{\Gamma}$ contains a network of convex subgroups generated by the induced square in the full-support component of $\square(\Gamma)$ . Each of these groups is virtually $\mathbb{Z}^{2}$ , that is to say has a finite index subgroup which is a copy of $\mathbb{Z}^{2}$ . Further, two induced squares in $\Gamma$ correspond to incident edges in $\square(\Gamma)$ if and only if the intersection of the associated virtual $\mathbb{Z}^{2}$ subgroups is virtually $\mathbb{Z}$ , that is to say has a finite index subgroup which is a copy of $\mathbb{Z}$ .

Thus, paths in the full-support component of $\square(\Gamma)$ give the connecting sequences needed in Definition 3.4. Hence if $\Gamma$ has the $\mathcal{CFS}$ property, $W_{\Gamma}$ is thick of order at most $1$ and has at most quadratic divergence.

As shown by Dani–Thomas in the triangle-free case [8, Theorem 1.1 and Remark 4,8], and by the present authors with Hagen in the general case (as above) [5, Proposition 3.1], if $\Gamma$ has the $\mathcal{CFS}$ property then the associated right-angled Coxeter group $W_{\Gamma}$ has thickness of order at most $1$ , and hence has at most quadratic divergence. Further, in [6], Behrstock, Hagen and Sisto show that a right-angled Coxeter group $W_{\Gamma}$ has linear divergence (and is thick of order $0$ ) if and only if $\Gamma$ is the join of two non-complete graphs. Finally Levcovitz proved that any graph without $\mathcal{CFS}$ has at least cubic divergence [15], and so we see that $W_{\Gamma}$ has exactly quadratic divergence if and only if $\Gamma$ is not the join of two non-complete graphs and has the $\mathcal{CFS}$ property.

3.2. Proof of threshold for quadratic divergence in random RACGs

Assuming our main theorems about square percolation, Theorems 1.4 and 1.5, we are now in a position to provide a proof of Theorem 1.6 on the threshold for quadratic divergence in RACGs:

Proof of Theorem 1.6 from Theorems 1.4 and 1.5.

Let $\epsilon>0$ , and suppose that

\frac{\lambda_{c}+\epsilon}{\sqrt{n}}\leq p(n)\leq 1-\frac{(1+\epsilon)\log{n}}{n}.

By Theorem 1.5, the graph $\Gamma$ a.a.s. has the $\mathcal{CFS}$ property. Thus, by [5, Proposition 3.1], $W_{\Gamma}$ a.a.s. has at most quadratic divergence and is thick of order at most 1. Further, since $1-p(n)\geq\frac{(1+\epsilon)\log{n}}{n}$ , standard results on the connectivity of Erdős–Rényi random graphs tell us that a.a.s. the complement of $\Gamma$ is connected, and thus that $\Gamma$ itself is a.a.s. not the join of two non-trivial graphs. Thus, [6] implies that $W_{\Gamma}$ is not thick of order $0$ and hence is thick of order exactly one and has precisely quadratic divergence.

On the other hand, if

\frac{\lambda_{c}-\epsilon}{\sqrt{n}}\leq p(n),

then by Theorem 1.4 no component of the square graph can have full support, and thus the graph $\Gamma$ is not $\mathcal{CFS}$ . It then follows from [15], that $W_{\Gamma}$ has at least cubic divergence, and thus by Proposition 3.5 that it is not thick of order 1. ∎

4. Branching processes and proof strategy

4.1. Branching processes

We recall here some basic facts and definitions from the theory of branching processes that we will use in our argument; for a more general treatment of such processes, see e.g. [2].

Definition 4.1.

A Galton–Watson branching process $\mathbf{W}=(W_{t})_{t\in\mathbb{Z}_{\geq 0}}$ with offspring distribution $X$ is a sequence of non-negative integer-valued random variables with $W_{0}=1$ and for all $t\geq 1$ , $W_{t}=\sum_{i=1}^{W_{t-1}}X_{i,t}$ , where the $X_{i,t}$ : $i,t\in\mathbb{N}$ are independent, identically-distributed random variables with $X_{i,t}\sim X$ for all $i,t$ .

A Galton–Watson branching process can be viewed as a random rooted tree: in the zero-th generation there is a root or ancestor, who begets a random number $X_{1,1}\sim X$ of children that form the first generation. In every subsequent generation, each child independently begets a random number of children, with the $i$ -th member of generation $t$ begetting $X_{i,t}\sim X$ children.

Galton–Watson branching processes are a widely studied family of random processes and are the subject of much probabilistic research; see e.g. [2] and the references therein. Here we introduce only some fairly standard elements of the theory that are needed for our argument. A Galton–Watson process $\mathbf{W}$ is said to become extinct if $W_{t}=0$ for some $t\in\mathbb{N}$ . The total progeny of $\mathbf{W}$ is the total number of vertices in the associated tree, which we denote by $W=\sum_{t=0}^{\infty}W_{t}$ ; this quantity is finite if and only if $\mathbf{W}$ becomes extinct.

A key tool in the study of $\mathbf{W}$ is the generating function of its offspring distribution, $f_{X}(t)=\mathbb{E}t^{X}$ . The following standard results from the theory of branching processes relate the probability of extinction for $\mathbf{W}$ to the mean and generating function of its offspring distribution $X$ .

Proposition 4.2 (See e.g. [2]).

Let $\mu=\mathbb{E}X$ and $f(t)=f_{X}(t)$ . Let $\mathbf{W}$ be a Galton–Watson branching process with offspring distribution $X$ . Then the following hold:

(i)

(subcritical regime) if $\mu<1$ , then almost surely $\mathbf{W}$ becomes extinct, and what is more

\mathbb{P}\left(\mathbf{W}\textrm{ has not become extinct by generation }k\right)=\mathbb{P}(W_{k}\neq 0)\leq\mu^{k}.

(ii)

(supercritical regime) if $\mu>1$ , then the probability $\theta_{e}$ that $\mathbf{W}$ becomes extinct is the smallest solution $\theta\in[0,1]$ to the equation

$f(\theta)=\theta,$

and satisfies $\theta_{e}<1$ .

We shall also need the following result on the distribution of the total progeny $W$ of $\mathbf{W}$ .

Proposition 4.3 (Dwass’s formula [11]).

Let $\mathbf{W}$ be a Galton–Watson branching process with offspring distribution $X$ . Then the total progeny $W$ satisfies

\mathbb{P}\left(W=k\right)=\frac{1}{k}\mathbb{P}\left(X_{1}+X_{2}+\cdots X_{k}=k-1\right),

where $X_{1},X_{2},\ldots,X_{k}$ are independent, identically distributed random variables with $X_{i}\sim X$ for all $i\in[k]$ .

4.2. Departures from the Bollobás–Riordan framework

Bollobás and Riordan in [7] developed a powerful branching process framework for the study of clique percolation. Much of that framework can be adapted to the study of the non-induced square percolation we are concerned with in this paper. However there remains a number of significant hurdles which need to be overcome in order to extend their techniques to the present setting.

In the subcritical regime, the structure of squares makes the analysis of exceptional edges and offspring distributions (which are the crux of the argument) differ significantly from the Bollobás–Riordan paper; care is needed to handle the resulting complications correctly. Indeed, Bollobás and Riordan are able to model clique percolation using a Galton–Watson branching process whose offspring distribution is roughly Poisson; however, for square percolation, the offspring distribution is more heavy-tailed, forcing us to resort to somewhat delicate technical arguments.

Further, in the supercritical regime, because of our motivation from geometric group theory, we are interested in the study of induced square percolation. In particular, adding new edges to a graph could destroy some induced squares and hence split apart square-components even as we are trying to build a giant square-component. This situation is quite unlike that in clique percolation, and we have to use a completely different sprinkling argument to obtain our results (inter alia sprinkling vertices rather than edges). Thus here again there are significant complications and major departures from Bollobás and Riordan’s framework in [7].

4.3. Proof strategy

Our results rely on the analysis of a branching process exploration of the square-components of a graph $\Gamma\in{\mathcal{G}}(n,p)$ for some fixed $p=\lambda n^{-1/2}$ where $\lambda>0$ .

We begin with an arbitrary induced square $S_{1}=abcd$ in $\Gamma$ . Its diagonals $ac$ and $bd$ give us two pairs of non-edges which can be used to discover further non-edges of $\Gamma$ belonging to the same square-component. The size of the set $\left(\Gamma_{a}\cap\Gamma_{c}\right)\setminus\{b,d\}$ of common neighbors of $a$ and $c$ in $V(\Gamma)\setminus\{b,d\}$ is a binomially distributed random variable $Z\sim\mathrm{Binom}(n-4,p^{2})$ . Assuming that $\Gamma_{a}\cap\Gamma_{c}$ is an independent set (i.e., contains no edge of $\Gamma$ ) these common neighbors together with $b,d$ give rise to $\binom{Z+2}{2}$ non-edges that lie in the same square-component as $ac$ ; however, since we already knew about the pair $bd$ , only $X=\binom{Z+2}{2}-1$ of these are new. We then pursue our exploration of the square-component of $ac$ by iterating this procedure: for each as-yet untested non-edge $xy$ in our square-component, we can first find the common neighbors of $xy$ , and add as “children” of $xy$ all the new non-edges discovered in this way.

This can be viewed as a Galton–Watson branching process $\mathbf{W}$ with offspring distribution $X$ in a natural way. Assuming the past exploration does not greatly interfere with the distribution of the number of children in our process, the expected number of children at each step is roughly equal to

\mathbb{E}X=\mathbb{E}\left(\binom{Z+2}{2}-1\right)=\mathbb{E}\left(\frac{Z^{2}+3Z}{2}\right).

The expected value of $X$ is readily computed from the first and second moments of the binomial distribution of $Z$ , yielding

\mathbb{E}X=\frac{1}{2}\lambda^{4}+2\lambda^{2}+o(1).

The Galton–Watson process $\mathbf{W}$ becomes critical when the expectation of its offspring distribution is $1$ . Solving

\frac{1}{2}\lambda^{4}+2\lambda^{2}=1

and selecting the non-negative root $\lambda_{c}=\sqrt{\sqrt{6}-2}=0.6704\ldots$ , we thus see that for for any fixed $\lambda<\lambda_{c}$ , our branching process $\mathbf{W}$ is subcritical. We thus expect it to terminate a.a.s. after a fairly small number of steps, from which one can hope to, in turn, deduce that a.a.s. all square-components are small. On the other hand, for any fixed $\lambda>\lambda_{c}$ , $\mathbf{W}$ is supercritical, and with probability strictly bounded away from zero it does not terminate before we have discovered a reasonably large number of non-edges. A second-moment argument can then be used to show that a strictly positive proportion of non-edges must lie in reasonably large square-components. With a little glueing work, we can then hope to show that in fact a strictly positive proportion of non-edges lie in a giant square-component that covers all the vertices of $\Gamma$ .

The above is however a simplification of what is actually required to make the arguments go through, and the situation turns out to be considerably more nuanced than what we described above. A first issue is our assumption that the vertices in $Z$ form an independent set: in the subcritical regime, we need to consider what happens if the set $Z$ of common neighbors of some non-edge $xy$ which we are testing interacts with some other previously discovered vertices, or with vertices in $Z$ . In particular, any “exceptional” edge from $Z$ to previously discovered vertices other than $x$ , $y$ could potentially create many additional squares, and hence add many new pairs to our square-component which are not accounted for by our branching process. Bollobás and Riordan faced a similar problem in their work on clique percolation. However, as stated in the previous subsection, the way they dealt with “exceptional edges” does not quite work for us in the square percolation setting. One issue is that in a copy of $C_{4}$ , vertices on opposite sides of a diagonal are not adjacent, so that the number of $4$ -cycles created by an exceptional edge cannot always be bounded by the degree of a newly discovered vertex. In addition, we note that if it is not dealt with properly, the presence of exceptional edges could significantly affect the future distribution of the number of children in our branching process: if in the example above $a,c$ had three common neighbors among the already discovered vertices rather than two, then the correct number of children for $ac$ in the exploration process would be $\binom{Z+3}{2}-3=\binom{Z+2}{2}-1+Z$ , which has expectation equal to $1+\lambda_{c}^{2}>1$ when $\lambda=\lambda_{c}$ . Finally, for the argument to work, we need not only for a Galton–Watson branching process with offspring distribution $X$ to become a.a.s. extinct within a few generations (which is an easy first moment argument): we also need its total progeny to be a.a.s. small. Here the fact that $X$ is a quadratic function of the binomial random variable $Z$ (and thus rather heavy-tailed) causes complicated issues, which were not faced in [7] (where the offspring distribution was essentially Poisson with mean $<1$ ). Overcoming these problems is the main work done in Section 5.

Secondly, in the supercritical argument, after establishing the a.a.s. existence of many non-edges in reasonably large square-components, we must prove the a.a.s. existence of a giant square-component covering all vertices and a strictly positive proportion of non-edges of $\Gamma$ . Here the crucial point is that, because of the applications in geometric group theory motivating our work, we are considering induced square percolation. The size of a largest square-component in $\Gamma$ is not monotone with respect to the addition of edges to the graph — adding an edge could very well destroy an induced square, thus potentially breaking a large square-component in to several smaller pieces. So we have to use a completely new sprinkling argument to be able to agglomerate all “reasonably large” square-components into a single giant square-component. To do this we reserve some vertices for sprinkling, rather than edges. We use these vertices to build bridges between reasonably large square-components in a sequence of rounds until all such components are joined into one. Finally once we have established the a.a.s. existence of a giant square-component, some care is needed to ensure this square-component covers every vertex of $\Gamma$ . Assembling a giant square-component and ensuring it has full support in this way involves overcoming a number of interesting obstacles, and is the main work done in Section 6.

5. The subcritical regime: proof of Theorem 1.4

Theorem 1.4 will be established as an immediate consequence of a stronger result, Theorem 5.1, which we state and prove below after providing a few preliminary definitions.

Given a graph $\Gamma$ , in addition to the square-graph $\square(\Gamma)$ from Definition 1.1 we shall consider a different but closely related auxiliary graph $\boxtimes(\Gamma)(\Gamma)$ that includes information about all squares in $\Gamma$ (rather than just the induced squares). Explicitly we let $\boxtimes(\Gamma):=(V(\Gamma)^{(2)},\{\{ac,bd\}:\ ab,bc,cd,ad\in E(\Gamma)\})$ be the graph whose vertices are pairs of vertices from $V(\Gamma)$ and whose edges correspond to (not necessarily induced) copies of $C_{4}$ . The support $\mathrm{supp}(C)$ of a component $C$ of $\boxtimes(\Gamma)$ is defined as in Definition 1.3, mutatis mutandis.

Note that the square graph $\square(\Gamma)$ is exactly the subgraph of $\boxtimes(\Gamma)$ induced by the set $\{ab\in V(\Gamma)^{(2)}:\ ab\notin E(\Gamma)\}$ of non-edges of $\Gamma$ . In particular, for every square-component $C$ in $\square(\Gamma)$ , there is a component $C^{\prime}$ in $\boxtimes(\Gamma)$ with $C\subseteq C^{\prime}$ and thus $\operatorname{\mathrm{supp}}(C)\subseteq\operatorname{\mathrm{supp}}(C^{\prime})$ . To establish Theorem 1.4, it is thus enough to prove the following stronger theorem that bounds the size of the support in $\Gamma$ of components of $\boxtimes(\Gamma)$ .

Theorem 5.1.

Let $\lambda<\lambda_{c}$ be fixed. Suppose that $p(n)\leq\lambda n^{-1/2}$ . Then for $\Gamma\in{\mathcal{G}}(n,p)$ , a.a.s. every component of $\boxtimes(\Gamma)$ has a support of size $O((\log n)^{2^{31}})$ .

Since the order of the support of the largest component in $\boxtimes(\Gamma)$ is monotone non-decreasing with respect to the addition of edges to $\Gamma$ , we may assume in the remainder of this section that $p(n)=\lambda n^{-1/2}$ . Further, since $\lambda<\lambda_{c}$ is fixed, there exists a constant $\varepsilon>0$ such that for a binomially distributed random variable $Z\sim\mathrm{Binom}(n,p^{2})$ we have

(5.1)

\displaystyle\mathbb{E}\left(\binom{Z+2}{2}-1\right)=1-\varepsilon.

With this last equality in hand, we are now ready to present and analyse the exploration process that lies at the heart of our proof of Theorem 5.1.

We shall discover a superset of the component of $\boxtimes(\Gamma)$ which contains sine fixed pair $v_{1}v_{2}\in V_{\Gamma}^{(2)}$ . We begin our exploration by finding common neighbors of $v_{1}$ and $v_{2}$ , then adding all pairs of such newly discovered vertices to a set of unexplored pairs. These new pairs obviously lie in the same component of $\boxtimes(\Gamma)$ as $v_{1}v_{2}$ .

After this initial step in the exploration, we proceed as follows. First, we choose a new pair $a_{t}$ from our set of unexplored pairs. By assumption, there exists a previously explored pair $b_{t}$ such that all $4$ edges from $a_{t}$ to $b_{t}$ are present. We continue our exploration by finding the set $Z_{t}$ of common neighbors of the vertices in $a_{t}$ among the previously undiscovered vertices of $\Gamma$ . We then add all pairs $\left(Z_{t}\cup b_{t}\right)^{(2)}\setminus\{b_{t}\}$ to our set of active pairs — these obviously lie in the same component of $\boxtimes(\Gamma)$ as $a_{t}$ — and delete $a_{t}$ from that set. We then repeat the procedure, choosing a new unexplored pair $a_{t+1}$ , finding its common neighbors among undiscovered vertices, etc.

This, however, is not enough to discover the totality of the component of $v_{1}v_{2}$ in $\boxtimes(\Gamma)$ . Indeed, it is possible that the pair $a_{t}$ has additional common neighbors among already discovered vertices (in addition to the two neighbors in $b_{t}$ ), which could give rise to additional, unexplored pairs that lie in the same component of $\boxtimes(\Gamma)$ as $a_{t}$ . To deal with this possibility, we have to add in an exceptional phase in our exploration, which takes care of potential additional edges among the vertices we have discovered. In this exceptional phase, we generously overestimate how many new unexplored pairs could be discovered, and for each of these new pairs we start new and essentially independent versions of our exploration process, which we think of as children processes of our original exploration process.

The key is that, with the exceptional phase factored in, we do discover a superset of the collection of all pairs in the same component of $\boxtimes(\Gamma)$ as our starting pair $v_{1}v_{2}$ . We compare the non-exceptional phase of our exploration to a subcritical branching process and give upper bound on its total progeny, which is small. Obtaining this bound is somewhat tricky (due to the nature of our offspring distribution) and relies on a rather technical application of Dwass’s formula (Proposition 4.3). The remainder of the proof is provided in Lemma 5.8 which shows that, with high probability, we do not run through more than five exceptional phases. In particular, we do not start too many child processes, so that with high probability our overall exploration process stops before we have discovered a large number of vertices.

5.1. An exploration process

Our exploration process will proceed by considering the following for each time $t\geq 0$ :

•

(Discovered vertices.) An ordered set of vertices: $D_{t}=\{v_{1},v_{2},\ldots,v_{d_{t}}\}$ .
•

(Active pairs.) A set of pairs of vertices (ordered lexicographically with respect to the ordering on $D_{t}$ ): $A_{t}=\{x_{1}y_{1},x_{2}y_{2},\ldots,x_{a_{t}}y_{a_{t}}\}\subseteq D_{t}^{(2)}$ .
•

(Discovered pairs.) A set of pairs of vertices: $S_{t}\subseteq D_{t}^{(2)}$ .
•

(Explored edge set.) A set of edges: $E_{t}\subseteq D_{t}^{(2)}\cap E(\Gamma)$ .
•

(Epoch.) An integer: $e_{t}\in\{0,1,2,3,4,5\}$ .

These sets will satisfy:

( $\star$ )

for all $t\geq 0$ and for every active pair $x_{i}y_{i}\in A_{t}$ , the vertices $x_{i}$ and $y_{i}$ have either $0$ or $2$ common neighbors in the graph $(D_{t},E_{t})$ .

The initial state of the exploration consists of the following data, which is seeded by a choice of $v_{1}v_{2}$ , an arbitrary pair of vertices from $V(\Gamma)$ (note that this pair can, alternatively, be thought of as a vertex of $\boxtimes(\Gamma)$ ). We set $D_{0}=\{v_{1},v_{2}\}$ , $A_{0}=S_{0}=\{v_{1}v_{2}\}$ , $E_{0}=\emptyset$ and $e_{0}=0$ .

At each time step $t$ our exploration proceeds as follows, with $\varepsilon$ as given in equation (5.1):

1.

If $|D_{t}|<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ and $A_{t}\neq\emptyset$ , let $a={x}{y}$ be the first pair in $A_{t}$ . For each $z\in V(\Gamma)\setminus D_{t}$ we test whether or not $z$ sends an edge in $\Gamma$ to both vertices of $a$ . Set $Z_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ {z}{x},{z}{y}\in E(\Gamma)\}$ and $F_{t}$ to be the collection of joint neighbors of $x$ and $y$ in $(D_{t},E_{t})$ . (Note, by property $(\star)$ , the set $F_{t}$ is either empty or consists of a pair of discovered vertices.)

We arbitrarily order the vertices in $Z_{t}$ as $\{v_{d_{t}+1},v_{d_{t}+2},\ldots,v_{d_{t+1}}\}$ , and add them to $D_{t}$ to form $D_{t+1}$ . We then set

$A_{t+1}=\left(A_{t}\cup\left(F_{t}\cup Z_{t}\right)^{(2)}\right)\setminus\left(a\cup F_{t}^{(2)}\right)$

to be the new collection of active pairs (which again is ordered lexicographically with respect to the ordering on $D_{t}$ ), set $E_{t+1}=E_{t}\cup\{zx,zy:\ z\in Z_{t}\}$ , set $e_{t+1}=e_{t}$ , $S_{t+1}=S_{t}\cup A_{t+1}$ and then proceed to the next time step $t+1$ of the process. Note that since the only new edges being added are ones connecting a new vertex to $x$ and $y$ and since $xy\notin A_{t+1}$ each pair in $A_{t+1}$ has either $0$ or $2$ common neighbors in $(D_{t+1},E_{t+1})$ , i.e., property ( $\star$ ) is still satisfied in the next time-step.
2.

If $|D_{t}|\geq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ , then we terminate the process and declare large stop.
3.

If $|D_{t}|<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ and $A_{t}=\emptyset$ , then we consider $i=|E(\Gamma[D_{t}])\setminus E_{t}|$ .

If $i=0$ or $e_{t}+i\geq 5$ , then we terminate our exploration process and declare extinction stop or exceptional stop, respectively.
Otherwise we set $e_{t+1}=e_{t}+i$ , update $E_{t}$ by setting $E_{t}=E(\Gamma[D_{t}])$ , and set $i_{1}=i$ (which by assumption is $>0$ ). We then run the following subroutines:
1. 3A.
  Set
  
  $Z^{1}_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ z\textrm{ sends at least three edges into }D_{t}\}.$
  
  We then update our value of $i_{1}$ , setting $i_{1}=|Z^{1}_{t}|$ .
  - •
    
    If $i_{1}>0$ and $e_{t+1}+i_{1}>5$ , then we terminate the whole exploration process and declare exceptional stop.
  - •
    
    Else if $i_{1}>0$ and $e_{t+1}+i_{1}\leq 5$ , we add $Z^{1}_{t}$ to $D_{t}$ , update $E_{t}$ by setting $E_{t}=E(\Gamma[D_{t}])$ , update $S_{t}$ by setting $S_{t}=D_{t}^{(2)}$ . We then update $e_{t+1}$ to $e_{t+1}+i_{1}$ and run through subroutine 3.3A. again.
  - •
    
    Otherwise $i_{1}=0$ and we proceed to subroutine 3.3B.
2. 3B.
  
  Let
  
  $Z^{2}_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ z\textrm{ sends at least two edges into }D_{t}\},$
  
  Since subroutine 3A. terminated with $i_{1}=0$ each vertex in $Z^{2}_{t}$ sends exactly two edges into $D_{t}$ . We set $D_{t+1}=D_{t}\cup Z^{2}_{t}$ , and let $A_{t+1}$ consist of all pairs of vertices in $D_{t+1}$ containing at least one vertex of $Z^{2}_{t}$ . Further, we set $S_{t+1}=S_{t}\cup A_{t+1}$ .
  
  Once this is done, we proceed to the next time-step $t+1$ in the overall exploration process, observing that property $(\star)$ has been preserved (since by construction every vertex in $Z^{2}_{t}$ has degree exactly two in $(D_{t+1},E_{t+1})$ ).

\labellist

\hair

2pt \pinlabel $D_{t}$ [ ] at 7 8 \pinlabel $F_{t}$ [ ] at 12 40 \pinlabel $x$ [ ] at 36 91 \pinlabel $y$ [ ] at 36 9 \pinlabel $f_{1}$ [ ] at 27 51 \pinlabel $f_{2}$ [ ] at 46 51 \pinlabel $z_{1}$ [ ] at 136 40 \pinlabel $z_{2}$ [ ] at 153 40 \endlabellist\includegraphics[scale=1.0]Figure_Process_1

Figure 1. An illustration of Stage 1 of the Exploration Process, with exploration from the active pair

a=xy

at time

t

. In this example,

F_{t}=\{f_{1},f_{2}\}

and the set of newly discovered vertices is

Z_{t}=\{z_{1},z_{2}\}

. We thus have

A_{t+1}\setminus A_{t}=\{f_{1}z_{1},f_{2}z_{1},f_{1}z_{2},f_{2}z_{2},z_{1}z_{2}\}

and

A_{t}\setminus A_{t+1}=\{xy\}

\labellist

\hair

2pt \pinlabel $v_{1}$ [ ] at 119 99 \pinlabel $v_{2}$ [ ] at 119 18 \pinlabel $w_{1}$ [ ] at 51 64 \pinlabel $w_{2}$ [ ] at 154 64 \pinlabel $z_{1}$ [ ] at 87 65 \pinlabel $z_{2}$ [ ] at 124 3 \endlabellist\includegraphics[scale=1.0]Figure_Process_2

Figure 2. Illustration of Stage 3 of the Exploration Process. Suppose the pairs

w_{1}w_{2},w_{1}z_{1},w_{2}z_{1}\in S_{t}

were discovered while

v_{1}v_{2}

was active. When

w_{1}w_{2}

is active in Stage 1 the pairs

\{v_{1}z_{2},v_{2}z_{2}\}

will be discovered. However the pair

z_{1}z_{2}

will only be discovered in an instance of Stage 3, if the two dashed edges are revealed to be present (thus making

w_{1}z_{1}w_{2}z_{2}

a square).

5.2. Analysing the process

The exploration process defined in the previous subsection can terminate for one of three reasons:

(1)

$|D_{t}|\geq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ (large stop);
(2)

$e_{t}\geq 5$ (exceptional stop);
(3)

$A_{t}=\emptyset$ and $E_{t}=\Gamma[D_{t}]$ (extinction stop).

It follows from the above that the process must in fact terminate within $O\left((\log n)^{2^{32}}\right)$ time-steps. We begin our analysis by noting that, given our aim of proving Theorem 1.4, extinction stops are good for us:

Lemma 5.2.

Suppose that the exploration from ${v_{1}}{v_{2}}$ terminates at time $T$ with an extinction stop. Let $C$ be the component of $\boxtimes(\Gamma)$ containing ${v_{1}}{v_{2}}$ . Then $C\subseteq S_{T}$ . Furthermore, the number of vertices in the support of $C$ is at most $2^{2^{10}}\epsilon^{-2^{10}}\left(\log{n}\right)^{2^{31}}$ and $|C|\leq 2^{2^{11}}\epsilon^{-2^{11}}\left(\log{n}\right)^{2^{32}}$ .

Proof.

We perform our exploration process from the pair $v_{1}v_{2}$ , and assume it terminates with an extinction stop at time $T$ . It is enough to show that given $a=u_{1}u_{2}\in C\cap S_{t}$ , for every neighbor $b=w_{1}w_{2}$ of $a$ in $\boxtimes(\Gamma)$ , there is some $t^{\prime}\leq T$ such that $b\in S_{t^{\prime}}$ .

If $a$ was discovered at a time-step where 1. applied or if $a\in A_{0}$ , then $a\in A_{t}$ and was an active pair at some time $t\geq 0$ . Thus, at some later time-step $t^{\prime}$ where 1. applies, our exploration process selects $a$ as its “exploration pair” and discover all neighbors $z_{1}z_{2}$ of $a$ in $\boxtimes(\Gamma)$ with $z_{1}z_{2}\in\left(Z_{t^{\prime}}\cup F_{t^{\prime}}\right)^{(2)}$ — where $F_{t^{\prime}}$ is the pair we used to discover $a$ , and $Z_{t^{\prime}}$ is the collection of joint neighbors of $u_{1}$ and $u_{2}$ that lie in $V(\Gamma)\setminus D_{t^{\prime}}$ . If $b$ is in this set, then $b\in S_{t^{\prime}}$ .

Otherwise, if we failed to find $b=w_{1}w_{2}$ at this time $t^{\prime}$ , $b$ must contain at least one vertex from $D_{t^{\prime}}\setminus F_{t^{\prime}}$ . By property $(\star)$ of our exploration, at least one of the edges $u_{i}w_{j}$ , $i,j\in\{1,2\}$ lies outside $E_{t^{\prime}}$ .

In particular, since we do not end with a large or exceptional stop, this edge will be uncovered at later time step $t^{\prime\prime}$ where 3.3A. applies. But by the end of 3.3B., all vertices sending at least two edges into $D_{t^{\prime\prime}}$ have been added to $D_{t^{\prime\prime}}$ . Thus all common neighbors of $a$ will be found (since $a\subseteq D_{t^{\prime\prime}}$ , and a common neighbor of $a$ has at least two neighbors in $D_{t^{\prime\prime}}$ ). Hence after the updates $b\subseteq D_{t^{\prime\prime}}$ . There are two options: if both vertices of $b$ are present after 3.3A., then $b\in S_{t^{\prime\prime}}$ , since after 3.3A. all possible pairs of discovered vertices (not already tested) are added to $S_{t^{\prime\prime}}$ . Otherwise, both vertices of $b$ are present after 3.3B., and since at least one of them was discovered in 3.3B., $b$ is added to $A_{t^{\prime\prime}+1}\subseteq S_{t^{\prime\prime}+1}$ , and we are done again.

If on the other hand $a$ was discovered at a step $t$ where 3.3B. applies, then $a$ is added to $A_{t+1}$ , and the above applies. Finally, if $a$ was discovered at a time step $t$ where 3.3A. applies, then in 3.3A. and 3.3B., all common neighbors of $a$ are added to $D_{t}$ , and all such pairs are added to $S_{t}$ or $S_{t+1}$ .

Either way, since $S_{t}\subseteq S_{T}$ for all $t\leq T$ , we see that $b\in S_{T}$ . Thus every neighbor of $a$ in $\boxtimes(\Gamma)$ is eventually discovered by our exploration process, and $C\subseteq S_{T}$ as claimed. Further, since $S_{T}\subseteq D_{T}^{(2)}$ by construction, and since our exploration ends with an extinction stop by the hypothesis of the lemma, we have $|C|\leq\frac{1}{2}|D_{T}|^{2}<2^{2^{11}}\varepsilon^{-2^{11}}(\log n)^{2^{32}}$ as claimed. ∎

We now turn to the technical crux of the analysis.

Lemma 5.3.

If we are at a time-step $t$ of the process where 1. applies, then given the past history of the process, the random variable $|A_{t+1}\setminus A_{t}|$ counting the number of new active pairs discovered by $a_{1}=x_{1}y_{1}$ is stochastically dominated by a random variable $X=\frac{Z^{2}+3Z}{2}$ , where $Z$ is a binomial random variable with parameters $n$ and $p^{2}$ .

Proof.

As observed by Bollobás and Riordan [7, Inequality (3)], the past of the exploration is the intersection of the principal increasing event $\mathcal{U}_{t}=\{E_{t}\subseteq E(\Gamma)\}$ (corresponding to the edges that we have discovered are present in $\Gamma$ ) and a decreasing event $\mathcal{D}$ (corresponding to the intersection of a number of events of the form “at least one of $zx,zy$ is not in $E(\Gamma)$ ”). In particular appealing to Harris’s Lemma [13], the conditional probability that $x$ and $y$ both send an edge to $z\in V(\Gamma)\setminus D_{t}$ in $\Gamma$ given the history is at most the unconditional probability $p^{2}$ . (We note here that the fact $z\notin D_{t}$ is essential — we have no control over the conditional probabilities of edges inside the set of discovered vertices $D_{t}$ .)

Thus conditional on the past history of the exploration process, the distribution of $|Z_{t}|$ is stochastically dominated by a random variable $Z\sim\mathrm{Binom}(n,p^{2})$ . The Lemma then immediately follows from the definition of our exploration process. (Note that here we are using the fact property ( $\star$ ) is maintained throughout our exploration.) ∎

Lemma 5.4.

Let $Z\sim\mathrm{Binom}(n,p^{2})$ and $k\in\mathbb{N}$ . Then $\mathbb{P}(Z\geq 9\log n+9\log k)\leq n^{-5}k^{-6}$ .

Proof.

Recall that in this section, $p=\lambda n^{-1/2}$ , for some constant $\lambda<\lambda_{c}$ . Since $Z\sim\mathrm{Binom}(n,p^{2})$ , we have:

	$\displaystyle\mathbb{P}\Bigl{(}Z\geq 9(\log n+\log k)\Bigr{)}$	$\displaystyle=\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}\binom{n}{r}p^{2r}(1-p^{2})^{n-r}<\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}n^{r}\left(\lambda_{c}n^{-1/2}\right)^{2r}$
		$\displaystyle=\sum_{r=\lceil 9(\log n+\log k)\rceil}^{n}\left(\lambda_{c}\right)^{2r}<n\lambda_{c}^{2\lceil 9(\log n+\log k)\rceil}\leq n\exp\left(18\log\lambda_{c}(\log n+\log k)\right),$

where in the last two inequalities we used the fact $(\lambda_{c})^{2}=\sqrt{6}-2<1$ . Since $18\log\lambda_{c}<-6$ , this immediately gives us the desired bound

\mathbb{P}\left(Z\geq 9\log n+9\log k\right)<ne^{-6\log n-6\log k}=n^{-5}k^{-6}.

∎

Corollary 5.5.

$\mathbb{P}\left(\exists\ {x}{y}\in V(\Gamma)^{(2)}\textrm{ : }|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n\right)\leq n^{-3}$ .

Proof.

Fix ${x}{y}\in V(\Gamma)^{(2)}$ . By Lemma 5.4 with $k=1$ , we have

\displaystyle\mathbb{P}\left(|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n\right)=\sum_{r=\lceil 9\log n\rceil}^{n-2}\binom{n-2}{r}p^{2r}(1-p^{2})^{n-2r}<\mathbb{P}\left(Z\geq 9\log n\right)\leq n^{-5}.

Taking a union bound over all $\binom{n}{2}<n^{2}$ possible choices of the pair $xy$ , the lemma follows. ∎

We now analyse the total progeny of the Galton–Watson branching process with offspring distribution given by the random variable $X$ from the statement of Lemma 5.3. By (5.1), $\mathbb{E}X=1-\epsilon$ and thus the branching process is subcritical. Unfortunately, combining the Markovian bound on the extinction time from Proposition 4.2(i), with the bounds on the maximum degree in $\boxtimes(\Gamma)$ from Corollary 5.5 does not give us sufficiently good control on the extinction time and total progeny of our Galton–Watson process. Thus we turn to an application of Dwass’s formula to obtain the tighter bounds needed for the proof of Theorem 1.4.

Lemma 5.6.

Let $\mathbf{W}=\left(W_{t}\right)_{t\in\mathbb{Z}_{\geq 0}}$ be a Galton–Watson branching process with an offspring distribution $X$ as in Lemma 5.3. Set $k_{0}=2^{26}\varepsilon^{-2}(\log n)^{5}$ . Then $\mathbf{W}$ is subcritical, and its total progeny $W=\sum_{t=0}^{\infty}W_{t}$ satisfies

\mathbb{P}\left(W\geq k_{0}\right)=O\left(n^{-5}\right).

Proof.

Let $\{X_{k,j}:\ k\in\mathbb{N},\ j\in[k]\}$ be an infinite family of independent, identically distributed copies of $X$ . For each $k\in\mathbb{N}$ and every $j\in[k]$ , write $X_{k,j}$ as $X_{k,j}={X^{a}}_{k,j}+{X^{b}}_{k,j}$ , where ${X^{a}}_{k,j}=\min\left(X_{k,j},2^{8}(\log n+\log k)^{2}\right)$ . Set $\mu_{k}^{a}=\mathbb{E}({X^{a}}_{k,1})$ . By construction, $\mu_{k}^{a}\leq\mathbb{E}[X]=1-\varepsilon$ . Since $X=(Z^{2}+3Z)/2\leq 2Z^{2}$ , where $Z\sim\mathrm{Binom}(n,p^{2})$ , and since $2\left(9(\log n+\log k)\right)^{2}<2^{8}(\log n+\log k)^{2}$ , Lemma 5.4 implies that for every $k\in\mathbb{N}$ and $j\in[k]$ ,

(5.2)

\displaystyle\mathbb{P}\left({X^{b}}_{k,j}>0\right)\leq\mathbb{P}\left(2Z^{2}\geq 2^{8}(\log n+\log k)^{2}\right)\leq\mathbb{P}\left(Z\geq 9\left(\log n+\log k\right)\right)\leq n^{-5}k^{-6}.

Applying Dwass’s formula, Proposition 4.3, to our branching process $\mathbf{W}$ , we have that for any $k\in\mathbb{N}$ ,

	$\displaystyle\mathbb{P}(W=k)$	$\displaystyle=\frac{1}{k}\mathbb{P}\left(\sum_{j=1}^{k}X_{k,j}=k-1\right)$
(5.3)			$\displaystyle\leq\frac{1}{k}\left(\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{k}^{a}}{2^{8}(\log n+\log k)^{2}}\geq\frac{k(1-\mu_{k}^{a})-1}{2^{8}(\log n+\log k)^{2}}\right)+\mathbb{P}\left(\sum_{j=1}^{k}{X^{b}}_{k,j}>0\right)\right).$

Since $\mu_{k}^{a}\leq 1-\varepsilon$ , for $k\geq 2\varepsilon^{-1}$ we have that $2(1-\mu_{k}^{a})-\frac{2}{k}>2(1-\mu_{k}^{a})-\varepsilon>\varepsilon$ and hence $k(1-\mu_{k}^{a})-1>\frac{\varepsilon k}{2}$ . Thus we have

s:=\frac{k(1-\mu_{k}^{a})-1}{2^{8}(\log n+\log k)^{2}}>\frac{\varepsilon k}{2^{9}(\log n+\log k)^{2}}:=s^{\prime}.

Since the random variables $({X^{a}}_{k,j}-\mu_{k}^{a})/(2^{8}(\log n+\log k)^{2})$ are by construction independent random variables with mean zero and absolute value at most $1$ , we can apply a standard Chernoff bound [1, Theorem A.1.16] in (5.2) to obtain:

\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{1}^{a}}{2^{8}(\log n)^{2}}\geq s\right)\leq e^{-\frac{(s^{\prime})^{2}}{2k}}=e^{-\frac{\varepsilon^{2}k}{2^{19}(\log n+\log k)^{4}}}.

Letting, $k_{0}=k_{0}(n)=\lceil 2^{26}\varepsilon^{-2}(\log n)^{5}\rceil$ we have that for $n$ sufficiently large, all $k\geq k_{0}(n)$ satisfy $k\geq 5\varepsilon^{-2}\cdot 2^{19}(\log{n}+\log{k})^{5}$ . In particular for such $k$ we have

(5.4)

\displaystyle\mathbb{P}\left(\sum_{j=1}^{k}\frac{{X^{a}}_{k,j}-\mu_{k}^{a}}{2^{8}(\log n)^{2}}\geq s\right)\leq e^{-\frac{\epsilon^{2}k}{2^{19}\left(\log n+\log k\right)^{4}}}\leq e^{-5\left(\log n+\log k\right)}\leq n^{-5}k^{-5}.

Further, applying inequality (5.2) and a union bound, we get that

(5.5)

\displaystyle\mathbb{P}\left(\sum_{j=1}^{k}{X^{b}}_{k,j}>0\right)\leq k\mathbb{P}({X^{b}}_{k,1}>0)\leq n^{-5}k^{-5}.

Combining (5.2), (5.4) and (5.5), we finally have

\displaystyle\mathbb{P}(W\geq k_{0})

\displaystyle\leq\sum_{k=k_{0}}^{\infty}\mathbb{P}(W=k)\leq\sum_{k=k_{0}}^{\infty}\frac{1}{k}2n^{-5}k^{-5}=O\left(n^{-5}\right).

∎

We now use Lemma 5.6 to estimate the probability that our exploration ends with a large stop. The key is to view our exploration as a branching process of branching processes: we begin with a subcritical branching process, corresponding to step 1. in our exploration process. Call this the ancestral branching process. When this process becomes extinct, we run through step 3. of our exploration process, potentially adding new active pairs to our otherwise empty set of active pairs $A_{t}$ . For each of these new pairs, we start an independent child branching process. For each of these we repeat the same procedure as for the ancestral branching process (so the child processes can generate their own child processes, and so forth). Thus to bound the total number of pairs discovered over the entire course of our exploration, we must control the growth of this branching process of branching processes.

Lemma 5.7.

The probability that the exploration from $v_{1}v_{2}$ terminates with a large stop is $O(n^{-3})$ .

Proof.

We view our exploration process as a kind of branching process of branching processes, with parent processes begetting children processes as described at the start of this section. Beginning from the single pair $v_{1}v_{2}$ , the exploration of its component in $\boxtimes(\Gamma)$ undertaken at time-steps $t$ when 1. applies is dominated by a subcritical branching process $\mathbf{W}$ as in the statement of Lemma 5.6. When that process terminates, we have discovered a certain set $D_{t_{0}}$ of vertices of $\Gamma$ . If 3. applies, then we add a certain number of vertices to $D_{t_{0}}$ to form $D_{t_{0}+1}$ and then add a subset of the pairs from $D_{t_{0}+1}^{(2)}$ to form $A_{t_{0}+1}$ . We may view each of the pairs $a_{i}$ added to $A_{t_{0}+1}$ at this time as the root of a subcritical independent branching process $\mathbf{W}_{i}$ . There are at most $|D_{t_{0}+1}|^{2}$ of these “child-processes”, and they are stochastically dominated by independent copies $\mathbf{W}_{i}$ of the subcritical branching process $\mathbf{W}$ from Lemma 5.6. When all these branching processes have become extinct, we now have a (larger) set $D_{t_{1}}$ of discovered vertices and we may be back at a time step where 3. applies. We repeat our procedure — adding vertices to form $D_{t^{\prime}+1}$ , adding new pairs to form $A_{t_{1}+1}$ , etc. The whole procedure can begin again at most $5$ times (for otherwise an exceptional stop must have occurred).

We run our exploration ignoring large stops and only applying 1. and 3. until the process terminates (with an exceptional stop or an extinction stop), and show that if an exceptional stop does not occur then with probability $O(n^{-3})$ the final size of the discovered set of vertices is at most $2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ .

Since we never can consider more than $\binom{n}{2}$ different active pairs, it follows that we never start more than $n^{2}$ branching processes $\mathbf{W}_{i}$ in the course of our exploration. By Lemma 5.6, and a union bound, the probability that one of our at most $n^{2}$ branching processes $\mathbf{W}_{i}$ has a total progeny of more than $k_{0}=2^{26}\varepsilon^{-2}(\log n)^{5}$ is $O(n^{-3})$ . Assume from now on this does not happen. Since the progeny of our processes correspond to pairs $xy\in V(\boxtimes(\Gamma))=V(\Gamma)^{(2)}$ , none of our processes can add more than $2k_{0}$ vertices to the set of discovered vertices $D_{t}$ .

Further, by Corollary 5.5, the probability that there is any pair $xy\in V(\boxtimes(\Gamma))=V(\Gamma)^{(2)}$ such that $|\Gamma_{x}\cap\Gamma_{y}|\geq 9\log n$ is at most $n^{-3}$ . Assume from now on this does not happen. Then in the first time-step $t_{0}$ where 3. applies, we can bound the number of vertices added to $D_{t_{0}}$ : each pair $xy$ can contribute at most $9\log n$ vertices to a $Z^{1}_{t}$ or $Z^{2}_{t}$ , and as we do not have an exceptional stop we can repeat the addition procedure at most $5$ times. In particular we have

	$\displaystyle\|D_{t_{0}+1}\|$	$\displaystyle\leq\left(\left(\left(\left(\left(\left(\|D_{t_{0}}\|\right)^{2}9\log n\right)^{2}9\log n\right)^{2}9\log n)\right)^{2}9\log n\right)^{2}9\log n\right)^{2}9\log n$
		$\displaystyle\leq\left(\|D_{t_{0}}\|9\left(\log n\right)\right)^{2^{6}}\leq\left(18(\log n)k_{0}\right)^{2^{6}}\ .$

The number of child processes started at that time step is at most $|D_{t_{0}+1}|^{2}$ ; by our assumption, each of these discovers at most $2k_{0}$ vertices in total, so that by the next time-step $t_{1}$ when 3. applies, we have

|D_{t_{1}}|\leq|D_{t_{0}+1}|^{2}2k_{0}\leq\left(18(\log n)k_{0}\right)^{2^{7}}2k_{0}:=2k_{1}.

Repeating the analysis above, we obtain

|D_{t_{1}+1}|\leq\left(18(\log n)k_{1}\right)^{2^{6}},

and in the next time-step $t_{2}$ where 3. applies we have

|D_{t_{2}}|\leq|D_{t_{1}+1}|^{2}2k_{0}\leq\left(18(\log n)k_{1}\right)^{2^{7}}2k_{0}=:2k_{2}.

and we can keep going in this way, defining $k_{3}$ , $k_{4}$ mutatis mutandis. If we avoid an exceptional stop, then we must terminate by the fifth time-step $t_{4}$ when 3. applies. Iterating our analysis, we see that the size of the final set of discovered vertices $D_{t_{4}}$ is as most

	$\displaystyle\|D_{t_{4}}\|$	$\displaystyle\leq\left(18(\log n)k_{3}\right)^{2^{7}}2k_{0}$
		$\displaystyle=\left(9(\log n)\left(9(\log n)\left(9(\log n)\left(18(\log n)k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}\right)^{2^{7}}2k_{0}$
		$\displaystyle<\left(18(\log n)k_{0}\right)^{2^{29}}<2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}.$

This shows that the probability our process terminates with a large stop is $O(n^{-3})$ . ∎

Finally, we compute the probability that an exploration ends with an exceptional stop.

Lemma 5.8.

The probability that the exploration from $v_{1}v_{2}$ terminates with an exceptional stop is $O((\log n)^{30\cdot 2^{31}}n^{-5/2})$ .

Proof.

Suppose that the exploration from $v_{1}v_{2}$ terminates at time $T$ with an exceptional stop. Then we must have discovered at least $5$ exceptional edges at time-steps $t\leq T$ when 3. applied, where we call an edge exceptional if it appeared in $E(\Gamma[D_{t}])\setminus E_{t}$ (type 1) or as the third or above edge from some vertex $z\in Z_{t}^{1}$ to $D_{t}$ (type 2), and where such edges are ordered according to the ordering of the vertices of $D_{t}$ .

Since the exploration did not terminate with a large stop, at each of these time steps we had $|D_{t}|\leq 2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}:=\Delta$ at the start of each time-step $t$ . Also, since we did not terminate with a large stop, the time $T$ at which the process terminated must satisfy $T\leq\Delta^{2}$ (this is an upper bound on the number of pairs we could have tested at time steps where 1. or 3. applied).

In any time-step $t$ where 3. applies and we are testing for membership in one of the (at most five) sets $Z^{1}_{t}$ considered in that turn, the probability that a vertex in $V(\Gamma)\setminus D_{t}$ sends at least three edges of $\Gamma$ to the set $D_{t}$ is at most

\sum_{i=3}^{|D_{t}|}\binom{|D_{t}|}{i}p^{i}(1-p)^{|D_{t}|-i}\leq\sum_{i=3}^{\Delta}\Delta^{i}p^{i}=O\left(\Delta^{4}p^{3}\right).

Since we have at most $n$ vertices in $V(\Gamma)\setminus D_{t}$ and at most $T\leq\Delta^{2}$ time-steps to choose from, the probability of having found at least $j$ type 2 exceptional edges for some $1\leq j\leq 5$ is

(5.6)

\displaystyle O\Bigl{(}T^{j}\left(n\Delta^{4}p^{3}\right)^{j}\Bigr{)}=O\left(\Delta^{6j}n^{-j/2}\right).

A similar (but simpler) calculation yields that the probability of having found $5-j$ edges of type 1 is:

(5.7)

\displaystyle O\Bigl{(}T^{5-j}{\left(\Delta^{2}p\right)}^{5-j}\Bigr{)}=O\left(\Delta^{4(5-j)}n^{-(5-j)/2}\right).

Adding the bounds (5.6) and (5.7) together and substituting in the value of $\Delta$ , the Lemma follows. ∎

We are now in a position to prove Theorem 5.1, which, as previously noted, immediately implies Theorem 1.4.

Proof of Theorem 5.1.

Let $v_{1}v_{2}$ be an arbitrary pair of vertices from $V(\Gamma)$ . By Lemmas 5.7 and 5.8, with probability $1-O((\log n)^{30\cdot 2^{31}}n^{-5/2})=1-o(n^{-2})$ the exploration from $v_{1}v_{2}$ terminates with an extinction stop. By Lemma 5.2 we obtain a bound on the size of each component of $v_{1}v_{2}$ in $\boxtimes(\Gamma)$ found by this exploration, and a bound on its support as well. By a simple union bound, with probability $1-o(1)$ , all pairs $v_{1}v_{2}$ lie in components of $\boxtimes(\Gamma)$ supported on sets of size at most $2^{2^{10}}\varepsilon^{-2^{10}}(\log n)^{2^{31}}$ in $V(\Gamma)$ . ∎

6. The supercritical regime: proof of Theorem 1.5

Fix $\lambda>\lambda_{c}$ . Suppose $\lambda n^{-1/2}\leq p(n)\leq(1-f(n))$ , where $f(n)$ is a function with $f(n)=o(1)$ and $f(n)=\omega(n^{-2})$ . Let $\Gamma\in{\mathcal{G}}(n,p)$ . By [5, Theorem 5.1], we know that if $p(n)\geq 5\sqrt{\log{n}/n}$ , then a.a.s. there is a square-component covering all of $V(\Gamma)$ . We may thus restrict our attention in the proof of Theorem 1.5 to the range $\lambda n^{-1/2}\leq p(n)\leq 5\sqrt{\log{n}/n}$ . For such $p(n)$ , there exists $\varepsilon>0$ such that if $Z\sim\mathrm{Binom}(n,p^{2})$ , then

(6.1)

\displaystyle\mathbb{E}\left(\frac{Z^{2}+3Z}{2}\right)\geq 1+\varepsilon.

We shall prove the existence of a giant square-component with full support in four stages: first, we define an exploration process in $\square(\Gamma)$ . In a second stage, we analyse the process to show that a.a.s. a large proportion of non-edges of $\Gamma$ lie in “somewhat large” square-components of $\square(\Gamma)$ . Next, in the (more involved) third stage of the argument, we perform vertex-sprinkling to show a.a.s. a large proportion of non-edges of $\Gamma$ lie in a giant square-component. Finally, we show there a.a.s. a giant square-component covering all of $V(\Gamma)$ .

6.1. An exploration process

We consider an exploration process consisting of the following data at each time $t\geq 0$ :

•

(Discovered vertices.) An ordered set of vertices: $D_{t}=\{v_{1},v_{2},\ldots,v_{d_{t}}\}\subseteq V(\Gamma)$ .
•

(Active pairs.) A set of pairs of vertices (ordered lexicographically with respect to the ordering on $D_{t}$ ): $A_{t}=\{x_{1}y_{1},x_{2}y_{2},\ldots,x_{a_{t}}y_{a_{t}}\}\subseteq D_{t}^{(2)}\setminus E(\Gamma)$ .
•

(Reached pairs.) A set of pairs of vertices: $R_{t}\subseteq D_{t}^{(2)}\setminus E(\Gamma)$ .

These sets will satisfy:

( $\star$ )

for every active pair $x_{i}y_{i}\in A_{t}$ , the vertices $x_{i}$ and $y_{i}$ have at least $2$ common neighbors in the subgraph of $\Gamma$ induced by $D_{t}$ .

The initial state $t=0$ of the exploration consists of an arbitrary induced $C_{4}$ of $\Gamma$ , denoted $v_{1}v_{2}v_{3}v_{4}$ , and the sets: $D_{0}=\{v_{1},v_{2},v_{3},v_{4}\}$ , $A_{0}=\{v_{1}v_{3},v_{2}v_{4}\}$ , and $R_{0}=\emptyset$ .

Our exploration then proceeds as follows:
1. If $|R_{t}|+|A_{t}|>(\log n)^{4}$ , then we terminate the process.
2. If $|R_{t}|+|A_{t}|\leq(\log n)^{4}$ and $A_{t}\neq\emptyset$ , then for each $z\in V(\Gamma)\setminus D_{t}$ we test whether or not $z$ sends an edge in $\Gamma$ to both of $\{x_{1},y_{1}\}$ which are the vertices of the first pair $a_{1}=x_{1}y_{1}$ in the ordered set $A_{t}$ . We then set $Z_{t}:=\{z\in V(\Gamma)\setminus D_{t}:\ zx_{1},zy_{1}\in E(\Gamma_{t})\}$ . Denote by $F_{t}$ the set of common neighbors of $x_{1}$ and $y_{1}$ in $D_{t}$ (which by property $(\star)$ has size at least $2$ ).

We then set $A_{t+1}=\left(A_{t}\setminus\{x_{1}y_{1}\}\right)\cup\left(\left(F_{t}\cup Z_{t}\right)^{(2)}\setminus\left(F_{t}^{(2)}\cup E(\Gamma)\right)\right)$ , $R_{t+1}=R_{t}\cup\{x_{1}y_{1}\}$ and $D_{t+1}=D_{t}\cup Z_{t}$ . We then proceed to the next time-step in the exploration process, noting that property $(\star)$ is maintained.
3. If $|R_{t}|\leq(\log n)^{4}$ and $A_{t}=\emptyset$ , then we terminate the process.

6.2. Many non-edges in somewhat large components

The exploration process defined in the previous subsection can terminate for one of two reasons:

(1)

(Large stop.) $|R_{t}|+|A_{t}|>(\log n)^{4}$ , or
(2)

(Extinction stop.) $A_{t}=\emptyset$ .

The process always terminates after some number $T\leq\left(\log n\right)^{4}$ of time-steps. By construction, at all times $t\geq 0$ the collection of pairs $A_{t}\cup R_{t}$ is a subset of the square-component of $v_{1}v_{3}$ and $v_{2}v_{4}$ in $\square(\Gamma)$ . Our aim is to show that with somewhat large probability $|A_{T}|+|R_{T}|>(\log n)^{4}$ .

Lemma 6.1.

At any time-step $t\geq 0$ , the distribution conditional on the past history of the process of the random variable $X_{t}=|A_{t+1}\setminus A_{t}|$ counting the number of new active pairs discovered by $a_{1}=x_{1}y_{1}$ stochastically dominates a random variable $X^{\prime}$ with mean $\mathbb{E}(X^{\prime})\geq 1+\varepsilon+o(1)$ .

Proof.

Write $E_{t}$ for the set of edges discovered by the process. As observed by Bollobás and Riordan [7, Inequality (3)], the past of the process is the intersection of the principal increasing event $\mathcal{U}=\{E_{t}\subseteq E(\Gamma)\}$ and a decreasing event $\mathcal{D}$ (corresponding to the intersection of a number of events of the form “at least one of $zx,zy$ is not in $\Gamma$ ” for some previously tested pair $xy$ ). In particular, for $xy\in A_{t}$ and $z\in V(\Gamma)\setminus D_{t}$ ,

\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\cap\mathcal{U}\right)=\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\right).

Let $\mathcal{D}^{\prime}(z)$ denote the decreasing event

\mathcal{D}^{\prime}(z)=\bigcap_{x^{\prime}y^{\prime}\in D_{t}^{(2)}\setminus\{xy\}}\{\textrm{at least one of $zx^{\prime},zy^{\prime}$ is not in $\Gamma$}\}.

Note that the event $\{xz,yz\in E(\Gamma)\}$ is independent of $\mathcal{D}\setminus\mathcal{D}^{\prime}(z)$ . Using this fact and appealing to Harris’s Lemma [13], we have

\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\right)=\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}\cap\mathcal{D}^{\prime}(z)\right)\geq\mathbb{P}\left(xz,yz\in E(\Gamma)|\mathcal{D}^{\prime}(z)\right).

Now conditional on $\mathcal{D}^{\prime}$ , the probability that both $xz$ and $yz$ are in $E(\Gamma)$ is readily computed: it is equal to

\displaystyle\frac{p^{2}(1-p)^{|D_{t}|-2}}{p^{2}(1-p)^{|D_{t}|-2}+2p(1-p)(1-p)^{|D_{t}|-2}+(1-p)^{2}\left((1-p)^{|D_{t}|-2}+(|D_{t}|-2)p(1-p)^{|D_{t}|-3}\right)}

which is equal to $p^{2}\left(1+O(|D_{t}|p)\right)=p^{2}+o(p^{2})$ (since $|D_{t}|\leq 2\left(|R_{t}|+|A_{t}|\right)\leq 2(\log n)^{4}$ ).

The indicator function of the event $Y_{z}=\{z\in Z_{t}\}$ thus stochastically dominates a Bernoulli random variable with mean $p^{2}+o(p^{2})$ . Further the $(Y_{z})_{z\in V(\Gamma)\setminus D_{t}}$ are independent events given our conditioning (since each such event is only affected by the state of edges from $D_{t}$ to $z$ ). The random variable $|Z_{t}|$ thus stochastically dominates a random variable $Z^{\prime}\sim\mathrm{Binom}(n-|D_{t}|,p^{2}+o(p^{2}))$ . Let $X^{\prime}$ denote the sum of $\frac{(Z^{\prime})^{2}+3Z^{\prime}}{2}$ independent Bernoulli random variables with parameter $1-p$ . For $|D_{t}|\leq 2(\log n)^{4}$ , we have

	$\displaystyle\mathbb{E}\|A_{t+1}\setminus A_{t}\|$	$\displaystyle=\mathbb{E}\left(\frac{\|Z_{t}\|^{2}+3\|Z_{t}\|}{2}-\|E(\Gamma)\cap(F_{t}\cup Z_{t})^{(2)}\|\right)$
		$\displaystyle\geq\mathbb{E}X^{\prime}=(1-p)\mathbb{E}\left(\frac{\|Z^{\prime}\|^{2}+3\|Z^{\prime}\|}{2}\right)\geq 1+\varepsilon+o(1),$

where the inequality follows from the stochastic domination of $Z^{\prime}$ by $Z_{t}$ , and where in the last line we have used (6.1) and the fact that a binomial distribution with parameters $n-|D_{t}|$ and $p^{2}+o(p^{2})$ is close to $\mathrm{Binom}(n,p^{2})$ . ∎

Let $\theta_{e}=\theta_{e}(n,p)$ denote the extinction probability of the supercritical branching process $\mathbf{W}$ with the offspring distribution $X^{\prime}$ given in the proof of Lemma 6.1. Note that by Proposition 4.2(b), $\theta_{e}$ is bounded away from $1$ .

Up to the time when it terminates, our exploration process on the square-component of $v_{1}v_{2}v_{3}v_{4}$ stochastically dominates $\mathbf{W}$ . We now use this fact to show many non-edges of $\Gamma$ lie in “somewhat large” square-components.

Lemma 6.2 (Many squares in large square-components).

Fix $\lambda>\lambda_{c}$ , $p(n)$ satisfying $\lambda n^{-1/2}\leq p(n)\leq 5n^{-1/2}\sqrt{\log n}$ , and $\theta_{e}=\theta_{e}(n,p)$ as above. Then, with probability $1-o(n^{-2})$ the number $N$ of induced $C_{4}$ s in $\Gamma$ which are part of square-components of order at least $(\log n)^{4}$ satisfies

N=(1+o(1))\mathbb{E}N\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e})(1+o(1)).

Proof.

Given a collection of $4$ vertices $S\in V(\Gamma)^{(4)}$ , let $E_{S}$ be the indicator function of the event that $\Gamma[S]\cong C_{4}$ and that our exploration process from $S$ terminates with a large stop (which is equivalent to $S$ being part of a square-component of order at least $(\log n)^{4}$ ). Conditional on $\Gamma[S]\cong C_{4}$ , Lemma 6.1 implies that $\mathbb{P}(E_{S}=1)\geq 1-\theta_{e}$ (which is the probability that the branching process $\mathbf{W}$ does not become extinct). Applying Wald’s identity, the expectation $\mu_{N}$ of $N$ thus satisfies

	$\displaystyle\mu_{N}=\mathbb{E}N$	$\displaystyle=\mathbb{E}\sum_{S\in V(\Gamma)^{(4)}}E_{S}=\sum_{S\in V(\Gamma)^{(4)}}\mathbb{P}(\Gamma[S]\cong C_{4})\mathbb{P}\left(E_{S}=1\|\Gamma[S]\cong C_{4}\right)$
(6.2)			$\displaystyle\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e}).$

We now use Chebyshev’s inequality to show $N$ is concentrated around its mean. To do this, we must bound $\mathbb{E}N^{2}=\sum_{S,S^{\prime}\in V(\Gamma)^{(4)}}\mathbb{E}E_{S}E_{S^{\prime}}$ . Consider two collections of $4$ vertices $S,S^{\prime}\in V(\Gamma)^{(4)}$ .

Claim 1.

If $S\cap S^{\prime}=\emptyset$ , then $E_{S}$ and $E_{S^{\prime}}$ satisfy

\mathbb{E}\left(E_{S}E_{S^{\prime}}\right)=\mathbb{E}\left(E_{S}\right)\mathbb{E}\left(E_{S^{\prime}}\right)+O\left((\log n)^{9}n^{-1}\mathbb{E}(E_{S})\right).

Proof.

Our claim is that $E_{S}$ and $E_{S^{\prime}}$ are essentially independent. Indeed, let us first perform our exploration process from $S$ (stopping immediately if $\Gamma[S]$ does not induce a copy of $C_{4}$ ). For $Z\sim\mathrm{Binom}(n,p^{2})$ (and $\lambda n^{-1/2}\leq p\leq 5n^{-1/2}(\log)^{-1/2}$ as everywhere in this section), we have

(6.3)

\displaystyle\mathbb{P}(Z\geq 2^{7}\log n)=\sum_{r=\lceil 2^{7}\log n\rceil}^{n}\binom{n}{r}p^{2r}(1-p^{2})^{n-r}\leq\sum_{r\geq 2^{7}\log n}\left(\frac{en}{r}\cdot 25\frac{\log n}{n}\right)^{r}=o(n^{-5}).

Thus with probability $1-o(n^{-3})$ , the number of vertices added to $D_{t}$ in the last stage of the exploration process from $S$ is at most $2^{7}\log n$ , implying that the set $D_{S}$ of vertices discovered by the process from $S$ has size at most $2(\log n)^{4}+2^{7}\log n$ . Further, the exploration process from $S$ tests at most $\left(\log n\right)^{4}$ pairs in total. This allows us to bound the probability that the exploration process from $S^{\prime}$ interacts with the exploration process from $S$ .

First of all, by Markov’s inequality the probability that a vertex in $S^{\prime}$ is discovered by the process from $S$ is at most

4p^{2}\left(\log n\right)^{4}=O\left((\log n)^{5}n^{-1}\right).

Secondly, the exploration process from $S^{\prime}$ tests at most $(\log n)^{4}$ pairs $xy$ . By Markov’s inequality again, the probability that some $z\in D_{S}$ sends an edge to both vertices in such a pair is at most

(\log n)^{4}|D_{S}|p^{2}=O\left((\log)^{9}n^{-1}\right).

In particular, the probability of $E_{S^{\prime}}=1$ given $E_{S}=1$ differs from $\mathbb{E}E_{S^{\prime}}$ by at most $O((\log n)^{9}n^{-1}$ , as claimed. ∎

Claim 2.

For any $S\in V(\Gamma)^{(4)}$ , we have

\displaystyle\sum_{S^{\prime}\in V(\Gamma)^{(4)}:\ S\cap S^{\prime}\neq\emptyset}\mathbb{E}\left(E_{S}E_{S^{\prime}}\right)=O\left(n^{3}p^{4}\mathbb{E}(E_{S})\right).

\displaystyle(\dagger)

Proof.

Fix $S$ and consider the various ways in which $S$ and $S^{\prime}$ could intersect non-trivially.

•

There is one choice of $S^{\prime}$ with $S=S^{\prime}$ , for which we have $\mathbb{E}(E_{S}E_{S^{\prime}})=\mathbb{E}(S)$ .
•

Next, we have at most $4n$ choices of $S^{\prime}$ with $|S\cap S^{\prime}|=3$ . Write $S=\{a,b,c,d\}$ and $S^{\prime}=\{a,b,c,d^{\prime}\}$ . For $E_{S}E_{S^{\prime}}$ to be non-zero, both $S$ and $S^{\prime}$ must induce copies of $C_{4}$ in $\Gamma$ and moreover $E_{S}$ must occur. This is only possible if $E_{S}=1$ and $d^{\prime}$ sends edges to the two neighbors of $d$ in $\{a,b,c\}$ . Arguing as in Claim 1, these two events are almost independent and occur with probability $(1+o(1))p^{2}\mathbb{E}E_{S}$ . Thus the contribution of $S^{\prime}$ with $|S\cap S^{\prime}|=3$ to the left-hand side of $(\dagger)$ is at most $O(np^{2}\mathbb{E}(E_{S}))$ .
•

There are at most $4n^{3}$ choices of $S^{\prime}$ with $|S\cap S^{\prime}|=1$ . For $E_{S}E_{S^{\prime}}$ to be non-zero, it is necessary for $S^{\prime}$ to induce a copy of $C_{4}$ in $\Gamma$ and for $E_{S}=1$ . Arguing as in Claim 1, these two events are almost independent and occur with probability $(1+o(1))p^{4}\mathbb{E}E_{S}$ . Thus the contribution of $S^{\prime}$ with $|S\cap S^{\prime}|=1$ to the left-hand side of $(\dagger)$ is at most $O(n^{3}p^{4}\mathbb{E}(E_{S}))$ .
•

Finally, there are at most $6n^{2}$ choices of $S$ with $|S\cap S^{\prime}|=2$ . For $E_{S}E_{S^{\prime}}$ to be non-zero, it is necessary for the vertices in $S^{\prime}\setminus S$ to be incident at least three edges in $\Gamma[S^{\prime}]$ and for $E_{S}=1$ . Arguing as in Claim 1, these two events are almost independent and occur with probability at most $(1+o(1))p^{3}\mathbb{E}E_{S}$ . Thus the contribution of $S^{\prime}$ with $|S\cap S^{\prime}|=2$ to the left-hand side of $(\dagger)$ is at most $O(n^{2}p^{3}\mathbb{E}(E_{S}))$ .

Since $np=\omega(1)$ , we have $O(1+np^{2}+n^{2}p^{3}+n^{3}p^{4})=O(n^{3}p^{4})$ , and the analysis above shows the left-hand side of $(\dagger)$ is at most $O(n^{2}p^{3}\mathbb{E}(E_{S}))$ , as claimed. ∎

Together, Claims 1 and 2 imply $\mathbb{E}N^{2}\leq\left(\mathbb{E}N\right)^{2}+O\left(n^{3}p^{4}\mathbb{E}N\right)$ . By inequality 6.2, we know $\mu_{N}=\mathbb{E}N=\Omega(n^{4}p^{4})$ . Since $p=\Omega(n^{-1/2})$ , it follows that

\mathrm{Var}(N)=O\left(n^{2}p^{3}\mathbb{E}(N)\right)=O\left(\frac{\mu_{N}}{n}\right)=O\left(\frac{(\mu_{N})^{2}}{n^{3}}\right).

Applying Chebyshev’s inequality yields that with probability at least $1-o(n^{-2})$ ,

N=(1-o(1))\mu_{N}\geq 3p^{4}(1-p)^{2}\binom{n}{4}(1-\theta_{e})(1-o(1)),

as desired. ∎

Corollary 6.3.

Let $\lambda>\lambda_{c}$ be fixed, and let $p=p(n)$ be an edge-probability satisfying $\lambda n^{-1/2}\leq p\leq 5n^{-1/2}\sqrt{\log n}$ . Then for all $\varepsilon_{1}>0$ sufficiently small, there exist a constant $\varepsilon_{2}>0$ such that if $\Gamma_{1}\in{\mathcal{G}}((1-\varepsilon_{1})n,p(n))$ , then with probability $1-O(n^{-1})$ the number $N_{v}$ of non-edges of $\Gamma_{1}$ that lie in square-components of $\square(\Gamma_{1})$ of order at least $(\log n)^{4}$ , satisfies

N_{v}=(1+o(1))\mathbb{E}(N_{v})\geq\varepsilon_{2}n^{2}.

Proof.

Let $\lambda^{\prime}=(\lambda+\lambda_{c})/2$ . For $\varepsilon_{1}$ sufficiently small, we have $p(n)\geq\lambda^{\prime}(n(1-\varepsilon_{1}))^{-1/2}$ . We now consider $\Gamma_{1}\in{\mathcal{G}}(n(1-\varepsilon_{1}),p)$ .

Ideally, we would now like to directly apply Lemma 6.2 in $\Gamma_{1}$ . However, to ensure the stochastic domination in Lemma 6.1, we started our exploration process from an induced $C_{4}$ rather than a non-edge — so we know that $\Omega(n^{4}p^{4})$ induced $C_{4}$ s are part of square-components of order at least $(\log n)^{4}(1+o(1))$ whereas we want to show $\Omega(n^{2})$ non-edges lie in such components. Since some non-edges could have as many as $\Omega(\log n)$ common neighbors in $\Gamma_{1}$ , it would in principle be possible for $p$ of order $n^{-1/2}$ that, for example, the collection of the diagonals of the induced $C_{4}$ s contained in such “large” components consists of a set of only $O(n^{2}/(\log n)^{2})$ non-edges. We must thus rule out situation.

The simplest way to do this is to run through our proof of Lemma 6.2 again, but this time for the variant of our exploration process from Section 6.1 where we begin with an arbitrary non-edge $v_{1}v_{2}$ of $\Gamma_{1}$ , set $D_{0}=\{v_{1},v_{2}\}$ , $A_{0}=\{v_{1}v_{2}\}$ and $R_{0}=\emptyset$ . We say such an exploration survives infancy if at the first time-step the pair $v_{1}v_{2}$ discovers a set $Z_{1}$ of joint neighbors that spans at least one non-edge $v_{3}v_{4}$ of $\Gamma$ .

For $p$ in the range we are considering the random graph $\Gamma$ a.a.s. does not contain a complete graph on $6$ vertices, and we can use this to give a constant order lower bound on $\theta_{S}$ , the probability the process survives infancy:

	$\displaystyle\theta_{S}$	$\displaystyle\geq\mathbb{P}\left(\|\Gamma_{v_{1}}\cap\Gamma_{v_{2}}\|\geq 6\ \|v_{1}v_{2}\notin E(\Gamma)\right)-\mathbb{P}\left(\Gamma\textrm{ contain a clique on $6$ vertices}\right)$
		$\displaystyle\geq\binom{n-2}{6}p^{12}(1-p^{2})^{n-8}-\binom{n}{6}p^{\binom{6}{2}}=\Theta((np^{2})^{6})=\Theta(1).$

Conditional on surviving infancy, by Lemma 6.1 the exploration process from $v_{3}v_{4}$ stochastically dominates a supercritical branching process $\mathbf{W}$ with extinction probability $\theta_{e}=\theta_{e}((1-\varepsilon_{1})n,p)$ . Applying Wald’s identity, this implies that the number $N_{v}$ of non-edges of $\Gamma_{1}$ that belong to square-components of order at least $(\log n)^{4}$ satisfies

\mathbb{E}(N_{v})\geq\binom{(1-\varepsilon_{1})n}{2}(1-p)\theta_{S}(1-\theta_{e})=\Omega(n^{2}).

We now bound $\mathbb{E}(N_{V})^{2}$ much as we did in Lemma 6.2. Given a pair $xy\in V(\Gamma)^{(2)}$ , write $E_{xy}$ for the event that $xy$ is a non-edge and that our exploration process from $xy$ terminates with a large stop. Claim 1 from the proof of Lemma 6.2 shows mutatis mutandis that if $\{x,y\}\cap\{x^{\prime},y^{\prime}\}=\emptyset$ then

\mathbb{E}\left(E_{xy}E_{x^{\prime}y^{\prime}}\right)=\mathbb{E}(E_{xy})\mathbb{E}(E_{x^{\prime}y^{\prime}})+O\left((\log n)^{9}n^{-1}\right).

For non-disjoint pairs $\{x,y\}$ and $\{x^{\prime},y^{\prime}\}$ , the situation is actually easier than it was in Claim 2: such pairs contribute at most $2n\mathbb{E}N_{v}$ to $\mathbb{E}\left((N_{v})^{2}\right)$ . Thus

\mathrm{Var}(N_{v})=O\left(n\mathbb{E}(N_{v})\right)=O\left(n^{-1}(\mathbb{E}(N_{v}))^{2}\right),

and we conclude that with probability $1-O(n^{-1})$ there are $(1+o(1))\mathbb{E}(N_{v})=\Omega(n^{2})$ non-edges contained in square-components of order at least $(\log n)^{4}$ . The Corollary then follows from a suitable choice of the constant $\varepsilon_{2}$ . ∎

6.3. A connecting lemma

The key to our sprinkling argument is the following, which we use to connect the somewhat large square-components into even larger square-components. We connect square-components by sprinkling in vertices, and looking for complete bipartite graphs with bipartition $\{x_{1},x_{2},y_{1},y_{2}\}\sqcup\{z_{1},z_{2}\}$ , where $x_{1}x_{2}$ , $y_{1}y_{2}$ are non-edges in distinct square-components, and $z_{1}z_{2}$ is a non-edge inside the set of newly sprinkled vertices — see Figure 3 below.

Recall that a $p$ -random bipartite graph with partition $V\sqcup W$ is a graph on the vertex set $V\sqcup W$ obtained by including each pair $\{v,w\}$ with $v\in V,w\in W$ as an edge independently at random with probability $p$ .

Lemma 6.4 (Connecting Lemma).

Let $\lambda>\lambda_{c}$ , $\delta\in(0,\frac{1}{2})$ and $\varepsilon_{1},\varepsilon_{2}>0$ be fixed. Let $V$ be a set of $(1-\delta)n$ vertices, and $W$ be a set of $\frac{\varepsilon_{1}n}{2\log_{2}n}$ vertices disjoint from $V$ . Suppose we are given disjoint subsets $C_{1},C_{2},\ldots C_{r}$ of $V^{(2)}$ and a subset $S\subseteq W^{(2)}$ with the following properties:

(1)

$|S|\geq\frac{(\varepsilon_{1})^{2}n^{2}}{8(\log_{2}n)^{2}}$ ;
(2)

$|C_{i}|\geq M$ for every $i$ : $1\leq i\leq r$ , and some $M$ satisfying: $(\log n)^{4}\leq M\leq\frac{\varepsilon_{2}}{4}n^{2}$ ;
(3)

$\sum_{i}|C_{i}|\geq\varepsilon_{2}n^{2}$ .

Let $p=p(n)$ be an edge probability with

\lambda\frac{1}{\sqrt{n}}<p(n)<5\frac{\sqrt{\log n}}{\sqrt{n}}.

Consider the $p$ -random bipartite graph $B_{p}(V,W)$ with bipartition $V\sqcup W$ . Let Boost be the event that for every $C_{i}$ with $|C_{i}|\leq 2M$ there exists $C_{j}\neq C_{i}$ and a triple $(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times C_{j}\times S$ such that the restriction $B_{p}(\{x_{1},x_{2},y_{1},y_{2}\},\{z_{1},z_{2}\})$ of $B_{p}(V,W)$ to $\{x_{1},x_{2},y_{1},y_{2}\}\sqcup\{z_{1},z_{2}\}$ is complete. Then for all $n$ sufficiently large we have

\mathbb{P}(\textrm{Boost})\geq 1-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right).

The proof of the connecting lemma relies on a celebrated inequality of Janson and some careful book-keeping.

Proposition 6.5 (The extended Janson inequality [14]).

Let $U$ be a finite set and $U_{q}$ a $q$ -random subset of $U$ for some $q\in[0,1]$ . Let $\mathcal{F}$ be a family of subsets of $U$ , and for every $F\in\mathcal{F}$ let $I_{F}$ be the indicator function of the event $\{F\subseteq U_{q}\}$ . Set $I_{\mathcal{F}}=\sum_{F\in\mathcal{F}}I_{F}$ , and let $\mu=\mathbb{E}I_{\mathcal{F}}$ and $\Delta=\sum_{F,F^{\prime}\in\mathcal{F}:\ F\cap F\ \neq\emptyset}\mathbb{E}(I_{F}I_{F}^{\prime})$ . Then

\mathbb{P}\left(I_{\mathcal{F}}=0\right)\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right).

Proof of Lemma 6.4.

Fix $C_{i}$ with $M\leq|C_{i}|\leq 2M$ . Set $M^{\prime}=\min(2M,n)$ . Let $\mathcal{F}_{0}$ denote the collection of connecting triples $(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times\bigcup_{j\neq i}C_{j}\times S$ . Further let

\mathcal{F}=\{\{x_{i}z_{j}:\ i,j\in[2]\}\cup\{y_{i}z_{j}:\ i,j\in[2]\}:\ (x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in\mathcal{F}_{0}\}.

Observe that the elements of $\mathcal{F}$ are subsets of either $6$ or $8$ edges (depending on whether the pairs $x_{1}x_{2}$ and $y_{1}y_{2}$ overlap or not) of the complete bipartite graph $B(V,W)$ with bipartition $V\sqcup W$ . We shall apply Janson’s inequality to $\mathcal{F}$ to give an upper bound on the probability that $C_{i}$ does not connect to $\bigcup_{j\neq i}C_{j}$ via a pair of squares of $B_{p}(V,W)$ . To this end, we must compute and bound the $\mu$ and $\Delta$ parameters for $\mathcal{F}$ . The first of these is straightforward:

(6.4)

\displaystyle\mu:

\displaystyle=\mathbb{E}I_{\mathcal{F}}=\sum_{F\in\mathcal{F}}\mathbb{E}I_{F}\geq|C_{i}|.\bigl{|}\bigcup_{j\neq i}C_{j}\bigr{|}.|S|p^{8}\geq M\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{8}\frac{n^{4}}{(\log n)^{2}}p^{8}

To bound the $\Delta$ parameter, fix a connecting triple $t=(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times\bigcup_{j\neq i}C_{j}\times S$ , and consider the contribution to $\Delta$ made by pairs $(t,t^{\prime})$ of connecting triples that share at least one edge of $B(V,W)$ ; call such pairs of connecting triples dependent.

Write $L(t)$ for the set $\{x_{1},x_{2},y_{1},y_{2}\}$ (which can have size either $4$ or $3$ — the latter if one of the $x_{i}$ is equal to one of the $y_{j}$ ) and $R(t)$ for the pair $\{z_{1},z_{2}\}$ . Also let $F_{t}\in\mathcal{F}$ be the collection of edges of $B(V,W)$ from $L(t)$ to $R(t)$ . Clearly if $L(t)\cap L(t^{\prime})=\emptyset$ or $R(t)\cap R(t^{\prime})=\emptyset$ , then $(t,t^{\prime})$ do not form a pair of dependent connecting triples.

Fix a connecting triple $t$ . For $(i,j)\in[4]\times[2]$ , let $D_{i,j}(t)$ denote the collection of connecting triples $t^{\prime}$ with $|L(t)\cap L(t^{\prime})|=i$ and $|R(t)\cap R(t^{\prime})|=j$ . Further let $D_{i,j}^{a}(t)$ and $D_{i,j}^{b}(t)$ denote the collection of $t^{\prime}$ in $D_{i,j}(t)$ with $|L(t)|=4$ and $|L(t^{\prime})|=3$ respectively. We shall bound the sizes of the sets $D_{i,j}^{a}(t)$ and $D_{i,j}^{b}(t)$ . Note to begin with that there are at most $\frac{2\varepsilon_{1}n}{\log n}$ ways of deleting a vertex in $R(t)$ and replacing it by a different vertex in $W$ . In particular, for all connecting triples $t$ and all $i\in[|L(t)|]$ , we have

\displaystyle|D_{i,1}^{a}(t)|\leq|D_{i,2}^{a}(t)|\cdot\frac{2\varepsilon_{1}n}{\log n}

and

\displaystyle|D_{i,1}^{b}(t)|\leq|D_{i,2}^{b}(t)|\cdot\frac{2\varepsilon_{1}n}{\log n}.

Thus we may focus on bounding the sizes of $D_{i,j}^{a}(t)$ and $D_{i,j}^{b}(t)$ in the case where $j=2$ .

Case 1, $|L(t)|=4$ :

•

there are at most $6$ ways of splitting $L(t)$ into a pair from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , and at most $24$ ways of deleting a vertex from $L(t)$ and viewing the remaining $3$ vertices as the union of a pair from $C_{i}$ and an (overlapping) pair from $\bigcup_{j\neq i}C_{j}$ , whence $|D_{4,2}^{a}(t)|\leq 6$ and $|D_{3,2}^{b}(t)|\leq 24$ ;
•

there are at most $4n$ ways of deleting one vertex from $L(t)$ and replacing it by another vertex from $V$ . As noted above, there are most $6$ ways of splitting the resulting $4$ -set into a pair from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , whence $|D_{3,2}^{a}(t)|\leq 24n$ ;
•

there are at most $6(n^{2}/2)=3n^{2}$ ways of deleting two vertices from $L(t)$ and replacing them by two other vertices from $V$ , whence (similarly to the above) we have $|D_{2,2}^{a}(t)|\leq 18n^{2}$ ; further, there are at most $6n$ ways of deleting a pair of vertices from $V$ and replacing them by a single vertex from $V$ , whence (similarly to the above, since there are most $6$ ways of viewing three vertices of a pair from $C_{i}$ and an (overlapping) pair from $\bigcup_{j\neq i}C_{j}$ ) we get $|D_{2,2}^{b}(t)|\leq 36n$ ;
•

since $C_{i}$ contains at most $2M$ pairs, there are at most $4(n^{2}/2)M^{\prime}=2n^{2}M^{\prime}$ ways of deleting three vertices in $L(t)$ and replacing them by another three vertices from $V$ in such a way that the resulting set can still be viewed as the union of a pair from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , whence (similarly to the above) we have $|D_{1,2}^{a}(t)|\leq 12n^{2}M^{\prime}$ ; further and similarly there are at most $4M+4M^{\prime}n\leq 8M^{\prime}n$ ways of deleting three vertices in $L(t)$ and replacing them by a pair of vertices from $V$ , whence (as before) we have $|D_{1,2}^{b}(t)|\leq 48M^{\prime}n$ ;

Case 2, $|L(t)|=3$ :

•

there are at most $6$ ways of splitting $L(t)$ into a pair from from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , whence $|D_{3,2}^{b}(t)|\leq 6$ ; further there are at most $6n$ ways of adding a vertex to $L(t)$ and splitting the resulting $4$ -set into two disjoint pairs, whence $|D_{3,2}^{a}(t)|\leq 6n$ ;
•

there are at most $3n$ ways of deleting one vertex from $L(t)$ and replacing it by another vertex from $V$ , whence (as above) $|D_{2,2}^{b}(t)|\leq 18n$ ; further, there are at most $3(n^{2}/2)$ ways of deleting one vertex from $L(t)$ and replacing it by a pair from $V$ , whence (as above) $|D_{2,2}^{a}(t)|\leq 9n^{2}$ ;
•

since $C_{i}$ contains at most $2M$ pairs, there are at most $3M^{\prime}n$ ways of deleting two vertices in $L(t)$ and replacing them by another two vertices from $V$ in such a way that the resulting set can still be viewed as the union of a pair from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , whence $|D_{1,2}^{b}(t)|\leq 18M^{\prime}n$ ; similarly, there are at most $3M^{\prime}(n^{2}/2)$ ways of deleting two vertices in $L(t)$ and replacing them by a triple of vertices from $V$ in such a way that the resulting set can be viewed as the disjoint union of a pair from $C_{i}$ and a pair from $\bigcup_{j\neq i}C_{j}$ , whence $|D_{1,2}^{a}(t)|\leq 9M^{\prime}n^{2}$ . .

Given $t^{\prime}\in D_{i,j}^{a}(t)$ and considering the edges between $L(t)\cup L(t^{\prime})$ and $R(t)\cup R(t^{\prime})$ , we see that

(6.5)

\displaystyle\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}=\mathbb{E}I_{F_{t}}p^{2(4-i)+i(2-j)}=\mathbb{E}I_{F_{t}}p^{8-ij}.

Similarly, for $t^{\prime}\in D^{b}_{i,j}(t)$ we have

(6.6)

\displaystyle\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}=\mathbb{E}I_{F_{t}}p^{2(3-i)+i(2-j)}=\mathbb{E}I_{F_{t}}p^{6-ij}.

With the bounds on the size of $D_{i,j}^{a}(t)$ and $D_{i,j}^{b}(t)$ derived above and equalities (6.5) and (6.6) in hand, we are now ready to bound the contribution to $\Delta$ from a connecting triple $t$ .

Case 1: $|L(t)|=4$ .

	$\displaystyle\sum\Bigl{\{}\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}\Bigr{.}$	$\displaystyle\Bigl{.}:\ (t,t^{\prime})\textrm{ form a pair of dependent connecting triples}\Bigr{\}}$
	$\displaystyle=$	$\displaystyle\ \mathbb{E}I_{F_{t}}\left(\sum_{i=1}^{4}\left(\|D_{i,2}^{a}\|p^{8-2i}+\|D_{i,2}\|^{b}p^{6-2i}\right)+\sum_{i=1}^{4}\left(\|D_{i,1}^{a}\|p^{8-i}+\|D_{i,1}^{b}\|p^{6-i}\right)\right)$
	$\displaystyle\leq$	$\displaystyle\ \mathbb{E}I_{F_{t}}\Bigl{(}\left((12n^{2}M^{\prime}p^{6}+48M^{\prime}np^{4})+(18n^{2}p^{4}+36np^{2})+(24np^{2}+24)+(6)\right)\Bigr{.}$
		$\displaystyle\Bigl{.}\ \ +\frac{2\varepsilon_{1}n}{\log n}\left((12n^{2}M^{\prime}p^{7}+48M^{\prime}np^{5})+(18n^{2}p^{6}+36np^{4})+(24np^{5}+24p^{3})+(6p^{4})\right)\Bigr{)}$
(6.7)		$\displaystyle\leq$	$\displaystyle\ \left(\mathbb{E}I_{F_{t}}\right)2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).$

(Note in the last line we use the fact that $np^{2}\geq(\lambda_{c})^{2}>1/4$ , whence $(np)^{-1}\leq 4p$ .)

Case 2: $|L(t)|=3$ .

	$\displaystyle\sum\Bigl{\{}\mathbb{E}I_{F_{t}}I_{F_{t^{\prime}}}\Bigr{.}$	$\displaystyle\Bigl{.}:\ (t,t^{\prime})\textrm{ form a pair of dependent connecting triples}\Bigr{\}}$
	$\displaystyle=$	$\displaystyle\ \mathbb{E}I_{F_{t}}\left(\sum_{i=1}^{3}\left(\|D_{i,2}^{a}\|p^{8-2i}+\|D_{i,2}^{b}\|p^{6-2i}\right)+\sum_{i=1}^{3}\left(\|D_{i,1}^{a}\|p^{8-i}+\|D_{i,1}^{b}\|p^{6-i}\right)\right)$
	$\displaystyle\leq$	$\displaystyle\ \mathbb{E}I_{F_{t}}\Bigl{(}\left((9n^{2}M^{\prime}p^{6}+18nM^{\prime}p^{4})+(9n^{2}p^{4}+18np^{2})+(6np^{2}+6)\right)\Bigr{.}$
		$\displaystyle\Bigl{.}\ \ +\frac{2\varepsilon_{1}n}{\log n}\left((9n^{2}M^{\prime}p^{7}+18nM^{\prime}p^{5})+(9n^{2}p^{6}+18np^{4})+(6np^{5}+6p^{3})\right)\Bigr{)}$
(6.8)		$\displaystyle\leq$	$\displaystyle\ \left(\mathbb{E}I_{F_{t}}\right)2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).$

Together, inequalities (6.7) and (6.8) yield that

(6.9)

\displaystyle\Delta\leq\frac{1}{2}\mu.2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right).

Applying the Extended Janson Inequality, Proposition 6.5, together with the bounds (6.4) and (6.9) on $\mu$ and $\Delta$ , we get:

	$\displaystyle\mathbb{P}(I_{\mathcal{F}}=0)$	$\displaystyle\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right)\leq\exp\left(-\frac{\mu}{2^{10}(np^{2})^{3}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right)}\right)$
		$\displaystyle\leq\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{13}}\frac{M(np^{2})}{(\log n)^{2}\max\left(1,\frac{\varepsilon_{1}}{\log n}M^{\prime}p\right)}\right)$
(6.13)			$\displaystyle\leq\left\{\begin{array}[]{ll}\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{15}}\frac{M}{(\log n)^{2}}\right)&\textrm{if }2M\leq\frac{p^{-1}\log n}{\varepsilon_{1}},\\ \exp\left(-\frac{\varepsilon_{2}\varepsilon_{1}}{2^{15}}\frac{p^{-1}}{\log n}\right)&\textrm{if }\frac{p^{-1}\log n}{\varepsilon_{1}}\leq 2M\leq n,\\ \exp\left(-\frac{\varepsilon_{2}\varepsilon_{1}}{2^{15}}\frac{p^{-1}M}{n\log n}\right)&\textrm{if }n\leq 2M.\end{array}\right.$

Now, the probability that $C_{i}$ fails to connect to $\cup\{C_{j}:\ j\neq i\}$ via a connecting triple is exactly the probability that $I_{\mathcal{F}}=0$ . Applying Markov’s inequality together with (6.3) and using our assumptions that $M\geq(\log n)^{4}$ and that $C_{1},\ldots,C_{r}$ are disjoint, we have for $n$ sufficiently large that

\displaystyle\mathbb{P}(\textrm{Boost})

\displaystyle\geq 1-r\mathbb{P}(I_{\mathcal{F}}=0)\geq 1-\frac{n^{2}}{M}\mathbb{P}(I_{\mathcal{F}}=0)\geq 1-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right),

provided $n$ is sufficiently large. This concludes the proof of the connecting lemma. ∎

6.4. Sprinkling vertices

With Lemma 6.4 in hand, we can return to the proof of Theorem 1.5. To complete the proof, we shall use a multiple-round vertex-sprinkling argument. We partition $\Gamma\in{\mathcal{G}}(n,p)$ into the union of

(i)

$\Gamma_{1}\in{\mathcal{G}}((1-\varepsilon_{1})n,p)$ on $V_{1}=[(1-\varepsilon_{1})n]$ ,
(ii)

$\Gamma_{2}\in{\mathcal{G}}(\varepsilon_{1}n,p)$ on $V_{2}=[n]\setminus V_{1}$ , and
(iii)

a $p$ -random bipartite graph $B=B_{p}(V_{1},V_{2})$ with bipartition $V_{1}\sqcup V_{2}$ .

We further partition $V_{2}$ into $2\log_{2}n$ sets of size $\frac{\varepsilon_{1}n}{2\log_{2}n}$ , $V_{2}=\sqcup_{i=1}^{2\log_{2}n}V_{2,i}$ (and ignore rounding errors). We say that $\Gamma_{1}$ is a good configuration if it satisfies the conclusion of Corollary 6.3, i.e., if at least $\varepsilon_{2}n^{2}$ non-edges of $\Gamma_{1}$ lie in square-components in $\square(\Gamma_{1})$ of order at least $M_{0}:=(\log n)^{4}$ (this is actually slightly weaker than what Corollary 6.3 gives us, but is all we need here).

We shall condition on $\Gamma_{1}$ being a good configuration when we perform our vertex-sprinkling. By Corollary 6.3, this occurs with probability $1-O(n^{-1})$ . A key observation is that the state of the edges in $\Gamma_{2}$ and $B$ are independent of our conditioning. Our strategy is then to reveal the $2\log_{2}n$ sprinkling sets $V_{2,k}$ one by one, and use them to create bridges between “somewhat large” square-components and thereby increase the minimum order of all “somewhat large” square-components.

More precisely, before stage $k\geq 1$ we have revealed all the edges inside

V_{1,k-1}:=V_{1}\cup\left(\cup_{i=1}^{k-1}V_{2,i}\right).

At this stage, we deem a square-component “large” if it contains at least $M_{k-1}$ non-edges of $\Gamma$ , and “very large” if it contains at least $2M_{k-1}$ non-edges of $\Gamma$ (which constitute, as we recall, the vertices of the square-graph). Now in stage $k$ , we reveal the set $S_{k}$ of non-edges of $\Gamma$ that lie inside $V_{2,k}$ and the edges between $V_{1,k}$ and $V_{2,k}$ . We then merge components as follows: given two square-components $C_{i}$ and $C_{j}$ , a connecting triple is a triple $(x_{1}x_{2},y_{1}y_{2},z_{1}z_{2})\in C_{i}\times C_{j}\times S_{k}$ . Such a connecting triple is active if all edges between the sets $\{x_{1},x_{2},y_{1},y_{2}\}$ and $\{z_{1},z_{2}\}$ are in $\Gamma$ ; in this case the components $C_{i}$ and $C_{j}$ lie inside the same square-component $C$ in $\square(\Gamma[V_{1,k}])$ (see Figure 3). In particular, if both $C_{i}$ and $C_{j}$ contained at least $M_{k-1}$ non-edges, then $C$ must contain at least $M_{k}=2M_{k-1}$ non-edges.

\labellist

\pinlabel

$y_{1}$ [ ] at 0 107 \pinlabel $y_{2}$ [ ] at 0 26 \pinlabel $x_{1}$ [ ] at 0 270 \pinlabel $x_{2}$ [ ] at 0 189 \pinlabel $z_{1}$ [ ] at 210 189 \pinlabel $z_{2}$ [ ] at 210 107 \endlabellist\includegraphics[scale=0.5]Connecting_triple

Figure 3. In this connecting triple, both

x_{1}z_{1}x_{2}z_{2}

and

y_{1}z_{1}y_{2}z_{2}

form induced copies of

C_{4}

, joining up the square-components containing the non-edges

x_{1}x_{2}

and

y_{1}y_{2}

via the non-edge of sprinkled vertices

z_{1}z_{2}

The connecting lemma we proved in the previous subsection immediately implies that with high probability at each stage $k$ , all components which are “large” but not “very large” must join up with at least one other “large” component. We make this explicit with a lemma below. Recall that throughout this section, $\lambda>\lambda_{c}$ is fixed and the edge-probability $p=p(n)$ satisfies $\lambda\frac{1}{\sqrt{n}}<p(n)<5\frac{\sqrt{\log n}}{\sqrt{n}}$ . Let $\varepsilon_{1},\varepsilon_{2}>0$ be the constants whose existence is guaranteed by Corollary 6.3.

Lemma 6.6 (Sprinkling lemma).

Suppose that before stage $k$ , at least $\varepsilon_{2}n^{2}$ non-edges of $\Gamma[V_{1,k-1}]$ lie in square-components of order at least $M_{k-1}=2^{k-1}M_{0}$ in $\square(\Gamma[V_{1,k-1}])$ . Suppose $M_{k-1}\leq\frac{\varepsilon_{2}}{4}n^{2}$ . Then with probability at least

1-2\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

when we have revealed the edges from $V_{2,k}$ to $V_{1,k-1}\cup V_{2,k}:=V_{1,k}$ at least $\varepsilon_{2}n^{2}$ non-edges of $\Gamma[V_{1,k}]$ lie in square-components of order at least $M_{k}=2M_{k-1}$ in $\square(\Gamma[V_{1,k}])$ .

In particular, with probability at least

1-4\log_{2}n\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

we have that starting from a good configuration $\Gamma[V_{1}]$ , the sprinkling process described above has discovered within $2\log_{2}n-4\log_{2}\log n$ steps a giant square-component containing at least $\varepsilon_{2}n^{2}$ non-edges, and all non-edges from $\Gamma[V_{1}]$ that lie inside components of $\square(\Gamma[V_{1}])$ of size at least $(\log n)^{4}.$

Proof.

Let $S_{k}$ denote the set of non-edges of $V_{2,k}$ . We have $\mathbb{E}|S_{k}|=(1-p)\binom{\frac{\varepsilon_{1}}{\log n}n}{2}=(1-o(1))\frac{\varepsilon_{1}^{2}}{(\log n)^{2}}n^{2}$ By a standard Chernoff bound,

\displaystyle\mathbb{P}\left(|S_{k}|\leq\frac{\varepsilon_{1}n^{2}}{4(\log n)^{2}}\right)

\displaystyle\leq\exp\left(-\frac{(\varepsilon_{1})^{2}}{16}\frac{n^{2}}{(\log n)^{2}}\right)

If $|S_{k}|\geq\frac{\varepsilon_{1}n^{2}}{(\log n)^{2}}$ holds, we can apply Lemma 6.4, concluding that every component of size $M_{k-1}$ is joined with at least one other, resulting in a component of size $M_{k}=2M_{k-1}$ . Thus the desired conclusion for the first part of the lemma holds with probability at least

1-\exp\left(-\frac{(\varepsilon_{1})^{2}}{16}\frac{n^{2}}{(\log n)^{2}}\right)-\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)\geq 1-2\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right),

as desired.

For the “in particular” part, we first apply a simple union bound to the first $2\log_{2}n-4\log_{2}\log n$ steps of the process, to show that with probability at least

(6.14)

\displaystyle 1-2(2\log_{2}n-4\log_{2}\log n)\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right)

our sprinkling process has uncovered a collection of square-components, each of which contains at least $\frac{\varepsilon_{2}n^{2}}{2}$ non-edges, and, whose union contains at least $\varepsilon_{2}n^{2}$ non-edges and includes all non-edges of $\Gamma[V_{1}]$ coming from components of $\square(\Gamma[V_{1}])$ of size at least $(\log n)^{4}$ . There can be at most $\frac{1}{2}(\varepsilon_{2})^{-1}$ such components. By (6.3), the probability that a fixed pair of such components fails to join up in the next round of sprinkling is at most

\exp\left(-\frac{(\varepsilon_{2})^{2}\varepsilon_{1}}{2^{16}}\frac{p^{-1}n}{\log n}\right)\leq\exp\left(-\frac{(\varepsilon_{2})^{2}\varepsilon_{1}}{2^{16}}\frac{5n^{\frac{3}{2}}}{(\log n)^{\frac{3}{2}}}\right).

Taking the union bound over the at most $\frac{1}{8}(\varepsilon_{2})^{-2}$ pairs of components, we have that the probability any pair of these components fail to join up is, for large $n$ , a lot less than the last term in equation (6.14):

8\log_{2}\log n\exp\left(-\frac{\varepsilon_{2}(\varepsilon_{1})^{2}}{2^{16}}(\log n)^{2}\right).

Combining this with (6.14), we get the claimed bound on the probability of having discovered a giant square-component containing at least $\varepsilon_{2}n^{2}$ non-edges and all non-edges of $\Gamma[V_{1}]$ contained in components of $\square(\Gamma[V_{1}])$ of size at least $(\log n)^{4}$ . ∎

6.5. Covering the whole world

All that now remains to complete the proof of Theorem 1.5 is to show that a.a.s. there is a square-component covering all vertices of $\square(\Gamma)$ . Note that while in Lemma 6.6 we showed that $\square(\Gamma)$ has a giant component, we have not quite shown it is unique: in principle, one could stitch together a rival giant component at the last stage of sprinkling by building numerous bridges between small components. This is of course highly unlikely, and one could show uniqueness of the giant by exploiting the fact that the number of non-edges of $\Gamma$ lying in square-components of order at least $(\log n)^{4}$ is a.a.s. concentrated around its expectation (as shown in Corollary 6.3). However we do not have a nice form for this expectation, so a little care would be needed to show it changes continuously with $n$ to make the argument above fully rigorous. As this paper is already sufficiently long and as the uniqueness of the giant is not our main concern here, we eschew this and focus instead on the problem of ensuring we have a giant component whose support covers all the vertices. We sidestep the issue of the uniqueness of the giant by considering a partition of $[n]$ which allows us to both build a preferred giant and, crucially, to ensure this preferred giant has full support. We begin by establishing a useful corollary of the work in the previous subsections.

Corollary 6.7.

Let $\lambda>\lambda_{c}$ be fixed, and let $p=p(n)$ be an edge-probability satisfying $\lambda n^{-1/2}\leq p\leq 5n^{-1/2}\sqrt{\log n}$ . Let $\Gamma\in{\mathcal{G}}(n,p)$ . Then there for every $\varepsilon_{3}>0$ sufficiently small, there exists a constant $\varepsilon_{4}>0$ such that given fixed sets $U\subseteq U^{\prime}\subseteq[n]$ with $|U|=\lfloor(1-2\varepsilon_{3})n\rfloor$ , $|U^{\prime}|=\lfloor(1-\varepsilon_{3})n\rfloor$ all of the following hold with probability $1-O(n^{-1})$ :

(i)

there are at least $\varepsilon_{4}n^{2}$ non-edges in $\Gamma[U]$ contained in square-components of $\square(\Gamma[U])$ of order at least $(\log n)^{4}$ ;
(ii)

there is a unique square-component in $\square(\Gamma[U^{\prime}])$ containing all non-edges in $\Gamma[U]$ contained in square-components of $\square(\Gamma[U])$ of order at least $(\log n)^{4}$ ;
(iii)

there is a unique square-component in $\square(\Gamma)$ containing all non-edges contained in square-components of $\square(\Gamma[U])$ or $\square(\Gamma[U^{\prime}])$ of order at least $(\log n)^{4}$ .∎

Proof.

The corollary is immediate for sufficiently small $\varepsilon_{3}>0$ from an application of Corollary 6.3 inside $U$ (for part (i)) and two applications of Lemma 6.6 (for parts (ii) and (iii) respectively), together with a suitable choice of the constant $\varepsilon_{4}$ . ∎

We now apply this Corollary to prove Theorem 1.5.

Proof of Theorem 1.5.

First note that if $f(n)$ is any function with $f(n)=o(1)$ and $f(n)=\Omega(n^{-2})$ , and $5n^{-1/2}\sqrt{\log{n}}\leq p(n)\leq 1-f(n)$ , then $\Gamma\in{\mathcal{G}}(n,p)$ has the $\mathcal{CFS}$ property, by [5, Theorem 5.1]. Now assume that $\lambda>\lambda_{c}$ and $\lambda n^{-1/2}\leq p(n)\leq 5n^{-1/2}\sqrt{\log{n}}$ . Pick $\varepsilon_{3}>0$ sufficiently small, and let $\varepsilon_{4}>0$ be the constant whose existence is guaranteed by Corollary 6.7. Partition $[n]$ into $K=\lceil 2(\varepsilon_{3})^{-1}\rceil$ sets

U_{i}=\{x\in[n]:\ (i-1)\varepsilon_{3}n<x\leq i\varepsilon_{3}n\}.

For each pair $(i,j)$ of distinct elements of $[K]$ , we apply Corollary 6.7 to the sets $U=[n]\setminus(U_{i}\cup U_{j})$ and $U^{\prime}=[n]\setminus U_{i}$ ; taking a union bound over all such pairs $(i,j)$ , we see that with probability $1-O(K^{2}n^{-1})=1-O(n^{-1})$ , for every pair of distinct elements $i,j\in[K]$ the following hold:

(1)

at least $\varepsilon_{4}n^{2}$ non-edges in $\Gamma[[n]\setminus(U_{i}\cup U_{j})]$ are contained in components of $\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})])$ of order at least $(\log n)^{4}$ ;
(2)

there is a unique component $C^{\prime}_{ij}$ of $\square(\Gamma[[n]\setminus U_{i}])$ containing all non-edges of $\Gamma[[n]\setminus(U_{i}\cup U_{j})]$ contained in components of $\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})])$ of order at least $(\log n)^{4}$ ;
(3)

there is a unique component $C_{i}$ of $\square(\Gamma)$ containing $C^{\prime}_{ij}$ as well as all non-edges of $\Gamma[[n]\setminus U_{i}]$ contained in components of $\square(\Gamma[[n]\setminus U_{i}])$ of order at least $(\log n)^{4}$ .

We claim that $\forall i,j\in[K]$ we have $C_{i}=C_{j}$ . Indeed for $i\neq j$ , note that $C_{i}\supseteq C^{\prime}_{ij}$ and $C_{j}\supseteq C^{\prime}_{ji}$ . Since both $C^{\prime}_{ij}$ and $C^{\prime}_{ji}$ contain all of the at least $\varepsilon_{4}n^{2}$ non-edges contained in components of $\square(\Gamma[[n]\setminus(U_{i}\cup U_{j})])$ of order at least $(\log n)^{4}$ , it follows that $C_{i}\cap C_{j}\supseteq C^{\prime}_{ij}\supseteq C^{\prime}_{ji}\neq\emptyset$ . Since their intersection is non-empty, $C_{i}$ and $C_{j}$ are the same component of $\square(\Gamma)$ , as claimed. We may thus let $C_{\star}$ denote the a.a.s. unique square-component with $C_{\star}=C_{i}$ for all $i\in[K]$ .

We now show that a.a.s. the support of this component $C_{\star}$ is the whole vertex set $[n]$ . Pick $i\in[K]$ and condition on the event that there is a square-component $C_{i}^{\prime}$ in $\square(\Gamma[[n]\setminus U_{i}])$ of order at least $\varepsilon_{4}n^{2}$ (an event which occurs with probability $1-O(n^{-1})$ , as we saw in (2) above). If two or more such components exist, pick a largest one. Further, condition on each vertex $x\in U_{i}$ having at least $\frac{\varepsilon_{3}}{2}n$ non-neighbors in $\Gamma[U_{i}]$ . By a standard application of Chernoff bounds and a union-bound, this event occurs with probability $1-O(n^{-1})$ .

Having thus conditioned on the state of pairs in $\Gamma[U_{i}]$ and $\Gamma[[n]\setminus U_{i}]$ , we now show that a.a.s. for every vertex $x\in U_{i}$ , there exist $y\in U_{i}$ and $uv\in C_{i}^{\prime}$ such that $xyuv$ induces a copy of $C_{4}$ — so that that $xy$ belongs to the component $C^{\prime}$ of $\square(\Gamma)$ containing $C_{i}^{\prime}$ . Combining this with (3) above (which implie that a.a.s. $C^{\prime}=C_{\star}$ ) and a simple union bound will then yield Theorem 1.5.

Given non-edges $xy\in U_{i}^{(2)}\setminus E(\Gamma[U_{i}])$ and $uv\in C_{i}^{\prime}$ , let $X_{xy,uv}$ be the indicator function of the event that all of $ux$ , $uy$ , $vx$ and $vy$ are edges of $\Gamma$ . Observe that this event is independent of our conditioning. For $x\in U_{i}$ , set

X_{x}=\sum_{y\in U_{i}:\ xy\notin E[U_{i}]}\sum_{uv\in C_{i}^{\prime}}X_{xy,uv}

to be the number of induced $C_{4}$ ’s $xyuv$ of $\Gamma$ with $y\in U_{i}$ and $uv\in C_{i}^{\prime}$ . We shall again apply the Extended Janson Inequality ((Proposition 6.5) to bound $\mathbb{P}(X_{x}=0)$ . Given our conditioning, the expectation of $X_{x}$ is

\mu:=|\{y\in U_{i}:\ xy\notin E(\Gamma)\}|\cdot|C_{i}^{\prime}|p^{4}\geq\frac{\varepsilon_{3}\varepsilon_{4}(\lambda_{c})^{4}}{2}n=\Omega(n).

We now compute the corresponding $\Delta$ -parameter in Janson’s inequality. For $y,y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}]$ and $uv,u^{\prime}v^{\prime}\in C^{\prime}_{i}$ , write $\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}$ if $\{ux,uy,vx,vy\}\cap\{u^{\prime}x,u^{\prime}y^{\prime},v^{\prime}x,v^{\prime}y\}\neq\emptyset$ . Note that the random variables $X_{xy,uv}$ and $X_{xy^{\prime},u^{\prime}v^{\prime}}$ are independent unless $\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}$ , and further that $\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}$ implies $\{u,v\}\cap\{u^{\prime},v^{\prime}\}\neq\emptyset$ .

Pick and fix $y\in U_{i}\setminus\Gamma_{x}[U_{i}]$ and $uv\in C^{\prime}_{i}$ , which can be done in at most $|U_{i}|\cdot|C_{i}^{\prime}|$ ways. We perform a case analysis to determine the contributions to the $\Delta$ -parameter from terms of the form $\mathbb{E}X_{xy,uv}X_{xy^{\prime},u^{\prime}v^{\prime}}$ with $\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}$ :

•

there are at most $|U_{i}|$ choices of $y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}]$ with $y^{\prime}\neq y$ such that $\{xy,uv\}\sim\{xy^{\prime},uv\}$ , and for such $y^{\prime}$ we have $\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv}]=p^{6}$ ;
•

there are at most $n$ choices of $v^{\prime}\in[n]\setminus U_{i}$ with $v^{\prime}\neq v$ such that $\{xy,uv\}\sim\{xy,uv^{\prime}\}$ , and for such $v^{\prime}$ we have $\mathbb{E}[X_{xy,uv}X_{xy,uv^{\prime}}]=p^{6}$ (with the contribution from the symmetric cases $\{xy,uv\}\sim\{xy,u^{\prime}v\}$ analysed similarly);
•

finally, there are at most $n|U_{i}|$ choices of $(v^{\prime},y^{\prime})$ with $v^{\prime}\in[n]\setminus U_{i}$ , $y^{\prime}\in U_{i}\setminus\Gamma_{x}[U_{i}]$ , $v^{\prime}\neq v$ and $y\neq y^{\prime}$ such that $\{xy,uv\}\sim\{xy^{\prime},uv^{\prime}\}$ , and for such $(v^{\prime},y^{\prime})$ we have $\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv^{\prime}}]=p^{7}$ (with the contribution from the symmetric cases $\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v\}$ analysed similarly).

Putting it all together, we see

\displaystyle\Delta=\sum_{\{xy,uv\}\sim\{xy^{\prime},u^{\prime}v^{\prime}\}}\mathbb{E}[X_{xy,uv}X_{xy^{\prime},uv}]\leq|U_{i}|\cdot|C_{i}^{\prime}|\Bigl{(}|U_{i}|p^{6}+2np^{6}+2n|U_{i}|p^{7}\Bigr{)}.

Since $|U_{i}|\leq\varepsilon_{3}n$ , $|C^{\prime}_{i}|\leq n^{2}/2$ and $p\leq 5\sqrt{\frac{\log{n}}{n}}$ , the above implies

\displaystyle\Delta

\displaystyle\leq 5^{7}(\varepsilon_{3})^{2}n\sqrt{n}(\log n)^{7/2}(1+o(1)),

which for $n$ large enough is at most $\displaystyle\frac{2^{20}\epsilon_{3}}{\epsilon_{4}(\lambda_{c})^{4}}\mu n^{1/2}(\log n)^{7/2}$ . Applying Janson’s inequality,

\displaystyle\mathbb{P}(X_{x}=0)\leq\exp\left(-\frac{\mu^{2}}{2\Delta}\right)\leq\exp\left(-\frac{\epsilon_{4}\lambda_{c}^{4}\mu}{2^{20}(\log n)^{3/2}\sqrt{n}}\right)=o(n^{-1}).

Thus with probability $1-o(n^{-1})$ , $X_{x}>0$ and the vertex $x$ is covered by the square-component in $\square(\Gamma)$ that contains $C_{i}^{\prime}$ . Taking a union bound over $x\in U_{i}$ , with probability $1-o(1)$ , every vertex $x\in U_{i}$ is covered by this square-component (which as we showed is the a.a.s uniquely determined giant component $C_{\star}$ ). Taking a union bound over $i\in[K]$ and combining this with (1)–(3) above, we see that with probability $1-o(1)$ the square-component $C_{\star}$ covers all of $[n]$ . This concludes the proof of Theorem 1.5. ∎

7. Concluding remarks

There are several natural questions arising from our work. To begin with, one could ask for more information about the component structure in $\square(\Gamma)$ : in the subcritical regime, can one get a better upper bound on the order of square-components? In the supercritical regime, can one give good bounds on the order of the second-largest square-component? In particular, can one give better bounds than just $o(n^{2})$ , and can one show its support has size $o(n)$ ? This may be feasible albeit technically challenging.

Another question on the probabilistic side is determining the range of $p=p(n)$ for which the square graph $\square(\Gamma)$ of $\Gamma\in{\mathcal{G}}(n,p)$ is a.a.s. connected, a very different question from the ones considered in this paper. Investigating other parameters such as the diameter of $\square(\Gamma)$ may also lead in interesting directions.

Further afield, one could consider percolation problems for similar structures. We could for example consider the graph on all triples of independent vertices in $\Gamma$ obtained by setting two such triples to be adjacent if they induce a copy of the $6$ -cycle $C_{6}$ or of the complete balanced bipartite graph $K_{3,3}$ in $\Gamma$ . We would guess the techniques in this paper would adapt well to the latter problem, as Bollobás and Riordan also suggest, but that new ideas would be needed for the former.

One could also consider a similar problem for a $p$ -random $r$ -uniform hypergraph $\Gamma$ , by setting a set of $r$ vertex-disjoint non-edges $F_{1},F_{2},\ldots,F_{r}\notin E(\Gamma)$ to be adjacent if all of the $r^{2}$ edges meeting each $F_{i}$ in exactly one vertex are present in $\Gamma$ (in other words, if and only if the union of the $F_{i}$ s induces a copy of the complete balanced $r$ -partite $r$ -uniform hypergraph on $r^{2}$ vertices). Note the case $r=2$ corresponds exactly to square percolation. Levcovitz [16] has provided a quasi-isometry invariant for right-angled Coxeter groups by associating a hypergraph to any such group, so analysis of suitable variants of square percolation for hypergraphs may yield interesting applications in geometric group theory (besides constituting a challenging and rather natural problem in combinatorial probability).

Finally it would be interesting to study other properties of the right-angled Coxeter group $W_{\Gamma}$ when $\Gamma\in{\mathcal{G}}(n,p)$ using tools from random graph theory. In particular, determining the threshold for algebraic thickness of every order, or the exact rate of divergence of $W_{\Gamma}$ for all $p$ would be of great interest (see [6, Question 1]). Doing so will require new group theoretic ideas to translate these properties into graph theoretic language, and the identification of suitably tractable graph theoretic proxies for these in ${\mathcal{G}}(n,p)$ . Work of Levcovitz [16] provides promising progress towards finding combinatorial properties to encode higher rates of polynomial divergence in right-angled Coxeter groups; indeed, as we finalized this paper, Levcovitz released a new preprint [17] that provides such a translation, which we expect will be of use in future work on this problem. As Levcovitz’s work involves hypergraphs, developing new techniques for generalizations of square percolation to hypergraphs will likely be key to further progress.

Finally, one could study thickness and relative hyperbolicity in random right-angled Coxeter groups with presentation graphs drawn from other distributions than the Erdős–Rényi random graph model, such as random regular graphs. We do not know of any work which has been done in this direction at the present time.

References

[1] Alon, N., and Spencer, J. H. The probabilistic method, fourth ed. Wiley-Interscience Series in Discrete Mathematics and Optimization. John Wiley & Sons, Inc., Hoboken, NJ, 2015.
[2] Balister, P. Branching processes. Lecture Note Series, IMS, NUS (2006).
[3] Behrstock, J., and Druţu, C. Divergence, thick groups, and short conjugators. Illinois J. Math. 58, 4 (2014), 939–980.
[4] Behrstock, J., Druţu, C., and Mosher, L. Thick metric spaces, relative hyperbolicity, and quasi-isometric rigidity. Mathematische Annalen 344, 3 (2009), 543–595.
[5] Behrstock, J., Falgas-Ravry, V., Hagen, M. F., and Susse, T. Global structural properties of random graphs. Int. Math. Res. Not. 2018, 5 (2018), 1411–1441.
[6] Behrstock, J., Hagen, M., and Sisto, A. Thickness, relative hyperbolicity, and randomness in Coxeter groups. Algebr. Geom. Topol. 17, 2 (2017), 705–740.
[7] Bollobás, B., and Riordan, O. Clique percolation. Random Structures Algorithms 35, 3 (2009), 294–322.
[8] Dani, P., and Thomas, A. Divergence in right-angled Coxeter groups. Trans. Amer. Math. Soc. 367, 5 (2015), 3549–3577.
[9] Derényi, I., Palla, G., and Vicsek, T. Clique percolation in random networks. Phys. Rev. Lett. 94 (Apr 2005), 160202.
[10] Druţu, C., Mozes, S., and Sapir, M. Divergence in lattices in semisimple Lie groups and graphs of groups. Trans. Amer. Math. Soc. 362, 5 (2010), 2451–2505.
[11] Dwass, M. The total progeny in a branching process and a related random walk. Journal of Applied Probability 6, 3 (1969), 682–686.
[12] Gersten, S. M. Divergence in $3$ -manifold groups. Geom. Funct. Anal. 4, 6 (1994), 633–647.
[13] Harris, T. E. A lower bound for the critical probability in a certain percolation process. In Mathematical Proceedings of the Cambridge Philosophical Society (1960), vol. 56, Cambridge University Press, pp. 13–20.
[14] Janson, S., Łuczak, T., and Ruciński, A. Random graphs, vol. 45. John Wiley & Sons, 2011.
[15] Levcovitz, I. Divergence of $\rm CAT(0)$ cube complexes and Coxeter groups. Algebr. Geom. Topol. 18, 3 (2018), 1633–1673.
[16] Levcovitz, I. A quasi-isometry invariant and thickness bounds for right-angled Coxeter groups. Groups Geom. Dyn. 13, 1 (2019), 349–378.
[17] Levcovitz, I. Characterizing divergence and thickness in right-angled Coxeter groups. arXiv:2007.13796, 2020.
[18] Li, M., Deng, Y., and Wang, B.-H. Clique percolation in random graphs. Phys. Rev. E (3) 92, 4 (2015), 042116–042122.
[19] Mühlherr, B. Automorphisms of graph-universal coxeter groups. Journal of Algebra 200, 2 (1998), 629–649.
[20] Palla, G., Derényi, I., Farkas, I., and Vicsek, T. Uncovering the overlapping community structure of complex networks in nature and society. Nature 435 (June 2005), 814–818.
[21] Tóth, B., Vicsek, T., and Palla, G. Overlapping modularity at the critical point of $k$ -clique percolation. J. Stat. Phys. 151, 3-4 (2013), 689–706.
[22] Wang, B., Cao, L., Suzuki, H., and Aihara, K. Impacts of clustering on interacting epidemics. J. Theoret. Biol. 304 (2012), 121–130.

Square percolation and the threshold for quadratic divergence in random right-angled Coxeter groups

Abstract.

2010 Mathematics Subject Classification:

1. Introduction

Definition 1.1.

Remark 1.2.

Definition 1.3.

Theorem 1.4 (Subcritical Behavior).

Theorem 1.5 (Supercritical Behavior).

Theorem 1.6 (Criticality for quadratic divergence of RACGs).

Organization of the paper

Acknowledgments

2. Graph-theoretic notation and standard notions

3. Geometric group theory and the 𝒞​ℱ​𝒮\mathcal{CFS} property

3.1. Background

Definition 3.1.

Theorem 3.2 (Theorems 5.1 and 5.7 in [5]).

Definition 3.3.

Definition 3.4.

Proposition 3.5 ([3, Corollary 4.17]).

3.2. Proof of threshold for quadratic divergence in random RACGs

Proof of Theorem 1.6 from Theorems 1.4 and 1.5.

4. Branching processes and proof strategy

4.1. Branching processes

Definition 4.1.

Proposition 4.2 (See e.g. [2]).

Proposition 4.3 (Dwass’s formula [11]).

4.2. Departures from the Bollobás–Riordan framework

4.3. Proof strategy

5. The subcritical regime: proof of Theorem 1.4

Theorem 5.1.

5.1. An exploration process

5.2. Analysing the process

Lemma 5.2.

Proof.

Lemma 5.3.

Proof.

Lemma 5.4.

Proof.

Corollary 5.5.

Proof.

Lemma 5.6.

Proof.

Lemma 5.7.

Proof.

Lemma 5.8.

Proof.

Proof of Theorem 5.1.

6. The supercritical regime: proof of Theorem 1.5

6.1. An exploration process

6.2. Many non-edges in somewhat large components

Lemma 6.1.

Proof.

Lemma 6.2 (Many squares in large square-components).

Proof.

Claim 1.

Proof.

Claim 2.

Proof.

Corollary 6.3.

Proof.

6.3. A connecting lemma

Lemma 6.4 (Connecting Lemma).

Proposition 6.5 (The extended Janson inequality [14]).

Proof of Lemma 6.4.

6.4. Sprinkling vertices

Lemma 6.6 (Sprinkling lemma).

Proof.

6.5. Covering the whole world

Corollary 6.7.

Proof.

Proof of Theorem 1.5.

7. Concluding remarks

References

3. Geometric group theory and the $\mathcal{CFS}$ property