This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Short proof of the hypergraph container theorem

Rajko Nenadov School of Computer Science, University of Auckland, New Zealand. Email: [email protected]. Research supported by the New Zealand Marsden Fund.    Huy Tuan Pham Department of Mathematics, Stanford University, Stanford, CA 94305. Email: [email protected]. Research supported by a Clay Research Fellowship and a Stanford Science Fellowship.
Abstract

We present a short and simple proof of the celebrated hypergraph container theorem of Balogh–Morris–Samotij and Saxton–Thomason. On a high level, our argument utilises the idea of iteratively taking vertices of largest degree from an independent set and constructing a hypergraph of lower uniformity which preserves independent sets and inherits edge distribution. The original algorithms for constructing containers also remove in each step vertices of high degree which are not in the independent set. Our modified algorithm postpones this until the end, which surprisingly results in a significantly simplified analysis.

1 Introduction

The method of containers is a powerful technique in combinatorics used to produce a small number of clusters encompassing independent sets of a given hypergraph. While in some applications one follows the idea of the method and the general principles for building such clusters, quite often one can apply off the shelf tools. The most such applicable tool has been developed independently by Balogh, Morris, and Samorij [1] and Saxton and Thomason [9], and it is this result that is commonly referred to as the hypergraph container theorem. For an introduction to the method, the hypergraph container theorem, and its many suprising applications, we refer the reader the ICM survey [2]. A number of different proofs and versions of this result have been obtained since [3, 4, 5, 7, 8, 10, 11]. We present a simple and short proof of a slight generalisation of the original theorem. Two other short proofs have been obtained very recently by Campos and Samotij [6].

Let VV be a finite set. Given a subset XVX\subseteq V, let X={SV:XS}\langle X\rangle=\{S\subseteq V\colon X\subseteq S\}. We say a probability measure ν\nu over 2V2^{V} is (p,K)(p,K)-uniformly-spread if for every non-empty XVX\subseteq V we have ν(X)Kp|X|1/|V|\nu(\langle X\rangle)\leq Kp^{|X|-1}/|V|. Uniform signifies that the measure is fairly uniform from the point of view of elements of VV. Throughout the paper we use V=V()V=V(\mathcal{H}) and N=|V|N=|V|, where \mathcal{H} is a given hypergraph. If all edges in a hypergraph \mathcal{H} have size at most \ell, we say that \mathcal{H} is an ()(\leq\ell)-graph.

Theorem 1.1.

For every \ell\in\mathbb{N} and K,ε>0K,\varepsilon>0 there exists T>0T>0 such that the following holds. Suppose \mathcal{H} is an ()(\leq\ell)-graph, and let ν\nu be (p,K)(p,K)-uniformly-spread measure over 2V2^{V} supported on \mathcal{H}, for some p(0,1]p\in(0,1]. Then for every independent set IV()I\subseteq V(\mathcal{H}) there exists FIF\subseteq I and C=C(F)VC=C(F)\subseteq V such that |F|TNp|F|\leq TNp, ν([C])<ε\nu(\mathcal{H}[C])<\varepsilon, and ICFI\subseteq C\cup F.

If ν\nu is uniform on \mathcal{H}, we obtain original hypergraph container theorems [1, 9]. Dependence of TT on the uniformity is of order O(22)O(2^{\ell^{2}}), which is also along the lines of the original results. Near-optimal dependence was obtained by Balogh and Samotij [3] and Campos and Samotij [6].

2 Proof

Our proof bears resemblance with the proof from [1, 9]. On a high level, we choose FF in Theorem 1.1 by greedily taking vertices from II with largest degree with respect to ν\nu and construct a hypergraph of lower uniformity given by (parts) of hyperedges containing vertices from FF. A common feature in many of the proofs utilising a similar idea is that one also keeps track of the vertices which are not in II but have larger degree than the last chosen vertex in FF. The main novelty here is that we completely avoid this, unless we are in the case the resulting hypergraph of lower uniformity is not sufficiently dense to proceed with the induction. In this case, we show that removing vertices of high degree immediately yields a desired container. It is worth noting that the proofs from [1, 9] also have a similar case distinction, however the analysis in our cases turns out to be significantly simpler.

Theorem 1.1 follows by iterated application of the following lemma, known as the hypergraph container lemma.

Lemma 2.1.

For every \ell\in\mathbb{N} and K>0K>0 there exists δ>0\delta>0 such that the following holds. Suppose \mathcal{H} is an ()(\leq\ell)-graph, and let ν\nu be a (p,K)(p,K)-uniformly-spread measure over 2V2^{V} supported on \mathcal{H}, for some p(0,1]p\in(0,1]. Then for every independent set IVI\subseteq V there exists FIF\subseteq I and C=C(F)VC=C(F)\subseteq V such that |F|Np|F|\leq\ell Np, |C|(1δ)N|C|\leq(1-\delta)N, and ICFI\subseteq C\cup F. Moreover, CC can be unambigously constructed from any FF^IF\subseteq\hat{F}\subseteq I.

Proof.

We prove the lemma by induction on \ell. For =1\ell=1, take F=F=\varnothing and CVC\subseteq V to be the set of all vertices vVv\in V with ν(v)=0\nu(v)=0. As there are at least N/KN/K vertices with strictly positive measure, the lemma holds for δ=1/K\delta=1/K. We now prove the lemma for 2\ell\geq 2. Without loss of generality, we may assume |I|Np|I|\geq Np.

Set F=IF=\varnothing\subseteq I, =2V\mathcal{L}=\varnothing\subseteq 2^{V}, and 𝒟,=\mathcal{D},\mathcal{H}^{\prime}=\varnothing\subseteq\mathcal{H}. Repeat the following for NpNp rounds: Take vIFv\in I\smallsetminus F to be a largest vertex with respect to ν(v)\nu(\langle v\rangle\cap\mathcal{R}), where =[VF]𝒟\mathcal{R}=\mathcal{H}[V\smallsetminus F]\smallsetminus\mathcal{D} (tie-breaking done in some canonical way, e.g. by agreeing on the ordering of VV). Add vv to FF, set =(v)\mathcal{H}^{\prime}=\mathcal{H}^{\prime}\cup(\langle v\rangle\cap\mathcal{R}), and for each X2VX\in 2^{V}\smallsetminus\mathcal{L} of size |X|1|X|\leq\ell-1 such that

ν(X)>Kp|X|/N,\nu(\langle X\rangle\cap\mathcal{H}^{\prime})>Kp^{|X|}/N, (1)

add XX to \mathcal{L} and set 𝒟=𝒟(X)\mathcal{D}=\mathcal{D}\cup(\langle X\rangle\cap\mathcal{R}).

A few observations about the process. First, as ν\nu is (p,K)(p,K)-uniformly-spread the value ν(X)\nu(\langle X\rangle\cap\mathcal{H}^{\prime}) increases by at most ν(X{v})Kp|X|/N\nu(\langle X\cup\{v\}\rangle)\leq Kp^{|X|}/N after adding a vertex vv to FF. Once a subset XX satisfies (1) no more hyperedges which contain XX are added to \mathcal{H}^{\prime}, thus at the end of the process we have

ν(X)2Kp|X|/N\nu(\langle X\rangle\cap\mathcal{H}^{\prime})\leq 2Kp^{|X|}/N (2)

for every XVX\subseteq V of size |X|1|X|\leq\ell-1. Second, given FF^IF\subseteq\hat{F}\subseteq I, we can reconstruct FF from F^\hat{F} together with the order in which the vertices were added, thus we can also reconstruct \mathcal{H^{\prime}} and \mathcal{R}.

We next derive several useful lower bounds on ν()\nu(\mathcal{H}^{\prime}). First we show that if ν(𝒟)\nu(\mathcal{D}) is large, then ν()\nu(\mathcal{H}^{\prime}) is also large. In particular, the following holds:

ν()2pν(𝒟).\displaystyle\nu(\mathcal{H}^{\prime})\geq 2^{-\ell}p\nu(\mathcal{D}). (3)

Indeed, for each e𝒟e\in\mathcal{D} there exists XX\in\mathcal{L} such that eXe\in\langle X\rangle. Thus, Xν(X)ν(𝒟)\sum_{X\in\mathcal{L}}\nu(\langle X\rangle)\geq\nu(\mathcal{D}). On the other hand, we have by (1) that

Xν(X)>XKp|X|/NpXν(X).\sum_{X\in\mathcal{L}}\nu(\langle X\rangle\cap\mathcal{H}^{\prime})>\sum_{X\in\mathcal{L}}Kp^{|X|}/N\geq p\sum_{X\in\mathcal{L}}\nu(\langle X\rangle).

Here in the last inequality we use that ν\nu is (p,K)(p,K)-uniformly spread. Furthermore, each edge ee in \mathcal{H}^{\prime} may contribute to at most 22^{\ell} terms ν(X)\nu(\langle X\rangle\cap\mathcal{H}^{\prime}). Hence,

ν()2Xν(X)>2pν(𝒟),\displaystyle\nu(\mathcal{H}^{\prime})\geq 2^{-\ell}\sum_{X\in\mathcal{L}}\nu(\langle X\rangle\cap\mathcal{H}^{\prime})>2^{-\ell}p\nu(\mathcal{D}),

as claimed in (3).

Next, we show that

ν()(Np)maxvIFν(v).\displaystyle\nu(\mathcal{H}^{\prime})\geq(Np)\max_{v\in I\smallsetminus F}\nu(\langle v\rangle\cap\mathcal{R}). (4)

Let i\mathcal{R}_{i} denote the hypergraph \mathcal{R} at the moment when the ii-th vertex viv_{i} was added to FF (thus =|F|\mathcal{R}=\mathcal{R}_{|F|}). We observe that, since \mathcal{R} is non-increasing and by our choice of vv in each step,

ν()i=1|F|ν(vii)i=1|F|maxvIFν(v|F|),\displaystyle\nu(\mathcal{H}^{\prime})\geq\sum_{i=1}^{|F|}\nu(\langle v_{i}\rangle\cap\mathcal{R}_{i})\geq\sum_{i=1}^{|F|}\max_{v\in I\smallsetminus F}\nu(\langle v\rangle\cap\mathcal{R}_{|F|}),

yielding (4).

Let α=22\alpha=2^{-\ell-2}. We now distinguish two cases, where if ν()\nu(\mathcal{H}^{\prime}) is large, then we can apply the inductive hypothesis to an appropriate (1)(\leq\ell-1)-graph, and otherwise we can immediately find a small container CC for which IFCI\smallsetminus F\subseteq C.

Case 1: ν()αp\nu(\mathcal{H}^{\prime})\geq\alpha p. Let ′′\mathcal{H}^{\prime\prime} denote the (1)(\leq\ell-1)-graph consisting of sets XX such that X=HFX=H^{\prime}\smallsetminus F for some HH^{\prime}\in\mathcal{H}^{\prime}. Set ν\nu^{\prime} to be the probability measure over 2VF2^{V\smallsetminus F} given by

ν(X){ν((X2F)),if X′′,0,otherwise.\nu^{\prime}(X)\,\propto\,\begin{cases}\nu((X\cup 2^{F})\cap\mathcal{H}^{\prime}),&\text{if }X\in\mathcal{H}^{\prime\prime},\\ 0,&\text{otherwise.}\end{cases}

From (2) and ν()αp\nu(\mathcal{H}^{\prime})\geq\alpha p we conclude that ν\nu^{\prime} is (2Kα1,p)(2K\alpha^{-1},p)-uniformly-spread. Also observe that II is an independent set in ′′\mathcal{H}^{\prime\prime}, thus by the induction hypothesis there exists FVF^{\prime}\subseteq V of size |F|(1)Np|F^{\prime}|\leq(\ell-1)Np and C=C(F)C=C(F^{\prime}) such that |C|(1δ)N|C|\leq(1-\delta)N and ICFI\subseteq C\cup F^{\prime}. Note that we can reconstruct CC from F:=FFF:=F\cup F^{\prime}.

Case 2: ν()<αp\nu(\mathcal{H}^{\prime})<\alpha p. By (3), we have ν(𝒟)<1/4\nu(\mathcal{D})<1/4 and hence ν()ν()ν()ν(𝒟)>1/2\nu(\mathcal{R})\geq\nu(\mathcal{H})-\nu(\mathcal{H}^{\prime})-\nu(\mathcal{D})>1/2. By (4), for every vIFv\in I\smallsetminus F we have

ν(v)α/N.\nu(\langle v\rangle\cap\mathcal{R})\leq\alpha/N. (5)

Let now CVFC\subseteq V\smallsetminus F denote the set of all vVFv\in V\smallsetminus F such that ν(v)α/N\nu(\langle v\rangle\cap\mathcal{R})\leq\alpha/N. By (5) we have IFCI\smallsetminus F\subseteq C. Furthermore,

ν()vCν(v)+wV(FC)ν(w)<α+(N|C|)K/N.\nu(\mathcal{R})\leq\sum_{v\in C}\nu(\langle v\rangle\cap\mathcal{R})+\sum_{w\in V\smallsetminus(F\cup C)}\nu(\langle w\rangle\cap\mathcal{R})<\alpha+(N-|C|)\cdot K/N.

Hence, |C|<N(ν()α)N/K<(1δ)N|C|<N-(\nu(\mathcal{R})-\alpha)N/K<(1-\delta)N for δ=1/(4K)\delta=1/(4K). This concludes the construction of desired FF and CC. ∎

For the sake of completeness, we derive Theorem 1.1 from Lemma 2.1.

Proof of Theorem 1.1.

Let δ>0\delta>0 be as given by Lemma 2.1 for \ell and K/εK/\varepsilon (as KK). We prove the theorem for T=log(Kε1)/log(1+δ)T=\ell\log(K\varepsilon^{-1})/\log(1+\delta).

We find a fingerprint FF and a container CC as follows. Set F=F=\varnothing and C=VC=V, and as long as ν([C])ε\nu(\mathcal{H}[C])\geq\varepsilon do the following: Let FF^{\prime} and CC^{\prime} be as given by Lemma 2.1 applied with ν\nu^{\prime} being a probability measure over 2C2^{C} given by ν(X)ν(X)\nu^{\prime}(X)\,\propto\,\nu(X) if X[C]X\in\mathcal{H}[C], and ν(X)=0\nu^{\prime}(X)=0 otherwise. Set F:=FFF:=F\cup F^{\prime} and C:=CC:=C^{\prime}, and proceed to the next iteration.

If ν([C])ε\nu(\mathcal{H}[C])\geq\varepsilon, then for nonempty XCX\subseteq C,

ν(X)ν(X)ν([C])Kp|X|1/NεKεp|X|1/|C|,\nu^{\prime}(\langle X\rangle)\leq\frac{\nu(\langle X\rangle)}{\nu(\mathcal{H}[C])}\leq\frac{Kp^{|X|-1}/N}{\varepsilon}\leq\frac{K}{\varepsilon}p^{|X|-1}/|C|,

and hence ν\nu^{\prime} is (p,K/ε)(p,K/\varepsilon)-uniformly-spread each time we apply Lemma 2.1. Furthermore, if ν([C])ε\nu(\mathcal{H}[C])\geq\varepsilon, then |C|εN/K|C|\geq\varepsilon N/K. In each iteration the set CC shrinks by a factor of 1δ1-\delta, thus we are done after at most log(Kε1)/log(1+δ)\log(K\varepsilon^{-1})/\log(1+\delta) iterations. The set FF grows by at most Np\ell Np in each iteration, which gives an upper bound of TNpTNp on its final size for the above choice of T=T(K,ε)T=T(K,\varepsilon). Due to the last property in Lemma 2.1, the final set CC can be unambiguously constructed from FF. ∎

Acknowledgment.

Ideas used in this paper were developed while the first author was visiting Stanford University in November 2023. The first author thanks Jacob Fox for hospitality. We thank Jacob Fox for helpful comments and Wojciech Samotij for pointing out a subtle issue in an earlier version.

References

  • [1] J. Balogh, R. Morris, and W. Samotij. Independent sets in hypergraphs. J. Am. Math. Soc., 28(3):669–709, 2015.
  • [2] J. Balogh, R. Morris, and W. Samotij. The method of hypergraph containers. In Proceedings of the international congress of mathematicians 2018, ICM 2018, Rio de Janeiro, Brazil, August 1–9, 2018. Volume IV. Invited lectures, pages 3059–3092. Hackensack, NJ: World Scientific; Rio de Janeiro: Sociedade Brasileira de Matemática (SBM), 2018.
  • [3] J. Balogh and W. Samotij. An efficient container lemma. Discrete Anal., 2020:56, 2020. Id/No 17.
  • [4] A. Bernshteyn, M. Delcourt, H. Towsner, and A. Tserunyan. A short nonalgorithmic proof of the containers theorem for hypergraphs. Proc. Am. Math. Soc., 147(4):1739–1749, 2019.
  • [5] M. Bucić, J. Fox, and H. T. Pham. Equivalence between Erdős-Hajnal and polynomial Rödl and Nikiforov conjectures, 2024. arXiv:2403.08303.
  • [6] M. Campos and W. Samotij. Towards an optimal hypergraph container lemma, 2024. arXiv:2408.06617.
  • [7] R. Morris, W. Samotij, and D. Saxton. An asymmetric container lemma and the structure of graphs with no induced 4-cycle. J. Eur. Math. Soc., 26(5):1655–1711, 2024.
  • [8] R. Nenadov. Probabilistic hypergraph containers. Israel Journal of Mathematics, 261:879–897, 2024.
  • [9] D. Saxton and A. Thomason. Hypergraph containers. Invent. Math., 201(3):925–992, 2015.
  • [10] D. Saxton and A. Thomason. Online containers for hypergraphs, with applications to linear equations. J. Comb. Theory, Ser. B, 121:248–283, 2016.
  • [11] D. Saxton and A. Thomason. Simple containers for simple hypergraphs. Comb. Probab. Comput., 25(3):448–459, 2016.