Improved bounds for coloring locally sparse hypergraphs
Abstract
We show that, for every , every -uniform hypergaph of degree and girth at least is efficiently -list colorable. As an application (and to the best of our knowledge) we obtain the currently best algorithm for list-coloring random hypergraphs of bounded average degree.
1 Introduction
In hypergraph coloring one is given a hypergraph and the goal is to find an assignment of one of colors to each vertex so that no hyperedge is monochromatic. In the more general list-coloring problem, a list of allowed colors is specified for each vertex. A graph is -list-colorable if it has a list-coloring no matter how the lists are assigned to each vertex. The list chromatic number, , is the smallest for which is -list colorable.
Hypergraph coloring is a fundamental constraint satisfaction problem with several applications in computer science and combinatorics, that has been studied for over 60 years. In this paper we consider the task of coloring locally sparse hypergraphs and its connection to coloring sparse random hypegraphs.
A hypergraph is -uniform if every hyperedge contains exactly vertices. An -cycle in a -uniform hypergraph is a collection of distinct hyperedges spanned by at most vertices. We say that a -uniform hypergraph has girth at least if it contains no -cycles for . Note that if a -uniform hypergraph has girth at least then every two of its hyperedges have at most one vertex in common.
The main contribution of this paper is to prove the following theorem.
Theorem 1.1.
Let by any -uniform hypergraph, , of maximum degree and girth at least . For all , there exist a positive constant such that if , then
(1) |
Furthermore, if is a hypergraph on vertices then there exists an algorithm that constructs such a coloring in expected polynomial time in .
Remark 1.1.
If is assumed to be constant, then the algorithm of Theorem 1.1 can be efficiently derandomized, i.e., there exists a deterministic algorithm that constructs the promised coloring in polynomial time in .
Theorem 1.1 is interesting for a number of reasons. First, it generalizes a well-known result of Kim [21] for coloring graphs of degree and girth , and it implies the classical theorem of Ajtai, Komlós, Pintz, Spencer and Szemerédi [4] regarding the independence number of -uniform hypergraphs of degree and girth . The latter is a seminal result in combinatorics, with applications in geometry and coding theory [22, 23, 24]. Second, Theorem 1.1 is tight up to a constant [8]. Note also that, without the girth assumption, the best possible bound [11] on the chromatic number of -uniform hypergraphs is , i.e., it is asymptotically worse than the one of Theorem 1.1. For example, there exist graphs of degree whose chromatic number is exactly . Third, when it applies, Theorem 1.1 improves upon a result of Frieze and Mubayi [13] regarding the chromatic number of simple hypergraphs, who showed (1) with an unspecified large leading constant (of order at least ). Finally, Theorem 1.1 can be used to provide the currently best algorithm for list-coloring random -uniform hypergarphs of bounded average degree (to the best of our knowledge). We discuss the connection between locally sparse hypergraphs and sparse random hypergraphs with respect to the task of coloring in the following section.
1.1 Application to coloring pseudo-random hypergraphs
The random -uniform hypergraph is obtained by choosing each of the -element subsets of a vertex set ( ) independently with probability . The chosen subsets are the hyperedges of the hypergraph. Note that for we have the usual definition of the random graph . We say that has a certain property asymptotically almost surely or with high probability, if the probability that has tends to as .
In this paper we are interested in , i.e., the family of random -uniform hypergraphs of bounded average degree . Specifically, we use Theorem 1.1 to prove the following theorem.
Theorem 1.2.
For any constants , , there exists such that for every constant , the random hypergraph can be -list-colored by a deterministic algorithm whose running time is polynomial in asymptotically almost surely.
Remark 1.2.
Note that, for constants, a very standard argument reveals that is essentially equivalent to , namely the uniform distribution over -uniform hypergraphs with vertices and exactly hyperedges. Thus, Theorem 1.2 extends to that model as well.
We note that previous approaches [3, 31, 13] for list-coloring random -uniform hypergraphs of bounded average degree are either randomized, or require significantly larger lists of colors per vertex in order to succeed. Indeed, for the approach of Achlioptas and Molloy [3] matches the bound of Theorem 1.2 but is randomized, while for , and to the best our knowledge, our algorithm uses less colors that any algorithm for (list-)coloring random hypergraphs of bounded average degree that has been rigorously analyzed in the literature. Moreover, it is believed that all efficient algorithms require lists of size at least , as this bound corresponds to the so-called shattering threshold [1, 7, 14] for coloring sparse random hypergraphs, which is also often referred to as the “algorithmic barrier” [1]. This threshold arises in a plethora of random constraint satisfaction problems, and it corresponds to a precise phase transition in the geometry set of solutions. In all of these problems, we are not aware of any efficient algorithm that works beyond the algorithmic barrier, despite the fact that solutions exist for constraint-densities larger than the one in which the shattering phenomenon appears. We refer the reader to [1, 33] for further details.
In order to prove Theorem 1.2, we show that random -uniform hypergraphs of bounded average degree can essentially be treated as hypergraphs of girth and maximum degree for the purposes of list-coloring, and then apply Theorem 1.1. In particular, we identify a pseudo-random family of hypergraphs which we call girth-reducible, and show that almost all -uniform hypergraphs of bounded average degree belong in this class. Then we show that girth-reducible hypergraphs can be colored efficiently using Theorem 1.1.
Formally, a -uniform hypergraph is -degenerate if the induced subhypergraph of all subsets of its vertex set has a vertex of degree at most . The degeneracy of a hypergraph is the smallest value of for which is -degenerate. Note that it is known that -degenerate hypergraphs are -list colorable and that the degeneracy of a hypergraph can be computed efficiently by an algorithm that repeatedly removes minimum degree vertices. Indeed, to list-color a -degenerate hypergraph we repeatedly find a vertex with (remaining) degree at most , assign to it a color that does not appear in any of its neighbors so far, and remove it from the hypergraph. Clearly, if the lists assigned to each vertex are of size at least this procedure always terminates successfully.
Definition 1.3.
For , we say that a -uniform hypergraph of average degree is -girth-reducible if its vertex set can be partitioned in two sets, and , such that:
-
(a)
contains all cycles of length at most , and all vertices of degree larger than ;
-
(b)
subhypergraph is -degenerate;
-
(c)
every vertex in has at most neighbors in .
In words, a hypergraph is -girth-reducible if its vertex set can be seen as the union of two parts: A “low-degeneracy” part, which contains all vertices of degree more than and all cycles of lengths at most , and a “high-girth” part, which induces a hypergraph of maximum degree at most and girth . Moreover, each vertex in the “high-girth” part has only a few neighbors in the “low-degeneracy” part.
Note that given a -girth-reducible hypergraph we can efficiently find the promised partition as follows. We start with , where is the set of vertices that either have degree at least , or they are contained in a cycle of length at most . Let denote the vertices in that violate property (c). While , update as . The correctness of the process lies in the fact that in each step we add to the current a set of vertices that must be in the low-degeneracy part of the hypergraph. Observe also that this process allows us to efficiently check whether a hypergraph is -girth-reducible.
We prove the following theorem regarding the list-chromatic number of girth-reducible hypergraphs.
Theorem 1.4.
For any constants and , there exists such that if is a -girth-reducible, -uniform hypergraph of average degree , then
where . Furthermore, if is a hypergraph on vertices then there exists a deterministic algorithm that constructs such a coloring in time polynomial in .
Proof of Theorem 1.4.
Let . Given lists of colors of size for each vertex of , we first color the vertices of using the greedy algorithm which exploits the low degeneracy of . Now each vertex in has at most forbidden colors in its list as it has at most that many neighbors in . We delete these colors from the list. Observe that if we manage to properly color the induced subgraph using colors from the updated lists, then we are done since every hyperedge with vertices both in and will be automatically “satisfied”, i.e., it cannot be monochromatic. Notice now that the updated list of each vertex still contains at least colors, for sufficiently large . Since the induced subgraph is of girth at least and of maximum degree at most , it is efficiently -list-colorable for sufficiently large per Theorem 1.1 and Remark 1.1. This concludes the proof since .
∎
Moreover, we show that girth-reducibility is a pseudo-random property which is admitted by almost all sparse -uniform hypregraphs.
Theorem 1.5.
For any constants , , there exists such that for every constant , asymptotically almost surely, the random hypergraph is -girth-reducible.
Theorem 1.5 follows by simple, although somewhat technical, considerations on properties of sparse random hypergraphs, which are mainly inspired by the results of Alon, Krivelevich and Sudakov [6] and Łuczak [25]. Observe that combining Theorem 1.5 with Theorem 1.4 immediately implies Theorem 1.2.
Overall, the task of coloring locally sparse hypergraphs is inherently related to the average-case complexity of coloring. In particular, in this section we showed that Theorem 1.1 implies a robust algorithm for hypergraph coloring, namely a deterministic procedure that applies to worst-case -uniform hypergraphs, while at the same using a number of colors that is only a -factor away from the algorithmic barrier for random instances (matching it for ). We remark that this application is inspired by recent results that study the connection between local sparsity and efficient randomized algorithms for coloring sparse regular random graphs [26, 2, 10].
1.2 Technical overview
The intuition behind the proof of Theorem 1.1 comes from the following observation, which we explain in terms of graph coloring for simplicity. Let be a triangle-free graph of degree , and assume that each of its vertices is assigned an arbitrary list of colors. Fix a vertex of , and consider the random experiment in which the neighborhood of is properly list-colored randomly. Since contains no triangles, this amounts to assigning to each neighbor of a color from its list randomly and independently. Assuming that , the expected number of available colors for , i.e., the colors from the list of that do not appear in any of its neighbors, is at least . In fact, a simple concentration argument reveals that the number of available colors for in the end of this experiment is at least with probability that goes to as grows. To put it differently, as long as , the vast majority of valid ways to list-color the neighborhood of “leaves enough room” to color without creating any monochromatic edges.
A completely analogous observation regarding the ways to properly color the neighborhood of a vertex can be made for -uniform hypergraphs. In order to exploit it we employ the so-called semi-random method, which is the main tool behind some of the strongest graph coloring results, e.g., [15, 16, 17, 18, 20, 27, 32], including the one of Kim [21]. (See also the very recent survey [19] of Kang et. al. on the subject.) The idea is to gradually color the hypergraph in iterations until we reach a point where we can finish the coloring with a simple, e.g., greedy, algorithm. In its most basic form, each iteration consists of the following simple procedure (using graph vertex coloring as a canonical example): Assign to each vertex a color chosen uniformly at random; then uncolor any vertex that receives the same color as one of its neighbors. Using the Lovász Local Lemma [11] and concentration inequalities, one typically shows that, with positive probability, the resulting partial coloring has useful properties that allow for the continuation of the argument in the next iteration. (In fact, using the Moser-Tardos algorithm [29] this approach yields efficient, and often times deterministic [9], algorithms.) Specifically, one keeps track of certain parameters of the current partial coloring and makes sure that, in each iteration, these parameters evolve almost as if the coloring was totally random. For example, recalling the heuristic experiment of the previous paragraph, one of the parameters we would like to keep track of in our case is a lower bound on the number of available colors of each vertex in the hypergraph: If this parameter evolves “randomly” throughout the process, then the vertices that remain uncolored in the end are guaranteed to have a non-trivial number of available colors.
Applications of the semi-random method tend to be technically intense and this is even more so in our case, where we have to deal with constraints of large arity. Large constraints introduce several difficulties, but the most important one is that our algorithm has to control many parameters that interact with each other. Roughly, in order to guarantee the properties that allow for the continuation of the argument in the next iteration, for each uncolored vertex , each color in the list of , and each integer , we should keep track of a lower bound on the number of adjacent to hyperedges that have uncolored vertices and vertices colored . Clearly, these parameters are not independent of each other throughout the process, and so the main challenge is to design and analyze a coloring procedure in which all of them, simultaneously, evolve essentially randomly.
1.3 Organization of the paper
2 Background and preliminaries
In this section we give some background on the technical tools that we will use in our proofs.
2.1 The Lovász Local Lemma
As we have already mentioned, one of the key tools we will use in our proof is the Lovász Local Lemma [11].
Theorem 2.1.
Consider a set of (bad) events. For each , let be such that for every . If there is a function satisfying
(2) |
then the probability that none of the events in occurs is at least .
In particular, we will need the following two corollaries of Theorem 2.1. For their proofs, the reader is referred to Chapter 19 in [28].
Corollary 2.2.
Consider a set of (bad) events. For each , let be such that for every . If for every :
-
(a)
;
-
(b)
,
then the probability that none of the events in occurs is strictly positive.
Corollary 2.3.
Consider a set of (bad) events such that for each :
-
(a)
;
-
(b)
is mutually independent of a set of all but at most of the other events.
If then with positive probability, none of the events in occur.
2.2 Talagrand’s inequality
We will also need the following version of Talagrand’s inequality [30] whose proof can be found in Chapter 20 of [28].
Theorem 2.4.
Let be a non-negative random variable, not identically , which is determined by independent trials , and satisfying the following for some :
-
1.
changing the outcome of any trial can affect by at most , and
-
2.
for any , if then there is a set of at most trials whose outcomes certify that ,
then for any ,
3 List-coloring high-girth hypergraphs
In this section we describe the algorithm of Theorem 1.1. As we already explained, our approach is based on the semi-random method. For an excellent exposition both of the method and Kim’s result the reader is referred to [28].
We assume without loss of generality that . Also, it will be convenient to define the parameter , so that the list of each vertex initially has at least colors, and assume that . (The case is Kim’s result.)
We analyze each iteration of our procedure using a probability distribution over the set of (possibly improper) colorings of the uncolored vertices of where, additionally, each vertex is either activated or deactivated. We call a pair of coloring and activation bits assignments for the uncolored vertices of hypergraph a state.
Let denote the set of uncolored vertices in the beginning of the -th iteration. (Initially, all vertices are uncolored.) Also, for each we denote by the list of colors of in the beginning of the -th iteration. Finally, we say that a color is available for in a state if assigning to does not cause any hyperedge whose initially uncolored vertices are all activated in to be monochromatic.
For each vertex , color and iteration , we define a few quantities of interest that our algorithm will attempt to control. Let be the size of . Further, for each , let denote the set of hyperedges that contain and (i) exactly vertices are uncolored and for every ; (ii) the rest vertices of other than are colored . We define .
As it is common in the applications of the semi-random method, we will not attempt to keep track of the values of and , , for every vertex and color , but rather we will focus on their extreme values. In particular, we will define appropriate such that for each the following property holds in the beginning of iteration :
Property P(i): For each vertex , color and :
As a matter of fact, it would be helpful for our analysis (though not necessary) if the inequalities defined in were actually tight. Given that holds, we can always enforce this stronger property in a straightforward way as follows. First, for each vertex such that we choose arbitrarily colors from its list and remove them. Then, for each vertex and color such that we add to the hypergraph new hyperedges of size that contain and new “dummy” vertices. (As it will be evident from the proof, we can always assume that are integers, since our analysis is robust to replacing with and with .) We assign each dummy vertex a list of colors: of them are new and do not appear in the list of any other vertex, and the last one is .
Remark 3.1.
Dummy vertices are only useful for the purposes of our analysis and can be removed at the end of the iteration. Indeed, one could use the technique of “equalizing coin flips” instead. For more details see e.g., [28].
Overall, without loss of generality, at each iteration our goal will be to guarantee that Property holds assuming Property .
Property Q(i): For each vertex , color and :
An iteration.
For the -th iteration we apply the Local Lemma with respect to the probability distribution induced by assigning to each vertex a color chosen uniformly at random from and activating with probability , where . That is, we apply the Moser-Tardos algorithm in the space induced by the variables corresponding to the color and activation bit of each variable in . (We will define the family of bad events for each iteration shortly.)
When the execution of the Moser-Tardos algorithm terminates, we uncolor some of the vertices in in order to get a new partial coloring. In particular, the partial coloring of the hypergraph, set , and the lists of colors for each uncolored vertex in the beginning of iteration are induced as follows. Let be the output state of the application of the Moser-Tardos algorithm in the -th iteration. The list of each vertex , , is induced from by removing every non-available color for in . We obtain the partial coloring for the hypergraph and set for the beginning of iteration by removing the color from every vertex which is either deactivated or is assigned a non-available for it color in .
Overall, the -th iteration of our algorithm can be described at a high-level as follows.
-
1.
Apply the Moser-Tardos algorithm to the probability space induced by assigning to each vertex a color chosen uniformly at random from , and activating with probability .
-
2.
Let be the output state of the Moser-Tardos algorithm.
-
3.
For each vertex , remove any non-available color in to get a list .
-
4.
Uncolor every vertex that has either received a non-available color or is deactivated in , to get a new partial coloring .
Controlling the parameters of interest.
Next we describe the recursive definitions for and which, as we already explained, will determine the behavior of the parameters and , respectively.
Initially, , and for every . Letting
(3) |
we define
(4) | |||||
(5) | |||||
The key lemmas.
We are almost ready to state the main lemmas which guarantee that our procedure eventually reaches a partial list-coloring of with favorable properties that allow us to extend it to a full list-coloring. Before doing so, we need to settle a subtle issue that has to do with the fact that is not sufficiently concentrated around its expectation. To see this, notice for example that drops to zero if is assigned . (Similarly, for , if is assigned then can be affected by a large amount.) To deal with this problem we will focus instead on variable , i.e., the number of hyperedges that contain and (i) exactly vertices of are colored in the end of iteration ; (ii) the rest vertices of did not retain their color during iteration and, further, would be available for them if we ignored the color assigned to . Observe that if is not assigned to then and otherwise.
The first lemma that we prove estimates the expected value of the parameters at the end of the -th iteration. Its proof can be found in Section 4.
Lemma 3.1.
Let and . If holds and for all , then, for every vertex and color :
-
(a)
;
-
(b)
The next step is to prove strong concentration around the mean for our random variables per the following lemma. Its proof can be found in Section 4.
Lemma 3.2.
If holds and , , then for every vertex and color ,
-
(a)
;
-
(b)
.
Armed with Lemmas 3.1, 3.2, a straightforward application of the symmetric Local Lemma, i.e., Corollary 2.3, reveals the following.
Lemma 3.3.
With positive probability, holds for every such that for all and .
In analyzing the recursive equations (4), (5), it would be helpful if we could ignore the “error terms”. The next lemma shows that this is indeed possible. Its proof can be found in Section 4.
Lemma 3.4.
Define , for , and recursively define
(6) | |||||
(7) | |||||
If for all , , for every , and , then
-
(a)
;
-
(b)
.
Remark 3.2.
Note that in Lemma 3.4 is still defined in terms of and not . Note also that in the definition of , the second summand is a function of , , and not .
Lemma 3.5.
There exists such that
-
(a)
For all , and ;
-
(b)
, for every and .
Remark 3.3.
Lemma 3.6.
Let be the integer promised by Lemma 3.5, and assume property holds for the partial coloring in the beginning of the -th iteration. Given , we can find a full list-coloring of in expected polynomial time in the number of vertices of . Also, if is assumed to be constant, then such a coloring can be constructed deterministically in polynomial time.
Proof of Theorem 1.1.
We carry out iterations of our procedure. If fails to hold for any iteration , then we halt. By Lemmas 3.3 and 3.5, (and, therefore, ) holds with positive probability for each iteration and so it is possible to perform iterations. Further, since the application of the LLL in the proof of Lemma 3.3 is within the scope of the so-called variable setting, i.e., the setting considered by [29], the Moser-Tardos algorithm applies and terminates in expected polynomial time. (If is assumed to be constant the deterministic version of the Moser-Tardos algorithm also applies and terminates in polynomial time.) Thus, we can perform (successful) iterations in polynomial time.
After iterations we apply the algorithm of Lemma 3.6 and complete the list-coloring of the input hypergraph.
∎
3.1 Proof of Lemma 3.6
Let denote the set of uncolored vertices in , and the subset of that belongs to a hyperedge . Our goal is to color the vertices in to get a proper list-coloring.
Towards that end, let denote the list of colors for in , and the set of hyperedges (of size ) with uncolored vertices in whose vertices “compete” for with , and recall the conclusion of Lemma 3.5. Let be the probability distribution induced by giving each vertex a color from uniformly at random. For every hyperedge and color such that (i) ; and (ii) for every vertex in , we define to be the event that all vertices of are colored . Let be the family of these (bad) events, and observe that any elementary event (list-coloring) that does not belong in their union is a proper. In other words, if we avoid these bad events we have found a proper list-coloring of the hypergraph. Moreover, for every :
for large enough , since .
Define
and observe that is mutually independent of the events in . The existential claim of Lemma 3.6 follows from Corollary 2.2 as, for every :
(8) | |||||
(9) | |||||
(10) |
concluding the proof. Note that in (8) we used the facts that every hyperedge has at most vertices and , and in (9) we used the fact that .
As for the algorithmic claim of the lemma, since we apply the LLL in the variable setting the Moser-Tardos algorithm applies and it terminates in expected polynomial time. Further, if is assumed to be constant then its deterministic version also applies and terminates in polynomial time, concluding the proof of the lemma.
4 Hypergraph list-coloring proofs
We start by showing a couple of important technical lemmas that will be helpful for these proofs. It will be convenient to define , for every .
Lemma 4.1.
If for all , then
The proof of Lemma 4.1 can be found in Appendix A. A straightforward corollary of Lemma 4.1 is the following.
Corollary 4.2.
If and , then
Proof.
The lower bound follows directly from (59). The upper bound follows from our assumption that which implies that
for sufficiently large . ∎
The proof of the following lemma can also be found in Appendix A.
Lemma 4.3.
If for all , then for every :
4.1 Proof of Lemma 3.1
Proof of part (a).
For every color ,
(11) |
where for the second equality we used our assumption that holds. Therefore, the proof of the first part of the lemma follows from the linearity of expectation.
∎
Proof of part (b) .
Recall the definition of and note that only hyperedges in can be potentially counted by . In particular, unless every uncolored vertex of an edge , , is assigned the same color with in iteration , then if is counted by , it is also counted by . Therefore,
(12) |
and so we focus on bounding .
Fix , where . Our goal will be to show that
(13) |
since combining (4.1) with (12) implies the lemma. To see this, observe that
(14) |
Note that in deriving the second part of the inequality in (14) we first used that for every in order to bound by , and then that for every . Therefore,
(16) | |||||
for sufficiently large . In deriving (16) we used (14) and the facts that:
Towards proving (4.1), for any vertex , consider the events
Let also be the event that and other uncolored vertices of receive color . Since we have assumed that our hypergraph is of girth at least , for any neighbor of and the event is mutually independent of all events , conditional on not occurring. Thus, if , , for every vertex , we obtain
(17) |
since .
Now we claim that for any , and sufficiently large ,
(18) |
To see this, notice that conditional on the probability that is activated is , it is assigned with probability at most , and it retains with probability at most
(19) |
Thus,
for sufficiently large , concluding the proof of (18).
Further, we claim that
(20) |
To show (20) we consider three cases. The first case is that is not activated and (notice that these are two independent events). In this case will not retain its color, and observe that
(21) |
using essentially the same calculations as in showing (18). Thus,
(22) |
In the second case we consider the scenario where is activated and is assigned . Clearly then, the probability that and does not retain is zero. Finally, suppose that is activated and is assigned a color . Our goal is to compute for each so that we can sum up these probabilities over all possible along with (22).
For a vertex let denote the event that is activated and assigned . Using this notation we have:
(23) |
and so below we focus on bounding for each the probability that and conditional on that is activated and assigned and did not occur. Note that in deriving (23) we used (21).
We have:
(24) | |||
(25) |
Note that in deriving (24) we use the fact the girth of the hypergraph is at least which, in particular, implies that any two vertices that contain do not have any other vertex in common.
To further bound (25), we consider the probability that every vertex in is activated and assigned , conditional on that and , for any fixed and . We consider two cases depending on whether or not.
We start with the case where . Let be the event that not every vertex in is activated and assigned . Since our hypergraph has girth at least and the color activations and color assignments are independent over different vertices, we have:
(26) |
for sufficiently large , since .
Next we consider the case . The difference here is that the event is not independent of as before. However, notice that since , and , we can only have when . This means that the occurrence of event the event prohibits the occurrence of event event . Therefore, the event is independent of the event and, thus, we have:
(27) |
for sufficiently large .
Proposition 4.4.
For every color :
(28) |
Proof.
Towards proving (28), a helpful observation is the following.
(29) | ||||
(30) |
for sufficiently large . Note that in (29) we used the fact that for any .
Using (26), (27) and (30), we have:
(31) | |||
(32) | |||
(33) |
for sufficiently large . Note that in deriving (31) we used our previous observation that (27) applies only when , and this can potentially happen only when .
∎
Overall, combining (22), (23) and Proposition 4.4 we see that is at most
where recall that . This concludes the proof of (20).
Finally, combining (17), (18) and (20) we obtain
(34) | |||||
(35) | |||||
(36) |
concluding the proof of (4.1), which was our goal. Note that in (34) we used that is bounded below by a constant according to Corollary 4.2 (the lower bound only requires the assumptions of Lemma 4.1). In (35) we used the fact that for . (We only care about since if then as well and, therefore, .) ∎
4.2 Proof of Lemma 3.2
Let denote the binomial random variable that counts the number of successes in Bernoulli trials, where each trial succeeds with probability . We will find the following lemma useful (see, e.g., Exercise 2.12 in [28]) :
Lemma 4.5.
For any we have
Proof of Part (a).
We will use Theorem 2.4 to show that that the number of colors, , which are removed from during iteration is highly concentrated.
Note that changing the assignment to any neighboring vertex of can change by at most , and changing the assignment to any other vertex cannot affect at all.
If , there are at most groups of at most neighbors of , so that each vertex in each group received the same color, and each group corresponds to a different color from . Thus, the color assignments and activation choices of these vertices certify that .
Since, according to Corollary 4.2, , applying Theorem 2.4 with , , , we obtain
for sufficiently large .
Finally, implies that
∎
Proof of Part (b).
Recall the definition of and let . Let denote the number of hyperedges in which (i) contain exactly uncolored vertices other than ; and (ii) the rest of their vertices are assigned in the end of the -th iteration. Let also be the number of these hyperedges which they contain an uncolored vertex such that (i) ; and (ii) would still not be in even if we ignored the color of .
B definition we have . Therefore, by the linearity of expectation, it suffices to show that and are both sufficiently concentrated. This is because
and, therefore, it is sufficient to prove that
(37) | |||||
(38) |
We first focus on . Let denote the number of hyperedges in which (i) contain exactly uncolored vertices other than ; and (ii) the rest of their vertices were activated and assigned (but did not retain their color necessarily). Clearly, . Further, let denote the random variable that counts all the hyperedges counted by , except for those whose uncolored vertices (other than ) were uncolored because they were activated and received the same color as . Finally, let be the number of hyperedges which (i) contain exactly vertices that are activated and received the same color as ; and (ii) the rest of their vertices were activated and assigned .
Observe that . The idea here is that we cannot directly apply Talagrand’s inequality to and so we consider instead.
First, we consider . Since our hypergraph is of girth at least , changing a choice for some vertex of a hyperedge can only affect whether or not the vertices of remain uncolored, and thus affect by at most . Furthermore, changing a choice for a vertex outside the ones that correspond to the hyperedges in can affect at most one vertex of at most one hyperedge in and, therefore, can affect by at most .
We claim now that if , then there exist at most random choices that certify this event. To see this, notice that if a hyperedge is counted by , then for every that remained uncolored, it must be that either was deactivated, or is contained in a hyperedge such that all the vertices in were activated and received the same color as . Moreover, the event that a variable was activated and received can be verified by the outcome of two random choices. So, overall, we can certify that was counted by by using the outcome of at most random choices.
Finally, observe that and, thus, applying Theorem 2.4 with , and we obtain
(39) |
for sufficiently large , since we have assumed that .
As far as is concerned, note that it is stochastically dominated by and recall Lemma 4.1. Since for every , Lemma 4.5 implies that
(40) |
We follow the same approach for . Let be the number of hyperedges counted by and which also contain an uncolored vertex such that (i) ; and (ii) would still not be in even if we ignored the color of . Further, let be the random variable that counts all the hyperedges counted by , except for those whose uncolored vertices were uncolored because they were activated and received the same color as , and observe that .
Moreover, if , then there exist at most random choices that certify this event. To see this, observe that for each hyperedge counted by , we need the output of at most choices to certify that it is counted by , and the output of at most extra random choices to certify that there is a vertex for which , and would still not be in even if we ignored the color of .
Finally, and, therefore, an almost identical argument to the case for implies (38).
∎
4.3 Proof of Lemma 3.3
We use induction on . Property clearly holds, so we assume that property holds and we prove that with property holds with positive probability. Recall our discussion in the previous section in which we argued that we can assume without loss of generality that property holds.
For every and let be the event that and to be the event that . Clearly, if these bad events are avoided, then holds.
Since property holds, we have that . Therefore, by (4) and Lemmas 3.1, 3.2 we have:
(41) |
Similarly, by (5) and Lemmas 3.1, 3.2 we have:
(42) | ||||
(43) |
where is the hidden constant that multiplies in the statement of Lemma 3.1, Part (b). Note that in deriving (42) we used Lemma 4.1 — which implies that — and our assumptions that — which implies that .
4.4 Proof of Lemma 3.4
Since , for the first part of the lemma it suffices to prove that . Towards that end, at first we observe that for sufficiently large , Corollary 4.2 and the fact that imply:
(44) | |||||
Note that in deriving the first inequality we used the fact that the function is decreasing on the interval since is sufficiently small. For the second one, we used the Taylor Series for around .
We now proceed by using induction. The base case is trivial, so assume that the statement is true for , and consider . Since by our assumption we obtain
(47) |
for sufficiently large . Note that in deriving (4.4) we used the inductive hypothesis; for (4.4) we used (44); and for (47) the fact that and the inductive hypothesis.
The proof of the second part of the lemma is conceptually similar, but more technical, so we present its proof in Appendix A.
4.5 Proof of Lemma 3.5
We proceed by induction. Let . We will assume that for all , and prove that . Towards that end, it will be useful to focus on the family of ratios , ]. Note that, according to Lemma 3.4, this family is well-approximated by the family , . In particular, recalling Lemma 4.3 and applying Lemma 3.4 we obtain:
(48) | |||||
for sufficiently large , since .
Using (48) and the fact that for we can get an improved lower bound for as follows.
(49) | |||||
for sufficiently large .
Using (50) we can now bound as follows.
(51) |
for sufficiently large . Thus, never gets too small for the purposes of our analysis. Lemma 3.4 implies that neither does .
The proof is concluded by observing that (48) implies that , , becomes smaller than for .
5 A sufficient pseudo-random property for coloring
In this section we present the proof of Theorem 1.5. To do so, we build on ideas of Alon, Krivelevich and Sudakov [6] and show that the random hypergraph asymptotically almost surely admits a few useful features.
The first lemma we prove states that all subgraphs of with not too many vertices are sparse and, therefore, of small degeneracy.
Lemma 5.1.
For every constant , there exists such that for any constant , the random hypergraph has the following property asymptotically almost surely: Every vertices of span fewer than hyperedges. Therefore, any subhypergraph of induced by a subset of size , is -degenerate.
Proof.
Given the statement about the sparsity of any subhypergraph of incudced by a set of vertices, the claim about its degeneracy follows from the fact that its average (and, therefore its minimum) degree is at most
So we are left with proving the statement regarding sparsity.
Letting , we see that the probability that there exists a subset which violates the statement of the lemma is at most
(52) | |||||
(53) | |||||
(54) |
for sufficiently large . Note that in the lefthand side of (52) we used the fact that any subset of vertices of size cannot violate the assertion of the lemma, since it can span at most hyperedges. In deriving (54) we used the fact that, for sufficiently large , every summand in (53) is at most , and there exist at most summands. Throughout our derivation we used that for any pair of positive integers , we have .
∎
Next we show that, that for any constant , the number of vertices of that have degree essentially behaves as a Poisson random variable with mean .
Lemma 5.2.
For constants , and sufficiently large, let denote the number of vertices of degree in . Then, asymptotically almost surely,
Proof.
The lemma follows from standard ideas for estimation of the degree distribution of random graphs (see for example the proof of Theorem 3.3 in [12] for the case ). In particular, assume that the vertices of are labeled . Then,
(56) |
Note that in the first inequality above we used the fact that for every any pair of positive integers , we have .
To show concentration of around its expectation, we will use Chebyshev’s inequality. In order to do so, we need to estimate the second moment of and, in particular, . Letting and be a sufficiently large constant, we see that equals
(57) | ||||
(58) |
Note that in deriving (57) we used that for any
for large enough . Moreover, in deriving (58) we use that for every
and that:
Therefore, letting denote the indicator random variable which equals if vertex has degree and otherwise,
for some constant . Note that in deriving (62) we used (58) and that according to (56).
Finally, applying the Chebyshev’s inequality, we obtain that, for any ,
and, thus, the proof is concluded by choosing .
∎
Lemma 5.2 implies the following useful corollary.
Corollary 5.3.
For any constants , let denote the random variable equal to the number of vertices in whose degree is in . There exists a constant such that if then, asymptotically almost surely,
Proof.
Let denote the number of vertices of degree in . Since are constants, using Lemma 5.2 and Stirling’s approximation we see that, asymptotically almost surely,
for sufficiently large and . ∎
Using Lemma 5.1 and Corollary 5.3 we show that, asymptotically almost surely, only a small fraction of vertices of have degree that significantly exceeds its average degree.
Lemma 5.4.
For every constants and , there exists such that for any constant , all but at most vertices of the random hypergraph have degree at most , asymptotically almost surely.
Proof.
Corollary 5.3 implies that the number of vertices with degree in the interval is at most , for sufficiently large .
Suppose now there are more than vertices with degree at least . Denote by a set containing exactly such vertices. According to Lemma 5.1, asymptotically almost surely, the induced subhypergraph has at most
hyperedges. Therefore, the number of hyperedges between the sets of vertices and is at least
for sufficiently large . However, the probability that contains such a subhypergraph is at most
for sufficiently large . Note that in deriving the final equality we used that for any pair of integers , we have that . Therefore, asymptotically almost surely there are at most vertices in with degree greater than , concluding the proof. ∎
Finally, we show that the neighborhood of a typical vertex of is locally tree-like.
Lemma 5.5.
For every constants , asymptotically almost surely, the random hypergraph has a subset of size at most such that the induced hypergraph is of girth at least .
Proof.
Let , denote the number of -, - and -cycles in , respectively. A straightforward calculation reveals that for :
By Markov’s inequality this implies that asymptotically almost surely. Denote by the union of all -, - and - cycles in . Then the induced subhypergraph has girth at least and, asymptotically almost surely, .
∎
We are now ready to prove Theorem 1.5.
Proof of Theorem 1.5.
Our goal will be to find a subset of size that (i) contains all cycles of length at most and every vertex of degree more than ; and (ii) such that, every vertex in has at most neighbors in . Note that in this case, according to Lemma 5.1, is -degenerate, concluding the proof assuming is sufficiently large. A similar idea has been used in [5, 6, 25].
Towards that end, let be the set of vertices of degree more than , and the set of vertices that are contained in a -,- or a -cycle. Notice that can be found in polynomial time and, according to Lemmas 5.4 and 5.5, the size of is at most for sufficiently large and .
We now start with and as long as there exists a vertex having at least neighbors in we do the following. Let be the neighbors of in . We choose an arbitrary hyperedge that contains and and update and by defining and . We keep repeating this operation until is empty.
This process terminates with because, otherwise, we would get a subset of size spanning more than
hyperedges, for sufficiently large . According to Lemma 5.1 however, does not contain any such set asymptotically almost surely.
∎
6 Acknowledgements
The author is grateful to Dimitris Achlioptas, Irit Dinur and anonymous reviewers for detailed comments and feedback.
References
- [1] Dimitris Achlioptas and Amin Coja-Oghlan. Algorithmic barriers from phase transitions. In 49th Annual IEEE Symposium on Foundations of Computer Science, FOCS 2008, October 25-28, 2008, Philadelphia, PA, USA, pages 793–802. IEEE Computer Society, 2008.
- [2] Dimitris Achlioptas, Fotis Iliopoulos, and Alistair Sinclair. Beyond the lovász local lemma: Point to set correlations and their algorithmic applications. In David Zuckerman, editor, 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS 2019, Baltimore, Maryland, USA, November 9-12, 2019, pages 725–744. IEEE Computer Society, 2019.
- [3] Dimitris Achlioptas and Michael Molloy. The analysis of a list-coloring algorithm on a random graph. In Proceedings 38th Annual Symposium on Foundations of Computer Science, pages 204–212. IEEE, 1997.
- [4] Miklós Ajtai, János Komlós, Janos Pintz, Joel Spencer, and Endre Szemerédi. Extremal uncrowded hypergraphs. Journal of Combinatorial Theory, Series A, 32(3):321–335, 1982.
- [5] Noga Alon and Michael Krivelevich. The concentration of the chromatic number of random graphs. Combinatorica, 17(3):303–313, 1997.
- [6] Noga Alon, Michael Krivelevich, and Benny Sudakov. List coloring of random and pseudo-random graphs. Combinatorica, 19(4):453–472, 1999.
- [7] Peter Ayre, Amin Coja-Oghlan, and Catherine Greenhill. Hypergraph coloring up to condensation. Random Structures & Algorithms, 54(4):615–652, 2019.
- [8] Tom Bohman, Alan Frieze, and Dhruv Mubayi. Coloring h-free hypergraphs. Random Structures & Algorithms, 36(1):11–25, 2010.
- [9] Karthekeyan Chandrasekaran, Navin Goyal, and Bernhard Haeupler. Deterministic algorithms for the Lovász local lemma. SIAM J. Comput., 42(6):2132–2155, 2013.
- [10] Ewan Davies, Ross J Kang, François Pirot, and Jean-Sébastien Sereni. An algorithmic framework for coloring locally sparse graphs. arXiv preprint arXiv:2004.07151, 2020.
- [11] Paul Erdős and László Lovász. Problems and results on -chromatic hypergraphs and some related questions. In Infinite and finite sets (Colloq., Keszthely, 1973; dedicated to P. Erdős on his 60th birthday), Vol. II, pages 609–627. Colloq. Math. Soc. János Bolyai, Vol. 10. North-Holland, Amsterdam, 1975.
- [12] Alan Frieze and Michał Karoński. Introduction to random graphs. Cambridge University Press, 2016.
- [13] Alan Frieze and Dhruv Mubayi. Coloring simple hypergraphs. Journal of Combinatorial Theory, Series B, 103(6):767–794, 2013.
- [14] Marylou Gabrié, Varsha Dani, Guilhem Semerjian, and Lenka Zdeborová. Phase transitions in the q-coloring of random hypergraphs. Journal of Physics A: Mathematical and Theoretical, 50(50):505002, 2017.
- [15] A. Johansson. Asympotic choice number for triangle free graphs., 1996.
- [16] A. Johansson. The choice number of sparse graphs. Unpublished manuscript, 1996.
- [17] Jeff Kahn. Asymptotics of the chromatic index for multigraphs. Journal of Combinatorial Theory, Series B, 68(2):233–254, 1996.
- [18] Jeff Kahn. Asymptotics of the list-chromatic index for multigraphs. Random Structures & Algorithms, 17(2):117–156, 2000.
- [19] Dong Yeap Kang, Tom Kelly, Daniela Kühn, Abhishek Methuku, and Deryk Osthus. Graph and hypergraph colouring via nibble methods: A survey. arXiv preprint arXiv:2106.13733, 2021.
- [20] Dong Yeap Kang, Tom Kelly, Daniela Kühn, Abhishek Methuku, and Deryk Osthus. A proof of the erdh o s-faber-lov’asz conjecture. arXiv preprint arXiv:2101.04698, 2021.
- [21] Jeong Han Kim. On Brooks’ theorem for sparse graphs. Combinatorics, Probability and Computing, 4(2):97–132, 1995.
- [22] János Komlós, János Pintz, and Endre Szemerédi. A lower bound for heilbronn’s problem. Journal of the London Mathematical Society, 2(1):13–24, 1982.
- [23] Alexandr Kostochka, Dhruv Mubayi, Vojtěch Rödl, and Prasad Tetali. On the chromatic number of set systems. Random Structures & Algorithms, 19(2):87–98, 2001.
- [24] Hanno Lefmann. Sparse parity-check matrices over GF (q). Combinatorics, Probability and Computing, 14(1-2):147–169, 2005.
- [25] Tomasz Łuczak. The chromatic number of random graphs. Combinatorica, 11(1):45–54, 1991.
- [26] Michael Molloy. The list chromatic number of graphs with small clique number. Journal of Combinatorial Theory, Series B, 134:264–284, 2019.
- [27] Michael Molloy and Bruce Reed. A bound on the total chromatic number. Combinatorica, 18(2):241–280, 1998.
- [28] Michael Molloy and Bruce Reed. Graph colouring and the probabilistic method, volume 23 of Algorithms and Combinatorics. Springer-Verlag, Berlin, 2002.
- [29] Robin A. Moser and Gábor Tardos. A constructive proof of the general Lovász local lemma. J. ACM, 57(2):Art. 11, 15, 2010.
- [30] Michel Talagrand. Concentration of measure and isoperimetric inequalities in product spaces. Publications Mathématiques de l’Institut des Hautes Etudes Scientifiques, 81(1):73–205, 1995.
- [31] Van H Vu. On the choice number of random hypergraphs. Combinatorics Probability and Computing, 9(1):79–95, 2000.
- [32] Van H Vu. A general upper bound on the list chromatic number of locally sparse graphs. Combinatorics, Probability and Computing, 11(1):103–111, 2002.
- [33] Lenka Zdeborová and Florent Krzakala. Phase transitions in the coloring of random graphs. Physical Review E, 76(3):031131, 2007.
Appendix A Proofs omitted from Section 4
A.1 Proof of Lemma 4.1
We proceed by induction. The case is straightforward to verify since for every , while . Therefore, we inductively assume the claim for , and consider the case . Note that the inductive hypothesis implies that since for every and, thus,
(59) |
for sufficiently large . Recalling (4) and (5), we have:
(62) | |||||
(63) |
for sufficiently large , concluding the proof. Note that in deriving (A.1) we used the inductive hypothesis and that to obtain:
In deriving (A.1) we used the facts that , . In deriving (63) we used the fact that and, therefore, the second term in (62) is a negative constant, the inductive hypothesis, and the that , and .
A.2 Proof of Lemma 4.3
We proceed by induction. The base cases are easy to verify since for every and .
We first focus on the case . We assume that the claim is true for and consider . Note that the inductive hypothesis, the facts that for every and , imply:
(64) |
for sufficiently large , for every and . Therefore, recalling (6), (7) and using (64), we have:
for sufficiently large , concluding the proof for the case .
We now focus on . We first observe that
concluding the proof of the base cases.
Assume that the claim holds for all pairs , where and . It suffices to prove that it also holds for any pair , where and . To see this, observe that
(65) | |||||
(66) | |||||
(69) | |||||
(70) | |||||
(71) | |||||
(72) |
for sufficiently large , concluding the proof. Note that in order to get (65) we upper bound in the same way we upper bounded . We keep using the same steps to bound until we get (69). In deriving (69) we used that for every . In going from (69) to (70) we start the summation in the last term from instead from in order to subsume the error term. Finally, in going from (71) to (72) we multiply the term that corresponds to in the summation by in order to subsume the terms of the summation that correspond to .
A.3 Proof of Lemma 3.4, Part(b)
We observe that it suffices to show that (since for every by definition) and proceed by using induction. Again, the base case is trivial, so we assume the statement is true for , and consider . We obtain the following.
(73) | |||||
(75) | |||||
(76) |
for sufficiently large , concluding the proof of the lemma. Note that in deriving (75) we used the first part of Lemma 3.4, i.e., the fact that to obtain
and the fact that
for sufficiently large . In deriving (75) we used Lemma 4.1 to obtain that
and, similarly, that
We also used the fact that, using Corollary 4.2, we obtain
for sufficiently large . Finally, to derive (76) we observe that:
(77) |
where in deriving (77) we used our assumption that and the inductive hypothesis according to which are within a constant factor (arbitrarily close to ) of .