Testing Graph Properties with the Container Method
Abstract
We establish nearly optimal sample complexity bounds for testing the -clique property in the dense graph model. Specifically, we show that it is possible to distinguish graphs on vertices that have a -clique from graphs for which at least edges must be added to form a -clique by sampling and inspecting a random subgraph on only vertices.
We also establish new sample complexity bounds for -testing -colorability. In this case, we show that a sampled subgraph on vertices suffices to distinguish -colorable graphs from those for which any -coloring of the vertices causes at least edges to be monochromatic.
The new bounds for testing the -clique and -colorability properties are both obtained via new extensions of the graph container method. This method has been an effective tool for tackling various problems in graph theory and combinatorics. Our results demonstrate that it is also a powerful tool for the analysis of property testing algorithms.
1 Introduction
Is it possible to test if a graph contains a large clique or if it is -colorable while examining only a small fraction of that graph? These problems are two of the foundational examples in graph property testing that were originally considered in the groundbreaking work of Goldreich, Goldwasser, and Ron [GGR98]. The problems can be specified formally in the dense graph property testing framework defined as follows.
A (simple undirected) graph on vertices is -far from a graph property if we need to add or remove at least edges from to obtain a graph that does have the property . A canonical -tester for with sample cost is a bounded-error111By bounded error we mean there exist absolute constants such that if has property then the algorithm accepts with probability at least and if is -far from then the algorithm accepts with probability at most randomized algorithm that samples a set of vertices of some unknown graph , examines the induced subgraph , and based only on this local view of the graph can distinguish between the case where has property and the case where it is -far from . The sample complexity of , denoted , is the minimum value for which there is a canonical -tester for with sample cost .
1.1 Testing Cliques
The -Clique property is the set of all graphs on vertices that contain a clique on vertices. The problem of testing the -Clique property can be considered in both the large clique regime, where is a constant, and in the small clique regime, where is a function of the number of vertices of the graph. In the small clique regime, it is important to note that the problem is non-trivial only when is also a function of , since we can always add at most edges to any graph to form a -clique.
What is the sample complexity for -testing the -Clique property? Goldreich, Goldwasser, and Ron [GGR98] were the first to consider this question and showed that . This result has remarkable qualitative implications in both the large clique and the small clique regimes. In the large clique regime, it shows that -testing -cliques requires only inspection of a constant-sized subgraph when is an absolute constant. And in the small clique regime, it shows that the sample complexity for -testing -clique property is sublinear in for all when —i.e., when the tester must distinguish graphs with -cliques from those whose -subgraphs all have constant density bounded away from .
Improved bounds on the sample complexity for testing cliques were obtained by Feige, Langberg, and Schechtman [FLS04]. With a new direct analysis of the canonical tester for testing cliques, they showed that . And by considering the restricted problem of distinguishing graphs that consist of a single -clique from those that consist of a single -clique, they showed that .
Our first main result is a new upper bound on the sample complexity of testing the -Clique property that is nearly optimal, since it matches the lower bound of Feige, Langberg, and Schechtman up to polylogarithmic factors.
Theorem 1.
The sample complexity of the -Clique property is .222Here and throughout the article, we use and notation to hide terms that are polylogarithmic in the argument. See Sections 3 and 4 for the precise formulation of the main theorems.
The qualitative implications of Theorem 1 are most striking in the small clique regime. In this regime, the result has a more natural representation when we reformulate it using notation from the Densest -Subgraph (DS) problem. (For more details on the DS problem, see [Man17] and the references therein.)
Theorem 1’ (Alternative formulation).
For every , , and , there is a bounded-error randomized algorithm that distinguishes (i) graphs that contain a -clique from (ii) graphs whose -subgraphs all contain at most edges by inspecting the induced subgraph of the input graph on a random set of only vertices.
This result implies that for every constant and every , we can distinguish graphs with a -clique from graphs whose -subgraphs have density at most by examining only a sublinear portion of the graph. The previous bounds in [GGR98] and [FLS04] showed that sublinear sample complexity is achievable for constant when and , respectively.
This result can also be viewed as a generalization of results of Rácz and Schiffer [RS19] and of Huleihel, Mazumdar, and Pal [HMP21] regarding the sample complexity of the Planted -Clique Detection problem. The latter result shows that sampling an induced subgraph on vertices suffices to distinguish (i) random graphs with edge probability from (ii) random graphs with edge probability containing a planted -clique. The former result shows a similar bound for the restricted case of When is a constant, the bound in Theorem 1’ shows that the same sample complexity (up to logarithmic factors) suffices to solve the more general perfect completeness decision version of the Densest -Subgraph problem as defined in the theorem statement.
Moreover, Theorem 1’ also gives a nearly optimal upper bound on the query complexity of algorithms that solve the Densest -Subgraph problem in the regime where is constant. In this setting, the algorithm is free to query any vertex pair (instead of selecting a set of vertices and querying all the pairs of vertices in to determine exactly), and is free to adaptively select which pairs of vertices to query based on the results of its earlier queries. The bound in the theorem implies that there is a bounded-error algorithm that distinguishes graphs with -cliques from those whose subgraphs have at most edges and has query complexity at most . However, as shown in [RS19, HMP21], at least queries are required to solve the easier -Planted Clique problem, even for adaptive algorithms.
1.2 Testing Colorability
The -Colorable property is the set of all graphs on vertices that are -colorable. The case where corresponds to bipartiteness. For testing bipartiteness, nearly optimal bounds [GGR98, AK02] are known. We focus on the case where . We again consider two different regimes: the low chromatic number regime where is a constant, and the polychromatic regime where is a function of . In the polychromatic regime, we note that -testing the -Colorable property is non-trivial only when (and so also depends on ) since we can always eliminate at most edges from a graph to make it -colorable.
The study of the sample complexity of the -Colorable property has a long history that predates the definition of property testing itself. Rödl and Duke [RD85], building on prior work of Bollobás, Erdős, Simonovits, and Szemerédi [BESS78], showed that is a constant independent of when both and are constant. However, this result relies on the regularity lemma and so the bounds obtained on the sample complexity grow as a tower of height polynomial in . Notably, this means that these results give only very limited results in the polychromatic regime since they give a sublinear bound on the sample complexity of -colorability only for some that are iterated logarithm functions of .
Goldreich, Goldwasser, and Ron [GGR98] obtained much better bounds for -Colorable. They showed that for all . This result provides a significant improvement in the testability of colorability in the polychromatic regime: it shows that -colorability is testable with a sublinear sample complexity when for a constant whenever .
Alon and Krivelevich [AK02] further improved the bounds for testing -colorability. They showed that for all . This result shows that -colorability can be testable with a sublinear sample complexity for all . Sohler [Soh12] obtained another improvement in the low chromatic number regime, showing that and, in particular, when is constant then the sample complexity for testing -colorability is . But in the polychromatic setting, the bound of Alon and Krivelevich remains stronger whenever and .
Our next main result unifies and improves on both of the incomparable results of Alon and Krivelevich [AK02] and Sohler [Soh12].
Theorem 2.
The -Colorable property has sample complexity .
Theorem 2 implies that sublinear sample complexity suffices to distinguish -colorable graphs from graphs whose -colorings all create at least monochromatic edges for all values of when is constant.
We do not know whether the bound in Theorem 2 is optimal. The best lower bound for testing -colorability in the polychromatic regime is [AK02]. Since -testing -colorability is non-trivial only when , this means that there is a quadratic gap between our new upper bound and this lower bound for testing -colorability for large values of .
1.3 The Graph Container Method
Theorems 1 and 2 are both established by analyzing the natural tester that checks whether the sampled induced subgraph has the property that we are testing or not. In the case of large cliques, this tester accepts with probability at least 1/2,333See the Remark at the end of Section 3.3 for a discussion related to this choice of acceptance probability. and in the case of -colorability, the tester always accepts when the graph has the property. The challenge in the analysis, as is usually the case in property testing, is in showing that the tester rejects all graphs that are far from having the property with sufficiently large probability. We address this challenge with the use of the graph container method.
We first note that the graph container method is stated in terms of independent sets. In the case of testing -cliques, our application of the graph container method applies more naturally to testing -independent sets, which is equivalent by considering the complement of a graph.
Very briefly, the graph container method builds upon the observation that though a graph may contain a large number of independent sets, for every graph we can find a much smaller collection of sets of vertices that we call containers such that (i) every independent set is a subset of at least one of the containers, (ii) the containers are small, and (iii) for each container , the induced subgraph is sparse. The graph container method was originally introduced by Kleitman and Winston [KW82] to bound the number of square-free graphs. The method has since been used to great effect, notably by Sapozhenko (see [Sap05] and the references therein), and was later extended to the hypergraph container method [BMS15, ST15] that has seen even more wide use throughout combinatorics. The container method has also been recently applied to algorithmic settings for the problem of approximately counting independent sets in bipartite graphs [JPP23] and the problem of obtaining faster exact algorithms for almost-regular graphs [Zam23].
Our use of the graph container method builds on Kleitman and Winston’s original approach to it. The key component of the method as they introduced it is a simple greedy algorithm for identifying a small set of vertices of an independent set that form its fingerprint. The fingerprint of an independent set in turn uniquely defines the container that contains . Our main technical lemmas provide tight bounds on the size of the containers as a function of the size of the corresponding fingerprints. We describe the algorithm and present the technical lemmas in Section 2.
The tight bounds on the size of containers can then be used to analyze the soundness of testers for -independent sets in the following way. Let be the set of vertices sampled by the tester. Any independent set has a (very small) fingerprint that defines a container with bounded size; we can use this bounded size to show that the probability that more than vertices (including the fingerprint vertices) in are drawn from that container is extremely small. So small, in fact, that we can apply a union bound over all potential fingerprints in to bound the overall probability that the sample includes a large independent set. For all the details of the proof, see Section 3.
For testing -colorability, the overall approach is the same, but a different argument is required to show that for any independent sets in the sample, we can identify fingerprints whose corresponding containers have a small union. See Section 4 for the details.
1.4 Discussion
Theorems 1 and 2 suggest a number of intriguing open problems. We discuss three of them briefly.
Property testing with the container method.
Our results demonstrate that the graph container method provides a useful tool to study the problems of testing cliques and colorability. In subsequent work, we show that the graph container method can be used to study all 0–1 graph partition properties, as defined by Nakar and Ron [NR18]. It would be interesting to see if the method is useful for testing other classes of graph properties as well. For example, the class of 0–1 graph partition properties does not include the property of containing one, or multiple, large cliques, and so we suspect there is a larger class of graph partition properties that the container method can be used on.
We would be remiss if we didn’t also discuss the hypergraph container method. In the subsequent work mentioned above, we have also been able to show that the hypergraph container method can also be used to provide tight analysis of testers for various partition properties of hypergraphs. However, one of the most exciting aspects of the hypergraph container method in combinatorics is that it has led to a number of results to problems in combinatorics that at first glance do not appear to relate directly to hypergraphs. Could the hypergraph container method similarly yield a new analysis technique for obtaining new property testing results in other domains as well?
Query complexity.
What exactly is the relation between the query complexity of (adaptive or non-adaptive) testers that are free to query any potential edge in a graph and the query complexity of the canonical tester that samples a set of vertices and queries all potential edges within the corresponding induced subgraph? This question has been studied extensively in the dense graph setting (see, e.g., [AFKS00, GT03, BT04, BL10, GR10, GR11, GW21] and discussions in [Gol17, BY22]) but remains open for many natural properties of graphs.
As discussed above, Theorem 1 gives a partial answer to this question for the case of testing large cliques, since it shows that the number of edge queries performed by the canonical -tester for the -Clique property is (up to logarithmic factors) identical to the query complexity of the best adaptive testing algorithm for the same problem when . But can adaptivity improve query complexity when is smaller? In particular, is it possible to -test the -Clique property with edge queries?
For -colorability, the analogous question has already been posed even in the setting where : Bogdanov and Li [BL10] conjectured that it is possible to -test -colorability (that is, bipartiteness) with queries. Theorem 2 does not shed light on this conjecture (or its analogues for larger values of ) but it is possible that Lemma 6 or a similar variant could be helpful in this conjecture and other related ones.
Time complexity.
Theorem 1’ only bounds the sample complexity of the Densest -Subgraph problem, but the proof of this bound also immediately implies fairly strong results on the time complexity of the problem. Namely, it shows that distinguishing graphs with -cliques from those whose -subgraphs all have density at most for some constant can be done by sampling vertices and checking whether the induced subgraph on those vertices contains a clique of size . Even by using exhaustive search for the latter task, the resulting time complexity is quasipolynomial in —or, more precisely, it is . Stronger time complexity bounds already exist for this problem [FS97], but it would be interesting to see if the graph container method could be used to directly improve the time complexity bounds for various property testing problems and related approximation problems.
2 The Container Method
The notions of fingerprints and containers of independent sets in graphs are defined according to the procedure defined in Algorithm 1, which is a slight variant of the greedy algorithm that was (implicitly) originally introduced by Kleitman and Winston [KW82]. The algorithm works as follows. The first fingerprint of an independent set is simply the vertex contained in with highest degree in Since is an independent set, any neighbour of must not be contained in and, further, by the fact that has the highest degree in of all vertices in any vertex with higher degree in cannot be in The associated container is constructed by removing all vertices from these two collections. The procedure is repeated until every vertex from has been selected to the fingerprint.
The greedy algorithm for generating fingerprints and containers was originally defined with an appropriate stopping condition so that each independent set generates a single fingerprint and container. For our proofs, however, it will be more convenient to work directly with the sequence of fingerprints and their corresponding containers generated by the algorithm for the independent set . Further, it is also convenient to extend the definition with for all We refer to and as the -th fingerprint and -th container of , respectively.
When we consider the fingerprints or containers of multiple independent sets, we write and to denote the -th fingerprint and container of the independent set . (When the current context specifies a single independent set, we will continue to write only and to keep the notation lighter.)
2.1 Basic Properties of Containers
By construction, the sequence of fingerprints and containers of an independent set satisfy a number of useful properties. For instance, we have
(1) |
so that for each , .
The construction of the algorithm also guarantees that the -th container of an independent set is the same as the -th container of its -th fingerprint.
Proposition 3.
For any graph , any independent set in , and any the fingerprint and container of satisfy
Proof.
If then and the equality holds. Now consider the case that If are the vertices selected in the first rounds of the greedy algorithm on input , then the algorithm selects the same vertices and forms the same container when provided with input instead of . ∎
Another basic property of the containers of any independent set is that we can bound their maximum degree in the following way.
Proposition 4.
For any graph on vertices, any independent set in , and any the -th container of has maximum degree bounded by .
Proof.
If then and so the maximum degree is Now, consider any . Since the containers satisfy the containment ,
and there must be an index for which . This means that the vertex chosen in the th round of the greedy algorithm must have degree at most in the induced subgraph . All vertices that had higher degree in the same graph were excluded from , so all vertices have degree at most in . Since , the maximum degree of is bounded above by as well. ∎
2.2 Graph Container Lemmas
In order to use containers in our proofs, we need stronger structural results about containers when the graph is -far from the properties of interest.
In the case of testing the -Clique property, we first change our focus so that we instead look at graphs that are -far from having the -IndepSet property. We show that for each such graph , every independent set in is a subset of a container of size strictly less than . In fact, our result shows more: there is a container for of size where is at least as large as times the length of the corresponding fingerprint (up to log factors).
Lemma 5 (Graph Container Lemma I).
Let be a graph on vertices that is -far from the -IndepSet property. For any independent set in there exists an index such that the size of the -th container of is bounded by
(2) |
and contains at most edges.
Graph container lemmas like the one above are often used to bound the size of a collection of containers that cover all independent sets. Indeed, Lemma 5 also immediately yields such a bound, showing that for every graph that is -far from the -IndepSet property, there is a collection of containers such that every independent set of is a subset of at least one container. However, for our intended application in the proof of Theorem 1, the size of the collection of containers is not relevant; what we need is the bound on the size of the containers in (2) and, in particular, the trade-off in that bound between the size of the container and its fingerprint.
For testing the -Colorable property, we need a different type of Graph Container lemma which bounds the size of the union of containers for different independent sets in a graph.
Lemma 6 (Graph Container Lemma II).
Let be -far from -colorable, and let be independent sets in Then, there exists such that
(3) |
3 Testing Cliques and Independent Sets
We complete the proof of Theorem 1 in this section. The proof we describe applies to both the -Clique and the -IndepSet properties (by considering the complement of the tested graph) but the latter fits in more naturally with the graph container method, so in the rest of the section we discuss the result entirely in terms of independent sets.
3.1 Container Shrinking Lemma
The main tool in the proof of the Graph Container Lemma for testing independent sets, Lemma 5, is a shrinking lemma that, informally speaking, says that many vertices are excluded in each round of Algorithm 1 whenever the containers contain a large subset for which the induced subgraph is sparse.
Lemma 7 (Container Shrinking Lemma).
Let be a graph on vertices that is -far from the -IndepSet property, let be an independent set in , and let be an index for which the -th container of has cardinality . For any , if there exists a subset of vertices where contains at most edges, then
Proof.
If contains a vertex with degree at least in the graph then at least this number of vertices are eliminated since the vertex selected by the greedy algorithm has degree at least as large as that of , otherwise would not have been included in (and so therefore would not be in ).
Consider now the setting where all the vertices in have degree at most . We claim that contains at least edges. This claim suffices to complete the proof of the lemma, since it implies that at least vertices in have degree at least in , and thus that at least this many vertices from are excluded from by Algorithm 1.
We now complete the proof of the claim that contains at least edges.
Let be chosen uniformly at random from all subsets of of size . Since is -far from having independent sets of size , the induced subgraph contains at least edges. By the lemma hypothesis, contains at most edges. Finally, each edge connecting a vertex in to one in is included in with probability . So by the maximum degree of the vertices in in the graph , the expected number of edges between and is at most . Therefore, the expected number of edges in is at least
In other words, the random graph chosen uniformly at random from the subgraphs of of cardinality has expected density at least . This implies that also has density at least . So it contains at least edges, as we wanted to show. ∎
3.2 Proof of Graph Container Lemma I
We are now ready to complete the proof of the Graph Container Lemma for testing independent sets, restated below.
See 5
Proof.
Let denote the first container in the sequence that contains at most edges. The existence of is guaranteed because is an independent set and has no edges. Define such that . The value of is bounded by , where the lower bound on is due to the fact that is -far from the -IndepSet property.
Define to be the largest index for which . First observe that because otherwise is an independent set, which contradicts the fact that is -far from the -IndepSet property. Hence, by Lemma 7 applied to all values of ,
and since , we conclude that .
Consider now values . When contains more than edges, then the vertices in have average degree at least in this graph. This means that at least vertices in have degree at least in , so at least vertices are removed in the -th iteration of Algorithm 1. Since , there can be at most such iterations before we reach . Therefore, the container satisfies the conclusion of the lemma. ∎
3.3 Proof of Theorem 1
We are now ready to use Lemma 5 to complete the proof of Theorem 1. The proof of the theorem also makes use of the following form of Chernoff’s bound for hypergeometric distributions.
Lemma 8 (Chernoff’s Bound).
Let be drawn from the hypergeometric distribution where elements are drawn without replacement from a population of size where elements are marked and represents the number of marked elements that were drawn. Then for any ,
Proof.
A standard multiplicative form of Chernoff’s bound states that for all , the random variable satisfies . (This standard bound is usually stated for sums of independent variables, but the same bound also applies to hypergeometric random variables as well. See, e.g., [Mul18]). Using the identity , we can apply this inequality with the parameter and simplify. ∎
We are now ready to prove Theorem 1, restated below in its precise form (with the polylogarithmic term) and for the -IndepSet property.
Theorem 1 (Precise formulation).
The sample complexity of the -IndepSet property is
Proof.
Let be a random set of vertices drawn uniformly at random from without replacement, where is a large enough constant. Note that for clarity of presentation, in the rest of the proof we ignore all integer rounding issues as they do not affect the asymptotics of the final result.
If contains a independent set, then contains at least vertices from this independent set with probability at least , since the number of such vertices follows a hypergeometric distribution, and the median of this distribution is at least [Neu70, KB80].
In the remainder of the proof we upper bound the probability that contains a -independent set when is -far from containing a -independent set. For the rest of the argument, let us call a container small when its cardinality is bounded by , the expression (2) in the conclusion of Lemma 5.
Let us denote the vertices of the sampled set as . First observe that, by Lemma 5, for every independent set of size in , there is a and a fingerprint of size that defines a small container such that Hence, the probability that contains an independent set of size at least is at most the probability that there exists some such that contains a set of vertices that form the fingerprint of a small container and contains at least other vertices from We now upper bound this probability.
For any distinct indices , consider the event where the vertices form the fingerprint of a small container . Let denote the number of vertices among the other sampled vertices that belong to . By the bound on the size of small containers, the expected value of is
where the last inequality uses the fact that for a large enough constant and that the problem is non-trivial only when
By the Chernoff bound in Lemma 8, the probability that we draw at least vertices from in the final vertices drawn to form is
Therefore, by applying a union bound over all values and all possible choices of indices from , the probability that any set of at most vertices in form the fingerprint of a small container from which we sample at least vertices in is at most
where the first inequality uses the upper bound and the last inequality again uses the fact that for a large enough constant . Hence, the probability that contains an independent set of size at least is less than ∎
Remark.
The completeness guarantee in the proof of Theorem 1 states that the algorithm correctly accepts graphs that have a -independent set with probability at least , instead of the usual bound typically used in property testing. Since the soundness guarantee is that the algorithm accepts graphs that are -far from the -IndepSet property with probability at most , these guarantees still enable us to apply standard error reduction techniques to obtain algorithms that err with probability at most with a multiplicative factor. In particular, it shows that there is also a standard tester with completeness guarantee with the same asymptotic sample complexity.
Alternatively, it is also possible to obtain the completeness guarantee directly by changing the algorithm slightly to check if the induced subgraph has the -IndepSet property for some appropriate gap parameter .
4 Testing k-Colorability
In this section we prove Theorem 2. We start by proving a graph container lemma in Section 4.1, and complete the proof in Section 4.2.
4.1 Proof of Graph Container Lemma II
In this section we prove the main lemma that balances the trade off between the size of the union and the sum of the fingerprints sizes of a set of containers, restated from Section 2.2.
See 6
Proof.
Assume on the contrary that for all ,
(4) |
We will obtain a contradiction by constructing a -partition of the vertices which shows that is not -far from -colorable.
Initialize the sets Remove vertices from until every vertex appears in one set so that forms a partition of By Proposition 4, each vertex has degree at most in it’s set, and so the sum of edges contained in the sets is at most
In the remainder of the proof, we allocate the remaining vertices in to one of the parts and argue that the number of additional edges is small.
For every let be the set of vertices that are contained in at least one container at the -th step, but not contained in any container at step step of the container procedure. In other words, let
Observe that partitions the vertices in and so to complete the partition of we add the vertices of to in the following way: for each and select an such that and add to set
In order to count the new edges, consider adding the vertices from in reverse order (starting at vertices in and going to ). In this way, when is added to it holds that Hence, by Proposition 4, each contributes at most new edges to (or edges if ) because is contained in
Then, the sum of the edges in each of the sets can be upper bounded by
(5) |
For any and so (4) implies that In the sum (5), the contribution from goes down as increases, and so the sum is maximized when for all Hence, the sum of the edges in each of the sets can be upper bounded by
where the last inequality uses the upper bound on the harmonic series and the bound that we can apply without loss of generality. This is a contradiction with the fact that is -far from -colorable. ∎
4.2 Proof of Theorem 2
We can now complete the proof of Theorem 2, restated below.
Theorem 2 (Precise formulation).
The sample complexity of the -Colorable property is
Proof.
Let be a random set of vertices drawn uniformly at random from without replacement, where is a large enough constant. The tester accepts a graph if and only if is -colorable. When is -colorable, then is also -colorable so the tester always accepts.
In the remainder of the proof, we upper bound the probability that is -colorable when is -far from -colorable. In the following, let us say that containers have a small union when .
Let us denote the vertices of as . For any , any values , and any set of distinct indices in , let us consider the event where the fingerprints define a collection of containers with a small union. When this event occurs, the probability that the remaining vertices in are all drawn from is at most .
For any , there are at most ways to choose sets of at most distinct indices as above. So by applying a union bound argument over all and over all possible choices of indices, the probability that any set of at most vertices in form the fingerprints of a family of containers with a small union from which we sample all the remaining vertices is at most
where the first inequality uses the fact that and the second is obtained by using the fact that for a large enough constant (and that the problem is non-trivial only when and in this regime).
By Lemma 6, for every collection of independent sets in , there is a value for which the fingerprints define containers with a small union. And by Proposition 3, each of these containers satisfy . Therefore, the argument above implies that the probability that the union of any independent sets contains all of , or equivalently that is -colorable, is at most . ∎
References
- [AFKS00] Noga Alon, Eldar Fischer, Michael Krivelevich, and Mario Szegedy. Efficient testing of large graphs. Combinatorica, 20(4):451–476, 2000.
- [AK02] Noga Alon and Michael Krivelevich. Testing k-colorability. SIAM Journal on Discrete Mathematics, 15(2):211–227, 2002.
- [BESS78] Béla Bollobás, Paul Erdős, Miklós Simonovits, and Endre Szemerédi. Extremal graphs without large forbidden subgraphs. Annals of Discrete Mathematics, 3:29–41, 1978.
- [BL10] Andrej Bogdanov and Fan Li. A better tester for bipartiteness? arXiv preprint 1011.0531, November 2010.
- [BMS15] József Balogh, Robert Morris, and Wojciech Samotij. Independent sets in hypergraphs. Journal of the American Mathematical Society, 28(3):669–709, 2015.
- [BT04] Andrej Bogdanov and Luca Trevisan. Lower bounds for testing bipartiteness in dense graphs. In Proceedings. of the 19th IEEE Annual Conference on Computational Complexity, pages 75–81, 2004.
- [BY22] Arnab Bhattacharyya and Yuichi Yoshida. Property Testing: Problems and Techniques. Springer Nature, 2022.
- [FLS04] Uriel Feige, Michael Langberg, and Gideon Schechtman. Graphs with tiny vector chromatic numbers and huge chromatic numbers. SIAM Journal on Computing, 33(6):1338–1368, 2004.
- [FS97] Uriel Feige and Michael Seltser. On the densest k-subgraph problem. Technical Report No. CS97-16, Weizmann Institute of Science, 1997.
- [GGR98] Oded Goldreich, Shari Goldwasser, and Dana Ron. Property testing and its connection to learning and approximation. Journal of the ACM, 45(4):653–750, 1998.
- [Gol17] Oded Goldreich. Introduction to property testing. Cambridge University Press, 2017.
- [GR10] Mira Gonen and Dana Ron. On the benefits of adaptivity in property testing of dense graphs. Algorithmica, 58(4):811–830, 2010.
- [GR11] Oded Goldreich and Dana Ron. Algorithmic aspects of property testing in the dense graphs model. SIAM Journal on Computing, 40(2):376–445, 2011.
- [GT03] Oded Goldreich and Luca Trevisan. Three theorems regarding testing graph properties. Random Structures & Algorithms, 23(1):23–57, 2003.
- [GW21] Oded Goldreich and Avi Wigderson. Non-adaptive vs adaptive queries in the dense graph testing model. In Proceedings of the 62nd IEEE Annual Symposium on Foundations of Computer Science, pages 269–275, 2021.
- [HMP21] Wasim Huleihel, Arya Mazumdar, and Soumyabrata Pal. Random subgraph detection using queries. arXiv preprint arXiv:2110.00744, 2021.
- [JPP23] Matthew Jenssen, Will Perkins, and Aditya Potukuchi. Approximately counting independent sets in bipartite graphs via graph containers. Random Structures & Algorithms, 2023.
- [KB80] Rob Kaas and Jan M Buhrman. Mean, median and mode in binomial distributions. Statistica Neerlandica, 34(1):13–18, 1980.
- [KW82] Daniel J. Kleitman and Kenneth J. Winston. On the number of graphs without 4-cycles. Discrete Mathematics, 41(2):167–172, 1982.
- [Man17] Pasin Manurangsi. Almost-polynomial ratio ETH-hardness of approximating densest k-subgraph. In Proceedings of the 49th Annual ACM SIGACT Symposium on Theory of Computing, pages 954–961, June 2017.
- [Mul18] Wolfgang Mulzer. Five proofs of Chernoff’s bound with applications. Bulletin of EATCS, 1(124), 2018.
- [Neu70] Peter Neumann. Über den median einiger dikreter verteilungen und eine damit zusammenhängende monotone konvergenz. Wissenschaftliche Zeitschrift der Technischen Universität Dresden, 19:29–33, 1970.
- [NR18] Yonatan Nakar and Dana Ron. On the testability of graph partition properties. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques (APPROX/RANDOM 2018), page 13, 2018.
- [RD85] Vojtěch Rödl and Richard A. Duke. On graphs with small subgraphs of large chromatic number. Graphs and Combinatorics, 1(1):91–96, 1985.
- [RS19] Miklós Z. Rácz and Benjamin Schiffer. Finding a planted clique by adaptive probing. arXiv preprint 1903.12050, 2019.
- [Sap05] Alexander Sapozhenko. Systems of containers and enumeration problems. In Stochastic Algorithms: Foundations and Applications (SAGA 2005), pages 1–13. Springer, 2005.
- [Soh12] Christian Sohler. Almost optimal canonical property testers for satisfiability. In Proceedings of the 53rd Annual IEEE Symposium on Foundations of Computer Science, pages 541–550, 2012.
- [ST15] David Saxton and Andrew Thomason. Hypergraph containers. Inventiones mathematicae, 201(3):925–992, 2015.
- [Zam23] Or Zamir. Algorithmic applications of hypergraph and partition containers. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, 2023.