The complexity of testing all properties of planar graphs, and the role of isomorphism
Abstract
Consider property testing on bounded degree graphs and let denote the proximity parameter. A remarkable theorem of Newman-Sohler (SICOMP 2013) asserts that all properties of planar graphs (more generally hyperfinite) are testable with query complexity only depending on . Recent advances in testing minor-freeness have proven that all additive and monotone properties of planar graphs can be tested in queries. Some properties falling outside this class, such as Hamiltonicity, also have a similar complexity for planar graphs. Motivated by these results, we ask: can all properties of planar graphs can be tested in queries? Is there a uniform query complexity upper bound for all planar properties, and what is the “hardest” such property to test?
We discover a surprisingly clean and optimal answer. Any property of bounded degree planar graphs can be tested in queries. Moreover, there is a matching lower bound, up to constant factors in the exponent. The natural property of testing isomorphism to a fixed graph requires queries, thereby showing that (up to polynomial dependencies) isomorphism to an explicit fixed graph is the hardest property of planar graphs. The upper bound is a straightforward adapation of the Newman-Sohler analysis that tracks dependencies on more carefully. The main technical contribution is the lower bound construction, which is achieved by a special family of planar graphs that are all mutually far from each other.
We can also apply our techniques to get analogous results for bounded treewidth graphs. We prove that all properties of bounded treewidth graphs can be tested in queries. Moreover, testing isomorphism to a fixed forest requires queries.
1 Introduction
Consider the setting of property testing for bounded degree graphs, under the model of random access to a graph adjacency list, as introduced by Goldreich-Ron [GR02]. Let be a graph where and the maximum degree is . We have random access to the list through neighbor queries. There is an oracle that, given and , returns the th neighbor of (if no neighbor exists, it returns ).
For a property of graphs with degree bound , the distance of to is the minimum number of edge additions/removals required to make have , divided by . We say that is -far from if the distance to is more than . A property tester for is a randomized procedure that takes as input (query access to) and a proximity parameter, . If , the tester must accept with probability at least . If is -far from , the tester must reject with probability at least . In our context, the property is called testable if there exists a property tester for whose query complexity is independent of .
One of the grand goals of property testing is to classify testable (graph) properties according to the query complexity of testing them. Of special interest are efficiently testable properties, whose testing complexity is . For bounded degree graphs, there is little clarity on this issue. A recent survey by Goldreich states: “Indeed, it is hard to find a common theme among [efficiently testable] properties…”. [Gol21]
Our work focuses on properties of planar graphs. Even the problem of just testing planarity has received much attention, whose complexity has only recently been shown to be [BSS08, HKNO09, EHNO11, YI15, LR15, KSS19]. Newman-Sohler proved that every planar property (actually, every hyperfinite property) is testable, but they do not provide an explicit complexity bound depending on [NS13]. Kumar-Seshadhri-Stolman recently showed that every additive and monotone planar property can tested in queries [KSS19]. Applying techniques from this result, Levi-Shoshan showed that planar Hamiltonicity can be tested efficiently [LS21]. The latter property is neither additive nor monotone, so it is natural to ask if all planar properties can be tested efficiently. If not, does there exist a uniform complexity bound for all planar properties, and a candidate for the “hardest planar property”? (We note that there is significant work on characterizing testable planar properties, for the unbounded degree case, which is qualitatively different [CMOS11, CS19]. Details are given in §1.2.)
We discover that the answer to this question is the query complexity bound . Up to constant factors in the exponent, the hardest planar property is testing isomorphism to a fixed explicit graph.
Let us give some formalism, and discuss the connection to isomorphism. In our discussions, is basically fixed, so we are considering a non-uniform setting. A planar property is a set of unlabeled bounded degree planar graph with vertices. The input is a labeled graph and the tester is trying to property test if is isomorphic (or equal to, if one ignores the labels) to any member of . Observe that singleton properties, where , are equivalent to testing if is isomorphic to an fixed graph . With this premable, we can state our main results.
Our upper bound is a straightforward adaptation of arguments in Newman-Sohler [NS13].
Theorem 1.1.
Consider a planar property of -vertex, degree-bounded graphs. There is a property tester for that makes queries.
Our main technical result is a matching lower bound for an explicit singleton property. One of the surprises (at least to the authors) is that testing isomorphism to an arbitrary set of planar graphs is not harder, up to polynomial dependencies, than testing isomorphism to a single fixed graph. A recent compendium of open problems by Goldreich states the question of determining the complexity of testing isomorphism (Open Problem 2.4 in [Gol21]). We note that our theorems resolve this question for bounded-degree planar graphs.
Theorem 1.2.
For every sufficiently large , there exists a bounded-degree planar graph on vertices such that property testing requires queries. Equivalently, testing isomorphism to requires queries.
Bounded treewidth classes: In the context of property testing, many results for planar graphs also hold for minor-free classes. It is natural to ask whether the bound of is the “right” answer for any property of minor-free graphs. We discover that to not be the case. For properties of bounded treewidth graphs, the answer is between and . The following theorems are also derived from the same methods for planar graphs, but the constructions are substantially simpler. The lower bound is achieved by a simple construction of forests. The main point of these results is to show that the bounded of achieved for planar properties can be significantly beaten for non-trivial minor-closed families.
Theorem 1.3.
Consider a bounded treewidth property of -vertex, degree-bounded graphs. There is a property tester for that makes queries.
Theorem 1.4.
For every sufficiently large , there exists a bounded-degree forest on vertices such that property testing requires queries.
1.1 Main ideas
One of the key components of property testers for planar graphs is the notion of partition oracles for hyperfinite graphs, as introduced by Hassidim-Kelner-Nguyen-Onak [HKNO09]. Bounded degree planar graphs (and more generally, minor-closed families) are hyperfinite, meaning that one can remove a constant fraction of edges and obtain connected components of constant size. Specifically, one can remove edges to get connected components of size .
Suppose is hyperfinite. A partition oracle is a local procedure that gives access to a hyperfinite decomposition to a graph , without any preprocessing. Given a query vertex , the partition oracle outputs a connected subgraph of size , such that the union of components forms a hyperfinite partition. Recent results give a partition oracle that runs in time per query [LR15, KSS21].
Newman-Sohler used partition oracles to prove that all hyperfinite (and hence planar) properties are testable [NS13]. This is where our results begin. Stripping down the Newman-Sohler arguments to their core, we can essentially treat a planar , up to edge changes, as a distribution over connected planar graphs of size . The partition oracle allows us to sample efficiently from this distribution. (If the input graph is not hyperfinite, the partition oracle can detect that efficiently, and the graph can be directly rejected.) For convenience, let denote this distribution.
Existing theorems in combinatorics show that the number of planar graphs of size is [MR01]; hence, this bounds the support size of . We can learn an approximation of up to TV-distance with queries to the partition oracle. It is not hard, but central to the argument, to show that if , then is -close to . For any property , one can simply check exhaustively if the learned approximation to is close to for any . (While this could be expensive in running time, it requires no further queries to .)
The main challenge is in the lower bound. We need to find a property such that testing requires samples from . We seem to need a converse to the upper bound argument, showing if is large, then and are far from each other. But this is false in general! The support of is a set of graphs, which are mutable objects. Meaning, we can modify dramatically by only modifying a few edges of . (As an extreme case, and could have disjoint supports on graphs that are close to each other.)
All our graphs (both input and hard instance) will be collections of connected components of size . We begin by trying to construct a graph such that: for all , implies that is -far from . Moreover, determining if should require samples from . A candidate is suggested by the problem of uniformity testing of distributions. Suppose we construct where is a uniform distribution on a collection of graphs (each member of which has size ), such that . By standard distribution testing arguments, distinguishing the uniform distribution on from a uniform distribution on half of requires samples. Suppose furthermore that all graphs in are -far from each other and all balanced separators of are at least of size . We prove that any graph such that is supported on half of must be -far from . Moreover, we can construct candidate input graphs , where any property tester can be simulated by sampling from . Putting all the arguments together, we get a bonafide lower bound: property testing requires queries.
What remains is the main technical construction. We need to design , the “suitable family” of graphs for the lower bound. We achieve this family by taking the grid and adding a collection of diagonal edges. We apply a cleaning procedure to get a large family of graphs that are all far from each other. The separator size holds trivially, since each graph in contains a large grid.
1.2 Related Work
Property testing on bounded-degree graphs is a vast topic and we direct the reader to Chapter 9 of Goldreich’s book [Gol17] for an introduction to the subject.
Arguably, the starting point for testing planarity is the seminal work of Benjamini-Schramm-Shapira [BSS08], who showed that all minor-closed properties are testable. This paper also introduced the significance of hyperfiniteness to property testing. Hassidim-Kelner-Nguyen-Onak [HKNO09] introduced the concept of partition oracles, a key tool in property testing for hyperfinite classes. Improving the query time of partition oracles was addressed by Edelman-Hassidim-Nguyen-Onak [EHNO11], Levi-Ron [LR15] and Kumar-Seshadhri-Stolman [KSS21].
The quest for characterizing testable (bounded-degree) graph properties and finding properties testable in is an important theme in property testing. We note that Goldreich’s recent survey explicitly calls these out as Open Problems 2.2 and 2.3 [Gol21]. One of the main inspirations for our work is the result of Newman-Sohler that proves that all hyperfinite properties are testable [NS13]. The recent work of [KSS21] proves that all additive and monotone minor-closed properties are efficiently testable, and Levi-Shoshan show that Hamiltonicity of minor-closed families is also efficiently testable [LS21].
We note an important line of work of testing properties of planar graphs, in the unbounded degree case. This direction was pioneered by Czumaj-Monemizadeh-Onak-Sohler [CMOS11], who showed the bipartitenesss is testable (independent of the size). A recent result of Czumaj-Sohler prove that all testable planar properties in the unbounded degree setting are related to subgraph freeness [CS19].
2 Proof of Theorem 1.1 and Theorem 1.3
We begin with some preliminaries: the notions of hyperfiniteness and partition oracles. Note that all graphs have vertices and degree bound .
Definition 2.1.
A graph is hyperfinite if there exists a subset of edges whose removal results in connected components of size at most .
A classic result of Alon-Seymour-Thomas (Prop. 4.1 of [AST94]) shows that all minor-closed families are -hyperfinite. In Lemma A.3, we prove that all bounded treewidth families are -hyperfinite. A partition oracle gives local access to a hyperfinite decomposition. We give the formal definition below (adapted from Def. 1.1 of [KSS21]).
Definition 2.2.
A procedure is a partition oracle for a minor-closed family of graphs if it satisfies the following properties. The deterministic procedure takes as input random access to , access to a random seed , a proximity parameter , and a vertex of . (We will think of fixing , so we use the notation . All probabilities are with respect to .) The procedure outputs a connected set of vertices, such that the sets , over all , form a partition of .
We say that the partition oracle outputs an -hyperfinite decomposition if the following properties hold. (i) For all , and (ii) with probability (over ), the number of edges between the sets is at most .
The main result of [KSS21] that we use is the following.
Theorem 2.3 (Rephrasing of Theorem 1.2 [KSS21]).
For any bounded degree graph in a minor-closed family, there is a partition oracle that outputs an -hyperfinite partition, and runs in time per query.
For bounded treewidth classes, the partition oracle can output -hyperfinite partitions using standard arguments (Claim A.4)
2.1 Property testers through subgraph count vectors
We use partition oracles to summarize a hyperfinite graph by an approximate count vector. This technique was first used by Newman-Sohler [NS13]. We follow their analysis, but take care to keep track of various dependencies on . This allows for getting the optimal query complexity bound.
Consider any planar property (technically, the following arguments hold for any minor-closed property). For any planar , let denote the partition given by the partition oracle of Theorem 2.3. (Note that the partition is a random variable.) Let be the set of unlabeled graphs in the family with at most vertices. We will always set (though the exact setting will change for bounded-treewidth properties).
For any graph consisting of connected components of size at most and any , let be the number of occurrences of in . Let be the -dimensional vector of these counts. Note that .
Claim 2.4.
Consider two graphs consisting entirely of connected components of size at most . If , then .
Proof:.
For every , we will modify and to equalize the and . We simply delete instances from either or (whichever has the larger count). This operation deletes at most edges. In total, the number of edges deleted is at most . ∎
For the property that we wish to test, construct the following set of count vectors: for every (with vertices and degree bound ) and every subgraph that is an partition of , add to .
Lemma 2.5.
Suppose is a valid -partition of . If , then . If is -far from , then , .
Proof:.
Suppose . In the construction of described above, we can select as and as . Hence, .
Suppose is -far from . Consider any , so , for being an -partition of . By triangle inequality, . By farness, . Because and are respective -partitions, and are at most . Hence, . By Claim 2.4, . ∎
Now, we present the main algorithmic ingredient of our property tester.
Claim 2.6.
Given and a setting of that is a valid -partition, using queries one can compute a vector such that with probability at least .
Proof:.
Fix . We show how to approximate . Pick uar vertex , and using the partition oracle, determine the component of containing . If the component is isomorphic to , declare success. The probability of success is exactly . By Chernoff-Hoeffding, we can get an additive estimate with error probability using samples. Thus, we get an additive estimate for . Applying for all , we get our estimate vector . Note that the total approximation is . The error probability, by union bound, is at most . By the running time bound of the partition oracle, the total number of queries made is . ∎
Now, we prove Theorem 1.1, repeated for convenience. To get the optimal bound for planar properties, we need the right upper bound for . Numerous results in the past show that the answer is . We cite one specific reference.
Theorem 2.7.
(Theorem 5 of [MR01]) A planar graph on vertices and edges can be represented uniquely by bits.
Hence, the number of planar graphs on vertices is .
See 1.1
Proof:.
The tester has two phases. In the first phase, it uses Theorem 2.3 to get a hyperfinite partition. If it succeeds, then the tester proceeds to the second phase. It uses Claim 2.6 to approximate and checks if it lies in .
The tester repeats the following times. For a random seed , it sets up the partition oracle with this seed, and then estimates the number of cut edges via random sampling. This is done by sampling uar vertices , picking a uar neighbor of , and calling the partition oracle on and . If the outputs are different components, then edge is cut. If more than an -fraction of edges are cut, the process is repeated with a new choice of . Otherwise, is fixed, and the tester proceeds to the next phase. If no is found, the tester rejects.
At this point, a suitable has been discovered. We set , to be the size bound of components, as promised by Theorem 2.3. Using Claim 2.6, the tester computes an approximate count vector of , setting . Note that is a fixed set of vectors, independent of the input. The tester then determines if such that . If such a vector exists, it accepts. Otherwise, it rejects. The description of the property tester is complete, and we proceed to the analysis.
Query complexity bound: Note that , , and, by Theorem 2.7, . Hence, by Claim 2.6, the overall query complexity is .
Correctness analysis: If is planar and hence -hyperfinite, with probability at least over , the partition oracle computes a hyperfinite decomposition. Hence, with probability at least , the tester proceeds to the second phase with being a valid -hyperfinite partition. Conversely, with probability at least , if the tester finds a suitable , then is a valid -hyperfinite partition.
Suppose and hence planar. As argued above, with probability , is an -hyperfinite decomposition. By Claim 2.6, with probability , . By the union bound, both conditions hold with probability at least . By Lemma 2.5, . Thus, there exists in such that and the tester accepts with probability at least .
Suppose is -far from . By Lemma 2.5, for all , . By the triangle inequality, . Thus, the tester rejects with probability at least .
∎
The proof of Theorem 1.3 is nearly identical.
See 1.3
Proof:.
We redo the proof of Theorem 1.1. Claim A.4 shows that graphs with treewidth at most admit a partition oracle with . For bounded treewidth graphs, we apply the trivial bound (Claim A.1). We apply these bounds in the above proof, and get a query complexity of queries. ∎
3 Lower bounds through suitable families
The key construct for the lower bounds is given in the following definition. A graph class is monotone if it is closed under edge removals. A graph class is additive if it closed under taking disjoint unions of graphs.
Definition 3.1.
Fix and a monotone, additive graph class . We call a family of distinct graphs (in ) -suitable if the following conditions hold:
-
•
All graphs in have the same number of vertices (denoted ).
-
•
If , , .
-
•
, the size of any balanced separator in is .
The main lemma used in our lower bound says that a large suitable family leads to a property testing lower bound.
Lemma 3.2.
Fix , and let denote an -suitable family of a monotone, additive graph class . Then, for all sufficiently large , there exists a graph with the following property. Any property tester for the property with proximity parameter requires queries.
The proof of Lemma 3.2 requires some tools, that we shall build up in this section. Firstly, using the properties of a suitable family, we can construct two graphs consisting entirely of components in that are far from each other. These graphs form the core of the lower bound.
Claim 3.3.
Let denote an -suitable family of a graph class (as defined in Lemma 3.2). Let denote a set indexing graphs in .
Let be the graph consisting of a disjoint union of all graphs in . For any subset with , let be the disjoint union of two copies of each graph in indexed by . Then the graphs and are far from each other.
Proof:.
Since is -suitable, we know that all graphs are defined on vertices. We know for any , since the number of edge edits needed to change to is at least . Since will be fixed for the remainder of the proof, it will be convenient to refer to as for the rest of the argument. Let us denote the components of (resp ) as (resp as ). Let . contains connected components in each of which occurs twice. Recall from Definition 3.1, are -far from each other. Let denote the set of missing components of in . And let denote the set of components from that are present in (without duplicates). Thus, contains two subgraphs isomorphic to which we denote as and . For , let denote the subgraph of isomorphic to . Similarly, define . Let
denote the smallest set of edge modifications to that produce a graph isomorphic to . We now lower bound . For a component , let denote the edge set of that component. Fix a component such that . The following cases arise.
-
•
Case 1: is connected. Thus, the deletion edits do not disconnect . In this case, after the insertion edits in , we obtain a component in . By the definition of suitable families, if , it takes at least (follow the sentence). If , then obviously gets at least one edit, which is at least edits. In both cases, the number of edits is .
-
•
Case 2: is not connected. Now, we get two subcases depending on size of the biggest component in .
-
1.
The biggest component in has size at least . In this case, maps to some component in . Now we split into two cases. Suppose . Since is a bounded degree graph, the number of insertion edits (by Definition 3.1) is at least . Suppose . In any case must get at least one insertion edit to make it have size . So the number of edits is at least .
-
2.
The biggest component in has size at most . In this case, note that removes a balanced separator of . Thus, the number of deletion edits made to is at least .
-
1.
In all of the above cases we see that for any component with , the number of edits made to is at least . Finally, note that
And therefore, the distance between graphs and is at least as desired. ∎
We can now define the graph used to prove the lower bound Theorem 1.2. For convenience, assume that is a multiple of , the size of graphs in . (If not, we will pad the final graph with extra isolated vertices.) The (unlabeled) graph is a disjoint union of copies of every graph in .
Analogously, for , let be the graph that has copies of every graph in indexed by . For convenience, we refer to any such as a half index set.
3.1 The distinguishing problem
Note that input graph is really a labeled instance, as given by the adjacency list. We now give the explicit distribution of labeled inputs and the main “distinguishing” problem. By Yao’s minimax lemma, it suffices to prove lower bounds for determinstic algorithms over randomized inputs. So let us define the YES and NO distributions.
-
•
YES: Generate a uar labeling of .
-
•
NO: Generate a uar half index set . Then, generate a uar labeling of .
Given an input generated from either of these distribution, the distinguishing problem is to determine, with probability , which distribution came from. (Note that the supports of these distributions are disjoint.)
Claim 3.4.
If there exists a deterministic distinguisher that makes queries (where is independent of ), then there exists an algorithm with the following property. Given input (generated as above), the algorithm gets uar connected components of and determines, with probability , whether came from the YES or NO distribution.
Proof:.
Consider any deterministic algorithm. One can express its behavior as follows. At any stage, it has explored the adjacency lists of some vertices . It can choose to further query the adjacency lists of these vertices. In this case, it further explores the connected components already visited. The algorithm can also choose to visit a new vertex ID . Note that is an arbitrary (but fixed) function of all the vertex IDs and edges seen so far. Since the input is a uar labeling of either or , the probability that lies in a component already visited is at most . With probability at least , lies in a uar connected component (since all components have exactly the same size, ). By a union bound, with probability at least , the algorithm sees a uar set of at most components and makes its decision.
The total number of components is , and hence the probability of seeing a specific set of components is (up to -factors) . Since is sufficiently large, this probability is within -factors of . This implies that with probability at least , the algorithm sees a uniform distribution of multisets of components. Alternately, the algorithm gets uar connected components of . The success probability only changes by -factors. ∎
The proof of Lemma 3.2 is now a straightforward reduction from distribution testing and an application of the tools built thus far.
Proof:.
(of Lemma 3.2) By Claim 3.3, all graphs generated by the NO distribution are -far from the property . By Yao’s minimax lemma, it suffices to show that any deterministic distinguisher requires queries.
By Claim 3.4, if there is a deterministic distinguisher making queries, then there is an algorithm that can distinguish from (with probability ) by querying uar connected components. Standard arguments in distribution testing show that samples are required to distinguish the uniform distribution on the index set from the uniform distribution on a uar half index set (refer to the section titled “An lower bound” on page 13 of [Can20]). By mapping arbitrarily to , getting samples from the uniform distribution in is equivalent to getting uar components from generated from the YES distribution. Analogously, samples from the uniform distribution on a uar half index set is equivalent to uar components from generated from the NO distribution. Thus, any algorithm that distinguishes from with uar connected components requires samples, leading to the lower bound for deterministic distinguishers. ∎
4 A suitable family for planar graphs
The main result of this section is the following theorem.
Theorem 4.1.
There exists a family of planar graphs where each graph has vertices and
-
•
.
-
•
For every , the size of a minimum balanced vertex separator in is
-
•
For every pair of graphs , it takes at least edge edits to go from to . In particular, it holds that and are -far.
We collect some ingredients which will be useful in proving Theorem 4.1. The first important ingredient we need is Whitney’s theorem.
Theorem 4.2 (Whitney’s Theorem. (Theorem 4.3.2 in [Die17])).
Any two planar embeddings of a -connected planar graph are equivalent.
We first define a “base” graph (Definition 4.3). This is obtained in the following manner. Let us start with the grid which we denote as . The label set of is indexed by a pair . We call this labeling the standard grid order. We denote the base graph by , and is constructed on top of by the addition of some specific edges. Essentially, we add edges at the corners; we refer the reader to Fig. 1. We add edges from to , from to , from to and from to . This graph thus differs from only in the edges adjacent to the corner vertices, making all the corners unique. Let us label according to the standard grid order. For , we thus identify as the pair which refers to the coordinate of on the grid. With this labeling, we have . where the set contains all the non grid neighbors of all the corner vertices.
Definition 4.3.
Consider the graph obtained above and consider the planar embedding of (which is unique by Theorem 4.2). We will fix an embedding of where
-
•
The unique corner vertex adjacent to two vertices 2 hops away (according to standard grid order) is located at .
-
•
The unique corner vertex adjacent to two vertices 3 hops away (in the standard grid order) is located at .
-
•
The unique corner vertex adjacent to two vertices 4 hops away (in the standard grid order) is located at .
-
•
The unique corner vertex adjacent to two vertices 5 hops away (in the standard grid order) is located at .
We call this graph (and by abuse of terminology, its embedding) as the base graph.
4.1 Proof of Theorem 4.1
Let us begin by defining the following family of graphs. We will show that all graphs in this family are pairwise non isomorphic.
Construction 4.4.
The family of graphs is a family of graphs labeled according to standard grid order which is obtained from the base graph in the following way. Let
denote an index set for vertices in . Graphs in this family are obtained in the following manner. For each , we add exactly one of the following edges. Either we add
-
•
The upward diagonal , or
-
•
The downward diagonal .
Noting , note that the size of this family is .
Lemma 4.5.
The family defined in Construction 4.4 is a collection of pairwise non isomorphic graphs on vertices.
Proof:.
As mentioned earlier in Definition 4.3, the base graph admits a unique planar drawing. This is due to Theorem 4.2, because the additional edges made our graph -connected. We fix that embedding. Now consider two drawings . They differ in at least one diagonal and therefore they are non isomorphic. ∎
However, we still need to show that this family has a sufficient number of graphs that are far from being isomorphic to each other. We state this in the following lemma.
Lemma 4.6.
There is a greedy procedure that takes as input the family of graphs and returns a family of size at least all of which are pairwise -far from each other.
Proof:.
We begin with the following observations. Consider the ball for any instance of the modified grid , which is the set of graphs reachable from by (diagonal) edge deletions. For any , note that the number of edges in is at most . Thus, the number of possible graphs reachable from after making deletions is at most . We replace these deleted diagonals with diagonals oriented the other way. Choose . We apply the bound .
Let denote the set found by the algorithm below. Using the above bound on , it holds that
Since for any , the maximum degree of any vertex in is at most , and all the graphs in the set disagree in at least edges, it follows that all the graphs in are pairwise -far. Below, we present our greedy procedure.
TakeScoops()
1.
Initialize .
2.
While :
(a)
.
(b)
.
(c)
.
end while
3.
return .
The family is thus the family output with the desired properties as stated in Theorem 4.1.
∎
The proof of the theorem follows immediately from the lemmas.
Proof of Theorem 4.1.
We show that the family obtained in Lemma 4.6 satisfies all the criteria we require to be satisfied in Theorem 4.1. We pick these one by one:
-
•
This is a direct consequence of Lemma 4.6.
-
•
This follows from the fact that the family comprises graphs that contain the grid on vertices as proper induced subgraphs. The grid has balanced separators of size , so all graphs in must have separators at least as large.
-
•
Follows from Lemma 4.6.
This shows that follows the properties described in theorem 4.1, thus completing its proof. ∎
4.1.1 Proof of Theorem 1.2
The proof follows from plugging in the right value for . In Theorem 4.1, we set . We get a suitable family with , where . By Lemma 3.2, we have a explicit singleton property that requires queries to test.
5 A suitable family of trees
This construction is fairly straightforward: we simply pick all distinct trees on vertices. We need to take care to count the number of unrooted trees, so some work is required.
Let denote the set of unlabled and rooted binary trees on vertices. We first recall the following fact.
Fact 5.1.
We would like to lower bound the number of unlabeled, unrooted and binary trees on vertices. To this end, we first make the following definition.
Definition 5.2.
Write . For , let
Note that this also means . Also, note that if then .
Now, we note the following.
Claim 5.3.
For any , .
Proof:.
Suppose not. Then which share as the root where both and . Then as well. However, we also have . Contradiction. ∎
Now, we will produce a family of unlabeled and unrooted binary trees. This is done by picking a single tree from each orbit in . The following is immediate.
Claim 5.4.
.
Proof:.
Since all orbits have size at most , it holds that . ∎
We can now complete the proof of the main lower bound for forests.
Proof:.
(of Theorem 1.4) The suitable family is simply . Since all graphs are of size , it only suffices to verify the separator bound for suitable families. But that trivially holds, since the separator size is non-zero, which is . The size of the family is , by Claim 5.4. Lemma 3.2 gives the lower bound of for property testing an explicit singleton property. ∎
References
- [AST94] Noga Alon, Paul D. Seymour, and Robin Thomas. Planar separators. SIAM J. Discrete Math., 7(2):184–193, 1994.
- [BGHK95] H.L. Bodlaender, J.R. Gilbert, H. Hafsteinsson, and T. Kloks. Approximating treewidth, pathwidth, frontsize, and shortest elimination tree. Journal of Algorithms, 18(2):238–255, 1995.
- [BSS08] I. Benjamini, O. Schramm, and A. Shapira. Every minor-closed property of sparse graphs is testable. In Symposium on the Theory of Computing (STOC), pages 393–402, 2008.
- [Can20] Clément L. Canonne. A Survey on Distribution Testing: Your Data is Big. But is it Blue? Number 9 in Graduate Surveys. Theory of Computing Library, 2020.
- [CMOS11] Artur Czumaj, Morteza Monemizadeh, Krzysztof Onak, and Christian Sohler. Planar graphs: Random walks and bipartiteness testing. In IEEE 52nd Annual Symposium on Foundations of Computer Science, FOCS, pages 423–432. IEEE Computer Society, 2011.
- [CS19] Artur Czumaj and Christian Sohler. A characterization of graph properties testable for general planar graphs with one-sided error (it’s all about forbidden subgraphs). In 60th IEEE Annual Symposium on Foundations of Computer Science, FOCS, pages 1525–1548. IEEE Computer Society, 2019.
- [Die17] Reinhard Diestel. Graph Theory. Springer Publishing Company, Incorporated, 5th edition, 2017.
- [EHNO11] Alan Edelman, Avinatan Hassidim, Huy N. Nguyen, and Krzysztof Onak. An efficient partitioning oracle for bounded-treewidth graphs. In Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques RANDOM, pages 530–541, 2011.
- [FS09] Philippe Flajolet and Robert Sedgewick. Analytic Combinatorics. Cambridge University Press, 2009.
- [Gol17] O. Goldreich. Introduction to Property Testing. Cambridge University Press, 2017.
- [Gol21] Oded Goldreich. Open problems in property testing of graphs. Electron. Colloquium Comput. Complex., page 16, 2021.
- [GR02] O. Goldreich and D. Ron. Property testing in bounded degree graphs. Algorithmica, 32(2):302–343, 2002.
- [Gru12] Hermann Gruber. On balanced separators, treewidth, and cycle rank. Journal of Combinatorics, 3(4):669–681, 2012.
- [HKNO09] A. Hassidim, J. Kelner, H. Nguyen, and K. Onak. Local graph partitions for approximation and testing. In Foundations of Computer Science (FOCS), pages 22–31, 2009.
- [KSS19] Akash Kumar, C. Seshadhri, and Andrew Stolman. Random walks and forbidden minors II: a poly(d )-query tester for minor-closed properties of bounded degree graphs. In STOC 2019., pages 559–567, 2019.
- [KSS21] Akash Kumar, C. Seshadhri, and Andrew Stolman. Random walks and forbidden minors III: poly(d /)-time partition oracles for minor-free graph classes. Electron. Colloquium Comput. Complex., 28:8, 2021.
- [LR15] Reut Levi and Dana Ron. A quasi-polynomial time partition oracle for graphs with an excluded minor. ACM Transactions on Algorithms (TALG), 11(3):24, 2015.
- [LS21] Reut Levi and Nadav Shoshan. Testing hamiltonicity (and other problems) in minor-free graphs. CoRR, abs/2102.11728, 2021.
- [MR01] J. Ian Munro and Venkatesh Raman. Succinct representation of balanced parentheses and static trees. SIAM Journal on Computing, 31(3):762–776, 2001.
- [NS13] Ilan Newman and Christian Sohler. Every property of hyperfinite graphs is testable. SIAM Journal on Computing, 42(3):1095–1112, 2013.
- [Ott48] Richard Otter. The number of trees. Annals of Mathematics, 49(3):583–599, 1948.
- [YI15] Yuichi Yoshida and Hiro Ito. Testing outerplanarity of bounded degree graphs. Algorithmica, 73(1):1–20, 2015.
Appendix A Missing proofs from §2
In this appendix, our objective is to prove Claim A.1 and Claim A.4. The former claim counts the number of graphs with treewidth at most and is proved below.
Claim A.1.
The number of unlabeled bounded treewidth graphs on vertices is at most .
Proof:.
Observe that for a graph on vertices with bounded treewidth has edges. Then, the number of unlabelled graphs of this form, is at most , where . Thus , which implies that the number of bounded treewidth graphs on vertices is at most . ∎
Next, we produce the statment of Claim A.4. It postulates that there exist efficient time implementations of partition oracles for -degree bounde graphs with bounded treewidth.
One key tool we need to prove this theorem is an analog of Alon-Seymour-Thomas style result which proves that a graph with treewidth at most is -hyperfinite.
Let be the size of the smallest balanced separator, whose removal leads to connected components of size at most of the vertices.
We now prove that every treewidth graph is hyperfinite.
Lemma A.3.
Fix , . Then for any graph with there exists a set of edges such that the following hold:
-
•
is a graph with connected components no larger than .
-
•
Proof:.
We begin by showing item 1. Since has treewidth at most , all of its subgraphs also have treewidth . Also, recall , implies that for all -balanced separators of satisfy .
Consider the following recursive procedure. Here is a routine which returns
a three tuple where is the smallest balanced separator of and
and are collections of connected components of each with size at least
.
Decompose
1.
If
(a)
Write .
(b)
.
Note that the sets and produced at any intermediate step of this procedure are unions of connected components (and they are not necessarily connected components themselves). Consider the recursion tree of this process and let us examine the leaves of this recursion tree. The leaf nodes correspond to some subgraph (which is a union of connected components) obtained after repeated applications of balanced separator routine such that . Also, since the procedure repeatedly deletes a -separator, it also holds that all the leaf nodes of the recursion tree correspond to graphs with at least vertices. Thus, since the leaves correspond to disjoint subgraphs, the number of leaves equals and the total number of nodes produced in the recursion is at most . Generating every node requires deleting at most edges. Thus, in all, we delete edges which proves the lemma.
∎
We now prove bounds on partition oracles for bounded treewidth graphs.
Claim A.4.
Let denote the class of graphs with treewidth at most . Let denote a graph with treewidth at most . Then admits an partition oracle with .
Proof:.
Set . Since is a minor-closed class of graphs, it follows from Theorem 2.3 that any admits an partition oracle whose setting on is denoted as . Note that the number of edges that run between the connected components of the underlying partition is at most . We will show how to modify the parition obtained by this oracle and obtain another oracle which returns components with size at most . This is done as follows.
Let denote some connected component in the partition . If , we do not refine . On the other hand, if , we use Lemma A.3 on with parameter . This refines and produces connected components of size at most as desired. And the total number of edges lost is at most . Overall, the total number of edges lost is at most as desired. ∎