Zarankiewicz’s problem for semilinear hypergraphs
Abstract.
A bipartite graph with is semilinear if for some and the edge relation consists of the pairs of points satisfying a fixed Boolean combination of linear equalities and inequalities in variables for some . We show that for a fixed , the number of edges in a -free semilinear is almost linear in , namely for any ; and more generally, for a -free semilinear -partite -uniform hypergraph.
As an application, we obtain the following incidence bound: given points and open boxes with axis parallel sides in such that their incidence graph is -free, there can be at most incidences. The same bound holds if instead of boxes one takes polytopes cut out by the translates of an arbitrary fixed finite set of halfspaces.
We also obtain matching upper and (superlinear) lower bounds in the case of dyadic boxes on the plane, and point out some connections to the model-theoretic trichotomy in -minimal structures (showing that the failure of an almost linear bound for some definable graph allows one to recover the field operations from that graph in a definable manner).
1. Introduction
We fix and let be an -partite and -uniform hypergraph (or just -hypergraph for brevity) with vertex sets having , (hyper-) edge set , and being the total number of vertices.
Zarankiewicz’s problem asks for the maximum number of edges in such a hypergraph (as a function of ) assuming that it does not contain the complete -hypergraph with a fixed number of vertices in each part. The following classical upper bound is due to Kővári, Sós and Turán [kovari1954problem] for and Erdős [erdos1964extremal] for a general : if is -free, then . A probabilistic construction in [erdos1964extremal] also shows that the exponent cannot be substantially improved.
However, stronger bounds are known for restricted families of hypergraphs arising in geometric settings. For example, if is the incidence graph of a set of points and lines in , then is -free, and Kővári-Sós-Turán Theorem implies . The Szemerédi-Trotter Theorem [szemeredi1983extremal] improves this and gives the optimal bound . More generally, [fox2017semi] gives improved bounds for semialgebraic graphs of bounded description complexity. This is generalized to semialgebraic hypergraphs in [do2018zarankiewicz]. In a different direction, the results in [fox2017semi] are generalized to graphs definable in -minimal structures in [basu2018minimal] and, more generally, in distal structures in [chernikov2020cutting].
A related highly nontrivial problem is to understand when the bounds offered by the results in the preceding paragraph are sharp. When is the incidence graph of points and circles of unit radius in , the best known upper bound is , proven in [spencer1984unit] and also implied by the general bound for semialgebraic graphs. Any improvement to this bound will be a step toward resolving the long standing unit distance conjecture of Erdős (an almost linear bound of the form will positively resolve it).
This paper was originally motivated by the following incidence problem. Let be the incidence graph of a set of points and a set of solid rectangles with axis-parallel sides (which we refer to as boxes) in . Assuming that is -free, i.e. no two points belong to two rectangles simultaneously, what is the maximum number of incidences ? In the following theorem, we obtain an almost linear bound (which is much stronger than the bound implied by the aforementioned general result for semialgebraic graphs) and demonstrate that it is close to optimal.
Theorem (A).
-
(1)
For any set of points in and any set of boxes in , if the incidence graph on is -free, then it contains at most incidences (Corollary 2.38 with ).
-
(2)
If all boxes in are dyadic (i.e. direct products of intervals of the form for some integers ), then the number of incidences is at most (Theorem 4.7).
-
(3)
For an arbitrarily large , there exist a set of points and dyadic boxes in so that the incidence graph is -free and the number of incidences is (Proposition 3.5).
Problem 1.1.
While the bound for dyadic boxes is tight, we leave it as an open problem to close the gap between the upper and the lower bounds for arbitrary boxes.
Remark 1.2.
A related result in [fox2008separator] demonstrates that every -free intersection graph of convex sets on the plane satisfies . Note that in Theorem (B) we consider a -free bipartite graph, so in particular there is no restriction on the intersection graph of the boxes in .
Theorem (A.1) admits the following generalization to higher dimensions and more general polytopes.
Theorem (B).
-
(1)
For any set of points and any set of boxes in , if the incidence graph on is -free, then it contains at most incidences (Corollary 2.38).
-
(2)
More generally, given finitely many half-spaces in , let be the family of all possible polytopes in cut out by arbitrary translates of . Then for any set of points in and any set of polytopes in , if the incidence graph on is -free, then it contains at most incidences (Corollary 2.37).
Problem 1.3.
What is the optimal bound on the power of in Theorem (B)? In particular, does it actually have to grow with the dimension ?
Remark 1.4.
A bound similar to Theorem (B.1) and an improved bound for Theorem (A.1) in the -free case are established independently by Tomon and Zakharov in [Tomon], in which the authors also use our Theorem (A.3) to provide a counterexample to a conjecture of Alon et al. [alon2015separation] about the number of edges in a graph of bounded separation dimension, as well as to a conjecture of Kostochka from [kostochka2004coloring]. Some further Ramsey properties of semilinear graphs are demonstrated by Tomon in [tomon2021ramsey].
The upper bounds in Theorems (A.1) and (B) are obtained as immediate applications of a general upper bound for Zarankiewicz’s problem for semilinear hypergraphs of bounded description complexity.
Definition 1.5.
Let be an ordered vector space over an ordered division ring (e.g. viewed as a vector space over itself). A set is semilinear, of description complexity if is a union of at most sets of the form
where and each is a linear function, i.e., of the form
for some and .
We focus on the case in the introduction, in which case these are precisely the semialgebraic sets that can be defined using only linear polynomials.
Remark 1.6.
By a standard quantifier elimination result [van1998tame, §7], every set definable in an ordered vector space over an ordered division ring, in the sense of model theory, is semilinear (equivalently, a projection of a semilinear set is a finite union of semilinear sets).
Definition 1.7.
We say that an -hypergraph is semilinear, of description complexity if there exist some and a semilinear set of description complexity so that is isomorphic to the -hypergraph .
We stress that there is no restriction on the dimensions in this definition. We obtain the following general upper bound.
Theorem (C).
If is a semilinear -hypergraph of description complexity and is -free, then
In particular for any in this case. For a more precise statement, see Corollary 2.36 (in particular, the dependence of the constant in on is at most linear).
Remark 1.8.
It is demonstrated in [mustafa2015zarankiewicz] that a similar bound holds in the situation when is the intersection hypergraph of -dimensional simplices in .
One can get rid of the logarithmic factor entirely by restricting to the family of all finite -hypergraphs induced by a given -free semilinear relation (as opposed to all -free -hypergraphs induced by a given arbitrary semilinear relation as in Theorem (C)).
Theorem (D).
Assume that is semilinear and does not contain the direct product of infinite sets (e.g. if is -free for some ). Then for any -hypergraph of the form for some finite , we have .
This is Corollary 5.12 and follows from a more general Theorem 5.6 connecting linear Zarankiewicz bounds to a model-theoretic notion of linearity of a first-order structure (in the sense that the matroid given by the algebraic closure operator behaves like the linear span in a vector space, as opposed to the algebraic closure in an algebraically closed field — see Definition 5.3).
In particular, for every -free semilinear relation (equivalently, definable with parameters in the first-order structure by Remark 1.6) we have for all . One the other hand, by optimality of the Szemerédi-Trotter bound, for the semialgebraic -free point-line incidence graph we have . Note that in order to define it we use both addition and multiplication, i.e. the field structure. This is not coincidental — as a consequence of the trichotomy theorem in -minimal structures [peterzil1998trichotomy], we observe that the failure of a linear Zarankiewicz bound always allows to recover the field in a definable way (Corollary 5.11). In the semialgebraic case, we have the following corollary that is easy to state (Corollary 5.14).
Theorem (E).
Assume that for some is semialgebraic and -free, but . Then the graph of multiplication restricted to the unit box is definable in .
We conclude with a brief overview of the paper.
In Section 2 we introduce a more general class of hypergraphs definable in terms of coordinate-wise monotone functions (Definition 2.1) and prove an upper Zarankiewicz bound for it (Theorem 2.17). Theorems (A.1), (B) and (C) are then deduced from it in Section 2.5.
In Section 3 we prove Theorem (A.3) by establishing a lower bound on the number of incidences between points and dyadic boxes on the plane, demonstrating that the logarithmic factor is unavoidable (Proposition 3.5).
In Section 4, we establish Theorem (A.2) by obtaining a stronger bound on the number of incidences with dyadic boxes on the plane (Theorem 4.7). We use a different argument relying on a certain partial order specific to the dyadic case to reduce from given by the general theorem above to . Up to a constant factor, this implies the same bound for incidences with general boxes when one only counts incidences that are bounded away from the border (Remark 4.8).
Finally, in Section 5, we prove a general Zarankiewicz bound for definable relations in weakly locally modular geometric first-order structures (Theorem 5.6), deduce Theorem (D) from it (Corollary 5.12) and observe how to recover a real closed field from the failure of Theorem (D) in the -minimal case (Corollary 5.11).
Acknowledgements
We thank the referees for their very helpful suggestions on improving the paper. Artem Chernikov was partially supported by the NSF CAREER grant DMS-1651321 and by a Simons Fellowship. He is grateful to Adam Sheffer for some very helpful conversations, and to the American Insitute of Mathematics for additional support. Sergei Starchenko was supported by the NSF Research Grant DMS-1800806. Terence Tao was partially supported by NSF grant DMS-1764034 and by a Simons Investigator Award.
2. Upper bounds
2.1. Coordinate-wise monotone functions and basic sets
For an integer , by an -grid (or a grid if is clear from the context) we mean a cartesian product of some sets . As usual, denotes the set .
If is a grid, then by a sub-grid we mean a subset of the form for some .
Let be an -grid, an arbitrary set and a function. For , set
and let and be the projection maps.
For and , we write for the element with and . In particular, when , .
Definition 2.1.
Let be an -grid and a linearly ordered set. A function is coordinate-wise monotone if for any , and we have
Remark 2.2.
Let be an -grid and an ordered abelian group. We say that a function is quasi-linear if there exist some functions , , such that
Then every quasi-linear function is coordinate-wise monotone (as for any ).
Example 2.3.
Suppose that is an ordered vector space over an ordered division ring , for , and is a linear function. Then is obviously quasi-linear, hence coordinate-wise monotone.
Remark 2.4.
Let be a grid and a sub-grid. If is a coordinate-wise monotone function then the restriction is a coordinate-wise monotone function on .
Definition 2.5.
Let be an -grid. A subset is a basic set if there exists a linearly ordered set , a coordinate-wise monotone function and such that .
Remark 2.6.
If , then every subset of is basic.
Remark 2.7.
If is given by for some coordinate-wise monotone function , then is a basic set as well. Indeed, we can just add a new element to so that it is a successor of , then .
Similarly, the sets are basic, by inverting the order on .
We have the following “coordinate-splitting” presentation for basic sets.
Proposition 2.8.
Let be an -grid and a basic set. Then there is a linearly ordered set , a coordinate-wise monotone function and a function such that .
Remark 2.9.
The converse of this proposition is also true: an arbitrary linear order can be realized as a subset of some ordered abelian group with the induced ordering (we can take when is at most countable); then define by setting
Proof of Proposition 2.8.
Assume that we are given a coordinate-wise monotone function and with .
For , let be the pre-order on induced by , namely for we set if and only if for some (equivalently, any) we have .
Quotienting by the equivalence relation corresponding to the pre-order if needed, we may assume that each is actually a linear order.
Let be the partial order on with if and only if
Let , where denotes the disjoint union. Clearly is a strict partial order on , i.e. a transitive and anti-symmetric (hence irreflexive) relation.
For any and we define
Claim 2.10.
Let , and .
-
(1)
If , then and .
-
(2)
If , then and .
Proof.
. We have and , hence .
Since and we also have .
is similar. ∎
Let be the transitive closure of . It follows from the above claim that . More explicitly, for , if , and for , if for some . It is not hard to see then that is anti-symmetric, hence it is a strict partial order on .
Claim 2.11.
The union is a strict partial order on .
Proof.
We first show transitivity. Note that and are both transitive, so it suffices to show for that if either or , then . Furthermore, since , we may restrict our attention to the following cases. If with and , then , and so . If with and , then , and so .
To check anti-symmetry, assume and . Since we have for some . We have , contradicting . ∎
Finally, let be an arbitrary linear order on extending . Since extends , for and we have if and only if .
We take and to be the identity maps. Since extends , the map is coordinate-wise monotone. ∎
2.2. Main theorem
Definition 2.12.
Let be an -grid.
-
(1)
Given , we say that a set has grid-complexity (in ) if is the intersection of with at most basic subsets of .
We say that has finite grid-complexity if it has grid-complexity for some .
-
(2)
For integers we say that is -free is does not contain a sub-grid with .
In particular, itself is the only subset of of grid-complexity .
Example 2.13.
Suppose that is an ordered vector space over an ordered division ring, and
for some linear functions . Then each is coordinate-wise monotone (Example 2.3), hence each of the sets
is a basic subset of the grid (the latter by Remark 2.7), and as an intersection of these basic sets has grid-complexity .
Remark 2.14.
-
(1)
Let be an -grid and a subset of of grid-complexity . If is a sub-grid containing , then is also a subset of of grid-complexity .
-
(2)
In particular, if is a subset of grid-complexity , then is a subset of grid-complexity of the grid , where is the projection of on (it is the smallest sub-grid of containing ).
Definition 2.15.
Let be a finite -grid and . For , we will denote by the integer
Example 2.16.
We have , , .
We can now state the main theorem.
Theorem 2.17.
For every integers there are and such that: for any finite -grid and -free subset of grid-complexity we have
Moreover, we can take .
Remark 2.18.
Inspecting the proof, it can be verified that the dependence of on is at most linear.
Remark 2.19.
We use instead of to include the case .
Remark 2.20.
Definition 2.21.
Let be a grid. We extend the definition of to arbitrary finite subsets of as follows. Let be a finite subset, and let , , be the projections of . We define .
If is a finite -grid and , then obviously . Thus Theorem 2.17 is equivalent to the following.
Proposition 2.22.
For every integers there are and such that for any -grid and -free finite subset of grid-complexity we have
Definition 2.23.
For and , let be the maximal size of a -free subset of grid-complexity of some -grid with .
Then Proposition 2.22 can be restated as follows.
Proposition 2.24.
For every integers there are and such that
Remark 2.25.
Notice that .
In the rest of the section we prove Proposition 2.24 by induction on , where for each it is proved by induction on . We will use the following simple recurrence bound.
Fact 2.26.
Let be a function satisfying and for some and . Then for some .
2.3. The base case
Let be a finite grid and a subset of grid-complexity . We will proceed by induction on .
If then . If is -free then one of the sets must have size at most . Hence .
Thus
Remark 2.27.
The same argument shows that for all .
Assume now that the theorem is proved for and all . Let , and .
We choose basic sets such that .
By Proposition 2.8, we can choose a finite linear order and functions and so that
For , and , let
We choose such that
For example we can take to be the minimal element in with . Then
Hence we conclude
This finishes the base case .
2.4. Induction step
We fix and assume that Proposition 2.24 holds for all pairs with and .
Definition 2.28.
Let be a finite -grid.
-
(1)
For integers , we say that a subset is of split grid-complexity if there are basic sets , a subset of grid-complexity , and a subset such that .
-
(2)
For and , let be the maximal size of a -free subset of an -grid of split grid-complexity with .
Remark 2.29.
-
(1)
Note that has grid-complexity at most , which is the reason we do not include a parameter for the grid-complexity of in the split grid-complexity of .
-
(2)
If is of grid-complexity , then it is of split grid-complexity .
-
(3)
If is of split grid-complexity , then it is of grid-complexity .
For the rest of the proof, we abuse notation slightly and refer to the “split grid-complexity” of a set as the “grid-complexity”. To complete the induction step we will prove the following Proposition.
Proposition 2.30.
For any integers there are and such that
We will use the following notations throughout the section:
-
•
is a finite grid with ;
-
•
is a subset of grid-complexity ;
-
•
is the -grid ;
-
•
is a subset of grid-complexity , , and are basic subsets such that .
We proceed by induction on .
The base case of Proposition 2.30.
In this case . If is -free then either is -free or .
In the first case, by induction hypothesis on , there are and such that . In the second case we have .
Since , the conclusion of the proposition follows with .
Induction step of Proposition 2.30. We assume now that the proposition holds for all pairs with and .
Given a tuple , we let . By Proposition 2.8, we can choose a finite linear order , a coordinate-wise monotone function and a function so that
Moreover, by Remark 2.9, we may assume without loss of generality that the coordinate-wise monotone function defining is given by
Definition 2.31.
Given an arbitrary set , we say that a set is an -strip in if
for some , . Likewise, given an arbitrary set , we say that is an -strip in if
for some , . If or , we simply say an -strip or -strip, respectively.
Remark 2.32.
Note the following:
-
(1)
is an -strip, and is an -strip;
-
(2)
every -strip is a subset of the -grid of grid-complexity (using Remark 2.7);
-
(3)
the intersection of any two -strips is an -strip; the same conclusion holds for -strips.
Definition 2.33.
-
(1)
We say that a subset is an -grid if , where is an -strip in and is an -strip in .
-
(2)
If is an -grid, we set
Note that if is a sub-grid of , then .
-
(3)
For an -grid , we will denote by the set .
The induction step for Proposition 2.30 will follow from the following proposition.
Proposition 2.34.
For every integer there are and such that, for any -grid , if the set is -free then
We should stress that in the above proposition and do not depend on , , , and but they may depend on our fixed and .
Given Proposition 2.34, we can apply it to the -grid (so ) and get
It is easy to see that , hence Proposition 2.30 follows with the same and .
We proceed with the proof of Proposition 2.34
Proof of Proposition 2.34.
Fix , and let be the maximal size of a -free set among all -grids with . We need to show that for some and we have
Let be an -grid with .
For and , let
and
Note that for every , is an -strip in , is an -strip in , and their product is an -grid.
Claim 2.35.
There is such that
Proof of Claim.
Let .
Let be the minimal element in with
Then and . Since , we have . The claim follows. ∎
Let be as in the claim. It is not hard to see that the following holds:
It follows that
Hence, by the choice of and using Remark 2.32(2),
Applying the induction hypothesis on and using Fact 2.26 we obtain for some and .
Finally, inspecting the proof, we have shown the following:
-
(1)
for all ;
-
(2)
for all and ;
-
(3)
for all .
Iterating (3), for every we have . Hence, by (2), for every and . Iterating this, we get . Using (1), this implies for all . Hence, by Remark 2.27 and (1) again, for all .
2.5. Some applications
We observe several immediate applications of Theorem 2.17, starting with the following bound for semilinear hypergraphs.
Corollary 2.36.
For every there exist some and satisfying the following.
For any semilinear -free -hypergraph of description complexity (see Definition 1.7), taking we have
Proof.
As a special case with , this implies a bound for the following incidence problem.
Corollary 2.37.
For every there exists some satisfying the following.
Let and be finitely many (closed or open) half-spaces in . Let be the (infinite) family of all possible polytopes in cut out by arbitrary translates of .
For any set of points in and any set of polytopes in , if the incidence graph on is -free, then it contains at most incidences.
Proof.
We can write
where and for depending on whether is an open or a closed half-space.
Every polytope is of the form for some , where is the translate of by the vector , i.e.
Then the incidence relation between points in and polytopes in can be identified with the semilinear set
defined by linear inequalities. The conclusion now follows by Corollary 2.36 with . ∎
In particular, we get a bound for the original question that motivated this paper.
Corollary 2.38.
Let be the family of all (closed or open) boxes in . Then for every there exists some satisfying the following.
For any set of points in and any set of boxes in , if the incidence graph on is -free, then it contains at most incidences.
Proof.
Immediate from Corollary 2.37, since we have half-spaces in so that every box in is cut out by the intersection of their translates. ∎
3. Lower bounds
While we do not know if the bound in Theorem 2.17 is optimal, in this section we show that at least the logarithmic factor is unavoidable already for the incidence relation between points and dyadic boxes in .
We describe a slightly more general construction first. Fix .
Definition 3.1.
Given finite tuples and with , say , we say that and have the same order-type over if
for all , and .
In other words, the tuples and have the same quantifier-free type over the set in the structure .
Remark 3.2.
Assume that is a finite set of points and is a finite set of -dimensional open boxes with axis-parallel sides, with incidences between and .
-
(1)
By perturbing and slightly, we may assume that for every , all points in have pairwise distinct th coordinates , and none of the points in belongs to the border of any of the boxes in , while the incidence graph between and remains unchanged.
-
(2)
Let be the tuple listing all corners of all boxes in . If is an arbitrary set of points with the same order-type as over , then the incidence graph on is isomorphic to the incidence graph on .
We have the following lemma for combining point-box incidence configurations in a higher-dimensional space.
Lemma 3.3.
Given any , assume that:
-
(1)
there exists a set of points with and a set of -dimensional boxes with , with incidences between them, and the incidence graph -free;
-
(2)
there exists a set of points with and a set of -dimensional boxes with , with incidences between them and the incidence graph -free.
Then there exists a set of points with and a set of -dimensional boxes with , so that there are incidences between and and their incidence graph is still -free.
Proof.
By Remark 3.2(1) we may assume that for every , all points in have pairwise distinct th coordinates, for every all points in have pairwise distinct th coordinates, and none of the points is on the border of any of the boxes. Write as . Let be the tuple listing all corners of all boxes in .
Using this, for each we can choose a very small -dimensional box with and such that: for any choice of points , we have that has the same order-type as over . In particular, all the ’s are pairwise disjoint, and the incidence graph between and is isomorphic to the incidence graph between and by Remark 3.2(2).
Contracting and translating while keeping the th coordinate unchanged, for each we can find a copy of the configuration entirely contained in the box , that is:
-
•
all points in and boxes in are contained in ;
-
•
the incidence graph on is isomorphic to the incidence graph on ;
-
•
for all , the th coordinate of every point in is the same as the th coordinate of the corresponding point in .
Let and , then and there are incidences between and .
Write as and as . As all of the th coordinates of the points in are pairwise disjoint, for each we can choose a small interval with , and so that all of the intervals are pairwise disjoint. For each and , we consider the -dimensional box . Let . For each and , contains exactly one point (given by the copy of in ), and the projection of onto the first coordinates is in . Hence the incidence graph between and is isomorphic to the incidence graph between and by the choice of the ’s, in particular the number of incidences is .
Finally, let , then . Note that for and any , i.e. no box in intersects any of the boxes in for . It is now not hard to check that the incidence graph between and is -free (by construction and the assumptions of -freeness of and ), and that there are incidences between and . ∎
Remark 3.4.
It follows from the proof that if all the boxes in and are dyadic (see Definition 4.6), then we can choose the boxes in to be dyadic as well.
Proposition 3.5.
For any , there exist a set of points and a set of dyadic boxes in such that their incidence graph is -free and the number of incidences is .
In particular, substituting , this shows that the number of incidences grows as .
Proof.
Given , assume that there exist -free ‘point – dyadic box’ configurations satisfying (1) and (2) in Lemma 3.3 for some parameters . Then, for any , we can iterate the lemma times and find a -free ‘point – dyadic box’ configuration in with points, dyadic boxes (Remark 3.4), and incidences.
In particular, let and let be arbitrary. We can start with (one dyadic interval containing points in ) and (one point and zero dyadic boxes in ). Taking , we then find a -free configuration with points, dyadic boxes and incidences. Hence for , we have points, boxes and incidences. ∎
Remark 3.6.
We remark that the construction in Lemma 3.3 cannot produce a -free configuration with more than incidences in for any .
Indeed, using the “coordinates” instead of , where the coordinates correspond to the number of points, boxes and incidences respectively, the lemma says that if is attainable in dimensions and is attainable in dimensions, then is attainable in dimensions. Thus, one adds the vector to . We want to maximize the second coordinate of this vector while keeping the first coordinate below , and the optimal way to do it essentially is to add times the vector , which increases by and gives the lower bound.
We thus ask whether in the ‘point-box’ incidence bound in the power of has to grow with the dimension (see Problem 1.3).
4. Dyadic rectangles
In this section we strengthen the bound on the number of incidences with rectangles on the plane with axis-parallel sides given by Corollary 2.38, i.e., , in the special case of dyadic rectangles, using a different argument (which relies on a certain partial order specific to the dyadic case).
4.1. Locally -linear orders
Throughout this section, let be a partially ordered set of size at most , and let be a collection of subsets of (possibly with repetitions) of size at most . As before, we let .
Definition 4.1.
We say that a set is -linear if it contains no antichains of size greater than , and is locally -linear if any interval is -linear.
Note that -linearity is preserved under removing points from .
Definition 4.2.
The collection is said to be a -free arrangement if for any , there are at most sets from containing all of them simultaneously.
Observe that if one removes any number of points from , or removes any number of sets from , one still obtains a -free arrangement. We now state the main theorem of this section.
Theorem 4.3.
Suppose is locally -linear, and is a -free arrangement of -linear subsets of . Then
.
To prove Theorem 4.3, we first need some definitions and a lemma. If , define a parent of to be an element with and no element between and , and similarly define a child of to be an element with and no element between and . We say that is a strict -descendant of if there are some elements such that is a child of , and that is a -descendant of if it is a strict -descendant for some .
Lemma 4.4.
Fix . Let be a -free arrangement of -linear subsets of , and let . Let denote the set of all elements in which have a -descendant with more than children. Then
Proof.
Let denote the set of elements such that every -descendant of has at most children. Then we can rearrange the desired inequality as
The quantity is counting incidences where and .
Given , call a point low if has no descending chain of length under it in . Every can contain at most low points. Indeed, as is -linear, it has at most minimal elements. Removing them, we obtain a -linear set such that every point in it contains an element under it in , and itself has at most minimal elements. Remove them to obtain a -linear set such that each point in it contains a descending chain of length under it in , etc.
Hence each contributes at most incidences with its low points, giving a total contribution of at most to the sum. If is not a low point on , then there are some in , with each one a child of the next one. As is a -free arrangement, among the sets there are at most containing all these points. By definition of , for each there are at most choices for such tuples . Hence is incident to at most sets for which it is not low, and the total number of contributions of incidences in this case is at most , so the claim follows. ∎
Now we prove Theorem 4.3. Let be a natural number to be chosen later, and be another parameter to be chosen later. Define the subsets
of by defining , and for each , defining to be the set of points in that have a -descendant with more than children in . By the above lemma, we have
for all , and hence on telescoping
Claim 4.5.
Let be a point in . Then it has at least distinct descendants in .
Proof.
By definition of there is some -descendant of which has at least children in . Let denote the set of children of , so . By reverse induction for we choose sets of descendants of so that . Then , as wanted.
Let be given. By definition of and pigeonhole principle, there is some and such that and every has a strict -descendant with at least children in . Fix a path of length connecting to , and for let denote the th element on the path (so , and is a child of ). Let , so . Then (otherwise there is some element which has at least different parents in , which would then form an antichain of size contradicting local -linearity of ). Hence
Now by hypothesis every element in has at least children in , denote the set of all the children of the elements in by . Then, again by -linearity, . ∎
Thus if we choose so that
then we will get a contradiction, unless is empty. We conclude, for such and , that
If we take and to be the integer part of , and assume that is sufficiently large relatively to and , then the claim follows.
4.2. Reduction for dyadic rectangles
Definition 4.6.
-
(1)
Define a dyadic interval to be a half-open interval of the form for integers ; we use to denote the length of such an interval.
-
(2)
Define a dyadic box in (dyadic rectangle when ) to be a product of dyadic intervals.
Note that if two dyadic intervals intersect, then one must be contained in the other.
Theorem 4.7.
Fix . Assume we have a collection of points in and a collection of dyadic rectangles in , with the property that the incidence graph contains no , and . Then the number of incidences with and is at most
Proof.
Suppose that we have some nested dyadic rectangles in . As the incidence graph is -free by hypothesis, may contain at most points from . Removing all such rectangles repeatedly we loose only incidences, and thus may assume that any nested sequence in is of length at most . In particular, any rectangle can be repeated at most times in . Then, possibly increasing the number of incidences by a multiple , we may assume that there are no repetitions in .
We now define a relation on by declaring if and . This is a locally -linear partial order (by the previous paragraph: antisymmetry holds as there are no repetitions in , and using that all rectangles are dyadic, any antichain of size inside an interval would give a nested sequence of rectangles of length ).
For each point in , let be a subset of consisting of all those rectangles in that contain ; then is a -linear set (again, any antichain gives a nested sequence of rectangles of the same length). Finally, , hence the collection is a -free arrangement and the claim now follows from Theorem 4.3 with . ∎
Remark 4.8.
For a non-dyadic rectangle , let denote the rectangle with the same center as R, but whose lengths and heights have been shrunk by a factor of . Define a “good incidence” to be a pair where is a point lying in , not just in . Then the dyadic bound in Theorem 4.7 implies that for a family of arbitrary (not necessarily dyadic) rectangles with no ’s, one still gets the -type bound for the number of good incidences.
The reason is as follows. First we can randomly translate and dilate (non-isotropically, with the horizontal and vertical coordinates dilated separately) the configuration of points and rectangles by some translation parameter and a pair of dilation parameters for each of the coordinates. While there is no invariant probability measure on the space of dilatations, one can for instance pick a large number (much larger than the number of points and rectangles, etc.), dilate horizontally by a random dilation between and (using say the Haar measure) making (with positive probability) the horizontal side length close to a power of two; then a vertical dilation will achieve a similar effect for the vertical side length; and then translate by a random amount in (chosen uniformly at random) placing the rectangle very close to a dyadic one with positive probability. If is a rectangle that is randomly dilated and translated this way, then with probability , there will be a dyadic rectangle stuck between and . If the original rectangles have no , then neither will these new dyadic rectangles. The expected number of incidences amongst the dyadic rectangles is at least times the number of good incidences amongst the original rectangles. Hence any incidence bound we get on dyadic rectangles implies the corresponding bound for good incidences for non-dyadic rectangles (losing a factor of ).
5. A connection to model-theoretic linearity
In this section we obtain a stronger bound in Theorem 2.17 (without the logarithmic factor) under a stronger assumption that the whole semilinear relation is -free (Corollary 5.12). And we show that if this stronger bound doesn’t hold for a given semialgebraic relation, then the field operations can be recovered from this relation (see Corollary 5.14 for the precise statement). These results are deduced in Section 5.2 from a more general model-theoretic theorem proved in Section 5.1.
5.1. Main theorem
We recall some standard model-theoretic notation and definitions, and refer to [marker2006model] for a general introduction to model theory, and to [berenstein2012weakly] for further details on geometric structures.
Recall that denotes the algebraic closure operator, i.e. if is a first-order structure, and is a finite tuple in , then if it belongs to some finite -definable subset of (this generalizes linear span in vector spaces and algebraic closure in fields). Throughout this section we follow the standard model theoretic notation: depending on the context, writing denotes either the union of two subsets of , or the tuple obtained by concatenating the (possibly infinite) tuples of elements of .
Definition 5.1.
A complete first-order theory in a language is geometric if for any model we have the following.
-
(1)
The algebraic closure in satisfies the Exchange Principle:
if are singletons in , and , then .
-
(2)
eliminates quantifier:
for every -formula with a single variable and a tuple of variables there exists some such that for every , if has more than solutions in , then it has infinitely many solutions in .
In models of a geometric theory, the algebraic closure operator gives rise to a matroid, and given a finite tuple in and , is the minimal cardinality of a subtuple of so that (in an algebraically closed field, this is just the transcendence degree of over the field generated by ). Finally, given a finite tuple and sets , we write to denote that .
Remark 5.2.
If is geometric, then it is easy to check that is an independence relation, i.e. it satisfies the following properties for all tuples and :
-
•
;
-
•
(extension) if and is arbitrary, then there exists some so that and (which means that belongs to exactly the same -definable subsets of as ).
-
•
(monotonicity) ;
-
•
(symmetry) ;
-
•
(transitivity) and ;
-
•
(non-degeneracy) if and , then .
The following property expresses that the matroid defined by the algebraic closure is linear, in the sense that the closure operator behaves more like span in vector spaces, as opposed to algebraic closure in fields.
Definition 5.3.
[berenstein2012weakly, Definition 2.1] A geometric theory is weakly locally modular if for any saturated and small subsets of there exists some small set such that .
Recall that a linearly ordered structure is -minimal if every definable subset of is a finite union of intervals (see e.g. [van1998tame]).
Example 5.4.
[berenstein2012weakly, Section 3.2] An -minimal structure is linear (i.e. any normal interpretable family of plane curves in has dimension ) if and only if it is weakly locally modular.
In particular, every theory of an ordered vector space over an ordered division ring is weakly locally modular (so Theorem 5.6 applies to semi-linear relations).
The following is a key model-theoretic lemma.
Lemma 5.5.
Assume that is geometric and weakly locally modular, and is -saturated. Assume that is an -ary relation defined by a formula with parameters in a finite tuple , and contains no -grid with each infinite. Then for any there exists some so that .
Proof of Lemma 5.5.
Assume not, then there exist some in such that , but for every , where .
By weak local modularity, for each there exists some small set so that
By extension of , we may assume that for all . Hence by transitivity , where .
Let .
Claim (A).
For every , .
Proof.
Fix . As and , by symmetry and transitivity we have
Note that for every , hence , and clearly . Hence , and in particular . ∎
Claim (B).
For every , .
Proof.
Fix . Then by definition. But as by transitivity, if then we would get , contradicting the assumption. ∎
By induction we will choose sequences of tuples in such that for every we have:
-
(1)
for all ;
-
(2)
(as tuples) for all ;
-
(3)
.
Fix , and assume that we already chose some sequences for satisfying (1)–(3).
Claim (C).
We have .
Proof.
If , this claim becomes , hence holds by Claim (A). So assume . We will show by induction that for each we have
For this is equivalent to , which holds by (3) for . So we assume this holds for , that is we have , and show it for . By assumption and transitivity we have
Also by (3) for . Then by transitivity again , which concludes the inductive step.
In particular, for we get , that is . By transitivity and Claim (A) this implies , and we conclude by symmetry. ∎
Using Claim (C) and extension of , we can choose a sequence so that and for every . By Claim (B) we have , hence , hence , so in particular all the tuples are pairwise-distinct and satisfies (1) and (2). By symmetry and transitivity of we get . This concludes the inductive step in the construction of the sequences.
Finally, as (1) holds for all and is contained in , it follows that , and hence for every . By (1) each of the sets is infinite — contradicting the assumption on . This concludes the proof of the lemma. ∎
Theorem 5.6.
Assume that is a geometric, weakly locally modular theory, and . Assume that and is an -formula without parameters, with . Then there exists some satisfying the following.
Given , consider the -ary relation
Then for every , exactly one of the following two cases must occur:
-
(1)
is not -free for any ;
-
(2)
for any finite -grid we have
Proof.
Assume that is an elementary extension of and . Then, for a fixed ,
is -free if and only if
is -free, as this can be expressed by a first-order formula and . Similarly, for a fixed , for every finite -grid if and only if for every finite -grid (as for every fixed sizes of this condition can be expressed by a first-order formula). Hence, passing to an elementary extension, we may assume that is -saturated.
As eliminates , there exists some such that for any , and tuple , the fiber
is finite if and only if it has size .
Given an arbitrary such that is -free, Lemma 5.5 and compactness imply that for every tuple , there exists some such that the fiber is finite, hence . This easily implies that for any finite -grid we have . ∎
Remark 5.7.
In the binary case, a similar observation was made by Evans in the context of certain stable theories [evans2005trivial, Proposition 3.1].
Restricting to distal structures, we can relax the assumption “ is -free for some ” to “ does not contain a direct product of infinite sets” in Theorem 5.6 (we refer to e.g. the introduction in [chernikov2018regularity] or [chernikov2020cutting] for a general discussion of model-theoretic distality and its connections to combinatorics).
Corollary 5.8.
Assume that is a distal, geometric, weakly locally modular theory, , and is an -formula without parameters, with . Then there exists some satisfying the following.
Assume that and the -ary relation does not contain an -grid with each infinite. Then for any finite -grid .
Proof.
By [chernikov2020cutting, Theorem 5.12], if is a distal structure with elimination of , then there exists some such that for every , is not -free if and only if for some infinite . The conclusion now follows by Theorem 5.6. ∎
Remark 5.9.
Weaker bounds for non-cartesian relations definable in arbitrary distal theories are established in [chernikov2018model, chernikov2021model].
Now we show that in the -minimal case, this result actually characterizes weak local modularity. By the trichotomy theorem in -minimal structures [peterzil1998trichotomy] we have the following equivalence.
Fact 5.10.
Let be an -minimal (-)saturated structure. The following are equivalent:
-
•
is not linear (see Example 5.4);
-
•
is not weakly locally modular;
-
•
there exists a real closed field definable in .
Corollary 5.11.
Let be an -minimal structure. The following are equivalent:
-
(1)
is weakly locally modular;
-
(2)
Corollary 5.8 holds in ;
-
(3)
for every and every definable (with parameters) , if is -free for some , then there exist some and such that: for any and with we have
-
(4)
there is no infinite field definable in .
Proof.
(1) (2) by Corollary 5.8, and (2) (3) is obvious.
(3) (4) Assume that is an infinite field definable in , by -minimality. Then the point-line incidence relation on corresponds to a -free definable relation for some . By the standard lower bound for Szemerédi-Trotter, the number of incidences satisfies , hence cannot satisfy (3).
(4) (1) If is not weakly locally modular, by Fact 5.10 a real closed field is definable in . ∎
5.2. Applications to semialgebraic relations
Corollary 5.12.
Assume that is semilinear and does not contain a direct product of infinite sets (e.g. if is -free for some ). Then there exists some so that for any -hypergraph of the form for some finite , with , we have .
Proof.
We recall the following special case of the trichotomy theorem in o-minimal structures restricted to semialgebraic relations.
Fact 5.13.
[marker1992additive, Theorem 1.3] Let be a semialgebraic but not semilinear set. Then (i.e. the graph of multiplication restricted to the unit box) is definable in the first-order structure .
Using it, we have the following more explicit variant of Corollary 5.11 in the semialgebraic case.
Corollary 5.14.
Let be a semialgebraic set, and consider the first-order structure . Then the following are equivalent.
-
(1)
For any and any -ary relation not containing an -grid with each infinite, there exists some so that for every finite -grid .
-
(2)
For every and definable (with parameters) in , if is -free for some , then there exist some and such that: for any and with we have
-
(3)
is not definable in .
Proof.
(1) (2) is obvious.
(2) (3) Using , the -free point-line incidence relation in is definable restricted to , and the standard configurations witnessing the lower bound in Szemerédi-Trotter can be scaled down to the unit box.