This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Zarankiewicz’s problem for semilinear hypergraphs

Abdul Basit Department of Mathematics
Iowa State University
Ames, IA, 50011, USA
[email protected]
Artem Chernikov Department of Mathematics
University of California Los Angeles
Los Angeles, CA 90095-1555
[email protected]
Sergei Starchenko Department of Mathematics
University of Notre Dame
Notre Dame, IN, 46656, USA
[email protected]
Terence Tao Department of Mathematics
University of California Los Angeles
Los Angeles, CA 90095-1555
[email protected]
 and  Chieu-Minh Tran Department of Mathematics
University of Notre Dame
Notre Dame, IN, 46656, USA
[email protected]
Abstract.

A bipartite graph H=(V1,V2;E)H=\left(V_{1},V_{2};E\right) with |V1|+|V2|=n|V_{1}|+|V_{2}|=n is semilinear if VidiV_{i}\subseteq\mathbb{R}^{d_{i}} for some did_{i} and the edge relation EE consists of the pairs of points (x1,x2)V1×V2(x_{1},x_{2})\in V_{1}\times V_{2} satisfying a fixed Boolean combination of ss linear equalities and inequalities in d1+d2d_{1}+d_{2} variables for some ss. We show that for a fixed kk, the number of edges in a Kk,kK_{k,k}-free semilinear HH is almost linear in nn, namely |E|=Os,k,ε(n1+ε)|E|=O_{s,k,\varepsilon}(n^{1+\varepsilon}) for any ε>0\varepsilon>0; and more generally, |E|=Os,k,r,ε(nr1+ε)|E|=O_{s,k,r,\varepsilon}(n^{r-1+\varepsilon}) for a Kk,,kK_{k,\ldots,k}-free semilinear rr-partite rr-uniform hypergraph.

As an application, we obtain the following incidence bound: given n1n_{1} points and n2n_{2} open boxes with axis parallel sides in d\mathbb{R}^{d} such that their incidence graph is Kk,kK_{k,k}-free, there can be at most Ok,ε(n1+ε)O_{k,\varepsilon}(n^{1+\varepsilon}) incidences. The same bound holds if instead of boxes one takes polytopes cut out by the translates of an arbitrary fixed finite set of halfspaces.

We also obtain matching upper and (superlinear) lower bounds in the case of dyadic boxes on the plane, and point out some connections to the model-theoretic trichotomy in oo-minimal structures (showing that the failure of an almost linear bound for some definable graph allows one to recover the field operations from that graph in a definable manner).

1. Introduction

We fix r2r\in\mathbb{N}_{\geq 2} and let H=(V1,,Vr;E)H=\left(V_{1},\ldots,V_{r};E\right) be an rr-partite and rr-uniform hypergraph (or just rr-hypergraph for brevity) with vertex sets V1,,VrV_{1},\ldots,V_{r} having |Vi|=ni|V_{i}|=n_{i}, (hyper-) edge set EE, and n=i=1rnin=\sum_{i=1}^{r}n_{i} being the total number of vertices.

Zarankiewicz’s problem asks for the maximum number of edges in such a hypergraph HH (as a function of n1,,nrn_{1},\ldots,n_{r}) assuming that it does not contain the complete rr-hypergraph Kk,,kK_{k,\ldots,k} with k>0k>0 a fixed number of vertices in each part. The following classical upper bound is due to Kővári, Sós and Turán [kovari1954problem] for r=2r=2 and Erdős [erdos1964extremal] for a general rr: if HH is Kk,,kK_{k,\ldots,k}-free, then |E|=Or,k(nr1kr1)|E|=O_{r,k}\left(n^{r-\frac{1}{k^{r-1}}}\right). A probabilistic construction in [erdos1964extremal] also shows that the exponent cannot be substantially improved.

However, stronger bounds are known for restricted families of hypergraphs arising in geometric settings. For example, if HH is the incidence graph of a set of n1n_{1} points and n2n_{2} lines in 2\mathbb{R}^{2}, then HH is K2,2K_{2,2}-free, and Kővári-Sós-Turán Theorem implies |E|=O(n3/2)|E|=O(n^{3/2}). The Szemerédi-Trotter Theorem [szemeredi1983extremal] improves this and gives the optimal bound |E|=O(n4/3)|E|=O(n^{4/3}). More generally, [fox2017semi] gives improved bounds for semialgebraic graphs of bounded description complexity. This is generalized to semialgebraic hypergraphs in [do2018zarankiewicz]. In a different direction, the results in [fox2017semi] are generalized to graphs definable in oo-minimal structures in [basu2018minimal] and, more generally, in distal structures in [chernikov2020cutting].

A related highly nontrivial problem is to understand when the bounds offered by the results in the preceding paragraph are sharp. When HH is the incidence graph of n1n_{1} points and n2n_{2} circles of unit radius in 2\mathbb{R}^{2}, the best known upper bound is |E|=O(n4/3)|E|=O(n^{4/3}), proven in [spencer1984unit] and also implied by the general bound for semialgebraic graphs. Any improvement to this bound will be a step toward resolving the long standing unit distance conjecture of Erdős (an almost linear bound of the form |E|=O(n1+c/loglogn)|E|=O(n^{1+c/\log\log n}) will positively resolve it).

This paper was originally motivated by the following incidence problem. Let HH be the incidence graph of a set of n1n_{1} points and a set of n2n_{2} solid rectangles with axis-parallel sides (which we refer to as boxes) in 2\mathbb{R}^{2}. Assuming that HH is K2,2K_{2,2}-free, i.e. no two points belong to two rectangles simultaneously, what is the maximum number of incidences |E||E|? In the following theorem, we obtain an almost linear bound (which is much stronger than the bound implied by the aforementioned general result for semialgebraic graphs) and demonstrate that it is close to optimal.

Theorem (A).
  1. (1)

    For any set PP of n1n_{1} points in 2\mathbb{R}^{2} and any set RR of n2n_{2} boxes in 2\mathbb{R}^{2}, if the incidence graph on P×RP\times R is Kk,kK_{k,k}-free, then it contains at most Ok(nlog4(n))O_{k}\left(n\log^{4}(n)\right) incidences (Corollary 2.38 with d=2d=2).

  2. (2)

    If all boxes in RR are dyadic (i.e. direct products of intervals of the form [s2t,(s+1)2t)[s2^{t},(s+1)2^{t}) for some integers s,ts,t), then the number of incidences is at most Ok(nlog(100+n1)loglog(100+n1))O_{k}\left(n\frac{\log(100+n_{1})}{\log\log(100+n_{1})}\right) (Theorem 4.7).

  3. (3)

    For an arbitrarily large nn, there exist a set of nn points and nn dyadic boxes in 2\mathbb{R}^{2} so that the incidence graph is K2,2K_{2,2}-free and the number of incidences is Ω(nlog(n)loglog(n))\Omega\left(n\frac{\log(n)}{\log\log(n)}\right) (Proposition 3.5).

Problem 1.1.

While the bound for dyadic boxes is tight, we leave it as an open problem to close the gap between the upper and the lower bounds for arbitrary boxes.

Remark 1.2.

A related result in [fox2008separator] demonstrates that every Kk,kK_{k,k}-free intersection graph of nn convex sets on the plane satisfies |E|=Ok(n)|E|=O_{k}(n). Note that in Theorem (B) we consider a Kk,kK_{k,k}-free bipartite graph, so in particular there is no restriction on the intersection graph of the boxes in RR.

Theorem (A.1) admits the following generalization to higher dimensions and more general polytopes.

Theorem (B).
  1. (1)

    For any set PP of n1n_{1} points and any set BB of n2n_{2} boxes in d\mathbb{R}^{d}, if the incidence graph on P×BP\times B is Kk,kK_{k,k}-free, then it contains at most Od,k(nlog2dn)O_{d,k}\left(n\log^{2d}n\right) incidences (Corollary 2.38).

  2. (2)

    More generally, given finitely many half-spaces H1,,HsH_{1},\ldots,H_{s} in d\mathbb{R}^{d}, let \mathcal{F} be the family of all possible polytopes in d\mathbb{R}^{d} cut out by arbitrary translates of H1,,HsH_{1},\ldots,H_{s}. Then for any set PP of n1n_{1} points in d\mathbb{R}^{d} and any set FF of n2n_{2} polytopes in \mathcal{F}, if the incidence graph on P×FP\times F is Kk,kK_{k,k}-free, then it contains at most Ok,s(nlogsn)O_{k,s}\left(n\log^{s}n\right) incidences (Corollary 2.37).

Problem 1.3.

What is the optimal bound on the power of logn\log n in Theorem (B)? In particular, does it actually have to grow with the dimension dd?

Remark 1.4.

A bound similar to Theorem (B.1) and an improved bound for Theorem (A.1) in the K2,2K_{2,2}-free case are established independently by Tomon and Zakharov in [Tomon], in which the authors also use our Theorem (A.3) to provide a counterexample to a conjecture of Alon et al. [alon2015separation] about the number of edges in a graph of bounded separation dimension, as well as to a conjecture of Kostochka from [kostochka2004coloring]. Some further Ramsey properties of semilinear graphs are demonstrated by Tomon in [tomon2021ramsey].

The upper bounds in Theorems (A.1) and (B) are obtained as immediate applications of a general upper bound for Zarankiewicz’s problem for semilinear hypergraphs of bounded description complexity.

Definition 1.5.

Let VV be an ordered vector space over an ordered division ring RR (e.g. \mathbb{R} viewed as a vector space over itself). A set XVdX\subseteq V^{d} is semilinear, of description complexity (s,t)(s,t) if XX is a union of at most tt sets of the form

{x¯Vd:f1(x¯)0,,fp(x¯)0,fp+1(x¯)<0,,fs(x¯)<0},\left\{\bar{x}\in V^{d}:f_{1}\left(\bar{x}\right)\leq 0,\ldots,f_{p}\left(\bar{x}\right)\leq 0,f_{p+1}\left(\bar{x}\right)<0,\ldots,f_{s}\left(\bar{x}\right)<0\right\}\mbox{,}

where psp\leq s\in\mathbb{N} and each fi:VdVf_{i}:V^{d}\to V is a linear function, i.e., of the form

f(x1,,xd)=λ1x1++λdxd+af\left(x_{1},\ldots,x_{d}\right)=\lambda_{1}x_{1}+\ldots+\lambda_{d}x_{d}+a

for some λiR\lambda_{i}\in R and aVa\in V.

We focus on the case V=R=V=R=\mathbb{R} in the introduction, in which case these are precisely the semialgebraic sets that can be defined using only linear polynomials.

Remark 1.6.

By a standard quantifier elimination result [van1998tame, §7], every set definable in an ordered vector space over an ordered division ring, in the sense of model theory, is semilinear (equivalently, a projection of a semilinear set is a finite union of semilinear sets).

Definition 1.7.

We say that an rr-hypergraph HH is semilinear, of description complexity (s,t)(s,t) if there exist some di,Vidid_{i}\in\mathbb{N},V_{i}\subseteq\mathbb{R}^{d_{i}} and a semilinear set Xd=i[r]diX\subseteq\mathbb{R}^{d}=\prod_{i\in[r]}\mathbb{R}^{d_{i}} of description complexity (s,t)(s,t) so that HH is isomorphic to the rr-hypergraph (V1,,Vr;Xi[r]Vi)\left(V_{1},\ldots,V_{r};X\cap\prod_{i\in[r]}V_{i}\right).

We stress that there is no restriction on the dimensions did_{i} in this definition. We obtain the following general upper bound.

Theorem (C).

If HH is a semilinear rr-hypergraph of description complexity (s,t)(s,t) and HH is Kk,,kK_{k,\ldots,k}-free, then

|E|=Or,s,t,k(nr1logs(2r11)(n)).|E|=O_{r,s,t,k}\left(n^{r-1}\log^{s(2^{r-1}-1)}(n)\right).

In particular |E|=Or,s,t,k,ε(nr1+ε)|E|=O_{r,s,t,k,\varepsilon}\left(n^{r-1+\varepsilon}\right) for any ε>0\varepsilon>0 in this case. For a more precise statement, see Corollary 2.36 (in particular, the dependence of the constant in Or,s,t,kO_{r,s,t,k} on kk is at most linear).

Remark 1.8.

It is demonstrated in [mustafa2015zarankiewicz] that a similar bound holds in the situation when HH is the intersection hypergraph of (d1)(d-1)-dimensional simplices in d\mathbb{R}^{d}.

One can get rid of the logarithmic factor entirely by restricting to the family of all finite rr-hypergraphs induced by a given Kk,,kK_{k,\ldots,k}-free semilinear relation (as opposed to all Kk,,kK_{k,\ldots,k}-free rr-hypergraphs induced by a given arbitrary semilinear relation as in Theorem (C)).

Theorem (D).

Assume that Xd=i[r]diX\subseteq\mathbb{R}^{d}=\prod_{i\in[r]}\mathbb{R}^{d_{i}} is semilinear and XX does not contain the direct product of rr infinite sets (e.g. if XX is Kk,,kK_{k,\ldots,k}-free for some kk). Then for any rr-hypergraph HH of the form (V1,,Vr;Xi[r]Vi)\left(V_{1},\ldots,V_{r};X\cap\prod_{i\in[r]}V_{i}\right) for some finite VidiV_{i}\subseteq\mathbb{R}^{d_{i}}, we have |E|=OX(nr1)|E|=O_{X}(n^{r-1}).

This is Corollary 5.12 and follows from a more general Theorem 5.6 connecting linear Zarankiewicz bounds to a model-theoretic notion of linearity of a first-order structure (in the sense that the matroid given by the algebraic closure operator behaves like the linear span in a vector space, as opposed to the algebraic closure in an algebraically closed field — see Definition 5.3).

In particular, for every Kk,kK_{k,k}-free semilinear relation Xd1×d2X\subseteq\mathbb{R}^{d_{1}}\times\mathbb{R}^{d_{2}} (equivalently, XX definable with parameters in the first-order structure (,<,+)(\mathbb{R},<,+) by Remark 1.6) we have |X(V1×V2)|=O(n)|X\cap(V_{1}\times V_{2})|=O(n) for all Viidi,|Vi|=ni,n=n1+n2V_{i}\subseteq\mathbb{R}^{d_{i}}_{i},|V_{i}|=n_{i},n=n_{1}+n_{2}. One the other hand, by optimality of the Szemerédi-Trotter bound, for the semialgebraic K2,2K_{2,2}-free point-line incidence graph X={(x1,x2;y1,y2)4:x2=y1x1+y2}2×2X=\{(x_{1},x_{2};y_{1},y_{2})\in\mathbb{R}^{4}:x_{2}=y_{1}x_{1}+y_{2}\}\subseteq\mathbb{R}^{2}\times\mathbb{R}^{2} we have |X(V1×V2)|=Ω(n43)|X\cap(V_{1}\times V_{2})|=\Omega(n^{\frac{4}{3}}). Note that in order to define it we use both addition and multiplication, i.e. the field structure. This is not coincidental — as a consequence of the trichotomy theorem in oo-minimal structures [peterzil1998trichotomy], we observe that the failure of a linear Zarankiewicz bound always allows to recover the field in a definable way (Corollary 5.11). In the semialgebraic case, we have the following corollary that is easy to state (Corollary 5.14).

Theorem (E).

Assume that Xd=i[r]diX\subseteq\mathbb{R}^{d}=\prod_{i\in[r]}\mathbb{R}^{d_{i}} for some r,dir,d_{i}\in\mathbb{N} is semialgebraic and Kk,,kK_{k,\ldots,k}-free, but |Xi[r]Vi|O(nr1)|X\cap\prod_{i\in[r]}V_{i}|\neq O(n^{r-1}). Then the graph of multiplication ×[0,1]\times\restriction_{[0,1]} restricted to the unit box is definable in (,<,+,X)(\mathbb{R},<,+,X).

We conclude with a brief overview of the paper.

In Section 2 we introduce a more general class of hypergraphs definable in terms of coordinate-wise monotone functions (Definition 2.1) and prove an upper Zarankiewicz bound for it (Theorem 2.17). Theorems (A.1), (B) and (C) are then deduced from it in Section 2.5.

In Section 3 we prove Theorem (A.3) by establishing a lower bound on the number of incidences between points and dyadic boxes on the plane, demonstrating that the logarithmic factor is unavoidable (Proposition 3.5).

In Section 4, we establish Theorem (A.2) by obtaining a stronger bound on the number of incidences with dyadic boxes on the plane (Theorem 4.7). We use a different argument relying on a certain partial order specific to the dyadic case to reduce from log4(n)\log^{4}(n) given by the general theorem above to log(n)\log(n). Up to a constant factor, this implies the same bound for incidences with general boxes when one only counts incidences that are bounded away from the border (Remark 4.8).

Finally, in Section 5, we prove a general Zarankiewicz bound for definable relations in weakly locally modular geometric first-order structures (Theorem 5.6), deduce Theorem (D) from it (Corollary 5.12) and observe how to recover a real closed field from the failure of Theorem (D) in the oo-minimal case (Corollary 5.11).

Acknowledgements

We thank the referees for their very helpful suggestions on improving the paper. Artem Chernikov was partially supported by the NSF CAREER grant DMS-1651321 and by a Simons Fellowship. He is grateful to Adam Sheffer for some very helpful conversations, and to the American Insitute of Mathematics for additional support. Sergei Starchenko was supported by the NSF Research Grant DMS-1800806. Terence Tao was partially supported by NSF grant DMS-1764034 and by a Simons Investigator Award.

2. Upper bounds

2.1. Coordinate-wise monotone functions and basic sets

For an integer r>0r\in\mathbb{N}_{>0}, by an rr-grid (or a grid if rr is clear from the context) we mean a cartesian product B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} of some sets B1,,BrB_{1},\ldots,B_{r}. As usual, [r][r] denotes the set {1,2,,r}\left\{1,2,\ldots,r\right\}.

If B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} is a grid, then by a sub-grid we mean a subset CBC\subseteq B of the form C=C1××CrC=C_{1}\times\dotsb\times C_{r} for some CiBiC_{i}\subseteq B_{i}.

Let BB be an rr-grid, SS an arbitrary set and f:BSf:B\to S a function. For i[r]i\in[r], set

Bi=B1×Bi1×Bi+1××Br,B^{i}=B_{1}\times\cdots B_{i-1}\times B_{i+1}\times\cdots\times B_{r},

and let πi:BBi\pi_{i}:B\to B_{i} and πi:BBi\pi^{i}:B\to B^{i} be the projection maps.

For aBia\in B^{i} and bBib\in B_{i}, we write aiba\oplus_{i}b for the element cBc\in B with πi(c)=a\pi^{i}(c)=a and πi(c)=b\pi_{i}(c)=b. In particular, when i=ri=r, arb=(a,b)a\oplus_{r}b=(a,b).

Definition 2.1.

Let BB be an rr-grid and (S,<)(S,<) a linearly ordered set. A function f:BSf\colon B\to S is coordinate-wise monotone if for any i[r]i\in[r], a,aBia,a^{\prime}\in B^{i} and b,bBib,b^{\prime}\in B_{i} we have

f(aib)f(aib)f(aib)f(aib).f(a\oplus_{i}b)\leq f(a\oplus_{i}b^{\prime})\Longleftrightarrow f(a^{\prime}\oplus_{i}b)\leq f({a^{\prime}}\oplus_{i}b^{\prime}).
Remark 2.2.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be an rr-grid and Γ\Gamma an ordered abelian group. We say that a function f:BΓf\colon B\to\Gamma is quasi-linear if there exist some functions fi:BiΓf_{i}\colon B_{i}\to\Gamma, i[r]i\in[r], such that

f(x1,,xr)=f1(x1)++fr(xr).f(x_{1},\dotsc,x_{r})=f_{1}(x_{1})+\dotsb+f_{r}(x_{r}).

Then every quasi-linear function is coordinate-wise monotone (as f(aib)f(aib)fi(b)fi(b)f(a\oplus_{i}b)\leq f({a}\oplus_{i}b^{\prime})\Leftrightarrow f_{i}(b)\leq f_{i}(b^{\prime}) for any aBia\in B^{i}).

Example 2.3.

Suppose that VV is an ordered vector space over an ordered division ring RR, did_{i}\in\mathbb{N} for i[r]i\in[r], and f:Vd1××VdrVf:V^{d_{1}}\times\cdots\times V^{d_{r}}\to V is a linear function. Then ff is obviously quasi-linear, hence coordinate-wise monotone.

Remark 2.4.

Let BB be a grid and CBC\subseteq B a sub-grid. If f:BSf\colon B\to S is a coordinate-wise monotone function then the restriction fCf{\restriction C} is a coordinate-wise monotone function on CC.

Definition 2.5.

Let BB be an rr-grid. A subset XBX\subseteq B is a basic set if there exists a linearly ordered set (S,<)(S,<), a coordinate-wise monotone function f:BSf\colon B\to S and lSl\in S such that X={bB:f(b)<l}X=\left\{b\in B\colon f(b)<l\right\}.

Remark 2.6.

If r=1r=1, then every subset of B=B1B=B_{1} is basic.

Remark 2.7.

If XBX\subseteq B is given by X={bB:f(b)l}X=\left\{b\in B\colon f(b)\leq l\right\} for some coordinate-wise monotone function f:BSf\colon B\to S, then XX is a basic set as well. Indeed, we can just add a new element ll^{\prime} to SS so that it is a successor of ll, then X={bB:f(b)<l}X=\left\{b\in B:f(b)<l^{\prime}\right\}.

Similarly, the sets {bB:f(b)>l},{bB:f(b)l}\left\{b\in B\colon f(b)>l\right\},\left\{b\in B\colon f(b)\geq l\right\} are basic, by inverting the order on SS.

We have the following “coordinate-splitting” presentation for basic sets.

Proposition 2.8.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be an rr-grid and XBX\subseteq B a basic set. Then there is a linearly ordered set (S,<)(S,<), a coordinate-wise monotone function fr:BrSf^{r}\colon B^{r}\to S and a function fr:BrSf_{r}\colon B_{r}\to S such that X={brrbr:fr(br)<fr(br)}X=\left\{b^{r}\oplus_{r}b_{r}\colon f^{r}(b^{r})<f_{r}(b_{r})\right\}.

Remark 2.9.

The converse of this proposition is also true: an arbitrary linear order (S,<)(S,<) can be realized as a subset of some ordered abelian group (G,+,<)(G,+,<) with the induced ordering (we can take G:=G:=\mathbb{Q} when SS is at most countable); then define f:BSf:B\to S by setting

f(brrbr):=fr(br)fr(br), and l:=0.f(b^{r}\oplus_{r}b_{r}):=f^{r}(b^{r})-f_{r}(b_{r}),\mbox{ and }l:=0.
Proof of Proposition 2.8.

Assume that we are given a coordinate-wise monotone function f:BSf\colon B\to S and lSl\in S with X={bB:f(b)<l}X=\left\{b\in B\colon f(b)<l\right\}.

For i[r]i\in[r], let i\leq_{i} be the pre-order on BiB_{i} induced by ff, namely for b,bBib,b^{\prime}\in B_{i} we set bibb\leq_{i}b^{\prime} if and only if for some (equivalently, any) aBia\in B^{i} we have f(aib)f(aib)f(a\oplus_{i}b)\leq f(a\oplus_{i}b^{\prime}).

Quotienting BiB_{i} by the equivalence relation corresponding to the pre-order i\leq_{i} if needed, we may assume that each i\leq_{i} is actually a linear order.

Let <r<^{r} be the partial order on BrB^{r} with (b1,,br1)<r(b1,,br1)(b_{1},\dotsc,b_{r-1})<^{r}(b^{\prime}_{1},\dotsc,b^{\prime}_{r-1}) if and only if

(b1,,br1)(b1,,br1) and bjjbj for all j[r1].(b_{1},\dotsc,b_{r-1})\neq(b^{\prime}_{1},\dotsc,b^{\prime}_{r-1})\text{ and }b_{j}\leq_{j}b^{\prime}_{j}\text{ for all }j\in[r-1].

Let T:=Br˙BrT:=B^{r}\dot{\cup}B_{r}, where ˙\dot{\cup} denotes the disjoint union. Clearly <r<^{r} is a strict partial order on TT, i.e. a transitive and anti-symmetric (hence irreflexive) relation.

For any brBrb^{r}\in B^{r} and brBrb_{r}\in B_{r} we define

brbr if f(brrbr)<l, and brbr otherwise.b^{r}\triangleleft b_{r}\text{ if }f(b^{r}\oplus_{r}b_{r})<l\text{, and }b_{r}\triangleleft b^{r}\text{ otherwise}.
Claim 2.10.

Let a1,a2Bra_{1},a_{2}\in B^{r}, and b1,b2Brb_{1},b_{2}\in B_{r}.

  1. (1)

    If a1b1a2b2a_{1}\triangleleft b_{1}\triangleleft a_{2}\triangleleft b_{2}, then b2<rb1b_{2}<_{r}b_{1} and a1b2a_{1}\triangleleft b_{2}.

  2. (2)

    If b1a1b2a2b_{1}\triangleleft a_{1}\triangleleft b_{2}\triangleleft a_{2}, then b2<rb1b_{2}<_{r}b_{1} and b1a2b_{1}\triangleleft a_{2}.

Proof.

(1)(1). We have f(a2rb1)lf(a_{2}\oplus_{r}b_{1})\geq l and f(a2rb2)<lf(a_{2}\oplus_{r}b_{2})<l, hence b2<rb1b_{2}<_{r}b_{1}.

Since f(a1rb1)<lf(a_{1}\oplus_{r}b_{1})<l and b2<rb1b_{2}<_{r}b_{1} we also have f(a1rb2)<lf(a_{1}\oplus_{r}b_{2})<l.

(2)(2) is similar. ∎

Let t\triangleleft^{t} be the transitive closure of \triangleleft. It follows from the above claim that t=\triangleleft^{t}=\triangleleft\cup\triangleleft{\circ}\triangleleft. More explicitly, for b1,b2Brb_{1},b_{2}\in B_{r}, b1tb2b_{1}\triangleleft^{t}b_{2} if b2<rb1b_{2}<_{r}b_{1}, and for a1,a2Bra_{1},a_{2}\in B^{r}, a1ta2a_{1}\triangleleft^{t}a_{2} if f(a1b)<l<f(a2b)f(a_{1}\oplus b)<l<f(a_{2}\oplus b) for some bBrb\in B_{r}. It is not hard to see then that t\triangleleft^{t} is anti-symmetric, hence it is a strict partial order on TT.

Claim 2.11.

The union <rt<^{r}\cup\triangleleft^{t} is a strict partial order on TT.

Proof.

We first show transitivity. Note that <r<^{r} and t\triangleleft^{t} are both transitive, so it suffices to show for x,y,zTx,y,z\in T that if either x<rytzx<^{r}y\triangleleft^{t}z or xty<rzx\triangleleft^{t}y<^{r}z, then xtzx\triangleleft^{t}z. Furthermore, since t=\triangleleft^{t}=\triangleleft\cup\triangleleft{\circ}\triangleleft, we may restrict our attention to the following cases. If a1<ra2ba_{1}<^{r}a_{2}\triangleleft b with a1,a2Bra_{1},a_{2}\in B^{r} and bBrb\in B_{r}, then f(a1rb)<f(a2rb)<lf(a_{1}\oplus_{r}b)<f(a_{2}\oplus_{r}b)<l, and so a1ba_{1}\triangleleft b. If ba1<ra2b\triangleleft a_{1}<^{r}a_{2} with a1,a2Bra_{1},a_{2}\in B^{r} and bBrb\in B_{r}, then f(a2rb)>f(a1rb)lf(a_{2}\oplus_{r}b)>f(a_{1}\oplus_{r}b)\geq l, and so ba2b\triangleleft a_{2}.

To check anti-symmetry, assume a1<ra2a_{1}<^{r}a_{2} and a2ta1a_{2}\triangleleft^{t}a_{1}. Since a1,a2Bra_{1},a_{2}\in B^{r} we have a2ba1a_{2}\triangleleft b\triangleleft a_{1} for some bBrb\in B_{r}. We have f(a1rb)l>f(a2rb)f(a_{1}\oplus_{r}b)\geq l>f(a_{2}\oplus_{r}b), contradicting a1<ra2a_{1}<^{r}a_{2}. ∎

Finally, let \prec be an arbitrary linear order on T=Br˙BrT=B^{r}\dot{\cup}B_{r} extending <rt<^{r}\cup\triangleleft^{t}. Since \prec extends \triangleleft, for aBra\in B^{r} and bBrb\in B_{r} we have (a,b)X(a,b)\in X if and only if aba\prec b.

We take fr:BrTf^{r}\colon B^{r}\to T and fr:BrTf_{r}\colon B_{r}\to T to be the identity maps. Since \prec extends <r<^{r}, the map frf^{r} is coordinate-wise monotone. ∎

2.2. Main theorem

Definition 2.12.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be an rr-grid.

  1. (1)

    Given ss\in\mathbb{N}, we say that a set XBX\subseteq B has grid-complexity ss (in BB) if XX is the intersection of BB with at most ss basic subsets of BB.

    We say that XX has finite grid-complexity if it has grid-complexity ss for some ss\in\mathbb{N}.

  2. (2)

    For integers k1,krk_{1},\dotsc k_{r} we say that XBX\subseteq B is Kk1,,krK_{k_{1},\dotsc,k_{r}}-free is XX does not contain a sub-grid C1××CrSC_{1}\times\dotsb\times C_{r}\subseteq S with |Ci|=ki|C_{i}|=k_{i}.

In particular, BB itself is the only subset of BB of grid-complexity 0.

Example 2.13.

Suppose that VV is an ordered vector space over an ordered division ring, d=d1++drd=d_{1}+\ldots+d_{r}\in\mathbb{N} and

X={x¯Vd:f1(x¯)0,,fp(x¯)0,fp+1(x¯)<0,,fs(x¯)<0},X=\left\{\bar{x}\in V^{d}:f_{1}\left(\bar{x}\right)\leq 0,\ldots,f_{p}\left(\bar{x}\right)\leq 0,f_{p+1}\left(\bar{x}\right)<0,\ldots,f_{s}\left(\bar{x}\right)<0\right\}\mbox{,}

for some linear functions fi:VdV,i[s]f_{i}:V^{d}\to V,i\in[s]. Then each fif_{i} is coordinate-wise monotone (Example 2.3), hence each of the sets

{x¯Vd:fi(x¯)<0},{x¯Vd:fi(x¯)0}\left\{\bar{x}\in V^{d}:f_{i}(\bar{x})<0\right\},\left\{\bar{x}\in V^{d}:f_{i}(\bar{x})\leq 0\right\}

is a basic subset of the grid Vd1××VdrV^{d_{1}}\times\ldots\times V^{d_{r}} (the latter by Remark 2.7), and XVd1××VdrX\subseteq V^{d_{1}}\times\ldots\times V^{d_{r}} as an intersection of these ss basic sets has grid-complexity ss.

Remark 2.14.
  1. (1)

    Let BB be an rr-grid and ABA\subseteq B a subset of BB of grid-complexity ss. If CBC\subseteq B is a sub-grid containing AA, then AA is also a subset of CC of grid-complexity ss.

  2. (2)

    In particular, if ABA\subseteq B is a subset of grid-complexity ss, then AA is a subset of grid-complexity ss of the grid A1××ArA_{1}{\times}\dotsb{\times}A_{r}, where Ai:=πi(A)A_{i}:=\pi_{i}(A) is the projection of AA on BiB_{i} (it is the smallest sub-grid of BB containing AA).

Definition 2.15.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be a finite rr-grid and ni:=|Bi|n_{i}:=|B_{i}|. For j{0,r}j\in\{0,\dotsc r\}, we will denote by δjr(B)\delta_{j}^{r}(B) the integer

δjr(B):=i1<i2<<ij[r]ni1ni2nij.\delta_{j}^{r}(B):=\sum_{i_{1}<i_{2}<\dotsb<i_{j}\in[r]}n_{i_{1}}\cdot n_{i_{2}}\cdot\ldots\cdot n_{i_{j}}.
Example 2.16.

We have δ0r(B)=1\delta^{r}_{0}(B)=1, δ1r(B)=n1++nr\delta^{r}_{1}(B)=n_{1}+\dotsb+n_{r}, δrr(B)=n1n2nr\delta_{r}^{r}(B)=n_{1}n_{2}\dotsb n_{r}.

We can now state the main theorem.

Theorem 2.17.

For every integers r2,s0,k2r\geq 2,s\geq 0,k\geq 2 there are α=α(r,s,k)\alpha=\alpha(r,s,k)\in\mathbb{R} and β=β(r,s)\beta=\beta(r,s)\in\mathbb{N} such that: for any finite rr-grid BB and Kk,,kK_{k,\dotsc,k}-free subset ABA\subseteq B of grid-complexity ss we have

|A|αδr1r(B)logβ(δr1r(B)+1).|A|\leq\alpha\delta^{r}_{r-1}(B)\log^{\beta}\left(\delta^{r}_{r-1}(B)+1\right).

Moreover, we can take β(r,s):=s(2r11)\beta(r,s):=s(2^{r-1}-1).

Remark 2.18.

Inspecting the proof, it can be verified that the dependence of α\alpha on kk is at most linear.

Remark 2.19.

We use logβ(δr1r(B)+1)\log^{\beta}\left(\delta^{r}_{r-1}(B)+1\right) instead of logβ(δr1r(B))\log^{\beta}\left(\delta^{r}_{r-1}(B)\right) to include the case δr1r(B)1\delta^{r}_{r-1}(B)\leq 1.

Remark 2.20.

If in Theorem 2.17 AA is only assumed to be a union of at most tt sets of grid-complexity ss, then the same bound holds with α:=tα\alpha^{\prime}:=t\cdot\alpha (if A=i[t]AiA=\bigcup_{i\in[t]}A_{i} is Kk,,kK_{k,\dotsc,k}-free, then each AiA_{i} is also Kk,,kK_{k,\dotsc,k}-free, so we can apply Theorem 2.17 to each of the AiA_{i}’s and bound |A||A| by the sum of their bounds).

Definition 2.21.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be a grid. We extend the definition of δjr\delta^{r}_{j} to arbitrary finite subsets of BB as follows. Let ABA\subseteq B be a finite subset, and let Ai:=πi(A)A_{i}:=\pi_{i}(A), i[r]i\in[r], be the projections of AA. We define δjr(A):=δjr(A1××Ar)\delta^{r}_{j}(A):=\delta^{r}_{j}(A_{1}{\times}\dotsb{\times}A_{r}).

If BB is a finite rr-grid and ABA\subseteq B, then obviously δjr(A)δjr(B)\delta^{r}_{j}(A)\leq\delta^{r}_{j}(B). Thus Theorem 2.17 is equivalent to the following.

Proposition 2.22.

For every integers r2,s0,k2r\geq 2,s\geq 0,k\geq 2 there are α=α(r,s,k)\alpha=\alpha(r,s,k)\in\mathbb{R} and β=s(2r11)\beta=s(2^{r-1}-1)\in\mathbb{N} such that for any rr-grid BB and Kk,,kK_{k,\dotsc,k}-free finite subset ABA\subseteq B of grid-complexity s\leq s we have

|A|αδr1r(A)logβ(δr1r(A)+1).|A|\leq\alpha\delta^{r}_{r-1}(A)\log^{\beta}(\delta^{r}_{r-1}(A)+1).
Definition 2.23.

For r1,s0,k2r\geq 1,s\geq 0,k\geq 2 and nn\in\mathbb{N}, let Fr,k(s,n)F_{r,k}(s,n) be the maximal size of a Kk,,kK_{k,\dotsc,k}-free subset AA of grid-complexity ss of some rr-grid BB with δr1r(B)n\delta_{r-1}^{r}(B)\leq n.

Then Proposition 2.22 can be restated as follows.

Proposition 2.24.

For every integers r2,s0,k2r\geq 2,s\geq 0,k\geq 2 there are α=α(r,s,k)\alpha=\alpha(r,s,k)\in\mathbb{R} and β=β(r,s)\beta=\beta(r,s)\in\mathbb{N} such that

Fr,k(s,n)αnlogβ(n+1).F_{r,k}(s,n)\leq\alpha n\log^{\beta}(n+1).
Remark 2.25.

Notice that Fr,k(s,0)=0F_{r,k}(s,0)=0.

In the rest of the section we prove Proposition 2.24 by induction on rr, where for each rr it is proved by induction on ss. We will use the following simple recurrence bound.

Fact 2.26.

Let μ:\mu\colon\mathbb{N}\to\mathbb{N} be a function satisfying μ(0)=0\mu(0)=0 and μ(n)2μ(n/2)+αnlogβ(n+1))\mu(n)\leq 2\mu(\lfloor n/2\rfloor)+\alpha n\log^{\beta}(n+1)) for some α\alpha\in\mathbb{R} and β\beta\in\mathbb{N}. Then μ(n)αnlogβ+1(n+1)\mu(n)\leq\alpha^{\prime}n\log^{\beta+1}(n+1) for some α=α(α,β)\alpha^{\prime}=\alpha^{\prime}(\alpha,\beta)\in\mathbb{R}.

2.3. The base case r=2r=2

Let B=B1×B2B=B_{1}{\times}B_{2} be a finite grid and ABA\subseteq B a subset of grid-complexity ss. We will proceed by induction on ss.

If s=0s=0 then A=B1×B2A=B_{1}\times B_{2}. If AA is Kk,kK_{k,k}-free then one of the sets B1,B2B_{1},B_{2} must have size at most kk. Hence |A|k(|B1|+|B2|)=kδ12(B)|A|\leq k(|B_{1}|+|B_{2}|)=k\delta^{2}_{1}(B).

Thus

F2,k(0,n)kn.F_{2,k}(0,n)\leq kn.
Remark 2.27.

The same argument shows that Fr,k(0,n)knF_{r,k}(0,n)\leq kn for all r2r\geq 2.

Assume now that the theorem is proved for r=2r=2 and all s<ss^{\prime}<s. Let n1:=|B1|n_{1}:=|B_{1}|, n2:=|B2|n_{2}:=|B_{2}| and n:=δ12(B)=n1+n2n:=\delta^{2}_{1}(B)=n_{1}+n_{2}.

We choose basic sets X1,XsBX_{1},\dotsc X_{s}\subseteq B such that A=Bj[s]XjA=B\cap\bigcap_{j\in[s]}X_{j}.

By Proposition 2.8, we can choose a finite linear order (S,<)(S,<) and functions f1:B1Sf_{1}\colon B_{1}\to S and f2:B2Sf_{2}\colon B_{2}\to S so that

Xs={(x1,x2)B1×B2:f1(x1)<f2(x2)}.X_{s}=\left\{(x_{1},x_{2})\in B_{1}\times B_{2}\colon f_{1}(x_{1})<f_{2}(x_{2})\right\}.

For lSl\in S, i{1,2}i\in\{1,2\} and {<,=,>,,}\square\in\{<,=,>,\leq,\geq\}, let

Bil={bBi:fi(b)l}.B_{i}^{\square l}=\left\{b\in B_{i}\colon f_{i}(b)\square l\right\}.

We choose hSh\in S such that

|B1<h|+|B2<h|n/2 and |B1>h|+|B2>h|n/2.|B_{1}^{<h}|+|B_{2}^{<h}|\leq n/2\text{ and }|B_{1}^{>h}|+|B_{2}^{>h}|\leq n/2.

For example we can take hh to be the minimal element in f1(B1)f2(B2)f_{1}(B_{1})\cup f_{2}(B_{2}) with |B1h|+|B2h|n/2|B_{1}^{\leq h}|+|B_{2}^{\leq h}|\geq n/2. Then

Xs=[(B1<h×B2<h)Xs][(B1>h×B2>h)Xs]\displaystyle X_{s}=\left[(B_{1}^{<h}\times B_{2}^{<h})\cap X_{s}\right]\cup\left[(B_{1}^{>h}\times B_{2}^{>h})\cap X_{s}\right]
(B1<h×B2h)(B1=h×B2>h).\displaystyle\cup(B_{1}^{<h}\times B_{2}^{\geq h})\cup(B_{1}^{=h}\times B_{2}^{>h}).

Hence we conclude

F2,k(s,n)2F2,k(s,n/2)+2F2,k(s1,n).F_{2,k}(s,n)\leq 2F_{2,k}(s,\lfloor n/2\rfloor)+2F_{2,k}(s-1,n).

Applying induction hypothesis on ss, and using Fact 2.26 and Remark 2.25 we obtain F2,k(s,n)αn(logn)βF_{2,k}(s,n)\leq\alpha n(\log n)^{\beta} for some α=α(s,k)\alpha=\alpha(s,k)\in\mathbb{R} and β=β(s)\beta=\beta(s)\in\mathbb{N}.

This finishes the base case r=2r=2.

2.4. Induction step

We fix r3r\in\mathbb{N}_{\geq 3} and assume that Proposition 2.24 holds for all pairs (r,s)(r^{\prime},s) with r<rr^{\prime}<r and ss\in\mathbb{N}.

Definition 2.28.

Let B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} be a finite rr-grid.

  1. (1)

    For integers t,ut,u\in\mathbb{N}, we say that a subset ABA\subseteq B is of split grid-complexity (t,u)(t,u) if there are basic sets X1,,XuBX_{1},\dotsc,X_{u}\subseteq B, a subset ArB1××Br1A^{r}\subseteq B_{1}{\times}\dotsb{\times}B_{r-1} of grid-complexity tt, and a subset ArBrA_{r}\subseteq B_{r} such that A=(Ar×Ar)i[u]XiA=(A^{r}\times A_{r})\cap\bigcap_{i\in[u]}X_{i}.

  2. (2)

    For t,u0,k2t,u\geq 0,k\geq 2 and nn\in\mathbb{N}, let Gk(t,u,n)G_{k}(t,u,n) be the maximal size of a Kk,,kK_{k,\dotsc,k}-free subset AA of an rr-grid BB of split grid-complexity (t,u)(t,u) with δr1r(B)n\delta_{r-1}^{r}(B)\leq n.

Remark 2.29.
  1. (1)

    Note that ArA_{r} has grid-complexity at most 11, which is the reason we do not include a parameter for the grid-complexity of ArA_{r} in the split grid-complexity of AA.

  2. (2)

    If ABA\subseteq B is of grid-complexity ss, then it is of split grid-complexity (0,s)(0,s).

  3. (3)

    If ABA\subseteq B is of split grid-complexity (t,u)(t,u), then it is of grid-complexity t+ut+u.

For the rest of the proof, we abuse notation slightly and refer to the “split grid-complexity” of a set as the “grid-complexity”. To complete the induction step we will prove the following Proposition.

Proposition 2.30.

For any integers t,u0,k2,r3t,u\geq 0,k\geq 2,r\geq 3 there are α=α(r,k,t,u)\alpha^{\prime}=\alpha^{\prime}(r,k,t,u)\in\mathbb{R} and β=β(r,k,t,u)\beta^{\prime}=\beta^{\prime}(r,k,t,u)\in\mathbb{N} such that

Gk(t,u,n)αnlogβ(n+1).G_{k}(t,u,n)\leq\alpha^{\prime}n\log^{\beta^{\prime}}(n+1).

We will use the following notations throughout the section:

  • B=B1××BrB=B_{1}{\times}\dotsb{\times}B_{r} is a finite grid with n=δr1r(B)n=\delta^{r}_{r-1}(B);

  • ABA\subseteq B is a subset of grid-complexity (t,u)(t,u);

  • BrB^{r} is the (r1)(r-1)-grid Br:=B1××Br1B^{r}:=B_{1}{\times}\dotsb{\times}B_{r-1};

  • ArBrA^{r}\subseteq B^{r} is a subset of grid-complexity tt, ArBrA_{r}\subseteq B_{r}, and X1,XuBX_{1},\dotsc X_{u}\subseteq B are basic subsets such that A=(Ar×Ar)i[u]XiA=(A^{r}{\times}A_{r})\cap\bigcap_{i\in[u]}X_{i}.

We proceed by induction on uu.

The base case u=0u=0 of Proposition 2.30.

In this case A=Ar×ArA=A^{r}\times A_{r}. If AA is Kk,,kK_{k,\dotsc,k}-free then either ArA^{r} is Kk,,kK_{k,\dotsc,k}-free or |Ar|<k|A_{r}|<k.

In the first case, by induction hypothesis on rr, there are α=α(r1,t,k)\alpha=\alpha(r-1,t,k) and β=β(r1,t)\beta=\beta(r-1,t) such that |Ar|αδr2r1(Br)logβ(δr2r1(Br)+1)|A^{r}|\leq\alpha\delta^{r-1}_{r-2}(B^{r})\log^{\beta}(\delta^{r-1}_{r-2}(B^{r})+1). In the second case we have |A||Br|k=δr1r1(Br)k|A|\leq|B^{r}|k=\delta^{r-1}_{r-1}(B^{r})k.

Since n=δr1r(B)=δr1r1(Br)+δr2r1(Br)|Br|n=\delta^{r}_{r-1}(B)=\delta^{r-1}_{r-1}(B^{r})+\delta^{r-1}_{r-2}(B^{r})|B_{r}|, the conclusion of the proposition follows with α:=α,β:=β\alpha^{\prime}:=\alpha,\beta^{\prime}:=\beta.

Induction step of Proposition 2.30. We assume now that the proposition holds for all pairs (t,u)(t,u^{\prime}) with u<uu^{\prime}<u and tt\in\mathbb{N}.

Given a tuple x=(x1,,xr)Bx=(x_{1},\ldots,x_{r})\in B, we let xr:=(x1,,xr1)x^{r}:=(x_{1},\ldots,x_{r-1}). By Proposition 2.8, we can choose a finite linear order (S,<)(S,<), a coordinate-wise monotone function fr:BrSf^{r}\colon B^{r}\to S and a function fr:BrSf_{r}\colon B_{r}\to S so that

Xu={xrrxrBr×Br:fr(xr)<fr(xr)}.X_{u}=\left\{x^{r}\oplus_{r}x_{r}\in B^{r}\times B_{r}\colon f^{r}(x^{r})<f_{r}(x_{r})\right\}.

Moreover, by Remark 2.9, we may assume without loss of generality that the coordinate-wise monotone function defining XuX_{u} is given by

f(xrrxr)=fr(xr)fr(xr).f(x^{r}\oplus_{r}x_{r})=f^{r}(x^{r})-f_{r}(x_{r}).
Definition 2.31.

Given an arbitrary set CrBrC^{r}\subseteq B^{r}, we say that a set HrCrH^{r}\subseteq C^{r} is an frf^{r}-strip in CrC^{r} if

Hr={xrCr:l11fr(xr)2l2}H^{r}=\left\{x^{r}\in C^{r}\colon l_{1}\triangleleft_{1}f^{r}(x^{r})\triangleleft_{2}l_{2}\right\}

for some l1,l2Sl_{1},l_{2}\in S, 1,2{<,}\triangleleft_{1},\triangleleft_{2}\in\{<,\leq\}. Likewise, given an arbitrary set CrBrC_{r}\subseteq B_{r}, we say that HrCrH_{r}\subseteq C_{r} is an frf_{r}-strip in CrC_{r} if

Hr={xrCr:l11fr(xr)2l2}H_{r}=\left\{x_{r}\in C_{r}\colon l_{1}\triangleleft_{1}f_{r}(x_{r})\triangleleft_{2}l_{2}\right\}

for some l1,l2Sl_{1},l_{2}\in S, 1,2{<,}\triangleleft_{1},\triangleleft_{2}\in\{<,\leq\}. If Cr=ArC^{r}=A^{r} or Cr=ArC_{r}=A_{r}, we simply say an frf^{r}-strip or frf_{r}-strip, respectively.

Remark 2.32.

Note the following:

  1. (1)

    ArA^{r} is an frf^{r}-strip, and ArA_{r} is an frf_{r}-strip;

  2. (2)

    every frf^{r}-strip is a subset of the (r1)(r-1)-grid BrB^{r} of grid-complexity t+2t+2 (using Remark 2.7);

  3. (3)

    the intersection of any two frf^{r}-strips is an frf^{r}-strip; the same conclusion holds for frf_{r}-strips.

Definition 2.33.
  1. (1)

    We say that a subset HBH\subseteq B is an ff-grid if H=Hr×HrH=H^{r}\times H_{r}, where HrBrH^{r}\subseteq B^{r} is an frf^{r}-strip in BrB^{r} and HrBrH_{r}\subseteq B_{r} is an frf_{r}-strip in BrB_{r}.

  2. (2)

    If H=Hr×HrH=H^{r}\times H_{r} is an ff-grid, we set

    Δ(H):=|Hr|+δr2r1(Hr)|Hr| (see Definition 2.21 for δr2r1).\Delta(H):=|H^{r}|+\delta^{r-1}_{r-2}(H^{r})|H_{r}|\text{ (see Definition ~{}\ref{defn:delta} for }\delta^{r-1}_{r-2}).

    Note that if HH is a sub-grid of BB, then Δ(H)=δr1r(H)\Delta(H)=\delta^{r}_{r-1}(H).

  3. (3)

    For an ff-grid HH, we will denote by AHA_{H} the set AHA\cap H.

The induction step for Proposition 2.30 will follow from the following proposition.

Proposition 2.34.

For every integer k2,r3k\geq 2,r\geq 3 there are α=α(r,k,t,u)\alpha^{\prime}=\alpha^{\prime}(r,k,t,u)\in\mathbb{R} and β=β(r,t,u)\beta^{\prime}=\beta^{\prime}(r,t,u)\in\mathbb{N} such that, for any ff-grid HH, if the set AHA_{H} is Kk,,kK_{k,\dotsc,k}-free then

|AH|αΔ(H)logβ(Δ(H)+1).|A_{H}|\leq\alpha^{\prime}\Delta(H)\log^{\beta^{\prime}}(\Delta(H)+1).

We should stress that in the above proposition α\alpha^{\prime} and β\beta^{\prime} do not depend on fr,frf^{r},f_{r}, BB, ArA^{r}, and ArA_{r} but they may depend on our fixed tt and uu.

Given Proposition 2.34, we can apply it to the ff-grid H:=Ar×ArH:=A^{r}\times A_{r} (so AH=AA_{H}=A) and get

|A|αΔ(H)logβ(Δ(H)+1).|A|\leq\alpha^{\prime}\Delta(H)\log^{\beta^{\prime}}(\Delta(H)+1).

It is easy to see that Δ(Ar×Ar)δr1r(B)\Delta(A^{r}\times A_{r})\leq\delta^{r}_{r-1}(B), hence Proposition 2.30 follows with the same α\alpha^{\prime} and β\beta^{\prime}.

We proceed with the proof of Proposition 2.34

Proof of Proposition 2.34.

Fix mm\in\mathbb{N}, and let L(m)L(m) be the maximal size of a Kk,,kK_{k,\dotsc,k}-free set AHA_{H} among all ff-grids HBH\subseteq B with Δ(H)m\Delta(H)\leq m. We need to show that for some α=α(k)\alpha^{\prime}=\alpha^{\prime}(k)\in\mathbb{R} and β\beta^{\prime}\in\mathbb{N} we have

L(m)αmlogβ(m+1).L(m)\leq\alpha^{\prime}m\log^{\beta^{\prime}}(m+1).

Let H=Hr×HrH=H^{r}\times H_{r} be an ff-grid with Δ(H)m\Delta(H)\leq m.

For lSl\in S and {<,=,>,,}\square\in\{<,=,>,\leq,\geq\}, let

Hr,l:={xrHr:fr(xr)l}H^{r,\square l}:=\left\{x^{r}\in H^{r}\colon f^{r}(x^{r})\square l\right\}

and

Hrl:={xrHr:fr(xr)l}.H_{r}^{\square l}:=\left\{x_{r}\in H_{r}\colon f_{r}(x_{r})\square l\right\}.

Note that for every lSl\in S, Hr,lH^{r,\square l} is an frf^{r}-strip in HrH^{r}, HrlH_{r}^{\square l} is an frf_{r}-strip in HrH_{r}, and their product is an ff-grid.

Claim 2.35.

There is hSh\in S such that

Δ(Hr,<h×Hr<h)m/2 and Δ(Hr,>h×Hr>h)m/2.\Delta(H^{r,<h}\times H_{r}^{<h})\leq m/2\text{ and }\Delta(H^{r,>h}\times H_{r}^{>h})\leq m/2.
Proof of Claim.

Let δ:=δr2r1(Hr)\delta:=\delta^{r-1}_{r-2}(H^{r}).

Let hh be the minimal element in fr(Hr)fr(Hr)f^{r}(H^{r})\cup f_{r}(H_{r}) with

|Hr,h|+δ|Hrh|m/2.|H^{r,\leq h}|+\delta|H_{r}^{\leq h}|\geq m/2.

Then |Hr,<h|+δ|Hr<h|m/2|H^{r,<h}|+\delta|H_{r}^{<h}|\leq m/2 and |Hr,>h|+δ|Hr>h|m/2|H^{r,>h}|+\delta|H_{r}^{>h}|\leq m/2. Since Hr,<h,Hr,>hHrH^{r,<h},H^{r,>h}\subseteq H^{r}, we have δr2r1(Hr,<h),δr2r1(Hr,>h)δ\delta^{r-1}_{r-2}(H^{r,<h}),\delta^{r-1}_{r-2}(H^{r,>h})\leq\delta. The claim follows. ∎

Let hh be as in the claim. It is not hard to see that the following holds:

(Hr,h×Hrh)Xu=(Hr,<h×Hrh)(Hr,=h×Hr>h),\displaystyle\left(H^{r,\leq h}\times H_{r}^{\geq h}\right)\cap X_{u}=\left(H^{r,<h}\times H_{r}^{\geq h}\right)\cup\left(H^{r,=h}\times H_{r}^{>h}\right),
(Hr,h×Hrh)Xu=.\displaystyle\left(H^{r,\geq h}\times H_{r}^{\leq h}\right)\cap X_{u}=\emptyset.

It follows that

AHXu=[(Hr,<h×Hr<h)Xu][(Hr,>h×Hr>h)Xu]\displaystyle A_{H}\cap X_{u}=\left[(H^{r,<h}\times H_{r}^{<h})\cap X_{u}\right]\cup\left[(H^{r,>h}\times H_{r}^{>h})\cap X_{u}\right]
(Hr,<h×Hrh)(Hr,=h×Hr>h).\displaystyle\cup(H^{r,<h}\times H_{r}^{\geq h})\cup(H^{r,=h}\times H_{r}^{>h}).

Hence, by the choice of hh and using Remark 2.32(2),

L(m)2L(m/2)+2Gk(t+2,u1,m).L(m)\leq 2L(\lfloor m/2\rfloor)+2G_{k}(t+2,u-1,m).

Applying the induction hypothesis on uu and using Fact 2.26 we obtain L(m)αmlogβ(m+1)L(m)\leq\alpha^{\prime}m\log^{\beta^{\prime}}(m+1) for some α=α(k)\alpha^{\prime}=\alpha^{\prime}(k)\in\mathbb{R} and β\beta^{\prime}\in\mathbb{N}.

This finishes the proof of Proposition 2.34, and hence of the induction step of Proposition 2.24. ∎

Finally, inspecting the proof, we have shown the following:

  1. (1)

    β(2,s)s\beta(2,s)\leq s for all ss\in\mathbb{N};

  2. (2)

    β(r,t,0)β(r1,t)\beta^{\prime}(r,t,0)\leq\beta(r-1,t) for all r3r\geq 3 and tt\in\mathbb{N};

  3. (3)

    β(r,t,u)β(r,t+2,u1)+1\beta^{\prime}(r,t,u)\leq\beta^{\prime}(r,t+2,u-1)+1 for all r3,t0,u1r\geq 3,t\geq 0,u\geq 1.

Iterating (3), for every r3,s1r\geq 3,s\geq 1 we have β(r,s)β(r,0,s)β(r,2s,0)+s\beta(r,s)\leq\beta^{\prime}(r,0,s)\leq\beta^{\prime}(r,2s,0)+s. Hence, by (2), β(r,s)β(r1,2s)+s\beta(r,s)\leq\beta(r-1,2s)+s for every r3r\geq 3 and s1s\geq 1. Iterating this, we get β(r,s)β(2,2r2s)+si=0r32i\beta(r,s)\leq\beta(2,2^{r-2}s)+s\sum_{i=0}^{r-3}2^{i}. Using (1), this implies β(r,s)si=0r22i=s(2r11)\beta(r,s)\leq s\sum_{i=0}^{r-2}2^{i}=s(2^{r-1}-1) for all r3,s1r\geq 3,s\geq 1. Hence, by Remark 2.27 and (1) again, β(r,s)s(2r11)\beta(r,s)\leq s(2^{r-1}-1) for all r2,s0r\geq 2,s\geq 0.

2.5. Some applications

We observe several immediate applications of Theorem 2.17, starting with the following bound for semilinear hypergraphs.

Corollary 2.36.

For every r,s,t,k,r2r,s,t,k\in\mathbb{N},r\geq 2 there exist some α=α(r,s,t,k)\alpha=\alpha(r,s,t,k)\in\mathbb{R} and β(r,s):=s(2r11)\beta(r,s):=s(2^{r-1}-1) satisfying the following.

For any semilinear Kk,,kK_{k,\ldots,k}-free rr-hypergraph H=(V1,,Vr;E)H=(V_{1},\ldots,V_{r};E) of description complexity (s,t)(s,t) (see Definition 1.7), taking V:=i[r]ViV:=\prod_{i\in[r]}V_{i} we have

|E|αδr1r(V)logβ(δr1r(V)+1).|E|\leq\alpha\delta^{r}_{r-1}(V)\log^{\beta}\left(\delta^{r}_{r-1}(V)+1\right).
Proof.

By assumption the edge relation EE can be defined by a union of tt sets, each of which is defined ss linear equalities and inequalities, hence of grid-complexity s\leq s (see Example 2.13). The conclusion follows by Theorem 2.17 and Remark 2.20. ∎

As a special case with r=2r=2, this implies a bound for the following incidence problem.

Corollary 2.37.

For every s,ks,k\in\mathbb{N} there exists some α=α(s,k)\alpha=\alpha(s,k)\in\mathbb{R} satisfying the following.

Let dd\in\mathbb{N} and H1,,HsdH_{1},\ldots,H_{s}\subseteq\mathbb{R}^{d} be finitely many (closed or open) half-spaces in d\mathbb{R}^{d}. Let \mathcal{F} be the (infinite) family of all possible polytopes in d\mathbb{R}^{d} cut out by arbitrary translates of H1,,HsH_{1},\ldots,H_{s}.

For any set PP of n1n_{1} points in d\mathbb{R}^{d} and any set FF of n2n_{2} polytopes in \mathcal{F}, if the incidence graph on P×FP\times F is Kk,kK_{k,k}-free, then it contains at most αnlogsn\alpha n\log^{s}n incidences.

Proof.

We can write

Hi={x¯=(x1,,xd)d:j[d]ai,jxjibi},H_{i}=\left\{\bar{x}=(x_{1},\ldots,x_{d})\in\mathbb{R}^{d}:\sum_{j\in[d]}a_{i,j}x_{j}\square_{i}b_{i}\right\},

where ai,j,bia_{i,j},b_{i}\in\mathbb{R} and i{>,}\square_{i}\in\{>,\geq\} for i[s],j[d]i\in[s],j\in[d] depending on whether HiH_{i} is an open or a closed half-space.

Every polytope FF\in\mathcal{F} is of the form i[s](y¯i+Hi)\bigcap_{i\in[s]}(\bar{y}_{i}+H_{i}) for some (y¯1,,y¯s)sd(\bar{y}_{1},\ldots,\bar{y}_{s})\in\mathbb{R}^{sd}, where y¯i+Hi\bar{y}_{i}+H_{i} is the translate of HiH_{i} by the vector y¯i=(yi,1,,yi,d)d\bar{y}_{i}=(y_{i,1},\ldots,y_{i,d})\in\mathbb{R}^{d}, i.e.

y¯i+Hi={x¯d:j[d]ai,jxj+j[d](ai,j)yjibi}.\bar{y}_{i}+H_{i}=\left\{\bar{x}\in\mathbb{R}^{d}:\sum_{j\in[d]}a_{i,j}x_{j}+\sum_{j\in[d]}(-a_{i,j})y_{j}\square_{i}b_{i}\right\}.

Then the incidence relation between points in d\mathbb{R}^{d} and polytopes in \mathcal{F} can be identified with the semilinear set

{(x¯;(yi,j)i[s],j[d])d×sd:i[s]j[d]ai,jxj+j[d](ai,j)yi,jibi}\left\{\left(\bar{x};(y_{i,j})_{i\in[s],j\in[d]}\right)\in\mathbb{R}^{d}\times\mathbb{R}^{sd}:\bigwedge_{i\in[s]}\sum_{j\in[d]}a_{i,j}x_{j}+\sum_{j\in[d]}(-a_{i,j})y_{i,j}\square_{i}b_{i}\right\}

defined by ss linear inequalities. The conclusion now follows by Corollary 2.36 with r=2r=2. ∎

In particular, we get a bound for the original question that motivated this paper.

Corollary 2.38.

Let d\mathcal{F}_{d} be the family of all (closed or open) boxes in d\mathbb{R}^{d}. Then for every kk there exists some α=α(d,k)\alpha=\alpha(d,k) satisfying the following.

For any set PP of n1n_{1} points in d\mathbb{R}^{d} and any set FF of n2n_{2} boxes in d\mathcal{F}_{d}, if the incidence graph on P×FP\times F is Kk,kK_{k,k}-free, then it contains at most αnlog2dn\alpha n\log^{2d}n incidences.

Proof.

Immediate from Corollary 2.37, since we have 2d2d half-spaces in d\mathbb{R}^{d} so that every box in d\mathbb{R}^{d} is cut out by the intersection of their translates. ∎

3. Lower bounds

While we do not know if the bound β(2,s)s\beta(2,s)\leq s in Theorem 2.17 is optimal, in this section we show that at least the logarithmic factor is unavoidable already for the incidence relation between points and dyadic boxes in 2\mathbb{R}^{2}.

We describe a slightly more general construction first. Fix d>0d\in\mathbb{N}_{>0}.

Definition 3.1.

Given finite tuples p¯=(p1,,pn),q¯=(q1,,qn)\bar{p}=(p_{1},\ldots,p_{n}),\bar{q}=(q_{1},\ldots,q_{n}) and r¯=(r1,,rm)\bar{r}=(r_{1},\ldots,r_{m}) with pi,qi,ridp_{i},q_{i},r_{i}\in\mathbb{R}^{d}, say pi=(pi,1,,pi,d),qi=(qi,1,,qi,d),ri=(ri,1,,ri,d)p_{i}=(p_{i,1},\ldots,p_{i,d}),q_{i}=(q_{i,1},\ldots,q_{i,d}),r_{i}=(r_{i,1},\ldots,r_{i,d}), we say that p¯\bar{p} and q¯\bar{q} have the same order-type over r¯\bar{r} if

pi,jpi,jqi,jqi,j andp_{i,j}\square p_{i^{\prime},j^{\prime}}\iff q_{i,j}\square q_{i^{\prime},j^{\prime}}\mbox{ and}
pi,jrk,jqi,jrk,jp_{i,j}\square r_{k,j^{\prime}}\iff q_{i,j}\square r_{k,j^{\prime}}

for all {<,>,=}\square\in\{<,>,=\}, 1i,in,1j,jd1\leq i,i^{\prime}\leq n,1\leq j,j^{\prime}\leq d and 1km1\leq k\leq m.

In other words, the tuples (pi,j:1in,1jd)(p_{i,j}:1\leq i\leq n,1\leq j\leq d) and (qi,j:1in,1jd)(q_{i,j}:1\leq i\leq n,1\leq j\leq d) have the same quantifier-free type over the set {ri,j:1im,1jd}\{r_{i,j}:1\leq i\leq m,1\leq j\leq d\} in the structure (,<)(\mathbb{R},<).

Remark 3.2.

Assume that P={p1,,pn}dP=\{p_{1},\ldots,p_{n}\}\subseteq\mathbb{R}^{d} is a finite set of points and BB is a finite set of dd-dimensional open boxes with axis-parallel sides, with II incidences between PP and BB.

  1. (1)

    By perturbing PP and BB slightly, we may assume that for every 1jd1\leq j\leq d, all points in PP have pairwise distinct jjth coordinates p1,j,,pn,jp_{1,j},\ldots,p_{n,j}, and none of the points in PP belongs to the border of any of the boxes in BB, while the incidence graph between PP and BB remains unchanged.

  2. (2)

    Let r¯\bar{r} be the tuple listing all corners of all boxes in BB. If P={p1,,pn}dP^{\prime}=\{p^{\prime}_{1},\ldots,p^{\prime}_{n}\}\subseteq\mathbb{R}^{d} is an arbitrary set of points with the same order-type as PP over r¯\bar{r}, then the incidence graph on P×BP\times B is isomorphic to the incidence graph on P×BP^{\prime}\times B.

We have the following lemma for combining point-box incidence configurations in a higher-dimensional space.

Lemma 3.3.

Given any d,n1,n2,n1,n2,m,m>0d,n_{1},n_{2},n^{\prime}_{1},n^{\prime}_{2},m,m^{\prime}\in\mathbb{N}_{>0}, assume that:

  1. (1)

    there exists a set of points Pd1d1P^{d-1}\subseteq\mathbb{R}^{d-1} with |Pd1|=n1|P^{d-1}|=n_{1} and a set of (d1)(d-1)-dimensional boxes Bd1B^{d-1} with |Bd1|=n2|B^{d-1}|=n_{2}, with mm incidences between them, and the incidence graph K2,2K_{2,2}-free;

  2. (2)

    there exists a set of points PddP^{d}\subseteq\mathbb{R}^{d} with |Pd|=n1|P^{d}|=n^{\prime}_{1} and a set of dd-dimensional boxes BdB^{d} with |Bd|=n2|B^{d}|=n^{\prime}_{2}, with mm^{\prime} incidences between them and the incidence graph K2,2K_{2,2}-free.

Then there exists a set of points PdP\subseteq\mathbb{R}^{d} with |P|=n1n1|P|=n_{1}n^{\prime}_{1} and a set of dd-dimensional boxes BB with |B|=n1n2+n1n2|B|=n_{1}n^{\prime}_{2}+n^{\prime}_{1}n_{2}, so that there are n1m+mn1n_{1}m^{\prime}+mn^{\prime}_{1} incidences between PP and BB and their incidence graph is still K2,2K_{2,2}-free.

Proof.

By Remark 3.2(1) we may assume that for every 1jd1\leq j\leq d, all points in PdP^{d} have pairwise distinct jjth coordinates, for every 1jd11\leq j\leq d-1 all points in Pd1P^{d-1} have pairwise distinct jjth coordinates, and none of the points is on the border of any of the boxes. Write Pd1P^{d-1} as p1,,pn1p_{1},\ldots,p_{n_{1}}. Let r¯\bar{r} be the tuple listing all corners of all boxes in Bd1B^{d-1}.

Using this, for each pip_{i} we can choose a very small (d1)(d-1)-dimensional box βi\beta_{i} with piβip_{i}\in\beta_{i} and such that: for any choice of points piβi,1in1p^{\prime}_{i}\in\beta_{i},1\leq i\leq n_{1}, we have that (p1,,pn1)(p^{\prime}_{1},\ldots,p^{\prime}_{n_{1}}) has the same order-type as (p1,,pn1)(p_{1},\ldots,p_{n_{1}}) over r¯\bar{r}. In particular, all the βi\beta_{i}’s are pairwise disjoint, and the incidence graph between Pd1P^{d-1} and Bd1B^{d-1} is isomorphic to the incidence graph between (pi,,pn1)(p^{\prime}_{i},\ldots,p^{\prime}_{n_{1}}) and Bd1B^{d-1} by Remark 3.2(2).

Contracting and translating while keeping the ddth coordinate unchanged, for each 1in11\leq i\leq n_{1} we can find a copy (Pid,Bid)(P^{d}_{i},B^{d}_{i}) of the configuration (Pd,Bd)(P^{d},B^{d}) entirely contained in the box βi×\beta_{i}\times\mathbb{R}, that is:

  • all points in PidP^{d}_{i} and boxes in BidB^{d}_{i} are contained in βi×\beta_{i}\times\mathbb{R};

  • the incidence graph on (Pid,Bid)(P^{d}_{i},B^{d}_{i}) is isomorphic to the incidence graph on (Pd,Bd)(P^{d},B^{d});

  • for all ii, the ddth coordinate of every point in PidP_{i}^{d} is the same as the ddth coordinate of the corresponding point in PdP^{d}.

Let P:=1in1PidP:=\bigcup_{1\leq i\leq n_{1}}P^{d}_{i} and B:=1in1BidB^{\prime}:=\bigcup_{1\leq i\leq n_{1}}B^{d}_{i}, then |P|=n1n1,|B|=n1n2|P|=n_{1}n^{\prime}_{1},|B^{\prime}|=n_{1}n^{\prime}_{2} and there are n1mn_{1}m^{\prime} incidences between PP and BB^{\prime}.

Write PdP^{d} as q1,,qn1q_{1},\ldots,q_{n^{\prime}_{1}} and Bd1B^{d-1} as c1,,cn2c_{1},\ldots,c_{n_{2}}. As all of the ddth coordinates of the points in PdP^{d} are pairwise disjoint, for each 1jn11\leq j\leq n^{\prime}_{1} we can choose a small interval IjI_{j}\subseteq\mathbb{R} with qj,dIjq_{j,d}\in I_{j}, and so that all of the intervals Ij,1jn1I_{j},1\leq j\leq n^{\prime}_{1} are pairwise disjoint. For each 1jn11\leq j\leq n^{\prime}_{1} and clBd1c_{l}\in B^{d-1}, we consider the dd-dimensional box cj,l:=cl×Ijc_{j,l}:=c_{l}\times I_{j}. Let Bj:={cj,l:1ln2}B_{j}:=\{c_{j,l}:1\leq l\leq n_{2}\}. For each 1in11\leq i\leq n_{1} and 1jn11\leq j\leq n^{\prime}_{1}, (βi×)(d1×Ij)(\beta_{i}\times\mathbb{R})\cap(\mathbb{R}^{d-1}\times I_{j}) contains exactly one point qi,jq_{i,j} (given by the copy of qjq_{j} in PidP_{i}^{d}), and the projection qi,jq^{\prime}_{i,j} of qi,jq_{i,j} onto the first d1d-1 coordinates is in βi\beta_{i}. Hence the incidence graph between PP and BjB_{j} is isomorphic to the incidence graph between Pd1P^{d-1} and Bd1B^{d-1} by the choice of the βi\beta_{i}’s, in particular the number of incidences is mm.

Finally, let B:=B1jn1BjB:=B^{\prime}\cup\bigcup_{1\leq j\leq n^{\prime}_{1}}B_{j}, then |B|=n1n2+n1n2|B|=n_{1}n^{\prime}_{2}+n^{\prime}_{1}n_{2}. Note that cj,lcj,l=c_{j,l}\cap c_{j^{\prime},l^{\prime}}=\emptyset for jjj\neq j^{\prime} and any l,ll,l^{\prime}, i.e. no box in BjB_{j} intersects any of the boxes in BjB_{j^{\prime}} for jjj\neq j^{\prime}. It is now not hard to check that the incidence graph between PP and BB is K2,2K_{2,2}-free (by construction and the assumptions of K2,2K_{2,2}-freeness of (Pd,Bd)(P^{d},B^{d}) and (Pd1,Bd1)(P^{d-1},B^{d-1})), and that there are n1m+mn1n_{1}m^{\prime}+mn^{\prime}_{1} incidences between PP and BB. ∎

Remark 3.4.

It follows from the proof that if all the boxes in Bd1B^{d-1} and BdB^{d} are dyadic (see Definition 4.6), then we can choose the boxes in BB to be dyadic as well.

Proposition 3.5.

For any \ell\in\mathbb{N}, there exist a set PP of \ell^{\ell} points and a set BB of \ell^{\ell} dyadic boxes in 2\mathbb{R}^{2} such that their incidence graph is K2,2K_{2,2}-free and the number of incidences is \ell\ell^{\ell}.

In particular, substituting n:=n:=\ell^{\ell}, this shows that the number of incidences grows as Ω(nlognloglogn)\Omega\left(n\frac{\log n}{\log\log n}\right).

Proof.

Given dd, assume that there exist K2,2K_{2,2}-free ‘point – dyadic box’ configurations satisfying (1) and (2) in Lemma 3.3 for some parameters d,n1,n2,n1,n2,m,md,n_{1},n_{2},n^{\prime}_{1},n^{\prime}_{2},m,m^{\prime}. Then, for any jj\in\mathbb{N}, we can iterate the lemma jj times and find a K2,2K_{2,2}-free ‘point – dyadic box’ configuration in d\mathbb{R}^{d} with n1jn1n_{1}^{j}n^{\prime}_{1} points, n1jn2+jn1j1n1n2n_{1}^{j}n^{\prime}_{2}+jn_{1}^{j-1}n^{\prime}_{1}n_{2} dyadic boxes (Remark 3.4), and n1jm+jn1j1n1mn_{1}^{j}m^{\prime}+jn_{1}^{j-1}n^{\prime}_{1}m incidences.

In particular, let d=2d=2 and let \ell be arbitrary. We can start with n1=,n2=1,m=n_{1}=\ell,n_{2}=1,m=\ell (one dyadic interval containing n1n_{1} points in \mathbb{R}) and n1=1,n2=0,m=0n^{\prime}_{1}=1,n^{\prime}_{2}=0,m^{\prime}=0 (one point and zero dyadic boxes in 2\mathbb{R}^{2}). Taking j:=j:=\ell, we then find a K2,2K_{2,2}-free configuration with \ell^{\ell} points, \ell^{\ell} dyadic boxes and \ell\ell^{\ell} incidences. Hence for n:=kkn:=k^{k}, we have nn points, nn boxes and Ω(nlognloglogn)\Omega\left(n\frac{\log n}{\log\log n}\right) incidences. ∎

Remark 3.6.

We remark that the construction in Lemma 3.3 cannot produce a K2,2K_{2,2}-free configuration with more than O(nlognloglogn)O\left(n\frac{\log n}{\log\log n}\right) incidences in d\mathbb{R}^{d} for any dd.

Indeed, using the “coordinates” (logn1,n2n1,mn1)\left(\log n^{\prime}_{1},\frac{n^{\prime}_{2}}{n^{\prime}_{1}},\frac{m^{\prime}}{n^{\prime}_{1}}\right) instead of (n1,n2,m)(n^{\prime}_{1},n^{\prime}_{2},m^{\prime}), where the coordinates correspond to the number of points, boxes and incidences respectively, the lemma says that if (logn1,n2n1,mn1)\left(\log n_{1},\frac{n_{2}}{n_{1}},\frac{m}{n_{1}}\right) is attainable in d1d-1 dimensions and (logn1,n2n1,mn1)\left(\log n^{\prime}_{1},\frac{n^{\prime}_{2}}{n^{\prime}_{1}},\frac{m^{\prime}}{n^{\prime}_{1}}\right) is attainable in dd dimensions, then (logn1+logn1,n2n1+n2n1,mn1+mn1)\left(\log n^{\prime}_{1}+\log n_{1},\frac{n^{\prime}_{2}}{n^{\prime}_{1}}+\frac{n_{2}}{n_{1}},\frac{m^{\prime}}{n^{\prime}_{1}}+\frac{m}{n_{1}}\right) is attainable in dd dimensions. Thus, one adds the vector (n2n1,mn1)\left(\frac{n_{2}}{n_{1}},\frac{m}{n_{1}}\right) to (n2n1,mn1)\left(\frac{n^{\prime}_{2}}{n^{\prime}_{1}},\frac{m^{\prime}}{n^{\prime}_{1}}\right). We want to maximize the second coordinate of this vector while keeping the first coordinate below 11, and the optimal way to do it essentially is to add n1n_{1} times the vector (1n1,1)\left(\frac{1}{n_{1}},1\right), which increases logn1\log n^{\prime}_{1} by n1logn1n_{1}\log n_{1} and gives the lognloglogn\frac{\log n}{\log\log n} lower bound.

We thus ask whether in the ‘point-box’ incidence bound in d\mathbb{R}^{d} the power of logn\log n has to grow with the dimension dd (see Problem 1.3).

4. Dyadic rectangles

In this section we strengthen the bound on the number of incidences with rectangles on the plane with axis-parallel sides given by Corollary 2.38, i.e., Ok(nlog4n)O_{k}\left(n\log^{4}n\right), in the special case of dyadic rectangles, using a different argument (which relies on a certain partial order specific to the dyadic case).

4.1. Locally dd-linear orders

Throughout this section, let (P,)(P,\leq) be a partially ordered set of size at most n1n_{1}, and let LL be a collection of subsets of PP (possibly with repetitions) of size at most n2n_{2}. As before, we let n=n1+n2n=n_{1}+n_{2}.

Definition 4.1.

We say that a set SPS\subseteq P is dd-linear if it contains no antichains of size greater than dd, and (P,)(P,\leq) is locally dd-linear if any interval [a,b]={xP:axb}[a,b]=\{x\in P:a\leq x\leq b\} is dd-linear.

Note that dd-linearity is preserved under removing points from PP.

Definition 4.2.

The collection LL is said to be a Kk,kK_{k,k}-free arrangement if for any a1akPa_{1}\neq\ldots\neq a_{k}\in P, there are at most k1k-1 sets from LL containing all of them simultaneously.

Observe that if one removes any number of points from PP, or removes any number of sets from LL, one still obtains a Kk,kK_{k,k}-free arrangement. We now state the main theorem of this section.

Theorem 4.3.

Suppose (P,<)(P,<) is locally dd-linear, and LL is a Kk,kK_{k,k}-free arrangement of dd-linear subsets of PP. Then

L||=Od,k(nlog(100+n1)loglog(100+n1))\sum_{\ell\in L}|\ell|=O_{d,k}\left(n\frac{\log(100+n_{1})}{\log\log(100+n_{1})}\right)

.

To prove Theorem 4.3, we first need some definitions and a lemma. If xPx\in P, define a parent of xx to be an element yPy\in P with y>xy>x and no element between xx and yy, and similarly define a child of xx to be an element zPz\in P with z<xz<x and no element between zz and xx. We say that zz is a strict tt-descendant of xx if there are some elements z0=x>z1>>zt=zz_{0}=x>z_{1}>\ldots>z_{t}=z such that zi+1z_{i+1} is a child of ziz_{i}, and that zz is a tt-descendant of xx if it is a strict ss-descendant for some 0st0\leq s\leq t.

Lemma 4.4.

Fix d,kd,k\in\mathbb{N}. Let LL be a Kk,kK_{k,k}-free arrangement of dd-linear subsets of PP, and let m>0m>0. Let PP^{\prime} denote the set of all elements in PP which have a (k1)(k-1)-descendant with more than mm children. Then

L||L|P|+d(k1)|L|+(k1)mk1(|P||P|).\sum_{\ell\in L}|\ell|\leq\sum_{\ell\in L}|\ell\cap P^{\prime}|+d(k-1)|L|+(k-1)m^{k-1}(|P|-|P^{\prime}|).
Proof.

Let P′′:=P\PP^{\prime\prime}:=P\backslash P^{\prime} denote the set of elements xPx\in P such that every (k1)(k-1)-descendant of xx has at most mm children. Then we can rearrange the desired inequality as

L|P′′|d(k1)|L|+(k1)mk1|P′′|.\sum_{\ell\in L}|\ell\cap P^{\prime\prime}|\leq d(k-1)|L|+(k-1)m^{k-1}|P^{\prime\prime}|.

The quantity L|P′′|\sum_{\ell\in L}|\ell\cap P^{\prime\prime}| is counting incidences (x,)(x,\ell) where L\ell\in L and xP′′x\in P^{\prime\prime}\cap\ell.

Given L\ell\in L, call a point xx\in\ell low if xx has no descending chain of length k1k-1 under it in \ell. Every \ell can contain at most d(k1)d(k-1) low points. Indeed, as \ell is dd-linear, it has at most dd minimal elements. Removing them, we obtain a dd-linear set 1\ell_{1}\subseteq\ell such that every point in it contains an element under it in \ell, and 1\ell_{1} itself has at most dd minimal elements. Remove them to obtain a dd-linear set 21\ell_{2}\subseteq\ell_{1} such that each point in it contains a descending chain of length 22 under it in \ell, etc.

Hence each L\ell\in L contributes at most d(k1)d(k-1) incidences with its low points, giving a total contribution of at most d(k1)|L|d(k-1)|L| to the sum. If xx is not a low point on \ell, then there are some z1<<zk1<xz_{1}<\ldots<z_{k-1}<x in \ell, with each one a child of the next one. As LL is a Kk,kK_{k,k}-free arrangement, among the sets L\ell\in L there are at most k1k-1 containing all these points. By definition of P′′P^{\prime\prime}, for each xP′′x\in P^{\prime\prime} there are at most mk1m^{k-1} choices for such tuples (z1,,zk1)(z_{1},\ldots,z_{k-1}). Hence xx is incident to at most (k1)mk1(k-1)m^{k-1} sets L\ell\in L for which it is not low, and the total number of contributions of incidences in this case is at most (k1)mk1|P′′|(k-1)m^{k-1}|P^{\prime\prime}|, so the claim follows. ∎

Now we prove Theorem 4.3. Let tt be a natural number to be chosen later, and m>0m>0 be another parameter to be chosen later. Define the subsets

P=P0P1PtP=P_{0}\supset P_{1}\supset\dots\supset P_{t}

of PP by defining P0:=PP_{0}:=P, and for each i=0,,t1i=0,\dots,t-1, defining Pi+1P_{i+1} to be the set of points in PiP_{i} that have a (k1)(k-1)-descendant with more than mm children in (Pi,<)(P_{i},<). By the above lemma, we have

L|Pi|L|Pi+1|+d(k1)|L|+(k1)mk1(|Pi||Pi+1|)\sum_{\ell\in L}|\ell\cap P_{i}|\leq\sum_{\ell\in L}|\ell\cap P_{i+1}|+d(k-1)|L|+(k-1)m^{k-1}(|P_{i}|-|P_{i+1}|)

for all i=0,,t1i=0,\dots,t-1, and hence on telescoping

L||L|Pt|+d(k1)t|L|+(k1)mk1n1.\sum_{\ell\in L}|\ell|\leq\sum_{\ell\in L}|\ell\cap P_{t}|+d(k-1)t|L|+(k-1)m^{k-1}n_{1}.
Claim 4.5.

Let xx be a point in PtP_{t}. Then it has at least mt(kdk)t1\frac{m^{t}}{(kd^{k})^{t-1}} distinct descendants in PP.

Proof.

By definition of PtP_{t} there is some (k1)(k-1)-descendant xPt1x^{\prime}\in P_{t-1} of xx which has at least mm children in Pt1P_{t-1}. Let St1Pt1S_{t-1}\subseteq P_{t-1} denote the set of children of xx^{\prime}, so |St1|m|S_{t-1}|\geq m. By reverse induction for i=t1,t2,,0i=t-1,t-2,\ldots,0 we choose sets SiPiS_{i}\subseteq P_{i} of descendants of xx so that |Si1||Si|mkdk|S_{i-1}|\geq\frac{|S_{i}|m}{kd^{k}}. Then |S0|mt(kdk)t1|S_{0}|\geq\frac{m^{t}}{(kd^{k})^{t-1}}, as wanted.

Let SiS_{i} be given. By definition of PiP_{i} and pigeonhole principle, there is some 0sk10\leq s\leq k-1 and SiSiS^{\prime}_{i}\subseteq S_{i} such that |Si||Si|k|S^{\prime}_{i}|\geq\frac{|S_{i}|}{k} and every ySiy\in S^{\prime}_{i} has a strict ss-descendant zyPi1z_{y}\in P_{i-1} with at least mm children in Pi1P_{i-1}. Fix a path IyI_{y} of length ss connecting yy to zyz_{y}, and for 0rs0\leq r\leq s let zyrz^{r}_{y} denote the rrth element on the path IyI_{y} (so zy0=yz_{y}^{0}=y, zys=zyz_{y}^{s}=z_{y} and zyr+1z_{y}^{r+1} is a child of zyrz_{y}^{r}). Let Ir:={zyr:ySi}I^{r}:=\{z^{r}_{y}:y\in S^{\prime}_{i}\}, so I0=SiI^{0}=S^{\prime}_{i}. Then |Ir+1||Ir|d|I^{r+1}|\geq\frac{|I^{r}|}{d} (otherwise there is some element zIr+1z\in I^{r+1} which has at least d+1d+1 different parents in IrI^{r}, which would then form an antichain of size d+1d+1 contradicting local dd-linearity of PP). Hence

|Is||I0|ds|Si|dk1|Si|kdk1.|I^{s}|\geq\frac{|I^{0}|}{d^{s}}\geq\frac{|S^{\prime}_{i}|}{d^{k-1}}\geq\frac{|S_{i}|}{kd^{k-1}}.

Now by hypothesis every element in IsI^{s} has at least mm children in Pi1P_{i-1}, denote the set of all the children of the elements in IsI^{s} by Si1Pi1S_{i-1}\subseteq P_{i-1}. Then, again by dd-linearity, |Si1||Is|md|Si|mkdk|S_{i-1}|\geq\frac{|I^{s}|m}{d}\geq\frac{|S_{i}|m}{kd^{k}}. ∎

Thus if we choose m,tm,t so that

(mkdk)t>n1\left(\frac{m}{kd^{k}}\right)^{t}>n_{1}

then we will get a contradiction, unless PtP_{t} is empty. We conclude, for such mm and tt, that

L||d(k1)t|L|+(k1)mk1n1.\sum_{\ell\in L}|\ell|\leq d(k-1)t|L|+(k-1)m^{k-1}n_{1}.

If we take m:=(clog(100+n1)loglog(100+n1))1k1m:=\left(\frac{c\log(100+n_{1})}{\log\log(100+n_{1})}\right)^{\frac{1}{k-1}} and tt to be the integer part of clog(100+n1)loglog(100+n1)\frac{c\log(100+n_{1})}{\log\log(100+n_{1})}, and assume that cc is sufficiently large relatively to kk and dd, then the claim follows.

4.2. Reduction for dyadic rectangles

Definition 4.6.
  1. (1)

    Define a dyadic interval to be a half-open interval II of the form I=[s2t,(s+1)2t)I=[s2^{t},(s+1)2^{t}) for integers s,ts,t; we use |I|=2t|I|=2^{t} to denote the length of such an interval.

  2. (2)

    Define a dyadic box in d\mathbb{R}^{d} (dyadic rectangle when d=2d=2) to be a product I1××IdI_{1}\times\ldots\times I_{d} of dyadic intervals.

Note that if two dyadic intervals intersect, then one must be contained in the other.

Theorem 4.7.

Fix kk\in\mathbb{N}. Assume we have a collection PP of n1n_{1} points in 2\mathbb{R}^{2} and a collection RR of n2n_{2} dyadic rectangles in 2\mathbb{R}^{2}, with the property that the incidence graph contains no Kk,kK_{k,k}, and n=n1+n2n=n_{1}+n_{2}. Then the number of incidences (p,I×J)(p,I\times J) with pPp\in P and pI×JRp\in I\times J\in R is at most

Ok(nlog(100+n1)loglog(100+n1)).O_{k}\left(n\frac{\log(100+n_{1})}{\log\log(100+n_{1})}\right).
Proof.

Suppose that we have some nested dyadic rectangles D1D2DkD_{1}\supseteq D_{2}\supseteq\ldots\supseteq D_{k} in RR. As the incidence graph is Kk,kK_{k,k}-free by hypothesis, DkD_{k} may contain at most (k1)(k-1) points from PP. Removing all such rectangles repeatedly we loose only (k1)n2(k-1)n_{2} incidences, and thus may assume that any nested sequence in RR is of length at most k1k-1. In particular, any rectangle can be repeated at most k1k-1 times in RR. Then, possibly increasing the number of incidences by a multiple (k1)(k-1), we may assume that there are no repetitions in RR.

We now define a relation \leq on RR by declaring I×JI×JI\times J\leq I^{\prime}\times J^{\prime} if III\subseteq I^{\prime} and JJJ\supseteq J^{\prime}. This is a locally (k1)(k-1)-linear partial order (by the previous paragraph: antisymmetry holds as there are no repetitions in RR, and using that all rectangles are dyadic, any antichain of size kk inside an interval would give a nested sequence of rectangles of length kk).

For each point pp in PP, let p\ell_{p} be a subset of RR consisting of all those rectangles in RR that contain pp; then p\ell_{p} is a (k1)(k-1)-linear set (again, any antichain gives a nested sequence of rectangles of the same length). Finally, pRRpp\in R\iff R\in\ell_{p}, hence the collection {p:pP}\{\ell_{p}:p\in P\} is a Kk,kK_{k,k}-free arrangement and the claim now follows from Theorem 4.3 with d:=k1d:=k-1. ∎

Remark 4.8.

For a non-dyadic rectangle RR, let 0.99R0.99R denote the rectangle with the same center as R, but whose lengths and heights have been shrunk by a factor of 0.990.99. Define a “good incidence” to be a pair (p,R)(p,R) where pp is a point lying in 0.99R0.99R, not just in RR. Then the dyadic bound in Theorem 4.7 implies that for a family of arbitrary (not necessarily dyadic) rectangles with no Kk,kK_{k,k}’s, one still gets the O(nlognloglogn)O\left(\frac{n\log n}{\log\log n}\right)-type bound for the number of good incidences.

The reason is as follows. First we can randomly translate and dilate (non-isotropically, with the horizontal and vertical coordinates dilated separately) the configuration of points and rectangles by some translation parameter and a pair of dilation parameters (s,t)(s,t) for each of the coordinates. While there is no invariant probability measure on the space of dilatations, one can for instance pick a large number NN (much larger than the number of points and rectangles, etc.), dilate horizontally by a random dilation between 1/N1/N and NN (using say the dt/tdt/t Haar measure) making (with positive probability) the horizontal side length close to a power of two; then a vertical dilation will achieve a similar effect for the vertical side length; and then translate by a random amount in [N,N]2[-N,N]^{2} (chosen uniformly at random) placing the rectangle very close to a dyadic one with positive probability. If RR is a rectangle that is randomly dilated and translated this way, then with probability >1010>10^{-10}, there will be a dyadic rectangle RR^{\prime} stuck between RR and 0.99R0.99R. If the original rectangles have no Kk,kK_{k,k}, then neither will these new dyadic rectangles. The expected number of incidences amongst the dyadic rectangles is at least 101010^{-10} times the number of good incidences amongst the original rectangles. Hence any incidence bound we get on dyadic rectangles implies the corresponding bound for good incidences for non-dyadic rectangles (losing a factor of 101010^{10}).

5. A connection to model-theoretic linearity

In this section we obtain a stronger bound in Theorem 2.17 (without the logarithmic factor) under a stronger assumption that the whole semilinear relation XX is Kk,,kK_{k,\ldots,k}-free (Corollary 5.12). And we show that if this stronger bound doesn’t hold for a given semialgebraic relation, then the field operations can be recovered from this relation (see Corollary 5.14 for the precise statement). These results are deduced in Section 5.2 from a more general model-theoretic theorem proved in Section 5.1.

5.1. Main theorem

We recall some standard model-theoretic notation and definitions, and refer to [marker2006model] for a general introduction to model theory, and to [berenstein2012weakly] for further details on geometric structures.

Recall that acl\operatorname{acl} denotes the algebraic closure operator, i.e. if =(M,)\mathcal{M}=(M,\ldots) is a first-order structure, AMA\subseteq M and aa is a finite tuple in MM, then aacl(A)a\in\operatorname{acl}(A) if it belongs to some finite AA-definable subset of M|a|M^{|a|} (this generalizes linear span in vector spaces and algebraic closure in fields). Throughout this section we follow the standard model theoretic notation: depending on the context, writing BCBC denotes either the union of two subsets B,CB,C of MM, or the tuple obtained by concatenating the (possibly infinite) tuples B,CB,C of elements of MM.

Definition 5.1.

A complete first-order theory TT in a language \mathcal{L} is geometric if for any model =(M,)T\mathcal{M}=(M,\ldots)\models T we have the following.

  1. (1)

    The algebraic closure in \mathcal{M} satisfies the Exchange Principle:

    if a,ba,b are singletons in \mathcal{M}, AMA\subseteq M and bacl(A,a)acl(A)b\in\operatorname{acl}(A,a)\setminus\operatorname{acl}(A), then aacl(A,b)a\in\operatorname{acl}(A,b).

  2. (2)

    TT eliminates \exists^{\infty} quantifier:

    for every \mathcal{L}-formula φ(x,y)\varphi(x,y) with xx a single variable and yy a tuple of variables there exists some kk\in\mathbb{N} such that for every bM|y|b\in M^{|y|}, if φ(x,b)\varphi(x,b) has more than kk solutions in MM, then it has infinitely many solutions in MM.

In models of a geometric theory, the algebraic closure operator acl\operatorname{acl} gives rise to a matroid, and given aa a finite tuple in MM and AMA\subseteq M, dim(a/A)\dim(a/A) is the minimal cardinality of a subtuple aa^{\prime} of aa so that acl(aA)=acl(aA)\operatorname{acl}(a\cup A)=\operatorname{acl}(a^{\prime}\cup A) (in an algebraically closed field, this is just the transcendence degree of aa over the field generated by AA). Finally, given a finite tuple aa and sets C,BMC,B\subseteq M, we write aCBa\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}B to denote that dim(a/BC)=dim(a/C)\dim\left(a/BC\right)=\dim\left(a/C\right).

Remark 5.2.

If TT is geometric, then it is easy to check that \mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}} is an independence relation, i.e.  it satisfies the following properties for all tuples a,a,b,b,da,a^{\prime},b,b^{\prime},d and C,DMC,D\subseteq M:

  • aCbacl(a,C)Cacl(b,C)a\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}b\iff\operatorname{acl}(a,C)\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}\operatorname{acl}(b,C);

  • (extension) if aCba\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}b and dd is arbitrary, then there exists some aa^{\prime} so that aCbda^{\prime}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}bd and aCbaa^{\prime}\equiv_{Cb}a (which means that aa^{\prime} belongs to exactly the same CbCb-definable subsets of M|a|M^{|a|} as aa).

  • (monotonicity) aaCbbaCbaa^{\prime}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}bb^{\prime}\implies a\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}b;

  • (symmetry) aCbbCaa\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}b\implies b\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}a;

  • (transitivity) aDbbaDbba\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}bb^{\prime}\iff a\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{Db}b^{\prime} and aDba\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}b;

  • (non-degeneracy) if aCba\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{C}b and dacl(a,C)acl(b,C)d\in\operatorname{acl}(a,C)\cap\operatorname{acl}(b,C), then dacl(C)d\in\operatorname{acl}(C).

The following property expresses that the matroid defined by the algebraic closure is linear, in the sense that the closure operator behaves more like span in vector spaces, as opposed to algebraic closure in fields.

Definition 5.3.

[berenstein2012weakly, Definition 2.1] A geometric theory TT is weakly locally modular if for any saturated T\mathcal{M}\models T and A,BA,B small subsets of \mathcal{M} there exists some small set CABC\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\emptyset}AB such that Aacl(AC)acl(BC)BA\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\operatorname{acl}(AC)\cap\operatorname{acl}(BC)}B.

Recall that a linearly ordered structure =(M,<,)\mathcal{M}=(M,<,\ldots) is oo-minimal if every definable subset of MM is a finite union of intervals (see e.g.  [van1998tame]).

Example 5.4.

[berenstein2012weakly, Section 3.2] An oo-minimal structure is linear (i.e. any normal interpretable family of plane curves in TT has dimension 1\leq 1) if and only if it is weakly locally modular.

In particular, every theory of an ordered vector space over an ordered division ring is weakly locally modular (so Theorem 5.6 applies to semi-linear relations).

The following is a key model-theoretic lemma.

Lemma 5.5.

Assume that TT is geometric and weakly locally modular, and =(M,)T\mathcal{M}=(M,\ldots)\models T is 1\aleph_{1}-saturated. Assume that EMd1××MdrE\subseteq M^{d_{1}}\times\ldots\times M^{d_{r}} is an rr-ary relation defined by a formula with parameters in a finite tuple bb, and EE contains no rr-grid A=i[r]AiA=\prod_{i\in[r]}A_{i} with each AiMdiA_{i}\subseteq M^{d_{i}} infinite. Then for any (a1,,ar)E(a_{1},\ldots,a_{r})\in E there exists some i[r]i\in[r] so that aiacl({aj:j[r]{i}},b)a_{i}\in\operatorname{acl}\left(\left\{a_{j}:j\in[r]\setminus\{i\}\right\},b\right).

Proof of Lemma 5.5.

Assume not, then there exist some (a1,,ar)(a_{1},\ldots,a_{r}) in \mathcal{M} such that (a1,,ar)E(a_{1},\ldots,a_{r})\in E, but aiacl(ai,b)a_{i}\notin\operatorname{acl}\left(a_{\neq i},b\right) for every i[r]i\in[r], where ai:={aj:j[r]{i}}a_{\neq i}:=\left\{a_{j}:j\in[r]\setminus\{i\}\right\}.

By weak local modularity, for each i[r]i\in[r] there exists some small set CiC_{i}\subseteq\mathcal{M} so that

Ci{a1,,ar}{b} and aiacl(ai,Ci)acl(ai,b,Ci)aib.C_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\emptyset}\left\{a_{1},\ldots,a_{r}\right\}\cup\{b\}\textrm{ and }a_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\operatorname{acl}(a_{i},C_{i})\cap\operatorname{acl}(a_{\neq i},b,C_{i})}a_{\neq i}b.

By extension of \mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}, we may assume that Cia1,,ar,b,C<iC_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\emptyset}a_{1},\ldots,a_{r},b,C_{<i} for all i[r]i\in[r]. Hence by transitivity Ca1,,ar,bC\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\emptyset}a_{1},\ldots,a_{r},b, where C:=i[r]CiC:=\bigcup_{i\in[r]}C_{i}.

Let D:=i[r]acl(ai,b,C)D:=\bigcap_{i\in[r]}\operatorname{acl}\left(a_{\neq i},b,C\right).

Claim (A).

For every i[r]i\in[r], aiDaia_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{\neq i}.

Proof.

Fix i[r]i\in[r]. As Ca1,,ar,bC\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\emptyset}a_{1},\ldots,a_{r},b and aiacl(ai,Ci)acl(ai,b,Ci)aiba_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\operatorname{acl}(a_{i},C_{i})\cap\operatorname{acl}(a_{\neq i},b,C_{i})}a_{\neq i}b, by symmetry and transitivity we have

aiacl(ai,Ci)acl(ai,b,Ci)aibC.a_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{\operatorname{acl}(a_{i},C_{i})\cap\operatorname{acl}(a_{\neq i},b,C_{i})}a_{\neq i}bC.

Note that acl(ai,Ci)acl(aj,C)\operatorname{acl}(a_{i},C_{i})\subseteq\operatorname{acl}(a_{\neq j},C) for every ij[r]i\neq j\in[r], hence acl(ai,Ci)acl(ai,b,Ci)D\operatorname{acl}(a_{i},C_{i})\cap\operatorname{acl}(a_{\neq i},b,C_{i})\subseteq D, and clearly Dacl(ai,b,C)D\subseteq\operatorname{acl}(a_{\neq i},b,C). Hence aiDaibCa_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{\neq i}bC, and in particular aiDaia_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{\neq i}. ∎

Claim (B).

For every i[r]i\in[r], aiacl(D)a_{i}\notin\operatorname{acl}(D).

Proof.

Fix i[r]i\in[r]. Then acl(D)acl(ai,b,C)\operatorname{acl}(D)\subseteq\operatorname{acl}(a_{\neq i},b,C) by definition. But as CaibaiC\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{a_{\neq i}b}a_{i} by transitivity, if aiacl(ai,b,C)a_{i}\in\operatorname{acl}(a_{\neq i},b,C) then we would get aiacl(ai,b)a_{i}\in\operatorname{acl}(a_{\neq i},b), contradicting the assumption. ∎

By induction we will choose sequences of tuples α¯i=(ait)t,i[r]\bar{\alpha}_{i}=(a_{i}^{t})_{t\in\mathbb{N}},i\in[r] in \mathcal{M} such that for every i[r]i\in[r] we have:

  1. (1)

    aitDα¯<ia>iaia^{t}_{i}\equiv_{D\bar{\alpha}_{<i}a_{>i}}a_{i} for all tt\in\mathbb{N};

  2. (2)

    aitaisa^{t}_{i}\neq a_{i}^{s} (as tuples) for all sts\neq t\in\mathbb{N};

  3. (3)

    α¯iDα¯<ia>i\bar{\alpha}_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i}a_{>i}.

Fix i[r]i\in[r], and assume that we already chose some sequences a¯j\bar{a}_{j} for 1j<i1\leq j<i satisfying (1)–(3).

Claim (C).

We have aiDα¯<ia>ia_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i}a_{>i}.

Proof.

If i=1i=1, this claim becomes aiDaia_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{\neq i}, hence holds by Claim (A). So assume i2i\geq 2. We will show by induction that for each l=1,,i1l=1,\ldots,i-1 we have

α¯i1α¯ilDα¯<ila>i1.\bar{\alpha}_{i-1}\ldots\bar{\alpha}_{i-l}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i-l}a_{>i-1}.

For l=1l=1 this is equivalent to α¯i1Dα¯<i1a>i1\bar{\alpha}_{i-1}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i-1}a_{>i-1}, which holds by (3) for i1i-1. So we assume this holds for l<i1l<i-1, that is we have α¯i1α¯ilDα¯<ila>i1\bar{\alpha}_{i-1}\ldots\bar{\alpha}_{i-l}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i-l}a_{>i-1}, and show it for l+1l+1. By assumption and transitivity we have

α¯i1α¯ilDα¯i(l+1)α¯<i(l+1)a>i1.\bar{\alpha}_{i-1}\ldots\bar{\alpha}_{i-l}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D\bar{\alpha}_{i-(l+1)}}\bar{\alpha}_{<i-(l+1)}a_{>i-1}.

Also α¯i(l+1)Dα¯<i(l+1)a>i1\bar{\alpha}_{i-(l+1)}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i-(l+1)}a_{>i-1} by (3) for i(l+1)<ii-(l+1)<i. Then by transitivity again α¯i1α¯ilα¯i(l+1)Dα¯<i(l+1)a>i1\bar{\alpha}_{i-1}\ldots\bar{\alpha}_{i-l}\bar{\alpha}_{i-(l+1)}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i-(l+1)}a_{>i-1}, which concludes the inductive step.

In particular, for l=i1l=i-1 we get α¯<iDa>i1\bar{\alpha}_{<i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{>i-1}, that is α¯<iDaia>i\bar{\alpha}_{<i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{i}a_{>i}. By transitivity and Claim (A) this implies α¯<ia>iDai\bar{\alpha}_{<i}a_{>i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}a_{i}, and we conclude by symmetry. ∎

Using Claim (C) and extension of \mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}, we can choose a sequence α¯i=(ait)t\bar{\alpha}_{i}=(a^{t}_{i})_{t\in\mathbb{N}} so that aitDα¯<ia>iaia^{t}_{i}\equiv_{D\bar{\alpha}_{<i}a_{>i}}a_{i} and aitDα¯<ia>iai<ta^{t}_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i}a_{>i}a_{i}^{<t} for every tt\in\mathbb{N}. By Claim (B) we have aiacl(D)a_{i}\notin\operatorname{acl}(D), hence aitacl(D)a_{i}^{t}\notin\operatorname{acl}(D), hence aitacl(α¯<i,a>i,ai<t)a^{t}_{i}\notin\operatorname{acl}\left(\bar{\alpha}_{<i},a_{>i},a_{i}^{<t}\right), so in particular all the tuples (ait)t(a^{t}_{i})_{t\in\mathbb{N}} are pairwise-distinct and α¯i\bar{\alpha}_{i} satisfies (1) and (2). By symmetry and transitivity of \mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}} we get α¯iDα¯<ia>i\bar{\alpha}_{i}\mathop{\mathchoice{\displaystyle\kern 5.71527pt\hbox to0.0pt{\hss$\displaystyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\displaystyle\smile$\hss}\kern 5.71527pt}{\textstyle\kern 5.71527pt\hbox to0.0pt{\hss$\textstyle\mid$\hss}\lower 3.87495pt\hbox to0.0pt{\hss$\textstyle\smile$\hss}\kern 5.71527pt}{\scriptstyle\kern 2.80048pt\hbox to0.0pt{\hss$\scriptstyle\mid$\hss}\lower 1.89871pt\hbox to0.0pt{\hss$\scriptstyle\smile$\hss}\kern 2.80048pt}{\scriptscriptstyle\kern 1.42882pt\hbox to0.0pt{\hss$\scriptscriptstyle\mid$\hss}\lower 0.96873pt\hbox to0.0pt{\hss$\scriptscriptstyle\smile$\hss}\kern 1.42882pt}}_{D}\bar{\alpha}_{<i}a_{>i}. This concludes the inductive step in the construction of the sequences.

Finally, as (1) holds for all i[r]i\in[r] and bb is contained in DD, it follows that (a1t1,,artr)b(a1,,ar)(a^{t_{1}}_{1},\ldots,a^{t_{r}}_{r})\equiv_{b}(a_{1},\ldots,a_{r}), and hence (a1t1,,artr)E(a^{t_{1}}_{1},\ldots,a^{t_{r}}_{r})\in E for every (t1,,tr)r(t_{1},\ldots,t_{r})\in\mathbb{N}^{r}. By (1) each of the sets {ait:t},i[r]\left\{a^{t}_{i}:t\in\mathbb{N}\right\},i\in[r] is infinite — contradicting the assumption on EE. This concludes the proof of the lemma. ∎

Theorem 5.6.

Assume that TT is a geometric, weakly locally modular theory, and T\mathcal{M}\models T. Assume that r2r\in\mathbb{N}_{\geq 2} and φ(x¯1,,x¯r,y¯)\varphi(\bar{x}_{1},\ldots,\bar{x}_{r},\bar{y}) is an \mathcal{L}-formula without parameters, with |x¯i|=di,|y¯|=e|\bar{x}_{i}|=d_{i},|\bar{y}|=e. Then there exists some α=α(φ)>0\alpha=\alpha(\varphi)\in\mathbb{R}_{>0} satisfying the following.

Given bMeb\in M^{e}, consider the rr-ary relation

Eb:={(a1,,ar)Md1××Mdr:φ(a1,,ar,b)}.E_{b}:=\left\{(a_{1},\ldots,a_{r})\in M^{d_{1}}\times\ldots\times M^{d_{r}}:\mathcal{M}\models\varphi(a_{1},\ldots,a_{r},b)\right\}.

Then for every bMeb\in M^{e}, exactly one of the following two cases must occur:

  1. (1)

    EbE_{b} is not Kk,,kK_{k,\ldots,k}-free for any kk\in\mathbb{N};

  2. (2)

    for any finite rr-grid Bi[r]MdiB\subseteq\prod_{i\in[r]}M^{d_{i}} we have

    |EbB|αδr1r(B).|E_{b}\cap B|\leq\alpha\delta^{r}_{r-1}(B).
Proof.

Assume that 𝒩=(N,)\mathcal{N}=(N,\ldots) is an elementary extension of \mathcal{M} and bMeb\in M^{e}. Then, for a fixed kk\in\mathbb{N},

Eb={(a1,,ar)Md1××Mdr:φ(a1,,ar,b)}E_{b}=\{(a_{1},\ldots,a_{r})\in M^{d_{1}}\times\ldots\times M^{d_{r}}:\mathcal{M}\models\varphi(a_{1},\ldots,a_{r},b)\}

is Kk,,kK_{k,\ldots,k}-free if and only if

Eb={(a1,,ar)Nd1××Ndr:𝒩φ(a1,,ar,b)}E^{\prime}_{b}=\{(a_{1},\ldots,a_{r})\in N^{d_{1}}\times\ldots\times N^{d_{r}}:\mathcal{N}\models\varphi(a_{1},\ldots,a_{r},b)\}

is Kk,,kK_{k,\ldots,k}-free, as this can be expressed by a first-order formula ψ(y)\psi(y) and ψ(b)𝒩ψ(b)\mathcal{M}\models\psi(b)\iff\mathcal{N}\models\psi(b). Similarly, for a fixed α\alpha\in\mathbb{R}, |EbB|αδr1r(B)|E_{b}\cap B|\leq\alpha\delta^{r}_{r-1}(B) for every finite rr-grid Bi[r]MdiB\subseteq\prod_{i\in[r]}M^{d_{i}} if and only if |EbB|αδr1r(B)|E^{\prime}_{b}\cap B|\leq\alpha\delta^{r}_{r-1}(B) for every finite rr-grid Bi[r]NdiB\subseteq\prod_{i\in[r]}N^{d_{i}} (as for every fixed sizes of B1,,BrB_{1},\ldots,B_{r} this condition can be expressed by a first-order formula). Hence, passing to an elementary extension, we may assume that \mathcal{M} is 1\aleph_{1}-saturated.

As TT eliminates \exists^{\infty}, there exists some m=m(φ)m=m(\varphi)\in\mathbb{N} such that for any i[r]i\in[r], bMeb\in M^{e} and tuple a¯:=(ajMdj:j[r]{i})\bar{a}:=\left(a_{j}\in M^{d_{j}}:j\in[r]\setminus\{i\}\right), the fiber

Ea¯;bi:={aMdi:φ(a1,,ai1,a,ai+1,,ar;b)}E^{i}_{\bar{a};b}:=\left\{a^{*}\in M^{d_{i}}:\mathcal{M}\models\varphi(a_{1},\ldots,a_{i-1},a^{*},a_{i+1},\ldots,a_{r};b)\right\}

is finite if and only if it has size m\leq m.

Given an arbitrary bMeb\in M^{e} such that EbE_{b} is Kk,,kK_{k,\ldots,k}-free, Lemma 5.5 and compactness imply that for every tuple (a1,,ar)Eb(a_{1},\ldots,a_{r})\in E_{b}, there exists some i[r]i\in[r] such that the fiber Ea¯;biE^{i}_{\bar{a};b} is finite, hence |Ea¯;bi|m|E^{i}_{\bar{a};b}|\leq m. This easily implies that for any finite rr-grid Bi[r]MdiB\subseteq\prod_{i\in[r]}M^{d_{i}} we have |EbB|mδr1r(B)|E_{b}\cap B|\leq m\delta^{r}_{r-1}(B). ∎

Remark 5.7.

In the binary case, a similar observation was made by Evans in the context of certain stable theories [evans2005trivial, Proposition 3.1].

Restricting to distal structures, we can relax the assumption “EbE_{b} is Kk,,kK_{k,\ldots,k}-free for some kk” to “EbE_{b} does not contain a direct product of infinite sets” in Theorem 5.6 (we refer to e.g. the introduction in [chernikov2018regularity] or [chernikov2020cutting] for a general discussion of model-theoretic distality and its connections to combinatorics).

Corollary 5.8.

Assume that TT is a distal, geometric, weakly locally modular theory, T\mathcal{M}\models T, r2r\in\mathbb{N}_{\geq 2} and φ(x¯1,,x¯r,y¯)\varphi(\bar{x}_{1},\ldots,\bar{x}_{r},\bar{y}) is an \mathcal{L}-formula without parameters, with |x¯i|=di,|y¯|=e|\bar{x}_{i}|=d_{i},|\bar{y}|=e. Then there exists some α=α(φ)>0\alpha=\alpha(\varphi)\in\mathbb{R}_{>0} satisfying the following.

Assume that bMeb\in M^{e} and the rr-ary relation EbE_{b} does not contain an rr-grid A=i[r]AiA=\prod_{i\in[r]}A_{i} with each AiMdiA_{i}\subseteq M^{d_{i}} infinite. Then |EbB|αδr1r(B)|E_{b}\cap B|\leq\alpha\delta^{r}_{r-1}(B) for any finite rr-grid BB.

Proof.

By [chernikov2020cutting, Theorem 5.12], if \mathcal{M} is a distal structure with elimination of \exists^{\infty}, then there exists some k=k(φ)k=k(\varphi)\in\mathbb{N} such that for every bMeb\in M^{e}, EbE_{b} is not Kk,,kK_{k,\ldots,k}-free if and only if i[r]AiEb\prod_{i\in[r]}A_{i}\subseteq E_{b} for some infinite AiMdiA_{i}\subseteq M^{d_{i}}. The conclusion now follows by Theorem 5.6. ∎

Remark 5.9.

Weaker bounds for non-cartesian relations definable in arbitrary distal theories are established in [chernikov2018model, chernikov2021model].

Now we show that in the oo-minimal case, this result actually characterizes weak local modularity. By the trichotomy theorem in oo-minimal structures [peterzil1998trichotomy] we have the following equivalence.

Fact 5.10.

Let \mathcal{M} be an oo-minimal (1\aleph_{1}-)saturated structure. The following are equivalent:

  • \mathcal{M} is not linear (see Example 5.4);

  • \mathcal{M} is not weakly locally modular;

  • there exists a real closed field definable in \mathcal{M}.

Corollary 5.11.

Let \mathcal{M} be an oo-minimal structure. The following are equivalent:

  1. (1)

    \mathcal{M} is weakly locally modular;

  2. (2)

    Corollary 5.8 holds in \mathcal{M};

  3. (3)

    for every d1,d2d_{1},d_{2}\in\mathbb{N} and every definable (with parameters) XMd1×Md2X\subseteq M^{d_{1}}\times M^{d_{2}}, if XX is Kk,kK_{k,k}-free for some kk\in\mathbb{N}, then there exist some β<43\beta<\frac{4}{3} and α\alpha such that: for any nn and BiMdiB_{i}\subseteq M^{d_{i}} with |Bi|=n|B_{i}|=n we have

    |XB1×B2|αnβ;|X\cap B_{1}\times B_{2}|\leq\alpha n^{\beta};
  4. (4)

    there is no infinite field definable in \mathcal{M}.

Proof.

(1) \Rightarrow (2) by Corollary 5.8, and (2) \Rightarrow (3) is obvious.

(3) \Rightarrow (4) Assume that \mathcal{R} is an infinite field definable in \mathcal{M}, char()=0\textrm{char}(\mathcal{R})=0 by oo-minimality. Then the point-line incidence relation on 2\mathcal{R}^{2} corresponds to a K2,2K_{2,2}-free definable relation Ed×dE\subseteq\mathcal{M}^{d}\times\mathcal{M}^{d} for some dd. By the standard lower bound for Szemerédi-Trotter, the number of incidences satisfies Ω(n4/3)\Omega(n^{4/3}), hence EE cannot satisfy (3).

(4) \Rightarrow (1) If \mathcal{M} is not weakly locally modular, by Fact 5.10 a real closed field \mathcal{R} is definable in \mathcal{M}. ∎

5.2. Applications to semialgebraic relations

Corollary 5.12.

Assume that Xd=i[r]diX\subseteq\mathbb{R}^{d}=\prod_{i\in[r]}\mathbb{R}^{d_{i}} is semilinear and XX does not contain a direct product of rr infinite sets (e.g. if XX is Kk,,kK_{k,\ldots,k}-free for some kk). Then there exists some α=α(X)\alpha=\alpha(X) so that for any rr-hypergraph HH of the form (V1,,Vr;Xi[r]Vi)\left(V_{1},\ldots,V_{r};X\cap\prod_{i\in[r]}V_{i}\right) for some finite VidiV_{i}\subseteq\mathbb{R}^{d_{i}}, with i=1r|Vi|=n\sum_{i=1}^{r}|V_{i}|=n, we have |E|αnr1|E|\leq\alpha n^{r-1}.

Proof.

As every oo-minimal structure is distal and every semilinear relation is definable in an ordered vector space over \mathbb{R} which is oo-minimal and locally modular (Example 5.4), the result follows by Corollary 5.8. ∎

We recall the following special case of the trichotomy theorem in o-minimal structures restricted to semialgebraic relations.

Fact 5.13.

[marker1992additive, Theorem 1.3] Let XnX\subseteq\mathbb{R}^{n} be a semialgebraic but not semilinear set. Then ×[0,1]2\times\restriction_{[0,1]^{2}} (i.e. the graph of multiplication restricted to the unit box) is definable in the first-order structure (,<,+,X)(\mathbb{R},<,+,X).

Using it, we have the following more explicit variant of Corollary 5.11 in the semialgebraic case.

Corollary 5.14.

Let XdX\subseteq\mathbb{R}^{d} be a semialgebraic set, and consider the first-order structure =(,<,+,X)\mathcal{M}=(\mathbb{R},<,+,X). Then the following are equivalent.

  1. (1)

    For any rr\in\mathbb{N} and any rr-ary relation Yi[r]diY\subseteq\prod_{i\in[r]}\mathbb{R}^{d_{i}} not containing an rr-grid A=i[r]AiA=\prod_{i\in[r]}A_{i} with each AidiA_{i}\subseteq\mathbb{R}^{d_{i}} infinite, there exists some α\alpha\in\mathbb{R} so that |YB|αδr1r(B)|Y\cap B|\leq\alpha\delta^{r}_{r-1}(B) for every finite rr-grid BB.

  2. (2)

    For every d1,d2d_{1},d_{2}\in\mathbb{N} and Yd1×d2Y\subseteq\mathbb{R}^{d_{1}}\times\mathbb{R}^{d_{2}} definable (with parameters) in \mathcal{M}, if YY is Kk,kK_{k,k}-free for some kk\in\mathbb{N}, then there exist some β<43\beta<\frac{4}{3} and α\alpha such that: for any nn and BidiB_{i}\subseteq\mathbb{R}^{d_{i}} with |Bi|=n|B_{i}|=n we have

    |XB1×B2|αnβ.|X\cap B_{1}\times B_{2}|\leq\alpha n^{\beta}.
  3. (3)

    ×[0,1]2\times\restriction_{[0,1]^{2}} is not definable in \mathcal{M}.

Proof.

(1) \Rightarrow (2) is obvious.

(2) \Rightarrow (3) Using ×[0,1]2\times\restriction_{[0,1]^{2}}, the K2,2K_{2,2}-free point-line incidence relation in 2\mathbb{R}^{2} is definable restricted to [0,1]2[0,1]^{2}, and the standard configurations witnessing the lower bound in Szemerédi-Trotter can be scaled down to the unit box.

(3) \Rightarrow (1) Assume that (1) does not hold in (,<,+,X)(\mathbb{R},<,+,X). Then necessarily some YY definable in (,<,+,X)(\mathbb{R},<,+,X) is not semilinear by Corollary 5.12. By Fact 5.13, if YY is not semilinear then ×[0,1]2\times\restriction_{[0,1]^{2}} is definable in the structure (,<,+,Y)(\mathbb{R},<,+,Y), hence in (,<,+,X)(\mathbb{R},<,+,X). ∎

References