This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Discordant sets and ergodic Ramsey theory

Vitaly Bergelson Department of Mathematics, Ohio State University, Columbus, OH 43210, USA [email protected] Jake Huryn Department of Mathematics, Ohio State University, Columbus, OH 43210, USA [email protected]  and  Rushil Raghavan Department of Mathematics, Ohio State University, Columbus, OH 43210, USA [email protected]
Abstract.

We explore the properties of non-piecewise syndetic sets with positive upper density, which we call discordant, in countably infinite amenable (semi)groups. Sets of this kind are involved in many questions of Ramsey theory and manifest the difference in complexity between the classical van der Waerden’s theorem and Szemerédi’s theorem. We generalize and unify old constructions and obtain new results about these historically interesting sets. Along the way, we draw from various corners of mathematics, including classical Ramsey theory, ergodic theory, number theory, and topological and symbolic dynamics.

2020 Mathematics Subject Classification:
05D10, 37A44

1. Introduction

Let AA be a subset of the set \mathbb{N} of positive integers. We begin by defining some notions of largeness that can be imposed on AA.

  • AA is an IP set if it contains a finite sums set, i.e., a set of the form

    FS((an)n)={iIai:I is finite and nonempty}\operatorname{FS}((a_{n})_{n\in\mathbb{N}})={\left\{\sum_{i\in I}a_{i}\colon\text{$I\subseteq\mathbb{N}$ is finite and nonempty}\right\}}

    for some infinite sequence (an)n(a_{n})_{n\in\mathbb{N}} of positive integers.

  • AA is thick if it contains arbitrarily long intervals: for each \ell\in\mathbb{N}, there exists aa\in\mathbb{N} such that {a,a+1,,a+1}A\{a,a+1,\dots,a+\ell-1\}\subseteq A.

  • AA is syndetic if its gaps are bounded: this means that, if we write A={a1<a2<}A=\{a_{1}<a_{2}<\cdots\}, the successive difference ai+1aia_{i+1}-a_{i} is bounded. Equivalently, AA is syndetic if there exist t1,,trt_{1},\dots,t_{r}\in\mathbb{N} such that =i=1rAti\mathbb{N}=\bigcup_{i=1}^{r}A-t_{i}, where At={x:x+tA}A-t=\{x\in\mathbb{N}\colon x+t\in A\}.

  • AA is piecewise syndetic if it can be written as the intersection of a thick set with a syndetic set. Equivalently, AA is piecewise syndetic if there exist t1,,trt_{1},\dots,t_{r}\in\mathbb{N} for which i=1rAti\bigcup_{i=1}^{r}A-t_{i} is thick.

  • The upper density of AA is

    d¯(A)=lim supn|A{1,,n}|n.\overline{d}(A)=\limsup_{n\to\infty}\frac{|A\cap\{1,\dots,n\}|}{n}.

    When the limit in question exists, this quantity is called the density of AA, denoted by d(A)d(A). A set should be thought of as large in terms of (upper) density if this quantity is positive.

Although we have called each of the above properties a “notion of largeness”, they are not on equal footing. For all these definitions, \mathbb{N} is large, and if AA has a large subset, then AA is itself large. However, only some of these notions of largeness are partition regular, meaning that if AA is large then, for any finite partition A=i=1rCiA=\bigcup_{i=1}^{r}C_{i} into disjoint sets, at least one CiC_{i} is large. This third property underlies much of classical Ramsey theory. The partition regularity of IP sets is a famous theorem of Hindman [28], and the partition regularity of piecewise syndeticity is a classical fact, often rediscovered (see, e.g., [29, Lemma 2.4] and [22, Theorem 1.24]).111A related fact, that if =i=1rCi\mathbb{N}=\bigcup_{i=1}^{r}C_{i} then at least one CiC_{i} is piecewise syndetic, appears to have been first observed by Brown [15, Lemma 1]. The partition regularity of positive upper density is an easy exercise for the reader, as is checking that thickness and syndeticity fail to be partition regular.

One of the earliest partition results of Ramsey theory is van der Waerden’s theorem, which says that given any finite partition =i=1rCi\mathbb{N}=\bigcup_{i=1}^{r}C_{i}, at least one CiC_{i} is AP-rich, i.e., contains arithmetic progressions of any finite cardinality [35]. A useful equivalent form of this theorem states that the family of AP-rich sets is partition regular. Van der Waerden’s theorem has a corresponding density version, Szemerédi’s theorem [34], one of whose many equivalent formulations guarantees that any set having positive upper density is AP-rich.

In seeking a density version of Hindman’s theorem, Erdős asked whether d(A)>0d(A)>0 is enough to imply that AA contains a shift of a finite sums set. (Of course we cannot ask that AA contain a finite sums set; consider, e.g., the set of odd numbers.) This question was answered negatively in unpublished work of E. Straus, who showed that, for any ε>0\varepsilon>0, one can find AA\subseteq\mathbb{N} with d(A)1εd(A)\geq 1-\varepsilon such that AnA-n is not an IP set for any nn\in\mathbb{Z}. In [19, p. 105], Erdős writes the following:

I first hoped that Hindman’s theorem on all subsums εiai\sum\varepsilon_{i}a_{i} can be generalised for sets of positive density, but the following example of E. Straus seems to give a counterexample to all such attempts: Let p1<p2<p_{1}<p_{2}<\cdots be a set of primes tending to infinity fast, and let εi<\sum\varepsilon_{i}<\infty. Consider the set of integers a1<a2<a_{1}<a_{2}<\cdots so that aiα(modpi)a_{i}\not\equiv\alpha\pmod{p_{i}} where |α|εipi|\alpha|\leq\varepsilon_{i}p_{i}. The density of this sequence is i(1εi)>0\prod_{i}(1-\varepsilon_{i})>0, and it is not difficult to see that this sequence furnishes the required counterexample.

See [4, Theorem 2.20] for more discussion and a detailed presentation of Straus’ construction.

Recall that if AA is piecewise syndetic, then a union of finitely many shifts of AA is thick. Since thick sets are obviously AP-rich, it follows from van der Waerden’s theorem that one of the shifts of AA (and hence AA itself) is AP-rich. This tempts us to ask, in analogy with Erdős’ question, whether d(A)>0d(A)>0 implies that AA is piecewise syndetic: a positive answer to this question would mean that we have just proven Szemerédi’s theorem. However, Straus’ construction again provides a counterexample; it has positive density but is not piecewise syndetic. Indeed, it is an easy exercise to show that any thick set is IP, so any piecewise syndetic set contains a shift of an IP set by Hindman’s theorem. Thus Straus’ example cannot be piecewise syndetic.

It is counterexamples of the above kind we are interested in—those sets which are non-piecewise syndetic but have positive (upper) density. We will call such sets discordant. Another classical example of a discordant set is the set of squarefree numbers. However, this set has shifts which are IP [12, Theorem 2.8], and so has a different nature than Straus’ counterexample. More constructions of discordant sets can be found in [13] along with applications to Ramsey theory. A fruitful analogy can be drawn between discordant sets and nowhere dense sets of positive measure, both being small in a “topological” sense but large in an “analytic” sense. This analogy will be developed and more closely investigated in the following sections.

We will now more closely examine the relationship between van der Waerden’s theorem, Szemerédi’s theorem, and discordant sets. We observed above, using van der Waerden’s theorem, that any piecewise syndetic set is AP-rich. On the other hand, given that every piecewise syndetic set is AP-rich, the partition regularity of piecewise syndeticity tells us that, for any finite partition =i=1rCi\mathbb{N}=\bigcup_{i=1}^{r}C_{i}, there is some ii for which CiC_{i} is AP-rich. Thus van der Waerden’s theorem is equivalent, in this informal sense, to the fact that every piecewise syndetic set is AP-rich. Because of this, the existence of discordant sets can be interpreted as a sign that Szemerédi’s infamously difficult theorem is deeper than van der Waerden’s.222Another equivalent form of Szemerédi’s theorem says that any subset of \mathbb{N} having positive upper Banach density is AP-rich. In contrast to density, for which one averages along the intervals {1,,n}\{1,\dots,n\}, the upper Banach density of a set AA is defined to be the supremum of all densities of AA averaged along any sequence of intervals of increasing length. Any piecewise syndetic set has positive upper Banach density, a fact which clarifies the relationship between van der Waerden’s and Szemerédi’s theorems and strengthens the perspective we take here. The striking abundance of contexts in which discordant sets arise, some of which will be provided in the ensuing sections, can be taken as evidence for the versatility of Szemerédi’s theorem and strengthens our interest in discordant sets.

The distinction between van der Waerden’s theorem and Szemerédi’s theorem can also be profitably analyzed from the point of view of dynamics and ergodic theory. Indeed, as initiated by Furstenberg?s ergodic-theoretic proof of Szemerédi?s theorem [21], one can elucidate the properties of a subset of \mathbb{N} by examining the dynamical and ergodic properties of the orbit closure of its indicator function. To be more explicit, given AA\subseteq\mathbb{N}, consider its indicator function 𝟏A{0,1}\mathbf{1}_{A}\in\{0,1\}^{\mathbb{N}}, where we view {0,1}\{0,1\}^{\mathbb{N}} as having the product topology (making {0,1}\{0,1\}^{\mathbb{N}} homeomorphic to the Cantor set). Then, letting T:{0,1}{0,1}T\colon\{0,1\}^{\mathbb{N}}\to\{0,1\}^{\mathbb{N}} denote the left-shift map (i.e., T(a1,a2,)=(a2,a3,))T(a_{1},a_{2},\dots)=(a_{2},a_{3},\dots)), define the orbit closure of 𝟏A\mathbf{1}_{A} as

𝒪(𝟏A)={Tn(𝟏A):n}¯.\mathcal{O}(\mathbf{1}_{A})=\overline{\{T^{n}(\mathbf{1}_{A})\colon n\in\mathbb{N}\}}.

The resulting topological dynamical system (𝒪(𝟏A),T)\mathcal{O}(\mathbf{1}_{A}),T) can be made into a measure-preserving system. In this setting, the difference in difficulty between van der Waerden’s and Szemerédi’s theorems is indicated by the existence or nonexistence of nontrivial minimal subsystems of 𝒪(𝟏A)\mathcal{O}(\mathbf{1}_{A}). In the presence of a nontrivial minimal subsystem, a purely topological-dynamical recurrence theorem, the topological analogue to van der Waerden’s theorem, can be used to obtain a streamlined proof of the classical van der Waerden theorem [23, Theorem 0.3]. On the other hand, when 𝒪(𝟏A)\mathcal{O}(\mathbf{1}_{A}) does not possess nontrivial minimal subsystems, a considerably more difficult proof via measurable dynamics is necessary to prove Szemerédi’s theorem. In particular, those sets AA\subseteq\mathbb{N} for which 𝒪(𝟏A)\mathcal{O}(\mathbf{1}_{A}) has a nontrivial minimal subsystem are exactly the piecewise syndetic sets (Theorem 45). This dynamical and ergodic approach leads not only to a proof of these classical theorems but also to far-reaching generalizations which still elude purely combinatorial proofs, such as versions of the polynomial Szemerédi theorem (e.g. [10, Theorem 1.1] and [11, Theorem 7.3]).

Finally, because the existence of discordant sets is a nontrivial byproduct of the interplay between the notions of density and piecewise syndeticity, both very basic definitions of Ramsey theory, these special sets are particularly nice vehicles for advancing Ramsey theory in different settings and showing the bountiful connections between Ramsey theory and other areas of mathematics. Therefore our paper is not only about discordant sets but also is meant as an accessible (but admittedly very non-exhaustive) demonstration of the diversity of Ramsey theory.

Outline of the paper

This paper concerns itself with the study of subsets of countably infinite semigroups, so some definitions must be introduced to allow us to work in this generality. This is done in Section 2, where we also provide the necessary prerequisites from Ramsey theory, ergodic theory, and topological dynamics.

In Section 3 we give an ergodic-theoretic construction of discordant sets that can be done in many semigroups, generalizing a construction based on irrational rotation of the torus /\mathbb{R}/\mathbb{Z}. In particular, we prove that the sets of return times to certain nowhere dense sets of positive measure form discordant sets. This section lays the foundation for the analogy between nowhere dense sets of positive measure and discordant sets, which we revisit and reexamine several times.

In Section 4, we prove in two ways that the squarefree numbers are discordant. The first proof is purely algebraic while the second uses the dynamical results of the previous section. We also give several wide-ranging generalizations of this fact.

Our view narrows in Section 5, where we take a brief excursion to SL2()\operatorname{SL}_{2}(\mathbb{Z}). Although the classical notions of density fail to exist in this group since SL2()\operatorname{SL}_{2}(\mathbb{Z}) is not amenable, we define a natural analogue of density which allows us to construct discordant sets using the results of the preceding section.

In Section 6, we study a special kind of discordant set which is particularly bad for recurrence in measure-preserving systems. We establish the existence of such discordant sets with density cc for each c(0,1)c\in(0,1) for any countably infinite abelian group and also prove the existence of discordant sets with density cc for each c(0,1)c\in(0,1) in any countably infinite commutative cancellative semigroup.

We shift focus in Section 7. First, we generalize the notion of a disjunctive {0,1}\{0,1\}-sequence, one in which every finite {0,1}\{0,1\}-word occurs, to an arbitrary countably infinite semigroup, and show that such sequences are “topologically typical” in the sense that the family of such sequences is comeager. This allows us to state that, in cancellative semigroups, non-piecewise syndetic sets (and hence discordant sets) are topologically atypical. We then consider special types of disjunctive sequences by measuring the limiting frequency with which every finite word occurs. In contrast to how one defines a normal sequence by saying that these frequencies approach 2n2^{-n}, where nn is the length of the word, we define an extremely non-averageable, or ENA, sequence by declaring that these frequencies should fail to converge as badly as possible. We prove that ENA sequences are topologically typical in arbitrary countably infinite amenable cancellative semigroups, strengthening the previous result for such semigroups.

Finally, in Section 8 we give a topological-dynamical characterization of non-piecewise syndetic sets in arbitrary countably infinite semigroups by looking at their orbit closures. We conclude the paper by giving combinatorial applications of these dynamical results.

Acknowledgements

We would like to thank Dona Strauss for helpful conversations. Her input was instrumental in guiding us toward the proof of Theorem 38. We are also indebted to the anonymous referee for their numerous helpful comments and corrections. The latter two authors were partially supported by the Ohio State Arts and Sciences Undergraduate Research Scholarship.

2. Preliminaries

Throughout this paper, GG will stand for a countably infinite semigroup.

When we say that GG is a left cancellative semigroup, we mean that for any g,h1,h2Gg,h_{1},h_{2}\in G, if gh1=gh2gh_{1}=gh_{2} then h1=h2h_{1}=h_{2}; GG is right cancellative if instead h1g=h2gh_{1}g=h_{2}g implies h1=h2h_{1}=h_{2}; and GG is cancellative if it is both left and right cancellative.

Given gGg\in G, define gA={ga:aA}gA=\{ga\colon a\in A\} and g1A={xG:gxA}g^{-1}A=\{x\in G\colon gx\in A\}; the sets AgAg and Ag1Ag^{-1} are defined analogously. Also, given BGB\subseteq G, define

AB={ab:aA,bB},A1B=aAa1B,andAB1=bBAb1.AB=\{ab\colon a\in A,b\in B\},\quad A^{-1}B=\bigcup_{a\in A}a^{-1}B,\quad\text{and}\quad AB^{-1}=\bigcup_{b\in B}Ab^{-1}.

For any set XX, we will write 𝒫(X)\mathcal{P}(X) to denote the set of subsets of XX. We write YcY^{\mathrm{c}} for the complement of YY, when the superset is understood. In particular, for AGA\subseteq G, AcA^{\mathrm{c}} always denotes the complement relative to GG.

2.1. Thickness, syndeticity, and piecewise syndeticity

Let AGA\subseteq G.

  • AA is right thick if for any finite FGF\subseteq G there exists gGg\in G such that FgAFg\subseteq A.

  • AA is right syndetic if there exists some finite HGH\subseteq G such that G=H1AG=H^{-1}A.

  • AA is right piecewise syndetic if there is a finite HGH\subseteq G for which H1AH^{-1}A is right thick.

Since GG is not assumed to be commutative, there are also “left” versions of these definitions obtained by replacing FgFg with gFgF and H1AH^{-1}A with AH1AH^{-1}. We warn the reader that the left and right versions do not necessarily coincide in noncommutative (semi)groups.333Here are some examples to show that the left- and right-handed notions really are different. In F2F_{2}, the free group on aa and bb, it is easily seen that F2aF_{2}a is right thick but not left thick, and left syndetic but not right syndetic. For a less noncommutative example, consider the following subset of the Heisenberg group H3()H_{3}(\mathbb{Z}): A={(1xy01z001)H3():|y|z}.A={\left\{{\left(\begin{smallmatrix}1&x&y\\ 0&1&z\\ 0&0&1\end{smallmatrix}\right)}\in H_{3}(\mathbb{Z})\colon|y|\leq z\right\}}. It can be shown that AA is left thick but not right piecewise syndetic (and thus not right thick). The group H3()H_{3}(\mathbb{Z}) is “almost abelian”, having nilpotency class 22, so we see from this example that not much noncommutativity is needed to separate the right- and left-handed notions. See [8, Section 2] for many similar examples. Henceforth we will deal only with the right-handed definitions and omit the word “right”.

Note that, if (Fn)(F_{n}) is an increasing sequence of finite subsets of GG satisfying G=FnG=\bigcup F_{n}, then AGA\subseteq G is thick if and only if FngnA\bigcup F_{n}g_{n}\subseteq A for some sequence (gn)(g_{n}) in GG.

We now outline the relationship between thick, syndetic, and piecewise syndetic sets. First, the family of syndetic sets and the family of thick sets are dual, in the following sense.

Lemma 1.

Let AGA\subseteq G. Then AA is syndetic if and only if AcA^{\mathrm{c}} is not thick.

Proof.

Suppose AA is syndetic, so G=H1AG=H^{-1}A for some finite HGH\subseteq G. Let gGg\in G. Since gH1Ag\in H^{-1}A, there exists hHh\in H with hgAhg\in A. Hence HgAHg\cap A\neq\varnothing, i.e., HgAcHg\nsubseteq A^{\mathrm{c}}. This proves that AcA^{\mathrm{c}} is not thick.

Conversely, suppose AA is not syndetic, and let FGF\subseteq G be finite. Since F1AGF^{-1}A\neq G, find gGg\in G such that gF1Ag\notin F^{-1}A. Then FgAcFg\subseteq A^{\mathrm{c}}, so AcA^{\mathrm{c}} is thick. ∎

From this duality we obtain a useful equivalent description of piecewise syndetic sets, which we described in introduction for subsets of (,+)(\mathbb{N},+).

Lemma 2.

Let AGA\subseteq G. Then AA is piecewise syndetic if and only if there exist a syndetic set SGS\subseteq G and a thick set TGT\subseteq G such that A=STA=S\cap T.

Proof.

First suppose AA is piecewise syndetic, and find a finite HGH\subseteq G such that H1AH^{-1}A is thick. Let T=AH1AT=A\cup H^{-1}A, and let S=ATcS=A\cup T^{\mathrm{c}}, so that A=STA=S\cap T. Pick any xGx\in G; we will show that G=(Hx{x})1SG=(Hx\cup\{x\})^{-1}S. Indeed, let gGg\in G. If gx1Sg\notin x^{-1}S, then xgSc=AcTH1Axg\in S^{\mathrm{c}}=A^{\mathrm{c}}\cap T\subseteq H^{-1}A, so g(Hx)1Sg\in(Hx)^{-1}S.

Now suppose SGS\subseteq G is syndetic and TGT\subseteq G is thick. Fix a finite HGH\subseteq G satisfying G=H1SG=H^{-1}S. Let FGF\subseteq G be finite and pick gGg\in G satisfying HFgTHFg\subseteq T. By the syndeticity of SS, for any fFf\in F we can find hHh\in H such that fgh1Sfg\in h^{-1}S. We also know that hfgThfg\in T, so fgh1(ST)fg\in h^{-1}(S\cap T). This proves that FgH1(ST)Fg\subseteq H^{-1}(S\cap T), and hence that H1(ST)H^{-1}(S\cap T) is thick. ∎

2.2. Densities in semigroups

In this subsection, GG is assumed to be a countably infinite amenable cancellative semigroup. Given these assumptions, amenability is equivalent to the existence of a Følner sequence.444Amenability is commonly defined as follows. For each gGg\in G, define λg:GG\lambda_{g}\colon G\to G by λg(h)=gh\lambda_{g}(h)=gh. Then GG is called (left) amenable if there exists a positive normalized linear functional LL on the space B(G)B(G) of bounded functions GG\to\mathbb{R} which satisfies L(fλg)=L(f)L(f\circ\lambda_{g})=L(f) for each fB(G)f\in B(G); LL is called an invariant mean for GG. For countably infinite left cancellative semigroups, the existence of Følner sequences and amenability are equivalent [1]. See [32] for a comprehensive study of the notion of amenability. A (left) Følner sequence in GG is a sequence Φ=(Φn)\Phi=(\Phi_{n}) of nonempty finite subsets of GG satisfying

limn|ΦngΦn||Φn|=1,or equivalentlylimn|ΦngΦn||Φn|=0,\lim_{n\to\infty}\frac{|\Phi_{n}\cap g\Phi_{n}|}{|\Phi_{n}|}=1,\quad\text{or equivalently}\quad\lim_{n\to\infty}\frac{|\Phi_{n}\bigtriangleup g\Phi_{n}|}{|\Phi_{n}|}=0,

for any gGg\in G. The upper density of AGA\subseteq G with respect to Φ\Phi is defined as

d¯Φ(A)=lim supn|AΦn||Φn|.\overline{d}_{\Phi}(A)=\limsup_{n\to\infty}\frac{|A\cap\Phi_{n}|}{|\Phi_{n}|}.

When the corresponding limit exists this quantity is the density of AA with respect to Φ\Phi, denoted by dΦ(A)d_{\Phi}(A), and we also define the lower density d¯Φ(A)\underline{d}_{\Phi}(A) to be the corresponding lim inf\liminf. Note that d¯Φ(A)=1d¯Φ(Ac)\underline{d}_{\Phi}(A)=1-\overline{d}_{\Phi}(A^{\mathrm{c}}). Observe also that it follows quickly from the pigeonhole principle and the cancellativity of GG that Følner sequences satisfy |Φn||\Phi_{n}|\to\infty, guaranteeing a degree of non-degeneracy for these definitions.

A subset AGA\subseteq G will be called discordant if it is not piecewise syndetic and there is some Følner sequence Φ\Phi in GG such that d¯Φ(G)>0\overline{d}_{\Phi}(G)>0. We would like to also guarantee, when possible, the stronger condition d¯Φ(A)>0\underline{d}_{\Phi}(A)>0, or better, dΦ(A)>0d_{\Phi}(A)>0.

In the introduction, we defined density in (,+)(\mathbb{N},+) with respect to the Følner sequence ({1,,n})n(\{1,\dots,n\})_{n\in\mathbb{N}}. We will reserve dd, leaving the Følner sequence unspecified, to refer to this density (and similarly with d¯\overline{d} and d¯\underline{d}). A set AA\subseteq\mathbb{N} for which d(A)d(A) exists is sometimes said to have natural density. Closely related are densities defined with respect to the Følner sequences ({n,,n})(\{-n,\dots,n\}) in \mathbb{Z} and ({n,,n}d)(\{-n,\dots,n\}^{d}) in d\mathbb{Z}^{d}; we will also use dd to refer to these without danger of ambiguity.

Upper density has some useful properties, which we briefly describe here. The most primitive is that it is invariant under shifts. In other words, for any gGg\in G and AGA\subseteq G, one has d¯Φ(gA)=d¯Φ(g1A)=d¯Φ(A)\overline{d}_{\Phi}(gA)=\overline{d}_{\Phi}(g^{-1}A)=\overline{d}_{\Phi}(A). We record the following similar result as we will use it later on. (The proof of shift-invariance can be seen in the proof of Lemma 3.)

Lemma 3.

Suppose gGg\in G and AGA\subseteq G satisfy AgA=A\cap gA=\varnothing. Then d¯Φ(AgA)=2d¯Φ(A)\overline{d}_{\Phi}(A\cup gA)=2\,\overline{d}_{\Phi}(A).

Proof.

It is clear that d¯Φ\overline{d}_{\Phi} is subadditive. For the opposite inequality, let ε>0\varepsilon>0 and find nn\in\mathbb{N} such that

|AΦn||Φn|>d¯(A)ε3and|gΦnΦn||Φn|<ε3.\frac{|A\cap\Phi_{n}|}{|\Phi_{n}|}>\overline{d}(A)-\frac{\varepsilon}{3}\quad\text{and}\quad\frac{|g\Phi_{n}\setminus\Phi_{n}|}{|\Phi_{n}|}<\frac{\varepsilon}{3}.

Then

|(AgA)Φn||Φn||AΦn||Φn|+|gAgΦn||Φn||gΦnΦn||Φn|.\frac{|(A\cup gA)\cap\Phi_{n}|}{|\Phi_{n}|}\geq\frac{|A\cap\Phi_{n}|}{|\Phi_{n}|}+\frac{|gA\cap g\Phi_{n}|}{|\Phi_{n}|}-\frac{|g\Phi_{n}\setminus\Phi_{n}|}{|\Phi_{n}|}.

Now since |gAgΦn|=|g(AΦn)|=|AΦn||gA\cap g\Phi_{n}|=|g(A\cap\Phi_{n})|=|A\cap\Phi_{n}| by cancellativity, we get

|(AgA)Φn||Φn|>(d¯Φ(A)ε3)+(d¯Φ(A)ε3)ε3>2d¯Φ(A)ε.\frac{|(A\cup gA)\cap\Phi_{n}|}{|\Phi_{n}|}>{\left(\overline{d}_{\Phi}(A)-\frac{\varepsilon}{3}\right)}+{\left(\overline{d}_{\Phi}(A)-\frac{\varepsilon}{3}\right)}-\frac{\varepsilon}{3}>2\,\overline{d}_{\Phi}(A)-\varepsilon.\qed

Finally, let us remark that density is related to thickness and syndeticity in the following way: AA is thick if and only if d¯Φ(A)=1\overline{d}_{\Phi}(A)=1 for some Φ\Phi, and AA is syndetic if and only if d¯Φ(A)>0\overline{d}_{\Phi}(A)>0 for all Φ\Phi; see [8, Theorem 2.6]. More discussion of thickness, syndeticity, piecewise syndeticity, the various notions of density, and other definitions of Ramsey theory can be found in, among other places, [8, Sections 1 and 2].

2.3. Partition regularity and Straus’ construction

To begin this subsection we will prove two lemmas about piecewise syndeticity and partition regularity. The following lemma states that the family of piecewise syndetic subsets of any countably infinite semigroup is partition regular. A different proof of this fact appears in [27, Lemma 7.2]. Since, as we will see, non-piecewise syndetic sets can be seen as discrete analogues of nowhere dense sets, Theorem 4 is a sort of discrete analogue to the Baire category theorem on unions of nowhere dense sets. We will not use Theorem 4 later on, but include it for completeness and to support the discussion given in the introduction and the analogy we are hoping to emphasize. A different proof, using topological dynamics, will be presented in the final section (Theorem 48).

Theorem 4.

Suppose AGA\subseteq G is piecewise syndetic. Then for any finite partition A=i=1rCiA=\bigcup_{i=1}^{r}C_{i} into disjoint sets, there exists ii for which CiC_{i} is piecewise syndetic.

Proof.

We will prove this for r=2r=2, and the claim will follow by induction. By Lemma 2, we may write A=STA=S\cap T for a syndetic set SGS\subseteq G and thick TGT\subseteq G. Let S=C2cSS^{\prime}=C_{2}^{\mathrm{c}}\cap S, so we get C1=STC_{1}=S^{\prime}\cap T (since C1C_{1} and C2C_{2} are disjoint) and C2=S(S)cC_{2}=S\cap(S^{\prime})^{\mathrm{c}}. Since either SS^{\prime} is syndetic or (S)c(S^{\prime})^{\mathrm{c}} is thick by Lemma 1, one of C1C_{1} and C2C_{2} must be piecewise syndetic.555Note that this proof does not use the definition of piecewise syndeticity, but only that the family of piecewise syndetic sets is the family of intersections of two dual families (“dual” in the sense of Lemma 1). In other words, we have shown that if 𝒜,𝒫(G)\mathcal{A},\mathcal{B}\subseteq\mathcal{P}(G) are any two dual families of subsets of GG that are upward closed, then the family {AB:A𝒜 and B}\{A\cap B\colon\text{$A\in\mathcal{A}$ and $B\in\mathcal{B}$}\} is guaranteed to be partition regular.

The following easy but important lemma demonstrates another connection between piecewise syndeticity and partition regularity. We record it as the general phenomenon behind why Straus’ construction produces discordant sets. We will apply this lemma by starting with a family 𝒫(G)\mathcal{B}\subseteq\mathcal{P}(G) and extending it to a GG-invariant family 𝒜={gB:B and gG}\mathcal{A}=\{gB\colon B\in\text{$\mathcal{B}$ and $g\in G$}\}.

Lemma 5.

Suppose 𝒜𝒫(G)\mathcal{A}\subseteq\mathcal{P}(G) satisfies the following:

  1. (a)

    𝒜\mathcal{A} contains all thick subsets of GG.

  2. (b)

    𝒜\mathcal{A} is partition regular.

  3. (c)

    if BAGB\subseteq A\subseteq G and B𝒜B\in\mathcal{A}, then A𝒜A\in\mathcal{A}.

  4. (d)

    if A𝒜A\in\mathcal{A} and gGg\in G then gA𝒜gA\in\mathcal{A}.

Then 𝒜\mathcal{A} contains all piecewise syndetic subsets of GG.

Proof.

Let AGA\subseteq G be piecewise syndetic and find a finite set HGH\subseteq G such that H1AH^{-1}A is thick. Then H1A=hHh1AH^{-1}A=\bigcup_{h\in H}h^{-1}A, so by (a) and (b), h1A𝒜h^{-1}A\in\mathcal{A} for some hHh\in H. Thus h(h1A)𝒜h(h^{-1}A)\in\mathcal{A}, so finally A𝒜A\in\mathcal{A} since h(h1A)Ah(h^{-1}A)\subseteq A. ∎

With this lemma, we give a simplified version of Straus’ construction to obtain a set with positive lower density, no shift of which is IP.

Theorem 6.

Let (an)(a_{n}) be a sequence in \mathbb{N} satisfying 1/an<1\sum 1/a_{n}<1. Then the following set is discordant:

A=n(an+n1).A=\mathbb{N}\setminus\bigcup_{n\in\mathbb{N}}(a_{n}\mathbb{N}+n-1).
Proof.

We leave it as an exercise to verify that every thick subset of \mathbb{N} is IP; thus, by Hindman’s theorem and Lemma 5, it suffices to show that AA is not a shift of an IP set (that is, a set of the form B+tB+t for some IP set BB\subseteq\mathbb{N} and tt\in\mathbb{N}). This follows immediately from the construction of AA and the observation that if BB\subseteq\mathbb{N} is IP, then BkB\cap k\mathbb{N}\neq\varnothing for all kk\in\mathbb{N}.

To see that d¯(A)>0\underline{d}(A)>0, we compute that

|A{1,,k}|kk(k/a1+k/a2++k/an)k>1n1an\frac{|A\cap\{1,\dots,k\}|}{k}\geq\frac{k-(k/a_{1}+k/a_{2}+\cdots+k/a_{n})}{k}>1-\sum_{n\in\mathbb{N}}\frac{1}{a_{n}}

for each k<an+1k<a_{n+1}, since there are no more than k/ak/a multiples of aa in {1,,k}\{1,\dots,k\}. ∎

For conciseness, we excuse ourselves from proving that d(A)d(A) exists, where AA is the set defined in Theorem 6. The reader is encouraged to look toward the sources we have mentioned or, alternatively, Theorem 23 to see how this could be done. One may also alter the set described in Theorem 6 to obtain a discordant set without invoking Hindman’s theorem as follows. Pick (an)(a_{n}) satisfying n/an<1\sum n/a_{n}<1 and set A=(an+{0,,n1})A=\mathbb{N}\setminus\bigcup(a_{n}\mathbb{N}+\{0,\dots,n-1\}). One shows similarly that d¯(A)>0\underline{d}(A)>0, and a totally self-contained proof that AA is not piecewise syndetic is not hard.

Note that Theorem 6 gives us discordant sets with lower densities arbitrarily close to 11. (A discordant set cannot have upper density equal to 11, since any set of upper density 11 is thick.) By generalizing the notion of an IP set, a similar construction can be done in many different groups. Discussion of this construction and a complete characterization of such groups is given in [18, Chapter 4] and [7].

2.4. Preliminaries from ergodic theory and dynamics

In this section we review some basic notions and theorems of ergodic theory and dynamics. This section is utilitarian, and hence we do not venture past what we will need. For this reason we omit proofs or provide a rudimentary outline and refer the reader elsewhere.

Let (X,,μ)(X,\mathcal{B},\mu) be a probability space. A measurable map T:XXT\colon X\to X is measure preserving if μ(T1E)=μ(E)\mu(T^{-1}E)=\mu(E) for each EE\in\mathcal{B}. If GG is a countably infinite semigroup, a measure-preserving action of GG on (X,,μ)(X,\mathcal{B},\mu) is an action of GG on XX such that xgxx\mapsto gx is measure-preserving for each gGg\in G. In this case, we say that μ\mu is GG-invariant. We refer to (X,,μ,T)(X,\mathcal{B},\mu,T) and (X,,μ,G)(X,\mathcal{B},\mu,G) as measure-preserving systems.

Given a measure-preserving system, one has the following version of the von Neumann ergodic theorem.

Theorem 7.

Suppose GG is a countably infinite amenable cancellative semigroup with a Følner sequence (Φn)(\Phi_{n}) and let (X,,μ,G)(X,\mathcal{B},\mu,G) be a measure-preserving system. Let fL2(X)f\in L^{2}(X), and let ff^{*} be the orthogonal projection of ff onto the subspace of L2(X)L^{2}(X) consisting of GG-invariant functions. Then

limn1|Φn|gΦnf(gx)=f(x)in L2(X).\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(gx)=f^{*}(x)\quad\text{in $L^{2}(X)$.}
Proof.

See [36, Theorem 1.14.1] for a proof in the case G=G=\mathbb{Z}. The proof for more general GG is analogous; see [3, Theorem 4.15]. ∎

With additional constraints on the action of GG, more can be said. The system (X,,μ,G)(X,\mathcal{B},\mu,G) is called ergodic if there are no nontrivial GG-invariant subsets, that is, if EE\in\mathcal{B} satisfies μ(Eg1E)=0\mu(E\bigtriangleup g^{-1}E)=0 for all gGg\in G, then μ(E){0,1}\mu(E)\in\{0,1\}. In this case, the orthogonal projection of ff onto the subspace of GG-invariant functions is Xf𝑑μ\int_{X}f\,d\mu, so Theorem 7 states that

limn1|Φn|gΦnf(gx)=Xf𝑑μin L2(X).\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(gx)=\int_{X}f\,d\mu\quad\text{in $L^{2}(X)$.}

If XX is a compact topological space on which GG acts by continuous maps, the pair (X,G)(X,G) is a (topological) dynamical system. If EXE\subseteq X and xXx\in X, we define RE(x)={gG:gxE}R_{E}(x)=\{g\in G\colon gx\in E\}. When GG is \mathbb{Z} or \mathbb{R}, RE(x)R_{E}(x) can be thought of as the set of visiting times of xx to EE. The system (X,G)(X,G) is minimal if there is no proper nonempty closed subset CXC\subseteq X invariant under the action of GG (in the sense that GCCGC\subseteq C). Equivalently, the action of GG on XX is minimal if, for each xGx\in G, its orbit {gx:gG}\{gx\colon g\in G\} is dense.

When XX is a compact metric space with a Borel σ\sigma-algebra \mathcal{B}, we say that (X,G)(X,G) is uniquely ergodic if there is exactly one GG-invariant Borel probability measure on XX, i.e., there exists exactly one normalized measure μ\mu on \mathcal{B} which makes (X,,μ,G)(X,\mathcal{B},\mu,G) into a measure-preserving system. Under the condition of unique ergodicity, one has the following convenient pointwise ergodic theorem.

Theorem 8.

Let (X,G)(X,G) be a uniquely ergodic topological dynamical system where XX is a compact metric space and GG is a countably infinite amenable cancellative semigroup. Assume μ\mu is the unique GG-invariant Borel probability measure and let (Φn)(\Phi_{n}) be a Følner sequence in GG. Then, for each measurable f:Xf\colon X\to\mathbb{R} whose points of discontinuity are contained in a set of measure zero and each xXx\in X,

limn1|Φn|gΦnf(gx)=f𝑑μ.\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(gx)=\int f\,d\mu.
Proof.

For continuous ff, this can be proved in complete analogy with the well-known proof of the case G=G=\mathbb{Z}; see [36, Theorem 6.19]. For more general ff, the statement follows by a standard approximation argument. ∎

A fact we will find useful is that a minimal action by isometries on a compact metric space is automatically uniquely ergodic.

2.5. Cantor spaces

The following construction will be very useful. Given any sequence (Xn)n(X_{n})_{n\in\mathbb{N}} of finite sets having the discrete topology, their topological product X=XnX=\prod X_{n} is a compact space homeomorphic to the classical Cantor set. We will use this construction frequently, and we will call such a space a Cantor space. To make the notation less cluttered and easier to read, we will use boldface letters for elements of a Cantor space, e.g., 𝐱=(xn)X\mathbf{x}=(x_{n})\in X. (This is also consistent with our use of 𝟏\mathbf{1} for the indicator function.)

Given nn\in\mathbb{N} and yiXiy_{i}\in X_{i} for i{1,,n}i\in\{1,\dots,n\}, we define the set

V(y1,,yn)={𝐱X:xi=yi for each i{1,,n}}.V(y_{1},\dots,y_{n})=\{\mathbf{x}\in X\colon x_{i}=y_{i}\text{ for each }i\in\{1,\dots,n\}\}.

The collection of all sets having this form constitute an open base for the topology on XX.

It will be convenient to also give Cantor spaces a metric space structure. Giving each XnX_{n} the discrete metric δn\delta_{n}, the space XX can be made into a compact metric space via the product metric δ\delta given by

(1) δ(𝐱,𝐲)=n2nδn(xn,yn).\delta(\mathbf{x},\mathbf{y})=\sum_{n\in\mathbb{N}}2^{-n}\delta_{n}(x_{n},y_{n}).

Assuming 𝐱𝐲\mathbf{x}\neq\mathbf{y}, let nn\in\mathbb{N} be the smallest integer satisfying xn=ynx_{n}=y_{n} and xn+1yn+1x_{n+1}\neq y_{n+1} (take n=0n=0 if x1y1x_{1}\neq y_{1}). Then δ\delta is equivalent to the more common and conceptually simpler metric δ\delta^{\prime} defined by δ(𝐱,𝐲)=1/(n+1)\delta^{\prime}(\mathbf{x},\mathbf{y})=1/(n+1). Observe that, given a collection of bijections fn:XnXnf_{n}\colon X_{n}\to X_{n} for each nn\in\mathbb{N}, the induced map f:XXf\colon X\to X is an isometry of XX.

There is one more construction that we describe here. Analogously to the natural topological and metric structures on XX, we can endow each XnX_{n} with the normalized counting measure. The resulting (uniform) product measure μ\mu is a probability measure on the Borel σ\sigma-algebra \mathcal{B} of XX satisfying μ(V(y1,,yn))=1/(|X1||Xn|)\mu(V(y_{1},\dots,y_{n}))=1/(|X_{1}|\cdots|X_{n}|). Clearly, μ\mu is invariant under the isometries ff described in the previous paragraph.

3. Discordant sets from visiting times

As foreshadowed by the introduction, discordant sets naturally arise from dynamical systems, one of the most familiar being irrational rotation of the one-dimensional torus 𝕋=/\mathbb{T}=\mathbb{R}/\mathbb{Z}. Our search for discordant sets is motivated by the fact that if α𝕋\alpha\in\mathbb{T} is irrational and U𝕋U\subseteq\mathbb{T} has nonempty interior, then {n:nαU}\{n\in\mathbb{Z}\colon n\alpha\in U\} is syndetic. The punchline of this direction of reasoning is that if E𝕋E\subseteq\mathbb{T} is nowhere dense and has positive measure, then for almost every x𝕋x\in\mathbb{T}, the set of visiting times RE(x)={n:x+nαE}R_{E}(x)=\{n\in\mathbb{Z}\colon x+n\alpha\in E\} is discordant. Here RE(x)R_{E}(x) is non-piecewise syndetic for any x𝕋x\in\mathbb{T} by Lemma 9 below, whereas the positive upper density of RE(x)R_{E}(x) is guaranteed only for almost every xx by an application of the von Neumann ergodic theorem.666In fact, one can easily see that RE(x)R_{E}(x) has positive natural density for almost every xx by using the pointwise ergodic theorem. Theorem 10 is a generalization of this construction which yields many discordant sets in many settings, and is the core of our analogy between discordant sets and nowhere dense sets of positive measure.

Let us note that the restriction “almost every xx” is a significant and unavoidable constraint. One can construct a closed nowhere dense set E𝕋E\subseteq\mathbb{T} of positive measure for which {n:x+nαE}\{n\in\mathbb{Z}\colon x+n\alpha\in E\} is empty for infinitely many x𝕋x\in\mathbb{T}.

The importance of the irrationality of α\alpha is that (𝕋,xx+α)(\mathbb{T},x\mapsto x+\alpha) is minimal if and only if α\alpha is irrational. In our more general setting we replace this system with a minimal action of a semigroup GG on a compact space XX.

In this setting, we wish to connect properties of RE(x)={gG:gxE}R_{E}(x)=\{g\in G\colon gx\in E\} as a subset of GG with the properties of EE as a subset of XX. Lemma 9 connects the piecewise syndeticity of RE(x)R_{E}(x) with the topological properties of EE and Theorem 10 connects the density of RE(x)R_{E}(x) with the measure of EE, allowing us to construct discordant sets.

In analogy with our notation for subsets of GG, we write A1E=aAa1EA^{-1}E=\bigcup_{a\in A}a^{-1}E for AGA\subseteq G and EXE\subseteq X.

Lemma 9.

Let (X,G)(X,G) be a minimal topological dynamical system, where GG is a countably infinite cancellative semigroup. Then, for any nowhere dense EXE\subseteq X and xXx\in X, the set RE(x)R_{E}(x) is not piecewise syndetic.

Proof.

It is a classical fact that (X,G)(X,G) is minimal if and only if RV(x)R_{V}(x) is syndetic for each nonempty open VXV\subseteq X and xXx\in X; see, e.g., [22, Theorem 1.15]. We reproduce the argument here for the convenience of the reader. The “if” direction is evident, so suppose VXV\subseteq X is open and nonempty and xXx\in X. Let Y=G1VY=G^{-1}V; we claim that Y=XY=X. Indeed, if yXYy\in X\setminus Y, then gyVgy\notin V for all gg, so yy cannot have dense orbit, contradicting the minimality of (X,G)(X,G). By compactness, there is a finite set HGH\subseteq G such that X=H1VX=H^{-1}V. Now fix gGg\in G and find hHh\in H such that gxh1Vgx\in h^{-1}V. Then hgRV(x)hg\in R_{V}(x), so G=H1RV(x)G=H^{-1}R_{V}(x), proving that RV(x)R_{V}(x) is syndetic.

Now let EXE\subseteq X be nowhere dense and let FGF\subseteq G be finite. Then F1RE(x)=RF1E(x)F^{-1}R_{E}(x)=R_{F^{-1}E}(x), and F1EF^{-1}E is nowhere dense. Let VV be an open set lying in XF1EX\setminus F^{-1}E. Then F1RE(x)F^{-1}R_{E}(x) is disjoint from the syndetic set RV(x)R_{V}(x), and so cannot be thick. Since FF was arbitrary, RE(x)R_{E}(x) cannot be piecewise syndetic. ∎

Suppose (X,G)(X,G) is a topological dynamical system. If GG is amenable, there always exists a GG-invariant measure μ\mu on XX by a version of the Bogolyubov–Krylov theorem (see [36, Theorem 6.9] for a proof of the case G=G=\mathbb{Z}; the theorem for general GG is proved analogously). Moreover, μ\mu can taken to be ergodic by invoking the so-called ergodic decomposition.

Theorem 10.

Let (X,G)(X,G) be a minimal topological dynamical system, where GG is a countably infinite amenable cancellative semigroup with a Følner sequence Φ=(Φn)\Phi=(\Phi_{n}). Let μ\mu be a GG-invariant probability measure on the Borel σ\sigma-algebra \mathcal{B} of XX. If EXE\subseteq X is nowhere dense with positive measure, then there exists zXz\in X such that RE(z)R_{E}(z) is discordant. If, in addition, (X,,μ,G)(X,\mathcal{B},\mu,G) is ergodic, d¯Φ(RE(z))μ(E)\overline{d}_{\Phi}(R_{E}(z))\geq\mu(E) for μ\mu-almost every zXz\in X.

Proof.

Let f=𝟏Ef=\mathbf{1}_{E}. By Theorem 7,

limn1|Φn|gΦnf(gx)=f(x),\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(gx)=f^{*}(x),

where ff^{*} is the orthogonal projection of ff onto the space of GG-invariant functions in L2(X,μ)L^{2}(X,\mu), and the convergence is in L2L^{2}. Extract a subsequence Φ=(Φnk)k\Phi^{\prime}=(\Phi_{n_{k}})_{k\in\mathbb{N}} such that limk|Φnk|1gΦnkf(gx)=f(x)\lim_{k\to\infty}|\Phi_{n_{k}}|^{-1}\sum_{g\in\Phi_{n_{k}}}f(gx)=f^{*}(x) almost everywhere. For each kk,

1|Φnk|gΦnkf(gx)dμ=1|Φnk|gΦnkf(gx)𝑑μ=1|Φnk|gΦnkμ(E)=μ(E).\int\frac{1}{|\Phi_{n_{k}}|}\sum_{g\in\Phi_{n_{k}}}f(gx)\,d\mu=\frac{1}{|\Phi_{n_{k}}|}\sum_{g\in\Phi_{n_{k}}}\int f(gx)\,d\mu=\frac{1}{|\Phi_{n_{k}}|}\sum_{g\in\Phi_{n_{k}}}\mu(E)=\mu(E).

Hence, by the dominated convergence theorem, f𝑑μ=μ(E)\int f^{*}\,d\mu=\mu(E). Thus, there is a set of positive measure on which f>0f^{*}>0. Choose zz such that f(z)>0f^{*}(z)>0 and

dΦ(RE(z))=limk1|Φnk|gΦnkf(gz)=f(z).d_{\Phi^{\prime}}(R_{E}(z))=\lim_{k\to\infty}\frac{1}{|\Phi_{n_{k}}|}\sum_{g\in\Phi_{n_{k}}}f(gz)=f^{*}(z).

Then d¯Φ(RE(z))dΦ(RE(z))=f(z)>0\overline{d}_{\Phi}(R_{E}(z))\geq d_{\Phi^{\prime}}(R_{E}(z))=f^{*}(z)>0, and, since (X,G)(X,G) is minimal, RE(z)R_{E}(z) is not piecewise syndetic by Lemma 9.

In the presence of ergodicity, ff^{*} is the constant function f𝑑μ=μ(E)\int f\,d\mu=\mu(E), so we may take zz to be any element for which limk|Φnk|1gΦnkf(gz)=f(z)\lim_{k\to\infty}|\Phi_{n_{k}}|^{-1}\sum_{g\in\Phi_{n_{k}}}f(gz)=f^{*}(z). As we noted, such zz form a subset of XX of full measure. ∎

It is worth noting that, in the setting of Theorem 10, XX possesses a large supply of nowhere dense sets of positive measure if it is an infinite metric space.

Theorem 11.

Let (X,G)(X,G) and μ\mu be as in Theorem 10, and assume that XX is a metric space which is not finite.777 It is evident from the proof of Theorem 11 that the condition “metric space” can be weakened slightly, at the cost of more cumbersome hypotheses. Then for any c(0,1)c\in(0,1), there exists a nowhere dense set EXE\subseteq X with μ(E)=c\mu(E)=c.

Proof.

We first claim that μ\mu is a non-atomic measure. Let xXx\in X. Then μ({gx})=μ(g1{gx})μ({x})\mu(\{gx\})=\mu(g^{-1}\{gx\})\geq\mu(\{x\}) for any gGg\in G. Since (X,G)(X,G) is minimal, {gx:gG}\{gx\colon g\in G\} is dense in XX, and is in particular infinite. It follows that μ({x})=0\mu(\{x\})=0 since μ\mu is a probability measure.

Now let {yn}nX\{y_{n}\}_{n\in\mathbb{N}}\subseteq X be a countable dense subset of XX. Since μ\mu is non-atomic, μ({yn})=0\mu(\{y_{n}\})=0 for all nn\in\mathbb{N}. Hence by the continuity of μ\mu from above, we can find an open neighborhood UnXU_{n}\subseteq X of yny_{n} with μ(Un)<(1c)/2n\mu(U_{n})<(1-c)/2^{n}. Set E0=XUnE_{0}=X\setminus\bigcup U_{n}, so that E0E_{0} is a nowhere dense subset of XX with μ(E0)>c\mu(E_{0})>c. The claim now follows from a theorem of Sierpiński [Si] which states that in any non-atomic measure space (Y,𝒩,ν)(Y,\mathcal{N},\nu) with ν(Y)<\nu(Y)<\infty, the measure ν\nu takes every value in the interval [0,ν(Y)][0,\nu(Y)]. ∎

4. Discordant sets and generalizations of squarefree numbers

It is known that the set of squarefree numbers in \mathbb{N} is discordant, and in this section we describe a wide-ranging generalization of this fact and examine in detail some special cases.

4.1. A characterization and a construction

We will prove non-piecewise syndeticity in two ways; the first uses a characterization (Lemma 12) of non-piecewise syndetic sets, and the second uses Lemma 9. Ramsey theory aficionados will notice that Lemma 12 can be rephrased as a characterization of sets satisfying the dual property, in the sense of Lemma 1, to piecewise syndeticity, often denoted by PS.

Lemma 12.

Let GG be a countably infinite semigroup and let AGA\subseteq G. Then AA is not piecewise syndetic if and only if, for any finite HGH\subseteq G, there exists a syndetic SGS\subseteq G such that HSA=HS\cap A=\varnothing.

Proof.

Let HGH\subseteq G be finite and consider the set BH={gG:HgA=}B_{H}=\{g\in G\colon Hg\cap A=\varnothing\}. Observe that BHc=H1AB_{H}^{\mathrm{c}}=H^{-1}A, since gBHcg\in B_{H}^{\mathrm{c}} if and only if hgAhg\in A for some hHh\in H. Thus, by Lemma 1, BHB_{H} is syndetic if and only if H1AH^{-1}A is not thick. In particular, BHB_{H} is syndetic for all finite HGH\subseteq G if and only if AA is not piecewise syndetic. ∎

Before constructing non-piecewise syndetic sets using Lemma 12, we first recall a general group-theoretic form of the Chinese remainder theorem.888A generalization of this form of the Chinese remainder theorem is described in the StackExchange discussion SE2954401, although we will not need this generality.

Lemma 13 (Chinese remainder theorem).

Let GG be a group with normal subgroups N1,,NnN_{1},\dots,N_{n}, where n2n\geq 2 and |G/N1|,,|G/Nn||G/N_{1}|,\dots,|G/N_{n}| are pairwise coprime. Define N=i=1nNiN=\bigcap_{i=1}^{n}N_{i} and φ:G/Ni=1nG/Ni\varphi\colon G/N\to\prod_{i=1}^{n}G/N_{i} by φ(gN)=(gN1,,gNn)\varphi(gN)=(gN_{1},\dots,gN_{n}). Then φ\varphi is an isomorphism.

Proof.

It is immediate from the first isomorphism theorem that φ\varphi is a well-defined injective homomorphism. So, it suffices to show that |G/N|i=1n|G/Ni||G/N|\geq\prod_{i=1}^{n}|G/N_{i}|. Indeed, for each ii, |G/Ni||G/N_{i}| divides |G/N||G/N| since |G/N|=|G/Ni||Ni/N||G/N|=|G/N_{i}|\cdot|N_{i}/N|. Thus the inequality holds since |G/N1|,,|G/Nn||G/N_{1}|,\dots,|G/N_{n}| are pairwise coprime. ∎

We now combine the Chinese remainder theorem with Lemmas 12 and 9 to obtain an abundance of non-piecewise syndetic sets. Although Lemma 12 shows that we can remove “thicker and thicker” syndetic sets to produce non-piecewise syndetic sets, Theorem 14 shows that in special situations (when the syndetic subsets are suitably “arithmetically/algebraically independent”, as the Chinese remainder theorem will guarantee) this is not necessary.

Theorem 14.

Let GG be a countably infinite group and (Nn)(N_{n}) be a sequence of normal finite-index subgroups of GG such that for any n2n\geq 2, the subgroups N1,,NnN_{1},\dots,N_{n} satisfy the hypotheses of Lemma 13. Then, for any sequence (gn)(g_{n}) in GG, the set A=GgnNnA=G\setminus\bigcup g_{n}N_{n} is not piecewise syndetic.

In principle, the set AA may be empty. Below we will see many examples where this is not the case. Notice that we can equivalently state the conclusion of Theorem 14 as “every piecewise syndetic subset of GG contains an element (in fact, infinitely many elements) of gnNn\bigcup g_{n}N_{n}”. We will give two proofs of this fact.

Proof 1.

Fix F={f1,,fn}GF=\{f_{1},\dots,f_{n}\}\subseteq G. Let N=i=1nNiN=\bigcap_{i=1}^{n}N_{i}, and let φ\varphi be the isomorphism guaranteed by the Chinese remainder theorem. Then, setting x=φ1(f11g1N1,,fn1gnNn)x=\varphi^{-1}(f_{1}^{-1}g_{1}N_{1},\dots,f_{n}^{-1}g_{n}N_{n}), we have fixgiNif_{i}x\in g_{i}N_{i} for each i{1,,n}i\in\{1,\dots,n\}, so F(xN)A=F(xN)\cap A=\varnothing. Since the subgroups NiN_{i} have finite index, NN is syndetic, so Lemma 12 completes the proof. ∎

Our second proof of Theorem 14, though longer, uses Lemma 9 and strengthens our analogy between discordant sets and nowhere dense sets of positive measure. In the remainder of this subsection we will use the definitions made in Subsections 2.4 and 2.5.

Proof 2.

Consider the Cantor space X=G/NnX=\prod G/N_{n}. Given h1,,hnGh_{1},\dots,h_{n}\in G, define

(2) V(h1,,hn)={𝐱X:xiNi=hiNi for each i{1,,n}}.V(h_{1},\dots,h_{n})=\{\mathbf{x}\in X\colon\text{$x_{i}N_{i}=h_{i}N_{i}$ for each $i\in\{1,\dots,n\}$}\}.

Sets having the form given in (2) constitute an open base for the topology on XX.

Let GG act by componentwise multiplication on XX, i.e., g𝐱=(gxnNn)ng\mathbf{x}=(gx_{n}N_{n})_{n\in\mathbb{N}}. The Chinese remainder theorem guarantees that this action is minimal. To see this, let 𝐱X\mathbf{x}\in X and h1,,hnGh_{1},\dots,h_{n}\in G be given. By the Chinese remainder theorem, find hGh\in G so that (hN1,,hNn)=(h1x11N1,,hnxn1Nn)(hN_{1},\dots,hN_{n})=(h_{1}x_{1}^{-1}N_{1},\dots,h_{n}x_{n}^{-1}N_{n}). Then h𝐱V(h1,,hn)h\mathbf{x}\in V(h_{1},\dots,h_{n}) since hxiNi=hNixi=hixi1Nixi=hiNihx_{i}N_{i}=hN_{i}x_{i}=h_{i}x_{i}^{-1}N_{i}x_{i}=h_{i}N_{i}. This proves that 𝐱\mathbf{x} has dense orbit.

To apply Lemma 9, we simply observe that A=RE((gn1Nn)n)A=R_{E}((g_{n}^{-1}N_{n})_{n\in\mathbb{N}}), where

(3) E={𝐱X:xnNnNn for each n}.E=\{\mathbf{x}\in X\colon x_{n}N_{n}\neq N_{n}\text{ for each }n\in\mathbb{N}\}.

The set EE is nowhere dense, since it is closed and is not a superset of any set given in (2). ∎

As one might expect from this second proof, we can obtain a density result for Theorem 14 by applying Theorem 10.

Theorem 15.

Let GG, XX, and (Nn)(N_{n}) be as in Proof 2 of Theorem 14, and set cn=|G/Nn|c_{n}=|G/N_{n}| for each nn\in\mathbb{N}. Suppose GG is amenable, possessing a Følner sequence Φ\Phi. Let μ\mu be the probability measure on the Borel σ\sigma-algebra \mathcal{B} of XX satisfying μ(V(h1,,hn))=1/(c1cn)\mu(V(h_{1},\dots,h_{n}))=1/(c_{1}\cdots c_{n}). Finally, given 𝐱X\mathbf{x}\in X, set A𝐱=GxnNnA_{\mathbf{x}}=G\setminus\bigcup x_{n}N_{n}. Then, for μ\mu-almost every 𝐱X\mathbf{x}\in X,

(4) d¯Φ(A𝐱)=n(11cn).\overline{d}_{\Phi}(A_{\mathbf{x}})=\prod_{n\in\mathbb{N}}{\left(1-\frac{1}{c_{n}}\right)}.
Proof.

Since GG acts minimally on XX by isometries, the system (X,,μ,G)(X,\mathcal{B},\mu,G) is uniquely ergodic, and in particular ergodic. Let EXE\subseteq X be as in (3), so that A𝐱=RE((xn1Nn)n)A_{\mathbf{x}}=R_{E}((x_{n}^{-1}N_{n})_{n\in\mathbb{N}}). Then by Theorem 10, we have, for μ\mu-almost every 𝐱X\mathbf{x}\in X,

d¯Φ(A𝐱)=μ(E)=n(11cn).\overline{d}_{\Phi}(A_{\mathbf{x}})=\mu(E)=\prod_{n\in\mathbb{N}}{\left(1-\frac{1}{c_{n}}\right)}.

(That μ(E)=(11/cn)\mu(E)=\prod(1-1/c_{n}) can be easily checked since EE is closed.) ∎

In practice, Theorem 15 will not be useful for constructing explicit discordant sets as we cannot easily control for which 𝐱\mathbf{x} equation (4) holds.

4.2. Some number-theoretic examples of discordant sets

In this subsection we apply Theorem 14 to construct some discordant sets. In the following subsection we provide proofs of positive density for all but one of the constructions. The most primitive example is the following.

Example 16.

The set QQ of squarefree integers is not piecewise syndetic:

Q=p primep2.Q=\mathbb{Z}\setminus\bigcup_{\text{$p$ prime}}p^{2}\mathbb{Z}.

It is classical that d(Q)=6/π2d(Q)=6/\pi^{2}, so that QQ is discordant.

Example 16 can be generalized a great deal. We provide five generalizations below.

Example 17.

Let =(bn)\mathscr{B}=(b_{n}) be a sequence of pairwise coprime positive integers and let \mathcal{F}_{\mathscr{B}} denote the set of \mathscr{B}-free integers

=nbn,\mathcal{F}_{\mathscr{B}}=\mathbb{Z}\setminus\bigcup_{n\in\mathbb{N}}b_{n}\mathbb{Z},

consisting of integers not divisible by any element of \mathscr{B}. Then \mathcal{F}_{\mathscr{B}} is not piecewise syndetic, and if 1/bn<\sum 1/b_{n}<\infty, then d()=(11/bn)d(\mathcal{F}_{\mathscr{B}})=\prod(1-1/b_{n}) [25, Theorem 9, Section V.5]. We provide a proof of this fact below (Theorem 23). A well-known but notable special case is that the set of kk-free integers, those divisible by no perfect kkth power, has density 1/ζ(k)1/\zeta(k).

Example 18.

Let KK be a finite extension of \mathbb{Q}, let k2k\geq 2, and let QK,kQ_{K,k} denote the kk-free integers KK:

QK,k=𝒪K𝔭𝒪Kprime ideal𝔭k,Q_{K,k}=\mathcal{O}_{K}\setminus\bigcup_{\begin{subarray}{c}\mathfrak{p}\subseteq\mathcal{O}_{K}\\ \text{prime ideal}\end{subarray}}\mathfrak{p}^{k},

where 𝒪K\mathcal{O}_{K} denotes the ring of integers in KK. Using an appropriate ring-theoretic version of the Chinese remainder theorem, we may apply Theorem 14 to show that QK,kQ_{K,k} is not piecewise syndetic. Moreover, as K/K/\mathbb{Q} is a finite extension, 𝒪Kd\mathcal{O}_{K}\cong\mathbb{Z}^{d} as additive groups, where dd is the degree of K/K/\mathbb{Q}. Define Φ\Phi to be a sequence of balls (pulled back from d\mathbb{Z}^{d} to 𝒪K\mathcal{O}_{K} under this isomorphism), with respect to the L1L^{1} norm, centered at 0 and having increasing radius. Then dΦ(QK,k)=1/ζK(k)d_{\Phi}(Q_{K,k})=1/\zeta_{K}(k) as shown in [17, Corollary 4.6], where ζK\zeta_{K} denotes the Dedekind zeta function of KK. Equivalently, dΦ(QK,k)=𝔭(11/N(𝔭)k)d_{\Phi}(Q_{K,k})=\prod_{\mathfrak{p}}(1-1/N(\mathfrak{p})^{k}), where N(𝔭)N(\mathfrak{p}) denotes the absolute norm of 𝔭\mathfrak{p}, i.e., the cardinality of the residue field 𝒪K/𝔭\mathcal{O}_{K}/\mathfrak{p}. Hence QK,kQ_{K,k} is discordant.

Example 19.

Given kk\in\mathbb{N} and a prime pp, write ep(k)e_{p}(k) to denote the exponent on pp in the prime factorization of kk. In [4, Theorem 3.9], it is shown in a somewhat cumbersome way that, for any infinite set PP of primes and any corresponding collection of nonempty subsets (Yp)pP(Y_{p})_{p\in P} of \mathbb{N}, the set

A={k:ep(k)Yp for each pP}A=\{k\in\mathbb{N}\colon\text{$e_{p}(k)\notin Y_{p}$ for each $p\in P$}\}

is not piecewise syndetic. This strengthens the non-piecewise syndeticity result of Example 16 and, in certain cases, Example 17, since it is clear that AA is a superset of the (pmin(Yp))pP(p^{\min(Y_{p})})_{p\in P}-free numbers. Below, we state this result in better generality and deduce it easily from Theorem 14.

Given any bb\in\mathbb{N} and kk\in\mathbb{Z}, define eb(k)e_{b}(k) to be the largest power of bb dividing kk (set eb(0)=e_{b}(0)=\infty). Now let =(bn)\mathscr{B}=(b_{n}) be a sequence of pairwise coprime positive integers and let u=(un)u=(u_{n}) be any sequence of positive integers. Define

u={k:ebn(k)un for each n}.\mathcal{F}_{\mathscr{B}}^{u}=\{k\in\mathbb{Z}\colon\text{$e_{b_{n}}(k)\neq u_{n}$ for each $n\in\mathbb{N}$}\}.

Then u(bnun+1+bnun)\mathcal{F}_{\mathscr{B}}^{u}\subseteq\mathbb{Z}\setminus\bigcup(b_{n}^{u_{n}+1}\mathbb{N}+b_{n}^{u_{n}}), so u\mathcal{F}_{\mathscr{B}}^{u} is not piecewise syndetic by Theorem 14. In Theorem 24 below, we prove that, if bnun<\sum b_{n}^{-u_{n}}<\infty, then

d(u)=n(1bn1bnun+1).d(\mathcal{F}_{\mathscr{B}}^{u})=\prod_{n\in\mathbb{N}}{\left(1-\frac{b_{n}-1}{b_{n}^{u_{n}+1}}\right)}.
Example 20.

The set CC consisting of pairs (a,b)(a,b) of coprime integers can be expressed as

C=(×)p prime(p×p),C=(\mathbb{Z}\times\mathbb{Z})\setminus\bigcup_{\text{$p$ prime}}(p\mathbb{Z}\times p\mathbb{Z}),

making clear that CC is not piecewise syndetic by Theorem 14. It is well known that d(C)=6/π2d(C)=6/\pi^{2}, which will follow from the more general Theorem 26 (in other words, the probability that two “randomly chosen” integers are coprime is 6/π26/\pi^{2}; see, e.g., [26, Theorem 332]). Many possible generalizations in this direction can be imagined.

Example 21.

Our final example will take place in a noncommutative setting. Consider the Heisenberg group H3()H_{3}(\mathbb{Z}), the matrix group

{(1ac01b001):a,b,c}.{\left\{{\begin{pmatrix}1&a&c\\ 0&1&b\\ 0&0&1\end{pmatrix}}\colon a,b,c\in\mathbb{Z}\right\}}.

Associating these matrices with triples (a,b,c)(a,b,c), we view H3()H_{3}(\mathbb{Z}) as 3\mathbb{Z}^{3} with the group operation (a1,b1,c1)(a2,b2,c2)=(a1+a2,b1+b2,a1b2+c1+c2)(a_{1},b_{1},c_{1})\cdot(a_{2},b_{2},c_{2})=(a_{1}+a_{2},b_{1}+b_{2},a_{1}b_{2}+c_{1}+c_{2}). For each nn\in\mathbb{N}, the kernel n3n\mathbb{Z}^{3} of the entrywise reduction map H3()H3(/n)H_{3}(\mathbb{Z})\to H_{3}(\mathbb{Z}/n\mathbb{Z}) is a normal subgroup of H3()H_{3}(\mathbb{Z}) of index n3n^{3}. Hence, for any sequence (bn)(b_{n}) of pairwise coprime positive integers, the set H3()bn3H_{3}(\mathbb{Z})\setminus\bigcup b_{n}\mathbb{Z}^{3} is not piecewise syndetic in H3()H_{3}(\mathbb{Z}) by Theorem 14. We will see in Theorem 27 that this set has density (11/bn3)\prod(1-1/b_{n}^{3}) with respect to an appropriate Følner sequence in H3()H_{3}(\mathbb{Z}).

4.3. Computing density

Before computing the density of some of the non-piecewise syndetic sets we constructed in Subsection 4.2, we make some definitions and prove a general lemma.

Suppose we have a countably infinite amenable cancellative semigroup GG with a Følner sequence Φ\Phi. Given a function f:Gf\colon G\to\mathbb{R}, we define

A¯Φ(f)=lim supn1|Φn|gΦnf(g),\overline{A}_{\Phi}(f)=\limsup_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(g),

and likewise A¯Φ(f)\underline{A}_{\Phi}(f) and AΦ(f)A_{\Phi}(f) in the obvious way. We will be most interested in the case when AΦ(f)A_{\Phi}(f) exists. Assuming this limit exists, it has some obvious but useful properties. First, given EGE\subseteq G, we have AΦ(𝟏E)=dΦ(E)A_{\Phi}(\mathbf{1}_{E})=d_{\Phi}(E). Second, if f1,f2:Gf_{1},f_{2}\colon G\to\mathbb{R} satisfy f1f2f_{1}\leq f_{2}, then AΦ(f1)AΦ(f2)A_{\Phi}(f_{1})\leq A_{\Phi}(f_{2}). Finally, fAΦ(f)f\mapsto A_{\Phi}(f) is finitely additive: AΦ(f1+f2)=AΦ(f1)+AΦ(f2)A_{\Phi}(f_{1}+f_{2})=A_{\Phi}(f_{1})+A_{\Phi}(f_{2}).

In order to prove that a set of the form GEnG\setminus\bigcup E_{n} has density, we will impose a number of nice properties on the family (En)(E_{n}). We adopt the notation EI=iIEiE_{I}=\bigcap_{i\in I}E_{i} for II\subseteq\mathbb{N}. In addition, we use 𝒫k()\mathcal{P}_{k}(\mathbb{N}) to refer to the set of kk-element subsets of \mathbb{N}.

Specifically, we say that (En)(E_{n}) is Φ\Phi-inclusion-exclusion good, or Φ\Phi-I.E. good if

  1. (a)

    dΦ(En)d_{\Phi}(E_{n}) exists for each nn\in\mathbb{N}, and ndΦ(En)<\sum_{n\in\mathbb{N}}d_{\Phi}(E_{n})<\infty;

  2. (b)

    EI=E_{I}=\varnothing for every infinite II\subseteq\mathbb{N};

  3. (c)

    for every finite II\subseteq\mathbb{N}, dΦ(EI)=iIdΦ(Ei)d_{\Phi}(E_{I})=\prod_{i\in I}d_{\Phi}(E_{i}); and

  4. (d)

    for each kk\in\mathbb{N}, AΦ(I𝒫k()𝟏EI)=I𝒫k()dΦ(EI)A_{\Phi}(\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\mathbf{1}_{E_{I}})=\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}d_{\Phi}(E_{I}).

For Φ\Phi-I.E. good families, the following lemma states that an asymptotic version of the inclusion-exclusion principle holds.

Lemma 22.

Suppose GG and Φ\Phi are as above and (En)(E_{n}) is a Φ\Phi-I.E. good family of subsets of GG. Write =GEn\mathcal{F}=G\setminus\bigcup E_{n}. Then dΦ()=(1dΦ(En))d_{\Phi}(\mathcal{F})=\prod(1-d_{\Phi}(E_{n})).

Proof.

By property (c) of Φ\Phi-I.E. goodness and the standard inclusion-exclusion principle, d¯Φ()(1dΦ(En))\overline{d}_{\Phi}(\mathcal{F})\leq\prod(1-d_{\Phi}(E_{n})), since Gi=1nEi\mathcal{F}\subseteq G\setminus\bigcup_{i=1}^{n}E_{i} for each nn\in\mathbb{N}. Now let nn\in\mathbb{N}. Using the identity

n(1xn)=n=0(1)nI𝒫n()iIxi,\prod_{n\in\mathbb{N}}(1-x_{n})=\sum_{n=0}^{\infty}(-1)^{n}\sum_{I\in\mathcal{P}_{n}(\mathbb{N})}\prod_{i\in I}x_{i},

which holds as long as xi<\sum x_{i}<\infty, properties (a) and (c) of Φ\Phi-I.E. goodness yield

n(1dΦ(En))=n=0(1)nI𝒫n()dΦ(EI).\prod_{n\in\mathbb{N}}(1-d_{\Phi}(E_{n}))=\sum_{n=0}^{\infty}(-1)^{n}\sum_{I\in\mathcal{P}_{n}(\mathbb{N})}d_{\Phi}(E_{I}).

Thus dΦ()=(1dΦ(En))d_{\Phi}(\mathcal{F})=\prod(1-d_{\Phi}(E_{n})) will follow if we can demonstrate that

d¯Φ()1idΦ(Ei)+I𝒫2()dΦ(EI)I𝒫2n1()dΦ(EI).\underline{d}_{\Phi}(\mathcal{F})\geq 1-\sum_{i\in\mathbb{N}}d_{\Phi}(E_{i})+\sum_{I\in\mathcal{P}_{2}(\mathbb{N})}d_{\Phi}(E_{I})-\cdots-\sum_{I\in\mathcal{P}_{2n-1}(\mathbb{N})}d_{\Phi}(E_{I}).

To do this, it suffices, by the finite additivity of fAΦ(f)f\mapsto A_{\Phi}(f) and property (d) of Φ\Phi-I.E. goodness, to show

(5) 𝟏1i𝟏Ei+I𝒫2()𝟏EII𝒫2n1()𝟏EI.\mathbf{1}_{\mathcal{F}}\geq 1-\sum_{i\in\mathbb{N}}\mathbf{1}_{E_{i}}+\sum_{I\in\mathcal{P}_{2}(\mathbb{N})}\mathbf{1}_{E_{I}}-\cdots-\sum_{I\in\mathcal{P}_{2n-1}(\mathbb{N})}\mathbf{1}_{E_{I}}.

Let ff denote the function on the right side of (5) and let rr\in\mathbb{N}. If rr\in\mathcal{F}, then clearly f(r)=1f(r)=1. Otherwise, due to property (b) of Φ\Phi-I.E. goodness, we may suppose rr is contained in exactly k1k\geq 1 elements of (En)(E_{n}). Then

f(r)=i=02n1(1)i(ki)=(k12n1)0,f(r)=\sum_{i=0}^{2n-1}(-1)^{i}\binom{k}{i}=-\binom{k-1}{2n-1}\leq 0,

so indeed 𝟏f\mathbf{1}_{\mathcal{F}}\geq f. This verifies that d¯Φ()=d¯Φ()=(1dΦ(En))\overline{d}_{\Phi}(\mathcal{F})=\underline{d}_{\Phi}(\mathcal{F})=\prod(1-d_{\Phi}(E_{n})). ∎

Using this lemma, we compute the density of sets constructed in Examples 17 and 19. The following two theorems use the standard Følner sequence ({1,,n})n(\{1,\dots,n\})_{n\in\mathbb{N}} in \mathbb{N}, which (as with the notation dd) we omit reference to.

Theorem 23.

Let =(bn)\mathscr{B}=(b_{n}) be a sequence of pairwise coprime positive integers satisfying 1/bn<\sum 1/b_{n}<\infty. Then d()=(11/bn)d(\mathcal{F}_{\mathscr{B}})=\prod(1-1/b_{n}).

Proof.

We prove this statement for =bn\mathcal{F}_{\mathscr{B}}=\mathbb{N}\setminus\bigcup b_{n}\mathbb{N} for convenience, but it is of no consequence, since \mathcal{F}_{\mathscr{B}} is symmetric about 0. We must show that (bn)(b_{n}\mathbb{N}) is I.E. good (with respect to the standard Følner sequence as mentioned above). Only property (d) is not immediate. Notice that it suffices to prove the case k=1k=1, since any intersection of sets of the form bb\mathbb{Z} again has this form.

Write f=𝟏bnf=\sum\mathbf{1}_{b_{n}\mathbb{N}}. Since 𝟏b1𝟏b1+𝟏b2f\mathbf{1}_{b_{1}\mathbb{N}}\leq\mathbf{1}_{b_{1}\mathbb{N}}+\mathbf{1}_{b_{2}\mathbb{N}}\leq\cdots\leq f, clearly A¯(f)1/bn\underline{A}(f)\geq\sum 1/b_{n}. To prove A¯(f)1/bn\overline{A}(f)\leq\sum 1/b_{n}, the key point is that, for each k,nk,n\in\mathbb{N},

1ki=1k𝟏bn(i)1bn,\frac{1}{k}\sum_{i=1}^{k}\mathbf{1}_{b_{n}\mathbb{N}}(i)\leq\frac{1}{b_{n}},

which is just a statement of the fact that there are no more than k/bnk/b_{n} multiples of bnb_{n} in {1,,k}\{1,\dots,k\}. It follows that, for each kk\in\mathbb{N},

1ki=1kf(i)n1bn,\frac{1}{k}\sum_{i=1}^{k}f(i)\leq\sum_{n\in\mathbb{N}}\frac{1}{b_{n}},

whence A¯(f)=A¯(f)=1/bn\overline{A}(f)=\underline{A}(f)=\sum 1/b_{n}. Thus (bn)(b_{n}\mathbb{N}) is I.E. good. ∎

Theorem 24.

Let =(bn)\mathscr{B}=(b_{n}) be a sequence of pairwise coprime positive integers and let u=(un)u=(u_{n}) be a sequence of positive integers for which bnun<\sum b_{n}^{-u_{n}}<\infty. Then

d(u)=n(1bn1bnun+1).d(\mathcal{F}_{\mathscr{B}}^{u})=\prod_{n\in\mathbb{N}}{\left(1-\frac{b_{n}-1}{b_{n}^{u_{n}+1}}\right)}.
Proof.

As in Theorem 23 we confine ourselves to \mathbb{N}. For each nn\in\mathbb{N} define Bn={k:ebn(k)=un}B_{n}=\{k\in\mathbb{N}\colon e_{b_{n}}(k)=u_{n}\}, so that u=Bn\mathcal{F}_{\mathscr{B}}^{u}\cap\mathbb{N}=\mathbb{N}\setminus\bigcup B_{n}. We aim to show that (Bn)(B_{n}) is an I.E. good family. (The density d(Bn)d(B_{n}) is easy to compute but is also derived below.) Clearly property (b) of I.E. goodness is satisfied.

Given II\subseteq\mathbb{N}, define also BI=iIBiB_{I}=\bigcap_{i\in I}B_{i}, bI=iIbib_{I}=\prod_{i\in I}b_{i}, and rI=iIbiuir_{I}=\prod_{i\in I}b_{i}^{u_{i}}. Observe that BIrIB_{I}\subseteq r_{I}\mathbb{N}. Define CI=BI/rIC_{I}=B_{I}/r_{I} to be the set of positive integers not divisible by bib_{i} for any iIi\in I. Observe that CIC_{I} is bIb_{I}-periodic, i.e., for each kk\in\mathbb{N}, one has kCIk\in C_{I} if and only if bI+kCIb_{I}+k\in C_{I}. In particular, d(CI)=|CI{1,,bI}|/bId(C_{I})=|C_{I}\cap\{1,\dots,b_{I}\}|/b_{I}. Now consider the isomorphism φ:/bIiI/bi\varphi\colon\mathbb{Z}/b_{I}\mathbb{Z}\overset{\sim}{\to}\prod_{i\in I}\mathbb{Z}/b_{i}\mathbb{Z} afforded by the classical Chinese remainder theorem. An element x/bIx\in\mathbb{Z}/b_{I}\mathbb{Z} is in CIC_{I} if and only if φ(x)\varphi(x) is nonzero in every coordinate, from which it follows that |CI{1,,bI}|=iI(bi1)|C_{I}\cap\{1,\dots,b_{I}\}|=\prod_{i\in I}(b_{i}-1). From this we get

d(BI)=d(rICI)=1bIrIiI(bi1)=iIbi1biui+1=iId(Bi),d(B_{I})=d(r_{I}C_{I})=\frac{1}{b_{I}r_{I}}\prod_{i\in I}(b_{i}-1)=\prod_{i\in I}\frac{b_{i}-1}{b_{i}^{u_{i}+1}}=\prod_{i\in I}d(B_{i}),

proving that property (c) above holds for (Bn)(B_{n}). In addition, the formula we have obtained for d(Bi)d(B_{i}) and the hypotheses on \mathscr{B} and uu verify property (a).

Now fix kk\in\mathbb{N} and set f=I𝒫k()𝟏BIf=\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\mathbf{1}_{B_{I}}. As in Theorem 23 we have A¯(f)d(BI)\underline{A}(f)\geq\sum d(B_{I}). To prove that A¯(f)d(BI)\overline{A}(f)\leq\sum d(B_{I}), fix ε>0\varepsilon>0. Select NN\in\mathbb{N} such that

I𝒫k()max(I)>N1rI<ε2.\sum_{\begin{subarray}{c}I\in\mathcal{P}_{k}(\mathbb{N})\\ \max(I)>N\end{subarray}}\frac{1}{r_{I}}<\frac{\varepsilon}{2}.

Now let I𝒫k()I\in\mathcal{P}_{k}(\mathbb{N}) and note that, since BI=rICIB_{I}=r_{I}C_{I} where CIC_{I} is bIb_{I}-periodic, we have, for each aa\in\mathbb{N} and mabIrIm\geq ab_{I}r_{I},

|BI{1,,m}|m(a+1)|BI{1,,bIrI}|abIrI=a+1ad(BI).\frac{|B_{I}\cap\{1,\dots,m\}|}{m}\leq\frac{(a+1)\cdot|B_{I}\cap\{1,\dots,b_{I}r_{I}\}|}{ab_{I}r_{I}}=\frac{a+1}{a}\cdot d(B_{I}).

Find aa\in\mathbb{N} for which I𝒫k()d(BI)/a<ε/2\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}d(B_{I})/a<\varepsilon/2 and set R=maxI𝒫k({1,,N})bIrIR=\max_{I\in\mathcal{P}_{k}(\{1,\dots,N\})}b_{I}r_{I}. Then, for each maRm\geq aR,

1mk=1mf(k)\displaystyle\frac{1}{m}\sum_{k=1}^{m}f(k) =I𝒫k({1,,N})|BI{1,,m}|m+I𝒫k()max(I)>N|BI{1,,m}|m\displaystyle=\sum_{I\in\mathcal{P}_{k}(\{1,\dots,N\})}\frac{|B_{I}\cap\{1,\dots,m\}|}{m}+\sum_{\begin{subarray}{c}I\in\mathcal{P}_{k}(\mathbb{N})\\ \max(I)>N\end{subarray}}\frac{|B_{I}\cap\{1,\dots,m\}|}{m}
I𝒫k({1,,N})a+1ad(BI)+I𝒫k()max(I)>N1rI\displaystyle\leq\sum_{I\in\mathcal{P}_{k}(\{1,\dots,N\})}\frac{a+1}{a}\cdot d(B_{I})+\sum_{\begin{subarray}{c}I\in\mathcal{P}_{k}(\mathbb{N})\\ \max(I)>N\end{subarray}}\frac{1}{r_{I}}
<(I𝒫k()d(BI))+ε2+ε2\displaystyle<\left(\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}d(B_{I})\right)+\frac{\varepsilon}{2}+\frac{\varepsilon}{2}

(we have |BI{1,,m}|/m1/rI|B_{I}\cap\{1,\dots,m\}|/m\leq 1/r_{I} since BIrIB_{I}\subseteq r_{I}\mathbb{N}). This verifies that (Bn)(B_{n}) is indeed I.E. good and completes the proof. ∎

We next compute the density of the sets constructed in Examples 20 and 21. To do so, we will need a technical lemma.

Lemma 25.

Let GG and HH be countably infinite amenable cancellative semigroups with Følner sequences Φ\Phi and Ψ\Psi, respectively. Let (Dn)(D_{n}) be a Φ\Phi-I.E. good family of subsets of GG and (En)(E_{n}) be a Ψ\Psi-I.E. good family of subsets of HH. Then (Dn×En)(D_{n}\times E_{n}) is a (Φ×Ψ)(\Phi\times\Psi)-I.E. good family of G×HG\times H, where Φ×Ψ=(Φn×Ψn)n\Phi\times\Psi=(\Phi_{n}\times\Psi_{n})_{n\in\mathbb{N}}.

Proof.

The fact that Φ×Ψ\Phi\times\Psi is a Følner sequence for G×HG\times H is easy and left to the reader. We outline the proof that (Dn×En)(D_{n}\times E_{n}) satisfies property (d) of (Φ×Ψ)(\Phi\times\Psi)-I.E. goodness; the others are straightforward. We have, for each kk\in\mathbb{N},

AΦ×Ψ(I𝒫k()𝟏DI×EI)=limnI𝒫k()|DIΦn||Φn||EIΨn||Ψn|.A_{\Phi\times\Psi}{\left(\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\mathbf{1}_{D_{I}\times E_{I}}\right)}=\lim_{n\to\infty}\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\frac{|D_{I}\cap\Phi_{n}|}{|\Phi_{n}|}\cdot\frac{|E_{I}\cap\Psi_{n}|}{|\Psi_{n}|}.

But since

AΦ(I𝒫k()𝟏DI)=limnI𝒫k()|DIΦn||Φn|=I𝒫k()dΦ(DI),A_{\Phi}{\left(\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\mathbf{1}_{D_{I}}\right)}=\lim_{n\to\infty}\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}\frac{|D_{I}\cap\Phi_{n}|}{|\Phi_{n}|}=\sum_{I\in\mathcal{P}_{k}(\mathbb{N})}d_{\Phi}(D_{I}),

and likewise with EIE_{I}, proving property (d) reduces to verifying the following claim: given doubly-indexed sequences (ak,n)(a_{k,n}) and (bk,n)(b_{k,n}) of nonnegative quantities satisfying

  • limnak,n=ak\lim_{n\to\infty}a_{k,n}=a_{k} and limnbk,n=bk\lim_{n\to\infty}b_{k,n}=b_{k} for each kk, and

  • limnkak,n=ak<\lim_{n\to\infty}\sum_{k\in\mathbb{N}}a_{k,n}=\sum a_{k}<\infty and limnkbk,n=bk<\lim_{n\to\infty}\sum_{k\in\mathbb{N}}b_{k,n}=\sum b_{k}<\infty,

one has limnkak,nbk,n=akbk\lim_{n\to\infty}\sum_{k\in\mathbb{N}}a_{k,n}b_{k,n}=\sum a_{k}b_{k}. ∎

Theorem 26.

Let d2d\geq 2 and let (bn,1),,(bn,d)(b_{n,1}),\dots,(b_{n,d}) be sequences such that, for each ii, (bn,i)(b_{n,i}) consists of pairwise coprime positive integers. Then

d(dn(bn,1××bn,d))=n(11bn,1bn,d).d{\left(\mathbb{Z}^{d}\setminus\bigcup_{n\in\mathbb{N}}(b_{n,1}\mathbb{Z}\times\cdots\times b_{n,d}\mathbb{Z})\right)}=\prod_{n\in\mathbb{N}}{\left(1-\frac{1}{b_{n,1}\cdots b_{n,d}}\right)}.
Proof.

This follows from Lemma 22 and Theorem 23 via repeated applications of Lemma 25. ∎

To apply Lemma 25 to the Heisenberg group H3()H_{3}(\mathbb{Z}), viewed as 3\mathbb{Z}^{3} with the group operation described in Example 21, a laborious but straightforward computation shows that

(6) Φ=({n,,n}×{n2,,n2}×{n,,n})n\Phi=(\{-n,\dots,n\}\times\{-n^{2},\dots,n^{2}\}\times\{-n,\dots,n\})_{n\in\mathbb{N}}

is a Følner sequence in H3()H_{3}(\mathbb{Z}).

Theorem 27.

Let (bn)(b_{n}) be a sequence of pairwise coprime positive integers. Then, letting Φ\Phi be as in (6), we have

dΦ(H3()nbn3)=n(11bn3).d_{\Phi}{\left(H_{3}(\mathbb{Z})\setminus\bigcup_{n\in\mathbb{N}}b_{n}\mathbb{Z}^{3}\right)}=\prod_{n\in\mathbb{N}}{\left(1-\frac{1}{b_{n}^{3}}\right)}.
Proof.

We may again apply Lemma 22, Theorem 23, and Lemma 25 to deduce that 3bn3\mathbb{Z}^{3}\setminus\bigcup b_{n}\mathbb{Z}^{3} is Φ\Phi-I.E. good as a subset of (3,+)(\mathbb{Z}^{3},+) (notice that Φ\Phi-I.E. goodness is unaffected by passing to a subsequence of Φ\Phi—in our case from ({n,,n})n(\{-n,\dots,n\})_{n\in\mathbb{N}} to ({n2,,n2})n(\{-n^{2},\dots,n^{2}\})_{n\in\mathbb{N}}). Since the definition of density does not depend on the group operation, we are done. ∎

5. Discordant sets in SL2()\operatorname{SL}_{2}(\mathbb{Z})

Let GG denote the free semigroup generated by aa and bb with identity 1G1\in G. It is well known that GG is not amenable. To see why, suppose Φ\Phi is a Følner sequence for GG, and form the partition G={1}aGbGG=\{1\}\cup aG\cup bG. One of these parts must have positive upper density with respect to Φ\Phi, and since {1}\{1\} does not, suppose without loss of generality that c=d¯Φ(aG)>0c=\overline{d}_{\Phi}(aG)>0. Then aG,baG,b2aG,aG,baG,b^{2}aG,\dots are disjoint subsets of GG each with upper density c>0c>0. But Lemma 3 guarantees that this is impossible, which is our desired contradiction. This argument can be modified without too much difficulty to show that nonabelian free groups are not amenable, and moreover any group possessing a nonabelian free subgroup is not amenable.

We now turn our attention to the group SL2()\operatorname{SL}_{2}(\mathbb{Z}). Since SL2()\operatorname{SL}_{2}(\mathbb{Z}) contains nonabelian free subgroups,999For example, it is well-known that the matrices (1201)\big{(}\!{\begin{smallmatrix}1&2\\ 0&1\end{smallmatrix}}\!\big{)} and (1021)\big{(}\!{\begin{smallmatrix}1&0\\ 2&1\end{smallmatrix}}\!\big{)} generate a free subgroup of SL2()\operatorname{SL}_{2}(\mathbb{Z}) (see [33]). it is not amenable. However, we will define a natural density-like notion for subsets of SL2()\operatorname{SL}_{2}(\mathbb{Z}) and construct sets which are large with respect to this notion but not piecewise syndetic. Although this density does not enjoy shift-invariance (which in the amenable case is assured by Følner sequences), we consider this result an interesting aside which has independent value. In particular, it makes more robust the analogy with nowhere dense sets of positive measure, showing that it extends beyond the amenable case.

We first leverage Theorem 14 to produce an ample supply of non-piecewise syndetic subsets of SL2()\operatorname{SL}_{2}(\mathbb{Z}). Recall the congruence subgroups of SL2()\operatorname{SL}_{2}(\mathbb{Z}), which are subgroups of the form

Γ(n)={(abcd)SL2():(abcd)(1001)(modn)}.\Gamma(n)={\left\{{\begin{pmatrix}a&b\\ c&d\end{pmatrix}}\in\operatorname{SL}_{2}(\mathbb{Z})\colon{\begin{pmatrix}a&b\\ c&d\end{pmatrix}}\equiv{\begin{pmatrix}1&0\\ 0&1\end{pmatrix}}\pmod{n}\right\}}.

The subgroup Γ(n)\Gamma(n) is normal in SL2()\operatorname{SL}_{2}(\mathbb{Z}) for each nn\in\mathbb{N} since it is the kernel of the homomorphism φn:SL2()SL2(/n)\varphi_{n}\colon\operatorname{SL}_{2}(\mathbb{Z})\to\operatorname{SL}_{2}(\mathbb{Z}/n\mathbb{Z}) defined by reducing each matrix entrywise modulo nn.

Theorem 28.

Let (bn)(b_{n}) be a sequence of pairwise coprime positive integers, and define A=SL2()Γ(bn)A=\operatorname{SL}_{2}(\mathbb{Z})\setminus\bigcup\Gamma(b_{n}). Then AA is not piecewise syndetic.

Proof.

Fix any coprime m,nm,n\in\mathbb{N}. Since Γ(m)Γ(n)=Γ(mn)\Gamma(m)\cap\Gamma(n)=\Gamma(mn), to apply Theorem 14 it suffices to show that Γ(m)Γ(n)=SL2()\Gamma(m)\Gamma(n)=\operatorname{SL}_{2}(\mathbb{Z}). We claim that we may always find, for any TSL2(/m)T\in\operatorname{SL}_{2}(\mathbb{Z}/m\mathbb{Z}), some SSL2()S\in\operatorname{SL}_{2}(\mathbb{Z}) such that φm(S)=T\varphi_{m}(S)=T and φn(S)=I\varphi_{n}(S)=I. Granted this, fix SSL2()S\in\operatorname{SL}_{2}(\mathbb{Z}), and find S1,S2SL2()S_{1},S_{2}\in\operatorname{SL}_{2}(\mathbb{Z}) such that φm(S1)=φm(S)\varphi_{m}(S_{1})=\varphi_{m}(S), φn(S1)=I\varphi_{n}(S_{1})=I, φm(S2)=I\varphi_{m}(S_{2})=I, and φn(S2)=φn(S)\varphi_{n}(S_{2})=\varphi_{n}(S). Then SS11S21ker(φm)ker(φn)=Γ(mn)SS_{1}^{-1}S_{2}^{-1}\in\ker(\varphi_{m})\cap\ker(\varphi_{n})=\Gamma(mn), so there exists TΓ(mn)T\in\Gamma(mn) such that S=(TS2)S1Γ(m)Γ(n)S=(TS_{2})S_{1}\in\Gamma(m)\Gamma(n). Hence Γ(m)Γ(n)=SL2()\Gamma(m)\Gamma(n)=\operatorname{SL}_{2}(\mathbb{Z}), as desired.

To prove the claim, let T=(abcd)T=\big{(}\!{\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}}\!\big{)}, treating the entries as elements of \mathbb{Z}. Find x,yx,y\in\mathbb{Z} such that mx+ny=1mx+ny=1 and set

S=(abcd)=(mx+anybnycnymx+dny),S^{\prime}={\begin{pmatrix}a^{\prime}&b^{\prime}\\ c^{\prime}&d^{\prime}\end{pmatrix}}={\begin{pmatrix}mx+any&bny\\ cny&mx+dny\end{pmatrix}},

so ST(modm)S^{\prime}\equiv T\pmod{m} and SI(modn)S^{\prime}\equiv I\pmod{n}. Note that det(S)1(modmn)\det(S^{\prime})\equiv 1\pmod{mn}. It follows that gcd(a,b,mn)=1\gcd(a^{\prime},b^{\prime},mn)=1. Letting qq denote the product of primes dividing aa^{\prime} but not bb^{\prime}, we see that aa^{\prime} and b+mnqb^{\prime}+mnq are coprime. Therefore, there exist u,vu,v\in\mathbb{Z} such that au+(b+mnq)v=1a^{\prime}u+(b^{\prime}+mnq)v=1. It also follows from det(S)1(modmn)\det(S^{\prime})\equiv 1\pmod{mn} that there exists ss\in\mathbb{Z} such that

det(ab+mnqcd)=mns+1.\det{\begin{pmatrix}a^{\prime}&b^{\prime}+mnq\\ c^{\prime}&d^{\prime}\end{pmatrix}}=mns+1.

Then

S=(ab+mnqc+mnsvdmnsu)S={\begin{pmatrix}a^{\prime}&b^{\prime}+mnq\\ c^{\prime}+mnsv&d^{\prime}-mnsu\end{pmatrix}}

is an element of SL2()\operatorname{SL}_{2}(\mathbb{Z}) satisfying φm(S)=T\varphi_{m}(S)=T and φn(S)=I\varphi_{n}(S)=I. ∎

We will now introduce a notion of density for subsets of SL2()\operatorname{SL}_{2}(\mathbb{Z}). Define

Fn={(abcd)SL2():|a|,|b|,|c|,|d|n}.F_{n}={\left\{\begin{pmatrix}a&b\\ c&d\end{pmatrix}\in\operatorname{SL}_{2}(\mathbb{Z})\colon|a|,|b|,|c|,|d|\leq n\right\}}.

We then define the upper density of ASL2()A\subseteq\operatorname{SL}_{2}(\mathbb{Z}) to be

d¯(A)=lim supn|AFn||Fn|,\overline{d}(A)=\limsup_{n\to\infty}\frac{|A\cap F_{n}|}{|F_{n}|},

and define d¯\underline{d} and dd analogously. We first make some crude estimates on the growth of the sets FnF_{n} and Γ(k)Fn\Gamma(k)\cap F_{n}.

Lemma 29.

There is a positive integer NN such that |Fn|(12/π2)n2|F_{n}|\geq(12/\pi^{2})n^{2} for all nNn\geq N.

Proof.

We will produce, for sufficiently large nn, at least (12/π2)n2(12/\pi^{2})n^{2} matrices (abcd)SL2()\big{(}\!{\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}}\!\big{)}\in\operatorname{SL}_{2}(\mathbb{Z}) such that max{|a|,|b|,|c|,|d|}{|a|,|b|}\max\{|a|,|b|,|c|,|d|\}\in\{|a|,|b|\} and |a|,|b|,|c|,|d|n|a|,|b|,|c|,|d|\leq n. As we saw in Example 20,

limn|{a,b:|a|,|b|n and gcd(a,b)=1}|(2n+1)2=6π2.\lim_{n\to\infty}\frac{|\{a,b\in\mathbb{Z}\colon\text{$|a|,|b|\leq n$ and $\gcd(a,b)=1$}\}|}{(2n+1)^{2}}=\frac{6}{\pi^{2}}.

Thus there is a positive integer NN such that for all nNn\geq N, there are at least (3/π2)(2n+1)2>(12/π2)n2(3/\pi^{2})(2n+1)^{2}>(12/\pi^{2})n^{2} pairs (a,b)(a,b) such that max{|a|,|b|}n\max\{|a|,|b|\}\leq n and gcd(a,b)=1\gcd(a,b)=1. Given such a pair (a,b)(a,b), using the Euclidean algorithm, we can find c,dc,d\in\mathbb{Z} with |c||a||c|\leq|a| and |d||b||d|\leq|b| such that adbc=1ad-bc=1. Then (abcd)Fn\big{(}\!{\begin{smallmatrix}a&b\\ c&d\end{smallmatrix}}\!\big{)}\in F_{n}, as desired. ∎

Lemma 30.

For each k2k\geq 2 and nkn\geq k, we have |Γ(k)Fn|(96/k2)n2|\Gamma(k)\cap F_{n}|\leq(96/k^{2})n^{2}.

Proof.

Define

Λ(k)={(abcd)Γ(k):max{|a|,|b|,|c|,|d|}{|a|,|b|}}.\Lambda(k)={\left\{\begin{pmatrix}a&b\\ c&d\end{pmatrix}\in\Gamma(k)\colon\max\{|a|,|b|,|c|,|d|\}\in\{|a|,|b|\}\right\}}.

Given MΓ(k)FnM\in\Gamma(k)\cap F_{n}, either MM or the matrix given by swapping the rows and columns of MM is in Λ(k)Fn\Lambda(k)\cap F_{n}. Thus |Γ(k)Fn|2|Λ(k)Fn||\Gamma(k)\cap F_{n}|\leq 2\cdot|\Lambda(k)\cap F_{n}|.

There are at most (2n+1)/k2((2n+1+k)/k)2(4n/k)2\lceil(2n+1)/k\rceil^{2}\leq((2n+1+k)/k)^{2}\leq(4n/k)^{2} pairs (a,b)2(a,b)\in\mathbb{Z}^{2} with |a|,|b|n|a|,|b|\leq n such that a1(modk)a\equiv 1\pmod{k} and b0(modk)b\equiv 0\pmod{k}. In particular, there are at most (4n/k)2(4n/k)^{2} such pairs (a,b)(a,b) with gcd(a,b)=1\gcd(a,b)=1. Given such a pair, one can find, using the Euclidean algorithm, c,dc,d\in\mathbb{Z} such that adbc=1ad-bc=1, |d||b||d|\leq|b|, |c||a||c|\leq|a|. Then

{(x,y)2:axby=1}={(d+mb,c+ma):m},\{(x,y)\in\mathbb{Z}^{2}\colon ax-by=1\}=\{(d+mb,c+ma)\colon m\in\mathbb{Z}\},

so there are at most three pairs (x,y)(x,y) such that axby=1ax-by=1 and max{|x|,|y|}max{|a|,|b|}\max\{|x|,|y|\}\leq\max\{|a|,|b|\}, and in particular there are at most three (x,y)(x,y) such that (abyx)Λ(k)Fn\big{(}\!{\begin{smallmatrix}a&b\\ y&x\end{smallmatrix}}\!\big{)}\in\Lambda(k)\cap F_{n}.

Combining all of this, we have |Γ(k)Fn|6(4n/k)2=(96/k2)n2|\Gamma(k)\cap F_{n}|\leq 6(4n/k)^{2}=(96/k^{2})n^{2}. ∎

With these estimates, we can show that under certain conditions, the sets in Theorem 28 have positive lower density.

Theorem 31.

Let (bn)(b_{n}) be a sequence of pairwise coprime positive integers such that 8π2/bn2<1\sum 8\pi^{2}/b_{n}^{2}<1. Then A=SL2()Γ(bn)A=\operatorname{SL}_{2}(\mathbb{Z})\setminus\bigcup\Gamma(b_{n}) has positive lower density.

Proof.

For 2n<bm2\leq n<b_{m}, the only element of Γ(bm)Fn\Gamma(b_{m})\cap F_{n} is the identity matrix II. For m2m\geq 2, write Γ(bm)=Γ(bm){I}\Gamma^{\prime}(b_{m})=\Gamma(b_{m})\setminus\{I\} and Γ(b1)=Γ(b1)\Gamma^{\prime}(b_{1})=\Gamma(b_{1}), so that Γ(bm)=Γ(bm)\bigcup\Gamma(b_{m})=\bigcup\Gamma^{\prime}(b_{m}). Then, for 2n<bm2\leq n<b_{m}, we have Γ(bm)Fn=\Gamma^{\prime}(b_{m})\cap F_{n}=\varnothing, so

AFn=Fnbmn(Γ(bm)Fn).A\cap F_{n}=F_{n}\setminus\bigcup_{b_{m}\leq n}(\Gamma^{\prime}(b_{m})\cap F_{n}).

For sufficiently large nn, we then have, applying Lemmas 29 and 30,

|AFn||Fn|1bmn|Γ(bm)Fn||Fn|1bmn(96/bm2)n2(12/π2)n21m8π2bm2.\frac{|A\cap F_{n}|}{|F_{n}|}\geq 1-\sum_{b_{m}\leq n}\frac{|\Gamma^{\prime}(b_{m})\cap F_{n}|}{|F_{n}|}\geq 1-\sum_{b_{m}\leq n}\frac{(96/b_{m}^{2})n^{2}}{(12/\pi^{2})n^{2}}\geq 1-\sum_{m\in\mathbb{N}}\frac{8\pi^{2}}{b_{m}^{2}}.

Thus d¯(A)18π2/bn2>0\underline{d}(A)\geq 1-\sum 8\pi^{2}/b_{n}^{2}>0, as desired. ∎

6. Recurrence and anti-recurrence

Let GG be a countably infinite group. A set AGA\subseteq G is a set of (measurable) recurrence if for all measure preserving systems (X,,μ,G)(X,\mathcal{B},\mu,G) and for all YXY\subseteq X with μ(Y)>0\mu(Y)>0, there is a gAg\in A not equal to the identity such that μ(YgY)>0\mu(Y\cap gY)>0. Recall that in a measure-preserving system, we assume μ(X)=1\mu(X)=1, so sets of recurrence exist (for example, take A=GA=G).

Call AGA\subseteq G an anti-recurrence set, or an AR set, if for all gGg\in G, gAgA is not a set of recurrence. We will begin by showing that any piecewise syndetic set is a shift of a set of recurrence, so any AR set with positive upper density is therefore discordant.

Lemma 32.

Let TGT\subseteq G be thick. Then TT is a set of recurrence.

Proof.

Let TGT\subseteq G be thick. First, we will show that there is an infinite sequence (gn)(g_{n}) of elements of GG such that for all pairs i,ji,j\in\mathbb{N} with i<ji<j, we have gi1gjTg_{i}^{-1}g_{j}\in T. Indeed, such a set can be constructed recursively: given g1,,gng_{1},\dots,g_{n}, we can find gn+1g_{n+1} such that {g11,g21,,gn1}gn+1T\{g_{1}^{-1},g_{2}^{-1},\dots,g_{n}^{-1}\}g_{n+1}\subseteq T.

Let (X,,μ,G)(X,\mathcal{B},\mu,G) be a measure preserving system, and let YXY\subseteq X have μ(Y)>0\mu(Y)>0. There exist gi,gjg_{i},g_{j} with i<ji<j such that μ(giYgjY)>0\mu(g_{i}Y\cap g_{j}Y)>0. Then, μ(Ygi1gjY)>0\mu(Y\cap g_{i}^{-1}g_{j}Y)>0, as desired. ∎

Lemma 33.

Let AGA\subseteq G be a set of recurrence partitioned as A=i=1rAiA=\bigcup_{i=1}^{r}A_{i}. Then at least one AiA_{i} is a set of recurrence.

Proof.

Assume that no AiA_{i} is a set of recurrence. Then, for each ii, there is a measure preserving system (Xi,i,μi,G)(X_{i},\mathcal{B}_{i},\mu_{i},G) and YiXiY_{i}\subseteq X_{i} with μi(Yi)>0\mu_{i}(Y_{i})>0 such that for all gAig\in A_{i} we have μi(YigYi)=0\mu_{i}(Y_{i}\cap gY_{i})=0. The actions of GG on each XiX_{i} induce an action on X=X1××XnX=X_{1}\times\dots\times X_{n}, which is measure preserving with respect to the product measure μ\mu of the measures μi\mu_{i}. Let Y=Y1××YnY=Y_{1}\times\dots\times Y_{n}. We have μ(Y1××Yn)>0\mu(Y_{1}\times\dots\times Y_{n})>0, but for all gAg\in A, since gAig\in A_{i} for some ii, we have μi(YigYi)=0\mu_{i}(Y_{i}\cap gY_{i})=0, and thus μ(YgY)=0\mu(Y\cap gY)=0. Therefore, AA is not a set of recurrence. This establishes the contrapositive. ∎

Theorem 34.

If AA is AR, then AA is not piecewise syndetic.

Proof.

Apply Lemma 5 with 𝒜={gA:A is a set of recurrence}\mathcal{A}=\{gA\colon A\text{ is a set of recurrence}\}, noting the previous two lemmas. ∎

In [5], it is noted that Straus’ example of a discordant set is also an AR set. We will follow their terminology and refer to AR sets with positive upper density as Straus sets. In what follows, we investigate the question of whether a given discordant set is a Straus set. We begin by showing that Example 17 provides large class of non-examples.

Theorem 35.

Let =(bn)\mathscr{B}=(b_{n}) be a sequence of pairwise coprime positive integers such that 1/bn<\sum 1/b_{n}<\infty. Define =bn\mathcal{F}_{\mathscr{B}}=\mathbb{Z}\setminus\bigcup b_{n}\mathbb{Z}. Then \mathcal{F}_{\mathscr{B}} is a discordant set which is not AR.

Proof.

As noted in Example 17, \mathcal{F}_{\mathscr{B}} is discordant. Now let bb\in\mathcal{F}_{\mathscr{B}}. By [12, Theorem 2.8], b\mathcal{F}_{\mathscr{B}}-b contains an IP set. Since IP sets are sets of recurrence, b\mathcal{F}_{\mathscr{B}}-b is also a set of recurrence, and hence \mathcal{F}_{\mathscr{B}} cannot be an AR set. ∎

Not all countably infinite amenable groups admit Straus sets. For example, in the group of finitely supported even permutations of \mathbb{N}, every set with positive upper Banach density is a set of recurrence (in fact, such a set must be IP). In [5, Theorem 1.23], it is shown that Straus sets exist in a locally compact, second countable, amenable group if and only if the von Neumann kernel of that group is not cocompact. In such groups, for all ε>0\varepsilon>0, there is a Straus set EE with d(E)>1εd(E)>1-\varepsilon. Thus, it is natural to ask whether such a group has a Straus set EE with d(E)=cd(E)=c for each c(0,1)c\in(0,1). We prove that this is the case for all countably infinite abelian groups in Theorem 38. First, we need the following lemma, a special case of a lemma appearing in [5].

Lemma 36 (cf. [5, Lemma 5.1]).

Let GG be a countably infinite amenable group and let Φ\Phi be a Følner sequence in GG. Let (An)(A_{n}) be a sequence of subsets of GG such that dΦ(An)d_{\Phi}(A_{n}) exists for every nn, dΦ(An)<\sum d_{\Phi}(A_{n})<\infty, and dΦ(A1An)d_{\Phi}(A_{1}\cup\dots\cup A_{n}) exists for every nn. Then there are cofinite subsets AnAnA_{n}^{\prime}\subseteq A_{n} such that for C=AnC=\bigcup A_{n}^{\prime}, dΦ(C)=limdΦ(A1An)d_{\Phi}(C)=\lim d_{\Phi}(A_{1}\cup\dots\cup A_{n}).

Before proving Theorem 38, we give a proof of the case G=G=\mathbb{Z} which does not rely on tools from ergodic theory. This also serves as a model of the proof of Theorem 38 for arbitrary discrete countably infinite abelian groups.

For the proof of Theorem 37, the notion of well-distribution for a sequence in 𝕋\mathbb{T} will be used. For a more detailed exposition of this theory, see [31]. A sequence (xn)(x_{n}) in 𝕋\mathbb{T} is called well-distributed if for any measurable A𝕋A\subseteq\mathbb{T} with μ(A)=0\mu(\partial A)=0,

limNM1NMn=M+1N𝟏A(xn)=μ(A),\lim_{N-M\to\infty}\frac{1}{N-M}\sum_{n=M+1}^{N}\mathbf{1}_{A}(x_{n})=\mu(A),

where μ\mu is the Lebesgue measure and A\partial A is the boundary of AA, i.e., the closure of AA minus the interior of AA. Equivalently, (xn)(x_{n}) is well-distributed if for any Følner sequence Φ\Phi in \mathbb{Z} and any measurable A𝕋A\subseteq\mathbb{T} with μ(A)=0\mu(\partial A)=0,

limn1|Φn|kΦn𝟏A(xk)=μ(A).\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{k\in\Phi_{n}}\mathbf{1}_{A}(x_{k})=\mu(A).
Theorem 37.

Let c(0,1)c\in(0,1) and let Φ\Phi be a Følner sequence in \mathbb{Z}. Then, there is a Straus set AA\subseteq\mathbb{Z} such that dΦ(A)=cd_{\Phi}(A)=c.

Proof.

Let t(0,1)t\in(0,1), and let α𝕋\alpha\in\mathbb{T} be irrational. For a set A𝕋A\subseteq\mathbb{T}, let RAR_{A} denote {n:nαA}\{n\in\mathbb{Z}\colon n\alpha\in A\}. For n0n\geq 0, let rn=t/((2n+1)2n+1)r_{n}=t/((2n+1)\cdot 2^{n+1}), let

Bn=k=nn(rn+kα,rn+kα),B_{n}=\bigcup_{k=-n}^{n}(-r_{n}+k\alpha,r_{n}+k\alpha),

and let Cn=k=0nBkC_{n}=\bigcup_{k=0}^{n}B_{k}. Since each of BnB_{n} and CnC_{n} have boundaries with measure zero, and since (nα)(n\alpha) is well-distributed in 𝕋\mathbb{T}, we have dΦ(RBn)=μ(Bn)d_{\Phi}(R_{B_{n}})=\mu(B_{n}) and dΦ(RCn)=μ(Cn)d_{\Phi}(R_{C_{n}})=\mu(C_{n}) for all nn. Since μ(Bn)2t\sum\mu(B_{n})\leq 2t, by Lemma 36, there are cofinite subsets RBnRBnR^{\prime}_{B_{n}}\subseteq R_{B_{n}} such that

dΦ(n=0RBn)=limnμ(Cn)=μ(n=0Bn).d_{\Phi}{\left(\bigcup_{n=0}^{\infty}R^{\prime}_{B_{n}}\right)}=\lim_{n\to\infty}\mu(C_{n})=\mu{\left(\bigcup_{n=0}^{\infty}B_{n}\right)}.

Let fn(t)=μ(Cn)f_{n}(t)=\mu(C_{n}), and let f(t)=limnfn(t)f(t)=\lim_{n\to\infty}f_{n}(t). For all tt,

|f(t)fn(t)|k=n+1μ(Bk)12n,|f(t)-f_{n}(t)|\leq\sum_{k=n+1}^{\infty}\mu(B_{k})\leq\frac{1}{2^{n}},

so the convergence is uniform. Since the boundary of each interval has measure zero, the functions fnf_{n} are continuous, so ff is continuous. Moreover, limt1f(t)=1\lim_{t\to 1}f(t)=1 (since limt1μ(B0)=1\lim_{t\to 1}\mu(B_{0})=1) and limt0f(t)=0\lim_{t\to 0}f(t)=0, so we can choose tt so that dΦ(RBn)=1cd_{\Phi}\left(\bigcup R_{B_{n}}^{\prime}\right)=1-c. Let A=RBnA=\mathbb{Z}\setminus\bigcup R^{\prime}_{B_{n}}. Then, dΦ(A)=cd_{\Phi}(A)=c. It remains to show that AA is AR. Let kk\in\mathbb{Z}. Then

(Ak){n:(rk2+nα,rk2+nα)(rk2,rk2)}(A-k)\cap{\left\{n\in\mathbb{Z}\colon{\left(\frac{-r_{k}}{2}+n\alpha,\frac{r_{k}}{2}+n\alpha\right)}\cap{\left(\frac{-r_{k}}{2},\frac{r_{k}}{2}\right)}\neq\varnothing\right\}}

is finite, so AkA-k cannot be a set of recurrence. ∎

In fact, a version of Theorem 37 holds true for all countably infinite abelian groups.

Theorem 38.

Let GG be a (discrete) countably infinite abelian group. Let c(0,1)c\in(0,1), and let Φ\Phi be a Følner sequence in GG. Then, there is a Straus set AGA\subseteq G such that dΦ(A)=cd_{\Phi}(A)=c.

Proof.

We will construct a compact abelian group XX and a homomorphism φ:GX\varphi\colon G\to X with dense image. Depending on GG, this construction can occur three different ways.

First, assume that GG has an element of infinite order, i.e., GG is not a torsion group. We define a homomorphism from GG to X=𝕋X=\mathbb{T} as follows. By [20, Theorems 23.1 and 24.1], GG can be embedded in TT\oplus\bigoplus_{\mathbb{N}}\mathbb{Q}, where TT is a torsion group. Let ι\iota be the embedding from GG into this direct sum. Let gg be an element of GG with infinite order. Then, there is a projection ρ\rho from TT\oplus\bigoplus_{\mathbb{N}}\mathbb{Q} onto \mathbb{Q} such that ρ(ι(g))\rho(\iota(g)) is nonzero. Choose an irrational α𝕋\alpha\in\mathbb{T}. Then the map hρ(ι(h))αh\mapsto\rho(\iota(h))\alpha is a homomorphism from GG to 𝕋\mathbb{T} with dense image, so in this case, we will let X=𝕋X=\mathbb{T} and let φ:G𝕋\varphi\colon G\to\mathbb{T} be such that φ(h)=ρ(ι(h))α\varphi(h)=\rho(\iota(h))\alpha.

Next, assume that GG is a torsion group. For pp a prime and nn\in\mathbb{N}, let Hp,nH_{p,n} be the Prüfer pp-group [p]\mathbb{Z}[p^{\infty}]. (By definition, for n1,n2n_{1},n_{2}\in\mathbb{N}, Hp,n1Hp,n2H_{p,n_{1}}\cong H_{p,n_{2}}.) By [20, Theorems 23.1 and 24.1], GG can be embedded into p primenHp,n\bigoplus_{p\text{ prime}}\bigoplus_{n\in\mathbb{N}}H_{p,n}. Let ii be the corresponding inclusion map, and let πp,n\pi_{p,n} be the projection from the direct sum onto Hp,nH_{p,n}. For convenience, we abbreviate ρp,n=πp,ni\rho_{p,n}=\pi_{p,n}\circ i. There are two cases.

First, assume there is some pair (p,n)(p,n) such that |ρp,n(G)||\rho_{p,n}(G)| is infinite. Note that Hp,nH_{p,n} is isomorphic to the subgroup {a/pk:a,k}\{a/p^{k}\colon a\in\mathbb{Z},k\in\mathbb{N}\} of 𝕋\mathbb{T}; call this isomorphism ψ\psi. Let φ=ψρp,n\varphi=\psi\circ\rho_{p,n}. Then φ:G𝕋\varphi\colon G\to\mathbb{T} is a homomorphism with dense image, so we set X=𝕋X=\mathbb{T}.

On the other hand, assume that for all pairs (p,n)(p,n), |ρp,n(G)||\rho_{p,n}(G)| is finite. Then GG is embedded in pnρp,n(G)\bigoplus_{p}\bigoplus_{n}\rho_{p,n}(G). Since GG is a subgroup of a direct sum of finite cyclic groups, by [20, Theorem 18.1], there is an infinite sequence (nj)j(n_{j})_{j\in\mathbb{N}} of integers greater than 11 such that Gj/njG\cong\bigoplus_{j}\mathbb{Z}/{n_{j}}\mathbb{Z}. We will assume without loss of generality that nj+1njn_{j+1}\geq n_{j} for all jj\in\mathbb{N}. Then there is a dense embedding φ\varphi from Gj/njG\cong\bigoplus_{j}\mathbb{Z}/{n_{j}}\mathbb{Z} to the group X=j/njX=\prod_{j}\mathbb{Z}/{n_{j}}\mathbb{Z}. For each njn_{j}, define a metric δj\delta_{j} on /nj\mathbb{Z}/n_{j}\mathbb{Z} by δj([x],[y])=mink|xy+knj|\delta_{j}([x],[y])=\min_{k\in\mathbb{Z}}|x-y+kn_{j}|, where x,yx,y\in\mathbb{Z} and [x][x] and [y][y] are the equivalence classes of xx and yy in /nj\mathbb{Z}/n_{j}\mathbb{Z}. Observe that δj(x,y){0,1,,nj/2}\delta_{j}(x,y)\in\{0,1,\dots,\lfloor n_{j}/2\rfloor\} for all x,y/njx,y\in\mathbb{Z}/n_{j}\mathbb{Z}. We can give XX the product metric δ(𝐱,𝐲)=jnj2jδj(xj,yj)\delta(\mathbf{x},\mathbf{y})=\sum_{j\in\mathbb{N}}n_{j}^{-2j}\delta_{j}(x_{j},y_{j}). The Haar measure μ\mu on XX is the product measure of the normalized counting measures on each /nj\mathbb{Z}/{n_{j}}\mathbb{Z}. We will show that for any a[0,1]a\in[0,1], the set {𝐱X:δ(𝐱,0)=a}\{\mathbf{x}\in X\colon\delta(\mathbf{x},0)=a\} has measure zero. Since δ\delta is invariant under the group operation of XX, this will imply {𝐱X:δ(𝐱,𝐲)=a}\{\mathbf{x}\in X\colon\delta(\mathbf{x},\mathbf{y})=a\} has measure zero for all 𝐲X\mathbf{y}\in X.

Let a[0,1]a\in[0,1]; we first claim that there is at most one sequence 𝐬X\mathbf{s}\in X with sj{0,1,,nj/2}s_{j}\in\{0,1,\dots,\lfloor n_{j}/2\rfloor\} for all jj\in\mathbb{N} and

(7) jsjnj2j=a.\sum_{j\in\mathbb{N}}\frac{s_{j}}{n_{j}^{2j}}=a.

Assume there are two distinct such sequences 𝐫,𝐬X\mathbf{r},\mathbf{s}\in X. Let kk be the smallest positive integer with skrks_{k}\neq r_{k}. Then

|jksjrjnj2j|1nk2k.\left|\sum_{j\leq k}\frac{s_{j}-r_{j}}{n_{j}^{2j}}\right|\geq\frac{1}{n_{k}^{2k}}.

However,

|j>ksjrjnj2j|j>k|sjrj|nj2jj>k1nj2j1j>k1nk2j1=1nk2k+1nk2nk21<1nk2k,\left|\sum_{j>k}\frac{s_{j}-r_{j}}{n_{j}^{2j}}\right|\leq\sum_{j>k}\frac{|s_{j}-r_{j}|}{n_{j}^{2j}}\leq\sum_{j>k}\frac{1}{n_{j}^{2j-1}}\leq\sum_{j>k}\frac{1}{n_{k}^{2j-1}}=\frac{1}{n_{k}^{2k+1}}\frac{n_{k}^{2}}{n_{k}^{2}-1}<\frac{1}{n_{k}^{2k}},

so we cannot have

jsjnj2j=a=jrjnj2j.\sum_{j\in\mathbb{N}}\frac{s_{j}}{n_{j}^{2j}}=a=\sum_{j\in\mathbb{N}}\frac{r_{j}}{n_{j}^{2j}}.

Now assume 𝐬X\mathbf{s}\in X satisfies (7). We will show

μ({𝐱X:δj(xj,0)=sj for all j})=0.\mu(\{\mathbf{x}\in X\colon\delta_{j}(x_{j},0)=s_{j}\text{ for all }j\in\mathbb{N}\})=0.

Let μj\mu_{j} be the normalized counting measure on /nj\mathbb{Z}/n_{j}\mathbb{Z}, so μj(A)=|A|/nj\mu_{j}(A)=|A|/n_{j} for any A/njA\subseteq\mathbb{Z}/n_{j}\mathbb{Z}. If nj=2n_{j}=2, then μj({x/nj:δj(x,0)=sj})=1/2<2/3\mu_{j}(\{x\in\mathbb{Z}/n_{j}\mathbb{Z}\colon\delta_{j}(x,0)=s_{j}\})=1/2<2/3, and if nj>2n_{j}>2, then μj({x/nj:δj(x,0)=sj})2/nj2/3\mu_{j}(\{x\in\mathbb{Z}/n_{j}\mathbb{Z}\colon\delta_{j}(x,0)=s_{j}\})\leq 2/n_{j}\leq 2/3. Thus for all kk\in\mathbb{N},

μ({𝐱X:δj(xj,0)=sj for all j})\displaystyle\mu(\{\mathbf{x}\in X\colon\delta_{j}(x_{j},0)=s_{j}\text{ for all }j\in\mathbb{N}\}) μ({𝐱X:δj(xj,0)=sj for all jk})\displaystyle\leq\mu(\{\mathbf{x}\in X\colon\delta_{j}(x_{j},0)=s_{j}\text{ for all }j\leq k\})
(2/3)k,\displaystyle\leq(2/3)^{k},

so

μ({𝐱X:δj(xj,0)=sj for all j})=0\mu(\{\mathbf{x}\in X\colon\delta_{j}(x_{j},0)=s_{j}\text{ for all }j\in\mathbb{N}\})=0

and thus μ({𝐱X:δ(𝐱,0)=a})=0\mu(\{\mathbf{x}\in X\colon\delta(\mathbf{x},0)=a\})=0. If there is no sequence 𝐬X\mathbf{s}\in X with sj{0,1,,nj/2}s_{j}\in\{0,1,\dots,\lfloor n_{j}/2\rfloor\} satisfying (7), we also have μ({𝐱X:δ(𝐱,0)=a})=0\mu(\{\mathbf{x}\in X\colon\delta(\mathbf{x},0)=a\})=0.

Here, the casework ends, and for GG, the following is true. We have a homomorphism with dense image φ:GX\varphi\colon G\to X, where XX is a compact abelian group. Moreover, XX is endowed with an invariant metric δ\delta such that for xXx\in X and a[0,1]a\in[0,1], we have μ({yX:δ(x,y)=a})=0\mu(\{y\in X\colon\delta(x,y)=a\})=0, where μ\mu is the normalized Haar measure on XX. Additionally, δ(x,y)1\delta(x,y)\leq 1 for all x,yXx,y\in X.

We define a natural GG-action on XX by isometries via gx=x+φ(g)gx=x+\varphi(g). Since im(φ)\operatorname{im}(\varphi) is dense, the action is uniquely ergodic, with the normalized Haar measure μ\mu as the unique invariant measure. Thus, by Theorem 8, for any f:Xf\colon X\to\mathbb{R} for which the set of discontinuities has measure zero and xXx\in X,

(8) limn1|Φn|gΦnf(gx)=Xf𝑑μ.\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}f(gx)=\int_{X}f\,d\mu.

Suppose YXY\subseteq X is a Borel set with μ(Y)=0\mu(\partial Y)=0, and write RY={gG:φ(g)Y}R_{Y}=\{g\in G\colon\varphi(g)\in Y\}. Taking x=0x=0 and f=𝟏Yf=\mathbf{1}_{Y}, equation (8) becomes

dΦ(RY)=limn1|Φn|gΦn𝟏Y(φ(g))=X𝟏Y𝑑μ=μ(Y),d_{\Phi}(R_{Y})=\lim_{n\to\infty}\frac{1}{|\Phi_{n}|}\sum_{g\in\Phi_{n}}\mathbf{1}_{Y}(\varphi(g))=\int_{X}\mathbf{1}_{Y}\,d\mu=\mu(Y),

since the set of discontinuities of 𝟏Y\mathbf{1}_{Y} is exactly Y\partial Y.

The remainder of the argument proceeds along the same lines as in Theorem 37. Enumerate G={g1,g2,}G=\{g_{1},g_{2},\dots\}, let t(0,1)t\in(0,1), and for n1n\geq 1, let rn=t/(n2n1)r_{n}=t/(n\cdot 2^{n-1}). Let

Bn=k=1n({xX:δ(0,x)<rn}+φ(gk)),B_{n}=\bigcup^{n}_{k=1}(\{x\in X\colon\delta(0,x)<r_{n}\}+\varphi(g_{k})),

and let Cn=k=1nBkC_{n}=\bigcup^{n}_{k=1}B_{k}. Using Lemma 36, for each nn, find a cofinite RBnRBnR_{B_{n}}^{\prime}\subset R_{B_{n}} such that

dΦ(n=1RBn)=limnμ(Cn)=μ(n=1Bn).d_{\Phi}{\left(\bigcup^{\infty}_{n=1}R_{B_{n}}^{\prime}\right)}=\lim_{n\to\infty}\mu(C_{n})=\mu{\left(\bigcup^{\infty}_{n=1}B_{n}\right)}.

Let fn(t)=μ(Cn)f_{n}(t)=\mu(C_{n}). Note that since the construction of the sets CnC_{n} depends continuously on tt, the functions fnf_{n} are continuous. Let f=limnfnf=\lim_{n\to\infty}f_{n}. The convergence of (fn)(f_{n}) to ff is uniform, so ff is continuous. Note that limt1f(t)=1\lim_{t\to 1}f(t)=1 (since diam(X)1\text{diam}\,(X)\leq 1) and limt0f(t)=0\lim_{t\to 0}f(t)=0, so we can choose tt so that dΦ(RBn)=1cd_{\Phi}(\bigcup R^{\prime}_{B_{n}})=1-c. Then, if A=GRBnA=G\setminus\bigcup R^{\prime}_{B_{n}}, we have dΦ(A)=cd_{\Phi}(A)=c. Let nn\in\mathbb{N}. Then (Ag){hG:δ(φ(h),0)<rk}(A-g)\cap\{h\in G\colon\delta(\varphi(h),0)<r_{k}\} is finite, so AgkA-g_{k} cannot be a set of recurrence. ∎

Theorem 38 can be extended to countably infinite commutative cancellative semigroups, allowing us to conclude that discordant subsets with any density in (0,1)(0,1) exist in, e.g., (,+)(\mathbb{N},+) and (,×)(\mathbb{N},\times) as well.

Corollary 39.

Let GG be a countably infinite commutative cancellative semigroup, let Φ\Phi be a Følner sequence in GG, and let c(0,1)c\in(0,1). Then there is a discordant set AGA\subseteq G with dΦ(A)=cd_{\Phi}(A)=c.

Proof.

We can embed GG in a countably infinite abelian group G~\widetilde{G} by a process similar to that of embedding an integral domain in a field (see [30, Exercise 1.1.8] for more details about this construction). The elements of G~\widetilde{G} can be thought formally as “fractions” of elements of GG, so for any gG~g\in\widetilde{G}, there is an hGh\in G such that ghGgh\in G. We will show that Φ\Phi is a Følner sequence in G~\widetilde{G} (see [6, Theorem 2.12]). Let gG~g\in\widetilde{G}, and let hGh\in G be such that hgGhg\in G. For each nn\in\mathbb{N},

|ΦngΦn||Φn||ΦnhgΦn||Φn|+|gΦnhgΦn||Φn|=|ΦnhgΦn||Φn|+|ΦnhΦn||Φn|,\frac{|\Phi_{n}\bigtriangleup g\Phi_{n}|}{|\Phi_{n}|}\leq\frac{|\Phi_{n}\bigtriangleup hg\Phi_{n}|}{|\Phi_{n}|}+\frac{|g\Phi_{n}\bigtriangleup hg\Phi_{n}|}{|\Phi_{n}|}=\frac{|\Phi_{n}\bigtriangleup hg\Phi_{n}|}{|\Phi_{n}|}+\frac{|\Phi_{n}\bigtriangleup h\Phi_{n}|}{|\Phi_{n}|},

so

limn|ΦngΦn||Φn|=0,\lim_{n\to\infty}\frac{|\Phi_{n}\bigtriangleup g\Phi_{n}|}{|\Phi_{n}|}=0,

and thus Φ\Phi is a Følner sequence in G~\widetilde{G}. By Theorem 38, there is a discordant set A~G~\widetilde{A}\subseteq\widetilde{G} with dΦ(A~)=cd_{\Phi}(\widetilde{A})=c. Set A=A~GA=\widetilde{A}\cap G. We claim that AA is not piecewise syndetic in GG. Suppose, toward the contrapositive, that there is a finite HGH\subseteq G such that H1AH^{-1}A is thick. Let FG~F\subseteq\widetilde{G} be finite and select gGg\in G such that FgGFg\subseteq G. Then there is an hGh\in G such that FghH1AFgh\subseteq H^{-1}A. But then H1AH1A~H^{-1}A\subseteq H^{-1}\widetilde{A} is piecewise syndetic in G~\widetilde{G}, which establishes the claim. Thus AA is a non-piecewise syndetic subset of GG with dΦ(A)=cd_{\Phi}(A)=c. ∎

7. Large subsets of {0,1}G\{0,1\}^{G}

Let GG be a countably infinite semigroup. In this section we will identify a subset AGA\subseteq G with its indicator function 𝟏A{0,1}G\mathbf{1}_{A}\in\{0,1\}^{G}. We will say that 𝟏A\mathbf{1}_{A} is thick, syndetic, etc. whenever AA possesses these properties. Thus, families of subsets of GG are identified with subsets of {0,1}G\{0,1\}^{G}. To study the topological properties of subsets of {0,1}G\{0,1\}^{G}, we give {0,1}G\{0,1\}^{G} the product topology, making it a Cantor space. In this space, a base for the topology is given by the cylinder sets, which have the form

V(L1,L2)={α{0,1}G:α(L1)={1} and α(L2)={0}}V(L_{1},L_{2})=\{\alpha\in\{0,1\}^{G}\colon\text{$\alpha(L_{1})=\{1\}$ and $\alpha(L_{2})=\{0\}$}\}

for finite, disjoint L1,L2GL_{1},L_{2}\subseteq G. Let us write

={(L1,L2)𝒫(G)×𝒫(G):L1 and L2 are finite and disjoint}.\mathcal{L}=\{(L_{1},L_{2})\in\mathcal{P}(G)\times\mathcal{P}(G)\colon\text{$L_{1}$ and $L_{2}$ are finite and disjoint}\}.

7.1. Background and motivation

Recall that if XX is a topological space homeomorphic to the Cantor set, it is a Baire space, meaning that it cannot be written as a countable union of nowhere dense sets. A subset of XX which is a countable union of nowhere dense sets is called meager, and a subset of XX whose complement is meager is comeager. The fact that XX is a Baire space means that no subset can be both meager and comeager, so one can think of the elements of a comeager set as being “topologically generic”. We wish to stress in this section the analogy between topology and measure theory in which “meager” corresponds to “measure 0” and “comeager” corresponds to “measure 1”.

We will describe comeager subsets of {0,1}G\{0,1\}^{G}, whose elements can thus be thought of as “topologically generic” subsets of GG. Let us first motivate our results by examining them in a familiar setting. Consider the case G=G=\mathbb{N}, where {0,1}\{0,1\}^{\mathbb{N}} is the space of infinite binary sequences. An infinite binary sequence is called disjunctive if every finite binary word occurs within it. (Formally, α{0,1}\alpha\in\{0,1\}^{\mathbb{N}} is disjunctive if, for each kk\in\mathbb{N} and ω{0,1}k\omega\in\{0,1\}^{k}, there exists nn\in\mathbb{N} such that α(n+i1)=ω(i)\alpha(n+i-1)=\omega(i) for each 1ik1\leq i\leq k.) The subset of {0,1}\{0,1\}^{\mathbb{N}} consisting of disjunctive sequences is large both topologically and measure-theoretically: it is comeager and of measure 1.

However, the topological and measure-theoretical worlds diverge once we consider the limiting frequencies with which finite subwords occur. Given a sequence α{0,1}\alpha\in\{0,1\}^{\mathbb{N}} and a finite word ω{0,1}k\omega\in\{0,1\}^{k}, one says that ω\omega occurs in α\alpha with a limiting frequency

(9) limn|{1ink+1:α(i+j1)=ω(j) for each 1jk}|n.\lim_{n\to\infty}\frac{|\{1\leq i\leq n-k+1\colon\text{$\alpha(i+j-1)=\omega(j)$ for each $1\leq j\leq k$}\}|}{n}.

It is a classical result of Émile Borel [14] that the set of normal sequences, those for which every word of length kk occurs as a subword with limiting frequency 2k2^{-k}, has measure 1. Moreover, as we will see, it is a meager set.

On the other hand, one can consider sequences for which limits analogous to (9) (but modified slightly to avoid counting overlapping sequences) fail to exist as badly as possible. For these sequences, the lim sup\limsup of the frequency of any finite word is 1 and the corresponding lim inf\liminf is 0. We will call such sequences extremely non-averageable (or ENA). It turns out that the set of ENA sequences, in contrast to the set of normal sequences, is comeager [16, Theorem 1].

We will make the following generalizations of the facts we have just stated: assuming only cancellativity, we prove an analogue in a countably infinite semigroup GG to the statement that the set of disjunctive sequences is comeager (Theorem 40); assuming also amenability, we prove an analogue to the statement that the set of ENA sequences is comeager (Theorem 42). A generalization of measure-theoretical normality to countably infinite amenable semigroups and the corresponding results can be found in [6].

Since the set of normal sequences and the set of ENA sequences are disjoint, we see also that the set normal sequences is meager, and that the set of ENA sequences has measure 0 (see [6, Proposition 4.7] regarding the former statement in arbitrary amenable cancellative semigroups). This allows us to give another motivation for the results of this section. One reason we gave for studying discordant sets is that they illustrate how topological and analytic notions of largeness for subsets of GG diverge; in this section, we construct subsets of {0,1}G\{0,1\}^{G} on which basic topological and analytic notions of largeness disagree.

Finally, we remark that, although we prove the results of this section in the space {0,1}G\{0,1\}^{G}, their generalizations to {0,,n1}G\{0,\dots,n-1\}^{G} (which can be thought of as the space of nn-part partitions, or nn-colorings, of GG) are apparent and can be achieved with the same arguments.

7.2. Disjunctive elements of {0,1}G\{0,1\}^{G}

In this subsection we will assume that GG is cancellative. Call α{0,1}G\alpha\in\{0,1\}^{G} disjunctive if for any (L1,L2)(L_{1},L_{2})\in\mathcal{L} there is some gGg\in G for which αV(L1g,L2g)\alpha\in V(L_{1}g,L_{2}g).101010In the notation of Section 8, we can equivalently say that α\alpha is disjunctive if and only if, for every (L1,L2)(L_{1},L_{2})\in\mathcal{L}, there exists gGg\in G for which gαV(L1,L2)g\alpha\in V(L_{1},L_{2}). This definition has the benefit of making sense for non-cancellative semigroups. Cancellativity here guarantees that L1gL2g=L_{1}g\cap L_{2}g=\varnothing for all gGg\in G. Note that, for G=G=\mathbb{N}, this definition specializes to the definition of disjunctive sequences we gave above.

Theorem 40.

The set of disjunctive elements of {0,1}G\{0,1\}^{G} is comeager.

Proof.

For any (L1,L2)(L_{1},L_{2})\in\mathcal{L}, let AL1,L2=gGV(L1g,L2g)A_{L_{1},L_{2}}=\bigcup_{g\in G}V(L_{1}g,L_{2}g), so that the set of disjunctive elements of {0,1}G\{0,1\}^{G} is

A=(L1,L2)AL1,L2.A=\bigcap_{(L_{1},L_{2})\in\mathcal{L}}A_{L_{1},L_{2}}.

Now let (L1,L2),(M1,M2)(L_{1},L_{2}),(M_{1},M_{2})\in\mathcal{L}. Since GG is cancellative and infinite, we can find some gGg\in G for which (L1gL2g)(M1M2)=(L_{1}g\cup L_{2}g)\cap(M_{1}\cup M_{2})=\varnothing. Then we have 𝟏L1gM1AL1,L2V(M1,M2),\mathbf{1}_{L_{1}g\cup M_{1}}\in A_{L_{1},L_{2}}\cap V(M_{1},M_{2}), so that AL1,L2A_{L_{1},L_{2}} is dense, since it has nontrivial intersection with each element of the open base for {0,1}G\{0,1\}^{G}. Since AL1,L2A_{L_{1},L_{2}} is a union of open sets, we have written AA as a countable intersection of open dense sets, which proves that AA is comeager. ∎

We deduce as a relevant consequence of this theorem that the set of non-piecewise syndetic elements of {0,1}G\{0,1\}^{G} is meager, since any disjunctive element of {0,1}G\{0,1\}^{G} is thick. If GG is non-cancellative, the set of piecewise syndetic elements of {0,1}G\{0,1\}^{G} may fail to be comeager. For example, if there exists zGz\in G such that for all gGg\in G, one has gz=zgz=z, then α{0,1}G\alpha\in\{0,1\}^{G} is piecewise syndetic if and only if α(z)=1\alpha(z)=1, so the family of piecewise syndetic sets cannot be dense.

7.3. Extremely non-averageable elements of {0,1}G\{0,1\}^{G}

In this subsection we will assume that GG is both cancellative and amenable. Fix a Følner sequence Φ\Phi in GG. We will begin by defining Φ\Phi-normality of elements of {0,1}G\{0,1\}^{G}, as defined in [6], for comparison with the definition we will give for ENA. We will also utilize the notion of Φ\Phi-normality in Theorem 43.

Given α{0,1}G\alpha\in\{0,1\}^{G}, one says that α\alpha is Φ\Phi-normal if, for each finite KGK\subseteq G and ω{0,1}K\omega\in\{0,1\}^{K},

limn|{gG:KgΦn and α(hg)=ω(h) for each hK}||Φn|=12|K|.\lim_{n\to\infty}\frac{|\{g\in G\colon\text{$Kg\subseteq\Phi_{n}$ and $\alpha(hg)=\omega(h)$ for each $h\in K$}\}|}{|\Phi_{n}|}=\frac{1}{2^{|K|}}.

The following classical lemma regarding Følner sequences, which states that they have a property analogous to thickness, assures us that the notion of Φ\Phi-normality is meaningful.

Lemma 41.

Let Φ\Phi be a Følner sequence in a countably infinite cancellative semigroup GG. Let KGK\subseteq G be finite. Then there exists NN\in\mathbb{N} such that, for each nNn\geq N, KgnΦnKg_{n}\subseteq\Phi_{n} for some gnGg_{n}\in G.

Proof.

Find NN large enough that, for each nNn\geq N and hKh\in K,

11|K|<|ΦnhΦn||Φn|=|h(h1ΦnΦn)||Φn|=|h1ΦnΦn||Φn|.1-\frac{1}{|K|}<\frac{|\Phi_{n}\cap h\Phi_{n}|}{|\Phi_{n}|}=\frac{|h(h^{-1}\Phi_{n}\cap\Phi_{n})|}{|\Phi_{n}|}=\frac{|h^{-1}\Phi_{n}\cap\Phi_{n}|}{|\Phi_{n}|}.

Then hKh1Φn\bigcap_{h\in K}h^{-1}\Phi_{n}\neq\varnothing, so let gng_{n} be an element of this set. Then KgnΦnKg_{n}\subseteq\Phi_{n}. ∎

Now we define extreme non-averageability in {0,1}G\{0,1\}^{G}. The following definitions are admittedly unwieldy because we wish to measure the frequency of occurrences without overlap of our finite sequence. This is because we are interested in the maximal frequency with which the finite sequence can occur, and when allowing for overlaps, this frequency is difficult to predict. We would be pleased if a simpler equivalent definition of ENA in an arbitrary countably infinite amenable cancellative semigroup could be found.

Fix a finite KGK\subseteq G. Given a finite set XGX\subseteq G, define

𝒴K,X={YX:KYXKyKz= for all y,zY, and |Y| is maximal}.\mathcal{Y}_{K,X}=\{Y\subseteq X\colon\text{$KY\subseteq X$, $Ky\cap Kz=\varnothing$ for all $y,z\in Y$, and $|Y|$ is maximal}\}.

Now let ω{0,1}K\omega\in\{0,1\}^{K}, and α{0,1}G\alpha\in\{0,1\}^{G}, and define

d¯Φ(K,ω,α)=lim supnmaxY𝒴K,Φn|{yY:α(hy)=ω(h) for all hK}||Y|.\overline{d}_{\Phi}(K,\omega,\alpha)=\limsup_{n\to\infty}\max_{Y\in\mathcal{Y}_{K,\Phi_{n}}}\frac{|\{y\in Y\colon\text{$\alpha(hy)=\omega(h)$ for all $h\in K$}\}|}{|Y|}.

We observe that d¯Φ(K,ω,α)\overline{d}_{\Phi}(K,\omega,\alpha) is well-defined since 𝒴K,Φn={}\mathcal{Y}_{K,\Phi_{n}}=\{\varnothing\} for only finitely many nn by Lemma 41. Define d¯Φ(K,ω,α)\underline{d}_{\Phi}(K,\omega,\alpha) to be the corresponding lim inf\liminf. Hence, d¯Φ(K,ω,α)\overline{d}_{\Phi}(K,\omega,\alpha) and d¯Φ(K,ω,α)\underline{d}_{\Phi}(K,\omega,\alpha) give the (upper and lower) limiting frequencies, measured along Φ\Phi, with which the finite binary word ω\omega occurs without overlaps in α\alpha. For convenience, we write

𝒦={(K,ω):KG is finite and ω{0,1}K}.\mathcal{K}=\{(K,\omega)\colon\text{$K\subseteq G$ is finite and $\omega\in\{0,1\}^{K}$}\}.

We say that α\alpha is Φ\Phi-ENA if d¯Φ(K,ω,α)=1\overline{d}_{\Phi}(K,\omega,\alpha)=1 and d¯Φ(K,ω,α)=0\underline{d}_{\Phi}(K,\omega,\alpha)=0 for each (K,ω)𝒦(K,\omega)\in\mathcal{K}.

With these definitions in hand, we proceed to the main result of this subsection.

Theorem 42.

The set of Φ\Phi-ENA elements of {0,1}G\{0,1\}^{G} is comeager.

Proof.

For each (K,ω)𝒦(K,\omega)\in\mathcal{K}, let

AK,ω={α{0,1}G:d¯Φ(K,ω,α)=1 and d¯Φ(K,ω,α)=0},A_{K,\omega}=\{\alpha\in\{0,1\}^{G}\colon\text{$\overline{d}_{\Phi}(K,\omega,\alpha)=1$ and $\underline{d}_{\Phi}(K,\omega,\alpha)=0$}\},

so that the set of Φ\Phi-ENA elements of {0,1}G\{0,1\}^{G} is (K,ω)𝒦AK,ω\bigcap_{(K,\omega)\in\mathcal{K}}A_{K,\omega}. Fix (K,ω)𝒦(K,\omega)\in\mathcal{K}; it suffices to show that AK,ωA_{K,\omega} is comeager.

Find a subsequence Φ=(Φnk)k\Phi^{\prime}=(\Phi_{n_{k}})_{k\in\mathbb{N}} of Φ\Phi which satisfies

limk|Φn1|++|Φnk1||Φnk|=0.\lim_{k\to\infty}\frac{|\Phi_{n_{1}}|+\cdots+|\Phi_{n_{k-1}}|}{|\Phi_{n_{k}}|}=0.

Define Ψ1=Φ1\Psi_{1}=\Phi_{1} and, for kk\in\mathbb{N}, Ψk+1=Φnk+1m=1kΦnm\Psi_{k+1}=\Phi_{n_{k+1}}\setminus\bigcup_{m=1}^{k}\Phi_{n_{m}}; let Ψ\Psi denote the Følner sequence (Ψn)n(\Psi_{n})_{n\in\mathbb{N}}. Fix gKg\in K. Now for each nn\in\mathbb{N} choose Yn𝒴K,ΨnY_{n}\in\mathcal{Y}_{K,\Psi_{n}} and define

Bn+\displaystyle B_{n}^{+} ={β{0,1}G:for all yYn and hKβ(hy)=ω(h)},\displaystyle=\{\beta\in\{0,1\}^{G}\colon\text{for all $y\in Y_{n}$ and $h\in K$, $\beta(hy)=\omega(h)$}\},
Bn\displaystyle B_{n}^{-} ={β{0,1}G:for all yΨn and hKβ(hy)ω(g)}.\displaystyle=\{\beta\in\{0,1\}^{G}\colon\text{for all $y\in\Psi_{n}$ and $h\in K$, $\beta(hy)\neq\omega(g)$}\}.

The definition of 𝒴K,Ψn\mathcal{Y}_{K,\Psi_{n}} ensures that the sets Bn±B_{n}^{\pm} are nonempty for all large enough nn, and they are clearly open since they are cylinder sets. In particular, Bn+=V(ω1({1})Yn,ω1({0})Yn)B_{n}^{+}=V(\omega^{-1}(\{1\})Y_{n},\omega^{-1}(\{0\})Y_{n}), and either Bn=V(,KΨn)B_{n}^{-}=V(\varnothing,K\Psi_{n}) or Bn=V(KΨn,)B_{n}^{-}=V(K\Psi_{n},\varnothing), depending on whether ω(g)\omega(g) is 11 or 0. The set BnB_{n}^{-} is chosen this way to ensure that for all Yn𝒴K,ΨnY_{n}\in\mathcal{Y}_{K,\Psi_{n}}, there is no yYny\in Y_{n} such that β(hy)=ω(h)\beta(hy)=\omega(h) for all hKh\in K. In particular, β(gy)ω(g)\beta(gy)\neq\omega(g) for all yy. Finally, define Cn±=m=nBm±C_{n}^{\pm}=\bigcup_{m=n}^{\infty}B_{m}^{\pm} for all nn\in\mathbb{N}.

Let (L1,L2)(L_{1},L_{2})\in\mathcal{L}. Since the sets Ψn\Psi_{n} are disjoint and L1L_{1} and L2L_{2} are finite, cancellativity gives that for all large enough nn\in\mathbb{N}, Bn±V(L1,L2)B_{n}^{\pm}\cap V(L_{1},L_{2})\neq\varnothing and hence Cn±V(L1,L2)C_{n}^{\pm}\cap V(L_{1},L_{2})\neq\varnothing. It follows that Cn±C_{n}^{\pm} is open and dense. Define C=(Cn+Cn)C=\bigcap(C_{n}^{+}\cap C_{n}^{-}), so that CC is a comeager subset of {0,1}G\{0,1\}^{G}. Moreover, if γC\gamma\in C, then

(10) d¯Ψ(K,ω,γ)=1andd¯Ψ(K,ω,γ)=0\overline{d}_{\Psi}(K,\omega,\gamma)=1\qquad\text{and}\qquad\underline{d}_{\Psi}(K,\omega,\gamma)=0

by construction. Since limk|Φnk|/|Ψk|=1\lim_{k\to\infty}|\Phi_{n_{k}}|/|\Psi_{k}|=1, (10) must hold with respect to Φ\Phi^{\prime} in addition to Ψ\Psi, and thus also with respect to Φ\Phi. Hence CC is a comeager subset of AK,ωA_{K,\omega}, as we had hoped. ∎

It is important to note that the property of Φ\Phi-ENA, as well as Φ\Phi-normality, is dependent on Φ\Phi. The following theorem shows that this is true in a strong sense.

Theorem 43.

Let α{0,1}G\alpha\in\{0,1\}^{G} be disjunctive. Then there exist Følner sequences Ψ1\Psi^{1} and Ψ2\Psi^{2} in GG such that α\alpha is simultaneously Ψ1\Psi^{1}-normal and Ψ2\Psi^{2}-ENA.

Proof.

Let Φ\Phi be a Følner sequence of GG, and suppose β1,β2{0,1}G\beta_{1},\beta_{2}\in\{0,1\}^{G} are Φ\Phi-normal and Φ\Phi-ENA, respectively. For each nn\in\mathbb{N} and i{1,2}i\in\{1,2\}, define Xni=Φnβi1({1})X_{n}^{i}=\Phi_{n}\cap\beta_{i}^{-1}(\{1\}) and Yni=Φnβi1({0})Y_{n}^{i}=\Phi_{n}\cap\beta_{i}^{-1}(\{0\}) and find gniGg_{n}^{i}\in G such that αV(Xnigni,Ynigni)\alpha\in V(X_{n}^{i}g_{n}^{i},Y_{n}^{i}g_{n}^{i}). Then define the Følner sequences Ψi=(Ψni)n\Psi^{i}=(\Psi_{n}^{i})_{n\in\mathbb{N}} by Ψni=Φngni\Psi_{n}^{i}=\Phi_{n}g_{n}^{i}. These Følner sequences have the desired properties. ∎

8. Topological dynamics and piecewise syndeticity

Let GG denote any countably infinite (not necessarily cancellative) semigroup. In this section we explore piecewise syndeticity of subsets of GG in terms of dynamical properties of elements of {0,1}G\{0,1\}^{G} and their orbit closures. Some of the results presented below are well-known for subsets of \mathbb{N}. Using these results we revisit and examine in greater detail the comparison between van der Waerden’s theorem and Szemerédi’s theorem made in the introduction.

We adopt the notation of the previous section. For each gGg\in G, let ρg:GG\rho_{g}\colon G\to G be the map hhgh\mapsto hg. We will consider the dynamical system ({0,1}G,G)(\{0,1\}^{G},G), where GG acts on {0,1}G\{0,1\}^{G} by gα=αρgg\alpha=\alpha\circ\rho_{g}. In the familiar settings G=G=\mathbb{N} and G=G=\mathbb{Z}, this is just the left-shift action on infinite one- or two-sided sequences. In light of the natural correspondence between {0,1}G\{0,1\}^{G} and 𝒫(G)\mathcal{P}(G), we may view this dynamical system instead as (𝒫(G),G)(\mathcal{P}(G),G). In this case, gg acts on 𝒫(G)\mathcal{P}(G) by AAg1A\mapsto Ag^{-1}. This action is continuous, since for any (L1,L2)(L_{1},L_{2})\in\mathcal{L}, we have g1V(L1,L2)=V(L1g,L2g)g^{-1}V(L_{1},L_{2})=V(L_{1}g,L_{2}g) (for this identity to hold in the non-cancellative case, we adopt the convention that V(A,B)=V(A,B)=\varnothing if ABA\cap B\neq\varnothing, in view of the fact that g1V(L1,L2)=g^{-1}V(L_{1},L_{2})=\varnothing if and only if L1gL2gL_{1}g\cap L_{2}g\neq\varnothing) .

Given α{0,1}G\alpha\in\{0,1\}^{G}, we define the orbit closure of α\alpha to be

𝒪(α)={gα:gG}¯.\mathcal{O}(\alpha)=\overline{\{g\alpha\colon g\in G\}}.

Now fix an increasing sequence (Fn)(F_{n}) of finite subsets of GG such that Fn=G\bigcup F_{n}=G. Let us note that for any α{0,1}G\alpha\in\{0,1\}^{G}, we have β𝒪(α)\beta\in\mathcal{O}(\alpha) if and only if, for all nn\in\mathbb{N}, there exists gGg\in G such that β(x)=(gα)(x)\beta(x)=(g\alpha)(x) for each xFnx\in F_{n}. This simple but crucial property of orbit closures in the system ({0,1}G,G)(\{0,1\}^{G},G) follows immediately from the definition of the product topology on {0,1}G\{0,1\}^{G}.

We illustrate what this means in the familiar case G=G=\mathbb{N}. In this setting, β𝒪(α)\beta\in\mathcal{O}(\alpha) if and only if, for any nn\in\mathbb{N}, α\alpha can be shifted to agree with β\beta on {1,,n}\{1,\dots,n\}. Put another way, β𝒪(α)\beta\in\mathcal{O}(\alpha) means that one can find arbitrarily long initial segments of β\beta as subwords of α\alpha.

Before stating the results, we give convenient reformulations of the notions of syndeticity, piecewise syndeticity, and thickness in terms of indicator functions. An element α{0,1}G\alpha\in\{0,1\}^{G} is syndetic if and only if there exists a finite set HGH\subseteq G such that, for each xGx\in G, we have α(hx)=1\alpha(hx)=1 for some hHh\in H. Similarly, α\alpha is piecewise syndetic if and only if there exists a finite HGH\subseteq G and an infinite sequence (gi)(g_{i}) in GG such that, for each nn\in\mathbb{N} and xFnx\in F_{n}, we have α(hxgn)=1\alpha(hxg_{n})=1 for some hHh\in H. Finally, α\alpha is thick if and only if 𝟏G𝒪(α)\mathbf{1}_{G}\in\mathcal{O}(\alpha), so by Lemma 1, α\alpha is syndetic if and only if 𝟏𝒪(α)\mathbf{1}_{\varnothing}\notin\mathcal{O}(\alpha).

Lemma 44.

An element α{0,1}G\alpha\in\{0,1\}^{G} is piecewise syndetic if and only if the orbit closure 𝒪(α)\mathcal{O}(\alpha) contains a syndetic element.

Proof.

Suppose α{0,1}G\alpha\in\{0,1\}^{G} is piecewise syndetic and find a finite set HGH\subseteq G and an infinite sequence (gn)(g_{n}) in GG which witnesses α\alpha’s piecewise syndeticity, in the sense of the previous paragraph. We claim that there is a decreasing sequence (Nm)(N_{m}) of infinite subsets of \mathbb{N} with the property that α(xgn1)=α(xgn2)\alpha(xg_{n_{1}})=\alpha(xg_{n_{2}}) for each xFmx\in F_{m} and n1,n2Nmn_{1},n_{2}\in N_{m}. Indeed, for any mm, there are finitely many functions Fm{0,1}F_{m}\to\{0,1\}, so such a sequence can easily be constructed inductively by applying the pigeonhole principle. Hence we can define β{0,1}G\beta\in\{0,1\}^{G} by β(x)=α(xgn)=(gnα)(x)\beta(x)=\alpha(xg_{n})=(g_{n}\alpha)(x) for any xFmx\in F_{m} and nNmn\in N_{m}. By construction, β𝒪(α)\beta\in\mathcal{O}(\alpha). To show that β\beta is syndetic, let xGx\in G, find mm such that xFmx\in F_{m}, and find nNmn\in N_{m} with nmn\geq m. Since xFnx\in F_{n}, we can find hHh\in H such that α(hxgn)=1\alpha(hxg_{n})=1. Then β(hx)=α(hxgn)=1\beta(hx)=\alpha(hxg_{n})=1, so β\beta is syndetic.

Now suppose α{0,1}G\alpha\in\{0,1\}^{G} and β𝒪(α)\beta\in\mathcal{O}(\alpha) is syndetic. Let HGH\subseteq G be a finite set which witnesses β\beta’s syndeticity. Fix nn\in\mathbb{N} and find mm\in\mathbb{N} such that HFnFmHF_{n}\subseteq F_{m}. Find gmGg_{m}\in G such that β(y)=(gmα)(y)\beta(y)=(g_{m}\alpha)(y) for each yFmy\in F_{m}. Let xFnx\in F_{n} and find hHh\in H for which β(hx)=1\beta(hx)=1. Then hxFmhx\in F_{m}, so α(hxgm)=β(hx)=1\alpha(hxg_{m})=\beta(hx)=1. This proves that α\alpha is piecewise syndetic. ∎

Recall that subsystem of a dynamical system (X,G)(X,G) is a nonempty closed GG-invariant subset of XX. A minimal subsystem is a subsystem which is itself a minimal dynamical system. By Zorn’s lemma, any dynamical system has a minimal subsystem (see, e.g., [24, Theorem 2.22]). We will call the subsystem of ({0,1}G,G)(\{0,1\}^{G},G) consisting of the fixed point 𝟏\mathbf{1}_{\varnothing} (the zero function) the trivial subsystem.

The following theorem is a restatement of Lemma 44 in terms of the minimal subsystems of orbit closures.

Theorem 45.

An element α{0,1}G\alpha\in\{0,1\}^{G} is piecewise syndetic if and only if 𝒪(α)\mathcal{O}(\alpha) possesses a nontrivial minimal subsystem.

In particular, if α\alpha is discordant, 𝒪(α)\mathcal{O}(\alpha) does not have a nontrivial minimal subsystem. Moreover, sets of positive upper density having this property are exactly the discordant sets.

Proof.

Suppose α{0,1}G\alpha\in\{0,1\}^{G} is piecewise syndetic. By Lemma 44, there exists β𝒪(α)\beta\in\mathcal{O}(\alpha) which is syndetic. Then 𝒪(β)𝒪(α)\mathcal{O}(\beta)\subseteq\mathcal{O}(\alpha) and 𝟏𝒪(β)\mathbf{1}_{\varnothing}\notin\mathcal{O}(\beta), so 𝒪(β)\mathcal{O}(\beta) has a nontrivial minimal subsystem. It follows that 𝒪(α)\mathcal{O}(\alpha) has a minimal subsystem as well.

Conversely, suppose 𝒪(α)\mathcal{O}(\alpha) possesses a nontrivial minimal subsystem XX. Then 𝟏X\mathbf{1}_{\varnothing}\notin X, so every element of XX is syndetic. Thus, by Lemma 44, α\alpha is piecewise syndetic. ∎

The existence or nonexistence of nontrivial minimal subsystems in 𝒪(α)\mathcal{O}(\alpha) has significant combinatorial consequences, which we presently examine.

Let us specialize to the case G=G=\mathbb{N}. As stated in the introduction, there is an equivalent formulation of van der Waerden’s theorem as a topological recurrence theorem:

Theorem 46 (Topological van der Waerden’s theorem).

Let (X,T)(X,T) be a minimal111111In the sources, one will not see the condition of minimality explicitly imposed in the formulations of Theorem 46. We of course do not lose generality by making this assumption as we may always pass to a minimal subsystem. Moreover, the topological proofs of this theorem of which the authors are aware, with the notable exception of [9, Proposition L], use minimality in a crucial way. topological dynamical system where XX is a compact metric space. Then for any \ell\in\mathbb{N} and ε>0\varepsilon>0 there exists dd\in\mathbb{N} and xXx\in X such that the points x,Tdx,,T(1)dxx,T^{d}x,\dots,T^{(\ell-1)d}x are within ε\varepsilon of each other.

Proof.

This is a special case of [23, Theorem 1.4]. See also [22, Theorem 2.6] or [2, Corollary 3.9]. ∎

From here the classical van der Waerden’s theorem can be derived along the following lines. Suppose α{0,1}\alpha\in\{0,1\}^{\mathbb{N}} is piecewise syndetic. Then 𝒪(α)\mathcal{O}(\alpha) possesses a nontrivial minimal subsystem YY by Theorem 45. Since 𝟏Y\mathbf{1}_{\varnothing}\notin Y, every element of YY is syndetic. Giving {0,1}\{0,1\}^{\mathbb{N}} the product metric described in Subsection 2.5 and applying Theorem 46 inside (Y,T)(Y,T) shows that, for any \ell\in\mathbb{N}, the system YY possesses an element with a length-\ell arithmetic progression. Since Y𝒪(α)Y\subset\mathcal{O}(\alpha), for any \ell\in\mathbb{N}, there is an element of 𝒪(α)\mathcal{O}(\alpha) with a length-\ell arithmetic progression. It follows that α\alpha is AP-rich. (As we saw in the introduction, this is equivalent, via the partition regularity of piecewise syndetic sets, to the formulation of van der Waerden’s theorem in terms of partitions of \mathbb{N}.)

This discussion supports our position taken in the introduction on the distinction between van der Waerden’s and Szemerédi’s theorems. Specifically, if the orbit closure of any element of {0,1}\{0,1\}^{\mathbb{N}} having positive upper density were to possess a nontrivial minimal subsystem, then Szemerédi’s theorem would follow immediately from the topological version of van der Waerden’s theorem using the argument we gave in the preceding paragraph.

As our next combinatorial application of Theorem 45, we will prove the partition regularity of piecewise syndetic sets from a dynamical perspective (cf. Theorem 4). We first address the following question: For which α{0,1}G\alpha\in\{0,1\}^{G} is 𝒪(α)\mathcal{O}(\alpha) itself minimal? The answer is that every finite subword occurring in α\alpha must occur syndetically in α\alpha, as the following lemma states. The result is well known in the case when GG is \mathbb{Z} or \mathbb{N}; see, e.g., [22, Proposition 1.22].

Lemma 47.

Let α{0,1}G\alpha\in\{0,1\}^{G}. Then 𝒪(α)\mathcal{O}(\alpha) is minimal if and only if, for all (L1,L2)(L_{1},L_{2})\in\mathcal{L}, the set {gG:gαV(L1,L2)}\{g\in G\colon g\alpha\in V(L_{1},L_{2})\} is either empty or syndetic.

Proof.

This is a consequence of the following general phenomenon: Let (X,G)(X,G) be a topological dynamical system where XX is a metric space, and let xXx\in X. Then 𝒪(x)={gx:gG}¯\mathcal{O}(x)=\overline{\{gx\colon g\in G\}} is minimal if and only if, for each open VXV\subseteq X, RV(x)R_{V}(x) is either empty or syndetic. Clearly, if 𝒪(x)\mathcal{O}(x) is minimal, then this condition holds (see the proof of Lemma 9).

Conversely, suppose y𝒪(x)y\in\mathcal{O}(x). We will show that x𝒪(y)x\in\mathcal{O}(y), which will prove that 𝒪(x)\mathcal{O}(x) is minimal. To this end, fix nn\in\mathbb{N} and find a finite set HGH\subseteq G such that H1RB(1/(2n),x)(x)=GH^{-1}R_{B(1/(2n),x)}(x)=G. Find a sufficiently small ε>0\varepsilon>0 so that hB(ε,y)B(1/(2n),hy)hB(\varepsilon,y)\subseteq B(1/(2n),hy) for each hHh\in H. Next find gGg\in G for which gxB(ε,y)gx\in B(\varepsilon,y) and find hHh\in H satisfying hgRB(1/(2n),x)(x)hg\in R_{B(1/(2n),x)}(x). Then hgxB(1/(2n),hy)B(1/(2n),x)hgx\in B(1/(2n),hy)\cap B(1/(2n),x), i.e., d(hy,x)<1/nd(hy,x)<1/n. This proves that x𝒪(y)x\in\mathcal{O}(y), as claimed. ∎

Before continuing, we remark that, as in the previous section, the preceding can be easily generalized to statements about {0,1,,r}G\{0,1,\dots,r\}^{G}. In this setting, we say that α{0,1,,r}G\alpha\in\{0,1,\dots,r\}^{G} is (piecewise) syndetic if 𝟏α1({1,,r}){0,1}G\mathbf{1}_{\alpha^{-1}(\{1,\dots,r\})}\in\{0,1\}^{G} is (piecewise) syndetic in the usual sense. We continue to refer to {𝟏}\{\mathbf{1}_{\varnothing}\} as the trivial subsystem. We leave the proofs of the generalizations to the interested reader. Using this, we give the promised result on the partition regularity of piecewise syndetic sets. The following proof in the case G=G=\mathbb{Z} can be found in [22, Theorem 1.24].

Theorem 48.

Let AGA\subseteq G be piecewise syndetic and suppose it is partitioned as A=i=1rCiA=\bigcup_{i=1}^{r}C_{i}. Then CiC_{i} is piecewise syndetic for some ii.

Proof.

Define α{0,1,,r}G\alpha\in\{0,1,\dots,r\}^{G} by α(g)=0\alpha(g)=0 if gAg\notin A and α(g)=i\alpha(g)=i if gCig\in C_{i}. By Lemma 45, 𝒪(α)\mathcal{O}(\alpha) has a nontrivial minimal subsystem XX. Let βX\beta\in X. By Lemma 47 and the nontriviality of XX, for all iRangeβi\in\text{Range}\,\beta the set {gG:β(g)=i}\{g\in G\colon\beta(g)=i\} is syndetic. Let iRangeβi\in\text{Range}\,\beta. Define β{0,1}G\beta^{\prime}\in\{0,1\}^{G} by β(g)=1\beta^{\prime}(g)=1 if β(g)=i\beta(g)=i and β(g)=0\beta^{\prime}(g)=0 otherwise. Then β\beta^{\prime} is a syndetic element in 𝒪(𝟏Ci)\mathcal{O}(\mathbf{1}_{C_{i}}), so CiC_{i} must be piecewise syndetic. ∎

It is worth mentioning that Theorem 48 is easily proved with the help of topological algebra in βG\beta G, the Stone–Čech compactification of GG, that is, the space of ultrafilters on GG. It is well known that AGA\subseteq G is piecewise syndetic if and only if some shift g1Ag^{-1}A lies in a minimal idempotent of βG\beta G [30, Theorem 4.43]. From here partition regularity follows from the definition of an ultrafilter. This approach to Ramsey theory—learning about subsets of GG by studying βG\beta G—has proved very fruitful, leading to, for example, a slick proof of Hindman’s theorem (see [30, Corollary 5.9] or [3, Theorem 1.2]). We do not pursue this further but direct the curious reader towards the references we have just mentioned.

References

  • [1] L. Argabright and C. Wilde “Semigroups satisfying a strong Følner condition” In Proceedings of the American Mathematical Society 18.4, 1967, pp. 587–591
  • [2] V. Bergelson “Ergodic theory and diophantine problems” In Topics in Symbolic Dynamics and Applications, London Mathematical Society Lecture Note Series Cambridge University Press, 2000, pp. 167–206
  • [3] V. Bergelson “Minimal idempotents and ergodic Ramsey theory” In Topics in Dynamics and Ergodic Theory, London Mathematical Society Lecture Note Series Cambridge University Press, 2003, pp. 8–39
  • [4] V. Bergelson, M. Beiglböck, N. Hindman and D. Strauss “Multiplicative structures in additively large sets” In Journal of Combinatorial Theory, Series A 113.7, 2006, pp. 1219–1242
  • [5] V. Bergelson, J.. Christopherson, D. Robertson and P. Zorin-Kranich “Finite product sets and minimally almost periodic groups” In Journal of Functional Analysis 270.6, 2016, pp. 2126–2167
  • [6] V. Bergelson, T. Downarowicz and M. Misiurewicz “A fresh look at the notion of normality” In Annali della Scuola Normale Superiore di Pisa, Classe di Scienze, To appear
  • [7] V. Bergelson and H. Furstenberg “WM groups and Ramsey theory” In Topology and its Applications 156.16, 2009, pp. 2572–2580
  • [8] V. Bergelson, N. Hindman and R. McCutcheon “Notions of size and combinatorial properties of quotient sets in semigroups” In Topology Proceedings 23, 1998, pp. 23–60
  • [9] V. Bergelson and A. Leibman “Set-polynomials and polynomial extension of the Hales-Jewett theorem” In Annals of Mathematics 150, 1999, pp. 33–75
  • [10] V. Bergelson, A. Leibman and E. Lesigne “Intersective polynomials and polynomial Szemerédi theorem” In Advances in Mathematics 219, 2008, pp. 369–388
  • [11] V. Bergelson and R. McCutcheon “An ergodic IP polynomial Szemerédi theorem” In Memoirs of the AMS 146.695, 2000, pp. 1–106
  • [12] V. Bergelson and I. Rusza “Squarefree numbers, IP sets, and ergodic theory” In Paul Erdős and his Mathematics 4, 5 Janos Bolyai Mathematical Society, 2002, pp. 147–160
  • [13] A. Bernardino, R. Pacheco and M. Silva “The gap structure of a family of integer subsets” #P1.47 In The Electronic Journal of Combinatorics 21.1, 2014
  • [14] É. Borel “Les probabilités dénombrables et leurs applications arithmétiques” In Rendiconti del Circolo Matematico di Palermo 27, 1909, pp. 247–271
  • [15] T.. Brown “An interesting combinatorial method in the theory of locally finite semigroups” In Pacific Journal of Mathematics 36, 1971, pp. 285–289
  • [16] C. Calude and T. Zamfirescu “Most numbers obey no probability laws” In Publicationes Mathematicae Debrecen 54 (Supplement), 1999, pp. 619–623
  • [17] F. Cellarosi and I. Vinogradov “Ergodic properties of kk-free integers in number fields” In Journal of Modern Dynamics 7.3, 2013, pp. 461–488
  • [18] J.. Christopherson “Closed ideals in the Stone-Čech compactification of a countable semigroup and some applications to ergodic theory and topological dynamics”, 2014
  • [19] P. Erdős “A survey of problems in combinatorial number theory” In Annals of Discrete Mathematics 6, 1980, pp. 89–115
  • [20] L. Fuchs “Infinite Abelian Groups” Elsevier Science, 1970
  • [21] H. Furstenberg “Ergodic behavior of diagonal measures and a theorem of Szemerédi on arithmetic progressions” In Journal d’Analyse Mathématique 31, 1977, pp. 204–256
  • [22] H. Furstenberg “Recurrence in Ergodic Theory and Combinatorial Number Theory” Princeton University Press, 1981
  • [23] H. Furstenberg and B. Weiss “Topological dynamics and combinatorial number theory” In Journal d’Analyse Mathématique 34, 1978, pp. 61–85
  • [24] W.. Gottschalk and G.. Hedlund “Topological Dynamics” American Mathematical Society, 1955
  • [25] H. Halberstam and K. Roth “Sequences” Oxford University Press, 1966
  • [26] G.. Hardy and E.. Wright “An Introduction to the Theory of Numbers” Oxford University Press, 2008
  • [27] N. Hindman “Algebra in βS\beta S and its applications to Ramsey theory” In Mathematica Japonica 44, 1996, pp. 581–625
  • [28] N. Hindman “Finite sums from sequences within cells of a partition of \mathbb{N} In Journal of Combinatorial Theory, Series A 17.1, 1974, pp. 1–11
  • [29] N. Hindman “Preimages of points under the natural map from β(×)\beta(\mathbb{N}\times\mathbb{N}) to β×β\beta\mathbb{N}\times\beta\mathbb{N} In Proceedings of the American Mathematical Society 37.2, 1973, pp. 603–608
  • [30] N. Hindman and D. Strauss “Algebra in the Stone-Čech Compactificaton” De Gruyter, 2012
  • [31] L. Kuipers and H. Neiderreiter “Uniform Distribution of Sequences” Wiley, 1974
  • [32] A… Paterson “Amenability” American Mathematical Society, 1988
  • [33] I.. Sanov “A property of a representation of a free group” In Doklady Akademii Nauk SSSR 57, 1947, pp. 657–659
  • [34] E. Szemerédi “On sets of integers containing no kk elements in arithmetic progression” In Acta Arithmetica 27, 1975, pp. 199–243
  • [35] B.. Waerden “Beweis einer Baudetschen Vermutung” In Nieuw. Arch. Wisk. 15, 1927, pp. 212–216
  • [36] P. Walters “An Introduction to Ergodic Theory” Springer-Verlag, 1982
\missing

Si