This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Singmaster’s conjecture in the interior of Pascal’s triangle

Kaisa Matomäki Department of Mathematics and Statistics
University of Turku, 20014 Turku
Finland
[email protected]
Maksym Radziwiłł Department of Mathematics, Caltech, 1200 E California Blvd, Pasadena, CA, 91125
USA
[email protected]
Xuancheng Shao Department of Mathematics, University of Kentucky
715 Patterson Office Tower
Lexington, KY 40506
USA
[email protected]
Terence Tao Department of Mathematics, UCLA
405 Hilgard Ave
Los Angeles CA 90095
USA
[email protected]
 and  Joni Teräväinen Mathematical Institute, University of Oxford
Woodstock Road
Oxford OX2 6GG
United Kingdom
[email protected]
Abstract.

Singmaster’s conjecture asserts that every natural number greater than one occurs at most a bounded number of times in Pascal’s triangle; that is, for any natural number t2t\geq 2, the number of solutions to the equation (nm)=t\binom{n}{m}=t for natural numbers 1m<n1\leq m<n is bounded. In this paper we establish this result in the interior region exp(log2/3+εn)mnexp(log2/3+εn)\exp(\log^{2/3+\varepsilon}n)\leq m\leq n-\exp(\log^{2/3+\varepsilon}n) for any fixed ε>0\varepsilon>0. Indeed, when tt is sufficiently large depending on ε\varepsilon, we show that there are at most four solutions (or at most two in either half of Pascal’s triangle) in this region. We also establish analogous results for the equation (n)m=t(n)_{m}=t, where (n)mn(n1)(nm+1)(n)_{m}\coloneqq n(n-1)\dots(n-m+1) denotes the falling factorial.

1. Introduction

In 1971, Singmaster [22] conjectured that any natural number greater than one only appeared in Pascal’s triangle a bounded number of times. In asymptotic notation111Our conventions for asymptotic notation are set out in Section 1.5., we can express this conjecture as

Conjecture 1.1 (Singmaster’s conjecture).

For any natural number t2t\geq 2, the number of integer solutions 1m<n1\leq m<n to the equation

(1.1) (nm)=t\binom{n}{m}=t

is O(1)O(1).

Note that we can exclude the edges m=0,m=nm=0,m=n of Pascal’s triangle from consideration since (nm)=1\binom{n}{m}=1 in these cases. Currently the largest known number of solutions to (1.1) for a given tt is eight, arising from t=3003t=3003 and

(1.2) (n,m)=(3003,1),(78,2),(15,5),(14,6),(14,8),(15,10),(78,76),(3003,3002).(n,m)=(3003,1),(78,2),(15,5),(14,6),(14,8),(15,10),(78,76),(3003,3002).

For the purposes of attacking this conjecture, we may of course assume tt to be larger than any given absolute constant, which we shall implicitly do in the sequel. In particular we can assume that the iterated logarithms

log2tloglogt;log3tlogloglogt\log_{2}t\coloneqq\log\log t;\quad\log_{3}t\coloneqq\log\log\log t

are well-defined and positive.

In view of the symmetry

(1.3) (nm)=(nnm)\binom{n}{m}=\binom{n}{n-m}

we may restrict attention to the left half

(1.4) {(m,n)×:1mn/2}\{(m,n)\in\mathbb{N}\times\mathbb{N}:1\leq m\leq n/2\}

of Pascal’s triangle. For solutions to (1.1) in this half (1.4) of the triangle, we have

t=(nm)(2mm)4m/mt=\binom{n}{m}\geq\binom{2m}{m}\asymp 4^{m}/\sqrt{m}

by Stirling’s approximation (2.4), and thus we have the upper bound

(1.5) m1log4logt+O(log2t).m\leq\frac{1}{\log 4}\log t+O(\log_{2}t).

Since n(nm)n\mapsto\binom{n}{m} is an increasing function of nn for fixed m1m\geq 1, nn is uniquely determined by mm and tt. Thus by (1.5) we have at most O(logt)O(\log t) solutions to the equation (nm)=t\binom{n}{m}=t, a fact already observed in the original paper [22] of Singmaster. This bound was improved to O(logt/log2t)O(\log t/\log_{2}t) by Abbott, Erdős, and Hansen [1], to O(logtlog3t/log22t)O(\log t\log_{3}t/\log_{2}^{2}t) by Kane [14], and finally to O(logtlog3t/log23t)O(\log t\log_{3}t/\log_{2}^{3}t) in a followup work of Kane [15]. This remains the best known unconditional bound for the total number of solutions, although it was observed in [1] that the improved bound Oε(log2/3+εt)O_{\varepsilon}(\log^{2/3+\varepsilon}t) was available for any ε>0\varepsilon>0 assuming the conjecture of Cramér [9].

From the elementary inequalities

(nm)mm!<(nm)nmm!\frac{(n-m)^{m}}{m!}<\binom{n}{m}\leq\frac{n^{m}}{m!}

and some rearranging we see that any solution to (nm)=t\binom{n}{m}=t obeys the bounds

(tm!)1/mn<(tm!)1/m+m.(tm!)^{1/m}\leq n<(tm!)^{1/m}+m.

Applying Stirling’s approximation (2.4) (and also nmn\geq m) we can thus obtain the order of magnitude of nn as a function of mm and tt:

(1.6) nmt1/mn\asymp mt^{1/m}

or equivalently

(1.7) nmexp(logtm).\frac{n}{m}\asymp\exp\left(\frac{\log t}{m}\right).

In particular we see that nn grows extremely rapidly when the ratio m/logtm/\log t becomes small. This makes the difficulty of the problem increase as m/logtm/\log t approaches zero, and indeed treating the case of small values of m/logtm/\log t is the main obstruction to making further progress on bounding the total number of solutions.

Remark 1.2.

In the left half (1.4) of Pascal’s triangle, a finer application of Stirling’s approximation in [14, (3.1)] gave the more precise estimate

n=(tm!)1/m+m12+O(mt1/m).n=(tm!)^{1/m}+\frac{m-1}{2}+O(mt^{-1/m}).

We will not explicitly use this estimate here.

In this paper we study the opposite regime in which m/logtm/\log t is relatively large, or equivalently (by (1.7)) nn and mm are somewhat comparable (in the doubly logarithmic sense log2nlog2m\log_{2}n\asymp\log_{2}m). More precisely, we have the following result:

Theorem 1.3 (Singmaster’s conjecture in the interior of Pascal’s triangle).

Let 0<ε<10<\varepsilon<1, and assume that tt is sufficiently large depending on ε\varepsilon. Then there are at most two solutions to (1.1) in the region exp(log2/3+εn)mn/2\exp(\log^{2/3+\varepsilon}n)\leq m\leq n/2. By (1.3), we thus have at most four solutions to (1.1) in the region exp(log2/3+εn)mnexp(log2/3+εn)\exp(\log^{2/3+\varepsilon}n)\leq m\leq n-\exp(\log^{2/3+\varepsilon}n). Furthermore, in the smaller region exp(log2/3+εn)mn/exp(log1εn)\exp(\log^{2/3+\varepsilon}n)\leq m\leq n/\exp(\log^{1-\varepsilon^{\prime}}n) there is at most one solution, whenever 0<ε<ε2/3+ε0<\varepsilon^{\prime}<\frac{\varepsilon}{2/3+\varepsilon} and tt is sufficiently large depending on both ε\varepsilon and ε\varepsilon^{\prime}.

Remark 1.4.

The bound of two (or four) solutions is absolutely sharp, in view of the infinite family of solutions observed in [18], [23], [27] to the equation

(n+1m+1)=(nm+2)\binom{n+1}{m+1}=\binom{n}{m+2}

given by n=F2j+2F2j+31n=F_{2j+2}F_{2j+3}-1, m=F2jF2j+31m=F_{2j}F_{2j+3}-1, where FjF_{j} denotes the jthj^{th} Fibonacci number. See also [13] for further analysis of equations of this type. Besides this infinite family of collisions, and the “trivial” ones generated by (1.3), (n0)=1\binom{n}{0}=1, and (nm)=((nm)1)\binom{n}{m}=\binom{\binom{n}{m}}{1}, the only further known collisions between binomial coefficients arise from the identities (n2)=(nm)\binom{n}{2}=\binom{n^{\prime}}{m^{\prime}} for

(n,n,m)=(16,10,3),(21,2,4),(52,22,3),(120,36,3),(153,19,5),(221,17,8)(n,n^{\prime},m^{\prime})=(16,10,3),(21,2,4),(52,22,3),(120,36,3),(153,19,5),(221,17,8)

as well as the example in (1.2). It was conjectured by de Weger [11] that these above examples generate all the non-trivial collisions (nm)=(nm)=t\binom{n}{m}=\binom{n^{\prime}}{m^{\prime}}=t; this would of course imply Singmaster’s conjecture. This conjecture has been verified for (m,m)=(2,3)(m,m^{\prime})=(2,3) [3], for (m,m)=(2,4)(m,m^{\prime})=(2,4) [21], [10], for (m,m)=(2,5)(m,m^{\prime})=(2,5) [8], for (m,m)=(3,4)(m,m^{\prime})=(3,4) [20], [11], and (m,m)=(2,6),(2,8),(3,6),(4,6),(4,8)(m,m^{\prime})=(2,6),(2,8),(3,6),(4,6),(4,8) [26], and for n106n\leq 10^{6} or t1060t\leq 10^{60} in [5].

Remark 1.5.

In view of Theorem 1.3, we now see that to prove Conjecture 1.1, we may restrict attention without loss of generality to the region 2mexp(log2/3+εn)2\leq m\leq\exp(\log^{2/3+\varepsilon}n) for any fixed ε>0\varepsilon>0, or equivalently (by (1.7)) to 2mlogtlog23/2εt2\leq m\leq\frac{\log t}{\log^{3/2-\varepsilon}_{2}t} for any fixed ε>0\varepsilon>0. It follows from the conjecture of de Weger [11] mentioned in Remark 1.4 that for tt sufficiently large there is only at most one solution in this region, that is to say all but a finite number of binomial coefficients (nm)\binom{n}{m} for 2mexp(log2/3+εn)2\leq m\leq\exp(\log^{2/3+\varepsilon}n) are distinct. In this direction, the number of solutions to the equation (nm)=(nm)\binom{n}{m}=\binom{n^{\prime}}{m^{\prime}} for fixed 2m<m2\leq m<m^{\prime} has been shown (via Siegel’s theorem on integral points) to be finite in [4] (see also the earlier result [16] treating the case (m,m)=(2,p)(m,m^{\prime})=(2,p) for an odd prime pp). This implies that there are no collisions in the regime 2mw(n)2\leq m\leq w(n) if ww is a function of nn that goes to infinity sufficiently slowly as nn\to\infty. Unfortunately, due to the reliance on Siegel’s theorem, the function ww given by these arguments is completely ineffective.

Remark 1.6.

For some previous bounds of this type, in [1] it was shown that the number of solutions to (1.1) in the range n5/6mn/2n^{5/6}\leq m\leq n/2 was O(log3/4t)O(\log^{3/4}t), while the arguments in [14, §7], after some manipulation, show that the number of solutions to (1.1) in the range exp(log1/2+εn)mn5/6\exp(\log^{1/2+\varepsilon}n)\leq m\leq n^{5/6} is Oε(logt/log23t)O_{\varepsilon}(\log t/\log_{2}^{3}t).

Remark 1.7.

The implied quantitative bounds in the hypothesis “tt is sufficiently large depending on ε\varepsilon” are effective; however, we have made no attempt whatsoever to optimize them in this paper, and will likely be too large to be of use in numerical verification of Singmaster’s conjecture in their current form.

1.1. An analog for falling factorials

The methods used to handle the equation (1.1) can be modified to treat the variant equation

(1.8) (n)m=t(n)_{m}=t

for integers 1m<n1\leq m<n and t2t\geq 2, where (n)m(n)_{m} denotes the falling factorial

(n)mn(n1)(nm+1)=m!(nm).(n)_{m}\coloneqq n(n-1)\dots(n-m+1)=m!\binom{n}{m}.

We exclude the cases m=0,m=nm=0,m=n since (n)0=1(n)_{0}=1 and (n)n=(n)n1=n!(n)_{n}=(n)_{n-1}=n!. In [1, Theorem 4] it was shown that for any t2t\geq 2 the number of integer solutions (m,n)(m,n) to (1.8) with 1mn11\leq m\leq n-1 is O(logt)O(\sqrt{\log t}). We do not directly improve upon this bound here, but can obtain an analogue of Theorem 1.3:

Theorem 1.8 (Falling factorial multiplicity in the interior).

Let 0<ε<10<\varepsilon<1, and assume that tt is sufficiently large depending on ε\varepsilon. Then there are at most two integer solutions to (1.8) in the region exp(log2/3+εn)m<n\exp(\log^{2/3+\varepsilon}n)\leq m<n.

We establish this result in Section 5. Note that the bound of two is best possible, as can be seen from the infinite family of solutions

(a2a)a22a=(a2a1)a22a+1(a^{2}-a)_{a^{2}-2a}=(a^{2}-a-1)_{a^{2}-2a+1}

for any integer a>2a>2, and more generally

((a)b)(a)ba=((a)b1)(a)ba+b1((a)_{b})_{(a)_{b}-a}=((a)_{b}-1)_{(a)_{b}-a+b-1}

whenever 2b<a2\leq b<a are integers.

1.2. Strategy of proof

Theorem 1.3 is a consequence of two Propositions that we now describe. The proof of Theorem 1.8 will follow a similar pattern as described here and we refer the reader to Section 5 for details.

Proposition 1.9 (Distance estimate).

Let ε>0\varepsilon>0. Suppose we have two solutions (n,m),(n,m)(n,m),(n^{\prime},m^{\prime}) to (1.1) in the left half (1.4) of Pascal’s triangle. Then one has

mmεexp(log2/3+ε(n+n))m^{\prime}-m\ll_{\varepsilon}\exp(\log^{2/3+\varepsilon}(n+n^{\prime}))

for any ε>0\varepsilon>0. Furthermore, if

m,mexp(log2/3+ε(n+n))m,m^{\prime}\geq\exp(\log^{2/3+\varepsilon}(n+n^{\prime}))

then we additionally have

nnεexp(log2/3+ε(n+n)).n^{\prime}-n\ll_{\varepsilon}\exp(\log^{2/3+\varepsilon}(n+n^{\prime})).

Note how this proposition is consistent with the example in Remark 1.4. We shall discuss the proof of Proposition 1.9 in Section 1.3. For the application to Theorem 1.3, Proposition 1.9 localizes all solutions to (1.1) to a region of small diameter. To conclude Theorem 1.3, we can now proceed by adapting the Taylor expansion arguments of Kane [14], [15], in which one views nn as an analytic function of mm (keeping tt fixed) and exploits the non-vanishing of certain derivatives of this function; see Section 2. This is what the proposition below accomplishes. In fact in our analysis only two derivatives of this function are needed (i.e., we only need to exploit the convexity properties of nn as a function of mm).

Proposition 1.10 (Kane-type estimate).

Let ε>0\varepsilon>0. Suppose that (n,m)(n,m) is a solution to (1.1) in the left-half (1.4) of Pascal’s triangle. There there exists at most one other solution (n,m)(n,m)(n^{\prime},m^{\prime})\neq(n,m) to (1.1) with m<mm^{\prime}<m, n>nn^{\prime}>n and

|mm|+|nn|exp((log2t)1ε).|m-m^{\prime}|+|n-n^{\prime}|\ll\exp((\log_{2}t)^{1-\varepsilon}).

With these two Propositions at hand it is easy to deduce Theorem 1.3.

Deduction of Theorem 1.3.

Let ε>0\varepsilon>0, let tt be sufficiently large depending on ε\varepsilon, and let (n,m)(n,m) be the solution to (1.1) in the region

(1.9) {(n,m):exp(log2/3+εn)mn/2}\{(n,m):\exp(\log^{2/3+\varepsilon}n)\leq m\leq n/2\}

with the maximal value of mm (if there are no such solutions then of course Theorem 1.3 is trivial). For brevity we allow all implied constants in the following arguments to depend on ε\varepsilon. If (n,m)(n^{\prime},m^{\prime}) is any other solution in this region, then m<mm^{\prime}<m and n>nn^{\prime}>n. From (1.7) we have

mlogtlognlogtlog12/3+εmlogtlog212/3+εtm\gg\frac{\log t}{\log n}\geq\frac{\log t}{\log^{\frac{1}{2/3+\varepsilon}}m}\gg\frac{\log t}{\log_{2}^{\frac{1}{2/3+\varepsilon}}t}

thanks to (1.5). From further application of (1.7) we then have

nexp(O(log212/3+εt)).n\ll\exp(O(\log_{2}^{\frac{1}{2/3+\varepsilon}}t)).

Similarly for nn^{\prime}. Applying Proposition 1.9 (with ε\varepsilon replaced by a sufficiently small quantity), we conclude that

(1.10) mm,nnεexp(O(log21εt))m-m^{\prime},n^{\prime}-n\ll_{\varepsilon^{\prime}}\exp(O(\log^{1-\varepsilon^{\prime}}_{2}t))

whenever 1ε>2/32/3+ε1-\varepsilon^{\prime}>\frac{2/3}{2/3+\varepsilon}, or equivalently ε<ε2/3+ε\varepsilon^{\prime}<\frac{\varepsilon}{2/3+\varepsilon}. The result now follows from Proposition 1.10. ∎

Remark 1.11.

The above arguments showed that for tt sufficiently large depending on ε\varepsilon, there were at most four solutions to (1.1) in the region exp(log2/3+εn)mnexp(log2/3+εn)\exp(\log^{2/3+\varepsilon}n)\leq m\leq n-\exp(\log^{2/3+\varepsilon}n). A modification of the argument also shows that there cannot be exactly three such solutions. For if this were the case, we see from (1.3) that there must be a solution (n,m)(n,m) with n=2mn=2m, so that mlogtm\asymp\log t by Stirling’s approximation. For all other solutions (n,m)(n^{\prime},m^{\prime}) to (1.1) we have nn+1n^{\prime}\geq n+1, hence

(nn/2)=t=(nm)(n+1m)\binom{n}{n/2}=t=\binom{n^{\prime}}{m^{\prime}}\geq\binom{n+1}{m^{\prime}}

and hence (by Stirling’s approximation)

(n+1m)(12+O(1n))(n+1(n+1)/2).\binom{n+1}{m^{\prime}}\leq\left(\frac{1}{2}+O\left(\frac{1}{n}\right)\right)\binom{n+1}{(n+1)/2}.

By Stirling’s approximation (or the central limit theorem of de Moivre and Laplace) this forces |mn+12|n|m^{\prime}-\frac{n+1}{2}|\gg\sqrt{n}, thus |mm|m1/2|m^{\prime}-m|\gg m^{1/2}. But this contradicts (1.10).

1.3. Proof methods

We now discuss the method of proof of Proposition 1.9, which is our main new contribution. In contrast to the “Archimedean” arguments of Kane (such as Proposition 1.10) that use real and complex analysis of the binomial coefficients (nm)\binom{n}{m}, the proof of Proposition 1.9 relies more on “non-Archimedean” arguments, based on evaluating the pp-adic valuations vp((nm))v_{p}\left(\binom{n}{m}\right) for various primes pp, defined as the number of times pp divides (nm)\binom{n}{m}. From the classical Legendre formula

(1.11) vp(n!)=j=1npj,v_{p}(n!)=\sum_{j=1}^{\infty}\left\lfloor\frac{n}{p^{j}}\right\rfloor,

where x\lfloor x\rfloor is the integer part of xx, we see that

(1.12) vp((nm))=j=1(npjmpjnmpj)=j=1({mpj}+{nmpj}{npj})\begin{split}v_{p}\left(\binom{n}{m}\right)&=\sum_{j=1}^{\infty}\left(\left\lfloor\frac{n}{p^{j}}\right\rfloor-\left\lfloor\frac{m}{p^{j}}\right\rfloor-\left\lfloor\frac{n-m}{p^{j}}\right\rfloor\right)\\ &=\sum_{j=1}^{\infty}\left(\left\{\frac{m}{p^{j}}\right\}+\left\{\frac{n-m}{p^{j}}\right\}-\left\{\frac{n}{p^{j}}\right\}\right)\end{split}

where {x}xx\{x\}\coloneqq x-\lfloor x\rfloor denotes the fractional part of xx. Note that the summands here vanish whenever pj>np^{j}>n. From this identity we see that if (n,m),(n,m)(n,m),(n^{\prime},m^{\prime}) are two solutions to (1.1) then we must have

(1.13) j=1({mpj}+{nmpj}{npj})=j=1({mpj}+{nmpj}{npj})\sum_{j=1}^{\infty}\left(\left\{\frac{m}{p^{j}}\right\}+\left\{\frac{n-m}{p^{j}}\right\}-\left\{\frac{n}{p^{j}}\right\}\right)=\sum_{j=1}^{\infty}\left(\left\{\frac{m^{\prime}}{p^{j}}\right\}+\left\{\frac{n^{\prime}-m^{\prime}}{p^{j}}\right\}-\left\{\frac{n^{\prime}}{p^{j}}\right\}\right)

for all primes pp. Our strategy will be to apply this equation with pp set equal to a random prime 𝐩\mathbf{p} drawn uniformly amongst all primes in the interval [P,P+Plog100P][P,P+P\log^{-100}P] where the scale PP is something like exp(log2/3+ε/2(n+n))\exp(\log^{2/3+\varepsilon/2}(n+n^{\prime})), and inspect the distribution of the resulting random variables on the left and right-hand sides of (1.13) in order to obtain a contradiction when m,mm,m^{\prime} or n,nn,n^{\prime} are sufficiently well separated. In order to do this we need some information concerning the equidistribution of fractional parts such as {n𝐩j}\{\frac{n}{\mathbf{p}^{j}}\}. This will be provided by the following estimate, proven in Section 4. There and later the letter pp always denotes a prime.

Proposition 1.12 (Equidistribution estimate).

Let ε>0\varepsilon>0 and P2P\geq 2 and let II be an interval contained in [P,2P][P,2P]. Let M,NM,N be real numbers with M,N=O(exp(log3/2εP))M,N=O(\exp(\log^{3/2-\varepsilon}P)), and let jj be a natural number.

  • (i)

    For all A>0A>0,

    pIe(Np+Mpj)=Ie(Nt+Mtj)dtlogt+Oε,A(PlogAP).\sum_{p\in I}e\left(\frac{N}{p}+\frac{M}{p^{j}}\right)=\int_{I}e\left(\frac{N}{t}+\frac{M}{t^{j}}\right)\ \frac{dt}{\log t}+O_{\varepsilon,A}(P\log^{-A}P).
  • (ii)

    Let W:2W\colon\mathbb{R}^{2}\to\mathbb{C} be a smooth 2\mathbb{Z}^{2}-periodic function. Then, for all A>0A>0,

    pIW(Np,Mpj)=IW(Nt,Mtj)dtlogt+Oε,A(WC3PlogAP),\sum_{p\in I}W\left(\frac{N}{p},\frac{M}{p^{j}}\right)=\int_{I}W\left(\frac{N}{t},\frac{M}{t^{j}}\right)\ \frac{dt}{\log t}+O_{\varepsilon,A}(\|W\|_{C^{3}}P\log^{-A}P),

    where

    WC3j=03supx2|jW(x)|.\|W\|_{C^{3}}\coloneqq\sum_{j=0}^{3}\sup_{x\in\mathbb{R}^{2}}|\nabla^{j}W(x)|.

One can generalize this proposition to control the joint equidistribution of any bounded number of expressions of the form {npj}\{\frac{n}{p^{j}}\}, but for our applications it will suffice to understand the equidistribution of pairs {Np}\{\frac{N}{p}\}, {Mpj}\{\frac{M}{p^{j}}\}.

When it comes to the proof of Proposition 1.12, the first step is to use Fourier expansion to reduce part (ii) of the proposition to part (i). For part (i), the case where |N|P+|M|Pj\frac{|N|}{P}+\frac{|M|}{P^{j}} is small (say logO(A)P\leq\log^{O(A)}P) is easily handled using the prime number theorem with classical error term. In the regime where |N|P+|M|Pj\frac{|N|}{P}+\frac{|M|}{P^{j}} is large, we use Vaughan’s identity to decompose the sum in (i) into type I and II sums, and assert that these exhibit cancellation; the type I and II bounds are given in (4.9) and (4.11).

Both type I and type II sums can be handled using Vinogradov’s bound for sums of the form nIe(f(n))\sum_{n\in I}e(f(n)) with ff smooth, although we need to first cut from II small intervals around zeros of the first logP\log P derivatives of N/t+M/tjN/t+M/t^{j}. This way we obtain that the sum in (i) exhibits cancellation. It is here that the restriction N,M=O(exp(log3/2εP))N,M=O(\exp(\log^{3/2-\varepsilon}P)) arises; even under the Riemann hypothesis we do not know how to relax this requirement222Using standard randomness heuristics one could tentatively conjecture that this restriction N,M=O(exp(log3/2εP))N,M=O(\exp(\log^{3/2-\varepsilon}P)) could be relaxed to N,M=O(exp(Pc))N,M=O(\exp(P^{c})) for some constant c>0c>0; this would improve the range exp(log2/3+εn)mn/2\exp(\log^{2/3+\varepsilon}n)\leq m\leq n/2 in Theorem 1.3 to logCnmn/2\log^{C}n\leq m\leq n/2 for some constant C>0C>0..

Once the equidistribution estimate, Proposition 1.12, is established, the analysis of the distribution of both sides of (1.13) is relatively straightforward, as long as the scale PP is chosen so that the powers PjP^{j} do not lie close to various integer combinations of m,n,m,nm,n,m^{\prime},n^{\prime}. However, there are some delicate cases when two of the numbers n,m,nm,n,m,nmn,m,n-m,n^{\prime},m^{\prime},n^{\prime}-m^{\prime} are “commensurable” in the sense that one of them is close to a rational multiple of the other, where the rational multiplier has small height. Commensurable integers are also known to generate some exceptional examples of integer factorial ratios [6], [7], [25]. Fortunately, we can handle these cases in our context by an analysis of covariances between various fractional parts {n1𝐩},{n2𝐩}\{\frac{n_{1}}{\mathbf{p}}\},\{\frac{n_{2}}{\mathbf{p}}\}, in particular taking advantage of the fact that these covariances are non-negative up to small errors, and small unless n1,n2n_{1},n_{2} are very highly commensurable.

1.4. Acknowledgments

KM was supported by Academy of Finland grant no. 285894. MR acknowledges the support of NSF grant DMS-1902063 and a Sloan Fellowship. XS was supported by NSF grant DMS-1802224. TT was supported by a Simons Investigator grant, the James and Carol Collins Chair, the Mathematical Analysis & Application Research Fund Endowment, and by NSF grant DMS-1764034. JT was supported by a Titchmarsh Fellowship.

1.5. Notation

We use XYX\ll Y, X=O(Y)X=O(Y), or YXY\gg X to denote the estimate |X|CY|X|\leq CY for some constant CC. If we wish to permit this constant to depend on one or more parameters we shall indicate this by appropriate subscripts, thus for instance Oε,A(Y)O_{\varepsilon,A}(Y) denotes a quantity bounded in magnitude by Cε,AYC_{\varepsilon,A}Y for some quantity Cε,AC_{\varepsilon,A} depending only on ε,A\varepsilon,A. We write XYX\asymp Y for XYXX\ll Y\ll X.

We use 1E1_{E} to denote the indicator of an event EE, thus 1E1_{E} equals 11 when EE is true and 0 otherwise.

We let ee denote the standard real character e(x)e2πixe(x)\coloneqq e^{2\pi ix}.

2. Derivative estimates

We generalize the binomial coefficient (nm)\binom{n}{m} to real 0mn0\leq m\leq n by the formula

(nm)Γ(n+1)Γ(m+1)Γ(nm+1)\binom{n}{m}\coloneqq\frac{\Gamma(n+1)}{\Gamma(m+1)\Gamma(n-m+1)}

where

Γ(x)eγxxn=1(1+xn)1ex/n\Gamma(x)\coloneqq\frac{e^{-\gamma x}}{x}\prod_{n=1}^{\infty}\left(1+\frac{x}{n}\right)^{-1}e^{x/n}

is the Gamma function (with γ\gamma the Euler–Mascheroni constant). This is of course consistent with the usual definition of the binomial coefficient. Observe that the digamma function

ψ(x)ΓΓ(x)=γ+n=01n+11n+x\psi(x)\coloneqq\frac{\Gamma^{\prime}}{\Gamma}(x)=-\gamma+\sum_{n=0}^{\infty}\frac{1}{n+1}-\frac{1}{n+x}

is a smooth increasing concave function on (0,+)(0,+\infty), with

ψ(x)=n=01(n+x)2\psi^{\prime}(x)=\sum_{n=0}^{\infty}\frac{1}{(n+x)^{2}}

positive and decreasing, and

ψ′′(x)=n=02(n+x)3\psi^{\prime\prime}(x)=-\sum_{n=0}^{\infty}\frac{2}{(n+x)^{3}}

negative. For future reference we also observe the standard asymptotics

(2.1) ψ(x)\displaystyle\psi(x) =logx+O(1x)\displaystyle=\log x+O\left(\frac{1}{x}\right)
(2.2) ψ(x)\displaystyle\psi^{\prime}(x) =1x+O(1x2)\displaystyle=\frac{1}{x}+O\left(\frac{1}{x^{2}}\right)
(2.3) ψ′′(x)\displaystyle\psi^{\prime\prime}(x) =1x2+O(1x3)\displaystyle=-\frac{1}{x^{2}}+O\left(\frac{1}{x^{3}}\right)

and the Stirling approximation

(2.4) logΓ(x)=xlogxx12logx+log2π+O(1x)\log\Gamma(x)=x\log x-x-\frac{1}{2}\log x+\log\sqrt{2\pi}+O\left(\frac{1}{x}\right)

for any x1x\geq 1; see e.g., [2, §6.1, 6.3, 6.4]. One could also extend these functions meromorphically to the entire complex plane, but we will not need to do so here.

From the increasing nature of ψ\psi we see that n(nm)n\mapsto\binom{n}{m} is strictly increasing on [m,+)[m,+\infty) for fixed real m>0m>0, and from Stirling’s approximation (2.4) we see that it goes to infinity as nn\to\infty. Thus for given t>1t>1, we see from the inverse function theorem that there exists a unique smooth function ft:[0,+)[0,+)f_{t}\colon[0,+\infty)\to[0,+\infty) with ft(m)>mf_{t}(m)>m for all mm, such that

(2.5) (ft(m)m)=t.\binom{f_{t}(m)}{m}=t.

In particular, the equation (1.1) holds for given integers 1mn1\leq m\leq n and t2t\geq 2 if and only if n=ft(m)n=f_{t}(m). This function ftf_{t} was analyzed by Kane [14], who among other things was able to extend ftf_{t} holomorphically to a certain sector, which then allowed him to estimate high derivatives of this function. However, for our analysis we will only need to control the first few derivatives of ftf_{t}, which can be estimated by hand:

Proposition 2.1 (Estimates on the first few derivatives).

Let t,mt,m be sufficiently large with mft(m)/2m\leq f_{t}(m)/2. Then

(2.6) ft(m)mt1/mf_{t}(m)\asymp mt^{1/m}

and

(2.7) ft(m)(ft(m)2m)logtm2-f^{\prime}_{t}(m)\asymp(f_{t}(m)-2m)\frac{\log t}{m^{2}}

and

(2.8) ft′′(m)ft(m)(logtm2)2.f^{\prime\prime}_{t}(m)\asymp f_{t}(m)\left(\frac{\log t}{m^{2}}\right)^{2}.

In particular, ftf_{t} is convex and decreasing in this regime.

The bound (2.6) can be viewed as a generalization of (1.6) to non-integer values of n,m,tn,m,t.

Proof.

Taking logarithms in (2.5) we have

(2.9) logΓ(ft(m)+1)logΓ(ft(m)m+1)logΓ(m+1)=logt.\log\Gamma(f_{t}(m)+1)-\log\Gamma(f_{t}(m)-m+1)-\log\Gamma(m+1)=\log t.

Writing n=ft(m)2mn=f_{t}(m)\geq 2m, we thus see from the mean value theorem that

mψ(nθm+1)logΓ(m+1)=logtm\psi(n-\theta m+1)-\log\Gamma(m+1)=\log t

for some 0θ10\leq\theta\leq 1 depending on t,mt,m. Applying (2.1), we conclude that

log(nθm)=1m(logt+logΓ(m+1))+O(1n)\log(n-\theta m)=\frac{1}{m}(\log t+\log\Gamma(m+1))+O(\frac{1}{n})

which implies that

nnθmexp(1m(logt+logΓ(m+1)))n\asymp n-\theta m\asymp\exp(\frac{1}{m}(\log t+\log\Gamma(m+1)))

and the claim (2.6) then follows from Stirling’s approximation (2.4).

If we differentiate (2.9) we obtain

(2.10) ft(m)ψ(ft(m)+1)(ft(m)1)ψ(ft(m)m+1)ψ(m+1)=0.f^{\prime}_{t}(m)\psi(f_{t}(m)+1)-(f^{\prime}_{t}(m)-1)\psi(f_{t}(m)-m+1)-\psi(m+1)=0.

In particular we obtain the first derivative formula

(2.11) ft(m)=ψ(m+1)ψ(nm+1)ψ(n+1)ψ(nm+1).f^{\prime}_{t}(m)=\frac{\psi(m+1)-\psi(n-m+1)}{\psi(n+1)-\psi(n-m+1)}.

From (2.2) and the mean value theorem we have

(2.12) ψ(n+1)ψ(nm+1)mn\psi(n+1)-\psi(n-m+1)\asymp\frac{m}{n}

while from either the mean-value theorem and (2.2) (if mnm\asymp n) or from (2.1) (if say mn/4m\leq n/4) we see that

ψ(nm+1)ψ(m+1)n2mnlognm.\psi(n-m+1)-\psi(m+1)\asymp\frac{n-2m}{n}\log\frac{n}{m}.

We conclude that

ft(m)n2mmlognm-f^{\prime}_{t}(m)\asymp\frac{n-2m}{m}\log\frac{n}{m}

and the claim (2.7) follows from (2.6).

Differentiating (2.10) again, we conclude

ft′′(m)ψ(n+1)+(ft(m))2ψ(n+1)ft′′(m)ψ(nm+1)(ft(m)1)2ψ(nm+1)ψ(m+1)=0.f^{\prime\prime}_{t}(m)\psi(n+1)+(f^{\prime}_{t}(m))^{2}\psi^{\prime}(n+1)-f^{\prime\prime}_{t}(m)\psi(n-m+1)-(f^{\prime}_{t}(m)-1)^{2}\psi^{\prime}(n-m+1)-\psi^{\prime}(m+1)=0.

which we can rearrange using (2.11) as

ft′′(m)(ψ(n+1)ψ(nm+1))3\displaystyle f^{\prime\prime}_{t}(m)(\psi(n+1)-\psi(n-m+1))^{3} =(ψ(n+1)ψ(nm+1))2ψ(m+1)\displaystyle=(\psi(n+1)-\psi(n-m+1))^{2}\psi^{\prime}(m+1)
+(ψ(n+1)ψ(m+1))2ψ(nm+1)\displaystyle\quad+(\psi(n+1)-\psi(m+1))^{2}\psi^{\prime}(n-m+1)
(ψ(nm+1)ψ(m+1))2ψ(n+1).\displaystyle\quad-(\psi(n-m+1)-\psi(m+1))^{2}\psi^{\prime}(n+1).

From (2.12), (2.6) it thus suffices to show that

(ψ(n+1)ψ(nm+1))2ψ(m+1)\displaystyle(\psi(n+1)-\psi(n-m+1))^{2}\psi^{\prime}(m+1)\quad
+(ψ(n+1)ψ(m+1))2ψ(nm+1)\displaystyle+(\psi(n+1)-\psi(m+1))^{2}\psi^{\prime}(n-m+1)\quad
(ψ(nm+1)ψ(m+1))2ψ(n+1)\displaystyle-(\psi(n-m+1)-\psi(m+1))^{2}\psi^{\prime}(n+1) mn2log2(n/m).\displaystyle\asymp\frac{m}{n^{2}}\log^{2}(n/m).

The quantity (ψ(n+1)ψ(nm+1))2ψ(m+1)(\psi(n+1)-\psi(n-m+1))^{2}\psi^{\prime}(m+1) is non-negative and is of size O(m/n2)O(m/n^{2}) by (2.12), (2.2). Thus it will suffice to show that

(ψ(n+1)ψ(m+1))2ψ(nm+1)(ψ(nm+1)ψ(m+1))2ψ(n+1)mn2log2(n/m).(\psi(n+1)-\psi(m+1))^{2}\psi^{\prime}(n-m+1)-(\psi(n-m+1)-\psi(m+1))^{2}\psi^{\prime}(n+1)\asymp\frac{m}{n^{2}}\log^{2}(n/m).

We split the left-hand side as the sum of

(ψ(m+1)ψ(n+1))2(ψ(nm+1)ψ(n+1))(\psi(m+1)-\psi(n+1))^{2}(\psi^{\prime}(n-m+1)-\psi^{\prime}(n+1))

and

ψ(n+1)[(ψ(n+1)ψ(m+1))2(ψ(nm+1)ψ(m+1))2]\displaystyle\psi^{\prime}(n+1)[(\psi(n+1)-\psi(m+1))^{2}-(\psi(n-m+1)-\psi(m+1))^{2}]
=(ψ(n+1)ψ(m+1)+ψ(nm+1)ψ(m+1))(ψ(n+1)ψ(nm+1))ψ(n+1).\displaystyle=(\psi(n+1)-\psi(m+1)+\psi(n-m+1)-\psi(m+1))(\psi(n+1)-\psi(n-m+1))\psi^{\prime}(n+1).

From (2.1), (2.3), and the mean value theorem the first term is positive and comparable to mn2log2nm\frac{m}{n^{2}}\log^{2}\frac{n}{m}; similarly, from (2.1), (2.2), and (2.12) the second term is positive and bounded above by O(mn2lognm)O(\frac{m}{n^{2}}\log\frac{n}{m}). The claim follows. ∎

To apply these derivative bounds, we use the following lemma that implicitly appears in [14], [15]:

Lemma 2.2 (Small non-zero derivative implies few integer values).

Let k1k\geq 1 be a natural number, and suppose that f:If:I\to\mathbb{R} is a smooth function on an interval II of some length |I||I| such that one has the derivative bound

(2.13) 0<|1k!f(k)(x)|<|I|k(k+1)/20<\left|\frac{1}{k!}f^{(k)}(x)\right|<|I|^{-k(k+1)/2}

for all xIx\in I. Then there are at most kk integers mIm\in I for which f(m)f(m) is also an integer.

Proof.

Suppose for contradiction that there are k+1k+1 distinct integers m1,,mk+1Im_{1},\dots,m_{k+1}\in I with f(m1),,f(mk+1)f(m_{1}),\dots,f(m_{k+1}) an integer. By Lagrange interpolation, the function

(2.14) P(x)i=1k+11jk+1:jixmjmimjf(mi)P(x)\coloneqq\sum_{i=1}^{k+1}\prod_{1\leq j\leq k+1:j\neq i}\frac{x-m_{j}}{m_{i}-m_{j}}f(m_{i})

is a polynomial of degree at most kk such that f(x)P(x)f(x)-P(x) vanishes at m1,,mk+1m_{1},\dots,m_{k+1}. By many applications of Rolle’s theorem (see [14, Corollary 2.1]), there must then exist xIx_{*}\in I such that f(k)(x)P(k)(x)f^{(k)}(x_{*})-P^{(k)}(x_{*}) vanishes. From (2.14), 1k!P(k)(x)\frac{1}{k!}P^{(k)}(x) (which is the degree kk coefficient of P(x)P(x)) is an integer multiple of 11i<jk+1|mimj||I|k(k+1)/2\frac{1}{\prod_{1\leq i<j\leq k+1}|m_{i}-m_{j}|}\geq|I|^{-k(k+1)/2}, and thus either vanishes or has magnitude at least |I|k(k+1)/2|I|^{-k(k+1)/2}. But this contradicts (2.13). ∎

As an application of these bounds, we can locally control the number of solutions (1.1) in the region n1/2+εmn/2n^{1/2+\varepsilon}\leq m\leq n/2, thus giving a version of Theorem 1.3 in a small interval:

Corollary 2.3.

Let 0<ε<10<\varepsilon<1, let tt be sufficiently large depending on ε\varepsilon, and suppose that (n,m)(n,m) is a solution to (1.1) in the left half (1.4) of Pascal’s triangle with mn1/2+εm\geq n^{1/2+\varepsilon}. Then there is at most one other solution (n,m)(n^{\prime},m^{\prime}) to (1.1) in the interval m[mmε/10,m]m^{\prime}\in[m-m^{\varepsilon/10},m].

Proof.

From (1.7) and the hypothesis n1/2+εmn/2n^{1/2+\varepsilon}\leq m\leq n/2 we have

(2.15) logtlog2tmlogt.\frac{\log t}{\log_{2}t}\ll m\ll\log t.

For xx in the interval I[mmε/10,m]I\coloneqq[m-m^{\varepsilon/10},m], we then have logtx=logtm+O(m2+ε/10logt)=logtm+O(1)\frac{\log t}{x}=\frac{\log t}{m}+O(m^{-2+\varepsilon/10}\log t)=\frac{\log t}{m}+O(1), and so we see from Proposition 2.1 and (2.15) that ft(x)nf_{t}(x)\asymp n and

0<|ft′′(x)|n(logtm2)2nm2log22tnm2log2m0<|f^{\prime\prime}_{t}(x)|\ll n\left(\frac{\log t}{m^{2}}\right)^{2}\ll\frac{n}{m^{2}}\log_{2}^{2}t\ll\frac{n}{m^{2}}\log^{2}m

for all xIx\in I. Since mn1/2+εm\geq n^{1/2+\varepsilon} and tt is sufficiently large depending on ε\varepsilon, mm is also sufficiently large depending on ε\varepsilon, and we have

0<|ft′′(x)|<|I|30<|f^{\prime\prime}_{t}(x)|<|I|^{-3}

for all xIx\in I. Applying Lemma 2.2, there are at most two integers mIm^{\prime}\in I with ft(m)f_{t}(m^{\prime}) an integer. Since mm is already one of these integers, the claim follows. ∎

The same method, using higher derivative estimates on ftf_{t}, also gives similar results (with weaker bounds on the number of solutions) for m<n1/2+εm<n^{1/2+\varepsilon}; see [14], [15]. However, we will only need to apply this method in the mn1/2+εm\geq n^{1/2+\varepsilon} regime here.

We are now ready to prove Proposition 1.10.

Proof of Proposition 1.10.

Let ε>0\varepsilon>0, let tt be sufficiently large depending on ε\varepsilon, and let (n,m)(n,m) be a solution to (1.1) in the region

(2.16) {(n,m):exp(log2/3+εn)mn/2}\{(n,m):\exp(\log^{2/3+\varepsilon}n)\leq m\leq n/2\}

For brevity we allow all implied constants in the following arguments to depend on ε\varepsilon. Suppose (n,m)(n^{\prime},m^{\prime}) is another solution in this region with m<mm^{\prime}<m, n>nn^{\prime}>n and

mm,nnεexp(O(log21εt)).m-m^{\prime},n^{\prime}-n\ll_{\varepsilon^{\prime}}\exp(O(\log^{1-\varepsilon^{\prime}}_{2}t)).

From (2.7) and convexity (and the bounds mlogtm\ll\log t and mm1m-m^{\prime}\geq 1) we have

nn\displaystyle n^{\prime}-n =ft(m)ft(m)\displaystyle=f_{t}(m^{\prime})-f_{t}(m)
ft(m)(mm)\displaystyle\geq f^{\prime}_{t}(m)(m^{\prime}-m)
(n2m)logtm2(mm)\displaystyle\gg(n-2m)\frac{\log t}{m^{2}}(m-m^{\prime})
n2mm\displaystyle\gg\frac{n-2m}{m}
=nm2\displaystyle=\frac{n}{m}-2

and thus

n/mεexp(O(log21εt))n/m\ll_{\varepsilon^{\prime}}\exp(O(\log^{1-\varepsilon^{\prime}}_{2}t))

From (1.7) we have nlogtn\gg\log t, hence log21εtlog1εn\log^{1-\varepsilon^{\prime}}_{2}t\ll\log^{1-\varepsilon^{\prime}}n, and so for some constant C>0C>0, mn/exp(Clog1εn)n9/10m\geq n/\exp(C\log^{1-\varepsilon^{\prime}}n)\geq n^{9/10} (shrinking ε\varepsilon^{\prime} slightly if necessary) if tt is sufficiently large depending on ε\varepsilon^{\prime}. The result now follows from Corollary 2.3.

It remains to establish Proposition 1.9. This will be the objective of the next two sections of the paper.

3. The distance bound

In this section we assume Proposition 1.12 and use it to establish Proposition 1.9.

Throughout this section 0<ε<10<\varepsilon<1 will be fixed; we can assume it to be small. We may assume that tt is sufficiently large depending on ε\varepsilon, as the claim is trivial otherwise. We may assume that m<mm^{\prime}<m, hence also n>nn^{\prime}>n. We assume for sake of contradiction that at least one of the claims

(3.1) mmexp(log2/3+εn)m-m^{\prime}\geq\exp(\log^{2/3+\varepsilon}n^{\prime})

and

(3.2) m,m,nnexp(log2/3+εn)m,m^{\prime},n^{\prime}-n\geq\exp(\log^{2/3+\varepsilon}n^{\prime})

is true, as the claim is trivial otherwise. This allows us to select a “good” scale:

Lemma 3.1 (Selection of scale).

With the above assumptions, there exists P>1P>1 obeying the following axioms:

  • (i)

    (m,m,n,nm,m^{\prime},n,n^{\prime} not too large) We have m,m,n,nexp(log32ε10P)m,m^{\prime},n,n^{\prime}\leq\exp(\log^{\frac{3}{2}-\frac{\varepsilon}{10}}P). (In particular, PP will be sufficiently large depending on ε\varepsilon, since otherwise t=Oε(1)t=O_{\varepsilon}(1).)

  • (ii)

    (Dichotomy) If a,a,b,ba,a^{\prime},b,b^{\prime} are integers with |a|,|a|,|b|,|b|log1/100P|a|,|a^{\prime}|,|b|,|b^{\prime}|\leq\log^{1/100}P, and jj is a natural number, then either

    (3.3) |am+am+bn+bn|Pj/log1000P|am+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime}|\leq P^{j}/\log^{1000}P

    or

    |am+am+bn+bn|Pjlog1000P.|am+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime}|\geq P^{j}\log^{1000}P.
  • (iii)

    (Separation) At least one of the statements

    mmPlog100Pm-m^{\prime}\geq P\log^{100}P

    and

    m,m,nnPlog100Pm,m^{\prime},n^{\prime}-n\geq P\log^{100}P

    is true.

Proof.

We restrict PP to be a power of two in the range

exp(log2/3+ε/2n)Pexp(2log2/3+ε/2n);\exp(\log^{2/3+\varepsilon/2}n^{\prime})\leq P\leq\exp(2\log^{2/3+\varepsilon/2}n^{\prime});

such a choice will automatically obey (i) since n>n>m>mn^{\prime}>n>m>m^{\prime} and (iii) since we assumed that either (3.1) or (3.2) holds. There are log2/3+ε/2n\gg\log^{2/3+\varepsilon/2}n^{\prime} choices for PP. Some of these will not obey (ii), but we can control the number of exceptions as follows. Firstly, observe that the conclusion (3.3) will hold unless j=O(log1/3n)j=O(\log^{1/3}n^{\prime}), so we may restrict attention to this range of jj. The number of possible tuples (a,a,b,b,j)(a,a^{\prime},b,b^{\prime},j) is then O(log4/100Plog1/3n)O(\log^{4/100}P\log^{1/3}n^{\prime}). For each such tuple, we see from the restriction on PP that the number of PP with

Pj/log1000P<|am+am+bn+bn|<Pjlog1000PP^{j}/\log^{1000}P<|am+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime}|<P^{j}\log^{1000}P

is at most O(log2n)O(\log_{2}n^{\prime}) (since am+am+bn+bnam+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime} is of size O((n)2)O((n^{\prime})^{2}), say). Thus we see that the total number of PP which fail to obey (ii) is at most

O(log4/100Plog1/3nlog2n)O(\log^{4/100}P\log^{1/3}n^{\prime}\log_{2}n^{\prime})

which is negligible compared to the total number of choices, which is log2/3+ε/2n\gg\log^{2/3+\varepsilon/2}n^{\prime}. Thus we can find a choice of PP which obeys all of (i), (ii), and (iii), giving the claim. ∎

Henceforth we fix a scale PP obeying the properties in Lemma 3.1. We now introduce a relation \approx on the reals by declaring xyx\approx y if |xy|P/log1000P|x-y|\leq P/\log^{1000}P. Thus, by Lemma 3.1(ii), if am+am+bn+bn0am+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime}\not\approx 0 for a,a,b,ba,a^{\prime},b,b^{\prime} as in Lemma 3.1(ii) then |am+am+bn+bn|Plog1000P|am+a^{\prime}m^{\prime}+bn+b^{\prime}n^{\prime}|\geq P\log^{1000}P. Also, from Lemma 3.1(iii), at least one of the statements

mmm\not\approx m^{\prime}

and

m,m,nn0m,m^{\prime},n^{\prime}-n\not\approx 0

is true.

We introduce a random variable 𝐩\mathbf{p}, which is drawn uniformly from the primes in the interval I[P,P+Plog100P]I\coloneqq[P,P+P\log^{-100}P] (note that there is at least one such prime thanks to the prime number theorem). From (1.13) we surely have

j=1({m𝐩j}+{nm𝐩j}{n𝐩j})=j=1({m𝐩j}+{nm𝐩j}{n𝐩j}).\sum_{j=1}^{\infty}\left(\left\{\frac{m}{\mathbf{p}^{j}}\right\}+\left\{\frac{n-m}{\mathbf{p}^{j}}\right\}-\left\{\frac{n}{\mathbf{p}^{j}}\right\}\right)=\sum_{j=1}^{\infty}\left(\left\{\frac{m^{\prime}}{\mathbf{p}^{j}}\right\}+\left\{\frac{n^{\prime}-m^{\prime}}{\mathbf{p}^{j}}\right\}-\left\{\frac{n^{\prime}}{\mathbf{p}^{j}}\right\}\right).

We can restrict attention to those jj with jlog1/2Pj\leq\log^{1/2}P, since the summands vanish otherwise. For any real number NN, we may take covariances of both sides of this identity with the random variable {N𝐩}\{\frac{N}{\mathbf{p}}\} to conclude that

(3.4) jlog1/2P(cj(N,m)+cj(N,nm)cj(N,n))=jlog1/2P(cj(N,m)+cj(N,nm)cj(N,n))\sum_{j\leq\log^{1/2}P}\left(c_{j}(N,m)+c_{j}(N,n-m)-c_{j}(N,n)\right)=\sum_{j\leq\log^{1/2}P}\left(c_{j}(N,m^{\prime})+c_{j}(N,n^{\prime}-m^{\prime})-c_{j}(N,n^{\prime})\right)

for any real number NN, where the covariances cj(N,M)c_{j}(N,M) are defined as

cj(N,M)\displaystyle c_{j}(N,M) 𝐄{N𝐩}{M𝐩j}𝐄{N𝐩}𝐄{M𝐩j}\displaystyle\coloneqq\mathbf{E}\left\{\frac{N}{\mathbf{p}}\right\}\left\{\frac{M}{\mathbf{p}^{j}}\right\}-\mathbf{E}\left\{\frac{N}{\mathbf{p}}\right\}\mathbf{E}\left\{\frac{M}{\mathbf{p}^{j}}\right\}
𝐄(12{N𝐩})(12{M𝐩j})𝐄(12{N𝐩})𝐄(12{M𝐩j}).\displaystyle\coloneqq\mathbf{E}\left(\frac{1}{2}-\left\{\frac{N}{\mathbf{p}}\right\}\right)\left(\frac{1}{2}-\left\{\frac{M}{\mathbf{p}^{j}}\right\}\right)-\mathbf{E}\left(\frac{1}{2}-\left\{\frac{N}{\mathbf{p}}\right\}\right)\mathbf{E}\left(\frac{1}{2}-\left\{\frac{M}{\mathbf{p}^{j}}\right\}\right).

We now compute these covariances:

Proposition 3.2 (Covariance estimates).

Let N,M{m,n,mn,m,n,nm}N,M\in\{m,n,m-n,m^{\prime},n^{\prime},n^{\prime}-m^{\prime}\}, and jj be a natural number with 1jlog1/2P1\leq j\leq\log^{1/2}P.

  • (i)

    If j2j\geq 2, then cj(N,M)log10Pc_{j}(N,M)\ll\log^{-10}P.

  • (ii)

    If j=1j=1 and N0N\approx 0 or M0M\approx 0, then cj(N,M)log1000Pc_{j}(N,M)\ll\log^{-1000}P.

  • (iii)

    If j=1j=1, N,M0N,M\not\approx 0 and there exist coprime natural numbers 1a,blog1/100P1\leq a,b\leq\log^{1/100}P such that aNbMaN\approx bM, then cj(N,M)=112ab+O(log1/1000P)c_{j}(N,M)=\frac{1}{12ab}+O(\log^{-1/1000}P).

  • (iv)

    If j=1j=1 and N,MN,M are not of the form in (ii) or (iii), then cj(N,M)log1/1000Pc_{j}(N,M)\ll\log^{-1/1000}P.

Remark 3.3.

The term 112ab\frac{1}{12ab} appearing in Proposition 3.2(iii) is also the covariance between {n𝐱}\{n{\bf x}\} and {m𝐱}\{m{\bf x}\} for 𝐱{\bf x} drawn randomly from the unit interval whenever n,mn,m are natural numbers with an=bman=bm for some coprime a,ba,b; see [24, Section 2]. Indeed, both assertions are proven by the same Fourier-analytic argument, and Proposition 3.2 endows the linear span of the six functions {N𝐩}\{\frac{N}{\mathbf{p}}\} for N{m,n,mn,m,n,nm}N\in\{m,n,m-n,m^{\prime},n^{\prime},n^{\prime}-m^{\prime}\} with an inner product closely related to the norm N()N() studied in [24], the structure of which is the key to obtaining a contradiction from our separation hypotheses on nn,mmn-n^{\prime},m-m^{\prime}.

Proof of Proposition 3.2 assuming Proposition 1.12.

We first dispose of the easy case (ii). If N0N\approx 0, then {N𝐩}log1000P\{\frac{N}{\mathbf{p}}\}\leq\log^{-1000}P, and the claim follows from the triangle inequality; similarly if M0M\approx 0 or actually if MPj/log1000PM\leq P^{j}/\log^{1000}P. Hence by Lemma 3.1(ii), we may from now on assume that

NPlog1000PandMPjlog1000P.N\geq P\log^{1000}P\quad\text{and}\quad M\geq P^{j}\log^{1000}P.

To handle the remaining cases we use the truncated Fourier expansion

(3.5) 12{x}=0<|n|N0e(nx)2πin+O(11+N0dist(x,))=0<|n|N0e(nx)2πin+O(1dist(x,)N01/2+1N01/2)\begin{split}\frac{1}{2}-\{x\}&=\sum_{0<|n|\leq N_{0}}\frac{e(nx)}{2\pi in}+O\left(\frac{1}{1+N_{0}\mathrm{dist}(x,\mathbb{Z})}\right)\\ &=\sum_{0<|n|\leq N_{0}}\frac{e(nx)}{2\pi in}+O\left(1_{\mathrm{dist}(x,\mathbb{Z})\leq N_{0}^{-1/2}}+\frac{1}{N_{0}^{1/2}}\right)\end{split}

that holds for any N01N_{0}\geq 1 (see e.g. [12, Formula (4.18)]).

Our primary tool is Proposition 1.12. Note that, for tIt\in I, logt=logP+O(log99P)\log t=\log P+O(\log^{-99}P), so that together with the prime number theorem Proposition 1.12 implies that

(3.6) 𝐄W(N𝐩,M𝐩j)=1|I|IW(Nt,Mtj)𝑑t+Oε(WC3log99P)\mathbf{E}W\left(\frac{N}{\mathbf{p}},\frac{M}{\mathbf{p}^{j}}\right)=\frac{1}{|I|}\int_{I}W\left(\frac{N}{t},\frac{M}{t^{j}}\right)\ dt+O_{\varepsilon}(\|W\|_{C^{3}}\log^{-99}P)

for any smooth 2\mathbb{Z}^{2}-periodic W:2W\colon\mathbb{R}^{2}\to\mathbb{C} and that, for any M,N=O(exp(log3/2ε/2P))M^{\prime},N^{\prime}=O(\exp(\log^{3/2-\varepsilon/2}P)),

(3.7) 𝐄e(N𝐩+M𝐩j)=1|I|Ie(Nt+Mtj)𝑑t+Oε(log99P)\mathbf{E}e\left(\frac{N^{\prime}}{\mathbf{p}}+\frac{M^{\prime}}{\mathbf{p}^{j}}\right)=\frac{1}{|I|}\int_{I}e\left(\frac{N^{\prime}}{t}+\frac{M^{\prime}}{t^{j}}\right)\ dt+O_{\varepsilon}(\log^{-99}P)

Applying (3.6) with WW a suitable cutoff localized to the region {(x,y):dist(x,)2N01/2}\{(x,y):\mathrm{dist}(x,\mathbb{Z})\leq 2N_{0}^{-1/2}\} that equals one on {(x,y):dist(x,)N01/2}\{(x,y):\mathrm{dist}(x,\mathbb{Z})\leq N_{0}^{-1/2}\} chosen so that WC3N03/2\|W\|_{C^{3}}\ll N_{0}^{3/2}, we see that, for any N0[1,log20P]N_{0}\in[1,\log^{20}P] we have

𝐏(dist(N𝐩,)N01/2)1|I|I1dist(Nt,)2N01/2𝑑t+N01/2.\mathbf{P}\left(\mathrm{dist}\left(\frac{N}{\mathbf{p}},\mathbb{Z}\right)\leq N_{0}^{-1/2}\right)\ll\frac{1}{|I|}\int_{I}1_{\mathrm{dist}(\frac{N}{t},\mathbb{Z})\leq 2N_{0}^{-1/2}}\ dt+N_{0}^{-1/2}.

Since NPlog1000PN\geq P\log^{1000}P, the first term on the right-hand side can be computed to be O(N01/2)O(N_{0}^{-1/2}). Thus

(3.8) 𝐏(dist(N𝐩,)N01/2)N01/2\mathbf{P}\left(\mathrm{dist}\left(\frac{N}{\mathbf{p}},\mathbb{Z}\right)\leq N_{0}^{-1/2}\right)\ll N_{0}^{-1/2}

and a similar argument gives

(3.9) 𝐏(dist(M𝐩j,)N01/2)N01/2.\mathbf{P}\left(\mathrm{dist}\left(\frac{M}{\mathbf{p}^{j}},\mathbb{Z}\right)\leq N_{0}^{-1/2}\right)\ll N_{0}^{-1/2}.

To prepare for the proofs of parts (i), (iii) and (iv), let us first show that, for 1jlog1/2P1\leq j\leq\log^{1/2}P, we have

(3.10) 𝐄(12{M𝐩j})log10P.\mathbf{E}\left(\frac{1}{2}-\left\{\frac{M}{\mathbf{p}^{j}}\right\}\right)\ll\log^{-10}P.

We use the Fourier expansion (3.5) with N0=log20PN_{0}=\log^{20}P. Averaging over pIp\in I and applying (3.9) to handle the first error term, we see that

𝐄(12{M𝐩j})=0<|m|log20P12πim𝐄e(mM𝐩j)+O(log10P).\mathbf{E}\left(\frac{1}{2}-\left\{\frac{M}{\mathbf{p}^{j}}\right\}\right)=\sum_{0<|m|\leq\log^{20}P}\frac{1}{2\pi im}\mathbf{E}e\left(m\frac{M}{\mathbf{p}^{j}}\right)+O(\log^{-10}P).

By the triangle inequality and (3.7), it suffices to show that, for every non-zero integer m=O(log20P)m=O(\log^{20}P),

1|I|Ie(mMtj)𝑑tlog11P.\frac{1}{|I|}\int_{I}e\left(m\frac{M}{t^{j}}\right)\ dt\ll\log^{-11}P.

Recalling that MPjlog1000PM\geq P^{j}\log^{1000}P, this estimate follows from a standard integration by parts (see e.g. [12, Lemma 8.9]). Similarly

(3.11) 𝐄(12{N𝐩})log10P.\mathbf{E}\left(\frac{1}{2}-\left\{\frac{N}{\mathbf{p}}\right\}\right)\ll\log^{-10}P.

Furthermore, using similarly (3.5), (3.8), (3.9) and (3.7), we see that, whenever 1N0log20P1\leq N_{0}\leq\log^{20}P,

(3.12) 𝐄(12{N𝐩})(12{M𝐩j})=0<|m|,|n|<N014π2mn1|I|Ie(nNt+mMtj)𝑑t+O(1N01/2).\mathbf{E}\left(\frac{1}{2}-\left\{\frac{N}{\mathbf{p}}\right\}\right)\left(\frac{1}{2}-\left\{\frac{M}{\mathbf{p}^{j}}\right\}\right)=-\sum_{0<|m|,|n|<N_{0}}\frac{1}{4\pi^{2}mn}\frac{1}{|I|}\int_{I}e\left(n\frac{N}{t}+m\frac{M}{t^{j}}\right)\ dt+O\left(\frac{1}{N_{0}^{1/2}}\right).

Now we are ready to prove (i), (iii), and (iv). Let us start with (i). In light of (3.10), (3.11) and (3.12) with N0=log20PN_{0}=\log^{20}P, it suffices to show that

1|I|Ie(nNt+mMtj)𝑑tlog11P.\frac{1}{|I|}\int_{I}e\left(n\frac{N}{t}+m\frac{M}{t^{j}}\right)\ dt\ll\log^{-11}P.

whenever n,m=O(log20P)n,m=O(\log^{20}P) are non-zero integers. Applying a change of variables t=P/st=P/s, we reduce to showing that

(3.13) 1/(1+log100P)1e(as+bsj)𝑑slog200P\int_{1/(1+\log^{-100}P)}^{1}e(as+bs^{j})\ ds\ll\log^{-200}P

(say), where anN/Pa\coloneqq nN/P and bmM/Pjb\coloneqq mM/P^{j}. By hypothesis, we have |a|,|b|log1000P|a|,|b|\geq\log^{1000}P. Since 2jlog1/2P2\leq j\leq\log^{1/2}P, the derivative a+jbsj1a+jbs^{j-1} of the phase as+bsjas+bs^{j} is at least log200P\log^{200}P outside of an interval of length at most O(log200P)O(\log^{-200}P), and (3.13) now follows from a standard integration by parts (see e.g. [12, Lemma 8.9]). This concludes the proof of (i).

Let us now turn to (iv). In light of (3.10), (3.11) and (3.12) with N0=log1/500PN_{0}=\log^{1/500}P, it suffices to show that

1|I|Ie(nN+mMt)𝑑tlog1/500P\frac{1}{|I|}\int_{I}e\left(\frac{nN+mM}{t}\right)\ dt\ll\log^{-1/500}P

whenever n,m=O(log1/500P)n,m=O(\log^{1/500}P) are non-zero integers. From the hypothesis (iv) and Lemma 3.1(ii) (after factoring out any common multiple of nn and mm), we have |nN+mM|Plog1000P|nN+mM|\geq P\log^{1000}P. The claim (iv) now follows from integration by parts.

Finally we show (iii). In light of (3.10), (3.11) and (3.12) with N0=log1/500PN_{0}=\log^{1/500}P, it suffices to show that

0<|n|,|m|log1/500P14π2mn1|I|Ie(nN+mMt)𝑑t=112ab+O(log1/1000P).-\sum_{0<|n|,|m|\leq\log^{1/500}P}\frac{1}{4\pi^{2}mn}\frac{1}{|I|}\int_{I}e\left(\frac{nN+mM}{t}\right)\ dt=\frac{1}{12ab}+O(\log^{-1/1000}P).

Let us first consider those n,m=O(log1/500P)n,m=O(\log^{1/500}P) for which nN+mM0nN+mM\not\approx 0. By Lemma 3.1(ii) |nN+mM|Plog1000P|nN+mM|\geq P\log^{1000}P and similarly to case (iv), the contribution of such pairs (n,m)(n,m) is acceptable.

Consider now the case nNmMnN\approx-mM for some non-zero integers n,m=O(log1/500P)n,m=O(\log^{1/500}P). By assumption also aNbMaN\approx bM for some co-prime positive integers a,blog1/100Pa,b\leq\log^{1/100}P. and hence by Lemma 3.1(ii) amMbnM-amM\approx bnM which contradicts the assumption M0M\not\approx 0 unless (n,m)(n,m) is a multiple of (a,b)(a,-b). On the other hand if (n,m)(n,m) is a multiple of (a,b)(a,-b), then nNmMnN\approx-mM by Lemma 3.1(ii).

Thus it remains to show that

0<|k|log1/500Pmax{a,b}14π2k2ab1|I|Ie(kaNkbMt)𝑑t=112ab+O(log1/1000P).\sum_{0<|k|\leq\frac{\log^{1/500}P}{\max\{a,b\}}}\frac{1}{4\pi^{2}k^{2}ab}\frac{1}{|I|}\int_{I}e\left(\frac{kaN-kbM}{t}\right)\ dt=\frac{1}{12ab}+O(\log^{-1/1000}P).

Since aNbMaN\approx bM we have, for every klog1/500Pk\leq\log^{1/500}P,

1|I|Ie(kaNkbMt)𝑑t=1O(log100P)\frac{1}{|I|}\int_{I}e\left(\frac{kaN-kbM}{t}\right)\ dt=1-O(\log^{-100}P)

and so it suffices to show that

0<|k|log1/500Pmax{a,b}14π2k2ab=112ab+O(log1/1000P)\sum_{0<|k|\leq\frac{\log^{1/500}P}{\max\{a,b\}}}\frac{1}{4\pi^{2}k^{2}ab}=\frac{1}{12ab}+O(\log^{-1/1000}P)

This is trivial for ablog1/1000Pab\geq\log^{1/1000}P. For ablog1/1000Pab\leq\log^{1/1000}P the claim follows from the Basel identity

k=11k2=π26\sum_{k=1}^{\infty}\frac{1}{k^{2}}=\frac{\pi^{2}}{6}

and the tail bound

klog1/1000P1k2log1/1000P.\sum_{k\geq\log^{1/1000}P}\frac{1}{k^{2}}\ll\log^{-1/1000}P.

Now we can get back to proving Proposition 1.9 assuming Proposition 1.12. From Proposition 3.2(i) and (3.4) we see that

(3.14) c1(N,m)+c1(N,nm)c1(N,n)=c1(N,m)+c1(N,nm)c1(N,n)+O(δ)c_{1}(N,m)+c_{1}(N,n-m)-c_{1}(N,n)=c_{1}(N,m^{\prime})+c_{1}(N,n^{\prime}-m^{\prime})-c_{1}(N,n^{\prime})+O(\delta)

for N{m,n,nm,m,n,mn}N\in\{m,n,n-m,m^{\prime},n^{\prime},m^{\prime}-n^{\prime}\}, where for brevity we introduce the error tolerance

δlog1/1000P.\delta\coloneqq\log^{-1/1000}P.

We can now arrive at the desired contradiction by some case analysis (reminiscent of that in [24, 25]) using the remaining portions of Proposition 3.2, as follows.

Case m0m^{\prime}\approx 0

Applying (3.14) with N=mN=m, we conclude from Proposition 3.2(ii) that

(3.15) c1(m,m)+c1(m,nm)c1(m,n)=c1(m,nm)c1(m,n)+O(δ).c_{1}(m,m)+c_{1}(m,n-m)-c_{1}(m,n)=c_{1}(m,n^{\prime}-m^{\prime})-c_{1}(m,n^{\prime})+O(\delta).

From Lemma 3.1(iii) we have m0m\not\approx 0 (and hence also nm,nm,n0n-m,n^{\prime}-m^{\prime},n^{\prime}\not\approx 0, since these quantities are greater than or equal to mm), hence by Proposition 3.2(iii) we have c1(m,m)=112+O(δ)c_{1}(m,m)=\frac{1}{12}+O(\delta). Furthermore, since m0m^{\prime}\approx 0, we see from Lemma 3.1(ii) that, for 1a,blog1/100P1\leq a,b\leq\log^{1/100}P, amb(nm)am\approx b(n^{\prime}-m^{\prime}) if and only if ambnam\approx bn^{\prime}. Hence Proposition 3.2(iii) (iv) implies that

c1(m,nm)=c1(m,n)+O(δ).c_{1}(m,n^{\prime}-m^{\prime})=c_{1}(m,n^{\prime})+O(\delta).

Plugging these facts into (3.15) and rearranging, we obtain

112+c1(m,nm)=c1(m,n)+O(δ).\frac{1}{12}+c_{1}(m,n-m)=c_{1}(m,n)+O(\delta).

But by Proposition 3.2(iii), (iv) we know that c1(m,nm)O(δ)c_{1}(m,n-m)\geq-O(\delta), so that

c1(m,n)112+O(δ)c_{1}(m,n)\geq\frac{1}{12}+O(\delta)

But since mnm\not\approx n (because mn/2m\leq n/2 and m0m\not\approx 0), another application of Proposition 3.2(iii), (iv) gives

c1(m,n)12112+O(δ),c_{1}(m,n)\leq\frac{1}{2}\frac{1}{12}+O(\delta),

which is a contradiction.

Since mm^{\prime} was the smallest element of {m,n,nm,m,n,mn}\{m,n,n-m,m^{\prime},n^{\prime},m^{\prime}-n^{\prime}\}, we now thus have N0N\not\approx 0 for all N{m,n,nm,m,n,mn}N\in\{m,n,n-m,m^{\prime},n^{\prime},m^{\prime}-n^{\prime}\}, and case (ii) of Proposition 3.2 no longer applies.

Case mmm\not\approx m^{\prime} and m0m^{\prime}\not\approx 0

We apply (3.14) with N=mN=m^{\prime} to conclude that

(3.16) c1(m,m)+c1(m,nm)c1(m,n)=c1(m,m)+c1(m,nm)c1(m,n)+O(δ).c_{1}(m^{\prime},m)+c_{1}(m^{\prime},n-m)-c_{1}(m^{\prime},n)=c_{1}(m^{\prime},m^{\prime})+c_{1}(m^{\prime},n^{\prime}-m^{\prime})-c_{1}(m^{\prime},n^{\prime})+O(\delta).

Now if there are no co-prime positive integers a,blog1/100Pa,b\leq\log^{1/100}P such that ambnam^{\prime}\approx bn^{\prime} or amb(nm)am^{\prime}\approx b(n^{\prime}-m^{\prime}), then by Proposition 3.2(iv) we have

c1(m,nm)c1(m,n)=O(δ)c_{1}(m^{\prime},n^{\prime}-m^{\prime})-c_{1}(m^{\prime},n^{\prime})=O(\delta)

On the other hand, if such co-prime integers exist, then ambnam^{\prime}\approx bn^{\prime} if and only if (ab)mb(nm)(a-b)m^{\prime}\approx b(n^{\prime}-m^{\prime}) and necessarily a>ba>b, so that by Proposition 3.2(iii) we have in this case

(3.17) c1(m,nm)c1(m,n)=112(ab)b112abO(δ)O(δ).c_{1}(m^{\prime},n^{\prime}-m^{\prime})-c_{1}(m^{\prime},n^{\prime})=\frac{1}{12(a-b)b}-\frac{1}{12ab}-O(\delta)\geq-O(\delta).

Since Proposition 3.2(iii) also gives c1(m,m)1/12+O(δ)c_{1}(m^{\prime},m^{\prime})\geq 1/12+O(\delta), combining with (3.16) we obtain that

(3.18) c1(m,m)+c1(m,nm)c1(m,n)112O(δ).c_{1}(m^{\prime},m)+c_{1}(m^{\prime},n-m)-c_{1}(m^{\prime},n)\geq\frac{1}{12}-O(\delta).

On the other hand, since mmm^{\prime}\not\approx m, we also have mnmm^{\prime}\not\approx n-m since nmm>mn-m\geq m>m^{\prime}. By Proposition 3.2(iii), (iv), we have

c1(m,m)+c1(m,nm)11212+11212+O(δ),c_{1}(m^{\prime},m)+c_{1}(m^{\prime},n-m)\leq\frac{1}{12}\cdot\frac{1}{2}+\frac{1}{12}\cdot\frac{1}{2}+O(\delta),

which can be improved to

(3.19) c1(m,m)+c1(m,nm)11213+11212+O(δ),c_{1}(m^{\prime},m)+c_{1}(m^{\prime},n-m)\leq\frac{1}{12}\cdot\frac{1}{3}+\frac{1}{12}\cdot\frac{1}{2}+O(\delta),

unless both m2mm\approx 2m^{\prime} and nm2mn-m\approx 2m^{\prime}. Since by Proposition 3.2(iii), (iv) we have c1(m,n)O(δ)c_{1}(m^{\prime},n)\geq-O(\delta), the estimate (3.19) contradicts (3.18).

Hence we must have both m2mm\approx 2m^{\prime} and nm2mn-m\approx 2m^{\prime}. But then Lemma 3.1(ii) forces n4mn\approx 4m^{\prime}, hence by Proposition 3.2(iii)

c1(m,m)+c1(m,nm)c1(m,n)=11212+1121211214+O(δ),c_{1}(m^{\prime},m)+c_{1}(m^{\prime},n-m)-c_{1}(m^{\prime},n)=\frac{1}{12}\cdot\frac{1}{2}+\frac{1}{12}\cdot\frac{1}{2}-\frac{1}{12}\cdot\frac{1}{4}+O(\delta),

and we again contradict (3.18).

Case mmm\approx m^{\prime} and m0m^{\prime}\not\approx 0

By Lemma 3.1(iii), we must have nnn\not\approx n^{\prime}. We apply (3.14) for N=nN=n to obtain

(3.20) c1(n,m)+c1(n,nm)c1(n,n)=c1(n,m)+c1(n,nm)c1(n,n)+O(δ).c_{1}(n,m)+c_{1}(n,n-m)-c_{1}(n,n)=c_{1}(n,m^{\prime})+c_{1}(n,n^{\prime}-m^{\prime})-c_{1}(n,n^{\prime})+O(\delta).

Since mmm\approx m^{\prime}, we have by Proposition 3.2(iii), (iv) (using also Lemma 3.1(ii)) that c1(n,m)=c1(n,m)+O(δ)c_{1}(n,m)=c_{1}(n,m^{\prime})+O(\delta). Proposition 3.2(iii) also gives c1(n,n)=1/12+O(δ)c_{1}(n,n)=1/12+O(\delta). Plugging these into (3.20) and rearranging, we obtain

(3.21) c1(n,nm)+c1(n,n)=112+c1(n,nm)+O(δ).c_{1}(n,n-m)+c_{1}(n,n^{\prime})=\frac{1}{12}+c_{1}(n,n^{\prime}-m^{\prime})+O(\delta).

Since nnn\not\approx n^{\prime} and m0m\not\approx 0, we see from Proposition 3.2(iii), (iv) that

(3.22) c1(n,nm)+c1(n,n)11212+11212+O(δ)c_{1}(n,n-m)+c_{1}(n,n^{\prime})\leq\frac{1}{12}\cdot\frac{1}{2}+\frac{1}{12}\cdot\frac{1}{2}+O(\delta)

which can be improved to

(3.23) c1(n,nm)+c1(n,n)11213+11212+O(δ)c_{1}(n,n-m)+c_{1}(n,n^{\prime})\leq\frac{1}{12}\cdot\frac{1}{3}+\frac{1}{12}\cdot\frac{1}{2}+O(\delta)

unless 2(nm)n2(n-m)\approx n and n2nn^{\prime}\approx 2n. Now (3.23) contradicts (3.21) since by Proposition 3.2(iii), (iv) c1(n,nm)O(δ)c_{1}(n,n^{\prime}-m^{\prime})\geq-O(\delta).

Hence we can assume that 2(nm)n2(n-m)\approx n and n2nn^{\prime}\approx 2n. But using mmm\approx m^{\prime} and Lemma 3.1(ii) this implies that 2(nm)3n2(n^{\prime}-m^{\prime})\approx 3n, so that by (3.21) and Proposition 3.2(iii) we obtain

c1(n,nm)+c1(n,n)=112+c1(n,nm)+O(δ)=112+112123+O(δ).c_{1}(n,n-m)+c_{1}(n,n^{\prime})=\frac{1}{12}+c_{1}(n,n^{\prime}-m^{\prime})+O(\delta)=\frac{1}{12}+\frac{1}{12}\cdot\frac{1}{2\cdot 3}+O(\delta).

contradicting (3.22).

Remark 3.4.

Morally speaking, the ability to obtain a contradiction here reflects the fact that one cannot have an identity of the form

(3.24) {mx}+{(nm)x}{nx}={mx}+{(nm)x}{nx}\{mx\}+\{(n-m)x\}-\{nx\}=\{m^{\prime}x\}+\{(n^{\prime}-m^{\prime})x\}-\{n^{\prime}x\}

for almost all real numbers xx and some integers 1mn/21\leq m\leq n/2, 1mn/21\leq m^{\prime}\leq n^{\prime}/2 unless one has both m=mm=m^{\prime} and n=nn=n^{\prime} (this type of connection goes back to Landau [17, p. 116]). This latter fact is easily established by inspecting the jump discontinuities of both sides of (3.24), but it is also possible to establish it by computing the covariances of both sides of (3.24) with {Nx}\{Nx\} for various choices of NN, and the arguments above can be viewed as an adaptation of this latter method.

It remains to establish Proposition 1.12. This will be established in the next section.

4. Equidistribution

In this section we prove Proposition 1.12. Fix ε,A\varepsilon,A. We may assume that PP is sufficiently large depending on ε,A\varepsilon,A, as the claim is trivial otherwise. If we have PjMlogAPP^{j}\geq M\log^{A}P then we can replace in both parts of the proposition MPj\frac{M}{P^{j}} by 0 with negligible error, so we may assume that either M=0M=0 or Pj<MlogAPP^{j}<M\log^{A}P. In either event we may thus assume that jlog1/2Pj\leq\log^{1/2}P. Next, by partitioning II into at most log100P\log^{100}P intervals of length at most Plog100PP\log^{-100}P and using the triangle inequality, it suffices (after suitable adjustment of PP, AA) to assume that I[P,P+Plog100P]I\subset[P,P+P\log^{-100}P]. In particular we have

(4.1) Pj1tj12Pj1P^{j-1}\leq t^{j-1}\leq 2P^{j-1}

for all tIt\in I.

Let us first reduce Proposition 1.12(ii) to Proposition 1.12(i). We perform a Fourier expansion

W(x,y)=n,mcn,me(nx+my)W(x,y)=\sum_{n,m\in\mathbb{Z}}c_{n,m}e(nx+my)

where by integration by parts the Fourier coefficients

cn,m=2/2W(x,y)e(nxmy)𝑑x𝑑yc_{n,m}=\int_{\mathbb{R}^{2}/\mathbb{Z}^{2}}W(x,y)e(-nx-my)\ dxdy

obey the bounds

|cn,m|WC3(1+|n|+|m|)3.|c_{n,m}|\ll\|W\|_{C^{3}}(1+|n|+|m|)^{-3}.

By the triangle inequality, the contributions of those frequencies n,mn,m with |n|+|m|log2AP|n|+|m|\geq\log^{2A}P is then acceptable. By a further application of the triangle inequality, Proposition 1.12(ii) follows from showing that

pIe(nNp+mMpj)=Ie(nNt+mMtj)dtlogt+Oε,A(Plog10AP)\sum_{p\in I}e\left(n\frac{N}{p}+m\frac{M}{p^{j}}\right)=\int_{I}e\left(n\frac{N}{t}+m\frac{M}{t^{j}}\right)\ \frac{dt}{\log t}+O_{\varepsilon,A}(P\log^{-10A}P)

whenever n,mn,m are integers with |n|+|m|log2AP|n|+|m|\leq\log^{2A}P. But this follows from Proposition 1.12(i) by adjusting the values of ε,A,M,N\varepsilon,A,M,N suitably.

The proof of part (i) will use the standard tools of Vaughan’s identity and Vinogradov’s exponential sum estimates. We state a suitable form of the latter tool here:

Lemma 4.1 (Vinogradov’s exponential sum estimate).

Let X2X\geq 2, FX4F\geq X^{4}, and α1\alpha\geq 1. Let I[X,2X]I\subset[X,2X] be an interval. Let f(x)f(x) be a smooth function on II satisfying for all tIt\in I

(4.2) αr3Ftrr!|f(r)(t)|αr3F\displaystyle\alpha^{-r^{3}}F\leq\frac{t^{r}}{r!}|f^{(r)}(t)|\leq\alpha^{r^{3}}F

for all integers 1r10logF/(logX)+1.1\leq r\leq 10\lceil\log F/(\log X)\rceil+1. Assume further that

(4.3) (logα)(logF)2(logX)3<103.\displaystyle(\log\alpha)\frac{(\log F)^{2}}{(\log X)^{3}}<10^{-3}.

Then we have

(4.4) nIe(f(n))αXexp(218(logX)3/(logF)2),\displaystyle\sum_{n\in I}e(f(n))\ll\alpha X\exp(-2^{-18}(\log X)^{3}/(\log F)^{2}),

where the implied constant is absolute.

Proof.

This is essentially [12, Theorem 8.25] with minor modifications (the modification needed is that we only assume (4.2) for rr in a certain range, not all integers r1r\geq 1.).

Let R:=10logF/(logX)R:=10\lceil\log F/(\log X)\rceil, and as in [12, p. 217], let

Fn(q):=0rRαr(n)qr,αr(n)f(r)(n)r!.\displaystyle F_{n}(q):=\sum_{0\leq r\leq R}\alpha_{r}(n)q^{r},\quad\alpha_{r}(n)\coloneqq\frac{f^{(r)}(n)}{r!}.

Let Sf(I)S_{f}(I) denote the sum in (4.4). By Taylor’s formula, for any q1q\geq 1 we have

Sf(I)=nIe(Fn(q))+O(q+XqR+1maxtI|f(R+1)(t)|(R+1)!).\displaystyle S_{f}(I)=\sum_{n\in I}e(F_{n}(q))+O\left(q+Xq^{R+1}\frac{\max_{t\in I}|f^{(R+1)}(t)|}{(R+1)!}\right).

Let 𝒬:={xy:1xV,1yV}\mathcal{Q}:=\{xy:1\leq x\leq V,1\leq y\leq V\}\cap\mathbb{N}, where 𝒬\mathcal{Q} is interpreted as a multiset. Also let Q=|𝒬|=V2Q=|\mathcal{Q}|=V^{2}. Then

Sf(I)=nI|𝒬|1q𝒬e(Fn(q))+O(Q+XQR+1maxtI|f(R+1)(t)|(R+1)!).\displaystyle S_{f}(I)=\sum_{n\in I}|\mathcal{Q}|^{-1}\sum_{q\in\mathcal{Q}}e(F_{n}(q))+O\left(Q+XQ^{R+1}\frac{\max_{t\in I}|f^{(R+1)}(t)|}{(R+1)!}\right).

We take V=X1/4V=X^{1/4} in which case by (4.2) the error term is

(4.5) X1/2+XX(R+1)/2α(R+1)3F/XR+1X1/2+X(R+1)/4α(R+1)3(FX1(R+1)/4).\begin{split}&\ll X^{1/2}+X\cdot X^{(R+1)/2}\alpha^{(R+1)^{3}}F/X^{R+1}\\ &\ll X^{1/2}+X^{-(R+1)/4}\alpha^{(R+1)^{3}}\cdot(FX^{1-(R+1)/4}).\end{split}

The term in the parenthesis is FX3/4F10/41\leq FX^{3/4}F^{-10/4}\leq 1. Using also (4.3) we see that (4.5) is X1/2\ll X^{1/2} which is in particular smaller than the right-hand side of (4.4). The sum q𝒬e(Fn(q))\sum_{q\in\mathcal{Q}}e(F_{n}(q)) is precisely the one estimated in [12, pp. 217–225]. The only assumption needed of ff in that argument is (4.2), and the only restriction on FF and XX there is FX4F\geq X^{4}. Hence, we conclude that the lemma holds by following the analysis there verbatim. ∎

We now apply this estimate to obtain an estimate for an exponential sum over integers.

Proposition 4.2 (Exponential sums over integers).

Let ε>0\varepsilon>0, A1A\geq 1, X2X\geq 2, 2jlog1/2X2\leq j\ll\log^{1/2}X, and let N,MN,M be real numbers with N,Mexp(O(log3/2εX))N,M\ll\exp(O(\log^{3/2-\varepsilon}X)). Let II be an interval in [X,X+Xlog100X][X,X+X\log^{-100}X]. Then

(4.6) nIe(Nn+Mnj)ε,AX(1+F)clogO(A)X+XlogAX\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\ll_{\varepsilon,A}X(1+F)^{-c}\log^{O(A)}X+X\log^{-A}X

for some absolute constant c>0c>0, where

F|N|X+|M|Xj.F\coloneqq\frac{|N|}{X}+\frac{|M|}{X^{j}}.
Proof.

We may assume without loss of generality that AA is sufficiently large, and XX is sufficiently large depending on ε,A\varepsilon,A. By hypothesis we have Fexp(O(log3/2εX))F\ll\exp(O(\log^{3/2-\varepsilon}X)). We may assume that FlogCAXF\geq\log^{CA}X for a large absolute constant CC, since the claim is trivial otherwise.

Let f:If\colon I\to\mathbb{R} denote the phase function

f(t)Nt+Mtj.f(t)\coloneqq\frac{N}{t}+\frac{M}{t^{j}}.

Then for any r1r\geq 1 and tIt\in I we have

(4.7) trr!|f(r)(t)|=|Nt+Mrtj|X1|N+Mr/tj1|\frac{t^{r}}{r!}|f^{(r)}(t)|=\left|\frac{N}{t}+\frac{M_{r}}{t^{j}}\right|\asymp X^{-1}|N+M_{r}/t^{j-1}|

where

Mr(r+j1j1)M.M_{r}\coloneqq\binom{r+j-1}{j-1}M.

Since

1(r+j1j1)(r+j)r=exp(rlog(r+j))=exp(O(r2log2X))1\leq\binom{r+j-1}{j-1}\leq(r+j)^{r}=\exp(r\log(r+j))=\exp(O(r^{2}\log_{2}X))

we conclude that

Mr=exp(O(r2log2X))MM_{r}=\exp(O(r^{2}\log_{2}X))M

and

|N|X+|Mr|Xj=exp(O(r2log2X))F.\frac{|N|}{X}+\frac{|M_{r}|}{X^{j}}=\exp(O(r^{2}\log_{2}X))F.

If |Mr||N|Xj1/4|M_{r}|\leq|N|X^{j-1}/4 then from the triangle inequality and (4.1) we have

X1|N+Mr/tj1|X1|N|=exp(O(r2log2X))F.X^{-1}|N+M_{r}/t^{j-1}|\asymp X^{-1}|N|=\exp(O(r^{2}\log_{2}X))F.

Consider then the case |Mr|>|N|Xj1/4|M_{r}|>|N|X^{j-1}/4. We have the upper bound

X1|N+Mr/tj1||Mr|Xjexp(O(r2log2X))FX^{-1}|N+M_{r}/t^{j-1}|\ll\frac{|M_{r}|}{X^{j}}\ll\exp(O(r^{2}\log_{2}X))F

for all tIt\in I from the triangle inequality. Furthermore, since the function t1/tj1t\mapsto-1/t^{j-1} has derivative j/Xj\asymp j/X^{j} on II, we also have, for all tt outside of an interval of length O(Xlog2AX)O(X\log^{-2A}X), the lower bound

X1|N+Mr/tj1||Mr|Xjlog3AXexp(O(r2log2X))Flog3AX.X^{-1}|N+M_{r}/t^{j-1}|\gg\frac{|M_{r}|}{X^{j}}\log^{-3A}X\gg\exp(O(r^{2}\log_{2}X))F\log^{-3A}X.

If we set αlog4AX\alpha\coloneqq\log^{4A}X and AA is sufficiently large, then we conclude from (4.7) and the bounds above that the estimate (4.2) holds for all 1rlogX1\leq r\leq\log X and all tIt\in I outside the union of O(logX)O(\log X) intervals of length O(Xlog2AX)O(X\log^{-2A}X). The contribution of these exceptional intervals to (4.6) is negligible, and removing them splits II up into at most O(logX)O(\log X) subintervals, so by the triangle inequality it suffices to show that

nIe(Nn+Mnj)ε,AXlog2AX\sum_{n\in I^{\prime}}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\ll_{\varepsilon,A}X\log^{-2A}X

for any subinterval II^{\prime} with the property that (4.2) holds for all tIt\in I^{\prime} and 1rlogX1\leq r\leq\log X. If FX4F\geq X^{4}, we may apply Lemma 4.1 to conclude that

nIe(Nn+Mnj)Xlog4AXexp(clog2εX)\sum_{n\in I^{\prime}}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\ll X\log^{4A}X\exp(-c\log^{2\varepsilon}X)

for some absolute constant c>0c>0, and the claim follows. If instead F<X4F<X^{4}, we can apply the Weyl inequality [12, Theorem 8.4] with k=5k=5 to conclude that

nIe(Nn+Mnj)αO(1)(F/X5+1/F)cXlogX\sum_{n\in I^{\prime}}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\ll\alpha^{O(1)}(F/X^{5}+1/F)^{c}X\log X

for some absolute constant c>0c>0; since FlogCAXF\geq\log^{CA}X, we obtain the claim by taking CC large enough. ∎

Now we prove Proposition 1.12(i). We may assume without loss of generality that j2j\geq 2, since for j=1j=1 we can absorb the MM terms into the NN term (and add a dummy term with M=0M=0 and j=2j=2, say). By summation by parts (see e.g. [19, Lemma 2.2]), and adjusting AA as necessary, it suffices to show that

pIe(Np+Mpj)logp=Ie(Nt+Mtj)𝑑t+Oε,A(Plog10AP)\sum_{p\in I}e\left(\frac{N}{p}+\frac{M}{p^{j}}\right)\log p=\int_{I}e\left(\frac{N}{t}+\frac{M}{t^{j}}\right)\ dt+O_{\varepsilon,A}(P\log^{-10A}P)

for all intervals I[P,P+Plog100P]I\subset[P,P+P\log^{-100}P]. This is equivalent to

nIe(Nn+Mnj)Λ(n)=Ie(Nt+Mtj)𝑑t+Oε,A(Plog10AP),\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\Lambda(n)=\int_{I}e\left(\frac{N}{t}+\frac{M}{t^{j}}\right)\ dt+O_{\varepsilon,A}(P\log^{-10A}P),

where Λ\Lambda is the von Mangoldt function, since the contribution of the prime powers is negligible. We introduce the quantity

F|N|P+|M|Pj.F\coloneqq\frac{|N|}{P}+\frac{|M|}{P^{j}}.

If FlogCAPF\leq\log^{CA}P for some large absolute constant C>0C>0, then the total variation of the phase tNt+Mtjt\mapsto\frac{N}{t}+\frac{M}{t^{j}} is O(logCAP)O(\log^{CA}P), and the claim readily follows from a further summation by parts (see e.g. [19, Lemma 2.2]) and the prime number theorem (with classical error term). Thus we may assume that

(4.8) F>logCAP.F>\log^{CA}P.

In this case, a change of variables t=P/st=P/s gives

Ie(Nt+Mtj)𝑑t=PP/Ie(NPs+MPjsj)dss2.\int_{I}e\left(\frac{N}{t}+\frac{M}{t^{j}}\right)\ dt=-P\int_{P/I}e\left(\frac{N}{P}s+\frac{M}{P^{j}}s^{j}\right)\ \frac{ds}{s^{2}}.

The derivative of the phase here is N/P+jsj1M/PjN/P+js^{j-1}M/P^{j} which, once CC is large enough, is log10AP\geq\log^{10A}P for all sP/Is\in P/I apart from an interval of length at most O(log10AP)O(\log^{-10A}P). Hence by partial integration we get that

Ie(Nt+Mtj)𝑑tPlog10AP\int_{I}e\left(\frac{N}{t}+\frac{M}{t^{j}}\right)\ dt\ll P\log^{-10A}P

if CC is large enough, so it remains to establish the bound

nIe(Nn+Mnj)Λ(n)Plog10AP\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)\Lambda(n)\ll P\log^{-10A}P

under the hypothesis (4.8).

By Vaughan’s identity in the form of [12, Proposition 13.4] (with y=z=P1/3y=z=P^{1/3}), followed by a shorter-than-dyadic decomposition, we can write

Λ(n)=rR(αr1(n)+αrlog(n)+βrγr(n))\displaystyle\Lambda(n)=\sum_{r\leq R}(\alpha_{r}*1(n)+\alpha_{r}^{\prime}*\log(n)+\beta_{r}*\gamma_{r}(n))

for n[P,2P]n\in[P,2P], where * denotes Dirichlet convolution, and

R\displaystyle R logO(1)P,\displaystyle\ll\log^{O(1)}P,
|αr(n)|,|αr(n)|,|βr(n)|,|γr(n)|\displaystyle|\alpha_{r}(n)|,|\alpha^{\prime}_{r}(n)|,|\beta_{r}(n)|,|\gamma_{r}(n)| logP,\displaystyle\ll\log P,
supp(αr),supp(αr)\displaystyle\mathrm{supp}(\alpha_{r}),\mathrm{supp}(\alpha^{\prime}_{r}) [Mr,(1+log100P)Mr],\displaystyle\subset[M_{r},(1+\log^{-100}P)M_{r}],
supp(βr)\displaystyle\mathrm{supp}(\beta_{r}) [Kr,(1+log100P)Kr],\displaystyle\subset[K_{r},(1+\log^{-100}P)K_{r}],
supp(γr)\displaystyle\mathrm{supp}(\gamma_{r}) [Nr,(1+log100P)Nr],\displaystyle\subset[N_{r},(1+\log^{-100}P)N_{r}],
1Mr\displaystyle 1\leq M_{r} P2/3;\displaystyle\ll P^{2/3};
P1/3Kr,Nr\displaystyle P^{1/3}\ll K_{r},N_{r} P2/3\displaystyle\ll P^{2/3}

(the bound for the coefficients arising from Vaughan’s identiy is logP\ll\log P since 1Λ=log1\ast\Lambda=\log). By the triangle inequality, it thus suffices to establish the Type I estimates

(4.9) nIe(Nn+Mnj)(αr1)(n)ε,APlog11AP\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)(\alpha_{r}*1)(n)\ll_{\varepsilon,A}P\log^{-11A}P

and

(4.10) nIe(Nn+Mnj)(αrlog)(n)ε,APlog11A+1P\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)(\alpha^{\prime}_{r}*\log)(n)\ll_{\varepsilon,A}P\log^{-11A+1}P

as well as the Type II estimates

(4.11) nIe(Nn+Mnj)(βrγr)(n)ε,APlog11AP\sum_{n\in I}e\left(\frac{N}{n}+\frac{M}{n^{j}}\right)(\beta_{r}*\gamma_{r})(n)\ll_{\varepsilon,A}P\log^{-11A}P

for all 1rR1\leq r\leq R and I[P,P+Plog100P]I\subset[P,P+P\log^{-100}P]. The second Type I estimate (4.10) follows from the first Type I estimate (4.9) (replacing αr\alpha_{r} with αr\alpha^{\prime}_{r}) and a summation by parts (see e.g. [19, Lemma 2.2]), so it suffices to establish (4.9) and (4.11).

We begin with (4.9). By the triangle inequality, the left-hand side is bounded by

logPm[Mr,(1+log100P)Mr]|n1mIe(Nmn+Mmjnj)|.\ll\log P\sum_{m\in[M_{r},(1+\log^{-100}P)M_{r}]}\left|\sum_{n\in\frac{1}{m}\cdot I}e\left(\frac{N}{mn}+\frac{M}{m^{j}n^{j}}\right)\right|.

Applying Proposition 4.2 with X=P/mX=P/m and N/mN/m and M/mjM/m^{j} in place of NN and MM, we can bound this by

ε,APlogP((1+F)clogO(A)P+log20AP)\ll_{\varepsilon,A}P\log P\left((1+F)^{c}\log^{O(A)}P+\log^{-20A}P\right)

for some constant c>0c>0, and the claim now follows from (4.8).

Now we establish (4.11). We can assume that KrNrPK_{r}N_{r}\asymp P, as the sum vanishes otherwise. By the triangle inequality, the left-hand side is bounded by

logPm[Kr,(1+log100P)Kr]|n1mIγr(n)e(Nmn+Mmjnj)|.\ll\log P\sum_{m\in[K_{r},(1+\log^{-100}P)K_{r}]}\left|\sum_{n\in\frac{1}{m}\cdot I}\gamma_{r}(n)e\left(\frac{N}{mn}+\frac{M}{m^{j}n^{j}}\right)\right|.

By Cauchy–Schwarz it suffices to show that

m[Kr,(1+log100P)Kr]|n1mIγr(n)e(Nmn+Mmjnj)|2KrNr2log30AP\sum_{m\in[K_{r},(1+\log^{-100}P)K_{r}]}\left|\sum_{n\in\frac{1}{m}\cdot I}\gamma_{r}(n)e\left(\frac{N}{mn}+\frac{M}{m^{j}n^{j}}\right)\right|^{2}\ll K_{r}N^{2}_{r}\log^{-30A}P

(say). Rearranging, it suffices to show that

(4.12) n,n[Nr,(1+log100P)Nr]γr(n)γr(n)¯Xn,nKrNr2log30AP\sum_{n,n^{\prime}\in[N_{r},(1+\log^{-100}P)N_{r}]}\gamma_{r}(n)\overline{\gamma_{r}(n^{\prime})}X_{n,n^{\prime}}\ll K_{r}N^{2}_{r}\log^{-30A}P

where

Xn,nm[Kr,(1+log100P)Kr]1nI1nIe(N(nn)nnm+M((n)jnj)nj(n)jmj).X_{n,n^{\prime}}\coloneqq\sum_{m\in[K_{r},(1+\log^{-100}P)K_{r}]\cap\frac{1}{n}\cdot I\cap\frac{1}{n^{\prime}}\cdot I}e\left(\frac{N(n^{\prime}-n)}{nn^{\prime}m}+\frac{M((n^{\prime})^{j}-n^{j})}{n^{j}(n^{\prime})^{j}m^{j}}\right).

By Proposition 4.2, we have

Xn,nε,AKr((1+|nn|NrF)clogO(A)P+log40AP)X_{n,n^{\prime}}\ll_{\varepsilon,A}K_{r}\left(\left(1+\frac{|n^{\prime}-n|}{N_{r}}F\right)^{-c}\log^{O(A)}P+\log^{-40A}P\right)

for some absolute constant 0<c<10<c<1. Bounding γr(n)γr(n)¯log2P\gamma_{r}(n)\overline{\gamma_{r}(n^{\prime})}\ll\log^{2}P and noting that

n[Nr,(1+log100P)Nr](1+|nn|NrF)cNrFc\sum_{n\in[N_{r},(1+\log^{-100}P)N_{r}]}\left(1+\frac{|n^{\prime}-n|}{N_{r}}F\right)^{-c}\ll N_{r}F^{-c}

for all n[Nr,(1+log100P)Nr]n^{\prime}\in[N_{r},(1+\log^{-100}P)N_{r}], we obtain the claim (4.12) from (4.8). This completes the proof of Proposition 1.12.

5. Multiplicity of the falling factorial

In this section we establish Theorem 1.8. We first observe that if 1mn1\leq m\leq n solves (1.8) for some sufficiently large tt, then

t=(n)m(m)m=m!(m/e)mt=(n)_{m}\geq(m)_{m}=m!\gg(m/e)^{m}

by Stirling’s formula. Hence we have an analogue of (1.5):

(5.1) mlogtlog2tm\ll\frac{\log t}{\log_{2}t}

Next, since

(nm)m<(n)mnm(n-m)^{m}<(n)_{m}\leq n^{m}

we have

(5.2) t1/mn<t1/m+mt^{1/m}\leq n<t^{1/m}+m

and we obtain an analogue

(5.3) nt1/m=exp(logtm)n\asymp t^{1/m}=\exp\left(\frac{\log t}{m}\right)

of (1.6), (1.7).

Next, we obtain the following analogue of Proposition 1.9.

Proposition 5.1 (Distance estimate).

Suppose we have two solutions (n,m),(n,m)(n,m),(n^{\prime},m^{\prime}) to (1.8) in region {(m,n)2:1mn}\{(m,n)\in\mathbb{N}^{2}:1\leq m\leq n\}. Then one has

(5.4) mmlog(n+n).m^{\prime}-m\ll\log(n+n^{\prime}).

Furthermore, if

(5.5) exp(log2/3+ε(n+n))m,m(n+n)2/3\exp(\log^{2/3+\varepsilon}(n+n^{\prime}))\leq m,m^{\prime}\leq(n+n^{\prime})^{2/3}

for some ε>0\varepsilon>0, then we additionally have

(5.6) nnA,εm+mlogA(m+m)n^{\prime}-n\ll_{A,\varepsilon}\frac{m+m^{\prime}}{\log^{A}(m+m^{\prime})}

for any A>0A>0.

Proof.

We begin with (5.4). We follow the arguments from [1, Proof of Theorem 4]. Taking 22-valuations v2v_{2} of both sides of (1.8) and using (1.11) we have

j=1(n2jnm2j)=j=1(n2jnm2j).\sum_{j=1}^{\infty}\left(\left\lfloor\frac{n}{2^{j}}\right\rfloor-\left\lfloor\frac{n-m}{2^{j}}\right\rfloor\right)=\sum_{j=1}^{\infty}\left(\left\lfloor\frac{n^{\prime}}{2^{j}}\right\rfloor-\left\lfloor\frac{n^{\prime}-m^{\prime}}{2^{j}}\right\rfloor\right).

The summands here vanish unless jlog(n+n)j\leq\log(n+n^{\prime}). Writing x=x+O(1)\lfloor x\rfloor=x+O(1), we conclude that

1jlog(n+n)m2j+O(log(n+n))=1jlog(n+n)m2j+O(log(n+n))\sum_{1\leq j\leq\log(n+n^{\prime})}\frac{m}{2^{j}}+O(\log(n+n^{\prime}))=\sum_{1\leq j\leq\log(n+n^{\prime})}\frac{m^{\prime}}{2^{j}}+O(\log(n+n^{\prime}))

and (5.4) follows.

Now we prove (5.6). Fix A,ε>0A,\varepsilon>0. We may assume without loss of generality that m<mm^{\prime}<m, so that n>nn^{\prime}>n by (1.8). We may also assume tt is sufficiently large depending on A,εA,\varepsilon, as the claim is trivial otherwise; from (5.5) this also implies that m,m,n,nm,m^{\prime},n,n^{\prime} are sufficiently large depending on A,εA,\varepsilon. Henceforth all implied constants are permitted to depend on A,εA,\varepsilon. By (5.5) we have

log2/3+εnlogm\log^{2/3+\varepsilon}n\leq\log m

while from (5.3) we have lognlogtm\log n\asymp\frac{\log t}{m}. From this and (5.1) we have

(5.7) mlogtlog2tm\asymp\frac{\log t}{\log_{2}t}

and then

lognlog212/3+εt.\log n\ll\log^{\frac{1}{2/3+\varepsilon}}_{2}t.

Similarly for m,nm^{\prime},n^{\prime}. From (5.4) we conclude that

(5.8) mmlog212/3+εtlog12/3+εm.m-m^{\prime}\ll\log^{\frac{1}{2/3+\varepsilon}}_{2}t\ll\log^{\frac{1}{2/3+\varepsilon}}m.

In particular mmm\asymp m^{\prime} and, combining (5.3) with (5.8) and (5.7), also nnt1/m1/mnn\asymp n^{\prime}t^{1/m-1/m^{\prime}}\asymp n^{\prime}. Hence from (5.5) we see that

(5.9) n,nm3/2.n,n^{\prime}\gg m^{3/2}.

Also we have

logtm=logtm+O(logtlog12/3+εmm2)=logtm+O(m1/2)\frac{\log t}{m^{\prime}}=\frac{\log t}{m}+O\left(\frac{\log t\log^{\frac{1}{2/3+\varepsilon}}m}{m^{2}}\right)=\frac{\log t}{m}+O(m^{-1/2})

(say), hence on exponentiating and using (5.2), (5.9)

(5.10) n=exp(logtm)+O(m)=n+O(m1/2n).n^{\prime}=\exp\left(\frac{\log t}{m^{\prime}}\right)+O(m)=n+O(m^{-1/2}n).

Suppose that we could find a prime p>mp>m obeying the inequalities

(5.11) max(1{nnp},1mp)<{nmp}<1;{nnp}<1mp.\max\left(1-\left\{\frac{n^{\prime}-n}{p}\right\},1-\frac{m}{p}\right)<\left\{\frac{n-m}{p}\right\}<1;\quad\left\{\frac{n^{\prime}-n}{p}\right\}<1-\frac{m}{p}.

These inequalities imply in particular that

{nmp}1+mp[0,1) and {nmp}+{nnp}1+mp[0,1),\left\{\frac{n-m}{p}\right\}-1+\frac{m}{p}\in[0,1)\text{ and }\left\{\frac{n-m}{p}\right\}+\left\{\frac{n^{\prime}-n}{p}\right\}-1+\frac{m}{p}\in[0,1),

so that these quantities respectively equal {np}\{\frac{n}{p}\} and {np}\{\frac{n^{\prime}}{p}\}. Consequently, if (5.11) hold, then we would have

(5.12) {np}={nmp}1+mp<mp\left\{\frac{n}{p}\right\}=\left\{\frac{n-m}{p}\right\}-1+\frac{m}{p}<\frac{m}{p}

and (since m<mm^{\prime}<m)

(5.13) {np}={nmp}+{nnp}1+mpmpmp.\left\{\frac{n^{\prime}}{p}\right\}=\left\{\frac{n-m}{p}\right\}+\left\{\frac{n^{\prime}-n}{p}\right\}-1+\frac{m}{p}\geq\frac{m}{p}\geq\frac{m^{\prime}}{p}.

Now (5.12) implies that pp divides (n)m(n)_{m}, while (5.13) implies that pp does not divide (n)m(n^{\prime})_{m^{\prime}}. This contradicts the assumption (n)m=t=(n)m(n)_{m}=t=(n^{\prime})_{m^{\prime}}. Thus there cannot be any prime p2mp\geq 2m obeying (5.11).

Let w1:[0,1]w_{1}\colon\mathbb{R}\to[0,1] be a suitable smooth \mathbb{Z}-periodic function supported on the region

{x:{x}(1log2Am,1)}\{x\in\mathbb{R}:\{x\}\in(1-\log^{-2A}m,1)\}

chosen so that 01w1log2Am\int_{0}^{1}w_{1}\gg\log^{-2A}m and w1C3log6Am\|w_{1}\|_{C^{3}}\ll\log^{6A}m, and let w2:[0,1]w_{2}\colon\mathbb{R}\to[0,1] similarly be a smooth \mathbb{Z}-periodic function supported on the region

{y:{y}(log2Am,1/2)}\{y\in\mathbb{R}:\{y\}\in(\log^{-2A}m,1/2)\}

chosen so that w2(y)=1w_{2}(y)=1 when {y}[2log2Am,1/4]\{y\}\in[2\log^{-2A}m,1/4] and w2C3log6Am\|w_{2}\|_{C^{3}}\ll\log^{6A}m. Let 𝐩\mathbf{p} be a prime drawn uniformly from all the primes in [2m,100m][2m,100m]. As 𝐩\mathbf{p} does not obey (5.11), we have

𝐄w1(nm𝐩)w2(nn𝐩)=0\mathbf{E}w_{1}\left(\frac{n-m}{\mathbf{p}}\right)w_{2}\left(\frac{n^{\prime}-n}{\mathbf{p}}\right)=0

and hence by Proposition 1.12 (and dyadic decomposition)

2100w1(nmtm)w2(nntm)𝑑tlog100Am,\int_{2}^{100}w_{1}\left(\frac{n-m}{tm}\right)w_{2}\left(\frac{n^{\prime}-n}{tm}\right)\ dt\ll\log^{-100A}m,

or on changing variables t=1/st=1/s

(5.14) 1/1001/2w1(nmms)w2(nnms)𝑑slog100Am.\int_{1/100}^{1/2}w_{1}\left(\frac{n-m}{m}s\right)w_{2}\left(\frac{n^{\prime}-n}{m}s\right)\ ds\ll\log^{-100A}m.

On the other hand, by (5.10), (5.9) we have

(5.15) nmmnmm1/2nnm+m1/2.\frac{n-m}{m}\gg\frac{n}{m}\gg m^{1/2}\frac{n^{\prime}-n}{m}+m^{1/2}.

We perform a Fourier expansion

w1(x)=ce(x),w_{1}(x)=\sum_{\ell\in\mathbb{Z}}c_{\ell}e(\ell x),

where by integration by parts the Fourier coefficients obey the bounds

|c|(1+||)3log6Am.|c_{\ell}|\ll(1+|\ell|)^{-3}\log^{6A}m.

Thus (5.14) can then be rewritten as

(5.16) c1/1001/2w2(nnms)e(nmms)𝑑slog100Am.\sum_{\ell\in\mathbb{Z}}c_{\ell}\int_{1/100}^{1/2}w_{2}\left(\frac{n^{\prime}-n}{m}s\right)e\left(\frac{n-m}{m}\ell s\right)\ ds\ll\log^{-100A}m.

By (5.15) and integration by parts, one readily establishes the bound

1/1001/2w2(nnms)e(nmms)𝑑slog6Am||m1/2\int_{1/100}^{1/2}w_{2}\left(\frac{n^{\prime}-n}{m}s\right)e\left(\frac{n-m}{m}\ell s\right)\ ds\ll\frac{\log^{6A}m}{|\ell|m^{1/2}}

for 0\ell\neq 0. Thus the total contribution to the left-hand side of (5.16) from the terms with 0\ell\neq 0 is negligible, and hence

c01/1001/2w2(nnms)𝑑slog100Am.c_{0}\int_{1/100}^{1/2}w_{2}\left(\frac{n^{\prime}-n}{m}s\right)\ ds\ll\log^{-100A}m.

Since c0=01w1log2Amc_{0}=\int_{0}^{1}w_{1}\gg\log^{-2A}m and w2w_{2} equals 11 on [2log2Am,1/4][2\log^{-2A}m,1/4], we have

(5.17) f(nnm)log98Amf\left(\frac{n^{\prime}-n}{m}\right)\ll\log^{-98A}m

where

f(θ)1/1001/212log2Am{θs}1/4𝑑s.f(\theta)\coloneqq\int_{1/100}^{1/2}1_{2\log^{-2A}m\leq\{\theta s\}\leq 1/4}\ ds.

However, direct calculation shows that when θ3\theta\geq 3, we have

f(θ)θ16nθ2141n+1/100θsn+1/4𝑑sθθ1=1,f(\theta)\geq\sum_{\frac{\theta}{16}\leq n\leq\frac{\theta}{2}-\frac{1}{4}}\int_{\mathbb{R}}1_{n+1/100\leq\theta s\leq n+1/4}\ ds\gg\theta\cdot\theta^{-1}=1,

when 1/2<θ<31/2<\theta<3, we have

f(θ)130θ120θ𝑑s1,f(\theta)\geq\int_{\frac{1}{30\theta}}^{\frac{1}{20\theta}}\ ds\asymp 1,

and, when 8logAmθ1/28\log^{-A}m\leq\theta\leq 1/2, we have

f(θ)1/41/2𝑑s1.f(\theta)\geq\int_{1/4}^{1/2}\ ds\asymp 1.

Hence (5.17) can only hold if

nnmlogAm,\frac{n^{\prime}-n}{m}\ll\log^{-A}m,

giving the claim (5.6). ∎

Now we adapt the analysis from Section 2. We extend the falling factorial (n)m(n)_{m} to real nm0n\geq m\geq 0 by the formula

(n)mΓ(n+1)Γ(nm+1).(n)_{m}\coloneqq\frac{\Gamma(n+1)}{\Gamma(n-m+1)}.

From the increasing nature of the digamma function ψ\psi we see that for fixed mm, (n)m(n)_{m} increases from Γ(m+2)\Gamma(m+2) when nn goes from m+1m+1 to infinity. Applying the inverse function theorem, we conclude that for any sufficiently large tt there is a unique smooth function gt:{m>0:Γ(m+2)t}g_{t}\colon\{m>0:\Gamma(m+2)\leq t\}\to\mathbb{R} such that for any m>0m>0 with Γ(m+2)t\Gamma(m+2)\leq t, one has gt(m)mg_{t}(m)\geq m and

(5.18) (gt(m))m=t.(g_{t}(m))_{m}=t.

Indeed, one could simply set gt(m)ft/Γ(m+1)(m)g_{t}(m)\coloneqq f_{t/\Gamma(m+1)}(m), where ftf_{t} is the function studied in Section 2.

We have an analogue of Proposition 2.1:

Proposition 5.2 (Estimates on the first few derivatives).

Let C>1C>1, and let t,mt,m be sufficiently large depending on CC with Γ(m+2)t\Gamma(m+2)\leq t. Then

(5.19) gt(m)t1/m.g_{t}(m)\asymp t^{1/m}.

In the range mgt(m)/2m\leq g_{t}(m)/2, we have

(5.20) gt(m)gt(m)logtm2-g^{\prime}_{t}(m)\asymp g_{t}(m)\frac{\log t}{m^{2}}

and in the range mgt(m)Clog2gt(m)m\leq g_{t}(m)-C\log^{2}g_{t}(m), one has

(5.21) 0<gt′′(m)gt(m)(logtm2)2+C1log3m.0<g^{\prime\prime}_{t}(m)\ll g_{t}(m)\left(\frac{\log t}{m^{2}}\right)^{2}+C^{-1}\log^{-3}m.
Proof.

Write n=gt(m)mn=g_{t}(m)\geq m. First note that (5.19) is simply (5.3). Taking logarithms in (5.18) we have

(5.22) logΓ(gt(m)+1)logΓ(gt(m)m+1)=logt.\log\Gamma(g_{t}(m)+1)-\log\Gamma(g_{t}(m)-m+1)=\log t.

If we differentiate (5.22) we obtain

(5.23) gt(m)ψ(gt(m)+1)(gt(m)1)ψ(gt(m)m+1)=0.g^{\prime}_{t}(m)\psi(g_{t}(m)+1)-(g^{\prime}_{t}(m)-1)\psi(g_{t}(m)-m+1)=0.

In particular we obtain the first derivative formula

(5.24) gt(m)=ψ(nm+1)ψ(n+1)ψ(nm+1).g^{\prime}_{t}(m)=\frac{-\psi(n-m+1)}{\psi(n+1)-\psi(n-m+1)}.

In the regime mn/2m\leq n/2 we can then obtain (5.20) from (2.12), (2.1), (5.19).

Differentiating (5.23) again, we conclude

gt′′(m)ψ(n+1)+(gt(m))2ψ(n+1)gt′′(m)ψ(nm+1)(gt(m)1)2ψ(nm+1)=0g^{\prime\prime}_{t}(m)\psi(n+1)+(g^{\prime}_{t}(m))^{2}\psi^{\prime}(n+1)-g^{\prime\prime}_{t}(m)\psi(n-m+1)-(g^{\prime}_{t}(m)-1)^{2}\psi^{\prime}(n-m+1)=0

which we can rearrange using (5.24) as

(5.25) gt′′(m)(ψ(n+1)ψ(nm+1))3=ψ(n+1)2ψ(nm+1)ψ(nm+1)2ψ(n+1).\begin{split}g^{\prime\prime}_{t}(m)(\psi(n+1)-\psi(n-m+1))^{3}&=\psi(n+1)^{2}\psi^{\prime}(n-m+1)\\ &\quad-\psi(n-m+1)^{2}\psi^{\prime}(n+1).\end{split}

Suppose first that mn/2m\leq n/2. Then (2.12) applies, and it suffices to show that

ψ(n+1)2ψ(nm+1)ψ(nm+1)2ψ(n+1)(mn)3n(logtm2)2.\psi(n+1)^{2}\psi^{\prime}(n-m+1)-\psi(n-m+1)^{2}\psi^{\prime}(n+1)\asymp\left(\frac{m}{n}\right)^{3}n\left(\frac{\log t}{m^{2}}\right)^{2}.

By (5.19) the right-hand side is mlog2nn2\asymp\frac{m\log^{2}n}{n^{2}}. On the other hand, from the mean value theorem and (2.1), (2.2), (2.3) we have

0<ψ(n+1)2(ψ(nm+1)ψ(n+1))mlog2nn20<\psi(n+1)^{2}(\psi^{\prime}(n-m+1)-\psi^{\prime}(n+1))\asymp\frac{m\log^{2}n}{n^{2}}

and

0<(ψ(n+1)2ψ(nm+1)2)ψ(n+1)mlognn20<(\psi(n+1)^{2}-\psi(n-m+1)^{2})\psi^{\prime}(n+1)\ll\frac{m\log n}{n^{2}}

giving the claim.

Now suppose that n/2mnClog2nn/2\leq m\leq n-C\log^{2}n. From (2.1), (2.2) we have

ψ(n+1)ψ(nm+1)\displaystyle\psi(n+1)-\psi(n-m+1) lognnm\displaystyle\asymp\log\frac{n}{n-m}
ψ(n+1)2(ψ(nm+1)ψ(n+1))\displaystyle\psi(n+1)^{2}(\psi^{\prime}(n-m+1)-\psi^{\prime}(n+1)) log2nnm\displaystyle\asymp\frac{\log^{2}n}{n-m}
0<(ψ(n+1)2ψ(nm+1)2)ψ(n+1)\displaystyle 0<(\psi(n+1)^{2}-\psi(n-m+1)^{2})\psi^{\prime}(n+1) log2nn\displaystyle\ll\frac{\log^{2}n}{n}

and hence by (5.25)

gt′′(m)log2n(nm)log3nnm.g_{t}^{\prime\prime}(m)\asymp\frac{\log^{2}n}{(n-m)\log^{3}\frac{n}{n-m}}.

Since n2nmClog2n\frac{n}{2}\geq n-m\geq C\log^{2}n, we have

(nm)log3nnmClog5n(n-m)\log^{3}\frac{n}{n-m}\gg C\log^{5}n

(as can be seen by checking the cases nmnn-m\leq\sqrt{n} and nm>nn-m>\sqrt{n} separately), and the claim follows. ∎

Now we can establish Theorem 1.8. Let C>0C>0 be a large absolute constant, let ε>0\varepsilon>0, and suppose that tt is sufficiently large depending on ε,C\varepsilon,C. Let (n,m)(n,m) be the integer solution to (1.8) in the region exp(log2/3+εn)mn1\exp(\log^{2/3+\varepsilon}n)\leq m\leq n-1 with a maximal value of mm; we may assume that such a solution exists, since we are done otherwise. If (n,m)(n^{\prime},m^{\prime}) is any other solution in this region, then m<mm^{\prime}<m and n<nn<n^{\prime}. Note that n,n,m,mn,n^{\prime},m,m^{\prime} are sufficiently large depending on ε,C\varepsilon,C. From Proposition 5.1 and (5.3) we have

mmlognlogtmm-m^{\prime}\ll\log n^{\prime}\ll\frac{\log t}{m^{\prime}}

and from (5.3) and (5.1)

mlogtlognlogtlog12/3+εmlogtlog212/3+εtm^{\prime}\asymp\frac{\log t}{\log n^{\prime}}\geq\frac{\log t}{\log^{\frac{1}{2/3+\varepsilon}}m^{\prime}}\gg\frac{\log t}{\log^{\frac{1}{2/3+\varepsilon}}_{2}t}

and thus mmm^{\prime}\asymp m and logtm=logtm+O(1)\frac{\log t}{m}=\frac{\log t}{m^{\prime}}+O(1). Hence nnn\asymp n^{\prime} and

(5.26) mmlogn.m-m^{\prime}\ll\log n.

First suppose that mn1/2log10nm\leq n^{1/2}\log^{10}n. Here we will exploit the fact that nn grows rapidly as mm decreases. From Proposition 5.1 we have

nnεmlog200mmlog100n.n^{\prime}-n\ll_{\varepsilon}\frac{m}{\log^{200}m}\ll\frac{m}{\log^{100}n}.

On the other hand, from (5.20) and the mean value theorem we have

nn=gt(m)gt(m)nlogtm2(mm)nmn^{\prime}-n=g_{t}(m^{\prime})-g_{t}(m)\gg\frac{n\log t}{m^{2}}(m-m^{\prime})\geq\frac{n}{m}

thanks to (5.1) and the trivial bound mm1m-m^{\prime}\geq 1. Thus we have

nmmlog100n\frac{n}{m}\ll\frac{m}{\log^{100}n}

but this contradicts the hypothesis mn1/2log10nm\leq n^{1/2}\log^{10}n.

Now suppose we are in the regime

n1/2log10n<mnClog2n.n^{1/2}\log^{10}n<m\leq n-C\log^{2}n.

Here we will take advantage of the convexity properties of gtg_{t}. From (5.26), mm^{\prime} lies in the interval [mO(logn),m][m-O(\log n),m]. By (5.19), for all xx in this interval, we have

gt(x)t1/xt1/mng_{t}(x)\asymp t^{1/x}\asymp t^{1/m}\asymp n

and by (5.21), we have

0<gt′′(x)\displaystyle 0<g^{\prime\prime}_{t}(x) gt(x)(logtx2)2+C1log3x\displaystyle\ll g_{t}(x)\left(\frac{\log t}{x^{2}}\right)^{2}+C^{-1}\log^{-3}x
n(logtm2)2+C1log3m\displaystyle\ll n\left(\frac{\log t}{m^{2}}\right)^{2}+C^{-1}\log^{-3}m
n(lognm)2+C1log3n\displaystyle\ll n\left(\frac{\log n}{m}\right)^{2}+C^{-1}\log^{-3}n
C1log3n\displaystyle\ll C^{-1}\log^{-3}n

since m>n1/2log10nm>n^{1/2}\log^{10}n. Applying Lemma 2.2 with k=2k=2, we see (for CC large enough) that there are at most two integers mm^{\prime} in this interval with gt(m)g_{t}(m^{\prime}) an integer, giving Theorem 1.8 follows in this case.

It remains to handle the case

(5.27) nClog2n<mn1.n-C\log^{2}n<m\leq n-1.

Recall from (5.26) that mm^{\prime} lies in the interval [mO(logn),m][m-O(\log n),m]. From (5.3), (5.27) we have

mnlogtlog2tm\asymp n\asymp\frac{\log t}{\log_{2}t}

so m=mO(log2t)m^{\prime}=m-O(\log_{2}t). From (5.3) again we thus also have

mnlogtlog2t.m^{\prime}\asymp n^{\prime}\asymp\frac{\log t}{\log_{2}t}.

From (1.8) we have

nnmn1n1mn+1n+1m=(nm)(nm+1).\frac{n^{\prime}}{n^{\prime}-m^{\prime}}\frac{n^{\prime}-1}{n^{\prime}-1-m^{\prime}}\dots\frac{n+1}{n+1-m^{\prime}}=(n-m^{\prime})\dots(n-m+1).

The right-hand side is at most exp(O(log2tlog3t))\exp(O(\log_{2}t\log_{3}t)). This implies that nnlog3tn^{\prime}-n\ll\log_{3}t, since otherwise the left hand side would be, for any C1C\geq 1,

(nnm+1+Clog3t)Clog3texp(C2log3tlog2t)\gg\left(\frac{n}{n-m^{\prime}+1+C\log_{3}t}\right)^{C\log_{3}t}\gg\exp\left(\frac{C}{2}\log_{3}t\log_{2}t\right)

which contradicts the bound for the right hand side when CC is sufficiently large.

In particular we have from the triangle inequality that

nm,nmClog22t.n-m,n^{\prime}-m^{\prime}\ll C\log^{2}_{2}t.

Making the change of variables :=nm\ell:=n-m, it now suffices to show that there are at most two integer solutions to the equation

(5.28) (n)n=t(n)_{n-\ell}=t

in the regime 1Clog22t1\leq\ell\ll C\log^{2}_{2}t. We write this equation (5.28) as

n!=t!n!=t\ell!

or equivalently

n=ht()n=h_{t}(\ell)

where ht(x)Γ1(tΓ(x+1))1h_{t}(x)\coloneqq\Gamma^{-1}(t\Gamma(x+1))-1, and Γ1:[1,+)[2,+)\Gamma^{-1}\colon[1,+\infty)\to[2,+\infty) is the inverse of the gamma function. Here we will exploit the very slowly varying nature of hth_{t}. From Stirling’s formula we have

ht(x)logtlog2th_{t}(x)\asymp\frac{\log t}{\log_{2}t}

whenever 1xClog22t1\leq x\ll C\log^{2}_{2}t. Taking the logarithmic derivative of the equation

Γ(ht(x)+1)=tΓ(x+1)\Gamma(h_{t}(x)+1)=t\Gamma(x+1)

we have

ht(x)ψ(ht(x)+1)=ψ(x+1).h^{\prime}_{t}(x)\psi(h_{t}(x)+1)=\psi(x+1).

Hence by (2.1)

ht(x)logxloght(x)log3tlog2th^{\prime}_{t}(x)\asymp\frac{\log x}{\log h_{t}(x)}\ll\frac{\log_{3}t}{\log_{2}t}

in the regime 1xClog22t1\leq x\ll C\log^{2}_{2}t. In particular, for two solutions (n,),(n,)(n,\ell),(n^{\prime},\ell^{\prime}) to (5.28) in this regime we have

(5.29) nnlog3tlog2t||.n-n^{\prime}\ll\frac{\log_{3}t}{\log_{2}t}|\ell-\ell^{\prime}|.

For fixed nn there is at most one 1\ell\geq 1 solving (5.28). We conclude that for two distinct solutions (n,),(n,)(n,\ell),(n^{\prime},\ell^{\prime}) to (5.28) in this regime, we have |nn|1|n-n^{\prime}|\geq 1, and hence the separation

||log2tlog3t.|\ell-\ell^{\prime}|\gg\frac{\log_{2}t}{\log_{3}t}.

Now suppose we have three solutions (n1,1),(n2,2),(n3,3)(n_{1},\ell_{1}),(n_{2},\ell_{2}),(n_{3},\ell_{3}) to (5.28) in this regime. We can order 1<2<3\ell_{1}<\ell_{2}<\ell_{3}, so that n1<n2<n3n_{1}<n_{2}<n_{3}. From the preceding discussion we have

log2tlog3t21,32Clog22t\frac{\log_{2}t}{\log_{3}t}\ll\ell_{2}-\ell_{1},\ell_{3}-\ell_{2}\ll C\log^{2}_{2}t

and

1n2n1,n3n2Clog2tlog3t.1\leq n_{2}-n_{1},n_{3}-n_{2}\ll C\log_{2}t\log_{3}t.

If 2j2^{j} is a power of 22 that divides an integer in (n1,n2](n_{1},n_{2}] as well as an integer in (n2,n3](n_{2},n_{3}], then we must therefore have 2jClog2tlog3t2^{j}\ll C\log_{2}t\log_{3}t, so that jlog3tj\ll\log_{3}t. Thus, there must exist i=1,2i=1,2 such that the interval (ni,ni+1](n_{i},n_{i+1}] only contains multiples of 2j2^{j} when jlog3tj\ll\log_{3}t. Fix this ii. Taking 22-adic valuations of (5.28) using (1.11) we have

j=1ni2j=v2(t)+j=1i2j\sum_{j=1}^{\infty}\left\lfloor\frac{n_{i}}{2^{j}}\right\rfloor=v_{2}(t)+\sum_{j=1}^{\infty}\left\lfloor\frac{\ell_{i}}{2^{j}}\right\rfloor

and

j=1ni+12j=v2(t)+j=1i+12j\sum_{j=1}^{\infty}\left\lfloor\frac{n_{i+1}}{2^{j}}\right\rfloor=v_{2}(t)+\sum_{j=1}^{\infty}\left\lfloor\frac{\ell_{i+1}}{2^{j}}\right\rfloor

and thus

(5.30) j=1(ni+12jni2j)=j=1(i+12ji2j).\sum_{j=1}^{\infty}\left(\left\lfloor\frac{n_{i+1}}{2^{j}}\right\rfloor-\left\lfloor\frac{n_{i}}{2^{j}}\right\rfloor\right)=\sum_{j=1}^{\infty}\left(\left\lfloor\frac{\ell_{i+1}}{2^{j}}\right\rfloor-\left\lfloor\frac{\ell_{i}}{2^{j}}\right\rfloor\right).

Since

(5.31) i+1ilog2tlog3t,\ell_{i+1}-\ell_{i}\gg\frac{\log_{2}t}{\log_{3}t},

we certainly have i+1i2\ell_{i+1}-\ell_{i}\geq 2, and the right-hand side of (5.30) is at least

i+12i2i+1i.\left\lfloor\frac{\ell_{i+1}}{2}\right\rfloor-\left\lfloor\frac{\ell_{i}}{2}\right\rfloor\gg\ell_{i+1}-\ell_{i}.

By construction, the terms on the left-hand side of (5.30) vanish unless jlog3tj\ll\log_{3}t, in which case they are equal to ni+1ni2j+O(1)\frac{n_{i+1}-n_{i}}{2^{j}}+O(1). Thus the left-hand side of (5.30) is at most O(ni+1ni+log3t)O(n_{i+1}-n_{i}+\log_{3}t). Thus

i+1ini+1ni+log3t.\ell_{i+1}-\ell_{i}\ll n_{i+1}-n_{i}+\log_{3}t.

But from (5.29) one has ni+1nilog3tlog2t(i+1i)n_{i+1}-n_{i}\ll\frac{\log_{3}t}{\log_{2}t}(\ell_{i+1}-\ell_{i}). Hence i+1ilog3t\ell_{i+1}-\ell_{i}\ll\log_{3}t. But this contradicts (5.31). This concludes the proof of Theorem 1.8.

References

  • [1] H. L. Abbott, P. Erdős, and D. Hanson. On the number of times an integer occurs as a binomial coefficient. Amer. Math. Monthly, 81:256–261, 1974.
  • [2] Milton Abramowitz and Irene A. Stegun. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover, New York City, ninth Dover printing, tenth GPO printing edition, 1964.
  • [3] È. T. Avanesov. Solution of a problem on figurate numbers. Acta Arith., 12:409–420, 1966/67.
  • [4] F. Beukers, T. N. Shorey, and R. Tijdeman. Irreducibility of polynomials and arithmetic progressions with equal products of terms. In Number theory in progress, Vol. 1 (Zakopane-Kościelisko, 1997), pages 11–26. de Gruyter, Berlin, 1999.
  • [5] Aart Blokhuis, Andries Brouwer, and Benne de Weger. Binomial collisions and near collisions. Integers, 17:Paper No. A64, 8, 2017.
  • [6] J. W. Bober. Factorial ratios, hypergeometric series, and a family of step functions. J. Lond. Math. Soc. (2), 79(2):422–444, 2009.
  • [7] J. William Bober. Integer ratios of factorials, hypergeometric functions, and related step functions. ProQuest LLC, Ann Arbor, MI, 2009. Thesis (Ph.D.)–University of Michigan.
  • [8] Yann Bugeaud, Maurice Mignotte, Samir Siksek, Michael Stoll, and Szabolcs Tengely. Integral points on hyperelliptic curves. Algebra Number Theory, 2(8):859–885, 2008.
  • [9] Harald Cramér. On the order of magnitude of the difference between consecutive prime numbers. Acta Arith., 2:23–46, 1936.
  • [10] B. M. M. de Weger. A binomial Diophantine equation. Quart. J. Math. Oxford Ser. (2), 47(186):221–231, 1996.
  • [11] Benjamin M. M. de Weger. Equal binomial coefficients: some elementary considerations. J. Number Theory, 63(2):373–386, 1997.
  • [12] H. Iwaniec and E. Kowalski. Analytic number theory, volume 53 of American Mathematical Society Colloquium Publications. American Mathematical Society, Providence, RI, 2004.
  • [13] H. Jenkins. Repeated binomial coefficients and high-degree curves. Integers, 16:Paper No. A69, 14, 2016.
  • [14] Daniel Kane. New bounds on the number of representations of TT as a binomial coefficient. Integers, 4:A7, 10, 2004.
  • [15] Daniel M. Kane. Improved bounds on the number of ways of expressing tt as a binomial coefficient. Integers, 7:A53, 7, 2007.
  • [16] P. Kiss. On the number of solutions of the Diophantine equation (xp)=(y2)\binom{x}{p}=\binom{y}{2}. Fibonacci Quart., 26(2):127–130, 1988.
  • [17] Edmund Landau. Collected works. Vol. 1. Thales-Verlag, Essen, 1985. With a contribution in English by G. H. Hardy and H. Heilbronn, Edited and with a preface in English by L. Mirsky, I. J. Schoenberg, W. Schwarz and H. Wefelscheid.
  • [18] D. A. Lind. The quadratic field Q(5)Q(\surd 5) and a certain Diophantine equation. Fibonacci Quart., 6(3):86–93, 1968.
  • [19] Kaisa Matomäki, Maksym Radziwiłł, and Terence Tao. Correlations of the von Mangoldt and higher divisor functions I. Long shift ranges. Proc. Lond. Math. Soc. (3), 118(2):284–350, 2019.
  • [20] L. J. Mordell. On the integer solutions of y(y+1)=x(x+1)(x+2)y(y+1)=x(x+1)(x+2). Pacific J. Math., 13:1347–1351, 1963.
  • [21] Ákos Pintér. A note on the Diophantine equation (x4)=(y2)\binom{x}{4}=\binom{y}{2}. Publ. Math. Debrecen, 47(3-4):411–415, 1995.
  • [22] David Singmaster. Research Problems: How Often Does an Integer Occur as a Binomial Coefficient? Amer. Math. Monthly, 78(4):385–386, 1971.
  • [23] David Singmaster. Repeated binomial coefficients and Fibonacci numbers. Fibonacci Quart., 13(4):295–298, 1975.
  • [24] K. Soundararajan. Integral Factorial Ratios. arXiv e-prints, page arXiv:1901.05133, January 2019.
  • [25] K. Soundararajan. Integral factorial ratios: irreducible examples with height larger than 1. Philos. Trans. Roy. Soc. A, 378(2163):20180444, 13, 2020.
  • [26] Roelof J. Stroeker and Benjamin M. M. de Weger. Elliptic binomial Diophantine equations. Math. Comp., 68(227):1257–1281, 1999.
  • [27] Craig A. Tovey. Multiple occurrences of binomial coefficients. Fibonacci Quart., 23(4):356–358, 1985.