Singmaster’s conjecture in the interior of Pascal’s triangle
Abstract.
Singmaster’s conjecture asserts that every natural number greater than one occurs at most a bounded number of times in Pascal’s triangle; that is, for any natural number , the number of solutions to the equation for natural numbers is bounded. In this paper we establish this result in the interior region for any fixed . Indeed, when is sufficiently large depending on , we show that there are at most four solutions (or at most two in either half of Pascal’s triangle) in this region. We also establish analogous results for the equation , where denotes the falling factorial.
1. Introduction
In 1971, Singmaster [22] conjectured that any natural number greater than one only appeared in Pascal’s triangle a bounded number of times. In asymptotic notation111Our conventions for asymptotic notation are set out in Section 1.5., we can express this conjecture as
Conjecture 1.1 (Singmaster’s conjecture).
For any natural number , the number of integer solutions to the equation
(1.1) |
is .
Note that we can exclude the edges of Pascal’s triangle from consideration since in these cases. Currently the largest known number of solutions to (1.1) for a given is eight, arising from and
(1.2) |
For the purposes of attacking this conjecture, we may of course assume to be larger than any given absolute constant, which we shall implicitly do in the sequel. In particular we can assume that the iterated logarithms
are well-defined and positive.
In view of the symmetry
(1.3) |
we may restrict attention to the left half
(1.4) |
of Pascal’s triangle. For solutions to (1.1) in this half (1.4) of the triangle, we have
by Stirling’s approximation (2.4), and thus we have the upper bound
(1.5) |
Since is an increasing function of for fixed , is uniquely determined by and . Thus by (1.5) we have at most solutions to the equation , a fact already observed in the original paper [22] of Singmaster. This bound was improved to by Abbott, Erdős, and Hansen [1], to by Kane [14], and finally to in a followup work of Kane [15]. This remains the best known unconditional bound for the total number of solutions, although it was observed in [1] that the improved bound was available for any assuming the conjecture of Cramér [9].
From the elementary inequalities
and some rearranging we see that any solution to obeys the bounds
Applying Stirling’s approximation (2.4) (and also ) we can thus obtain the order of magnitude of as a function of and :
(1.6) |
or equivalently
(1.7) |
In particular we see that grows extremely rapidly when the ratio becomes small. This makes the difficulty of the problem increase as approaches zero, and indeed treating the case of small values of is the main obstruction to making further progress on bounding the total number of solutions.
Remark 1.2.
In this paper we study the opposite regime in which is relatively large, or equivalently (by (1.7)) and are somewhat comparable (in the doubly logarithmic sense ). More precisely, we have the following result:
Theorem 1.3 (Singmaster’s conjecture in the interior of Pascal’s triangle).
Let , and assume that is sufficiently large depending on . Then there are at most two solutions to (1.1) in the region . By (1.3), we thus have at most four solutions to (1.1) in the region . Furthermore, in the smaller region there is at most one solution, whenever and is sufficiently large depending on both and .
Remark 1.4.
The bound of two (or four) solutions is absolutely sharp, in view of the infinite family of solutions observed in [18], [23], [27] to the equation
given by , , where denotes the Fibonacci number. See also [13] for further analysis of equations of this type. Besides this infinite family of collisions, and the “trivial” ones generated by (1.3), , and , the only further known collisions between binomial coefficients arise from the identities for
as well as the example in (1.2). It was conjectured by de Weger [11] that these above examples generate all the non-trivial collisions ; this would of course imply Singmaster’s conjecture. This conjecture has been verified for [3], for [21], [10], for [8], for [20], [11], and [26], and for or in [5].
Remark 1.5.
In view of Theorem 1.3, we now see that to prove Conjecture 1.1, we may restrict attention without loss of generality to the region for any fixed , or equivalently (by (1.7)) to for any fixed . It follows from the conjecture of de Weger [11] mentioned in Remark 1.4 that for sufficiently large there is only at most one solution in this region, that is to say all but a finite number of binomial coefficients for are distinct. In this direction, the number of solutions to the equation for fixed has been shown (via Siegel’s theorem on integral points) to be finite in [4] (see also the earlier result [16] treating the case for an odd prime ). This implies that there are no collisions in the regime if is a function of that goes to infinity sufficiently slowly as . Unfortunately, due to the reliance on Siegel’s theorem, the function given by these arguments is completely ineffective.
Remark 1.6.
Remark 1.7.
The implied quantitative bounds in the hypothesis “ is sufficiently large depending on ” are effective; however, we have made no attempt whatsoever to optimize them in this paper, and will likely be too large to be of use in numerical verification of Singmaster’s conjecture in their current form.
1.1. An analog for falling factorials
The methods used to handle the equation (1.1) can be modified to treat the variant equation
(1.8) |
for integers and , where denotes the falling factorial
We exclude the cases since and . In [1, Theorem 4] it was shown that for any the number of integer solutions to (1.8) with is . We do not directly improve upon this bound here, but can obtain an analogue of Theorem 1.3:
Theorem 1.8 (Falling factorial multiplicity in the interior).
Let , and assume that is sufficiently large depending on . Then there are at most two integer solutions to (1.8) in the region .
We establish this result in Section 5. Note that the bound of two is best possible, as can be seen from the infinite family of solutions
for any integer , and more generally
whenever are integers.
1.2. Strategy of proof
Theorem 1.3 is a consequence of two Propositions that we now describe. The proof of Theorem 1.8 will follow a similar pattern as described here and we refer the reader to Section 5 for details.
Proposition 1.9 (Distance estimate).
Note how this proposition is consistent with the example in Remark 1.4. We shall discuss the proof of Proposition 1.9 in Section 1.3. For the application to Theorem 1.3, Proposition 1.9 localizes all solutions to (1.1) to a region of small diameter. To conclude Theorem 1.3, we can now proceed by adapting the Taylor expansion arguments of Kane [14], [15], in which one views as an analytic function of (keeping fixed) and exploits the non-vanishing of certain derivatives of this function; see Section 2. This is what the proposition below accomplishes. In fact in our analysis only two derivatives of this function are needed (i.e., we only need to exploit the convexity properties of as a function of ).
Proposition 1.10 (Kane-type estimate).
With these two Propositions at hand it is easy to deduce Theorem 1.3.
Deduction of Theorem 1.3.
Let , let be sufficiently large depending on , and let be the solution to (1.1) in the region
(1.9) |
with the maximal value of (if there are no such solutions then of course Theorem 1.3 is trivial). For brevity we allow all implied constants in the following arguments to depend on . If is any other solution in this region, then and . From (1.7) we have
thanks to (1.5). From further application of (1.7) we then have
Similarly for . Applying Proposition 1.9 (with replaced by a sufficiently small quantity), we conclude that
(1.10) |
whenever , or equivalently . The result now follows from Proposition 1.10. ∎
Remark 1.11.
The above arguments showed that for sufficiently large depending on , there were at most four solutions to (1.1) in the region . A modification of the argument also shows that there cannot be exactly three such solutions. For if this were the case, we see from (1.3) that there must be a solution with , so that by Stirling’s approximation. For all other solutions to (1.1) we have , hence
and hence (by Stirling’s approximation)
By Stirling’s approximation (or the central limit theorem of de Moivre and Laplace) this forces , thus . But this contradicts (1.10).
1.3. Proof methods
We now discuss the method of proof of Proposition 1.9, which is our main new contribution. In contrast to the “Archimedean” arguments of Kane (such as Proposition 1.10) that use real and complex analysis of the binomial coefficients , the proof of Proposition 1.9 relies more on “non-Archimedean” arguments, based on evaluating the -adic valuations for various primes , defined as the number of times divides . From the classical Legendre formula
(1.11) |
where is the integer part of , we see that
(1.12) |
where denotes the fractional part of . Note that the summands here vanish whenever . From this identity we see that if are two solutions to (1.1) then we must have
(1.13) |
for all primes . Our strategy will be to apply this equation with set equal to a random prime drawn uniformly amongst all primes in the interval where the scale is something like , and inspect the distribution of the resulting random variables on the left and right-hand sides of (1.13) in order to obtain a contradiction when or are sufficiently well separated. In order to do this we need some information concerning the equidistribution of fractional parts such as . This will be provided by the following estimate, proven in Section 4. There and later the letter always denotes a prime.
Proposition 1.12 (Equidistribution estimate).
Let and and let be an interval contained in . Let be real numbers with , and let be a natural number.
-
(i)
For all ,
-
(ii)
Let be a smooth -periodic function. Then, for all ,
where
One can generalize this proposition to control the joint equidistribution of any bounded number of expressions of the form , but for our applications it will suffice to understand the equidistribution of pairs , .
When it comes to the proof of Proposition 1.12, the first step is to use Fourier expansion to reduce part (ii) of the proposition to part (i). For part (i), the case where is small (say ) is easily handled using the prime number theorem with classical error term. In the regime where is large, we use Vaughan’s identity to decompose the sum in (i) into type I and II sums, and assert that these exhibit cancellation; the type I and II bounds are given in (4.9) and (4.11).
Both type I and type II sums can be handled using Vinogradov’s bound for sums of the form with smooth, although we need to first cut from small intervals around zeros of the first derivatives of . This way we obtain that the sum in (i) exhibits cancellation. It is here that the restriction arises; even under the Riemann hypothesis we do not know how to relax this requirement222Using standard randomness heuristics one could tentatively conjecture that this restriction could be relaxed to for some constant ; this would improve the range in Theorem 1.3 to for some constant ..
Once the equidistribution estimate, Proposition 1.12, is established, the analysis of the distribution of both sides of (1.13) is relatively straightforward, as long as the scale is chosen so that the powers do not lie close to various integer combinations of . However, there are some delicate cases when two of the numbers are “commensurable” in the sense that one of them is close to a rational multiple of the other, where the rational multiplier has small height. Commensurable integers are also known to generate some exceptional examples of integer factorial ratios [6], [7], [25]. Fortunately, we can handle these cases in our context by an analysis of covariances between various fractional parts , in particular taking advantage of the fact that these covariances are non-negative up to small errors, and small unless are very highly commensurable.
1.4. Acknowledgments
KM was supported by Academy of Finland grant no. 285894. MR acknowledges the support of NSF grant DMS-1902063 and a Sloan Fellowship. XS was supported by NSF grant DMS-1802224. TT was supported by a Simons Investigator grant, the James and Carol Collins Chair, the Mathematical Analysis & Application Research Fund Endowment, and by NSF grant DMS-1764034. JT was supported by a Titchmarsh Fellowship.
1.5. Notation
We use , , or to denote the estimate for some constant . If we wish to permit this constant to depend on one or more parameters we shall indicate this by appropriate subscripts, thus for instance denotes a quantity bounded in magnitude by for some quantity depending only on . We write for .
We use to denote the indicator of an event , thus equals when is true and otherwise.
We let denote the standard real character .
2. Derivative estimates
We generalize the binomial coefficient to real by the formula
where
is the Gamma function (with the Euler–Mascheroni constant). This is of course consistent with the usual definition of the binomial coefficient. Observe that the digamma function
is a smooth increasing concave function on , with
positive and decreasing, and
negative. For future reference we also observe the standard asymptotics
(2.1) | ||||
(2.2) | ||||
(2.3) |
and the Stirling approximation
(2.4) |
for any ; see e.g., [2, §6.1, 6.3, 6.4]. One could also extend these functions meromorphically to the entire complex plane, but we will not need to do so here.
From the increasing nature of we see that is strictly increasing on for fixed real , and from Stirling’s approximation (2.4) we see that it goes to infinity as . Thus for given , we see from the inverse function theorem that there exists a unique smooth function with for all , such that
(2.5) |
In particular, the equation (1.1) holds for given integers and if and only if . This function was analyzed by Kane [14], who among other things was able to extend holomorphically to a certain sector, which then allowed him to estimate high derivatives of this function. However, for our analysis we will only need to control the first few derivatives of , which can be estimated by hand:
Proposition 2.1 (Estimates on the first few derivatives).
Let be sufficiently large with . Then
(2.6) |
and
(2.7) |
and
(2.8) |
In particular, is convex and decreasing in this regime.
Proof.
Taking logarithms in (2.5) we have
(2.9) |
Writing , we thus see from the mean value theorem that
for some depending on . Applying (2.1), we conclude that
which implies that
and the claim (2.6) then follows from Stirling’s approximation (2.4).
If we differentiate (2.9) we obtain
(2.10) |
In particular we obtain the first derivative formula
(2.11) |
From (2.2) and the mean value theorem we have
(2.12) |
while from either the mean-value theorem and (2.2) (if ) or from (2.1) (if say ) we see that
We conclude that
Differentiating (2.10) again, we conclude
which we can rearrange using (2.11) as
From (2.12), (2.6) it thus suffices to show that
The quantity is non-negative and is of size by (2.12), (2.2). Thus it will suffice to show that
We split the left-hand side as the sum of
and
From (2.1), (2.3), and the mean value theorem the first term is positive and comparable to ; similarly, from (2.1), (2.2), and (2.12) the second term is positive and bounded above by . The claim follows. ∎
Lemma 2.2 (Small non-zero derivative implies few integer values).
Let be a natural number, and suppose that is a smooth function on an interval of some length such that one has the derivative bound
(2.13) |
for all . Then there are at most integers for which is also an integer.
Proof.
Suppose for contradiction that there are distinct integers with an integer. By Lagrange interpolation, the function
(2.14) |
is a polynomial of degree at most such that vanishes at . By many applications of Rolle’s theorem (see [14, Corollary 2.1]), there must then exist such that vanishes. From (2.14), (which is the degree coefficient of ) is an integer multiple of , and thus either vanishes or has magnitude at least . But this contradicts (2.13). ∎
As an application of these bounds, we can locally control the number of solutions (1.1) in the region , thus giving a version of Theorem 1.3 in a small interval:
Corollary 2.3.
Proof.
From (1.7) and the hypothesis we have
(2.15) |
For in the interval , we then have , and so we see from Proposition 2.1 and (2.15) that and
for all . Since and is sufficiently large depending on , is also sufficiently large depending on , and we have
for all . Applying Lemma 2.2, there are at most two integers with an integer. Since is already one of these integers, the claim follows. ∎
The same method, using higher derivative estimates on , also gives similar results (with weaker bounds on the number of solutions) for ; see [14], [15]. However, we will only need to apply this method in the regime here.
We are now ready to prove Proposition 1.10.
Proof of Proposition 1.10.
Let , let be sufficiently large depending on , and let be a solution to (1.1) in the region
(2.16) |
For brevity we allow all implied constants in the following arguments to depend on . Suppose is another solution in this region with , and
From (2.7) and convexity (and the bounds and ) we have
and thus
From (1.7) we have , hence , and so for some constant , (shrinking slightly if necessary) if is sufficiently large depending on . The result now follows from Corollary 2.3.
∎
It remains to establish Proposition 1.9. This will be the objective of the next two sections of the paper.
3. The distance bound
Throughout this section will be fixed; we can assume it to be small. We may assume that is sufficiently large depending on , as the claim is trivial otherwise. We may assume that , hence also . We assume for sake of contradiction that at least one of the claims
(3.1) |
and
(3.2) |
is true, as the claim is trivial otherwise. This allows us to select a “good” scale:
Lemma 3.1 (Selection of scale).
With the above assumptions, there exists obeying the following axioms:
-
(i)
( not too large) We have . (In particular, will be sufficiently large depending on , since otherwise .)
-
(ii)
(Dichotomy) If are integers with , and is a natural number, then either
(3.3) or
-
(iii)
(Separation) At least one of the statements
and
is true.
Proof.
We restrict to be a power of two in the range
such a choice will automatically obey (i) since and (iii) since we assumed that either (3.1) or (3.2) holds. There are choices for . Some of these will not obey (ii), but we can control the number of exceptions as follows. Firstly, observe that the conclusion (3.3) will hold unless , so we may restrict attention to this range of . The number of possible tuples is then . For each such tuple, we see from the restriction on that the number of with
is at most (since is of size , say). Thus we see that the total number of which fail to obey (ii) is at most
which is negligible compared to the total number of choices, which is . Thus we can find a choice of which obeys all of (i), (ii), and (iii), giving the claim. ∎
Henceforth we fix a scale obeying the properties in Lemma 3.1. We now introduce a relation on the reals by declaring if . Thus, by Lemma 3.1(ii), if for as in Lemma 3.1(ii) then . Also, from Lemma 3.1(iii), at least one of the statements
and
is true.
We introduce a random variable , which is drawn uniformly from the primes in the interval (note that there is at least one such prime thanks to the prime number theorem). From (1.13) we surely have
We can restrict attention to those with , since the summands vanish otherwise. For any real number , we may take covariances of both sides of this identity with the random variable to conclude that
(3.4) |
for any real number , where the covariances are defined as
We now compute these covariances:
Proposition 3.2 (Covariance estimates).
Let , and be a natural number with .
-
(i)
If , then .
-
(ii)
If and or , then .
-
(iii)
If , and there exist coprime natural numbers such that , then .
-
(iv)
If and are not of the form in (ii) or (iii), then .
Remark 3.3.
The term appearing in Proposition 3.2(iii) is also the covariance between and for drawn randomly from the unit interval whenever are natural numbers with for some coprime ; see [24, Section 2]. Indeed, both assertions are proven by the same Fourier-analytic argument, and Proposition 3.2 endows the linear span of the six functions for with an inner product closely related to the norm studied in [24], the structure of which is the key to obtaining a contradiction from our separation hypotheses on .
Proof of Proposition 3.2 assuming Proposition 1.12.
We first dispose of the easy case (ii). If , then , and the claim follows from the triangle inequality; similarly if or actually if . Hence by Lemma 3.1(ii), we may from now on assume that
To handle the remaining cases we use the truncated Fourier expansion
(3.5) |
that holds for any (see e.g. [12, Formula (4.18)]).
Our primary tool is Proposition 1.12. Note that, for , , so that together with the prime number theorem Proposition 1.12 implies that
(3.6) |
for any smooth -periodic and that, for any ,
(3.7) |
Applying (3.6) with a suitable cutoff localized to the region that equals one on chosen so that , we see that, for any we have
Since , the first term on the right-hand side can be computed to be . Thus
(3.8) |
and a similar argument gives
(3.9) |
To prepare for the proofs of parts (i), (iii) and (iv), let us first show that, for , we have
(3.10) |
We use the Fourier expansion (3.5) with . Averaging over and applying (3.9) to handle the first error term, we see that
By the triangle inequality and (3.7), it suffices to show that, for every non-zero integer ,
Recalling that , this estimate follows from a standard integration by parts (see e.g. [12, Lemma 8.9]). Similarly
(3.11) |
Now we are ready to prove (i), (iii), and (iv). Let us start with (i). In light of (3.10), (3.11) and (3.12) with , it suffices to show that
whenever are non-zero integers. Applying a change of variables , we reduce to showing that
(3.13) |
(say), where and . By hypothesis, we have . Since , the derivative of the phase is at least outside of an interval of length at most , and (3.13) now follows from a standard integration by parts (see e.g. [12, Lemma 8.9]). This concludes the proof of (i).
Let us now turn to (iv). In light of (3.10), (3.11) and (3.12) with , it suffices to show that
whenever are non-zero integers. From the hypothesis (iv) and Lemma 3.1(ii) (after factoring out any common multiple of and ), we have . The claim (iv) now follows from integration by parts.
Finally we show (iii). In light of (3.10), (3.11) and (3.12) with , it suffices to show that
Let us first consider those for which . By Lemma 3.1(ii) and similarly to case (iv), the contribution of such pairs is acceptable.
Consider now the case for some non-zero integers . By assumption also for some co-prime positive integers . and hence by Lemma 3.1(ii) which contradicts the assumption unless is a multiple of . On the other hand if is a multiple of , then by Lemma 3.1(ii).
Thus it remains to show that
Since we have, for every ,
and so it suffices to show that
This is trivial for . For the claim follows from the Basel identity
and the tail bound
∎
Now we can get back to proving Proposition 1.9 assuming Proposition 1.12. From Proposition 3.2(i) and (3.4) we see that
(3.14) |
for , where for brevity we introduce the error tolerance
We can now arrive at the desired contradiction by some case analysis (reminiscent of that in [24, 25]) using the remaining portions of Proposition 3.2, as follows.
Case
Applying (3.14) with , we conclude from Proposition 3.2(ii) that
(3.15) |
From Lemma 3.1(iii) we have (and hence also , since these quantities are greater than or equal to ), hence by Proposition 3.2(iii) we have . Furthermore, since , we see from Lemma 3.1(ii) that, for , if and only if . Hence Proposition 3.2(iii) (iv) implies that
Plugging these facts into (3.15) and rearranging, we obtain
But by Proposition 3.2(iii), (iv) we know that , so that
But since (because and ), another application of Proposition 3.2(iii), (iv) gives
which is a contradiction.
Since was the smallest element of , we now thus have for all , and case (ii) of Proposition 3.2 no longer applies.
Case and
We apply (3.14) with to conclude that
(3.16) |
Now if there are no co-prime positive integers such that or , then by Proposition 3.2(iv) we have
On the other hand, if such co-prime integers exist, then if and only if and necessarily , so that by Proposition 3.2(iii) we have in this case
(3.17) |
Since Proposition 3.2(iii) also gives , combining with (3.16) we obtain that
(3.18) |
On the other hand, since , we also have since . By Proposition 3.2(iii), (iv), we have
which can be improved to
(3.19) |
unless both and . Since by Proposition 3.2(iii), (iv) we have , the estimate (3.19) contradicts (3.18).
Case and
By Lemma 3.1(iii), we must have . We apply (3.14) for to obtain
(3.20) |
Since , we have by Proposition 3.2(iii), (iv) (using also Lemma 3.1(ii)) that . Proposition 3.2(iii) also gives . Plugging these into (3.20) and rearranging, we obtain
(3.21) |
Since and , we see from Proposition 3.2(iii), (iv) that
(3.22) |
which can be improved to
(3.23) |
unless and . Now (3.23) contradicts (3.21) since by Proposition 3.2(iii), (iv) .
Hence we can assume that and . But using and Lemma 3.1(ii) this implies that , so that by (3.21) and Proposition 3.2(iii) we obtain
contradicting (3.22).
Remark 3.4.
Morally speaking, the ability to obtain a contradiction here reflects the fact that one cannot have an identity of the form
(3.24) |
for almost all real numbers and some integers , unless one has both and (this type of connection goes back to Landau [17, p. 116]). This latter fact is easily established by inspecting the jump discontinuities of both sides of (3.24), but it is also possible to establish it by computing the covariances of both sides of (3.24) with for various choices of , and the arguments above can be viewed as an adaptation of this latter method.
It remains to establish Proposition 1.12. This will be established in the next section.
4. Equidistribution
In this section we prove Proposition 1.12. Fix . We may assume that is sufficiently large depending on , as the claim is trivial otherwise. If we have then we can replace in both parts of the proposition by with negligible error, so we may assume that either or . In either event we may thus assume that . Next, by partitioning into at most intervals of length at most and using the triangle inequality, it suffices (after suitable adjustment of , ) to assume that . In particular we have
(4.1) |
for all .
Let us first reduce Proposition 1.12(ii) to Proposition 1.12(i). We perform a Fourier expansion
where by integration by parts the Fourier coefficients
obey the bounds
By the triangle inequality, the contributions of those frequencies with is then acceptable. By a further application of the triangle inequality, Proposition 1.12(ii) follows from showing that
whenever are integers with . But this follows from Proposition 1.12(i) by adjusting the values of suitably.
The proof of part (i) will use the standard tools of Vaughan’s identity and Vinogradov’s exponential sum estimates. We state a suitable form of the latter tool here:
Lemma 4.1 (Vinogradov’s exponential sum estimate).
Let , , and . Let be an interval. Let be a smooth function on satisfying for all
(4.2) |
for all integers Assume further that
(4.3) |
Then we have
(4.4) |
where the implied constant is absolute.
Proof.
This is essentially [12, Theorem 8.25] with minor modifications (the modification needed is that we only assume (4.2) for in a certain range, not all integers .).
Let , and as in [12, p. 217], let
Let denote the sum in (4.4). By Taylor’s formula, for any we have
Let , where is interpreted as a multiset. Also let . Then
We take in which case by (4.2) the error term is
(4.5) |
The term in the parenthesis is . Using also (4.3) we see that (4.5) is which is in particular smaller than the right-hand side of (4.4). The sum is precisely the one estimated in [12, pp. 217–225]. The only assumption needed of in that argument is (4.2), and the only restriction on and there is . Hence, we conclude that the lemma holds by following the analysis there verbatim. ∎
We now apply this estimate to obtain an estimate for an exponential sum over integers.
Proposition 4.2 (Exponential sums over integers).
Let , , , , and let be real numbers with . Let be an interval in . Then
(4.6) |
for some absolute constant , where
Proof.
We may assume without loss of generality that is sufficiently large, and is sufficiently large depending on . By hypothesis we have . We may assume that for a large absolute constant , since the claim is trivial otherwise.
Let denote the phase function
Then for any and we have
(4.7) |
where
Since
we conclude that
and
If then from the triangle inequality and (4.1) we have
Consider then the case . We have the upper bound
for all from the triangle inequality. Furthermore, since the function has derivative on , we also have, for all outside of an interval of length , the lower bound
If we set and is sufficiently large, then we conclude from (4.7) and the bounds above that the estimate (4.2) holds for all and all outside the union of intervals of length . The contribution of these exceptional intervals to (4.6) is negligible, and removing them splits up into at most subintervals, so by the triangle inequality it suffices to show that
for any subinterval with the property that (4.2) holds for all and . If , we may apply Lemma 4.1 to conclude that
for some absolute constant , and the claim follows. If instead , we can apply the Weyl inequality [12, Theorem 8.4] with to conclude that
for some absolute constant ; since , we obtain the claim by taking large enough. ∎
Now we prove Proposition 1.12(i). We may assume without loss of generality that , since for we can absorb the terms into the term (and add a dummy term with and , say). By summation by parts (see e.g. [19, Lemma 2.2]), and adjusting as necessary, it suffices to show that
for all intervals . This is equivalent to
where is the von Mangoldt function, since the contribution of the prime powers is negligible. We introduce the quantity
If for some large absolute constant , then the total variation of the phase is , and the claim readily follows from a further summation by parts (see e.g. [19, Lemma 2.2]) and the prime number theorem (with classical error term). Thus we may assume that
(4.8) |
In this case, a change of variables gives
The derivative of the phase here is which, once is large enough, is for all apart from an interval of length at most . Hence by partial integration we get that
if is large enough, so it remains to establish the bound
under the hypothesis (4.8).
By Vaughan’s identity in the form of [12, Proposition 13.4] (with ), followed by a shorter-than-dyadic decomposition, we can write
for , where denotes Dirichlet convolution, and
(the bound for the coefficients arising from Vaughan’s identiy is since ). By the triangle inequality, it thus suffices to establish the Type I estimates
(4.9) |
and
(4.10) |
as well as the Type II estimates
(4.11) |
for all and . The second Type I estimate (4.10) follows from the first Type I estimate (4.9) (replacing with ) and a summation by parts (see e.g. [19, Lemma 2.2]), so it suffices to establish (4.9) and (4.11).
We begin with (4.9). By the triangle inequality, the left-hand side is bounded by
Applying Proposition 4.2 with and and in place of and , we can bound this by
for some constant , and the claim now follows from (4.8).
Now we establish (4.11). We can assume that , as the sum vanishes otherwise. By the triangle inequality, the left-hand side is bounded by
By Cauchy–Schwarz it suffices to show that
(say). Rearranging, it suffices to show that
(4.12) |
where
By Proposition 4.2, we have
for some absolute constant . Bounding and noting that
for all , we obtain the claim (4.12) from (4.8). This completes the proof of Proposition 1.12.
5. Multiplicity of the falling factorial
In this section we establish Theorem 1.8. We first observe that if solves (1.8) for some sufficiently large , then
by Stirling’s formula. Hence we have an analogue of (1.5):
(5.1) |
Next, since
we have
(5.2) |
and we obtain an analogue
(5.3) |
Next, we obtain the following analogue of Proposition 1.9.
Proposition 5.1 (Distance estimate).
Suppose we have two solutions to (1.8) in region . Then one has
(5.4) |
Furthermore, if
(5.5) |
for some , then we additionally have
(5.6) |
for any .
Proof.
We begin with (5.4). We follow the arguments from [1, Proof of Theorem 4]. Taking -valuations of both sides of (1.8) and using (1.11) we have
The summands here vanish unless . Writing , we conclude that
and (5.4) follows.
Now we prove (5.6). Fix . We may assume without loss of generality that , so that by (1.8). We may also assume is sufficiently large depending on , as the claim is trivial otherwise; from (5.5) this also implies that are sufficiently large depending on . Henceforth all implied constants are permitted to depend on . By (5.5) we have
while from (5.3) we have . From this and (5.1) we have
(5.7) |
and then
Similarly for . From (5.4) we conclude that
(5.8) |
In particular and, combining (5.3) with (5.8) and (5.7), also . Hence from (5.5) we see that
(5.9) |
Also we have
(say), hence on exponentiating and using (5.2), (5.9)
(5.10) |
Suppose that we could find a prime obeying the inequalities
(5.11) |
These inequalities imply in particular that
so that these quantities respectively equal and . Consequently, if (5.11) hold, then we would have
(5.12) |
and (since )
(5.13) |
Now (5.12) implies that divides , while (5.13) implies that does not divide . This contradicts the assumption . Thus there cannot be any prime obeying (5.11).
Let be a suitable smooth -periodic function supported on the region
chosen so that and , and let similarly be a smooth -periodic function supported on the region
chosen so that when and . Let be a prime drawn uniformly from all the primes in . As does not obey (5.11), we have
and hence by Proposition 1.12 (and dyadic decomposition)
or on changing variables
(5.14) |
On the other hand, by (5.10), (5.9) we have
(5.15) |
We perform a Fourier expansion
where by integration by parts the Fourier coefficients obey the bounds
Thus (5.14) can then be rewritten as
(5.16) |
By (5.15) and integration by parts, one readily establishes the bound
for . Thus the total contribution to the left-hand side of (5.16) from the terms with is negligible, and hence
Since and equals on , we have
(5.17) |
where
However, direct calculation shows that when , we have
when , we have
and, when , we have
Hence (5.17) can only hold if
giving the claim (5.6). ∎
Now we adapt the analysis from Section 2. We extend the falling factorial to real by the formula
From the increasing nature of the digamma function we see that for fixed , increases from when goes from to infinity. Applying the inverse function theorem, we conclude that for any sufficiently large there is a unique smooth function such that for any with , one has and
(5.18) |
Indeed, one could simply set , where is the function studied in Section 2.
We have an analogue of Proposition 2.1:
Proposition 5.2 (Estimates on the first few derivatives).
Let , and let be sufficiently large depending on with . Then
(5.19) |
In the range , we have
(5.20) |
and in the range , one has
(5.21) |
Proof.
If we differentiate (5.22) we obtain
(5.23) |
In particular we obtain the first derivative formula
(5.24) |
In the regime we can then obtain (5.20) from (2.12), (2.1), (5.19).
Now we can establish Theorem 1.8. Let be a large absolute constant, let , and suppose that is sufficiently large depending on . Let be the integer solution to (1.8) in the region with a maximal value of ; we may assume that such a solution exists, since we are done otherwise. If is any other solution in this region, then and . Note that are sufficiently large depending on . From Proposition 5.1 and (5.3) we have
and thus and . Hence and
(5.26) |
First suppose that . Here we will exploit the fact that grows rapidly as decreases. From Proposition 5.1 we have
On the other hand, from (5.20) and the mean value theorem we have
thanks to (5.1) and the trivial bound . Thus we have
but this contradicts the hypothesis .
Now suppose we are in the regime
Here we will take advantage of the convexity properties of . From (5.26), lies in the interval . By (5.19), for all in this interval, we have
and by (5.21), we have
since . Applying Lemma 2.2 with , we see (for large enough) that there are at most two integers in this interval with an integer, giving Theorem 1.8 follows in this case.
It remains to handle the case
(5.27) |
Recall from (5.26) that lies in the interval . From (5.3), (5.27) we have
so . From (5.3) again we thus also have
From (1.8) we have
The right-hand side is at most . This implies that , since otherwise the left hand side would be, for any ,
which contradicts the bound for the right hand side when is sufficiently large.
In particular we have from the triangle inequality that
Making the change of variables , it now suffices to show that there are at most two integer solutions to the equation
(5.28) |
in the regime . We write this equation (5.28) as
or equivalently
where , and is the inverse of the gamma function. Here we will exploit the very slowly varying nature of . From Stirling’s formula we have
whenever . Taking the logarithmic derivative of the equation
we have
Hence by (2.1)
in the regime . In particular, for two solutions to (5.28) in this regime we have
(5.29) |
For fixed there is at most one solving (5.28). We conclude that for two distinct solutions to (5.28) in this regime, we have , and hence the separation
Now suppose we have three solutions to (5.28) in this regime. We can order , so that . From the preceding discussion we have
and
If is a power of that divides an integer in as well as an integer in , then we must therefore have , so that . Thus, there must exist such that the interval only contains multiples of when . Fix this . Taking -adic valuations of (5.28) using (1.11) we have
and
and thus
(5.30) |
Since
(5.31) |
we certainly have , and the right-hand side of (5.30) is at least
By construction, the terms on the left-hand side of (5.30) vanish unless , in which case they are equal to . Thus the left-hand side of (5.30) is at most . Thus
But from (5.29) one has . Hence . But this contradicts (5.31). This concludes the proof of Theorem 1.8.
References
- [1] H. L. Abbott, P. Erdős, and D. Hanson. On the number of times an integer occurs as a binomial coefficient. Amer. Math. Monthly, 81:256–261, 1974.
- [2] Milton Abramowitz and Irene A. Stegun. Handbook of Mathematical Functions with Formulas, Graphs, and Mathematical Tables. Dover, New York City, ninth Dover printing, tenth GPO printing edition, 1964.
- [3] È. T. Avanesov. Solution of a problem on figurate numbers. Acta Arith., 12:409–420, 1966/67.
- [4] F. Beukers, T. N. Shorey, and R. Tijdeman. Irreducibility of polynomials and arithmetic progressions with equal products of terms. In Number theory in progress, Vol. 1 (Zakopane-Kościelisko, 1997), pages 11–26. de Gruyter, Berlin, 1999.
- [5] Aart Blokhuis, Andries Brouwer, and Benne de Weger. Binomial collisions and near collisions. Integers, 17:Paper No. A64, 8, 2017.
- [6] J. W. Bober. Factorial ratios, hypergeometric series, and a family of step functions. J. Lond. Math. Soc. (2), 79(2):422–444, 2009.
- [7] J. William Bober. Integer ratios of factorials, hypergeometric functions, and related step functions. ProQuest LLC, Ann Arbor, MI, 2009. Thesis (Ph.D.)–University of Michigan.
- [8] Yann Bugeaud, Maurice Mignotte, Samir Siksek, Michael Stoll, and Szabolcs Tengely. Integral points on hyperelliptic curves. Algebra Number Theory, 2(8):859–885, 2008.
- [9] Harald Cramér. On the order of magnitude of the difference between consecutive prime numbers. Acta Arith., 2:23–46, 1936.
- [10] B. M. M. de Weger. A binomial Diophantine equation. Quart. J. Math. Oxford Ser. (2), 47(186):221–231, 1996.
- [11] Benjamin M. M. de Weger. Equal binomial coefficients: some elementary considerations. J. Number Theory, 63(2):373–386, 1997.
- [12] H. Iwaniec and E. Kowalski. Analytic number theory, volume 53 of American Mathematical Society Colloquium Publications. American Mathematical Society, Providence, RI, 2004.
- [13] H. Jenkins. Repeated binomial coefficients and high-degree curves. Integers, 16:Paper No. A69, 14, 2016.
- [14] Daniel Kane. New bounds on the number of representations of as a binomial coefficient. Integers, 4:A7, 10, 2004.
- [15] Daniel M. Kane. Improved bounds on the number of ways of expressing as a binomial coefficient. Integers, 7:A53, 7, 2007.
- [16] P. Kiss. On the number of solutions of the Diophantine equation . Fibonacci Quart., 26(2):127–130, 1988.
- [17] Edmund Landau. Collected works. Vol. 1. Thales-Verlag, Essen, 1985. With a contribution in English by G. H. Hardy and H. Heilbronn, Edited and with a preface in English by L. Mirsky, I. J. Schoenberg, W. Schwarz and H. Wefelscheid.
- [18] D. A. Lind. The quadratic field and a certain Diophantine equation. Fibonacci Quart., 6(3):86–93, 1968.
- [19] Kaisa Matomäki, Maksym Radziwiłł, and Terence Tao. Correlations of the von Mangoldt and higher divisor functions I. Long shift ranges. Proc. Lond. Math. Soc. (3), 118(2):284–350, 2019.
- [20] L. J. Mordell. On the integer solutions of . Pacific J. Math., 13:1347–1351, 1963.
- [21] Ákos Pintér. A note on the Diophantine equation . Publ. Math. Debrecen, 47(3-4):411–415, 1995.
- [22] David Singmaster. Research Problems: How Often Does an Integer Occur as a Binomial Coefficient? Amer. Math. Monthly, 78(4):385–386, 1971.
- [23] David Singmaster. Repeated binomial coefficients and Fibonacci numbers. Fibonacci Quart., 13(4):295–298, 1975.
- [24] K. Soundararajan. Integral Factorial Ratios. arXiv e-prints, page arXiv:1901.05133, January 2019.
- [25] K. Soundararajan. Integral factorial ratios: irreducible examples with height larger than 1. Philos. Trans. Roy. Soc. A, 378(2163):20180444, 13, 2020.
- [26] Roelof J. Stroeker and Benjamin M. M. de Weger. Elliptic binomial Diophantine equations. Math. Comp., 68(227):1257–1281, 1999.
- [27] Craig A. Tovey. Multiple occurrences of binomial coefficients. Fibonacci Quart., 23(4):356–358, 1985.