Upper bounds for moments of zeta sums
Abstract.
We establish upper bounds for moments of zeta sums using results on shifted moments of the Riemann zeta function under the Riemann hypothesis.
Mathematics Subject Classification (2010): 11M06
Keywords: zeta sums, moments
1. Introduction
Character sums have been extensively studied in the literature as they have many important applications in number theory. In [Harper23], A. J. Harper studied sizes of the sums given by
where and is a non-principal Dirichlet character modulo a large prime . Following the notation in [Harper23], we shall refer the first sum above as a zeta sum.
Building on his work concerning moments of random multiplicative functions, Harper [Harper23] showed that the low moments of zeta sums (and also character sums) have “better than squareroot cancellation”. More precisely, he proved that uniformly for and ,
where
In [Szab], B. Szabó obtained sharp upper bounds on shifted moments of Dirichlet -function at points on the critical line and then applied the results to show under the generalized Riemann hypothesis (GRH) that for a fixed real number and a large integer , we have for ,
(1.1) |
where denotes the set of primitive Dirichlet characters modulo and denotes Euler’s totient function. A similar result is given in [G&Zhao2024] for moments of quadratic Dirichlet character sums under GRH.
We note that the zeta sums behave very much like character sums. In fact, other than periodicity, the function for a fixed is totally multiplicative and is unimodular. Thus, one expects to establish results analogous to (1.1) for moments of zeta sums and it is the aim of this paper to achieve this. For this, we define for real numbers ,
We are interested in bounding from the above. We first observe that as pointed out in [Harper23] that using [Ivic, Lemma 1.2] that when is large and , we have . Moreover, note that by [Montgomery94, Chap. 7,(34)], we have when . As the term dominates when , we deduce that when is large enough and , we have for any real ,
We may therefore focus on the case . In fact, we shall assume that for any throughout the paper as this is often the most interesting case regarding character sums. For this case, we establish the following result concerning the size of under the Riemann hypothesis (RH).
Theorem 1.1.
With the notation as above and assume the truth of RH. For any real number , large real numbers such that for any , we have
(1.2) |
We note that by Hölder’s inequality, we have for any real number ,
The above together with Theorem 1.1 then implies that for any , upon choosing large enough. We remark here that it is shown in [Harper23] that one has , so that our result above improves upon this when is slightly smaller than .
Our proof of Theorem 1.1 follows the approaches in [Szab]. A key ingredient used in the proof is a result of M. J. Curran [Curran] on shifted moments of the Riemann zeta function .
2. Preliminaries
In this section, we include some results concerning shifted moments of the Riemann zeta function. The first one is quoted from [Curran, Theorem 1.1].
Proposition 2.1.
With the notation as above and assume the truth of RH. Let be a fixed integer and be fixed non-negative real numbers. Let be a large real number and let be a real -tuple with for a fixed . Then
Here the implied constant depends on and the but not on or the .
We remark here that [Curran, Theorem 1.1] is stated for but an inspection of the proof indicates that it continues to hold for with any . We also note that
where the last estimation above follows from [MVa1, Corollary 1.17]. Also by [MVa1, Corollary 1.17], we see that for , we have
Moreover, by [MVa1, Corollary 13.16], we see that for , we have under the RH that
Based on these observations, for be given as in Proposition 2.1, we now introduce the function defined by
(2.1) |
The above discussions together with Proposition 2.1 allows us to derive the following simplified version on shifted moments of the Riemann zeta function.
Corollary 2.2.
With the notation as above and assume the truth of RH. Let be a fixed integer and be fixed non-negative real numbers. Let be a large real number and let be a real -tuple with for a fixed . Then
Here the implied constant depends on and the but not on or the .
We also note the following upper bounds on moments of the Riemann zeta function, which can be obtained by modifying the proof of [Curran, Theorem 1.1].
Lemma 2.3.
With the notation as above and assume the truth of RH. Let be a fixed integer and be fixed non-negative real numbers. Let be a large real number and let be a real -tuple with for any fixed . Then for large real number and ,
We end this section by including an estimation for an average of the moments of the Riemann zeta function.
Proposition 2.4.
With the notation as above and assume the truth of RH. We have for any real numbers , with being fixed,
(2.2) |
Proof.
Our proof follows closely that of [Szab, Proposition 3]. Without loss of generality, we prove (2.2) only for the case where the sign in front of is in what follows. We have by symmetry that for each fixed and any fixed integer ,
(2.3) |
where .
We let and for . We further denote .
Observe that for any , we have . Thus if we denote , then form a partition of . We apply Hölder’s inequality twice to deduce that for ,
(2.4) |
We denote for ,
We then deduce from (2.3) and (2.4) that
(2.5) |
where
We now distinguish two cases in the last summation of (2.5) according to the size of .
Case 1: . First note that for any fixed , is in a fixed region of size . For fixed and , is in a fixed region of size as ). Similar considerations then imply that the volume of the region is . Also, by the definition of we have so that , where is the function defined in (2.1). We deduce from the definition of that , so that , which implies that . Similarly, we have for any . Moreover, we have for any , so that we have . We then deduce from Theorem 2.1 that for ,
Here, we adopt the convention throughout the paper that any empty product is defined to be and any empty sum is defined to be . Observe that we have , so that
(2.6) |
We now set to see that in this case deduce from the above that
We deduce from (2.6) and the above that
(2.7) |
where the last estimation above follows by noting that we have .
Case 2 . The volume of the region is . For each , we have . Also, similar to Case 1, we have for .
As , we see that
(2.8) |
We now set to see that in this case deduce from the above that
3. Proof of Theorem 1.1
3.1. Initial treatments
As we explained in the paragraph below Theorem 1.1, it suffices to establish (1.2). We let be a non-negative smooth function supported on , satisfying for with a parameter to be chosen later and such that for all integers . We denote the Mellin transform of by and we observe that repeated integration by parts gives that, for any integer and ,
(3.1) |
We insert the function into the definition of and apply the triangle inequality to obtain that
(3.2) |
We further apply the Mellin inversion to obtain that
Observe that by [iwakow, Corollary 5.20] that under RH, we have for and any ,
(3.3) |
The bounds in (3.1) and (3.3) allow us to shift the line of integration in (3.2) to to obtain that
(3.4) |
We split the last integral above according to whether or not for some to be specified later, obtaining
It follows from the above that in order to establish Theorem 1.1, it remains to prove the following results.
Lemma 3.2.
With the notation as above and assume the truth of RH. We have for large enough and any real number ,
(3.6) |
Lemma 3.3.
With the notation as above and assume the truth of RH. We have for large enough and any real number ,
(3.7) |
Lemma 3.4.
With the notation as above and assume the truth of RH. We have for large enough and any real number ,
(3.8) |
3.5. Proof of Lemma 3.2
We apply (3.1) and Hölder’s inequality to deduce that, as ,
(3.9) |
Now we note that by (3.1),
(3.10) |
We also apply (3.3) to see that
(3.11) |
where the last estimation above follows from [Sound2009, Corollary B], which asserts that
3.6. Proof of Lemma 3.3
We deduce from (3.5) by symmetry and Hölder’s inequality that,
3.7. Proof of Lemma 3.4
We apply the Cauchy-Schwarz inequality to see that
(3.12) |
We first note that it follows from [iwakow, Theorem 9.1] that for arbitrary complex numbers , we have for and any ,
We apply the above to and keep in mind our assumption that to see that
(3.13) |
We next note that
(3.14) |
By the remark made in the paragraph below Theorem 1.1, we see that
(3.15) |
To estimate the first expression on the right-hand side of (3.14), we apply Perron’s formula as given in [MVa1, Corollary 5.3] to see that
(3.16) |
where
(3.17) |
Here the last estimation above follows from [MVa1, Corollary 1.17]. We now consider the moments of the horizontal integrals in (3.16). We may assume that , otherwise the lemma is trivial. By symmetry we only need to consider only one of them. Note that we have in that range and , which allows us to apply Hölder’s inequality to get
(3.18) |
where the last estimation above follows from Lemma 2.3, which implies that for , we have under RH,
We treat the moments of the vertical integral in (3.16) using Hölder’s inequality (by noting that ), Proposition 2.4 and the assumption to see that
(3.19) |
We conclude from (3.16)-(3.19) that
(3.20) |
We then deduce from (3.12)-(3.15), (3.20) and recall that we have to see that the estimation given in (3.8) is valid. This completes the proof of the lemma.
Acknowledgments. The author is supported in part by NSFC grant 11871082. This work grows out of discussions with Changhao Chen and Nankun Hong on large inequalities for Dirichlet polynomials when the author visited Anhui University in April 2024. The author is debt to them for the inspiration of this paper and many helpful suggestions on the writing of the manuscript.