Trinomials, singular moduli and Riffaut’s conjecture
Abstract
Riffaut [22] conjectured that a singular modulus of degree cannot be a root of a trinomial with rational coefficients. We show that this conjecture follows from the GRH and obtain partial unconditional results.
1 Introduction
A singular modulus is the -invariant of an elliptic curve with complex multiplication. Given a singular modulus we denote by the discriminant of the associated imaginary quadratic order. We denote by the class number of the imaginary quadratic order of discriminant . Recall that two singular moduli and are conjugate over if and only if , and that there are singular moduli of a given discriminant . In particular, . For all details, see, for instance, [12, §7 and §11].
In this article we study the following question.
Problem 1.1.
Can a singular modulus of degree be a root of a trinomial with rational coefficients?
Here and below a trinomial is an abbreviation for a monic trinomial non-vanishing at ; in other words, a polynomial of the form with and .
This problem emerged in the work of Riffaut [22] on the effective André-Oort conjecture. We invite the reader to consult the article of Riffaut for more context and motivation. Riffaut conjectured that the answer is negative, but, as he admits, “much about trinomials is known, but this knowledge is still insufficient to rule out such a possibility”.
We believe, however, that the problem is motivated on its own, independently of Riffaut’s work, because it is of interest to learn more about the relationship between two very classical objects like rational trinomials and singular moduli.
In Section 9 we prove that Riffaut’s conjecture follows from the GRH.
Theorem 1.2.
Assume the Generalized Riemann Hypothesis for the Dirichlet -functions. Then a singular modulus of degree at least cannot be a root of a trinomial with rational coefficients.
We also obtain some partial unconditional results. To state them, we have to introduce some definitions that will be used throughout the article.
Let be an imaginary quadratic discriminant. We call trinomial discriminant if and the singular moduli of discriminant are roots of a trinomial with rational coefficients. If this trinomial is of the form then we say that is a trinomial discriminant of signature .
Note that a trinomial discriminant may, a priori, admit several signatures. However, there can be at most finitely many of them, and all of them can be effectively computed in terms of . This follows from the results of article [3] and the following property: for any singular modulus and positive integer we have , see [22, Lemma 2.6].
Now we are ready to state our unconditional results. First of all, in Sections 6 and 10 we show that a trinomial discriminant cannot be too small, and, with at most one exception, cannot be too large either.
Theorem 1.3.
Every trinomial discriminant satisfies , and at most one trinomial discriminant satisfies . In particular, the set of trinomial discriminants is finite.
Next, in Section 7 we show that trinomial discriminants are of rather special form.
Theorem 1.4.
Every trinomial discriminant is of the form or , where and are distinct odd prime numbers. In particular, trinomial discriminants are odd and fundamental.
Finally, in Section 12 we show that trinomials vanishing at singular moduli are themselves quite special.
Theorem 1.5.
Let be a trinomial discriminant of signature . Assume that . Then .
Plan of the article.
In Section 2 we remind general facts about singular moduli, to be used throughout the article. In Section 3 we introduce and study the basic notion of suitable integer. A positive integer is called suitable for a discriminant if there exists such that and belongs to the standard fundamental domain (plus a certain coprimality condition must be satisfied). We give various recipes for detecting suitable integers of arbitrary discriminants, so far without any reference to trinomials.
In Section 4 we obtain some metrical properties of roots of trinomials, both in the complex and non-archimedean setting. Applying them to singular moduli that are roots of a trinomial, we obtain the “principal inequality”, a basic tool instrumental for the rest of the article. In Section 5 we use the “principal inequality” to study suitable integers of trinomial discriminants: they turn out to be very large, of order of magnitude , and densely spaced.
In Section 6 we show that trinomial discriminants cannot be too small (the first statement of Theorem 1.3). The argument uses the results of the previous sections and computations with PARI [26] and SAGE [27].
In Section 7 we prove Theorem 1.4 on the structure of trinomial discriminants, using careful analysis of suitable integers. In the follow-up Section 8 we show that suitable integers of trinomial discriminants are prime numbers.
In Section 9 we obtain the conditional result (Theorem 1.2) and in Section 10 we obtain an unconditional upper bound for all but one trinomial discriminant (the second statement of Theorem 1.3). The principal arguments of these sections already appeared elsewhere [15, 17, 21], and we only had to adapt them to our situation.
Acknowledgments.
We thank Michael Filaseta, Andrew Granville, Sanoli Gun, Tanmay Khale, Chazad Movahhedi, Olivier Ramaré, Igor Shparlinski and the mathoverflow user Lucia for helpful suggestions.
We are most grateful to the anonymous referees, who corrected several mistakes, and made many very useful comments that helped us to improve the presentation.
All calculations were performed using PARI [26] or SAGE [27]. We thank Bill Allombert and Karim Belabas for the PARI tutorial. The reader may consult https://github.com/yuribilu/trinomials to view the PARI script used for this article.
Yuri Bilu thanks the University of Valparaiso and the Institute of Mathematical Sciences (Chennai) for stimulating working conditions.
Florian Luca worked on this project during visits to the Institute of Mathematics of Bordeaux from March to June 2019, and to the Max Planck Institute for Mathematics in Bonn from September 2019 to February 2020. He thanks these institutions for hospitality and support.
Amalia Pizarro-Madariaga thanks the Max Planck Institute for Mathematics in Bonn for hospitality and stimulating working conditions.
1.1 Some general conventions
Throughout the article we use as a quantitative version of the familiar notation: means that .
We denote by the Kronecker symbol. The general definition of the Kronecker symbol can be found, for instance, on page 202 of [14]. In this article, however, we will always use it in the special case when is a discriminant (and, in particular, ) and is a prime number. In this case is just the Legendre symbol if is an odd prime, and
We use the standard notation for the number of prime divisors (counted without multiplicities). If is a prime of a number field then we denote the -adic valuation (normalized so that its group of values is ).
2 Generalities on singular moduli
In this section we summarize some properties of singular moduli used in the article. Unless the contrary is stated explicitly, everywhere below the letter stands for an imaginary quadratic discriminant; that is, and .
Denote by the standard fundamental domain: the open hyperbolic triangle with vertices
together with the geodesics and . It is well-known (see, for instance, [4, Proposition 2.5] and the references therein) that there is a one-to-one correspondence between the singular moduli of discriminant and the set of triples of integers with , satisfying and
If then belongs to , and the corresponding singular modulus is .
We call a singular modulus dominant if in the corresponding triple we have . For every there exists exactly one dominant singular modulus of discriminant .
The inequality
holds true for every ; see, for instance, [5, Lemma 1]. In particular, if is a singular modulus of discriminant corresponding to the triple then
This implies that
(2.1) | |||||
(2.2) | |||||
(2.3) |
These inequalities will be systematically used in the sequel, sometimes without special reference.
3 Suitable integers
Everywhere in this section is an imaginary quadratic discriminant and a positive integer.
Call an integer suitable for if there exist such that . Note that is always suitable, and that a suitable satisfies : this follows from the fact that belongs to the standard fundamental domain, or directly from the relation and the inequalities . Moreover, equality is possible only when and , and we have the strict inequality when .
In the following proposition we collect some useful tools for detecting suitable integers.
Proposition 3.1.
-
1.
Assume that , that is a square , and that . Then is suitable for .
-
2.
Let be suitable for and a divisor of such that . Then is suitable for as well.
-
3.
Let be a prime number satisfying and . Then is suitable for .
-
4.
Assume that is even, that , and that . Then or is suitable for .
-
5.
Assume that . Let be an integer such that . Then is suitable for . In particular, if then and are suitable for .
-
6.
Assume that , where and are positive odd integers with . Then is suitable for if is odd, and is suitable if is even.
The proof requires a simple lemma, telling that exhaust all squares .
Lemma 3.2.
Let be a positive integer and an integer. Then there exists an integer satisfying and .
Proof.
Since implies that we may assume that . Now set . ∎
Proof of Proposition 3.1.
- 1.
- 2.
- 3.
- 4.
- 5.
- 6.
∎
We want to extend item 6 to the case when and are not coprime. This is possible in the special case when is an odd prime power with even exponent.
Proposition 3.3.
Let be an odd prime number and a positive integer such that . Write . (In particular, .) Define
Assume that . Then is suitable for .
Proof.
Setting
a routine verification shows that
(3.1) | |||||
(3.2) | |||||
(3.3) | |||||
The only part of the verification which is not completely trivial is coprimality of the entries of the triple in (3.3). To see this, just note that the difference of the third and the first entry is . This already proves coprimarity in the case of even , when , because none of the entries is divisible by . And if is odd, in which case , we simply note that the middle entry is odd, because (and this is because both and are ).
Thus, we have proved that is suitable for when . It remains to note that . Indeed, for the function admits global maximum at , and this maximum is equal to . ∎
Proposition 3.4.
Let be a prime number or . If then admits a suitable integer distinct from and .
Proof.
There can exist at most integers satisfying
Therefore if the only suitable integers for are and then the set consists of at most elements. Hence . This contradicts the assumption , because the largest discriminant with class number is , see Subsection 6.2. ∎
4 Roots of trinomials and the principal inequality
In this section we establish some elementary metrical properties of roots of trinomials, both in the complex and -adic setting. The complex result, applied to singular moduli, will yield that non-dominant singular moduli of trinomial discriminant are very close to each other in absolute value. We call this the “principal inequality”; it will indeed be of crucial importance for the rest of the article.
Everything is based on the following property: if are roots of a trinomial then
(4.1) |
4.1 The complex case
We start from the following observation.
Proposition 4.1.
Let be roots of a trinomial . Assume that . Then
(4.2) |
and
(4.3) |
While (4.3) is weaker than (4.2), it has the advantage that and are not involved. Hence it may be used to check whether given numbers are roots of some trinomial. This will be used in Section 6.
Proof.
Recall that we call a trinomial discriminant of signature if and the singular moduli of discriminant are roots of a trinomial of the form with rational coefficients.
Corollary 4.2 (The “principal inequality”).
Let and be non-dominant singular moduli of trinomial discriminant of signature . Assume that and . Then
(4.4) |
In particular, we have the inequalities
(4.5) | ||||
(4.6) | ||||
(4.7) |
4.2 The non-archimedean case
In this subsection is a field of characteristic complete with respect to a non-archimedean absolute value . The results of this subsection will be used only in Subsection 6.2 and Section 12.
Proposition 4.3.
-
1.
The roots of a trinomial may have at most distinct absolute values. In other words, the set consists of at most elements.
-
2.
Assume now this set has exactly distinct elements and , with . Then for any roots with we have
(4.8) and for any roots with we have
(4.9)
Proof.
If are roots with then the determinant in (4.1) has the term which has a strictly larger absolute value than the other terms. Hence the determinant cannot vanish. This proves item 1.
Now let be roots with . Again expanding the determinant, we obtain , which proves (4.8).
It turns out that a trinomial having roots of distinct absolute values must have, in the algebraic closure , exactly “small” roots and “big” roots.
Proposition 4.4.
Proof.
Denote the roots of in counted with multiplicities. Let be the number of roots of absolute value . The coefficient of in is given by
In this sum exactly one term is of absolute value , while the other terms are of strictly smaller absolute value. Hence the coefficient of does not vanish, which implies that . ∎
Remark 4.5.
As the anonymous referees suggested, item 1 of Proposition 4.3, and Proposition 4.4 can be proved using the “Newton polygons”, as in Section 6.3 of [10]. While our proof of Proposition 4.4 does not use Newton polygons, the present simple argument was inspired by the referees’ comments. Our initial statement of Proposition 4.4 was weaker, and the proof was long and ugly.
5 Suitable integers for trinomial discriminants
In this section denotes a trinomial discriminant unless the contrary is stated explicitly. The following property is crucial.
Proposition 5.1.
Let be a trinomial discriminant admitting at least distinct suitable integers other than . Let be suitable for . Then we have if , and if .
(It follows from the proof that, assuming large enough, can be replaced by any .)
Here are some immediate consequences.
Corollary 5.2.
Let be trinomial and a prime number such that . Then if and if .
Proof.
Corollary 5.3.
Let be trinomial, , and suitable for . Assume that . Then is a prime number, and . Moreover, if then .
Proof.
Before proving Proposition 5.1, we obtain the following preliminary statement.
Proposition 5.4.
Let be a trinomial discriminant, . Then it admits at most one suitable satisfying . If then can be replaced by .
Proof.
Assume that admits suitable and satisfying
with to be specified later. Let be triples where our occur, and the corresponding singular moduli. We have
A calculation shows that
when and , or when and . (In fact, in the latter case can be replaced by .) Hence . Comparing this with the “principal inequality” (4.7), we obtain
which is impossible because . ∎
Proof of Proposition 5.1.
Let and be the smallest suitable integers for . By the assumption, admits a suitable integer . Proposition 5.4 implies that , where can be replaced by if .
Now assume that . We will see that this leads to a contradiction. We again let and be singular moduli for and . Then
(5.1) |
When , the right-hand side of (5.1) does not exceed . Then
which clearly contradicts (4.7).
Now assume that and . Then instead of (5.1) we have
The right-hand side is again bounded by , and we complete the proof in the same way. ∎
Another important property of suitable integers is that they are of the same order of magnitude.
Proposition 5.5.
Let be suitable integers for a trinomial discriminant . Then .
6 Small discriminants
Recall that we call a discriminant trinomial if and singular moduli of this discriminant are roots of a trinomial with rational coefficients.
In this section we show that trinomial discriminants must be odd and not too small.
Theorem 6.1.
Let be trinomial discriminant. Then is odd and satisfies .
This theorem is an immediate consequence of Theorem 6.2, Proposition 6.3 and Theorem 6.6 proved below.
Most of the arguments of this section are computer-assisted. We use packages PARI [26] and SAGE [27]. The reader may consult https://github.com/yuribilu/trinomials to view our PARI scripts. All computation were performed on a personal computer with 2.70 GHz processor and 16.0 GB RAM.
6.1 Small discriminants with class number larger than
In this subsection we prove the following.
Theorem 6.2.
There are no trinomial discriminants with and .
First of all, we show that sufficiently large trinomial discriminants must be odd.
Proposition 6.3.
Let be a trinomial discriminant with . Then is odd.
Proof.
Next, we dispose of the discriminants in the range .
Proposition 6.4.
There are no trinomial discriminants with and .
Proof.
The proof is by a PARI script. For every such our script finds singular moduli of discriminant such that inequality (4.3) does not hold. More precisely, for every in the range , except , our script finds satisfying
For the exceptional one has the same inequality, but with 0.001 instead of .
The total running time was less than 6 minutes. ∎
Unfortunately, this method fails for discriminants with class number . Each of those admits one real singular modulus (the dominant one) and two complex conjugate singular moduli ; for them inequality (4.3) is trivially true. In Subsection 6.2 we use a totally different method to show that discriminants with cannot be trinomial.
To dismiss larger discriminants, we show that they admit a small prime with .
Proposition 6.5.
-
1.
Every odd discriminant with admits a prime such that .
-
2.
Every odd discriminant with admits a prime such that .
Proof.
We use again a PARI script. It works in 3 steps. In what follows is a (large) positive number and is the largest prime number such that
the product being over primes in the indicated range. Let also be the first two primes larger than . We have , , for and , , for .
Building the list of residues
In the first step we use successively the Chinese Remainder Theorem to generate the list of residues such that and for every odd prime . There are
such residues altogether, which gives residues for and residues for .
Building the list of discriminants
For every residue class from the previous list, and every residue class with we find the smallest negative number belonging to both, and we include this in the list only if . We obtain the full list of odd discriminants with the properties and for all (including ). We end up with 32567861 discriminants for and with 4450 discriminants for .
Sieving
Now we sieve our list modulo every prime , by deleting from the list the discriminants with . The list was emptied after for and after for .
The bottleneck steps are building the list of discriminants and sieving modulo : they require most of processor time and memory. The total running time was less than 5 minutes for and less than 0.1 second for . ∎
Now we are ready to prove Theorem 6.2.
6.2 Discriminants with class number
In this subsection we prove the following theorem.
Theorem 6.6.
There are no trinomial discriminants with class number .
There are 25 discriminants with class number : the full list of them, found by SAGE command cm_orders, is the top row of Table 1 below. As we have seen, they cannot be dismissed using the method of Proposition 6.4. Instead, we use a version of the argument from [19].
We start by some general discussion. Assume that we are in the following situation: is a -irreducible polynomial with splitting field of degree , and is a prime number such that
, but . | (6.1) |
Assume further that
is unramified in and inert in the quadratic subfield of . | (6.2) |
(The latter assumption can be suppressed, but it holds in all cases that interest us, and many arguments below simplify when it is imposed.)
Let be the roots of . Since , every must be divisible by a prime ideal above . Since , these ideals must be distinct. Hence splits in in at least distinct primes. Since is inert in the quadratic subfield, it splits in exactly primes: , with . Since is unramified, we have for .
Now assume that divides a trinomial . Then, setting , Proposition 4.3 implies that
(6.3) |
To make use of this, we need the following classical fact. Some versions of it were known already to Lucas [20] or even earlier, but we prefer to include the proof for the reader’s convenience.
Proposition 6.7.
Let be a prime number, a number field, a prime of and a -adic unit, but not a root of unity. Assume that is unramified over . Let be the order of (the smallest positive integer such that ). Then for every positive integer such that we have . Furthermore, for we have
(6.4) |
and for we have
(6.5) |
Proof.
We only have to prove (6.4) and (6.5), because is obvious. Note also that divides , the order of the multiplicative group ; in particular, .
Assume first that . Then
Since , the second factor is congruent to modulo . In particular, it is not divisible by . Hence , and both (6.4), (6.5) are true in this case.
Now assume that and . Write . Since , we have . Hence
Since is unramified and , we have as well. Hence
and (6.4) follows by induction in .
Now let . When we can prove that in the same way as we proved when . Hence (6.5) is true is this case. Now assume that and write
Since , we have and . Since is unramified, this implies that , or, in other words,
and we complete the proof by induction as before. ∎
Corollary 6.8.
Proof.
We have by (6.1) and (6.3). Proposition 6.7 implies that . Furthermore, for we use (6.1) and (6.4) to obtain
Thus, both and divide . Since , this proves (6.6) in the case .
In the case the same argument gives
We have clearly , which implies that the maximum above is . ∎
Remark 6.9.
-
1.
The lower bound (6.6) is quite strong, but to profit from it in practical situations, we must be able to calculate and . We do it as follows. Let be the monic polynomial of degree whose roots are , and its discriminant. Then , because and . In particular, is the smallest positive for which and
-
2.
Polynomials are very easy to calculate consecutively. Indeed,
and for general we have ,
which implies the recurrence relations
Theorem 6.6 is an easy consequence of the following statement.
Proposition 6.10.
Let be a discriminant with class number . Assume that is trinomial of signature . Then , where can be found in Table 1.
Proof.
It is again by a PARI script. Let be the Hilbert Class Polynomial of (the monic polynomial whose roots are the singular moduli of discriminant ). A verification with PARI shows that, for each of the 25 possible , it has one real root (the dominant one) and two complex conjugate roots.
Let be the largest positive integer such that . We want to apply Corollary 6.8 to the polynomial
We pick a prime number such that , but ; our script shows the existence of at least one such in all the 25 cases. If there are several with this property, we take the largest of them. The prime chosen for each can be seen in Table 1. As we verified, each of our primes satisfies , which means that it is unramified in (the splitting field of ) and inert in , the quadratic subfield of , so we are indeed in the set-up of Corollary 6.8.
We calculate and as defined in Corollary 6.8, using the method outlined in Remark 6.9. Their values are in Table 1 as well. Corollary 6.8 implies that (6.6) holds true. As our script verified, we always have . This implies the lower bound
(6.7) |
Now let us bound in terms of from above. Let be the real root of and the complex conjugate roots. (Our definition of implies that are the singular moduli of discriminant , of which one is real and the other two are complex conjugates.) Set . Then we have inequality (4.2):
A quick calculation with PARI shows that , which implies that
(6.8) |
On the other hand, we can estimate from below using the classical Liouville inequality: if is a non-zero complex algebraic number of degree , then
Here is the usual absolute logarithmic height444There is no risk of confusing the height and the class number , not only because the former is roman and the latter is italic, but also because the class number notation does not occur in this subsection, and heights do not occur outside this subsection..
Our is an algebraic number of degree , and not a root of unity, because among its -conjugates there is , of absolute value distinct from . Hence we may apply the Liouville inequality to .
We have clearly . To estimate , note that the -conjugates of are the numbers with . Of them, only and are greater than in absolute value. Also, is a root of the polynomial
where is the free term of . Hence
It follows that
Together with (6.8) this implies that
Comparing this with the lower bound (6.7), we obtain with
Upper bounds for and produced by our script can be found in Table 1.
The total running time was less than 2 seconds. ∎
7 Structure of trinomial discriminants
In this section we prove Theorem 1.4. For convenience, we reproduce the statement here.
Theorem 7.1.
A trinomial discriminant must be of the form or , where are distinct odd prime numbers, . In particular, a trinomial discriminant is fundamental.
In this section denotes a trinomial discriminant; in particular, is odd and by Theorem 6.1. The proof is split into many steps which correspond to Subsection 7.1–7.6 below.
7.1 may have at most prime divisors
Assume that has distinct (odd) prime divisors . Set and . We may assume that . We have clearly for . Item 6 of Proposition 3.1 implies that both and are suitable for . Using Proposition 5.1 we obtain , which is impossible when .
.
7.2 is not a square
Assume that , with . Among the three primes there is one, call it , which does not divide . This must be suitable, because . Corollary 5.2 implies now that , which is impossible when .
7.3 If then
Assume that , where is a prime number and a positive integer. Since is not a square, must be odd. Assume that . Let be an odd prime divisor of . Then , which implies that . In addition to this, implies that . Hence is suitable for , and Corollary 5.3 implies now that
which is impossible when . Thus, and .
We are left with the case when has exactly two odd prime divisors and , with . In the sequel we write
We want to show that .
7.4 We have
Assume that . Let us show first of all that we must have . Indeed, assume the contrary: and . Writing , Proposition 3.3 implies that is suitable for . However, since , we have , which implies that the minimum is, actually, . Thus, is suitable for . Item 6 of Proposition 3.1 implies that one of the three numbers
is suitable as well. Since none of these numbers is equal to , Proposition 5.1 applies, and we obtain
(7.1) |
which is impossible when .
Thus, we have , that is, . In this case we have suitable integers
Let us show that . If then we must have
It follows that , which implies that . Furthermore, , which implies that . Hence
a contradiction.
Thus, , and Proposition 5.1 applies. If then , and we have , which is impossible. Now assume that
(7.2) |
in which case
When this implies that . When we have which again implies . Thus, in any case. From (7.2) we deduce that , and we end up with (7.1).
This shows that . Hence we proved that .
7.5 We have
Assume that . If then is suitable for by item 6 of Proposition 3.1. Proposition 3.4 implies that there must be a suitable integer distinct from and . Proposition 5.1 implies now that
which is impossible when .
Thus, . Hence , because is not a square.
Item 6 of Proposition 3.1 implies that is suitable. We want to show that there is one more suitable integer, distinct from and . If then an easy application of Proposition 3.3 implies that is suitable, so in the sequel we will assume that , that is, . Note that we must have : otherwise , a contradiction.
We consider two cases.
7.5.1 The case
7.5.2 The case
7.6 We have
The only remaining possibilities are and , and we have to dismiss the former. Thus, let us assume that . Defining
(7.3) |
Since is a discriminant, is a discriminant as well, and we have two possible cases.
Case 1: The discriminant admits a suitable integer which is not a power of
Item 2 of Proposition 3.1 implies that admits a suitable prime . Then and , which implies that and . Item 3 of Proposition 3.1 implies that is suitable for as well.
And if then from we deduce . Now Corollary 5.3 implies that , which is again impossible.
Case 2: Every integer suitable for is a power of
Item 2 of Proposition 3.1 implies that in this case the list of suitable integers for consists of consecutive powers of :
The suitable integer occurs in only one triple in , and each of the suitable integers occurs in exactly triples. Hence .
On the other hand, from we deduce that , which implies that (the largest fundamental discriminant of class number not exceeding is , see [29, Table 4 on page 936]). Hence , or, equivalently, must be suitable for . This implies, in particular, that . Hence , defined in (7.3), is equal to . This shows that is suitable for .
We claim that is suitable for as well. Indeed, since is suitable for , there exist such that and . Using we obtain
Now a routine verification shows that
This proves that is suitable for .
Thus, both and are suitable for . Since , we have . Hence . Proposition 5.1 implies that
which is impossible when .
This completes the proof of Theorem 7.1. ∎
8 Primality of suitable integers
As before, denotes a trinomial discriminant unless the contrary is stated explicitly. In particular, by Theorem 6.1, and, according to Theorem 7.1, we have or where are distinct odd prime numbers.
As we have seen in Corollary 5.3, suitable integers for trinomial discriminants are prime numbers with some rare exceptions. It turns out that there are no exceptions at all.
Proposition 8.1.
Let be a trinomial discriminant and suitable for . Then is prime and satisfies .
Remark 8.2.
Since , this implies, in particular, that .
Before proving Proposition 8.1, observe that, in the case , the primes are of the same order of magnitude up to a logarithmic factor.
Proposition 8.3.
If then
(8.1) |
Proof.
It suffices to prove the lower estimate in (8.1); the upper estimate will then follow automatically. Thus, let us assume that and prove that .
9 A conditional result
In this section we prove Theorem 1.2. Let us reproduce it here for convenience.
Theorem 9.1.
Assume GRH. Then a singular modulus of degree at least cannot be a root of a trinomial with rational coefficients. In other words, GRH implies that trinomial discriminants do not exist.
In this section, by the RH we mean the Riemann Hypothesis for the Riemann -function, and by GRH the Generalized Riemann Hypothesis for Dirichlet -functions.
Due to the results of the previous sections, Theorem 9.1 is an easy consequence of the following statement.
Proposition 9.2.
Assume GRH. Let be an integer with , and a primitive odd real Dirichlet character modulo . Then there exists a prime number such that and
(9.1) |
Recall that a real character is called odd if . Restricting to odd characters is purely opportunistic here: the same argument, with very insignificant changes, applies to even real primitive characters as well. But we apply estimate (9.1) only to real odd characters, and making this assumption allows us to shorten the proof. The assumption is of similar nature: it can be dropped, making the proof a bit more complicated, but this is unnecessary because we will apply (9.1) only to with at most prime divisors.
Remark 9.3.
Unconditionally, the bound holds for every ; this is a classical result of Linnik and Vinogradov. Unfortunately, the implied constant in this estimate depends ineffectively on , because of ineffectiveness of Siegel’s bound for the exception real zero of . In Section 10 we imitate the Linnik-Vinogradov argument in the form given in [21], but with Siegel’s Theorem replaced by Tatuzawa’s Theorem [25], obtaining this way an unconditional explicit upper bound for all but one trinomial discriminants.
Proof of Theorem 9.1 (assuming Proposition 9.2).
Let be a trinomial discriminant. Theorem 6.1 implies that . We apply Proposition 9.2 with the character , which is an odd real Dirichlet character . Moreover, it is primitive because is fundamental, see Theorem 7.1. Note also that , again by Theorem 7.1. We find (assuming GRH) a prime satisfying and , which contradicts Corollary 5.2. ∎
The proof of Proposition 9.2 is an adaptation of the argument developed by Lamzouri et al. in [17]. Their Theorem 1.4 implies, in our case, the estimate
(9.2) |
Of course, it is asymptotically much sharper than (9.1), but (9.2) is not suitable for our purposes because of the term .
9.1 Lemmas from [17]
In this subsection we recall some technical lemmas proved in [17], and give simplified versions of them. We use the notation of [17] whenever possible; our only major deviation from the set-up of [17] is that we denote the modulus by , while it is usually denoted by therein.
For and a Dirichlet character define
Here is, of course, the von Mangoldt function.
Denote by the Euler–Mascheroni constant, and define
(9.3) |
(see equation (2.2) on [17, page 2395]). The following is combination of Lemmas 2.1 and 2.4 from [17].
Lemma 9.4.
Recall (see Subsection 1.1) that means that .
We will use the following simplified version of this lemma for large .
Lemma 9.5.
The proof of this lemma is left out, being an easy calculation.
For define
The next lemma combines Lemmas 2.2 and 2.3 from [17], in the special case of odd real characters.
Lemma 9.6.
Let be an integer and be a primitive real odd Dirichlet character modulo . Assume GRH. Then for we have
(9.7) |
where
Note that we denote by the quantity from [17].
We again give a simplified version (of the lower bound only, we do not need the upper bound).
Lemma 9.7.
Proof.
Finally, the following is (a consequence of) Lemma 3.1 from [17] (which is unconditional, unlike the previous lemmas).
Lemma 9.8.
Let be an integer and be a real number. Then
9.2 Proof of Proposition 9.2
Assume the contrary: for every . Since is a real character, this implies that
where we set . Since , we have , which means that we may use estimates (9.5) and (9.8). We may also use (9.4) with replaced by , which gives
Finally, Lemma 9.8 and the assumption imply that
Combining all these estimates, we obtain
which can be re-written as
When the left-hand side does not exceed , which implies the inequality . Substituting , we obtain
This inequality is impossible when . ∎
10 Bounding all but one trinomial discriminants
In this section we prove the following theorem.
Theorem 10.1.
There exists at most one trinomial discriminant satisfying .
Call a positive integer coarse555as opposed to smooth if is either prime or a product of two distinct primes each exceeding . We deduce Theorem 10.1 from the following statement.
Theorem 10.2.
With at most one exception, every coarse integer has the following property. Let be a primitive real Dirichlet character . Then there exists a prime such that .
Theorem 10.2 will be proved in Subsection 10.2, after we establish, in Subsection 10.1, an explicit version of the Burgess estimate for coarse moduli.
10.1 Explicit Burgess for coarse moduli
Everywhere in this subsection is a positive integer and a primitive Dirichlet character . For with denote
A classical result of Burgess [8, 9] implies that . We need a version of this inequality explicit in all parameters. Such a version is available in the case of prime modulus [6, 15, 28], but we need a slightly more general version of it, for coarse moduli, as defined in the beginning of Section 10.
Theorem 10.3.
Let be a coarse integer. Then for every as above we have
Note that we did not try to optimize the numerical constant . Probably, sharper constants are possible, as the work of Booker [6] and Treviño [28] suggests.
Remark 10.4.
The following lemma is quite standard, but we did not find a suitable reference.
Lemma 10.5.
Assume that is square-free, and let be a polynomial with the following property: there exists such that
Then the sum
satisfies , where is the number of distinct roots of modulo .
Proof.
Let be the prime factorization of (recall that is square-free), and set . Our character has a unique presentation as , where each is a primitive character . Then
where ; see, for instance, equation (12.21) in [15]. Since each has a simple root modulo , the Hasse-Weil bound applies (see, for instance, [24], Theorem 2C′ on page 43). The result follows. ∎
Proof of Theorem 10.3.
Denote . We want to prove that
(10.1) |
We follow rather closely the argument from the book of Iwaniec and Kowalski [15, pages 327–329], where we set . In particular, we will use induction in .
If then (10.1) follows from the trivial estimate , and if then (10.1) follows from the Pólya-Vinogradov inequality , see [15, Theorem 12.5]. Hence we may assume in the sequel that
(10.2) |
We fix positive integers , to be specified later, such that
(10.3) |
Since is coarse,
(10.4) |
For a residue class denote by the number of presentation of as , where
and denote the inverse of modulo .
Arguing as in [15, page 327], we find
(10.5) |
where
Using Hölder’s inequality, we estimate
where
We have the following estimates:
(10.6) |
Now we are going to complete the proof, assuming them. Estimates (10.6) themselves will be proved afterwards.
Set
From the hypotheses and the inequality (10.2) we deduce that (10.3) indeed holds. Moreover, satisfies .
Since , estimate (10.1) holds true, by induction, with replaced by :
Now let us estimate . We have clearly
Furthermore, using (10.2) and , we obtain
It follows that . We obtain
Substituting all this in (10.5), we obtain
as wanted.
We are left with the estimates from (10.6). The estimate is obvious. The estimate for is Lemma 12.7 from [15, page 328]. The only difference is that in [15] is a prime number (and denoted ). However, it is only needed therein that every integer between and is co-prime with , which is true in our case, see (10.4).
Finally, let us prove the estimate for . The proof is very similar to that of [15, Lemma 12.8]. We write
Note that implies that , see (10.4). Now if a quadruple has the property that
(10.7) |
then
by Lemma 10.5 applied to the polynomial
(Here, the from Lemma 10.5 is the that is distinct from all other .) And a simple combinatorial argument shows that exactly quadruples do not satisfy (10.7). Hence
which is slightly sharper than wanted. The theorem is proved. ∎
10.2 Proof of Theorem 10.2
Set
(10.8) |
Note that is a multiplicative function.
The following statement is a version of Proposition 3.1 from Pollack [21].
Proposition 10.6.
Let and be as in Theorem 10.3 (in particular, is coarse). Then for we have
(10.9) |
(Recall that means .)
Proof.
Set , so that (10.1) can be written as
(10.10) |
Let be a real number satisfying , to be specified later, and set . Intuitively, one should think of as “large” (not much smaller than ) and “small”. As in [21], we use the “Dirichlet hyperbola formula”
Here the first double sum will give the main contribution, while the second and the third double sums will be absorbed in the error term.
Using (10.10), we estimate the last two double sums:
For the first double sum we have the expression
(10.11) |
where
We have clearly . To estimate we use partial summation. For an integer set . Using (10.10) we estimate
Hence
Combining all these estimates, we obtain
with
We set the “optimal” . Our assumption implies that indeed . We obtain
as wanted. ∎
The following lemma is the classical theorem of Tatuzawa [25].
Lemma 10.7.
Let . Then, with at most one exception, for every positive integer the following holds. Let be a primitive real character . Then .
Proof of Theorem 10.2.
We assume that is coarse and not the exceptional one from Lemma 10.7, where we set . Set . We have and . Hence, combining Proposition 10.6 and Lemma 10.7 with , we obtain
(10.12) |
On the other hand, if is such that then
(10.13) |
Now assume that for all primes . We have two cases.
If is prime, then, by (10.13) and the multiplicativity of , for we have when is a full square, and otherwise.
Now assume that is a product of two primes, being the smallest of them; in particular, . Since , we have for any in the range . It follows that when is a full square or times a full square, and otherwise.
11 The quantities , and
This section is preparatory for the “signature theorem”, proved in Section 12. As before, denotes a trinomial discriminant unless the contrary is stated explicitly. The following three quantities will play a crucial role in the sequel:
-
•
, the class number;
-
•
, the largest absolute value of a non-dominant singular modulus of discriminant ;
-
•
, the absolute norm , where is a singular modulus of discriminant ; it clearly depends only on and not on the particular choice of .
For an arbitrary (not necessarily trinomial) discriminant we have upper estimates
(for the first one see, for instance, Theorems 10.1 and 14.3 in [13, Chapter 12]). It turns out that for trinomial discriminants one can do much better.
Proposition 11.1.
Let be a trinomial discriminant. Then
(11.1) |
An immediate application is the following much sharper form of Corollary 4.2 (the “principal inequality”).
Corollary 11.2 (refined “principal inequality”).
Let be a trinomial discriminant of signature and non-dominant singular moduli of discriminant . Then
For subsequent applications we need to estimate the product . Proposition 11.1 implies the estimate
However, this is insufficient for us: we need an estimate of the shape on both sides.
Proposition 11.3.
Let be a trinomial discriminant. Then
Finally, we need an estimate for . It is known that for any discriminant , see [2, 18]. For trinomial discriminants one can say much more.
Proposition 11.4.
Let be a trinomial discriminant. Then
(11.2) |
In particular,
(11.3) |
Propositions 11.1, 11.3 and 11.4 are proved in the subsequent subsections. Since is fixed, we may omit it in the sequel and write simply , and .
11.1 Proof of Proposition 11.1
According to Proposition 8.1, all suitable integers except are prime numbers not exceeding . Each of them occurs in most triples (see the proof of Proposition 3.4). Hence . Theorem 2 in [23, page 69] states that
(11.4) |
Since , we can apply this with . We obtain
which proves the upper estimate for .
The lower bound for follows from the work of Watkins [29], which implies that a fundamental discriminant with satisfies : see Table 4 on page 936 of [29]. But , a contradiction.
Now let be a non-dominant singular modulus of discriminant and the corresponding suitable integer. Since is not dominant, we have , which implies that . Hence
which proves the upper estimate for .
As for the lower estimate , it holds true for any discriminant , not only trinomial discriminants, due to the following lemma.
Lemma 11.5.
Let be a singular modulus of discriminant (not necessarily trinomial). Then .
For the proof see [1, Corollary 5.3].
11.2 Proof of Proposition 11.3: the upper estimate
Let be all suitable integers for . Since occurs in one triple from and each in at most two triples, we have
Now note that by Proposition 5.5, and , see Remark 8.2. Hence we may use (11.4) with . We obtain
Now, using that , we obtain
as wanted.
(Note that our argument is quite loose: the numerical constant can be easily improved.)
11.3 Proof of Proposition 11.3: the lower estimate
We need a lemma. Recall (see Section 2) that denotes the standard fundamental domain, and that
Lemma 11.6.
For the following is true.
-
1.
If then .
-
2.
If then
The proof can be found in [4, Proposition 2.2].
Now we are ready to prove the lower estimate from Proposition 11.3. We consider two cases. Assume first that there exists with such that satisfies
Lemma 11.6 implies that the non-dominant singular modulus satisfies . Hence . Using Proposition 11.1, we obtain
as wanted.
Now assume that for every with the number as above satisfies . Let be the smallest real number such that
for every like this. In particular, .
Lemma 11.6 implies that
Now let us estimate . Note that for we have
Hence every suitable satisfies
which can be rewritten as
Since all such are prime and each occurs in at most triples , we have
(We have to add rather than because can accidentally be a prime number.) Using (11.4), we estimate
which implies that
We will estimate each of the terms
separately.
The function is strictly decreasing on , which implies that
To estimate the second term, note that, since , we have
Hence , and
This proves that
better than wanted.
11.4 Proof of Proposition 11.4
Let be the singular moduli of discriminant , with dominant. Then
where the second inequality follows from Corollary 11.2.
12 The signature theorem
In this section we prove Theorem 1.5. Let us reproduce the statement here.
Theorem 12.1.
Let be a trinomial discriminant of signature . Assume that . Then .
As before, in this section we assume that is a trinomial discriminant. In particular, is odd, square-free and has at most prime divisors. We also use the notation , etc. from Section 11. Throughout this section is the Hilbert Class Field of . It is an unramified abelian extension of of degree , generated over by any singular modulus of discriminant . It is also Galois over , of degree . Denoting
we have , where denotes the complex conjugation. Note that for every ; see, for instance, [12, Lemma 9.3].
We denote by the absolute norm on ; that is, for we set
If is a singular modulus of discriminant then . Indeed, , and is a subfield of of degree .
The strategy is as follows. We introduce a certain non-zero algebraic integer and estimate from above using the “principal inequality” as given in Corollary 11.2. Compared with the trivial lower estimate , this would imply the following weaker version of Theorem 12.1: when is large we have .
Using this, and applying Proposition 4.4, we obtain a non-trivial lower bound for . Comparing it with the previously obtained upper bound, we prove Theorem 12.1.
We start with some lemmas.
Lemma 12.2.
Let be a trinomial discriminant and distinct singular moduli of discriminant . Then .
Proof.
Lemma 12.3.
A trinomial may have at most real roots.
Proof.
The derivative may have at most real roots, and the result follows by the Theorem of Rolle.
Alternatively, one may use the “Descartes’ rule of signs”, which implies that the trinomial may have at most positive and at most negative roots. ∎
Since by Proposition 11.1, Lemma 12.3 implies that there must exist at least non-real singular moduli of discriminant . In particular, there exist two singular moduli such that and . (We denote by the complex conjugate of .) Thus, are distinct non-dominant singular moduli of discriminant . We set
This is a non-zero (by Lemma 12.2) real algebraic integer.
Proposition 12.4.
Let be a trinomial discriminant of signature . Then
Proof.
-
1.
For or we have . Hence in these two cases
(12.1) -
2.
For every singular modulus of discriminant there exists exactly one element such that is dominant. We claim that is then dominant as well. Indeed, since (the dominant singular modulus is real), we have , as wanted.
Now let be such that is dominant, and so is . Then there exist exactly elements such that one of is dominant: they are
For these we have the upper estimate
(12.2) -
3.
For the remaining elements none of is dominant. Hence for those we have
(12.3)
Corollary 12.5.
We have either or .
Proof.
To improve on this, we need a non-trivial lower estimate for .
Proposition 12.6.
Assume that . Then .
Proof.
For every rational prime number we want to estimate from below. Let be an -prime above and the ramification index of in . Note that
(12.4) |
because is unramified over . We denote by the -adic valuation and define . Then, clearly, for any .
Proposition 4.3 implies that the set
may consist of at most elements. In case it consists of just one element, this element is , where, as before, we use notation and . Hence for every we have
which implies that . Therefore in this case
(12.5) |
Now assume that consists of distinct elements: , with
Since , Proposition 4.4 implies that at most singular moduli of discriminant have -valuation . It follows that there exist at most elements such that one of has -valuation . For the remaining elements we have
which implies that for these we have . Therefore
(12.6) |
where the last inequality follows from .
We are now ready to prove Theorem 12.1.
References
- [1] Yuri Bilu, Bernadette Faye, and Huilin Zhu, Separating singular moduli and the primitive element problem, Q. J. Math. 71 (2020), no. 4, 1253–1280. MR 4186519
- [2] Yuri Bilu, Philipp Habegger, and Lars Kühne, No singular modulus is a unit, Int. Math. Res. Not. IMRN (2020), no. 24, 10005–10041. MR 4190395
- [3] Yuri Bilu and Florian Luca, Trinomials with given roots, Indag. Math. (N.S.) 31 (2020), no. 1, 33–42. MR 4052259
- [4] Yuri Bilu, Florian Luca, and Amalia Pizarro-Madariaga, Rational products of singular moduli, J. Number Theory 158 (2016), 397–410. MR 3393559
- [5] Yuri Bilu, David Masser, and Umberto Zannier, An effective “theorem of André” for -points on a plane curve, Math. Proc. Cambridge Philos. Soc. 154 (2013), no. 1, 145–152. MR 3002589
- [6] Andrew R. Booker, Quadratic class numbers and character sums, Math. Comp. 75 (2006), no. 255, 1481–1492. MR 2219039
- [7] Matteo Bordignon, Partial Gaussian sums and the Pólya–Vinogradov inequality for primitive characters, arXiv:2001.05114 (2020).
- [8] D. A. Burgess, On character sums and -series, Proc. London Math. Soc. (3) 12 (1962), 193–206. MR 0132733
- [9] , On character sums and -series. II, Proc. London Math. Soc. (3) 13 (1963), 524–536. MR 0148626
- [10] J. W. S. Cassels, Local Fields, London Mathematical Society Student Texts, vol. 3, Cambridge University Press, Cambridge, 1986. MR 861410
- [11] Yong-Gao Chen, On the Siegel-Tatuzawa-Hoffstein theorem, Acta Arith. 130 (2007), no. 4, 361–367. MR 2365711
- [12] David A. Cox, Primes of the Form , second ed., Pure and Applied Mathematics (Hoboken), John Wiley & Sons, Inc., Hoboken, NJ, 2013, Fermat, class field theory, and complex multiplication. MR 3236783
- [13] Loo Keng Hua, Introduction to Number Theory, Springer-Verlag, Berlin-New York, 1982, Translated from the Chinese by Peter Shiu. MR 665428
- [14] Kenneth Ireland and Michael Rosen, A classical introduction to modern number theory, second ed., Graduate Texts in Mathematics, vol. 84, Springer-Verlag, New York, 1990. MR 1070716
- [15] Henryk Iwaniec and Emmanuel Kowalski, Analytic Number Theory, American Mathematical Society Colloquium Publications, vol. 53, American Mathematical Society, Providence, RI, 2004. MR 2061214
- [16] Niraek Jain-Sharma, Tanmay Khale, and Mengzhen Liu, Explicit burgess bound for composite moduli, Int. J. Number Theory, to appear, arXiv:2010.09530.
- [17] Youness Lamzouri, Xiannan Li, and Kannan Soundararajan, Conditional bounds for the least quadratic non-residue and related problems, Math. Comp. 84 (2015), no. 295, 2391–2412. MR 3356031
- [18] Yingkun Li, Singular units and isogenies between CM elliptic curves, Compos. Math. 157 (2021), no. 5, 1022–1035. MR 4251608
- [19] Florian Luca and Antonin Riffaut, Linear independence of powers of singular moduli of degree three, Bull. Aust. Math. Soc. 99 (2019), no. 1, 42–50. MR 3896878
- [20] Edouard Lucas, Theorie des Fonctions Numeriques Simplement Periodiques, Amer. J. Math. 1 (1878), no. 4, 289–321. MR 1505176
- [21] Paul Pollack, Bounds for the first several prime character nonresidues, Proc. Amer. Math. Soc. 145 (2017), no. 7, 2815–2826. MR 3637932
- [22] Antonin Riffaut, Equations with powers of singular moduli, Int. J. Number Theory 15 (2019), no. 3, 445–468. MR 3925747
- [23] J. Barkley Rosser and Lowell Schoenfeld, Approximate formulas for some functions of prime numbers, Illinois J. Math. 6 (1962), 64–94. MR 0137689
- [24] Wolfgang M. Schmidt, Equations over Finite Fields. An Elementary Approach, Lecture Notes in Mathematics, Vol. 536, Springer-Verlag, Berlin-New York, 1976. MR 0429733
- [25] Tikao Tatuzawa, On a theorem of Siegel, Jap. J. Math. 21 (1951), 163–178. MR 0051262
- [26] The PARI Group, PARI/GP (Version 2.11.0), 2018, Bordeaux, http://pari.math.u-bordeaux.fr/.
- [27] The Sage Developers, Sagemath, the Sage Mathematics Software System (Version 8.6), 2019, https://www.sagemath.org.
- [28] Enrique Treviño, The Burgess inequality and the least th power non-residue, Int. J. Number Theory 11 (2015), no. 5, 1653–1678. MR 3376232
- [29] Mark Watkins, Class numbers of imaginary quadratic fields, Math. Comp. 73 (2004), no. 246, 907–938. MR 2031415