Solving -adic polynomial equations using Jarratt’s Method
Abstract.
We implement an iterative numerical method to solve polynomial equations in the -adic numbers, where . This method is a simplified -adic analogue of Jarratt’s method for finding roots of functions over the real numbers. We establish that our method has a higher order of convergence than J.F.T. Rabago’s [9] -adic version of Olver’s method from 2016. Moreover, we weaken the initial conditions in Rabago’s method, which allows us to start the iteration with a multiple root of the congruence .
Key words and phrases:
Jarratt’s method, p-adic polynomials, Thurston’s Method, Hensel’s lemma2020 Mathematics Subject Classification:
11D88,11C08,65H041. Introduction
Solving polynomial equations is one of the oldest problems in mathematics which dates back to the Babylonians. There are several analytic, algebraic and geometric approaches for finding roots of polynomials. In this paper we discuss an iterative method to approximate solutions of polynomial equations of the form in the -adic numbers, where .
Well-known numerical methods to approximate roots of functions on the real numbers are Newton’s Method, Olver’s Method [8], Halley’s method [3] and others. In this paper, we look at a method developed by P. Jarratt in 1966 [5]. This method has a higher order of convergence than all of the afore-mentioned methods. We set up a -adic analogue of Jarratt’s method. Analogues of Olver’s and Halley’s method in the -adic setting have previously been established by J.F.T. Rabago [9]. Apart from improving the order of convergence, we also weaken the initial conditions in Rabago’s work. Below is a definition of the term “order of convergence” for general Banach spaces, including the complete valued fields and .
Definition 1.1.
Let be a Banach space with norm . Assume that is a sequence in converging to . Then we say that converges with order if
is a bounded sequence in .
Before turning to Jarret’s method, let us briefly review Newton’s and Olver’s methods in the context of -adic numbers. Suppose is a prime and let be a polynomial. Throughout the sequel, we assume the leading coefficient of to be a unit in . Then all solutions of in lie in . Assume that such that but . Newton’s method uses the iteration
(1.1) |
which yields a sequence in which converges to a simple root of in . In the case when , this provides a convenient proof of Hensel’s lemma which, for all , asserts that a solution of the congruence satisfying can be uniquely lifted to a root of the congruence . In fact, one may replace the condition by the weaker condition
(1.2) |
which leads to a generalized version of Hensel’s lemma (see [6]). The order of convergence of Newton’s method is 2.
Olver’s method, an iteration with higher order of convergence, was carried over to -adic numbers by J.F.T. Rabago in 2016 [9]. He proved that if and if satisfies and , then the sequence defined iteratively as
(1.3) |
converges to a simple root of in . The order of convergence of this method is 3 for and 4 for .
Our goal is to implement a -adic analogue of a simplified version of Jarratt’s method which has order of convergence 4. We will also weaken the initial condition , replacing it by the condition (1.2) above. This allows us to begin with an which possibly is a multiple root of the congruence . At the end of this article we will briefly discuss Thurston’s method which is useful when the initial condition (1.2) is not satisfied. In a nutshell, Thurston’s method either tells us after finitely many steps that a Hensel lifting of a given root of the congruence to a solution of in does not exist, or it produces a new polynomial in place of which meets the above-mentioned initial conditions. In this case we can perform Jarratt’s method for , which in turn gives us an approximation to a root of in .
2. Simplified Jarratt Method for functions on
Jarratt’s original method is an iterative numerical method for solving equations , where is a real-valued function defined on intervals in . The iteration is defined as
(2.1) |
Precise initial conditions on for a generalized version of this method on Banach spaces were worked out in [1]. The order of convergence of this method is 4, as established in [5].
Instead of using the original method, we here use a simplified version. The idea is as follows. Using a Taylor approximation (if existent), we find
Plugging the right-hand side above into (2.1) and neglecting the error, we arrive at the simpler iteration
(2.2) |
We will refer to this iteration as Simplified Jarratt Method (SJM) in the rest of this article. Below we will prove that SJM has order of convergence 4 over the real numbers. If , this result carries over to , as we will see later.
Theorem 2.1.
Assume is an open interval in and is a five times differentiable function. Assume that satisfies and . Then there exists such that if , the sequence defined by (2.2) converges to with order . Moreover,
(2.3) |
Proof.
Let . Taylor’s theorem provides us with the approximations
(2.4) |
where
Dividing the first two equations in (2.4), we get
(2.5) |
after a short calculation. We further write (2.2) in the form
(2.6) |
Combining (2.4) and (2.5), we approximate the denominator in the last term on the right-hand side of (2.6) by
(2.7) |
From the first line in (2.4) and (2.7), we deduce that
(2.8) |
which may be calculated using a suitable computer algebra system. Now plugging (2.5) and (2.8) into (2.6) and simplifying, we arrive at
and hence
(2.9) |
with
(2.10) |
Therefore, if is sufficiently close to , then the sequence converges to with order 4, and we have
which implies (2.3). This completes the proof. ∎
3. -adic analogue of SJM
Now we apply SJM to polynomials in subject to certain initial conditions which we work out in detail. We shall prove the following theorem, which is our main result.
Theorem 3.1.
Let be a prime and be a polynomial with leading coefficent a unit in . Suppose that satisfies the condition
(3.1) |
(In particular, and and thus .) Then the sequence defined iteratively as
converges to a simple root of with order of convergence 4.
As noted in the introduction, in [9], a -adic analogue of Olver’s method was worked out. This method converges with order 3 and was formulated in [9] under the stronger initial condition that and , which means that is a simple root of the congruence . In contrast, our condition (3.1) allows us to take as a multiple root of the said congruence. We point out that (3.1) matches the condition in the following theorem which is a version of the Generalized Hensel Lemma (see [6]).
Theorem 3.2 (Generalised Hensel Lemma).
Let and satisfying . Then there is a unique such that and . Moreover, .
This shows that a root of the congruence can be lifted uniquely to a simple root of in , provided that . Hence, if satisfies this condition, then the SJM iteration gives a sequence which converges to precisely this unique lifting of .
4. Preliminaries
For the proof of Theorem 3.1, we will use the lemma below.
Lemma 4.1.
Let be a prime, and such that . Suppose that
(4.1) |
Define
where
Then we have
5. Proof of the main result
Now we turn to the
Proof of Theorem 3.1.
We set
and recall our assumption . We shall proceed similarly as in the proof of the Generalized Hensel Lemma in [2] and aim to show the following statements by induction over :
-
(1)
,
-
(2)
,
-
(3)
.
For , these conditions are clearly true. So let us assume the conditions to be true for . Now for , we have
(5.1) |
where . From Lemma 4.1 we know that
(5.2) |
where we use statement (3) together with our assumption above. Further, using (2), we deduce that , and from (3), we then get
(5.3) |
Finally, from (1), we know that and hence we have by appealing to (5.1). Thus, we have established (1) for in place of . To prove (2) for , we use the fact that if , then
(5.4) |
for some polynomial and hence
for any . Applying this to , we have
where we use (5.2) and (3). Now if , then we have
(5.5) |
Hence we arrive at a contradiction. So .
To prove (3) for , we use Taylor’s formula and recall that , obtaining
(5.6) |
for some .
(5.7) |
where
and
with , , , . The above expressions may be calculated conveniently using a suitable computer algebra system.
We note that by (2) and by (2) and (3). It follows that and , and hence
where for the second inequality above, we use (5.2) again, and for the last inequality, we use (2) and (3). Hence we are done with the induction.
Using (5.3) and , it is clear that is a Cauchy sequence in and hence converges to some -adic integer . Further, from (2) and (3), we get and . Finally, using arguments parallel to those in the proof of (2.9), we see that
where and as defined in (2.10). Thus we get
(5.8) |
and hence, converges with order 4. ∎
6. Brief concluding remarks
Two questions remain to be answered.
(A) Can every zero in of be approximated using SJM, provided we choose suitably?
(B) What happens if the initial condition (3.1) on in Theorem 3.1 is not satisfied? Is there a method which produces a suitable satisfying this condition?
Regarding question (A), we note that SJM only approximates simple roots of . If is a simple root, then clearly there is satisfying the initial condition in Theorem 3.1, giving rise to a sequence converging to : In particular, we may just take . If has multiple roots, then it is easy to produce a polynomial whose roots are all simple and coincide with the roots of : Take , where is a greatest common divisor (unique up to multiplication by a unit in ) of and in . Such a greatest common divisor can be conveniently calculated using the Euclidean algorithm in .
To address question (B), we may employ Thurston’s method. Here we only describe briefly how this method works. For the details, we refer the interested reader to [13]. Let
(6.1) |
be a root of in . In particular, . Let
Now Thurston’s method produces a chain of polynomials , starting from , where satisfies the equation . In particular, . Hence, we may proceed as follows to solve in : First, we find such that . Then we produce and find such that , if existent. This may not be unique. Then we produce and find (a not necessarily unique) such that , if existent. We continue this process. If we can continue this process ad infinitum, this gives rise to a sequence which converges to a root in . Generally, this sequence converges only with order 1, though.
Thurston proved that if has only simple roots in , then for every satisfying , the above process either stops at some point in which case cannot be lifted to a root of in , or, after finitely many steps, we get an index for which there exists with and . In this case, we are in the situation of Hensel’s Lemma, and therefore lifts uniquely to a root of . To apply SJM, we only need the weaker condition from the Generalized Hensel Lemma. Therefore, if has only simple roots in , then we can get all roots of in as follows: We apply Thurston’s method initially, starting with an satisfying and, as soon as we reach the said index , apply the faster SJM with and in place of . This gives rise to a sequence which converges to a root of . From this, we retrieve a root
of .
One last brief remark on practical applications of SJM. This method produces a sequence whose elements lie in . For practical applications, however, only sequences in are useful since we cannot encode an infinite -adic representation using a computer. Therefore, to implement SJM in practice, one needs to cut off the -adic representation of at an appropriate point, getting an integer . Then one calculates (and hence ) using in place of . This will give a sequence of integers converging to our root of .
References
- [1] I.K. Argyros, Dong Chen, and Qingshan Qian. The jarratt method in banach space setting. Journal of Computational and Applied Mathematics, 51(1):103–106, May 1994.
- [2] Keith Conrad. Hensel’s lemma. Hensel’s Lemma by K.Conrad.
- [3] Walter Gander. On halley’s iteration method. The American Mathematical Monthly, 92(2):131–134, 1985.
- [4] Fernando Q. Gouvêa. p-adic Numbers. Springer International Publishing, 2020.
- [5] P. Jarratt. Some fourth order multipoint iterative methods for solving equations. Mathematics of Computation, 20(95):434–437, 1966.
- [6] Svetlana Katok. p-adic analysis compared with real. American Mathematical Society Mathematics Advanced Study Semesters, Providence, R.I. University Park, PA, 2007.
- [7] Neal Koblitz. p-adic Numbers, p-adic Analysis, and Zeta-Functions. Springer New York, 1984.
- [8] F. W. J. Olver. The evaluation of zeros of high-degree polynomials. Philosophical Transactions of the Royal Society of London. Series A, Mathematical and Physical Sciences, 244(885):385–415, 1952.
- [9] Julius Fergy Rabago. Olver’s method for solving roots of p-adic polynomial equations. Italian Journal of Pure and Applied Mathematics, 36:739, 08 2016.
- [10] B. Surender Reddy. On equivalence of p-adic 2-norms in p-adic linear 2-normed spaces. 2010.
- [11] Alain M. Robert. A Course in p-adic Analysis. Springer New York, 2000.
- [12] W. H. Schikhof. Ultrametric Calculus: An Introduction to p-Adic Analysis. Cambridge Studies in Advanced Mathematics. Cambridge University Press, 1985.
- [13] H. S. Thurston. The solution of p-adic equations. The American Mathematical Monthly, 50(3):142–148, 1943.
- [14] Axel G. R. Turnquist. p-adic numbers and solving p-adic equations. 2012.
- [15] Xiuhua Wang, Jisheng Kou, and Yitian Li. Modified jarratt method with sixth-order convergence. Applied Mathematics Letters, 22(12):1798–1802, 2009.