Removal of 5 Terms from a Degree 21 Polynomial
Abstract
In 1683 Tschirnhaus claimed to have developed an algebraic method to determine the roots of any degree polynomial. His argument was flawed, but it spurred a great deal of work by mathematicians including Bring, Jerrard, Hamilton, Sylvester, and Hilbert. Many of the problems they considered can be framed in terms of the geometric notion of resolvent degree, introduced by Farb and Wolfson. Roughly speaking, we have if a general degree polynomial can be put into an -parameter form. In the present note we discuss a method introduced by Sylvester for solving systems of polynomial equations, and apply it to finding bounds on resolvent degree. In particular, we prove the bound . This bound has been independently established by Sutherland using the classical theory of polarity.
1 Introduction
Consider a degree polynomial with roots :
A Tschirnhaus transformation is a polynomial transformation of the roots of . That is, given a polynomial
and setting
we can form a new polynomial with roots :
Tschirnhaus’s original idea was to transform to a solvable form – specifically, to choose such that – as part of a proposed method for determining in radicals the roots of . This method is not successful because in general it is not possible to determine in radicals the coefficients of the necessary . On the other hand, a number of interesting results have since been proved involving the use of Tschirnhaus transformations to reduce families of polynomials to certain canonical forms; in particular, the problem of finding such that has the -parameter form
with has been widely considered. We provide a partial review of the historical development of this theory in Section 2.
Recently progress has been made by casting the problem in terms of the geometric notion of resolvent degree, as introduced by Farb and Wolfson.[7] We review the precise definitions in Section 4, but roughly speaking the idea is as follows. We say a generically finite dominant map has essential dimension at most if there is a pullback square
with . We then say has resolvent degree at most if it factors as a composition of maps each of essential dimension at most . (This is somewhat stronger than what is required in the precise definition; for instance, it is sufficient for to be birationally equivalent to such a composition.)
In particular, we are interested in the resolvent degree of the root cover of the family of degree polynomials. That is, let be the space of monic degree polynomials, and the space of pairs of polynomials with a choice of root . Then there is a generically finite dominant map by “forgetting the root”, and we are interested in bounds on
Intuitively, we have if the roots of any can be determined by solving algebraic functions of at most variables. For example, for , the quadratic formula implies that the root cover is a pullback of the map
Similarly, the existence of “formulas in radicals” for and implies that and factor into towers of pullbacks of cyclic self-covers of , so that .
Many classical results concerning Tschirnhaus transformations can be translated into statements about resolvent degree: if any can be reduced to an -parameter form by means of a Tschirnhaus transformation , we have
provided that can itself always be determined by solving algebraic functions of at most variables. For example, a result of Bring shows that any degree polynomial can be put into the one-parameter form .[4] This is sufficient to imply ; in Section 4 we treat this example in detail. The bounds so-obtained are generally not sharp, as the improvements obtained by Wolfson and Sutherland have demonstrated.[19, 15]
On the other hand, this formulation of the problem implies different constraints on which Tschirnhaus transformations are permissible than were generally assumed by classical authors. For example, in considering the problem of transforming a general degree polynomial into an -parameter form by means of a Tschirnhaus transformation , Sylvester only considers those transformations whose coefficients can be determined without solving any equation of degree higher than . This is much more restrictive than necessary if one’s goal is to show
For example, for , Sylvester obtains the following:
Proposition 1 (Sylvester).
For , a general polynomial of degree can be put into an -parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving any equation of degree higher than 5.
As a corollary, one obtains the resolvent degree bound for ; on the other hand, finding the necessary Tschirnhaus transformation is a resolvent degree problem, since only equations of degree at most 5 are involved. Sylvester’s was nonetheless the best known bound for prior to Wolfson’s 2020 proof that for . [19] In Wolfson’s argument, finding the necessary Tschirnhaus transformation is shown to be at worst a resolvent degree 35 problem. It has been conjectured by Wiman and (separately) Chebotarev that for , though their proposed arguments contain gaps. [18, 3] Recently, Sutherland has provided a rigorous proof of this bound using the classical theory of polarity.[15]
In the current note we use ideas from Sylvester to provide an alternative proof of the for bound. In particular, we show
Theorem (Main Theorem).
For , a general polynomial of degree can be put into an -parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving an equation of degree higher than 20.
This implies for .
We now give an outline of the remaining sections of the paper. In Section 2 we give a brief historical overview of the classical theory of Tschirnhaus transformations and some of the important results therein. In Section 3 we turn to Sylvester’s work in particular and give a description of his “method of obliteration”: a technique for finding solutions of systems of polynomial equations when the number of variables is large relative to the number of equations, which lies at the heart of his work on Tschirnhaus transformations. In Section 4 we discuss resolvent degree and consider the case in detail as an illustration of the relation between resolvent degree and the classical theory of Tschirnhaus transformations. Section 5 describes a key ingredient of our main result: the use of the “method of obliteration” to determine linear subspaces of hypersurfaces. Finally, in Section 6 we give the proof of our main theorem.
2 Some Background on Tschirnhaus Transformations
2.1 Preliminaries
Let be a polynomial over a (not necessarily closed) field . Then we have
for some coefficients and roots . Fixing some algebraic extension of , a Tschirnhaus transformation is a polynomial ,
which we apply to the roots of to obtain a new polynomial
Note that the coefficients are symmetric functions of the transformed roots , and hence are symmetric in . The coefficients generate the algebra of symmetric polynomials in , so we can determine the polynomially in terms of and . In particular, this implies . The precise constraints on the field from which the coefficients of may be taken vary from author to author but generally one considers towers of extensions of bounded degree. Tschirnhaus’s original goal was to find a formula in radicals for the roots of . More precisely, taking to be the solvable closure of , his strategy was to determine a Tschirnhaus transformation which transforms into the solvable form . If is a root of , we can find the corresponding root(s) of (i.e., those such that ) as the roots of
If there are roots of which map to , this will be a degree polynomial. In particular, if has distinct roots, then the roots of can be recovered rationally from the roots of . Thus if is solvable and has coefficients in a solvable extension of , then will in general be solvable. (And at worst recovering the roots of will require the solution of an equation of degree .) This strategy fails because it is not generally possible to determine solvably such that all the intermediate terms of vanish. On the other hand, there are weaker versions of Tschirnhaus’s problem which are tractable. More precisely, we can ask: what conditions on and are sufficient for there to exist in transforming the degree polynomial to a polynomial with ? That is, we consider the problem of setting some (rather than all) of the intermediate coefficients to zero (thus reducing the number of parameters of the family of polynomials), and we relax the requirement that necessarily be a solvable extension.
We now consider in more detail what is necessary to determine such that . As we remarked above, is a polynomial function of and . Treating the ’s as unknowns to be determined, we further observe that is homogeneous of degree in the variables . For example,
we can ensure as long as the coefficients of are chosen to satisfy a single homogeneous linear equation. (The coefficients of this equation are symmetric in the , so again can be expressed in terms of the coefficients of ; it is not necessary to know the roots .) More generally, vanishes if and only if the th symmetric function of vanishes, so to find a Tschirnhaus transformation such that in the transformed polynomial we need to solve a degree polynomial in whose coefficients are polynomials in . Thus, to determine we must find a point in on the intersection of the hypersurfaces , which are of degree , respectively.
2.2 Some Results on Tschirnhaus Transformations from 1683 to Present
Tschirnhaus, in his 1683 paper introducing these transformations, claimed to be able to solve any degree polynomial by removing all intermediate terms.[17] That is, he claimed one could find a transformation such that
and the transformed polynomial has the solvable form
The problem with this strategy is that finding the necessary requires the solution of a system of polynomial equations of degrees . In general this leads to an equation of degree , so for finding is a priori more complicated than finding the roots of the original degree polynomial. Tschirnhaus did show explicitly how to find in the case, and his proof readily generalizes to removing the first two intermediate terms from any degree polynomial.
Finding a Tschirnhaus transformation yielding requires solving a system of equations in variables, of degree . In the worst case this leads to an equation of degree , but when the system is sufficiently undetermined (i.e., much larger than ), a solution can be found without solving any polynomial of degree greater than . The first result of this kind was given in 1786 by Bring, who showed that when , we can find such that without solving any equation of degree higher than three.[4]
In 1834, Jerrard recovered independently Bring’s result that removal of three terms from a degree 5 polynomial was possible by means of a Tschirnhaus transformation.[10] In fact, Jerrard went on to claim that his methods could yield a reduction of a general quintic to a solvable form.[11] (Jerrard was aware of, but did not accept, Abel’s 1824 work on the insolvability of the quintic.)
Shortly thereafter Hamilton was commissioned by the British Association for the Advancement of Science to investigate the validity of Jerrard’s methods, issuing his report in 1837.[8] He showed that Jerrard’s reductions were in many cases “illusory”. Jerrard allowed Tschirnhaus transformations of degree potentially as high or higher than the original polynomial , and Hamilton demonstrated that in the “illusory” cases the transformation found by Jerrard was a multiple of . In this case the transformed polynomial is always (the same as if the “transformation” were permitted) for any degree polynomial , so there is no hope of determining the roots of by studying . On the other hand, Hamilton showed that Jerrard’s methods can be a made to work –and the “illusory” cases avoided – in higher degrees. More precisely, for any fixed , one can find a (nonzero, degree ) Tschirnhaus transformation yielding without solving any equation of degree higher than , provided the degree of the original polynomial is large enough relative to . In particular, for , Hamilton showed suffices, and for , .
In 1886, Sylvester published a geometric explanation of the Hamilton/Jerrard method, sharpened some of the bounds (finding, in particular, suffices for , and for ).[16] Sylvester claimed that these sharpened bounds are optimal if no “elevation of degree” is allowed – that is, if no equations of degree higher than are to be solved. This claim requires careful interpretation – Sylvester does not consider the possibility of higher degree polynomials arising but factoring into lower degree terms. This is known to happen for : if one attempts to remove three terms from a degree polynomial, the system of equations of degree 1, 2, and 3 leads in general to an equation for degree 6. Sylvester’s method is able to avoid this elevation of degree only when . For , Sylvester’s method is inapplicable, but Lagrange had shown (in 1771, more than 100 years prior) that the degree 6 equation that arises in this case always factors into a pair of cubics, so that the necessary Tschirnhaus transformation can in fact be determined without solving any equation of degree higher than 3.[12]
Further reductions have been achieved, but all loosen or ignore the “elevation of degree” restriction in one way or another. Viewing the degree 9 polynomial
as a multi-valued “algebraic function” sending to the set of roots of , Hilbert asked whether this could be written as a composition of algebraic functions of at most 4 variables.[9] As part of his investigation, Hilbert sketched a method for finding a Tschirnhaus transformation such that when . Hilbert’s method requires finding a line on a cubic surface. This in turn requires the solution of an equation of degree 27. However, by putting the equation of the cubic surface into a four-parameter canonical form (the “pentahedral form”, originally suggested by Sylvester), Hilbert was able to show that the coefficients of the necessary line were themselves algebraic functions of four variables, so this was sufficient for his purposes.
In 1945, B. Segre strengthened Hilbert’s result, describing an algorithm for finding a Tschirnhaus transformation removing 4 terms from a degree polynomial, without solving any equation of degree higher than 5.[13] The same result was later proven by Dixmier using different methods.[6] Also building on Hilbert’s work were Wiman, who sketched an alternative proof of the bound for , and G. Chebotarev, who extended Wiman’s ideas to sketch a proof that for .[3, 18, 14] In 1975, Brauer showed that, when , a degree polynomial can be written as a composition of algebraic functions of variables; in this framing of the problem it is permissible to simply solve the degree to find a Tschirnhaus transformation removing terms, since this will be an algebraic function of fewer variables than the original degree polynomial.[1] For this bound is lower than the corresponding bounds for removal of terms found by Sylvester, though it should again be emphasized that this ignores the “no elevation of degree“ constraint which is explicit in Sylvester, and implicit in pre-Sylvester work on the problem.
In 2020, Wolfson described a function such that a degree polynomial can be put in an -parameter form whenever , and which improved significantly on Brauer’s bounds.[19] Wolfson frames the problem in terms of the theory resolvent degree, which allows bounds on the number of necessary parameters for a family of algebraic functions to be computed based on the dimensions of certain spaces. For , Wolfson shows , which also improves on Sylvester’s bound of . To find the Tschirnhaus transformation accomplishing the reduction in this case requires, among other things, finding a 2-plane inside of a cubic hypersurface in . Informally, since the dimension of the moduli space of cubic hypersurfaces in is 35, finding a 2-plane inside such a hypersurface is at worst a 35-parameter problem, so is permissible when putting a degree 41 polynomial into 35-parameter form. This approach does not produce the degrees of the equations which must be solved to find the Tschirnhaus transformation (which may be higher than , as in Hilbert’s proof that suffices for , where it was necessary to solve a degree 27 equation to find a line on a cubic surface).
Most recently, Sutherland used a combination of ideas from the classical theory of polarity together with optimizations to the moduli-dimension-counting method to improve on the bound for and all .[15] In particular, for , Sutherland shows that suffices, giving the first rigorous proof of the bound first claimed by Chebotarev.
In the current note, we show that the result can be recovered from Sylvester’s method if partial elevation of degree is allowed. More precisely, one can find a Tschirnhaus transformation removing 5 terms (i.e., such that the coefficients of the transformed polynomial satisfy ) from a degree polynomial whenever , by solving no equation of degree higher than 20. A key ingredient is that there exists an explicit algorithm to find a 2-plane on a cubic hypersurface in , without solving any equation of degree higher than 5.
3 Sylvester’s Obliteration Formula
Though motivated by the problem of finding Tschirnhaus transformations, the procedure Sylvester describes in [16] is remarkably general, allowing for a solution to be found to any sufficiently underdetermined system, without elevation of degree. More precisely, given a system of equations of polynomial equations in variables of degree at most , with the number of equations of degree , Sylvester shows there is a function such that, if , then a solution to can be found without solving any equation of degree greater than .
The method is as follows: Pick any of maximal degree from the system . We can find a solution to by first finding a line contained in the solution set of the subsystem , then intersecting this line with the vanishing locus of . To find , first find a solution to . Then to get the required line it suffices to find a point such that is a solution to for all . For any equation , then, view as a polynomial in ; if , the coefficients of must vanish identically. In fact the coefficient of is just , so vanishes since we chose to be a solution of . The vanishing of the remaining coefficients imposes polynomial conditions on the of degrees . Ranging over all , we see that must be a solution to a system with equations of degree , where
Now to find this needed point solution, we proceed inductively: again hold aside some polynomial of highest degree (say ), and find a linear solution to the subsystem , then intersect that line with , and so on. The number of equations of maximal degree decreases by 1 with each step, so eventually we are left with a (possibly very large) system of linear equations; a line in the solution set to this system can be found provided the number of variables is (at least) one greater than the number of equations. Then to backtrack to a linear solution to the original system requires only for equations of degree to be solved one at a time.
The core reduction is succinctly summarized by Sylvester’s formula of obliteration, which we will refer to repeatedly:
Proposition 2 (Sylvester).
Given let
denote the minimum number of variables such that a linear solution to any system of equations with exactly equations of degree can be found without elevation of degree. Then
where
Note that applying the obliteration formula times yields a system with no equations of degree , so all equations of the highest degree can be removed. This can be repeated until only (a large number of) linear equations remain, in which case the minimum number of variables needed is easy to determine. For example, consider the problem of finding a line on a cubic hypersurface. We have a system of equations with 1 equation of degree 3, 0 equations of degree 2, and 0 of degree 1. By repeated application of the obliteration formula,
so it is possible to find a line solvably (in fact, by solving no equation of degree greater than 3) on a cubic hypersurface in .
4 Resolvent Degree
4.1 Definitions
Let be an algebraically closed field and let be a generically finite dominant map of -varieties. Following Buhler and Reichstein [2], we define the essential dimension of this map, , to be the minimal such that there exists a pullback square
with . The idea of resolvent degree is to extend this notion to allow towers of pullbacks; essentially, a map is of resolvent degree at most if it can be written as a composition of maps each of essential dimension at most . More precisely, following Farb and Wolfson [7], the resolvent degree, , is defined to be the minimal such that there exists a tower
where is an open subset of , factors through a dominant map , and for each . It follows from these definitions that
It is convenient, in proving results on resolvent degree, to construct towers of maps of bounded resolvent degree (rather than essential dimension). Lemma 2.7 in Farb and Wolfson [7] shows that this is sufficient.
The connection to polynomials and their roots is as follows. Let denote the space of monic degree polynomials with coefficients in and let
the space of monic polynomials together with a choice of root. We then define
where the map is “forgetting the root”. captures the complexity of the root cover in the sense that if there is a formula in terms of algebraic functions of at most variables for finding the roots of a degree polynomial in terms of its coefficients, then .
4.2 Resolvent Degree of a Dominant Map
It is necessary to extend the definition of resolvent degree to dominant maps of -varieties which are not necessarily generically finite. Although we are principally interested in , which is generically finite, a number of maps which arise naturally in the study of Tschirnhaus transformations are not. For example, a key step in Bring’s proof that a general quintic can be reduced to a one-parameter form involves finding a line on a quadric surface in . Thus we would like to study
where is the parameter space of quadric surfaces in and is the space of such quadrics together with a choice of line on the surface, and the map is “forgetting the line”. Since any quadric surface contains an infinite family of lines, this map is not generically finite. We would like to extend the notion of resolvent degree so that captures the complexity of finding lines on quadric surfaces in the same way that captures the complexity of finding roots.
This is done in [19] in the following way: given a dominant map , define the resolvent degree to be the minimum for which there exists a dense collection of subvarieties with
a generically finite dominant map and for all (Such a subvariety is called a rational multi-section for .)
Requiring the collection of multisections to be dense in is necessary to ensure that resolvent degree behaves well with respect to composition of dominant maps: given and dominant, for the composition we have by [19, Lemma 4.10]. Having multi-sections for and for is not generally enough to product a multisection for (the range of could entirely miss ). On the other hand, if is surjective, then we do obtain a multi-section of .
Returning to the example of lines on quadrics, an algorithm for determining a line (or finite set of lines) on each quadric surface in terms of algebraic functions of degree gives a multi-section with , but this is somewhat less than what is required to show that . On the other hand, as we shall see in the next section, determining any one line is sufficient to yield a Tschirnhaus transformation reducing the quintic to a one-parameter form, so that .
4.3 Resolvent Degree of the Quintic
As an illustrative example, we now give a detailed proof that (as observed in [7]) Bring’s work on Tschirnhaus transformations of quintic polynomials implies . (Recall that by definition, , where is the parameter space of monic degree 5 polynomials and is the space of such polynomials together with a choice of root.)
Recall that given a polynomial in and a Tschirnhaus transformation of the form , we can set and eliminate to obtain a polynomial
where the coefficient is a degree homogeneous polynomial in , whose coefficients are integral functions of .
Our strategy will be to construct a tower of maps
such that the resolvent degree of each map is 1 and there is a dominant rational map . Informally, in there will be associated to each quintic polynomial the equation of a hyperplane determining the set of Tschirnhaus transformations such that . In , we will further have the information of a quadric contained in this hyperplane, corresponding to , and the equation of a line contained in . In we will further have a choice of Tschirnhaus transformation which lies on while also satisfying . Using this information we can construct a map from to the one-dimensional space of quintic polynomials of the form , and we will take to be the pullback of the root cover of this space, so that associates to a root of the polynomial obtained by applying to . Finally, in constructing , we recover the roots of itself from the roots of the transformed polynomial. Each step of this process is of bounded resolvent degree.
More precisely: viewing the as homogeneous coordinates on , there is associated to each polynomial a hyperplane consisting of all Tschirnhaus transformations that send to a polynomial with . Let denote the space of ordered pairs with and as just described. Then the map is birational, hence has resolvent degree 1.
Next, we consider the set of all Tschirnhaus transformations such that and in the transformed polynomial. The latter is a degree 2 homogeneous polynomial in , so to each polynomial there is associated a quadric surface whose points are Tschirnhaus transformations satisfying both conditions. This defines a map
where is the parameter space of quadric surfaces in . Letting denote the space of such surfaces together with a choice of line, we consider the map
An algorithm exists for finding a line on a quadric surface that requires solving only a single quadratic equation, so there is a rational multi-section with
(In fact there is a dense collection of such multi-sections, so , but this is stronger than we require.)
We define via the pullback square
so that . Points of are ordered tuples , where is a quintic polynomial, is the space of all Tschirnhaus transformations that send to a polynomial with , is the quadric surface whose points are Tschirnhaus transformations such that and , and is a line contained in .
Finally, to find a Tschirnhaus transformation satisfying , , and requires intersecting the cubic hypersurface defined by with the line . To find the three points of intersection requires only the solution of a cubic equation. Defining to be the space of tuples , with as above and a Tschirnhaus transformation yielding , the projection map
has resolvent degree .
Now let denote the space of quintic polynomials of the form . Given a point in , we can apply the transformation to to obtain a polynomial
Applying a change of variables and dividing by yields
so this procedure defines a map . Now, the root cover
has resolvent degree
so defining via the pullback square
we have . Points of are of the form , where and is a root of the transformed polynomial . Let be the space whose points are tuples , where is a root of . There is a map
defined by
On the other hand, given a root of the transformed polynomial, we can find the corresponding roots of by solving the (at worst degree 4) polynomial equation
so .
In all, we have constructed a tower of maps
each of which has resolvent degree 1. Since the root cover factors through the projection , , we have
5 Linear subspaces of hypersurfaces
To prove we needed to show one can always find a Tschirnhaus transformation which eliminates the first three intermediate terms of a quintic polynomial. This required finding a solution of a system of three equations of degree 1, 2, and 3, respectively. A key geometric fact which made this tractable was that any quadric surface in contains a line, and that an equation for such a line can be found by solving a quadratic equation; this allows the degree 2 equation (which determines the quadric surface) to be replaced by a degree 1 equation (which determines the line), so that to find a solution of the system requires only the solution of a cubic equation.
Similarly, in Hilbert’s proof that , a Tschirnhaus transformation removing four intermediate terms is required, so a system of equations of degrees 1, 2, 3, and 4 must be solved. Informally, Hilbert’s idea is to first find a 3-plane inside the hypersurface determined by the equation of degree 2 (which is possible as long as the ambient dimension is high enough), then to find a line on the cubic surface determined inside this 3-plane by the equation of degree 3. If this can be done, all that remains is to intersect the line with the remaining equation of degree 4 and a solution can be found.
In general, one can consider the problem of finding a -plane inside a degree hypersurface in . We can ask several questions:
-
Q1:
In terms of and , how large must the ambient dimension be to guarantee that any degree hypersurface contains a -plane?
-
Q2:
What is the “resolvent degree of finding the -plane”? That is, if is a moduli space of degree hypersurfaces in and is the space of such hypersurfaces together with a choice of -plane, what is the resolvent degree of the map
which forgets the choice of plane?
-
Q3:
When is there a constructive algorithm to determine the -plane? What are the degrees of the equations that must be solved? How large must be if no equation of degree higher than some given bound is to be permitted?
As the and examples above illustrate, the answers to these questions have implications for the problem of finding Tschirnhaus transformations, since replacing a hypersurface with a linear subspace of that hypersurface allows us to replace a degree equation with one or more equations of degree 1, reducing the total degree of the system to be solved.
An answer to Q1 is given in Debarre and Manivel [5]: for , any degree hypersurface in contains a -plane if
In this case, the map is surjective, and an upper bound on its resolvent degree is given by the dimension of the codomain:
For example,
corresponding to Hilbert’s observation that finding a line on a cubic surface requires the solution of at most an algebraic function of four variables.
These dimension-counting arguments do not provide enough information to address Q3. For this we return to Sylvester’s ideas. First, for the problem of finding a line on a degree hypersurface, repeated application of the obliteration formula suffices to compute an ambient dimension such that the desired line may be found without solving any equation of degree higher than . Sylvester’s methods can be readily adapted to the problem of finding a -plane on a degree hypersurface without solving any equation of degree higher than . Note that, when is large enough for this to be possible,
giving a bound on resolvent degree which is often sharper than what one obtains from dimension-counting alone (though at the price of requiring a larger ambient dimension than that given by Waldron’s theorem), so Sylvester’s ideas are of interest even if one is only concerned with finding bounds on resolvent degree. In the remainder of this section we look at the case in detail.
5.1 Linear subspace of cubic hypersurfaces
Let be the moduli space of smooth cubic hypersurfaces in and let be the moduli space of pairs where is a smooth cubic hypersurface and is an -plane contained in . We can then consider the “resolvent degree of finding an -plane”, i.e.,
where the map forgets the choice of plane. For example, for the problem of finding a line on a cubic surface, we have
so that finding a line on a cubic surface in requires the solution of algebraic equations of no more than 4 variables. Farb and Wolfson [7], using an argument due to Klein, show this bound can be improved to
In higher dimensions it is easier to find a line. In particular, by Sylvester’s obliteration method, as discussed in section 3, finding a line on a cubic surface in requires the solution of no equation of degree higher than 3, and so is a resolvent degree 1 problem.
Proposition 3 (Sylvester).
Given a cubic hypersurface in , we can find a line contained in by solving equations of degree no higher than 3 provided Hence
Proof.
We wish to find a linear solution to a system of equations consisting of 1 equation of degree and no equations of lower degree. By Sylvester’s obliteration formula, the ambient dimension required is
∎
Sylvester’s method also extends to finding higher-dimensional linear subspaces of hypersurfaces, when the dimension of the ambient space is large enough. For example, for cubic hypersurfaces in , finding a 2-plane can also be done solvably.
Proposition 4 (Sylvester).
Given a cubic hypersurface in , we can find a 2-plane contained in by solving equations of degree no higher than 3, provided . Hence
Proof.
Since , we can find a line contained in solvably. Let and be distinct points of such that for all . To find a plane contained in , we look for a point
such that for all . Expanding this as a polynomial in and , the coefficients of , , , , , and , must vanish identically.
This gives a system of equations in variables with 1 equation of degree 3, 2 equations of degree 2, and 3 equations of degree 1. To solve it, we look for a linear solution to the subsystem of equations of degree , then intersect this line with the remaining degree 3 equation. Using Sylvester’s formula of obliteration, we have
so we can find the needed linear solution when . Thus we can find a line on a cubic hypersurface in when . ∎
The ambient dimension can be reduced slightly if some elevation of degree is allowed. From Segre [1945, pg 295, Sec 12], it is possible to determine a line on the intersection of two given quadrics by solving equations of degree no higher than 5, provided the ambient dimension is at least 4. Thus to find a linear solution of a system with 2 equations of degree 2, and 3 equations of degree 1, an ambient dimension of 7 is sufficient. Comparing to the last paragraph in the proof above we have
Proposition 5.
Given a cubic hypersurface in , we can find a 2-plane contained in by solving equations of degree no higher than 5, provided . Hence
6 Removing 5 Terms from a Polynomial
Given a degree polynomial
we wish to find a Tschirnhaus transformation
such that after setting the transformed polynomial
satisfies . To determine the coefficients of then requires the solution of a system of equations with degrees 1, 2, 3, 4, 5, in . In general, to find a solution to such a system requires solving a polynomial of degree , but when is large enough this elevation of degree can be partially avoided.
We first informally sketch the geometry underlying Wolfson’s bound of . The general idea is that by finding linear subspaces of the hypersurfaces corresponding to the polynomials of our system, the total degree of the system can be reduced. In this case, to find the necessary Tschirnhaus transformation one first finds a -plane inside a quadric surface in (this only requires solving degree 2 equations, so is resolvent degree 1). Then, intersecting the degree 3 equation with the 6-plane yields a cubic hypersurface in . If we can then find a 2-plane inside this cubic hypersurface, then by intersecting the degree 4 and 5 equations with this plane, we are left with a system of total degree 20 to solve. In summary, we have the chain
This gives an algorithm for finding the desired Tschirnhaus transformation provided one is able to find the necessary 2-plane inside the cubic hypersurface in . Wolfson uses a dimension count to show that
and so is able to use this Tschirnhaus transformation to show whenever .
By increasing the ambient dimension in which the the cubic hypersurface lives, we can use Sylvester’s ideas to find a plane inside a cubic hypersurface in by solving only equations of degree 5 or less. (i.e., in a resolvent degree 1 way). This allows the bound for removing 5 terms from a degree polynomial to be reduced to , with simpler irrationalities involved in finding the necesary Tschirnhaus transformation.
Theorem (Main Theorem).
Let . Given a degree polynomial
we can find a Tschirnhaus transformation
such that after setting the transformed polynomial
satisfies . The coefficients of can be determined by solving equations of degree at most .
Proof.
The equations , , , , impose polynomial conditions of degree 1, 2, 3, 4, and 5, respectively, on the point to be determined. Using the degree 1 equation we can eliminate one variable. The degree 2 equation then determines a quadric hypersurface in . By the classical theory of linear subspaces of quadrics, we can find a contained in this hypersurface provided .
Next, we consider the cubic hypersurface determined by this and the degree 3 equation. By Proposition 5 of the previous section, we can find a contained in this hypersurface by solving equations of degree at worst 5. Finally, intersecting the remaining equations of degree 4 and 5 determine two curves in this , whose points of intersection are governed by an equation of degree at most 20. Each such point then satisfies all 5 polynomial conditions, by construction, and so yields a Tschirnhaus transformation of the desired form. ∎
In the same way that the reduction of the quintic to one-parameter form can be translated into the language of resolvent degree to yield , this theorem implies for .
References
- [1] Richard Brauer. On the resolvent problem. Annali di Matematica Pura ed Applicata, 102(1):45–55, 1975.
- [2] Joe Buhler and Zinovy Reichstein. On tschirnhaus transformations. In Topics in Number theory, pages 127–142. Springer, 1999.
- [3] Grigorii Nikolaevich Chebotarev. On the problem of resolvents. Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, 114(2):189–193, 1954.
- [4] Alexander Chen, Yang-Hui He, and John McKay. Erland Samuel Bring’s “transformation of algebraic equations”. arXiv preprint arXiv:1711.09253, 2017.
- [5] Olivier Debarre and Laurent Manivel. Sur la vari et e des espaces lin eaires contenus dans une intersection compl ete. Math. Ann, 312:549–574, 1998.
- [6] Jacques Dixmier. Histoire du 13e probleme de hilbert. Cahiers du séminaire d’histoire des mathématiques, 3:85–94, 1993.
- [7] Benson Farb and Jesse Wolfson. Resolvent degree, hilbert’s 13th problem and geometry. L’Enseignement Mathématique, 65(3):303–376, 2020.
- [8] Sir William Rowan Hamilton. Inquiry Into the Validity of a Method Recently Proposed by George B. Jerrard, Esq. for Transforming and Resolving Equations of Elevated Degrees Undertaken at the Request of the Association. Richard and John E. Taylor, 1836.
- [9] David Hilbert. Über die gleichung neunten grades. Mathematische Annalen, 97(1):243–250, 1927.
- [10] George Birch Jerrard. Mathematical Researches. Longman, Bristol and London, 1834.
- [11] George Birch Jerrard. Xxiv. on certain transformations connected with the finite solution of equations of the fifth degree: To the editors of the philosophical magazine and journal. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 7(39):202–203, 1835.
- [12] Joseph Lagrange. Réflexions sur la résolution algébrique des équations. Nouveaux Mémoires de l’Académie Royale des Sciences et Belles-Lettres de Berlin, 1771.
- [13] Beniamino Segre. The algebraic equations of degrees 5, 9, 157,…, and the arithmetic upon an algebraic variety. Annals of Mathematics, pages 287–301, 1945.
- [14] Alexander J. Sutherland. GN Chebotarev’s “on the problem of resolvents”. arXiv preprint arXiv:2107.01006, 2021.
- [15] Alexander J Sutherland. Upper bounds on resolvent degree and its growth rate. arXiv preprint arXiv:2107.08139, 2021.
- [16] James Joseph Sylvester. On the so-called tschirnhausen transformation. 1887.
- [17] Ehrenfried Walther von Tschirnhaus. Methodus auferendi omnes terminos intermedios ex data aeqvatione (method of eliminating all intermediate terms from a given equation). Acta Eruditorum (1683), pages 204–207.
- [18] Anders Wiman. Über die Anwendung der Tschirnhausentransformation auf die Reduktion algebraischer Gleichungen. Almquist & Wiksells Boktr., 1927.
- [19] Jesse Wolfson. Tschirnhaus transformations after hilbert. L’Enseignement Mathématique, 66(3):489–540, 2021.