This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Removal of 5 Terms from a Degree 21 Polynomial

Curtis Heberle
Abstract

In 1683 Tschirnhaus claimed to have developed an algebraic method to determine the roots of any degree nn polynomial. His argument was flawed, but it spurred a great deal of work by mathematicians including Bring, Jerrard, Hamilton, Sylvester, and Hilbert. Many of the problems they considered can be framed in terms of the geometric notion of resolvent degree, introduced by Farb and Wolfson. Roughly speaking, we have RD(n)nkRD(n)\leq n-k if a general degree nn polynomial can be put into an (nk)(n-k)-parameter form. In the present note we discuss a method introduced by Sylvester for solving systems of polynomial equations, and apply it to finding bounds on resolvent degree. In particular, we prove the bound RD(21)15RD(21)\leq 15. This bound has been independently established by Sutherland using the classical theory of polarity.

1 Introduction

Consider a degree nn polynomial p(x)p(x) with roots x1,,xnx_{1},\ldots,x_{n}:

p(x)=xn+a1xn1++anx+an=i=1n(xxi).p(x)=x^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}=\prod_{i=1}^{n}(x-x_{i}).

A Tschirnhaus transformation is a polynomial transformation of the roots of pp. That is, given a polynomial

T(x)=bn1xn1+bn2xn2++b1x+b0,T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0},

and setting

y=T(x)y=T(x)

we can form a new polynomial with roots T(x1),,T(xn)T(x_{1}),\ldots,T(x_{n}):

q(y)=yn+A1yn1++An1y+An=i=1n(yT(xi)).q(y)=y^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}=\prod_{i=1}^{n}(y-T(x_{i})).

Tschirnhaus’s original idea was to transform pp to a solvable form – specifically, to choose TT such that q(y)=yn+Anq(y)=y^{n}+A_{n} – as part of a proposed method for determining in radicals the roots of pp. This method is not successful because in general it is not possible to determine in radicals the coefficients of the necessary TT. On the other hand, a number of interesting results have since been proved involving the use of Tschirnhaus transformations to reduce families of polynomials to certain canonical forms; in particular, the problem of finding TT such that qq has the (nk)(n-k)-parameter form

q(y)=yn+Ak+1ynk1++An1y+Anq(y)=y^{n}+A_{k+1}y^{n-k-1}+\ldots+A_{n-1}y+A_{n}

with A1=A2==Ak=0A_{1}=A_{2}=\ldots=A_{k}=0 has been widely considered. We provide a partial review of the historical development of this theory in Section 2.

Recently progress has been made by casting the problem in terms of the geometric notion of resolvent degree, as introduced by Farb and Wolfson.[7] We review the precise definitions in Section 4, but roughly speaking the idea is as follows. We say a generically finite dominant map XYX\to Y has essential dimension at most dd if there is a pullback square

X{X}W{W}Y{Y}Z{Z}

with dim(Z)d\dim(Z)\leq d. We then say XYX\to Y has resolvent degree at most dd if it factors as a composition of maps each of essential dimension at most dd. (This is somewhat stronger than what is required in the precise definition; for instance, it is sufficient for XYX\to Y to be birationally equivalent to such a composition.)

In particular, we are interested in the resolvent degree of the root cover of the family of degree nn polynomials. That is, let 𝒫n\mathcal{P}_{n} be the space of monic degree nn polynomials, and 𝒫n~\widetilde{\mathcal{P}_{n}} the space of pairs (p,λ)(p,\lambda) of polynomials p𝒫np\in\mathcal{P}_{n} with a choice of root λ\lambda. Then there is a generically finite dominant map 𝒫n~𝒫n\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n} by “forgetting the root”, and we are interested in bounds on

RD(n):=RD(𝒫n~𝒫n).\mathrm{RD}(n):=\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}).

Intuitively, we have RD(n)d\mathrm{RD}(n)\leq d if the roots of any p𝒫np\in\mathcal{P}_{n} can be determined by solving algebraic functions of at most dd variables. For example, for n=2n=2, the quadratic formula implies that the root cover 𝒫2~𝒫2\widetilde{\mathcal{P}_{2}}\to\mathcal{P}_{2} is a pullback of the map

11zz2.\displaystyle\mathbb{P}^{1}\to\mathbb{P}^{1}z\mapsto z^{2}.

Similarly, the existence of “formulas in radicals” for n=3n=3 and n=4n=4 implies that 𝒫3~𝒫3\widetilde{\mathcal{P}_{3}}\to\mathcal{P}_{3} and 𝒫4~𝒫4\widetilde{\mathcal{P}_{4}}\to\mathcal{P}_{4} factor into towers of pullbacks of cyclic self-covers of 1\mathbb{P}^{1}, so that RD(3)=RD(4)=1\mathrm{RD}(3)=\mathrm{RD}(4)=1.

Many classical results concerning Tschirnhaus transformations can be translated into statements about resolvent degree: if any p𝒫np\in\mathcal{P}_{n} can be reduced to an (nk)(n-k)-parameter form by means of a Tschirnhaus transformation TT, we have

RD(n)nk,\mathrm{RD}(n)\leq n-k,

provided that TT can itself always be determined by solving algebraic functions of at most nkn-k variables. For example, a result of Bring shows that any degree 55 polynomial can be put into the one-parameter form y5+A5y+1y^{5}+A_{5}y+1.[4] This is sufficient to imply RD(5)=1\mathrm{RD}(5)=1; in Section 4 we treat this example in detail. The bounds so-obtained are generally not sharp, as the improvements obtained by Wolfson and Sutherland have demonstrated.[19, 15]

On the other hand, this formulation of the problem implies different constraints on which Tschirnhaus transformations TT are permissible than were generally assumed by classical authors. For example, in considering the problem of transforming a general degree nn polynomial p(x)p(x) into an (nk)(n-k)-parameter form by means of a Tschirnhaus transformation TT, Sylvester only considers those transformations whose coefficients can be determined without solving any equation of degree higher than kk. This is much more restrictive than necessary if one’s goal is to show

RD(n)nk.\mathrm{RD}(n)\leq n-k.

For example, for k=6k=6, Sylvester obtains the following:

Proposition 1 (Sylvester).

For n44n\geq 44, a general polynomial of degree nn can be put into an (n6)(n-6)-parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving any equation of degree higher than 5.

As a corollary, one obtains the resolvent degree bound RD(n)n6\mathrm{RD}(n)\leq n-6 for n44n\geq 44; on the other hand, finding the necessary Tschirnhaus transformation is a resolvent degree RD(5)=1\mathrm{RD}(5)=1 problem, since only equations of degree at most 5 are involved. Sylvester’s was nonetheless the best known bound for k=6k=6 prior to Wolfson’s 2020 proof that RD(n)n6\mathrm{RD}(n)\leq n-6 for n41n\geq 41. [19] In Wolfson’s argument, finding the necessary Tschirnhaus transformation is shown to be at worst a resolvent degree 35 problem. It has been conjectured by Wiman and (separately) Chebotarev that RD(n)15\mathrm{RD}(n)\leq 15 for n21n\geq 21, though their proposed arguments contain gaps. [18, 3] Recently, Sutherland has provided a rigorous proof of this bound using the classical theory of polarity.[15]

In the current note we use ideas from Sylvester to provide an alternative proof of the RD(n)n6\mathrm{RD}(n)\leq n-6 for n21n\geq 21 bound. In particular, we show

Theorem (Main Theorem).

For n21n\geq 21, a general polynomial of degree nn can be put into an (n6)(n-6)-parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving an equation of degree higher than 20.

This implies RD(n)n6\mathrm{RD}(n)\leq n-6 for n21n\leq 21.

We now give an outline of the remaining sections of the paper. In Section 2 we give a brief historical overview of the classical theory of Tschirnhaus transformations and some of the important results therein. In Section 3 we turn to Sylvester’s work in particular and give a description of his “method of obliteration”: a technique for finding solutions of systems of polynomial equations when the number of variables is large relative to the number of equations, which lies at the heart of his work on Tschirnhaus transformations. In Section 4 we discuss resolvent degree and consider the n=5n=5 case in detail as an illustration of the relation between resolvent degree and the classical theory of Tschirnhaus transformations. Section 5 describes a key ingredient of our main result: the use of the “method of obliteration” to determine linear subspaces of hypersurfaces. Finally, in Section 6 we give the proof of our main theorem.

2 Some Background on Tschirnhaus Transformations

2.1 Preliminaries

Let p(x)p(x) be a polynomial over a (not necessarily closed) field KK. Then we have

p(x)=xn+a1xn1++an1x+an=i=1n(xxi)p(x)=x^{n}+a_{1}x^{n-1}+\ldots+a_{n-1}x+a_{n}=\prod_{i=1}^{n}(x-x_{i})

for some coefficients a1,,anka_{1},\ldots,a_{n}\in k and roots x1,,xnK¯x_{1},\ldots,x_{n}\in\overline{K}. Fixing some algebraic extension LL of KK, a Tschirnhaus transformation is a polynomial TL[x]T\in L[x],

T(x)=b0+b1x++bn1xn1T(x)=b_{0}+b_{1}x+\ldots+b_{n-1}x^{n-1}

which we apply to the roots of pp to obtain a new polynomial

q(y)=i=1n(yT(xi))=yn+A1yn1++A1y+A0.q(y)=\prod_{i=1}^{n}(y-T(x_{i}))=y^{n}+A_{1}y^{n-1}+\ldots+A_{1}y+A_{0}.

Note that the coefficients A0,,AnA_{0},\ldots,A_{n} are symmetric functions of the transformed roots T(x1),,T(xn)T(x_{1}),\ldots,T(x_{n}), and hence are symmetric in x1,,xnx_{1},\ldots,x_{n}. The coefficients a0,,ana_{0},\ldots,a_{n} generate the algebra of symmetric polynomials in x1,,xnx_{1},\ldots,x_{n}, so we can determine the AiA_{i} polynomially in terms of a0,,ana_{0},\ldots,a_{n} and b0,,bn1b_{0},\ldots,b_{n-1}. In particular, this implies qL[y]q\in L[y]. The precise constraints on the field LL from which the coefficients of TT may be taken vary from author to author but generally one considers towers of extensions of bounded degree. Tschirnhaus’s original goal was to find a formula in radicals for the roots of pp. More precisely, taking LL to be the solvable closure of KK, his strategy was to determine a Tschirnhaus transformation TL[x]T\in L[x] which transforms pp into the solvable form q(y)=yn+Anq(y)=y^{n}+A_{n}. If yiy_{i} is a root of qq, we can find the corresponding root(s) of pp (i.e., those xjx_{j} such that T(xj)=yiT(x_{j})=y_{i}) as the roots of

GCD(p(x),T(x)yi).\mathrm{GCD}(p(x),T(x)-y_{i}).

If there are dd roots of pp which map to yiy_{i}, this will be a degree dd polynomial. In particular, if qq has distinct roots, then the roots of pp can be recovered rationally from the roots of qq. Thus if qq is solvable and TT has coefficients in a solvable extension of KK, then pp will in general be solvable. (And at worst recovering the roots of pp will require the solution of an equation of degree n1n-1.) This strategy fails because it is not generally possible to determine TT solvably such that all the intermediate terms A1,A2,,An1A_{1},A_{2},\ldots,A_{n-1} of qq vanish. On the other hand, there are weaker versions of Tschirnhaus’s problem which are tractable. More precisely, we can ask: what conditions on nn and LL are sufficient for there to exist TT in L[x]L[x] transforming the degree nn polynomial pK[x]p\in K[x] to a polynomial qL[y]q\in L[y] with A1==Ak=0A_{1}=\ldots=A_{k}=0? That is, we consider the problem of setting some (rather than all) of the intermediate coefficients to zero (thus reducing the number of parameters of the family of polynomials), and we relax the requirement that LL necessarily be a solvable extension.

We now consider in more detail what is necessary to determine TT such that A1==Ak=0A_{1}=\ldots=A_{k}=0. As we remarked above, AiA_{i} is a polynomial function of a1,,ana_{1},\ldots,a_{n} and b0,,bn1b_{0},\ldots,b_{n-1}. Treating the bib_{i}’s as unknowns to be determined, we further observe that AiA_{i} is homogeneous of degree ii in the variables b0,,bn1b_{0},\ldots,b_{n-1}. For example,

A1\displaystyle A_{1} =i=1nT(xi)\displaystyle=-\sum_{i=1}^{n}T(x_{i})
=i=1n(bn1xin1+bn2xin2++b1xi+b0)\displaystyle=-\sum_{i=1}^{n}(b_{n-1}x_{i}^{n-1}+b_{n-2}x_{i}^{n-2}+\ldots+b_{1}x_{i}+b_{0})
=(i=1nxin1)bn1+(i=1nxin2)bn2++(i=1nxi)b1nb0\displaystyle=\left(-\sum_{i=1}^{n}x_{i}^{n-1}\right)b_{n-1}+\left(-\sum_{i=1}^{n}x_{i}^{n-2}\right)b_{n-2}+\ldots+\left(-\sum_{i=1}^{n}x_{i}\right)b_{1}-nb_{0}

we can ensure A1=0A_{1}=0 as long as the coefficients b0,,bn1b_{0},\ldots,b_{n-1} of TT are chosen to satisfy a single homogeneous linear equation. (The coefficients of this equation are symmetric in the xix_{i}, so again can be expressed in terms of the coefficients a1,,ana_{1},\ldots,a_{n} of pp; it is not necessary to know the roots xix_{i}.) More generally, AiA_{i} vanishes if and only if the iith symmetric function of T(x1),,T(xn)T(x_{1}),\ldots,T(x_{n}) vanishes, so to find a Tschirnhaus transformation TT such that Ai=0A_{i}=0 in the transformed polynomial we need to solve a degree ii polynomial in b0,,bn1b_{0},\ldots,b_{n-1} whose coefficients are polynomials in a1,,ana_{1},\ldots,a_{n}. Thus, to determine TT we must find a point in 𝐛n1\mathbb{P}^{n-1}_{\mathbf{b}} on the intersection of the hypersurfaces V(A1),V(A2),,V(Ak)V(A_{1}),V(A_{2}),\ldots,V(A_{k}), which are of degree 1,2,,k1,2,\ldots,k, respectively.

2.2 Some Results on Tschirnhaus Transformations from 1683 to Present

Tschirnhaus, in his 1683 paper introducing these transformations, claimed to be able to solve any degree nn polynomial by removing all intermediate terms.[17] That is, he claimed one could find a transformation TT such that

A1=A2==An1=0A_{1}=A_{2}=\ldots=A_{n-1}=0

and the transformed polynomial has the solvable form

yn+An=0.y^{n}+A_{n}=0.

The problem with this strategy is that finding the necessary TT requires the solution of a system of polynomial equations of degrees 1,2,,n11,2,\ldots,n-1. In general this leads to an equation of degree (n1)!(n-1)!, so for n>3n>3 finding TT is a priori more complicated than finding the roots of the original degree nn polynomial. Tschirnhaus did show explicitly how to find TT in the n=3n=3 case, and his proof readily generalizes to removing the first two intermediate terms from any degree nn polynomial.

Finding a Tschirnhaus transformation TT yielding A1=A2==Ak=0A_{1}=A_{2}=\ldots=A_{k}=0 requires solving a system of kk equations in nn variables, of degree 1,2,3,,k1,2,3,\ldots,k. In the worst case this leads to an equation of degree k!k!, but when the system is sufficiently undetermined (i.e., nn much larger than kk), a solution can be found without solving any polynomial of degree greater than kk. The first result of this kind was given in 1786 by Bring, who showed that when n5n\geq 5, we can find TT such that A1=A2=A3=0A_{1}=A_{2}=A_{3}=0 without solving any equation of degree higher than three.[4]

In 1834, Jerrard recovered independently Bring’s result that removal of three terms from a degree 5 polynomial was possible by means of a Tschirnhaus transformation.[10] In fact, Jerrard went on to claim that his methods could yield a reduction of a general quintic to a solvable form.[11] (Jerrard was aware of, but did not accept, Abel’s 1824 work on the insolvability of the quintic.)

Shortly thereafter Hamilton was commissioned by the British Association for the Advancement of Science to investigate the validity of Jerrard’s methods, issuing his report in 1837.[8] He showed that Jerrard’s reductions were in many cases “illusory”. Jerrard allowed Tschirnhaus transformations of degree potentially as high or higher than the original polynomial pp, and Hamilton demonstrated that in the “illusory” cases the transformation TT found by Jerrard was a multiple of pp. In this case the transformed polynomial is always q(y)=ynq(y)=y^{n} (the same as if the “transformation” y=T(x)=0y=T(x)=0 were permitted) for any degree nn polynomial pp, so there is no hope of determining the roots of pp by studying qq. On the other hand, Hamilton showed that Jerrard’s methods can be a made to work –and the “illusory” cases avoided – in higher degrees. More precisely, for any fixed kk, one can find a (nonzero, degree n1\leq n-1) Tschirnhaus transformation TT yielding A1=A2=Ak=0A_{1}=A_{2}\ldots=A_{k}=0 without solving any equation of degree higher than kk, provided the degree nn of the original polynomial pp is large enough relative to kk. In particular, for k=4k=4, Hamilton showed n11n\geq 11 suffices, and for k=5k=5, n47n\geq 47.

In 1886, Sylvester published a geometric explanation of the Hamilton/Jerrard method, sharpened some of the bounds (finding, in particular, n10n\geq 10 suffices for k=4k=4, and n44n\geq 44 for k=5k=5).[16] Sylvester claimed that these sharpened bounds are optimal if no “elevation of degree” is allowed – that is, if no equations of degree higher than kk are to be solved. This claim requires careful interpretation – Sylvester does not consider the possibility of higher degree polynomials arising but factoring into lower degree terms. This is known to happen for k=3k=3: if one attempts to remove three terms from a degree nn polynomial, the system of equations of degree 1, 2, and 3 leads in general to an equation for degree 6. Sylvester’s method is able to avoid this elevation of degree only when n5n\geq 5. For n=4n=4, Sylvester’s method is inapplicable, but Lagrange had shown (in 1771, more than 100 years prior) that the degree 6 equation that arises in this case always factors into a pair of cubics, so that the necessary Tschirnhaus transformation can in fact be determined without solving any equation of degree higher than 3.[12]

Further reductions have been achieved, but all loosen or ignore the “elevation of degree” restriction in one way or another. Viewing the degree 9 polynomial

p(x)=a0x9+a1x8++a8x+a9p(x)=a_{0}x^{9}+a_{1}x^{8}+\ldots+a_{8}x+a_{9}

as a multi-valued “algebraic function” sending (a0,,a9)(a_{0},\ldots,a_{9}) to the set of roots of pp, Hilbert asked whether this could be written as a composition of algebraic functions of at most 4 variables.[9] As part of his investigation, Hilbert sketched a method for finding a Tschirnhaus transformation such that A1=A2=A3=A4=0A_{1}=A_{2}=A_{3}=A_{4}=0 when n9n\geq 9. Hilbert’s method requires finding a line on a cubic surface. This in turn requires the solution of an equation of degree 27. However, by putting the equation of the cubic surface into a four-parameter canonical form (the “pentahedral form”, originally suggested by Sylvester), Hilbert was able to show that the coefficients of the necessary line were themselves algebraic functions of four variables, so this was sufficient for his purposes.

In 1945, B. Segre strengthened Hilbert’s result, describing an algorithm for finding a Tschirnhaus transformation removing 4 terms from a degree n9n\geq 9 polynomial, without solving any equation of degree higher than 5.[13] The same result was later proven by Dixmier using different methods.[6] Also building on Hilbert’s work were Wiman, who sketched an alternative proof of the n=9n=9 bound for k=4k=4, and G. Chebotarev, who extended Wiman’s ideas to sketch a proof that n=21n=21 for k=5k=5.[3, 18, 14] In 1975, Brauer showed that, when n>k!n>k!, a degree nn polynomial can be written as a composition of algebraic functions of (nk1)(n-k-1) variables; in this framing of the problem it is permissible to simply solve the degree k!k! to find a Tschirnhaus transformation removing kk terms, since this will be an algebraic function of fewer variables than the original degree nn polynomial.[1] For k7,k\geq 7, this n>k!n>k! bound is lower than the corresponding bounds for removal of kk terms found by Sylvester, though it should again be emphasized that this ignores the “no elevation of degree“ constraint which is explicit in Sylvester, and implicit in pre-Sylvester work on the problem.

In 2020, Wolfson described a function F(k)F(k) such that a degree nn polynomial can be put in an (nk1)(n-k-1)-parameter form whenever nF(k)n\geq F(k), and which improved significantly on Brauer’s bounds.[19] Wolfson frames the problem in terms of the theory resolvent degree, which allows bounds on the number of necessary parameters for a family of algebraic functions to be computed based on the dimensions of certain spaces. For k=5k=5, Wolfson shows n41n\geq 41, which also improves on Sylvester’s bound of n44n\geq 44. To find the Tschirnhaus transformation accomplishing the reduction in this case requires, among other things, finding a 2-plane inside of a cubic hypersurface in 6\mathbb{P}^{6}. Informally, since the dimension of the moduli space of cubic hypersurfaces in 6\mathbb{P}^{6} is 35, finding a 2-plane inside such a hypersurface is at worst a 35-parameter problem, so is permissible when putting a degree 41 polynomial into 35-parameter form. This approach does not produce the degrees of the equations which must be solved to find the Tschirnhaus transformation (which may be higher than nn, as in Hilbert’s proof that n=9n=9 suffices for k=4k=4, where it was necessary to solve a degree 27 equation to find a line on a cubic surface).

Most recently, Sutherland used a combination of ideas from the classical theory of polarity together with optimizations to the moduli-dimension-counting method to improve on the F(k)F(k) bound for k=5,,15k=5,\ldots,15 and all k18k\geq 18.[15] In particular, for k=5k=5, Sutherland shows that n=21n=21 suffices, giving the first rigorous proof of the bound first claimed by Chebotarev.

In the current note, we show that the n=21n=21 result can be recovered from Sylvester’s method if partial elevation of degree is allowed. More precisely, one can find a Tschirnhaus transformation removing 5 terms (i.e., such that the coefficients of the transformed polynomial satisfy A1=A2=A3=A4=A5=0A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0) from a degree nn polynomial whenever n21n\geq 21, by solving no equation of degree higher than 20. A key ingredient is that there exists an explicit algorithm to find a 2-plane on a cubic hypersurface in 9\mathbb{P}^{9}, without solving any equation of degree higher than 5.

3 Sylvester’s Obliteration Formula

Though motivated by the problem of finding Tschirnhaus transformations, the procedure Sylvester describes in [16] is remarkably general, allowing for a solution to be found to any sufficiently underdetermined system, without elevation of degree. More precisely, given a system of equations SS of polynomial equations in NN variables of degree at most kk, with nin_{i} the number of equations of degree ii, Sylvester shows there is a function l(n1,,nk)l(n_{1},\ldots,n_{k}) such that, if N>l(n1,,nk)N>l(n_{1},\ldots,n_{k}), then a solution to SS can be found without solving any equation of degree greater than kk.

The method is as follows: Pick any ff of maximal degree kk from the system SS. We can find a solution to SS by first finding a line LL contained in the solution set of the subsystem S=S{f}S^{\prime}=S\setminus\{f\}, then intersecting this line with the vanishing locus of ff. To find LL, first find a solution Q=(q0,,qN)Q=(q_{0},\ldots,q_{N}) to SS^{\prime}. Then to get the required line it suffices to find a point P=(p0,,pN)P=(p_{0},\ldots,p_{N}) such that P+λQP+\lambda Q is a solution to SS^{\prime} for all λ\lambda\in\mathbb{C}. For any equation gSg\in S^{\prime}, then, view g(P+λQ)g(P+\lambda Q) as a polynomial in λ\lambda; if deg(g)=d\deg(g)=d, the coefficients of 1,λ,,λd1,λd1,\lambda,\ldots,\lambda^{d-1},\lambda^{d} must vanish identically. In fact the coefficient of λd\lambda^{d} is just g(Q)g(Q), so vanishes since we chose QQ to be a solution of SS^{\prime}. The vanishing of the remaining coefficients imposes polynomial conditions on the p0,,pNp_{0},\ldots,p_{N} of degrees 1,2,,d1,2,\ldots,d. Ranging over all gSg\in S^{\prime}, we see that PP must be a solution to a system S′′S^{\prime\prime} with mim_{i} equations of degree ii, where

mk\displaystyle m_{k} =nk1\displaystyle=n_{k}-1
mk1\displaystyle m_{k-1} =nk+nk1\displaystyle=n_{k}+n_{k-1}
mk2\displaystyle m_{k-2} =nk+nk1+nk2\displaystyle=n_{k}+n_{k-1}+n_{k-2}
\displaystyle\ \vdots
m1\displaystyle m_{1} =nk+nk1++n1.\displaystyle=n_{k}+n_{k-1}+\ldots+n_{1}.

Now to find this needed point solution, we proceed inductively: again hold aside some polynomial of highest degree (say hh), and find a linear solution to the subsystem S′′{h}S^{\prime\prime}\setminus\{h\}, then intersect that line with hh, and so on. The number of equations of maximal degree decreases by 1 with each step, so eventually we are left with a (possibly very large) system of linear equations; a line in the solution set to this system can be found provided the number of variables is (at least) one greater than the number of equations. Then to backtrack to a linear solution to the original system requires only for equations of degree k\leq k to be solved one at a time.

The core reduction is succinctly summarized by Sylvester’s formula of obliteration, which we will refer to repeatedly:

Proposition 2 (Sylvester).

Given n1,,nk,n_{1},\ldots,n_{k}\in\mathbb{N}, let

[nk,nk1,,n2,n1][n_{k},n_{k-1},\ldots,n_{2},n_{1}]

denote the minimum number of variables such that a linear solution to any system of equations with exactly nin_{i} equations of degree ii can be found without elevation of degree. Then

[nk,nk1,,n2,n1]1+[mk,mk1,,m2,m1][n_{k},n_{k-1},\ldots,n_{2},n_{1}]\leq 1+[m_{k},m_{k-1},\ldots,m_{2},m_{1}]

where

mi={j=iknjiknk1i=k.m_{i}=\begin{cases}\sum_{j=i}^{k}n_{j}&i\neq k\\ n_{k}-1&i=k.\end{cases}

Note that applying the obliteration formula nkn_{k} times yields a system with no equations of degree kk, so all equations of the highest degree can be removed. This can be repeated until only (a large number of) linear equations remain, in which case the minimum number of variables needed is easy to determine. For example, consider the problem of finding a line on a cubic hypersurface. We have a system of equations with 1 equation of degree 3, 0 equations of degree 2, and 0 of degree 1. By repeated application of the obliteration formula,

[1,0,0]1+[1,1]2+[2]=2+3=5,[1,0,0]\leq 1+[1,1]\leq 2+[2]=2+3=5,

so it is possible to find a line solvably (in fact, by solving no equation of degree greater than 3) on a cubic hypersurface in 5\mathbb{P}^{5}.

4 Resolvent Degree

4.1 Definitions

Let kk be an algebraically closed field and let XYX\to Y be a generically finite dominant map of kk-varieties. Following Buhler and Reichstein [2], we define the essential dimension of this map, ed(XY)\mathrm{ed}(X\to Y), to be the minimal dd such that there exists a pullback square

X{X}W{W}Y{Y}Z{Z}

with dimZd\dim Z\leq d. The idea of resolvent degree is to extend this notion to allow towers of pullbacks; essentially, a map is of resolvent degree at most dd if it can be written as a composition of maps each of essential dimension at most dd. More precisely, following Farb and Wolfson [7], the resolvent degree, RD(XY)\mathrm{RD}(X\to Y), is defined to be the minimal dd such that there exists a tower

ErE1E0=U,E_{r}\to\cdots\to E_{1}\to E_{0}=U,

where UU is an open subset of YY, ErUE_{r}\to U factors through a dominant map ErXE_{r}\to X, and ed(EiEi1)d\mathrm{ed}(E_{i}\to E_{i-1})\leq d for each ii. It follows from these definitions that

RD(XY)ed(XY)dimY.\mathrm{RD}(X\to Y)\leq\mathrm{ed}(X\to Y)\leq\dim Y.

It is convenient, in proving results on resolvent degree, to construct towers of maps of bounded resolvent degree (rather than essential dimension). Lemma 2.7 in Farb and Wolfson [7] shows that this is sufficient.

The connection to polynomials and their roots is as follows. Let 𝒫n\mathcal{P}_{n} denote the space of monic degree nn polynomials with coefficients in kk and let

𝒫n~={(p,r)𝒫n×𝔸k1p(r)=0},\widetilde{\mathcal{P}_{n}}=\{(p,r)\in\mathcal{P}_{n}\times\mathbb{A}^{1}_{k}\mid p(r)=0\},

the space of monic polynomials together with a choice of root. We then define

RD(n):=RD(𝒫n~𝒫n)\mathrm{RD}(n):=\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n})

where the map is “forgetting the root”. RD(n)\mathrm{RD}(n) captures the complexity of the root cover in the sense that if there is a formula in terms of algebraic functions of at most dd variables for finding the roots of a degree nn polynomial in terms of its coefficients, then RD(n)d\mathrm{RD}(n)\leq d.

4.2 Resolvent Degree of a Dominant Map

It is necessary to extend the definition of resolvent degree to dominant maps of kk-varieties which are not necessarily generically finite. Although we are principally interested in RD(n)=𝒫n~𝒫n\mathrm{RD}(n)=\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}, which is generically finite, a number of maps which arise naturally in the study of Tschirnhaus transformations are not. For example, a key step in Bring’s proof that a general quintic can be reduced to a one-parameter form involves finding a line on a quadric surface in 3\mathbb{P}^{3}. Thus we would like to study

2,312,3\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3}

where 2,3\mathcal{H}_{2,3} is the parameter space of quadric surfaces in 3\mathbb{P}^{3} and 2,31\mathcal{H}_{2,3}^{1} is the space of such quadrics together with a choice of line on the surface, and the map is “forgetting the line”. Since any quadric surface contains an infinite family of lines, this map is not generically finite. We would like to extend the notion of resolvent degree so that RD(2,312,3)\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3}) captures the complexity of finding lines on quadric surfaces in the same way that RD(𝒫n~𝒫n)\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}) captures the complexity of finding roots.

This is done in [19] in the following way: given a dominant map π:XY\pi:X\to Y, define the resolvent degree RD(XY)\mathrm{RD}(X\to Y) to be the minimum dd for which there exists a dense collection of subvarieties {UαX}\{U_{\alpha}\subset X\} with

π|Uα:UαY\pi|_{U_{\alpha}}:U_{\alpha}\to Y

a generically finite dominant map and RD(UαY)d\mathrm{RD}(U_{\alpha}\to Y)\leq d for all α.\alpha. (Such a subvariety UαU_{\alpha} is called a rational multi-section for π\pi.)

Requiring the collection of multisections to be dense in XX is necessary to ensure that resolvent degree behaves well with respect to composition of dominant maps: given XYX\to Y and YZY\to Z dominant, for the composition XZX\to Z we have RD(XZ)max{RD(XY),RD(YZ)}\mathrm{RD}(X\to Z)\leq\max\{\mathrm{RD}(X\to Y),\mathrm{RD}(Y\to Z)\} by [19, Lemma 4.10]. Having multi-sections VYV\to Y for XYX\to Y and UZU\to Z for YZY\to Z is not generally enough to product a multisection for XZX\to Z (the range of VYV\to Y could entirely miss UU). On the other hand, if VYV\to Y is surjective, then we do obtain a multi-section of XZX\to Z.

Returning to the example of lines on quadrics, an algorithm for determining a line (or finite set of lines) on each quadric surface in terms of algebraic functions of degree dd gives a multi-section U2,3U\to\mathcal{H}_{2,3} with RD(U2,3)RD(d)\mathrm{RD}(U\to\mathcal{H}_{2,3})\leq\mathrm{RD}(d), but this is somewhat less than what is required to show that RD(2,312,3)RD(d)\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3})\leq\mathrm{RD}(d). On the other hand, as we shall see in the next section, determining any one line is sufficient to yield a Tschirnhaus transformation reducing the quintic to a one-parameter form, so that RD(5)=1\mathrm{RD}(5)=1.

4.3 Resolvent Degree of the Quintic

As an illustrative example, we now give a detailed proof that (as observed in [7]) Bring’s work on Tschirnhaus transformations of quintic polynomials implies RD(5)=1\mathrm{RD}(5)=1. (Recall that by definition, RD(5)=RD(𝒫5~𝒫5)\mathrm{RD}(5)=\mathrm{RD}(\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5}), where 𝒫5\mathcal{P}_{5} is the parameter space of monic degree 5 polynomials and 𝒫5~\widetilde{\mathcal{P}_{5}} is the space of such polynomials together with a choice of root.)

Recall that given a polynomial p(x)=x5+a1x4+a2x3+a3x2+a4x+a5p(x)=x^{5}+a_{1}x^{4}+a_{2}x^{3}+a_{3}x^{2}+a_{4}x+a_{5} in 𝒫5\mathcal{P}_{5} and a Tschirnhaus transformation of the form T(x)=b0x4+b1x3+b2x2+b3x+b4T(x)=b_{0}x^{4}+b_{1}x^{3}+b_{2}x^{2}+b_{3}x+b_{4}, we can set y=T(x)y=T(x) and eliminate xx to obtain a polynomial

q(y)=y5+A1y4+A2y3+A3y2+A4y+A5,q(y)=y^{5}+A_{1}y^{4}+A_{2}y^{3}+A_{3}y^{2}+A_{4}y+A_{5},

where the coefficient AiA_{i} is a degree ii homogeneous polynomial in b0,b1,b2,b3,b4b_{0},b_{1},b_{2},b_{3},b_{4}, whose coefficients are integral functions of a0,a1,a2,a3,a4,a5a_{0},a_{1},a_{2},a_{3},a_{4},a_{5}.

Our strategy will be to construct a tower of maps

X5X4X3X2X1𝒫5X_{5}\to X_{4}\to X_{3}\to X_{2}\to X_{1}\to\mathcal{P}_{5}

such that the resolvent degree of each map is 1 and there is a dominant rational map X5𝒫5~X_{5}\to\widetilde{\mathcal{P}_{5}}. Informally, in X1X_{1} there will be associated to each quintic polynomial pp the equation of a hyperplane determining the set of Tschirnhaus transformations such that A1=0A_{1}=0. In X2X_{2}, we will further have the information of a quadric QQ contained in this hyperplane, corresponding to A1=A2=0A_{1}=A_{2}=0, and the equation of a line ll contained in QQ. In X3X_{3} we will further have a choice of Tschirnhaus transformation TT which lies on ll while also satisfying A3=0A_{3}=0. Using this information we can construct a map from X3X_{3} to the one-dimensional space of quintic polynomials of the form z5+Az+1z^{5}+Az+1, and we will take X4X_{4} to be the pullback of the root cover of this space, so that X4X_{4} associates to pp a root of the polynomial obtained by applying TT to pp. Finally, in constructing X5X_{5}, we recover the roots of pp itself from the roots of the transformed polynomial. Each step of this process is of bounded resolvent degree.

More precisely: viewing the bib_{i} as homogeneous coordinates on 4\mathbb{P}^{4}, there is associated to each polynomial p𝒫5p\in\mathcal{P}_{5} a hyperplane H𝐛4H\subset\mathbb{P}^{4}_{\mathbf{b}} consisting of all Tschirnhaus transformations that send pp to a polynomial q(y)q(y) with A1=0A_{1}=0. Let X1X_{1} denote the space of ordered pairs (p,H)(p,H) with pp and HH as just described. Then the map X1𝒫5X_{1}\to\mathcal{P}_{5} is birational, hence has resolvent degree 1.

Next, we consider the set of all Tschirnhaus transformations such that A1=0A_{1}=0 and A2=0A_{2}=0 in the transformed polynomial. The latter is a degree 2 homogeneous polynomial in b0,b1,b2,b3,b4b_{0},b_{1},b_{2},b_{3},b_{4}, so to each polynomial pp there is associated a quadric surface QH3Q\subset H\cong\mathbb{P}^{3} whose points are Tschirnhaus transformations satisfying both conditions. This defines a map

X12,3X_{1}\to\mathcal{H}_{2,3}

where 2,3\mathcal{H}_{2,3} is the parameter space of quadric surfaces in 3\mathbb{P}^{3}. Letting 2,31\mathcal{H}_{2,3}^{1} denote the space of such surfaces together with a choice of line, we consider the map

2,312,3.\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3}.

An algorithm exists for finding a line on a quadric surface that requires solving only a single quadratic equation, so there is a rational multi-section U2,3U\to\mathcal{H}_{2,3} with

RD(U2,3)RD(2)=1.\mathrm{RD}(U\to\mathcal{H}_{2,3})\leq\mathrm{RD}(2)=1.

(In fact there is a dense collection of such multi-sections, so RD(2,312,3)=1\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3})=1, but this is stronger than we require.)

We define X2X_{2} via the pullback square

X2{X_{2}}X1{X_{1}}U{U}2,3{\mathcal{H}_{2,3}}

so that RD(X2X1)RD(U2,3)=1\mathrm{RD}(X_{2}\to X_{1})\leq\mathrm{RD}(U\to\mathcal{H}_{2,3})=1. Points of X2X_{2} are ordered tuples (p,H,Q,l)(p,H,Q,l), where pp is a quintic polynomial, HH is the space of all Tschirnhaus transformations that send pp to a polynomial with A1=0A_{1}=0, QHQ\subset H is the quadric surface whose points are Tschirnhaus transformations such that A1=0A_{1}=0 and A2=0A_{2}=0, and ll is a line contained in QQ.

Finally, to find a Tschirnhaus transformation TT satisfying A1=0A_{1}=0, A2=0A_{2}=0, and A3=0A_{3}=0 requires intersecting the cubic hypersurface defined by A3=0A_{3}=0 with the line ll. To find the three points of intersection requires only the solution of a cubic equation. Defining X3X_{3} to be the space of tuples (p,H,Q,l,T)(p,H,Q,l,T), with p,H,Q,lp,H,Q,l as above and TT a Tschirnhaus transformation yielding A1=A2=A3=0A_{1}=A_{2}=A_{3}=0, the projection map

X3X2X_{3}\to X_{2}

has resolvent degree RD(3)=1RD(3)=1.

Now let 𝒫5\mathcal{P}_{5}^{\prime} denote the space of quintic polynomials of the form z5+Az+1z^{5}+Az+1. Given a point (p,H,Q,l,T)(p,H,Q,l,T) in X3X_{3}, we can apply the transformation TT to pp to obtain a polynomial

q(y)=y5+A4y+A5.q(y)=y^{5}+A_{4}y+A_{5}.

Applying a change of variables y=A55zy=\sqrt[5]{A_{5}}z and dividing by A5A_{5} yields

z5+A4A55A5z+1z^{5}+\frac{A_{4}\sqrt[5]{A_{5}}}{A_{5}}z+1

so this procedure defines a map X3𝒫5X_{3}\to\mathcal{P}_{5}^{\prime}. Now, the root cover

𝒫5~𝒫5\widetilde{\mathcal{P}_{5}^{\prime}}\to\mathcal{P}_{5}^{\prime}

has resolvent degree

RD(𝒫5~𝒫5)dim(𝒫5)=1,\mathrm{RD}(\widetilde{\mathcal{P}_{5}^{\prime}}\to\mathcal{P}_{5}^{\prime})\leq\dim\left(\mathcal{P}_{5}^{\prime}\right)=1,

so defining X4X_{4} via the pullback square

X4{X_{4}}X3{X_{3}}𝒫5~{\widetilde{\mathcal{P}_{5}^{\prime}}}𝒫5{\mathcal{P}_{5}^{\prime}}

we have RD(X4X3)=1\mathrm{RD}(X_{4}\to X_{3})=1. Points of X4X_{4} are of the form (p,H,Q,l,T,λ)(p,H,Q,l,T,\lambda), where (p,H,Q,l,T)X3(p,H,Q,l,T)\in X_{3} and λ\lambda is a root of the transformed polynomial z5+A4A55A5z+1z^{5}+\frac{A_{4}\sqrt[5]{A_{5}}}{A_{5}}z+1. Let X5X_{5} be the space whose points are tuples (p,H,Q,l,T,μ)(p,H,Q,l,T,\mu), where μ\mu is a root of pp. There is a map

X5X4X_{5}\to X_{4}

defined by

(p,H,Q,l,T,μ)(p,H,Q,l,T,T(μ)A55).(p,H,Q,l,T,\mu)\mapsto\left(p,H,Q,l,T,\frac{T(\mu)}{\sqrt[5]{A_{5}}}\right).

On the other hand, given a root λ\lambda of the transformed polynomial, we can find the corresponding roots of pp by solving the (at worst degree 4) polynomial equation

GCD(p(x),A55T(x)λ)=0\mathrm{GCD}(p(x),\sqrt[5]{A_{5}}T(x)-\lambda)=0

so RD(X5X4)=RD(4)=1\mathrm{RD}(X_{5}\to X_{4})=\mathrm{RD}(4)=1.

In all, we have constructed a tower of maps

X5X4X3X2X1𝒫5X_{5}\to X_{4}\to X_{3}\to X_{2}\to X_{1}\to\mathcal{P}_{5}

each of which has resolvent degree 1. Since the root cover 𝒫5~𝒫5\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5} factors through the projection X5𝒫5~X_{5}\to\widetilde{\mathcal{P}_{5}}, (p,H,Q,l,T,μ)(p,μ)(p,H,Q,l,T,\mu)\mapsto(p,\mu), we have

RD(5)=RD(𝒫5~𝒫5)=1.\mathrm{RD}(5)=\mathrm{RD}(\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5})=1.

5 Linear subspaces of hypersurfaces

To prove RD(5)=1\mathrm{RD}(5)=1 we needed to show one can always find a Tschirnhaus transformation which eliminates the first three intermediate terms of a quintic polynomial. This required finding a solution of a system of three equations of degree 1, 2, and 3, respectively. A key geometric fact which made this tractable was that any quadric surface in 3\mathbb{P}^{3} contains a line, and that an equation for such a line can be found by solving a quadratic equation; this allows the degree 2 equation (which determines the quadric surface) to be replaced by a degree 1 equation (which determines the line), so that to find a solution of the system requires only the solution of a cubic equation.

Similarly, in Hilbert’s proof that RD(9)=4\mathrm{RD}(9)=4, a Tschirnhaus transformation removing four intermediate terms is required, so a system of equations of degrees 1, 2, 3, and 4 must be solved. Informally, Hilbert’s idea is to first find a 3-plane inside the hypersurface determined by the equation of degree 2 (which is possible as long as the ambient dimension is high enough), then to find a line on the cubic surface determined inside this 3-plane by the equation of degree 3. If this can be done, all that remains is to intersect the line with the remaining equation of degree 4 and a solution can be found.

In general, one can consider the problem of finding a kk-plane inside a degree dd hypersurface in N\mathbb{P}^{N}. We can ask several questions:

  1. Q1:

    In terms of dd and kk, how large must the ambient dimension NN be to guarantee that any degree dd hypersurface contains a kk-plane?

  2. Q2:

    What is the “resolvent degree of finding the kk-plane”? That is, if d,N\mathcal{M}_{d,N} is a moduli space of degree dd hypersurfaces in N\mathbb{P}^{N} and d,Nk\mathcal{M}_{d,N}^{k} is the space of such hypersurfaces together with a choice of kk-plane, what is the resolvent degree of the map

    d,Nkd,N\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}

    which forgets the choice of plane?

  3. Q3:

    When is there a constructive algorithm to determine the kk-plane? What are the degrees of the equations that must be solved? How large must NN be if no equation of degree higher than some given bound is to be permitted?

As the RD(5)=1\mathrm{RD}(5)=1 and RD(9)=4\mathrm{RD}(9)=4 examples above illustrate, the answers to these questions have implications for the problem of finding Tschirnhaus transformations, since replacing a hypersurface with a linear subspace of that hypersurface allows us to replace a degree dd equation with one or more equations of degree 1, reducing the total degree of the system to be solved.

An answer to Q1 is given in Debarre and Manivel [5]: for d>3d>3, any degree dd hypersurface in N\mathbb{P}^{N} contains a kk-plane if

N(d+kk)k+1+k.N\geq\frac{{d+k\choose k}}{k+1}+k.

In this case, the map d,Nkd,N\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N} is surjective, and an upper bound on its resolvent degree is given by the dimension of the codomain:

RD(d,Nkd,N)dim(d,N).\mathrm{RD}\left(\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}\right)\leq\dim\left(\mathcal{M}_{d,N}\right).

For example,

RD(3,313,3)dim(3,3)=4,\mathrm{RD}\left(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3}\right)\leq\dim\left(\mathcal{M}_{3,3}\right)=4,

corresponding to Hilbert’s observation that finding a line on a cubic surface requires the solution of at most an algebraic function of four variables.

These dimension-counting arguments do not provide enough information to address Q3. For this we return to Sylvester’s ideas. First, for the problem of finding a line on a degree dd hypersurface, repeated application of the obliteration formula suffices to compute an ambient dimension NN such that the desired line may be found without solving any equation of degree higher than dd. Sylvester’s methods can be readily adapted to the problem of finding a kk-plane on a degree dd hypersurface without solving any equation of degree higher than dd. Note that, when NN is large enough for this to be possible,

RD(d,Nkd,N)RD(d)\mathrm{RD}\left(\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}\right)\leq\mathrm{RD}(d)

giving a bound on resolvent degree which is often sharper than what one obtains from dimension-counting alone (though at the price of requiring a larger ambient dimension NN than that given by Waldron’s theorem), so Sylvester’s ideas are of interest even if one is only concerned with finding bounds on resolvent degree. In the remainder of this section we look at the d=3d=3 case in detail.

5.1 Linear subspace of cubic hypersurfaces

Let 3,N\mathcal{M}_{3,N} be the moduli space of smooth cubic hypersurfaces in N\mathbb{P}^{N} and let 3,Nr\mathcal{M}_{3,N}^{r} be the moduli space of pairs (C,P)(C,P) where CC is a smooth cubic hypersurface and PP is an rr-plane contained in CC. We can then consider the “resolvent degree of finding an rr-plane”, i.e.,

RD(3,Nr3,N),\mathrm{RD}(\mathcal{M}_{3,N}^{r}\to\mathcal{M}_{3,N}),

where the map forgets the choice of plane. For example, for the problem of finding a line on a cubic surface, we have

RD(3,313,3)dim(3,3)=4,\mathrm{RD}(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3})\leq\dim(\mathcal{M}_{3,3})=4,

so that finding a line on a cubic surface in 3\mathbb{P}^{3} requires the solution of algebraic equations of no more than 4 variables. Farb and Wolfson [7], using an argument due to Klein, show this bound can be improved to

RD(3,313,3.)3.\mathrm{RD}(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3}.)\leq 3.

In higher dimensions it is easier to find a line. In particular, by Sylvester’s obliteration method, as discussed in section 3, finding a line on a cubic surface in 5\mathbb{P}^{5} requires the solution of no equation of degree higher than 3, and so is a resolvent degree 1 problem.

Proposition 3 (Sylvester).

Given a cubic hypersurface VV in n\mathbb{P}^{n}, we can find a line contained in VV by solving equations of degree no higher than 3 provided n5.n\geq 5. Hence

RD(3,513,5)RD(3)=1.\mathrm{RD}(\mathcal{M}_{3,5}^{1}\to\mathcal{M}_{3,5})\leq\mathrm{RD}(3)=1.
Proof.

We wish to find a linear solution to a system of equations consisting of 1 equation of degree 33 and no equations of lower degree. By Sylvester’s obliteration formula, the ambient dimension required is

[1,0,0]1+[1,1]2+[2]=2+3=5.[1,0,0]\leq 1+[1,1]\leq 2+[2]=2+3=5.

Sylvester’s method also extends to finding higher-dimensional linear subspaces of hypersurfaces, when the dimension of the ambient space is large enough. For example, for cubic hypersurfaces in 9\mathbb{P}^{9}, finding a 2-plane can also be done solvably.

Proposition 4 (Sylvester).

Given a cubic hypersurface V=V(f)V=V(f) in n\mathbb{P}^{n}, we can find a 2-plane contained in VV by solving equations of degree no higher than 3, provided n11n\geq 11. Hence

RD(3,1123,11)RD(3)=1.\mathrm{RD}(\mathcal{M}_{3,11}^{2}\to\mathcal{M}_{3,11})\leq\mathrm{RD}(3)=1.
Proof.

Since n5n\geq 5, we can find a line ll contained in VnV\subset\mathbb{P}^{n} solvably. Let qq and rr be distinct points of ll such that q+λrVq+\lambda r\in V for all λ𝕍\lambda\in\mathbb{V}. To find a plane contained in VV, we look for a point

p=n2nlp=\mathbb{P}^{n-2}\subset\mathbb{P}^{n}\setminus l

such that f(p+μq+λr)=0f(p+\mu q+\lambda r)=0 for all μ,λ𝕍\mu,\lambda\in\mathbb{V}. Expanding this as a polynomial in μ\mu and λ\lambda, the coefficients of 11, λ\lambda, μ\mu, λμ\lambda\mu, λ2\lambda^{2}, and μ2\mu^{2}, must vanish identically.

This gives a system of equations in n2n-2 variables with 1 equation of degree 3, 2 equations of degree 2, and 3 equations of degree 1. To solve it, we look for a linear solution to the subsystem of equations of degree <3<3, then intersect this line with the remaining degree 3 equation. Using Sylvester’s formula of obliteration, we have

[2,3]1+[1,5]2+[6]=2+7=9,[2,3]\leq 1+[1,5]\leq 2+[6]=2+7=9,

so we can find the needed linear solution when n29n-2\geq 9. Thus we can find a line on a cubic hypersurface in n\mathbb{P}^{n} when n11n\geq 11. ∎

The ambient dimension can be reduced slightly if some elevation of degree is allowed. From Segre [1945, pg 295, Sec 12], it is possible to determine a line on the intersection of two given quadrics by solving equations of degree no higher than 5, provided the ambient dimension is at least 4. Thus to find a linear solution of a system with 2 equations of degree 2, and 3 equations of degree 1, an ambient dimension of 7 is sufficient. Comparing to the last paragraph in the proof above we have

Proposition 5.

Given a cubic hypersurface V=V(f)V=V(f) in n\mathbb{P}^{n}, we can find a 2-plane contained in VV by solving equations of degree no higher than 5, provided n9n\geq 9. Hence

RD(3,923,9)RD(5)=1.\mathrm{RD}(\mathcal{M}_{3,9}^{2}\to\mathcal{M}_{3,9})\leq\mathrm{RD}(5)=1.

6 Removing 5 Terms from a Polynomial

Given a degree nn polynomial

xn+a1xn1++anx+anx^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}

we wish to find a Tschirnhaus transformation

T(x)=bn1xn1+bn2xn2++b1x+b0T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0}

such that after setting y=T(x)y=T(x) the transformed polynomial

yn+A1yn1++An1y+Any^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}

satisfies A1=A2=A3=A4=A5=0A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0. To determine the coefficients b0,,bn1b_{0},\ldots,b_{n-1} of TT then requires the solution of a system of equations with degrees 1, 2, 3, 4, 5, in n1\mathbb{P}^{n-1}. In general, to find a solution to such a system requires solving a polynomial of degree 5!=1205!=120, but when nn is large enough this elevation of degree can be partially avoided.

We first informally sketch the geometry underlying Wolfson’s bound of n=41n=41. The general idea is that by finding linear subspaces of the hypersurfaces corresponding to the polynomials of our system, the total degree of the system can be reduced. In this case, to find the necessary Tschirnhaus transformation one first finds a 66-plane inside a quadric surface in 13\mathbb{P}^{13} (this only requires solving degree 2 equations, so is resolvent degree 1). Then, intersecting the degree 3 equation with the 6-plane yields a cubic hypersurface in 6\mathbb{P}^{6}. If we can then find a 2-plane inside this cubic hypersurface, then by intersecting the degree 4 and 5 equations with this plane, we are left with a system of total degree 20 to solve. In summary, we have the chain

V4V52V36V213=V114.V_{4}\cap V_{5}\subset\mathbb{P}^{2}\subset V_{3}\subset\mathbb{P}^{6}\subset V_{2}\subset\mathbb{P}^{13}=V_{1}\subset\mathbb{P}^{14}.

This gives an algorithm for finding the desired Tschirnhaus transformation provided one is able to find the necessary 2-plane inside the cubic hypersurface in 6\mathbb{P}^{6}. Wolfson uses a dimension count to show that

RD(3,623,6)dim(3,6)=35,RD(\mathcal{M}_{3,6}^{2}\to\mathcal{M}_{3,6})\leq\dim(\mathcal{M}_{3,6})=35,

and so is able to use this Tschirnhaus transformation to show RD(n)n6\mathrm{RD}(n)\leq n-6 whenever n635n-6\geq 35.

By increasing the ambient dimension in which the the cubic hypersurface lives, we can use Sylvester’s ideas to find a plane inside a cubic hypersurface in 9\mathbb{P}^{9} by solving only equations of degree 5 or less. (i.e., in a resolvent degree 1 way). This allows the n=41n=41 bound for removing 5 terms from a degree nn polynomial to be reduced to n=21n=21, with simpler irrationalities involved in finding the necesary Tschirnhaus transformation.

Theorem (Main Theorem).

Let n21n\geq 21. Given a degree nn polynomial

xn+a1xn1++anx+anx^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}

we can find a Tschirnhaus transformation

T(x)=bn1xn1+bn2xn2++b1x+b0T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0}

such that after setting y=T(x)y=T(x) the transformed polynomial

yn+A1yn1++An1y+Any^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}

satisfies A1=A2=A3=A4=A5=0A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0. The coefficients b0,bn1b_{0},\ldots b_{n-1} of TT can be determined by solving equations of degree at most 2020.

Proof.

The equations A1=0A_{1}=0, A2=0A_{2}=0, A3=0A_{3}=0, A4=0A_{4}=0, A5=0A_{5}=0 impose polynomial conditions of degree 1, 2, 3, 4, and 5, respectively, on the point (b0,,bn1)n1(b_{0},\ldots,b_{n-1})\in\mathbb{P}^{n-1} to be determined. Using the degree 1 equation we can eliminate one variable. The degree 2 equation then determines a quadric hypersurface in n2\mathbb{P}^{n-2}. By the classical theory of linear subspaces of quadrics, we can find a 9\mathbb{P}^{9} contained in this hypersurface provided n219n-2\geq 19.

Next, we consider the cubic hypersurface determined by this 9\mathbb{P}^{9} and the degree 3 equation. By Proposition 5 of the previous section, we can find a 2\mathbb{P}^{2} contained in this hypersurface by solving equations of degree at worst 5. Finally, intersecting the remaining equations of degree 4 and 5 determine two curves in this 2\mathbb{P}^{2}, whose points of intersection are governed by an equation of degree at most 20. Each such point then satisfies all 5 polynomial conditions, by construction, and so yields a Tschirnhaus transformation of the desired form. ∎

In the same way that the reduction of the quintic to one-parameter form can be translated into the language of resolvent degree to yield RD(5)=1\mathrm{RD}(5)=1, this theorem implies RD(n)nk\mathrm{RD}(n)\leq n-k for n21n\geq 21.

References

  • [1] Richard Brauer. On the resolvent problem. Annali di Matematica Pura ed Applicata, 102(1):45–55, 1975.
  • [2] Joe Buhler and Zinovy Reichstein. On tschirnhaus transformations. In Topics in Number theory, pages 127–142. Springer, 1999.
  • [3] Grigorii Nikolaevich Chebotarev. On the problem of resolvents. Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, 114(2):189–193, 1954.
  • [4] Alexander Chen, Yang-Hui He, and John McKay. Erland Samuel Bring’s “transformation of algebraic equations”. arXiv preprint arXiv:1711.09253, 2017.
  • [5] Olivier Debarre and Laurent Manivel. Sur la vari et e des espaces lin eaires contenus dans une intersection compl ete. Math. Ann, 312:549–574, 1998.
  • [6] Jacques Dixmier. Histoire du 13e probleme de hilbert. Cahiers du séminaire d’histoire des mathématiques, 3:85–94, 1993.
  • [7] Benson Farb and Jesse Wolfson. Resolvent degree, hilbert’s 13th problem and geometry. L’Enseignement Mathématique, 65(3):303–376, 2020.
  • [8] Sir William Rowan Hamilton. Inquiry Into the Validity of a Method Recently Proposed by George B. Jerrard, Esq. for Transforming and Resolving Equations of Elevated Degrees Undertaken at the Request of the Association. Richard and John E. Taylor, 1836.
  • [9] David Hilbert. Über die gleichung neunten grades. Mathematische Annalen, 97(1):243–250, 1927.
  • [10] George Birch Jerrard. Mathematical Researches. Longman, Bristol and London, 1834.
  • [11] George Birch Jerrard. Xxiv. on certain transformations connected with the finite solution of equations of the fifth degree: To the editors of the philosophical magazine and journal. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 7(39):202–203, 1835.
  • [12] Joseph Lagrange. Réflexions sur la résolution algébrique des équations. Nouveaux Mémoires de l’Académie Royale des Sciences et Belles-Lettres de Berlin, 1771.
  • [13] Beniamino Segre. The algebraic equations of degrees 5, 9, 157,…, and the arithmetic upon an algebraic variety. Annals of Mathematics, pages 287–301, 1945.
  • [14] Alexander J. Sutherland. GN Chebotarev’s “on the problem of resolvents”. arXiv preprint arXiv:2107.01006, 2021.
  • [15] Alexander J Sutherland. Upper bounds on resolvent degree and its growth rate. arXiv preprint arXiv:2107.08139, 2021.
  • [16] James Joseph Sylvester. On the so-called tschirnhausen transformation. 1887.
  • [17] Ehrenfried Walther von Tschirnhaus. Methodus auferendi omnes terminos intermedios ex data aeqvatione (method of eliminating all intermediate terms from a given equation). Acta Eruditorum (1683), pages 204–207.
  • [18] Anders Wiman. Über die Anwendung der Tschirnhausentransformation auf die Reduktion algebraischer Gleichungen. Almquist & Wiksells Boktr., 1927.
  • [19] Jesse Wolfson. Tschirnhaus transformations after hilbert. L’Enseignement Mathématique, 66(3):489–540, 2021.