Removal of 5 Terms from a Degree 21 Polynomial

Curtis Heberle

Abstract

In 1683 Tschirnhaus claimed to have developed an algebraic method to determine the roots of any degree $n$ polynomial. His argument was flawed, but it spurred a great deal of work by mathematicians including Bring, Jerrard, Hamilton, Sylvester, and Hilbert. Many of the problems they considered can be framed in terms of the geometric notion of resolvent degree, introduced by Farb and Wolfson. Roughly speaking, we have $RD(n)\leq n-k$ if a general degree $n$ polynomial can be put into an $(n-k)$ -parameter form. In the present note we discuss a method introduced by Sylvester for solving systems of polynomial equations, and apply it to finding bounds on resolvent degree. In particular, we prove the bound $RD(21)\leq 15$ . This bound has been independently established by Sutherland using the classical theory of polarity.

1 Introduction

Consider a degree $n$ polynomial $p(x)$ with roots $x_{1},\ldots,x_{n}$ :

p(x)=x^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}=\prod_{i=1}^{n}(x-x_{i}).

A Tschirnhaus transformation is a polynomial transformation of the roots of $p$ . That is, given a polynomial

T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0},

and setting

y=T(x)

we can form a new polynomial with roots $T(x_{1}),\ldots,T(x_{n})$ :

q(y)=y^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}=\prod_{i=1}^{n}(y-T(x_{i})).

Tschirnhaus’s original idea was to transform $p$ to a solvable form – specifically, to choose $T$ such that $q(y)=y^{n}+A_{n}$ – as part of a proposed method for determining in radicals the roots of $p$ . This method is not successful because in general it is not possible to determine in radicals the coefficients of the necessary $T$ . On the other hand, a number of interesting results have since been proved involving the use of Tschirnhaus transformations to reduce families of polynomials to certain canonical forms; in particular, the problem of finding $T$ such that $q$ has the $(n-k)$ -parameter form

q(y)=y^{n}+A_{k+1}y^{n-k-1}+\ldots+A_{n-1}y+A_{n}

with $A_{1}=A_{2}=\ldots=A_{k}=0$ has been widely considered. We provide a partial review of the historical development of this theory in Section 2.

Recently progress has been made by casting the problem in terms of the geometric notion of resolvent degree, as introduced by Farb and Wolfson.[7] We review the precise definitions in Section 4, but roughly speaking the idea is as follows. We say a generically finite dominant map $X\to Y$ has essential dimension at most $d$ if there is a pullback square

with $\dim(Z)\leq d$ . We then say $X\to Y$ has resolvent degree at most $d$ if it factors as a composition of maps each of essential dimension at most $d$ . (This is somewhat stronger than what is required in the precise definition; for instance, it is sufficient for $X\to Y$ to be birationally equivalent to such a composition.)

In particular, we are interested in the resolvent degree of the root cover of the family of degree $n$ polynomials. That is, let $\mathcal{P}_{n}$ be the space of monic degree $n$ polynomials, and $\widetilde{\mathcal{P}_{n}}$ the space of pairs $(p,\lambda)$ of polynomials $p\in\mathcal{P}_{n}$ with a choice of root $\lambda$ . Then there is a generically finite dominant map $\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}$ by “forgetting the root”, and we are interested in bounds on

\mathrm{RD}(n):=\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}).

Intuitively, we have $\mathrm{RD}(n)\leq d$ if the roots of any $p\in\mathcal{P}_{n}$ can be determined by solving algebraic functions of at most $d$ variables. For example, for $n=2$ , the quadratic formula implies that the root cover $\widetilde{\mathcal{P}_{2}}\to\mathcal{P}_{2}$ is a pullback of the map

\displaystyle\mathbb{P}^{1}\to\mathbb{P}^{1}z\mapsto z^{2}.

Similarly, the existence of “formulas in radicals” for $n=3$ and $n=4$ implies that $\widetilde{\mathcal{P}_{3}}\to\mathcal{P}_{3}$ and $\widetilde{\mathcal{P}_{4}}\to\mathcal{P}_{4}$ factor into towers of pullbacks of cyclic self-covers of $\mathbb{P}^{1}$ , so that $\mathrm{RD}(3)=\mathrm{RD}(4)=1$ .

Many classical results concerning Tschirnhaus transformations can be translated into statements about resolvent degree: if any $p\in\mathcal{P}_{n}$ can be reduced to an $(n-k)$ -parameter form by means of a Tschirnhaus transformation $T$ , we have

\mathrm{RD}(n)\leq n-k,

provided that $T$ can itself always be determined by solving algebraic functions of at most $n-k$ variables. For example, a result of Bring shows that any degree $5$ polynomial can be put into the one-parameter form $y^{5}+A_{5}y+1$ .[4] This is sufficient to imply $\mathrm{RD}(5)=1$ ; in Section 4 we treat this example in detail. The bounds so-obtained are generally not sharp, as the improvements obtained by Wolfson and Sutherland have demonstrated.[19, 15]

On the other hand, this formulation of the problem implies different constraints on which Tschirnhaus transformations $T$ are permissible than were generally assumed by classical authors. For example, in considering the problem of transforming a general degree $n$ polynomial $p(x)$ into an $(n-k)$ -parameter form by means of a Tschirnhaus transformation $T$ , Sylvester only considers those transformations whose coefficients can be determined without solving any equation of degree higher than $k$ . This is much more restrictive than necessary if one’s goal is to show

\mathrm{RD}(n)\leq n-k.

For example, for $k=6$ , Sylvester obtains the following:

Proposition 1 (Sylvester).

For $n\geq 44$ , a general polynomial of degree $n$ can be put into an $(n-6)$ -parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving any equation of degree higher than 5.

As a corollary, one obtains the resolvent degree bound $\mathrm{RD}(n)\leq n-6$ for $n\geq 44$ ; on the other hand, finding the necessary Tschirnhaus transformation is a resolvent degree $\mathrm{RD}(5)=1$ problem, since only equations of degree at most 5 are involved. Sylvester’s was nonetheless the best known bound for $k=6$ prior to Wolfson’s 2020 proof that $\mathrm{RD}(n)\leq n-6$ for $n\geq 41$ . [19] In Wolfson’s argument, finding the necessary Tschirnhaus transformation is shown to be at worst a resolvent degree 35 problem. It has been conjectured by Wiman and (separately) Chebotarev that $\mathrm{RD}(n)\leq 15$ for $n\geq 21$ , though their proposed arguments contain gaps. [18, 3] Recently, Sutherland has provided a rigorous proof of this bound using the classical theory of polarity.[15]

In the current note we use ideas from Sylvester to provide an alternative proof of the $\mathrm{RD}(n)\leq n-6$ for $n\geq 21$ bound. In particular, we show

Theorem (Main Theorem).

For $n\geq 21$ , a general polynomial of degree $n$ can be put into an $(n-6)$ -parameter form by means of a Tschirnhaus transformation whose coefficients can be determined without solving an equation of degree higher than 20.

This implies $\mathrm{RD}(n)\leq n-6$ for $n\leq 21$ .

We now give an outline of the remaining sections of the paper. In Section 2 we give a brief historical overview of the classical theory of Tschirnhaus transformations and some of the important results therein. In Section 3 we turn to Sylvester’s work in particular and give a description of his “method of obliteration”: a technique for finding solutions of systems of polynomial equations when the number of variables is large relative to the number of equations, which lies at the heart of his work on Tschirnhaus transformations. In Section 4 we discuss resolvent degree and consider the $n=5$ case in detail as an illustration of the relation between resolvent degree and the classical theory of Tschirnhaus transformations. Section 5 describes a key ingredient of our main result: the use of the “method of obliteration” to determine linear subspaces of hypersurfaces. Finally, in Section 6 we give the proof of our main theorem.

2 Some Background on Tschirnhaus Transformations

2.1 Preliminaries

Let $p(x)$ be a polynomial over a (not necessarily closed) field $K$ . Then we have

p(x)=x^{n}+a_{1}x^{n-1}+\ldots+a_{n-1}x+a_{n}=\prod_{i=1}^{n}(x-x_{i})

for some coefficients $a_{1},\ldots,a_{n}\in k$ and roots $x_{1},\ldots,x_{n}\in\overline{K}$ . Fixing some algebraic extension $L$ of $K$ , a Tschirnhaus transformation is a polynomial $T\in L[x]$ ,

T(x)=b_{0}+b_{1}x+\ldots+b_{n-1}x^{n-1}

which we apply to the roots of $p$ to obtain a new polynomial

q(y)=\prod_{i=1}^{n}(y-T(x_{i}))=y^{n}+A_{1}y^{n-1}+\ldots+A_{1}y+A_{0}.

Note that the coefficients $A_{0},\ldots,A_{n}$ are symmetric functions of the transformed roots $T(x_{1}),\ldots,T(x_{n})$ , and hence are symmetric in $x_{1},\ldots,x_{n}$ . The coefficients $a_{0},\ldots,a_{n}$ generate the algebra of symmetric polynomials in $x_{1},\ldots,x_{n}$ , so we can determine the $A_{i}$ polynomially in terms of $a_{0},\ldots,a_{n}$ and $b_{0},\ldots,b_{n-1}$ . In particular, this implies $q\in L[y]$ . The precise constraints on the field $L$ from which the coefficients of $T$ may be taken vary from author to author but generally one considers towers of extensions of bounded degree. Tschirnhaus’s original goal was to find a formula in radicals for the roots of $p$ . More precisely, taking $L$ to be the solvable closure of $K$ , his strategy was to determine a Tschirnhaus transformation $T\in L[x]$ which transforms $p$ into the solvable form $q(y)=y^{n}+A_{n}$ . If $y_{i}$ is a root of $q$ , we can find the corresponding root(s) of $p$ (i.e., those $x_{j}$ such that $T(x_{j})=y_{i}$ ) as the roots of

\mathrm{GCD}(p(x),T(x)-y_{i}).

If there are $d$ roots of $p$ which map to $y_{i}$ , this will be a degree $d$ polynomial. In particular, if $q$ has distinct roots, then the roots of $p$ can be recovered rationally from the roots of $q$ . Thus if $q$ is solvable and $T$ has coefficients in a solvable extension of $K$ , then $p$ will in general be solvable. (And at worst recovering the roots of $p$ will require the solution of an equation of degree $n-1$ .) This strategy fails because it is not generally possible to determine $T$ solvably such that all the intermediate terms $A_{1},A_{2},\ldots,A_{n-1}$ of $q$ vanish. On the other hand, there are weaker versions of Tschirnhaus’s problem which are tractable. More precisely, we can ask: what conditions on $n$ and $L$ are sufficient for there to exist $T$ in $L[x]$ transforming the degree $n$ polynomial $p\in K[x]$ to a polynomial $q\in L[y]$ with $A_{1}=\ldots=A_{k}=0$ ? That is, we consider the problem of setting some (rather than all) of the intermediate coefficients to zero (thus reducing the number of parameters of the family of polynomials), and we relax the requirement that $L$ necessarily be a solvable extension.

We now consider in more detail what is necessary to determine $T$ such that $A_{1}=\ldots=A_{k}=0$ . As we remarked above, $A_{i}$ is a polynomial function of $a_{1},\ldots,a_{n}$ and $b_{0},\ldots,b_{n-1}$ . Treating the $b_{i}$ ’s as unknowns to be determined, we further observe that $A_{i}$ is homogeneous of degree $i$ in the variables $b_{0},\ldots,b_{n-1}$ . For example,

	$\displaystyle A_{1}$	$\displaystyle=-\sum_{i=1}^{n}T(x_{i})$
		$\displaystyle=-\sum_{i=1}^{n}(b_{n-1}x_{i}^{n-1}+b_{n-2}x_{i}^{n-2}+\ldots+b_{1}x_{i}+b_{0})$
		$\displaystyle=\left(-\sum_{i=1}^{n}x_{i}^{n-1}\right)b_{n-1}+\left(-\sum_{i=1}^{n}x_{i}^{n-2}\right)b_{n-2}+\ldots+\left(-\sum_{i=1}^{n}x_{i}\right)b_{1}-nb_{0}$

we can ensure $A_{1}=0$ as long as the coefficients $b_{0},\ldots,b_{n-1}$ of $T$ are chosen to satisfy a single homogeneous linear equation. (The coefficients of this equation are symmetric in the $x_{i}$ , so again can be expressed in terms of the coefficients $a_{1},\ldots,a_{n}$ of $p$ ; it is not necessary to know the roots $x_{i}$ .) More generally, $A_{i}$ vanishes if and only if the $i$ th symmetric function of $T(x_{1}),\ldots,T(x_{n})$ vanishes, so to find a Tschirnhaus transformation $T$ such that $A_{i}=0$ in the transformed polynomial we need to solve a degree $i$ polynomial in $b_{0},\ldots,b_{n-1}$ whose coefficients are polynomials in $a_{1},\ldots,a_{n}$ . Thus, to determine $T$ we must find a point in $\mathbb{P}^{n-1}_{\mathbf{b}}$ on the intersection of the hypersurfaces $V(A_{1}),V(A_{2}),\ldots,V(A_{k})$ , which are of degree $1,2,\ldots,k$ , respectively.

2.2 Some Results on Tschirnhaus Transformations from 1683 to Present

Tschirnhaus, in his 1683 paper introducing these transformations, claimed to be able to solve any degree $n$ polynomial by removing all intermediate terms.[17] That is, he claimed one could find a transformation $T$ such that

A_{1}=A_{2}=\ldots=A_{n-1}=0

and the transformed polynomial has the solvable form

y^{n}+A_{n}=0.

The problem with this strategy is that finding the necessary $T$ requires the solution of a system of polynomial equations of degrees $1,2,\ldots,n-1$ . In general this leads to an equation of degree $(n-1)!$ , so for $n>3$ finding $T$ is a priori more complicated than finding the roots of the original degree $n$ polynomial. Tschirnhaus did show explicitly how to find $T$ in the $n=3$ case, and his proof readily generalizes to removing the first two intermediate terms from any degree $n$ polynomial.

Finding a Tschirnhaus transformation $T$ yielding $A_{1}=A_{2}=\ldots=A_{k}=0$ requires solving a system of $k$ equations in $n$ variables, of degree $1,2,3,\ldots,k$ . In the worst case this leads to an equation of degree $k!$ , but when the system is sufficiently undetermined (i.e., $n$ much larger than $k$ ), a solution can be found without solving any polynomial of degree greater than $k$ . The first result of this kind was given in 1786 by Bring, who showed that when $n\geq 5$ , we can find $T$ such that $A_{1}=A_{2}=A_{3}=0$ without solving any equation of degree higher than three.[4]

In 1834, Jerrard recovered independently Bring’s result that removal of three terms from a degree 5 polynomial was possible by means of a Tschirnhaus transformation.[10] In fact, Jerrard went on to claim that his methods could yield a reduction of a general quintic to a solvable form.[11] (Jerrard was aware of, but did not accept, Abel’s 1824 work on the insolvability of the quintic.)

Shortly thereafter Hamilton was commissioned by the British Association for the Advancement of Science to investigate the validity of Jerrard’s methods, issuing his report in 1837.[8] He showed that Jerrard’s reductions were in many cases “illusory”. Jerrard allowed Tschirnhaus transformations of degree potentially as high or higher than the original polynomial $p$ , and Hamilton demonstrated that in the “illusory” cases the transformation $T$ found by Jerrard was a multiple of $p$ . In this case the transformed polynomial is always $q(y)=y^{n}$ (the same as if the “transformation” $y=T(x)=0$ were permitted) for any degree $n$ polynomial $p$ , so there is no hope of determining the roots of $p$ by studying $q$ . On the other hand, Hamilton showed that Jerrard’s methods can be a made to work –and the “illusory” cases avoided – in higher degrees. More precisely, for any fixed $k$ , one can find a (nonzero, degree $\leq n-1$ ) Tschirnhaus transformation $T$ yielding $A_{1}=A_{2}\ldots=A_{k}=0$ without solving any equation of degree higher than $k$ , provided the degree $n$ of the original polynomial $p$ is large enough relative to $k$ . In particular, for $k=4$ , Hamilton showed $n\geq 11$ suffices, and for $k=5$ , $n\geq 47$ .

In 1886, Sylvester published a geometric explanation of the Hamilton/Jerrard method, sharpened some of the bounds (finding, in particular, $n\geq 10$ suffices for $k=4$ , and $n\geq 44$ for $k=5$ ).[16] Sylvester claimed that these sharpened bounds are optimal if no “elevation of degree” is allowed – that is, if no equations of degree higher than $k$ are to be solved. This claim requires careful interpretation – Sylvester does not consider the possibility of higher degree polynomials arising but factoring into lower degree terms. This is known to happen for $k=3$ : if one attempts to remove three terms from a degree $n$ polynomial, the system of equations of degree 1, 2, and 3 leads in general to an equation for degree 6. Sylvester’s method is able to avoid this elevation of degree only when $n\geq 5$ . For $n=4$ , Sylvester’s method is inapplicable, but Lagrange had shown (in 1771, more than 100 years prior) that the degree 6 equation that arises in this case always factors into a pair of cubics, so that the necessary Tschirnhaus transformation can in fact be determined without solving any equation of degree higher than 3.[12]

Further reductions have been achieved, but all loosen or ignore the “elevation of degree” restriction in one way or another. Viewing the degree 9 polynomial

p(x)=a_{0}x^{9}+a_{1}x^{8}+\ldots+a_{8}x+a_{9}

as a multi-valued “algebraic function” sending $(a_{0},\ldots,a_{9})$ to the set of roots of $p$ , Hilbert asked whether this could be written as a composition of algebraic functions of at most 4 variables.[9] As part of his investigation, Hilbert sketched a method for finding a Tschirnhaus transformation such that $A_{1}=A_{2}=A_{3}=A_{4}=0$ when $n\geq 9$ . Hilbert’s method requires finding a line on a cubic surface. This in turn requires the solution of an equation of degree 27. However, by putting the equation of the cubic surface into a four-parameter canonical form (the “pentahedral form”, originally suggested by Sylvester), Hilbert was able to show that the coefficients of the necessary line were themselves algebraic functions of four variables, so this was sufficient for his purposes.

In 1945, B. Segre strengthened Hilbert’s result, describing an algorithm for finding a Tschirnhaus transformation removing 4 terms from a degree $n\geq 9$ polynomial, without solving any equation of degree higher than 5.[13] The same result was later proven by Dixmier using different methods.[6] Also building on Hilbert’s work were Wiman, who sketched an alternative proof of the $n=9$ bound for $k=4$ , and G. Chebotarev, who extended Wiman’s ideas to sketch a proof that $n=21$ for $k=5$ .[3, 18, 14] In 1975, Brauer showed that, when $n>k!$ , a degree $n$ polynomial can be written as a composition of algebraic functions of $(n-k-1)$ variables; in this framing of the problem it is permissible to simply solve the degree $k!$ to find a Tschirnhaus transformation removing $k$ terms, since this will be an algebraic function of fewer variables than the original degree $n$ polynomial.[1] For $k\geq 7,$ this $n>k!$ bound is lower than the corresponding bounds for removal of $k$ terms found by Sylvester, though it should again be emphasized that this ignores the “no elevation of degree“ constraint which is explicit in Sylvester, and implicit in pre-Sylvester work on the problem.

In 2020, Wolfson described a function $F(k)$ such that a degree $n$ polynomial can be put in an $(n-k-1)$ -parameter form whenever $n\geq F(k)$ , and which improved significantly on Brauer’s bounds.[19] Wolfson frames the problem in terms of the theory resolvent degree, which allows bounds on the number of necessary parameters for a family of algebraic functions to be computed based on the dimensions of certain spaces. For $k=5$ , Wolfson shows $n\geq 41$ , which also improves on Sylvester’s bound of $n\geq 44$ . To find the Tschirnhaus transformation accomplishing the reduction in this case requires, among other things, finding a 2-plane inside of a cubic hypersurface in $\mathbb{P}^{6}$ . Informally, since the dimension of the moduli space of cubic hypersurfaces in $\mathbb{P}^{6}$ is 35, finding a 2-plane inside such a hypersurface is at worst a 35-parameter problem, so is permissible when putting a degree 41 polynomial into 35-parameter form. This approach does not produce the degrees of the equations which must be solved to find the Tschirnhaus transformation (which may be higher than $n$ , as in Hilbert’s proof that $n=9$ suffices for $k=4$ , where it was necessary to solve a degree 27 equation to find a line on a cubic surface).

Most recently, Sutherland used a combination of ideas from the classical theory of polarity together with optimizations to the moduli-dimension-counting method to improve on the $F(k)$ bound for $k=5,\ldots,15$ and all $k\geq 18$ .[15] In particular, for $k=5$ , Sutherland shows that $n=21$ suffices, giving the first rigorous proof of the bound first claimed by Chebotarev.

In the current note, we show that the $n=21$ result can be recovered from Sylvester’s method if partial elevation of degree is allowed. More precisely, one can find a Tschirnhaus transformation removing 5 terms (i.e., such that the coefficients of the transformed polynomial satisfy $A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0$ ) from a degree $n$ polynomial whenever $n\geq 21$ , by solving no equation of degree higher than 20. A key ingredient is that there exists an explicit algorithm to find a 2-plane on a cubic hypersurface in $\mathbb{P}^{9}$ , without solving any equation of degree higher than 5.

3 Sylvester’s Obliteration Formula

Though motivated by the problem of finding Tschirnhaus transformations, the procedure Sylvester describes in [16] is remarkably general, allowing for a solution to be found to any sufficiently underdetermined system, without elevation of degree. More precisely, given a system of equations $S$ of polynomial equations in $N$ variables of degree at most $k$ , with $n_{i}$ the number of equations of degree $i$ , Sylvester shows there is a function $l(n_{1},\ldots,n_{k})$ such that, if $N>l(n_{1},\ldots,n_{k})$ , then a solution to $S$ can be found without solving any equation of degree greater than $k$ .

The method is as follows: Pick any $f$ of maximal degree $k$ from the system $S$ . We can find a solution to $S$ by first finding a line $L$ contained in the solution set of the subsystem $S^{\prime}=S\setminus\{f\}$ , then intersecting this line with the vanishing locus of $f$ . To find $L$ , first find a solution $Q=(q_{0},\ldots,q_{N})$ to $S^{\prime}$ . Then to get the required line it suffices to find a point $P=(p_{0},\ldots,p_{N})$ such that $P+\lambda Q$ is a solution to $S^{\prime}$ for all $\lambda\in\mathbb{C}$ . For any equation $g\in S^{\prime}$ , then, view $g(P+\lambda Q)$ as a polynomial in $\lambda$ ; if $\deg(g)=d$ , the coefficients of $1,\lambda,\ldots,\lambda^{d-1},\lambda^{d}$ must vanish identically. In fact the coefficient of $\lambda^{d}$ is just $g(Q)$ , so vanishes since we chose $Q$ to be a solution of $S^{\prime}$ . The vanishing of the remaining coefficients imposes polynomial conditions on the $p_{0},\ldots,p_{N}$ of degrees $1,2,\ldots,d$ . Ranging over all $g\in S^{\prime}$ , we see that $P$ must be a solution to a system $S^{\prime\prime}$ with $m_{i}$ equations of degree $i$ , where

	$\displaystyle m_{k}$	$\displaystyle=n_{k}-1$
	$\displaystyle m_{k-1}$	$\displaystyle=n_{k}+n_{k-1}$
	$\displaystyle m_{k-2}$	$\displaystyle=n_{k}+n_{k-1}+n_{k-2}$
		$\displaystyle\ \vdots$
	$\displaystyle m_{1}$	$\displaystyle=n_{k}+n_{k-1}+\ldots+n_{1}.$

Now to find this needed point solution, we proceed inductively: again hold aside some polynomial of highest degree (say $h$ ), and find a linear solution to the subsystem $S^{\prime\prime}\setminus\{h\}$ , then intersect that line with $h$ , and so on. The number of equations of maximal degree decreases by 1 with each step, so eventually we are left with a (possibly very large) system of linear equations; a line in the solution set to this system can be found provided the number of variables is (at least) one greater than the number of equations. Then to backtrack to a linear solution to the original system requires only for equations of degree $\leq k$ to be solved one at a time.

The core reduction is succinctly summarized by Sylvester’s formula of obliteration, which we will refer to repeatedly:

Proposition 2 (Sylvester).

Given $n_{1},\ldots,n_{k}\in\mathbb{N},$ let

[n_{k},n_{k-1},\ldots,n_{2},n_{1}]

denote the minimum number of variables such that a linear solution to any system of equations with exactly $n_{i}$ equations of degree $i$ can be found without elevation of degree. Then

[n_{k},n_{k-1},\ldots,n_{2},n_{1}]\leq 1+[m_{k},m_{k-1},\ldots,m_{2},m_{1}]

where

m_{i}=\begin{cases}\sum_{j=i}^{k}n_{j}&i\neq k\\ n_{k}-1&i=k.\end{cases}

Note that applying the obliteration formula $n_{k}$ times yields a system with no equations of degree $k$ , so all equations of the highest degree can be removed. This can be repeated until only (a large number of) linear equations remain, in which case the minimum number of variables needed is easy to determine. For example, consider the problem of finding a line on a cubic hypersurface. We have a system of equations with 1 equation of degree 3, 0 equations of degree 2, and 0 of degree 1. By repeated application of the obliteration formula,

[1,0,0]\leq 1+[1,1]\leq 2+[2]=2+3=5,

so it is possible to find a line solvably (in fact, by solving no equation of degree greater than 3) on a cubic hypersurface in $\mathbb{P}^{5}$ .

4 Resolvent Degree

4.1 Definitions

Let $k$ be an algebraically closed field and let $X\to Y$ be a generically finite dominant map of $k$ -varieties. Following Buhler and Reichstein [2], we define the essential dimension of this map, $\mathrm{ed}(X\to Y)$ , to be the minimal $d$ such that there exists a pullback square

with $\dim Z\leq d$ . The idea of resolvent degree is to extend this notion to allow towers of pullbacks; essentially, a map is of resolvent degree at most $d$ if it can be written as a composition of maps each of essential dimension at most $d$ . More precisely, following Farb and Wolfson [7], the resolvent degree, $\mathrm{RD}(X\to Y)$ , is defined to be the minimal $d$ such that there exists a tower

E_{r}\to\cdots\to E_{1}\to E_{0}=U,

where $U$ is an open subset of $Y$ , $E_{r}\to U$ factors through a dominant map $E_{r}\to X$ , and $\mathrm{ed}(E_{i}\to E_{i-1})\leq d$ for each $i$ . It follows from these definitions that

\mathrm{RD}(X\to Y)\leq\mathrm{ed}(X\to Y)\leq\dim Y.

It is convenient, in proving results on resolvent degree, to construct towers of maps of bounded resolvent degree (rather than essential dimension). Lemma 2.7 in Farb and Wolfson [7] shows that this is sufficient.

The connection to polynomials and their roots is as follows. Let $\mathcal{P}_{n}$ denote the space of monic degree $n$ polynomials with coefficients in $k$ and let

\widetilde{\mathcal{P}_{n}}=\{(p,r)\in\mathcal{P}_{n}\times\mathbb{A}^{1}_{k}\mid p(r)=0\},

the space of monic polynomials together with a choice of root. We then define

\mathrm{RD}(n):=\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n})

where the map is “forgetting the root”. $\mathrm{RD}(n)$ captures the complexity of the root cover in the sense that if there is a formula in terms of algebraic functions of at most $d$ variables for finding the roots of a degree $n$ polynomial in terms of its coefficients, then $\mathrm{RD}(n)\leq d$ .

4.2 Resolvent Degree of a Dominant Map

It is necessary to extend the definition of resolvent degree to dominant maps of $k$ -varieties which are not necessarily generically finite. Although we are principally interested in $\mathrm{RD}(n)=\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n}$ , which is generically finite, a number of maps which arise naturally in the study of Tschirnhaus transformations are not. For example, a key step in Bring’s proof that a general quintic can be reduced to a one-parameter form involves finding a line on a quadric surface in $\mathbb{P}^{3}$ . Thus we would like to study

\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3}

where $\mathcal{H}_{2,3}$ is the parameter space of quadric surfaces in $\mathbb{P}^{3}$ and $\mathcal{H}_{2,3}^{1}$ is the space of such quadrics together with a choice of line on the surface, and the map is “forgetting the line”. Since any quadric surface contains an infinite family of lines, this map is not generically finite. We would like to extend the notion of resolvent degree so that $\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3})$ captures the complexity of finding lines on quadric surfaces in the same way that $\mathrm{RD}(\widetilde{\mathcal{P}_{n}}\to\mathcal{P}_{n})$ captures the complexity of finding roots.

This is done in [19] in the following way: given a dominant map $\pi:X\to Y$ , define the resolvent degree $\mathrm{RD}(X\to Y)$ to be the minimum $d$ for which there exists a dense collection of subvarieties $\{U_{\alpha}\subset X\}$ with

\pi|_{U_{\alpha}}:U_{\alpha}\to Y

a generically finite dominant map and $\mathrm{RD}(U_{\alpha}\to Y)\leq d$ for all $\alpha.$ (Such a subvariety $U_{\alpha}$ is called a rational multi-section for $\pi$ .)

Requiring the collection of multisections to be dense in $X$ is necessary to ensure that resolvent degree behaves well with respect to composition of dominant maps: given $X\to Y$ and $Y\to Z$ dominant, for the composition $X\to Z$ we have $\mathrm{RD}(X\to Z)\leq\max\{\mathrm{RD}(X\to Y),\mathrm{RD}(Y\to Z)\}$ by [19, Lemma 4.10]. Having multi-sections $V\to Y$ for $X\to Y$ and $U\to Z$ for $Y\to Z$ is not generally enough to product a multisection for $X\to Z$ (the range of $V\to Y$ could entirely miss $U$ ). On the other hand, if $V\to Y$ is surjective, then we do obtain a multi-section of $X\to Z$ .

Returning to the example of lines on quadrics, an algorithm for determining a line (or finite set of lines) on each quadric surface in terms of algebraic functions of degree $d$ gives a multi-section $U\to\mathcal{H}_{2,3}$ with $\mathrm{RD}(U\to\mathcal{H}_{2,3})\leq\mathrm{RD}(d)$ , but this is somewhat less than what is required to show that $\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3})\leq\mathrm{RD}(d)$ . On the other hand, as we shall see in the next section, determining any one line is sufficient to yield a Tschirnhaus transformation reducing the quintic to a one-parameter form, so that $\mathrm{RD}(5)=1$ .

4.3 Resolvent Degree of the Quintic

As an illustrative example, we now give a detailed proof that (as observed in [7]) Bring’s work on Tschirnhaus transformations of quintic polynomials implies $\mathrm{RD}(5)=1$ . (Recall that by definition, $\mathrm{RD}(5)=\mathrm{RD}(\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5})$ , where $\mathcal{P}_{5}$ is the parameter space of monic degree 5 polynomials and $\widetilde{\mathcal{P}_{5}}$ is the space of such polynomials together with a choice of root.)

Recall that given a polynomial $p(x)=x^{5}+a_{1}x^{4}+a_{2}x^{3}+a_{3}x^{2}+a_{4}x+a_{5}$ in $\mathcal{P}_{5}$ and a Tschirnhaus transformation of the form $T(x)=b_{0}x^{4}+b_{1}x^{3}+b_{2}x^{2}+b_{3}x+b_{4}$ , we can set $y=T(x)$ and eliminate $x$ to obtain a polynomial

q(y)=y^{5}+A_{1}y^{4}+A_{2}y^{3}+A_{3}y^{2}+A_{4}y+A_{5},

where the coefficient $A_{i}$ is a degree $i$ homogeneous polynomial in $b_{0},b_{1},b_{2},b_{3},b_{4}$ , whose coefficients are integral functions of $a_{0},a_{1},a_{2},a_{3},a_{4},a_{5}$ .

Our strategy will be to construct a tower of maps

X_{5}\to X_{4}\to X_{3}\to X_{2}\to X_{1}\to\mathcal{P}_{5}

such that the resolvent degree of each map is 1 and there is a dominant rational map $X_{5}\to\widetilde{\mathcal{P}_{5}}$ . Informally, in $X_{1}$ there will be associated to each quintic polynomial $p$ the equation of a hyperplane determining the set of Tschirnhaus transformations such that $A_{1}=0$ . In $X_{2}$ , we will further have the information of a quadric $Q$ contained in this hyperplane, corresponding to $A_{1}=A_{2}=0$ , and the equation of a line $l$ contained in $Q$ . In $X_{3}$ we will further have a choice of Tschirnhaus transformation $T$ which lies on $l$ while also satisfying $A_{3}=0$ . Using this information we can construct a map from $X_{3}$ to the one-dimensional space of quintic polynomials of the form $z^{5}+Az+1$ , and we will take $X_{4}$ to be the pullback of the root cover of this space, so that $X_{4}$ associates to $p$ a root of the polynomial obtained by applying $T$ to $p$ . Finally, in constructing $X_{5}$ , we recover the roots of $p$ itself from the roots of the transformed polynomial. Each step of this process is of bounded resolvent degree.

More precisely: viewing the $b_{i}$ as homogeneous coordinates on $\mathbb{P}^{4}$ , there is associated to each polynomial $p\in\mathcal{P}_{5}$ a hyperplane $H\subset\mathbb{P}^{4}_{\mathbf{b}}$ consisting of all Tschirnhaus transformations that send $p$ to a polynomial $q(y)$ with $A_{1}=0$ . Let $X_{1}$ denote the space of ordered pairs $(p,H)$ with $p$ and $H$ as just described. Then the map $X_{1}\to\mathcal{P}_{5}$ is birational, hence has resolvent degree 1.

Next, we consider the set of all Tschirnhaus transformations such that $A_{1}=0$ and $A_{2}=0$ in the transformed polynomial. The latter is a degree 2 homogeneous polynomial in $b_{0},b_{1},b_{2},b_{3},b_{4}$ , so to each polynomial $p$ there is associated a quadric surface $Q\subset H\cong\mathbb{P}^{3}$ whose points are Tschirnhaus transformations satisfying both conditions. This defines a map

X_{1}\to\mathcal{H}_{2,3}

where $\mathcal{H}_{2,3}$ is the parameter space of quadric surfaces in $\mathbb{P}^{3}$ . Letting $\mathcal{H}_{2,3}^{1}$ denote the space of such surfaces together with a choice of line, we consider the map

\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3}.

An algorithm exists for finding a line on a quadric surface that requires solving only a single quadratic equation, so there is a rational multi-section $U\to\mathcal{H}_{2,3}$ with

\mathrm{RD}(U\to\mathcal{H}_{2,3})\leq\mathrm{RD}(2)=1.

(In fact there is a dense collection of such multi-sections, so $\mathrm{RD}(\mathcal{H}_{2,3}^{1}\to\mathcal{H}_{2,3})=1$ , but this is stronger than we require.)

We define $X_{2}$ via the pullback square

so that $\mathrm{RD}(X_{2}\to X_{1})\leq\mathrm{RD}(U\to\mathcal{H}_{2,3})=1$ . Points of $X_{2}$ are ordered tuples $(p,H,Q,l)$ , where $p$ is a quintic polynomial, $H$ is the space of all Tschirnhaus transformations that send $p$ to a polynomial with $A_{1}=0$ , $Q\subset H$ is the quadric surface whose points are Tschirnhaus transformations such that $A_{1}=0$ and $A_{2}=0$ , and $l$ is a line contained in $Q$ .

Finally, to find a Tschirnhaus transformation $T$ satisfying $A_{1}=0$ , $A_{2}=0$ , and $A_{3}=0$ requires intersecting the cubic hypersurface defined by $A_{3}=0$ with the line $l$ . To find the three points of intersection requires only the solution of a cubic equation. Defining $X_{3}$ to be the space of tuples $(p,H,Q,l,T)$ , with $p,H,Q,l$ as above and $T$ a Tschirnhaus transformation yielding $A_{1}=A_{2}=A_{3}=0$ , the projection map

X_{3}\to X_{2}

has resolvent degree $RD(3)=1$ .

Now let $\mathcal{P}_{5}^{\prime}$ denote the space of quintic polynomials of the form $z^{5}+Az+1$ . Given a point $(p,H,Q,l,T)$ in $X_{3}$ , we can apply the transformation $T$ to $p$ to obtain a polynomial

q(y)=y^{5}+A_{4}y+A_{5}.

Applying a change of variables $y=\sqrt[5]{A_{5}}z$ and dividing by $A_{5}$ yields

z^{5}+\frac{A_{4}\sqrt[5]{A_{5}}}{A_{5}}z+1

so this procedure defines a map $X_{3}\to\mathcal{P}_{5}^{\prime}$ . Now, the root cover

\widetilde{\mathcal{P}_{5}^{\prime}}\to\mathcal{P}_{5}^{\prime}

has resolvent degree

\mathrm{RD}(\widetilde{\mathcal{P}_{5}^{\prime}}\to\mathcal{P}_{5}^{\prime})\leq\dim\left(\mathcal{P}_{5}^{\prime}\right)=1,

so defining $X_{4}$ via the pullback square

we have $\mathrm{RD}(X_{4}\to X_{3})=1$ . Points of $X_{4}$ are of the form $(p,H,Q,l,T,\lambda)$ , where $(p,H,Q,l,T)\in X_{3}$ and $\lambda$ is a root of the transformed polynomial $z^{5}+\frac{A_{4}\sqrt[5]{A_{5}}}{A_{5}}z+1$ . Let $X_{5}$ be the space whose points are tuples $(p,H,Q,l,T,\mu)$ , where $\mu$ is a root of $p$ . There is a map

X_{5}\to X_{4}

defined by

(p,H,Q,l,T,\mu)\mapsto\left(p,H,Q,l,T,\frac{T(\mu)}{\sqrt[5]{A_{5}}}\right).

On the other hand, given a root $\lambda$ of the transformed polynomial, we can find the corresponding roots of $p$ by solving the (at worst degree 4) polynomial equation

\mathrm{GCD}(p(x),\sqrt[5]{A_{5}}T(x)-\lambda)=0

so $\mathrm{RD}(X_{5}\to X_{4})=\mathrm{RD}(4)=1$ .

In all, we have constructed a tower of maps

X_{5}\to X_{4}\to X_{3}\to X_{2}\to X_{1}\to\mathcal{P}_{5}

each of which has resolvent degree 1. Since the root cover $\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5}$ factors through the projection $X_{5}\to\widetilde{\mathcal{P}_{5}}$ , $(p,H,Q,l,T,\mu)\mapsto(p,\mu)$ , we have

\mathrm{RD}(5)=\mathrm{RD}(\widetilde{\mathcal{P}_{5}}\to\mathcal{P}_{5})=1.

5 Linear subspaces of hypersurfaces

To prove $\mathrm{RD}(5)=1$ we needed to show one can always find a Tschirnhaus transformation which eliminates the first three intermediate terms of a quintic polynomial. This required finding a solution of a system of three equations of degree 1, 2, and 3, respectively. A key geometric fact which made this tractable was that any quadric surface in $\mathbb{P}^{3}$ contains a line, and that an equation for such a line can be found by solving a quadratic equation; this allows the degree 2 equation (which determines the quadric surface) to be replaced by a degree 1 equation (which determines the line), so that to find a solution of the system requires only the solution of a cubic equation.

Similarly, in Hilbert’s proof that $\mathrm{RD}(9)=4$ , a Tschirnhaus transformation removing four intermediate terms is required, so a system of equations of degrees 1, 2, 3, and 4 must be solved. Informally, Hilbert’s idea is to first find a 3-plane inside the hypersurface determined by the equation of degree 2 (which is possible as long as the ambient dimension is high enough), then to find a line on the cubic surface determined inside this 3-plane by the equation of degree 3. If this can be done, all that remains is to intersect the line with the remaining equation of degree 4 and a solution can be found.

In general, one can consider the problem of finding a $k$ -plane inside a degree $d$ hypersurface in $\mathbb{P}^{N}$ . We can ask several questions:

Q1:

In terms of $d$ and $k$ , how large must the ambient dimension $N$ be to guarantee that any degree $d$ hypersurface contains a $k$ -plane?
Q2:

What is the “resolvent degree of finding the $k$ -plane”? That is, if $\mathcal{M}_{d,N}$ is a moduli space of degree $d$ hypersurfaces in $\mathbb{P}^{N}$ and $\mathcal{M}_{d,N}^{k}$ is the space of such hypersurfaces together with a choice of $k$ -plane, what is the resolvent degree of the map

$\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}$

which forgets the choice of plane?
Q3:

When is there a constructive algorithm to determine the $k$ -plane? What are the degrees of the equations that must be solved? How large must $N$ be if no equation of degree higher than some given bound is to be permitted?

As the $\mathrm{RD}(5)=1$ and $\mathrm{RD}(9)=4$ examples above illustrate, the answers to these questions have implications for the problem of finding Tschirnhaus transformations, since replacing a hypersurface with a linear subspace of that hypersurface allows us to replace a degree $d$ equation with one or more equations of degree 1, reducing the total degree of the system to be solved.

An answer to Q1 is given in Debarre and Manivel [5]: for $d>3$ , any degree $d$ hypersurface in $\mathbb{P}^{N}$ contains a $k$ -plane if

N\geq\frac{{d+k\choose k}}{k+1}+k.

In this case, the map $\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}$ is surjective, and an upper bound on its resolvent degree is given by the dimension of the codomain:

\mathrm{RD}\left(\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}\right)\leq\dim\left(\mathcal{M}_{d,N}\right).

For example,

\mathrm{RD}\left(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3}\right)\leq\dim\left(\mathcal{M}_{3,3}\right)=4,

corresponding to Hilbert’s observation that finding a line on a cubic surface requires the solution of at most an algebraic function of four variables.

These dimension-counting arguments do not provide enough information to address Q3. For this we return to Sylvester’s ideas. First, for the problem of finding a line on a degree $d$ hypersurface, repeated application of the obliteration formula suffices to compute an ambient dimension $N$ such that the desired line may be found without solving any equation of degree higher than $d$ . Sylvester’s methods can be readily adapted to the problem of finding a $k$ -plane on a degree $d$ hypersurface without solving any equation of degree higher than $d$ . Note that, when $N$ is large enough for this to be possible,

\mathrm{RD}\left(\mathcal{M}_{d,N}^{k}\to\mathcal{M}_{d,N}\right)\leq\mathrm{RD}(d)

giving a bound on resolvent degree which is often sharper than what one obtains from dimension-counting alone (though at the price of requiring a larger ambient dimension $N$ than that given by Waldron’s theorem), so Sylvester’s ideas are of interest even if one is only concerned with finding bounds on resolvent degree. In the remainder of this section we look at the $d=3$ case in detail.

5.1 Linear subspace of cubic hypersurfaces

Let $\mathcal{M}_{3,N}$ be the moduli space of smooth cubic hypersurfaces in $\mathbb{P}^{N}$ and let $\mathcal{M}_{3,N}^{r}$ be the moduli space of pairs $(C,P)$ where $C$ is a smooth cubic hypersurface and $P$ is an $r$ -plane contained in $C$ . We can then consider the “resolvent degree of finding an $r$ -plane”, i.e.,

\mathrm{RD}(\mathcal{M}_{3,N}^{r}\to\mathcal{M}_{3,N}),

where the map forgets the choice of plane. For example, for the problem of finding a line on a cubic surface, we have

\mathrm{RD}(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3})\leq\dim(\mathcal{M}_{3,3})=4,

so that finding a line on a cubic surface in $\mathbb{P}^{3}$ requires the solution of algebraic equations of no more than 4 variables. Farb and Wolfson [7], using an argument due to Klein, show this bound can be improved to

\mathrm{RD}(\mathcal{M}_{3,3}^{1}\to\mathcal{M}_{3,3}.)\leq 3.

In higher dimensions it is easier to find a line. In particular, by Sylvester’s obliteration method, as discussed in section 3, finding a line on a cubic surface in $\mathbb{P}^{5}$ requires the solution of no equation of degree higher than 3, and so is a resolvent degree 1 problem.

Proposition 3 (Sylvester).

Given a cubic hypersurface $V$ in $\mathbb{P}^{n}$ , we can find a line contained in $V$ by solving equations of degree no higher than 3 provided $n\geq 5.$ Hence

\mathrm{RD}(\mathcal{M}_{3,5}^{1}\to\mathcal{M}_{3,5})\leq\mathrm{RD}(3)=1.

Proof.

We wish to find a linear solution to a system of equations consisting of 1 equation of degree $3$ and no equations of lower degree. By Sylvester’s obliteration formula, the ambient dimension required is

[1,0,0]\leq 1+[1,1]\leq 2+[2]=2+3=5.

∎

Sylvester’s method also extends to finding higher-dimensional linear subspaces of hypersurfaces, when the dimension of the ambient space is large enough. For example, for cubic hypersurfaces in $\mathbb{P}^{9}$ , finding a 2-plane can also be done solvably.

Proposition 4 (Sylvester).

Given a cubic hypersurface $V=V(f)$ in $\mathbb{P}^{n}$ , we can find a 2-plane contained in $V$ by solving equations of degree no higher than 3, provided $n\geq 11$ . Hence

\mathrm{RD}(\mathcal{M}_{3,11}^{2}\to\mathcal{M}_{3,11})\leq\mathrm{RD}(3)=1.

Proof.

Since $n\geq 5$ , we can find a line $l$ contained in $V\subset\mathbb{P}^{n}$ solvably. Let $q$ and $r$ be distinct points of $l$ such that $q+\lambda r\in V$ for all $\lambda\in\mathbb{V}$ . To find a plane contained in $V$ , we look for a point

p=\mathbb{P}^{n-2}\subset\mathbb{P}^{n}\setminus l

such that $f(p+\mu q+\lambda r)=0$ for all $\mu,\lambda\in\mathbb{V}$ . Expanding this as a polynomial in $\mu$ and $\lambda$ , the coefficients of $1$ , $\lambda$ , $\mu$ , $\lambda\mu$ , $\lambda^{2}$ , and $\mu^{2}$ , must vanish identically.

This gives a system of equations in $n-2$ variables with 1 equation of degree 3, 2 equations of degree 2, and 3 equations of degree 1. To solve it, we look for a linear solution to the subsystem of equations of degree $<3$ , then intersect this line with the remaining degree 3 equation. Using Sylvester’s formula of obliteration, we have

[2,3]\leq 1+[1,5]\leq 2+[6]=2+7=9,

so we can find the needed linear solution when $n-2\geq 9$ . Thus we can find a line on a cubic hypersurface in $\mathbb{P}^{n}$ when $n\geq 11$ . ∎

The ambient dimension can be reduced slightly if some elevation of degree is allowed. From Segre [1945, pg 295, Sec 12], it is possible to determine a line on the intersection of two given quadrics by solving equations of degree no higher than 5, provided the ambient dimension is at least 4. Thus to find a linear solution of a system with 2 equations of degree 2, and 3 equations of degree 1, an ambient dimension of 7 is sufficient. Comparing to the last paragraph in the proof above we have

Proposition 5.

Given a cubic hypersurface $V=V(f)$ in $\mathbb{P}^{n}$ , we can find a 2-plane contained in $V$ by solving equations of degree no higher than 5, provided $n\geq 9$ . Hence

\mathrm{RD}(\mathcal{M}_{3,9}^{2}\to\mathcal{M}_{3,9})\leq\mathrm{RD}(5)=1.

6 Removing 5 Terms from a Polynomial

Given a degree $n$ polynomial

x^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}

we wish to find a Tschirnhaus transformation

T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0}

such that after setting $y=T(x)$ the transformed polynomial

y^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}

satisfies $A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0$ . To determine the coefficients $b_{0},\ldots,b_{n-1}$ of $T$ then requires the solution of a system of equations with degrees 1, 2, 3, 4, 5, in $\mathbb{P}^{n-1}$ . In general, to find a solution to such a system requires solving a polynomial of degree $5!=120$ , but when $n$ is large enough this elevation of degree can be partially avoided.

We first informally sketch the geometry underlying Wolfson’s bound of $n=41$ . The general idea is that by finding linear subspaces of the hypersurfaces corresponding to the polynomials of our system, the total degree of the system can be reduced. In this case, to find the necessary Tschirnhaus transformation one first finds a $6$ -plane inside a quadric surface in $\mathbb{P}^{13}$ (this only requires solving degree 2 equations, so is resolvent degree 1). Then, intersecting the degree 3 equation with the 6-plane yields a cubic hypersurface in $\mathbb{P}^{6}$ . If we can then find a 2-plane inside this cubic hypersurface, then by intersecting the degree 4 and 5 equations with this plane, we are left with a system of total degree 20 to solve. In summary, we have the chain

V_{4}\cap V_{5}\subset\mathbb{P}^{2}\subset V_{3}\subset\mathbb{P}^{6}\subset V_{2}\subset\mathbb{P}^{13}=V_{1}\subset\mathbb{P}^{14}.

This gives an algorithm for finding the desired Tschirnhaus transformation provided one is able to find the necessary 2-plane inside the cubic hypersurface in $\mathbb{P}^{6}$ . Wolfson uses a dimension count to show that

RD(\mathcal{M}_{3,6}^{2}\to\mathcal{M}_{3,6})\leq\dim(\mathcal{M}_{3,6})=35,

and so is able to use this Tschirnhaus transformation to show $\mathrm{RD}(n)\leq n-6$ whenever $n-6\geq 35$ .

By increasing the ambient dimension in which the the cubic hypersurface lives, we can use Sylvester’s ideas to find a plane inside a cubic hypersurface in $\mathbb{P}^{9}$ by solving only equations of degree 5 or less. (i.e., in a resolvent degree 1 way). This allows the $n=41$ bound for removing 5 terms from a degree $n$ polynomial to be reduced to $n=21$ , with simpler irrationalities involved in finding the necesary Tschirnhaus transformation.

Theorem (Main Theorem).

Let $n\geq 21$ . Given a degree $n$ polynomial

x^{n}+a_{1}x^{n-1}+\ldots+a_{n}x+a_{n}

we can find a Tschirnhaus transformation

T(x)=b_{n-1}x^{n-1}+b_{n-2}x^{n-2}+\ldots+b_{1}x+b_{0}

such that after setting $y=T(x)$ the transformed polynomial

y^{n}+A_{1}y^{n-1}+\ldots+A_{n-1}y+A_{n}

satisfies $A_{1}=A_{2}=A_{3}=A_{4}=A_{5}=0$ . The coefficients $b_{0},\ldots b_{n-1}$ of $T$ can be determined by solving equations of degree at most $20$ .

Proof.

The equations $A_{1}=0$ , $A_{2}=0$ , $A_{3}=0$ , $A_{4}=0$ , $A_{5}=0$ impose polynomial conditions of degree 1, 2, 3, 4, and 5, respectively, on the point $(b_{0},\ldots,b_{n-1})\in\mathbb{P}^{n-1}$ to be determined. Using the degree 1 equation we can eliminate one variable. The degree 2 equation then determines a quadric hypersurface in $\mathbb{P}^{n-2}$ . By the classical theory of linear subspaces of quadrics, we can find a $\mathbb{P}^{9}$ contained in this hypersurface provided $n-2\geq 19$ .

Next, we consider the cubic hypersurface determined by this $\mathbb{P}^{9}$ and the degree 3 equation. By Proposition 5 of the previous section, we can find a $\mathbb{P}^{2}$ contained in this hypersurface by solving equations of degree at worst 5. Finally, intersecting the remaining equations of degree 4 and 5 determine two curves in this $\mathbb{P}^{2}$ , whose points of intersection are governed by an equation of degree at most 20. Each such point then satisfies all 5 polynomial conditions, by construction, and so yields a Tschirnhaus transformation of the desired form. ∎

In the same way that the reduction of the quintic to one-parameter form can be translated into the language of resolvent degree to yield $\mathrm{RD}(5)=1$ , this theorem implies $\mathrm{RD}(n)\leq n-k$ for $n\geq 21$ .

References

[1] Richard Brauer. On the resolvent problem. Annali di Matematica Pura ed Applicata, 102(1):45–55, 1975.
[2] Joe Buhler and Zinovy Reichstein. On tschirnhaus transformations. In Topics in Number theory, pages 127–142. Springer, 1999.
[3] Grigorii Nikolaevich Chebotarev. On the problem of resolvents. Uchenye Zapiski Kazanskogo Universiteta. Seriya Fiziko-Matematicheskie Nauki, 114(2):189–193, 1954.
[4] Alexander Chen, Yang-Hui He, and John McKay. Erland Samuel Bring’s “transformation of algebraic equations”. arXiv preprint arXiv:1711.09253, 2017.
[5] Olivier Debarre and Laurent Manivel. Sur la vari et e des espaces lin eaires contenus dans une intersection compl ete. Math. Ann, 312:549–574, 1998.
[6] Jacques Dixmier. Histoire du 13e probleme de hilbert. Cahiers du séminaire d’histoire des mathématiques, 3:85–94, 1993.
[7] Benson Farb and Jesse Wolfson. Resolvent degree, hilbert’s 13th problem and geometry. L’Enseignement Mathématique, 65(3):303–376, 2020.
[8] Sir William Rowan Hamilton. Inquiry Into the Validity of a Method Recently Proposed by George B. Jerrard, Esq. for Transforming and Resolving Equations of Elevated Degrees Undertaken at the Request of the Association. Richard and John E. Taylor, 1836.
[9] David Hilbert. Über die gleichung neunten grades. Mathematische Annalen, 97(1):243–250, 1927.
[10] George Birch Jerrard. Mathematical Researches. Longman, Bristol and London, 1834.
[11] George Birch Jerrard. Xxiv. on certain transformations connected with the finite solution of equations of the fifth degree: To the editors of the philosophical magazine and journal. The London, Edinburgh, and Dublin Philosophical Magazine and Journal of Science, 7(39):202–203, 1835.
[12] Joseph Lagrange. Réflexions sur la résolution algébrique des équations. Nouveaux Mémoires de l’Académie Royale des Sciences et Belles-Lettres de Berlin, 1771.
[13] Beniamino Segre. The algebraic equations of degrees 5, 9, 157,…, and the arithmetic upon an algebraic variety. Annals of Mathematics, pages 287–301, 1945.
[14] Alexander J. Sutherland. GN Chebotarev’s “on the problem of resolvents”. arXiv preprint arXiv:2107.01006, 2021.
[15] Alexander J Sutherland. Upper bounds on resolvent degree and its growth rate. arXiv preprint arXiv:2107.08139, 2021.
[16] James Joseph Sylvester. On the so-called tschirnhausen transformation. 1887.
[17] Ehrenfried Walther von Tschirnhaus. Methodus auferendi omnes terminos intermedios ex data aeqvatione (method of eliminating all intermediate terms from a given equation). Acta Eruditorum (1683), pages 204–207.
[18] Anders Wiman. Über die Anwendung der Tschirnhausentransformation auf die Reduktion algebraischer Gleichungen. Almquist & Wiksells Boktr., 1927.
[19] Jesse Wolfson. Tschirnhaus transformations after hilbert. L’Enseignement Mathématique, 66(3):489–540, 2021.