This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Greatly improved higher-order product formulae for quantum simulation

Mauro E. S. Morales Centre for Quantum Computation and Communication Technology, University of Technology Sydney, Sydney, NSW 2007, AU Centre for Quantum Software and Information, University of Technology Sydney, Sydney, NSW 2007, AU Joint Center for Quantum Information and Computer Science (QuICS), University of Maryland, College Park, Maryland 20742, USA    Pedro C. S. Costa School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, AU Quantum for New South Wales, Sydney, NSW 2000, AU    Giacomo Pantaleoni School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, AU    Daniel K. Burgarth School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, AU Department Physik, Friedrich-Alexander-Universität Erlangen-Nürnberg, Staudtstraße 7, 91058 Erlangen, Germany    Yuval R. Sanders Centre for Quantum Software and Information, University of Technology Sydney, Sydney, NSW 2007, AU School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, AU    Dominic W. Berry School of Mathematical and Physical Sciences, Macquarie University, Sydney, NSW 2109, AU
Abstract

Quantum algorithms for simulation of Hamiltonian evolution are often based on product formulae. The fractal method of Suzuki gives a systematic way to find arbitrarily high-order product formulae, but results in a large number of exponentials. On the other hand, product formulae with fewer exponentials can be found by numerical solution of simultaneous nonlinear equations. It is also possible to reduce the cost of long-time simulations by processing, where a kernel is repeated and a processor need only be applied at the beginning and end of the simulation. In this work, we found thousands of new product formulae of both 8th and 10th order, and numerically tested these formulae, together with many formulae from prior literature. We provide methods to fairly compare product formulae of different lengths and different orders. We have found a new 8th order processed product formula with exceptional performance, that outperforms all other tested product formulae for about eight orders of magnitude in system parameters TT (time) and ϵ\epsilon (allowable error). That includes most reasonable combinations of parameters to be used in quantum algorithms.

I Introduction

The Lie-Trotter product formula is commonly used in quantum algorithms for Hamiltonian simulation, where one can approximate the Hamiltonian evolution of H=A+BH=A+B in terms of exponentials of AA and BB when these operators do not commute. For short time, a standard approximation is the second-order symmetric formula S2(t)=eiAt/2eiBteiAt/2S_{2}(t)=e^{-iAt/2}e^{-iBt}e^{-iAt/2}, which satisfies eiHt=S2(t)+𝒪(t3)e^{-iHt}=S_{2}(t)+\mathcal{O}(t^{3}). More generally, the error in an order kk formula is 𝒪(tk+1)\mathcal{O}(t^{k+1}). Longer times are simulated by breaking the evolution into many repetitions of a short time. The number of repetitions needed is reduced with the order, motivating the search for higher-order product formulae. A systematic method to produce arbitrarily high order formulae is the fractal method of Suzuki [1, 2], which has found applications in Hamiltonian simulation [3]. The first explicit use of product formulae for quantum simulation was given in [4], applying it for quantum evolution under local Hamiltonians. Later work considered the broader class of sparse Hamiltonians [5] and higher orders [3], as well as methods beyond product formulae [6, 7, 8, 9].

Recent work has shown that despite its simplicity, the Lie-Trotter product formula can compete with other Hamiltonian simulation algorithms due to the low error that it achieves in practice [10]. Methods based on linear combinations of unitaries [11, 12] or quantum signal processing [13] give complexity logarithmic in the inverse error, but the error is not required to be extremely small, meaning those methods do not provide as large an advantage as might be expected. Product formula error bounds can be further improved by considering particular physical systems [14, 15, 16] or leveraging randomisation [17, 18, 19]. Moreover, Trotter formulae are expected to be relevant for both noisy intermediate-scale quantum computation and fault-tolerant computation. It is then of great importance to seek efficient implementations of product formulae as it can have a great impact on the efficiency of Hamiltonian simulation algorithms in practice.

The downside of the Suzuki method to generate higher-order formulae is that the number of exponential operators to implement it grows very rapidly. Suzuki’s product formulae are usually assumed in quantum computing, but they can be greatly improved upon. An alternative method by Yoshida [20] can be used to obtain product formulae with a smaller number of exponentials. Similar to Suzuki’s formulae, they are given as a product of S2S_{2} for different time intervals, but in contrast to Suzuki’s approach there is not an explicit analytic form for the higher-order formulae. Instead one needs to derive and solve a complicated set of simultaneous nonlinear polynomial equations.

Yoshida gives what appears to be all 6th order solutions but only some 8th order integrators, and did not consider any orders beyond that. More recent work [21, 22, 23, 24] has pushed the search to higher orders. In Ref. [23] a summary of product formulae in the literature is given, including results from orders 4 to 10. The terminology used in that work is stages for the number of S2S_{2} in the product formula. In Ref. [21] the authors provide 8th order product formulae with 15 and 17 stages which improve over those of Suzuki and Yoshida. Other product formulae based on the ansatz of Yoshida are also presented in Ref. [22]. For 8th order, Ref. [22] replicates the findings of [21] for product formulae with 15 and 17 stages, and finds new solutions with 19 and 21 stages. For 10th order, Ref. [22] provides solutions with 31 and 33 stages.

Another approach is that of processed product formulae [25, 26, 27, 28, 29, 30, 24, 31]. Instead of the product formula being symmetric, it is of the form PΣP1P\Sigma P^{-1} for kernel Σ\Sigma and processor PP. The PP and P1P^{-1} cancel when using the product of many of these for evolution over a long time, so the cost is dominated by the number of stages in the kernel. Since the kernel has fewer restrictions on it, it can have fewer stages than a normal symmetric product formula.

The objective of this work is to find improved product formulae and compare their performance in a consistent manner. We show that the performance of product formulae is better quantified by the error in the eigenvalues, rather than the spectral-norm error as usually considered in prior work. This is because it is the eigenvalue error that dominates the error for evolution at longer times. We also derive a method to consistently compare the performance of product formulae with different numbers of exponentials.

We use these methods to compare between the performance of our product formulae, as well as to compare to prior product formulae in the literature. By our numerical search, we found hundreds of thousands of solutions at 8th order, including some that outperform those previously reported in the literature, with our best being a processed product formula. For 10th order we have found thousands of new solutions including some with extremely low error, though one given in Ref. [22] provides even better performance. A detailed list of the product formulae considered in this work can be found in Section VI.

When comparing product formulae of different orders, it is better to use higher-order formulae for larger values of T/ϵT/\epsilon, where TT is the total evolution time and ϵ\epsilon is the required precision. For smaller T/ϵT/\epsilon it is better to use lower-order formulae, but as it is increased there are threshold values beyond which it is optimal to use higher-order formulae. We derive a methods for determining these thresholds, and show that our 8th order product formula is best for T/ϵT/\epsilon from about 10610^{6} to 101410^{14}, which includes the range of typical values for quantum chemistry applications. This means that the best 8th order product formula we have found in this work will be best suited to real applications. It is possible to reduce the lower threshold by an order of magnitude by adjusting the product formula to provide low error for larger time steps.

The organisation of this work is as follows. In Section II we introduce the background necessary for the rest of the paper. First, we define product formulae and introduce the Suzuki method for generating higher-order product formulae in Section II.1, then in Section II.2 we give a summary of Yoshida’s method, then we explain processors for product formulae in Section II.3. In Section III we present the methods used for solution, including Subsection III.2 where we introduce the Taylor series method to find new product formulae, which is distinct from Yoshida’s method. In Sections IV.1 and IV.2 we present the results regarding new 8th and 10th order product formulae, and then in Section VI we give a comprehensive comparison of the product formulae.

II Background

In this section, we give a summary of the background for our work. We begin by defining product formulae and the Baker-Campbell-Hausdorff formula, then we introduce the Suzuki method and Yoshida method to obtain higher-order formulae, and describe the processed product formulae.

II.1 Product formulae

It is well known that, for any non-commuting operators XX and YY,

exp((X+Y)t)=exp(Xt)exp(Yt)+(t2).\exp((X+Y)t)=\exp(Xt)\exp(Yt)+\order{t^{2}}. (1)

where we have absorbed the i-i factor used in quantum simulation into XX and YY. The above equation demonstrates that the exponential of a sum of two operators is, to first order, equal to the product of the exponential of those operators. The above equation is often referred as a ‘first-order product formula’. Higher-order terms can be computed via the Baker-Campbell-Haussdorff (BCH) formula.

Theorem 1 (Baker-Campbell-Haussdorff formula [32]).

Let XX and YY be any operators such that X+Y<ln(2)\norm{X}+\norm{Y}<\ln{2}. We have for an operator ZZ that exp(X)exp(Y)=exp(Z)\exp(X)\exp(Y)=\exp(Z), with

Z=n=1(1)n1nr1+s1>0rn+sn>0[Xr1,Ys1,Xrn,Ysn](j=1nri+si)i=1nri!si!,Z=\sum_{n=1}^{\infty}\frac{(-1)^{n-1}}{n}\sum_{\begin{subarray}{c}r_{1}+s_{1}>0\\ \vdots\\ r_{n}+s_{n}>0\end{subarray}}\frac{[X^{r_{1}},Y^{s_{1}},\cdots X^{r_{n}},Y^{s_{n}}]}{\left(\sum_{j=1}^{n}r_{i}+s_{i}\right)\prod_{i=1}^{n}r_{i}!s_{i}!}, (2)

where

[Xr1,Ys1,Xrn,Ysn]=[X,[X,[Xr1,[Y,[Y,[Ys1,[X,[X,[Xrn,[Y,[Y,Ysn]]]].[X^{r_{1}},Y^{s_{1}},\dotsm X^{r_{n}},Y^{s_{n}}]=[\underbrace{X,[X,\dotsm[X}_{r_{1}},[\underbrace{Y,[Y,\dotsm[Y}_{s_{1}},\,\dotsm\,[\underbrace{X,[X,\dotsm[X}_{r_{n}},[\underbrace{Y,[Y,\dotsm Y}_{s_{n}}]]\dotsm]].

The standard second-order symmetric product formula is as given in the definition below.

Definition 2 (Symmetric product formula of order two).

Let XX and YY be non-commuting operators and let tt be a real variable. Define

S2(t):=exp(12Xt)exp(Yt)exp(12Xt).S_{2}(t):=\exp(\frac{1}{2}Xt)\exp(Yt)\exp(\frac{1}{2}Xt). (3)

The operators XX and YY used in the definition of S2S_{2} should always be clear from context. More generally, when there is a sum of XjX_{j}, the product formula is

S2(t):=[j=1Jexp(12Xjt)][j=J1exp(12Xjt)].S_{2}(t):=\left[\prod_{j=1}^{J}\exp(\frac{1}{2}X_{j}t)\right]\left[\prod_{j=J}^{1}\exp(\frac{1}{2}X_{j}t)\right]. (4)

Products are ordered with the starting index on the right and the final one on the left, so for J=2J=2 terms the expression in the definition is obtained. Note that S2S_{2}, and all product formulae considered in this work, have an order independent of JJ, so there may be any number of terms in the sum. The corollary below describes the form of the error terms in the symmetric product formula, and also implies that it is second order.

Corollary 3 (Symmetric BCH formula [20]).

Let XX and YY be any operators such that X+Y<ln(2)\norm{X}+\norm{Y}<\ln{2} and let tt be a real variable. Define ZZ such that S2(t)=exp(Z)S_{2}(t)=\exp(Z). Then there is a sequence α\alpha_{\ell} consisting of linear combinations of \ell-term commutators in XX and YY such that

Z==1αt.Z=\sum_{\ell=1}^{\infty}\alpha_{\ell}t^{\ell}. (5)

Moreover, α0\alpha_{\ell}\equiv 0 whenever \ell is even.

Reference [20] also shows that even terms are zero for more general symmetric product formulae. The first three non-zero α\alpha_{\ell} terms from above are

α1\displaystyle\alpha_{1} =X+Y,\displaystyle=X+Y, (6)
α3\displaystyle\alpha_{3} =112[Y,[Y,X]]124[X,[X,Y]],\displaystyle=\frac{1}{12}[Y,[Y,X]]-\frac{1}{24}[X,[X,Y]], (7)
α5\displaystyle\alpha_{5} =75760[X,X,X,X,Y]1720[Y,Y,Y,Y,X]+1360[X,Y,Y,Y,X]+1360[Y,X,X,X,Y]1480[X,X,Y,Y,X]\displaystyle=\frac{7}{5760}[X,X,X,X,Y]-\frac{1}{720}[Y,Y,Y,Y,X]+\frac{1}{360}[X,Y,Y,Y,X]+\frac{1}{360}[Y,X,X,X,Y]-\frac{1}{480}[X,X,Y,Y,X]
+1120[Y,Y,X,X,Y].\displaystyle\quad+\frac{1}{120}[Y,Y,X,X,Y]. (8)

Here the square brackets are used to indicate multicommutator expressions similar to the notation in Theorem 1, for example

[Y,Y,X,X,Y][Y,[Y,[X,[X,Y]]]].[Y,Y,X,X,Y]\equiv[Y,[Y,[X,[X,Y]]]]. (9)

Expressions for each α\alpha_{\ell} can be derived from two applications of the usual BCH formula.

Definition 4 (Product formula).

Let XX and YY be any non-commuting operators. Given a natural number nn, a product formula of order nn is a pair (c,d)(\vec{c},\vec{d}) with c,d\vec{c},\vec{d}\in\mathbb{R}^{\ell} and \ell a natural number such that for all tt\in\mathbb{R}

exp((X+Y)t)=j=1exp(cjXt)exp(djYt)+(tn+1).\exp((X+Y)t)=\prod_{j=1}^{\ell}\exp(c_{j}Xt)\exp(d_{j}Yt)+\order{t^{n+1}}. (10)

We refer to the number of non-zero coefficients in (c,d)\left(\vec{c},\vec{d}\right) as the length of the product formula.

Hence S2S_{2} is a length-33 product formula. The number of exponentials used in the product formula is a crucial measure of its efficiency, and in quantum simulation it is proportional to the required number of gates.

Suzuki method for generating higher-order product formulae.

Here we describe Suzuki’s fractal methods from [1, 2] to obtain higher-order product formulae. Starting from the symmetrised product formula in Eq. (3), the fractal method produces product formulae at all even orders. Suzuki’s first fractal method to generate a product formula of order k=2κk=2\kappa is [1]

S2κ(t)=S2κ2(sκt)S2κ2((12sκ)t)S2κ2(sκt),S_{2\kappa}(t)=S_{2\kappa-2}(s_{\kappa}t)S_{2\kappa-2}((1-2s_{\kappa})t)S_{2\kappa-2}(s_{\kappa}t), (11)

where sκ=1/(221/(2κ1))s_{\kappa}=1/(2-2^{1/(2\kappa-1)}). This method can be used to generate even orders starting at S2S_{2}. A drawback to this method is that both sks_{k} and 12sκ1-2s_{\kappa} are greater than 1, so the coefficients in the higher-order methods are large, causing greater error.

Alternatively to Eq. 11, an order 2κ2\kappa product formula can be obtained via Suzuki’s second fractal method [2]

S~2κ(t)=S~2κ2(uκt)2S~2κ2((14uκ)t)S~2κ2(uκt)2,\tilde{S}_{2\kappa}(t)=\tilde{S}_{2\kappa-2}(u_{\kappa}t)^{2}\tilde{S}_{2\kappa-2}((1-4u_{\kappa})t)\tilde{S}_{2\kappa-2}(u_{\kappa}t)^{2}, (12)

where uκ=1/(441/(2κ1))u_{\kappa}=1/(4-4^{1/(2\kappa-1)}). This method has the advantage that both uκu_{\kappa} and 14uκ1-4u_{\kappa} are less than 1, so the coefficients of higher-order formulae are small resulting in small error. The drawback is that far more exponentials are required. Each iteration uses 5 copies of the lower-order formula, whereas the previous one uses 3 copies. The virtue of these fractal methods is that they allow one to generate arbitrarily high-order product formulae easily, albeit at the expense of a large number of exponentials.

Exponential length scaling of the Suzuki method.

For Suzuki’s first method, the total number of exponentials for a given order 2κ=4,6,8,2\kappa={4,6,8,...} in the product formula S2κS_{2\kappa} is given by

2(J1)3κ1+1.2(J-1)3^{\kappa-1}+1\,. (13)

For example J=2J=2 for X+YX+Y and κ=1\kappa=1 just corresponds to S2S_{2}, and the expression gives 3 as expected. For Suzuki’s second method S~2κ\tilde{S}_{2\kappa} the number of exponentials is

2(J1)5κ1+1.2(J-1)5^{\kappa-1}+1. (14)

The number of exponential operators in both cases of the Suzuki method grows very rapidly. Thus one may be interested in alternative method to obtain product formulae with a lower count, such as the method of Yoshida in the next section.

II.2 Yoshida’s method for deriving 6th order product formulae

Approach.

Rather than using Eqs. 11 and 12, Yoshida uses the general procedure

S(m)(t)=(j=1mS2(wmj+1t))S2(w0t)(j=1mS2(wjt)),\displaystyle S^{(m)}(t)=\bigg{(}\prod_{j=1}^{m}S_{2}(w_{m-j+1}t)\bigg{)}S_{2}(w_{0}t)\bigg{(}\prod_{j=1}^{m}S_{2}(w_{j}t)\bigg{)}, (15)

where wjw_{j}\in\mathbb{R} for j=0,1,,mj=0,1,\cdots,m are parameters to be determined. Note the number of exponentials in this product is given by (4m+2)(J1)+1(4m+2)(J-1)+1. Given this ansatz, the task becomes to find mm and wiw_{i} such that S(m)S^{(m)} is an order kk product formula. We will illustrate Yoshida’s method by deriving the result for 6th order.

Expand Yoshida product using Baker-Campbell-Haussdorf formula.

The method is to expand Eq. 15 using the BCH formula from Theorem 1 recursively as follows. First, note that from Corollary 3, S2(t)=et2XetYet2X=etα1+t3α3S_{2}(t)=e^{\frac{t}{2}X}e^{tY}e^{\frac{t}{2}X}=e^{t\alpha_{1}+t^{3}\alpha_{3}\cdots}, where α\alpha_{\ell} is a commutator of \ell operators as described below Corollary 3. We are for the moment considering sums of two terms X+YX+Y. Define C=i=1t2i1w12i1α2i1C=\sum_{i=1}^{\infty}t^{2i-1}w_{1}^{2i-1}\alpha_{2i-1} and D=i=1t2i1w02i1α2i1D=\sum_{i=1}^{\infty}t^{2i-1}w_{0}^{2i-1}\alpha_{2i-1}. Then,

S2(w1t)S2(w0t)S2(w1t)\displaystyle S_{2}(w_{1}t)S_{2}(w_{0}t)S_{2}(w_{1}t)
=eCeDeC\displaystyle=e^{C}e^{D}e^{C}
=exp{tw1α1+t3w13α3+t5w15α5+(t7)}exp{tw0α1+t3w03α3+tw05α5+(t7)}\displaystyle=\exp\{tw_{1}\alpha_{1}+t^{3}w_{1}^{3}\alpha_{3}+t^{5}w_{1}^{5}\alpha_{5}+\order{t^{7}}\bigg{\}}\exp\{tw_{0}\alpha_{1}+t^{3}w_{0}^{3}\alpha_{3}+tw_{0}^{5}\alpha_{5}+\order{t^{7}}\bigg{\}}
×exp{tw1α1+t3w13α3+t5w15α5+(t7)}\displaystyle\quad\times\exp\{tw_{1}\alpha_{1}+t^{3}w_{1}^{3}\alpha_{3}+t^{5}w_{1}^{5}\alpha_{5}+\order{t^{7}}\bigg{\}}
=exp{t(2w1+w0)α1+t3(2w13+w03)α3+t5(2w15+w05)α5+16([D,D,C][C,C,D])+(t7)}.\displaystyle=\exp\{t(2w_{1}+w_{0})\alpha_{1}+t^{3}(2w_{1}^{3}+w_{0}^{3})\alpha_{3}+t^{5}(2w_{1}^{5}+w_{0}^{5})\alpha_{5}+\frac{1}{6}([D,D,C]-[C,C,D])+\order{t^{7}}\bigg{\}}. (16)

A simple computation shows

[D,D,C][C,C,D]\displaystyle[D,D,C]-[C,C,D] =t5(w02w13w12w03+w14w0w04w1)[α1,α1,α3]+(t7).\displaystyle=t^{5}(w_{0}^{2}w_{1}^{3}-w_{1}^{2}w_{0}^{3}+w_{1}^{4}w_{0}-w_{0}^{4}w_{1})[\alpha_{1},\alpha_{1},\alpha_{3}]+\order{t^{7}}. (17)

Define β5=[α1,α1,α3]\beta_{5}=[\alpha_{1},\alpha_{1},\alpha_{3}] so

S2(w1t)S2(w0τ)S2(w1t)\displaystyle S_{2}(w_{1}t)S_{2}(w_{0}\tau)S_{2}(w_{1}t) =exp{t(2w1+w0)α1+t3(2w13+w03)α3+t5(2w15+w05)α5\displaystyle=\exp\{t(2w_{1}+w_{0})\alpha_{1}+t^{3}(2w_{1}^{3}+w_{0}^{3})\alpha_{3}+t^{5}(2w_{1}^{5}+w_{0}^{5})\alpha_{5}
+t516(w02w13w12w03+w0w14w04w1)β5+(t7)}.\displaystyle\quad+t^{5}\frac{1}{6}(w_{0}^{2}w_{1}^{3}-w_{1}^{2}w_{0}^{3}+w_{0}w_{1}^{4}-w_{0}^{4}w_{1})\beta_{5}+\order{t^{7}}\bigg{\}}. (18)

By an induction argument Yoshida shows that

S(m)(t)=exp{tA1,mα1+t3A3,mα3+t5(A5,mα5+B5,mβ5)+(t7)},\displaystyle S^{(m)}(t)=\exp\{tA_{1,m}\alpha_{1}+t^{3}A_{3,m}\alpha_{3}+t^{5}(A_{5,m}\alpha_{5}+B_{5,m}\beta_{5})+\order{t^{7}}\bigg{\}}, (19)

where Aj,mA_{j,m} and B5,mB_{5,m} are polynomials on the variables w0,,wmw_{0},\dots,w_{m}.

The case m=0m=0 is just the symmetric BCH formula, so it is clear that Eq. (19) holds with

A1,0\displaystyle A_{1,0} =w0,\displaystyle=w_{0},
A3,0\displaystyle A_{3,0} =w03,\displaystyle=w_{0}^{3},
A5,0\displaystyle A_{5,0} =w05,\displaystyle=w_{0}^{5},
B5,0\displaystyle B_{5,0} =0.\displaystyle=0. (20)

To prove Eq. (19) for m>0m>0, one needs to show that the exponential is of the form with operator α1\alpha_{1} for first order in tt, operator α3\alpha_{3} for third order in tt, and operators α5\alpha_{5} and β5\beta_{5} for fifth order in tt. This result may be shown using

S(m+1)(t)\displaystyle S^{(m+1)}(t) =S2(wm+1t)S(m)(t)S2(wm+1t)\displaystyle=S_{2}(w_{m+1}t)S^{(m)}(t)S_{2}(w_{m+1}t)
=exp{twm+1α1+t3wm+13α3+t5wm+15α5+(t7)}\displaystyle=\exp\{tw_{m+1}\alpha_{1}+t^{3}w_{m+1}^{3}\alpha_{3}+t^{5}w_{m+1}^{5}\alpha_{5}+\order{t^{7}}\bigg{\}}
×exp{tA1,mα1+t3A3,mα3+t5(A5,mα3+B5,mβ5)+(t7)}\displaystyle\quad\times\exp\{tA_{1,m}\alpha_{1}+t^{3}A_{3,m}\alpha_{3}+t^{5}(A_{5,m}\alpha_{3}+B_{5,m}\beta_{5})+\order{t^{7}}\bigg{\}}
×exp{twm+1α1+t3wm+13α3+t5wm+15α5+(t7)}\displaystyle\quad\times\exp\{tw_{m+1}\alpha_{1}+t^{3}w_{m+1}^{3}\alpha_{3}+t^{5}w_{m+1}^{5}\alpha_{5}+\order{t^{7}}\bigg{\}}
=exp{2twm+1α1+tA1,mα1+2t3wm+13α3+t3A3,mα3+2t5wm+15α5+t5A5,mα5+t5B5,mβ5\displaystyle=\exp\{2tw_{m+1}\alpha_{1}+tA_{1,m}\alpha_{1}+2t^{3}w_{m+1}^{3}\alpha_{3}+t^{3}A_{3,m}\alpha_{3}+2t^{5}w_{m+1}^{5}\alpha_{5}+t^{5}A_{5,m}\alpha_{5}+t^{5}B_{5,m}\beta_{5}
+16t5(A1,m2wm+13A1,mA3,mwm+1wm+12A3,m+wm+14A1,m)β5+(t7)}.\displaystyle\quad+\frac{1}{6}t^{5}(A_{1,m}^{2}w_{m+1}^{3}-A_{1,m}A_{3,m}w_{m+1}-w_{m+1}^{2}A_{3,m}+w_{m+1}^{4}A_{1,m})\beta_{5}+\order{t^{7}}\bigg{\}}. (21)

Hence, if the product formula can be expressed as in the form (19) for S(m)(t)S^{(m)}(t), it can again be expressed in this form for S(m+1)(t)S^{(m+1)}(t), establishing it for all m0m\geq 0 by induction. This expression also shows that the scalar coefficients can be determined from the formulae

A1,m+1\displaystyle A_{1,m+1} =2wm+1+A1,m,\displaystyle=2w_{m+1}+A_{1,m},
A3,m+1\displaystyle A_{3,m+1} =2wm+13+A3,m,\displaystyle=2w_{m+1}^{3}+A_{3,m},
A5,m+1\displaystyle A_{5,m+1} =2wm+15+A5,m,\displaystyle=2w_{m+1}^{5}+A_{5,m},
B5,m+1\displaystyle B_{5,m+1} =B5,m+16(A1,m2wm+13A1,mA3,mwm+1wm+12A3,m+wm+14A1,m).\displaystyle=B_{5,m}+\frac{1}{6}(A_{1,m}^{2}w_{m+1}^{3}-A_{1,m}A_{3,m}w_{m+1}-w_{m+1}^{2}A_{3,m}+w_{m+1}^{4}A_{1,m}). (22)

See Appendix A for an explanation of how to extend this approach to 10th order.

Constraints to satisfy in order to derive 6th order formula.

To derive a 6th order formula, the lower-order terms in the exponential in Eq. 19 must be zero (see also Eq. (5.16) of [20]), which gives the four conditions

A1,m=1,A3,m=0,A5,m=0,B5,m=0.A_{1,m}=1,\quad A_{3,m}=0,\quad A_{5,m}=0,\quad B_{5,m}=0. (23)

For m=3m=3 there are four unknowns w0w_{0} to w3w_{3}, and it can be expected there are solutions because there are the same number of equations as unknowns. In practice A1,m=1A_{1,m}=1 is satisfied by taking w0=12jwjw_{0}=1-2\sum_{j}w_{j}, so there are then three equations for three unknowns w1,w2,w3w_{1},w_{2},w_{3}. Whereas it is possible to solve the equations using the Newton-Raphson method, the expression for the appropriate Jacobian matrix is complicated, so Yoshida instead uses the Brent method. Yoshida produced three m=3m=3 solutions in this way, and states “It seems that there is no other solution.” We have performed an extensive search and also found no more solutions.

The product formulae obtained through the Yoshida method also work for exponentials of sums of more operators. The S2S_{2} product formula can again be used as the building block for the product formula, and we can write

S2(t)=exp(=0α~t),S_{2}(t)=\exp\left(\sum_{\ell=0}^{\infty}\tilde{\alpha}_{\ell}t^{\ell}\right), (24)

where α~\tilde{\alpha}_{\ell} are now order-\ell multicommutator expressions on the JJ terms. The reasoning for finding the product formula is entirely based on the construction with α{\alpha}_{\ell}, but without taking advantage of its particular form, so exactly the same reasoning holds for α~\tilde{\alpha}_{\ell}. This immediately implies that the higher-order product formulae work in general. This is an advantage of constructing product formulae as products of S2S_{2}, because deriving coefficients separately for exponentials of XX and YY would not generalise.

II.3 Processed product formulae

Another technique to obtain higher-order product formulae is that of processing [24, 25, 26, 27, 28, 29, 30]. In this technique a product formula SkS_{k} of order kk is generated by the composition of a kernel Σ\Sigma conjugated by a processor operator PP as

Sk=PΣP1.\displaystyle S_{k}=P\Sigma P^{-1}. (25)

The advantage of this method is that usually Σ\Sigma requires fewer stages than other methods, and due to the construction, we have that Skn=PΣnP1S_{k}^{n}=P\Sigma^{n}P^{-1}. This means that the cost of using the product formula is effectively that of the kernel in the usual application where a long time evolution is partitioned into many repetitions of the product formula for short times.

Typically, one chooses a kernel Σ\Sigma as a product formula that has a smaller order than kk, but conjugating by processors gives an order kk integrator. A table of available kernels that can be combined with processors is given in Ref. [24], and recently new kernels were reported in [31]. Another advantage of this method is that the number of nonlinear equations required to be solved to find new product formulae is reduced, as one only needs to find new parameters for the kernel Σ\Sigma, and then conjugate it by known processors in the literature to give an order kk integrator.

To be more specific, Ref. [24] gives a set of conditions for the kernels and processors in Table 4 of that work. Reference [24] uses the notation YjY_{j} rather than αj\alpha_{j}, so S2(t)=etY1+t3Y3S_{2}(t)=e^{tY_{1}+t^{3}Y_{3}\cdots}. It then uses the notation Ej,iE_{j,i} for multicommutator expressions, with Ej,1=YjαjE_{j,1}=Y_{j}\equiv\alpha_{j} and for example

E5,2\displaystyle E_{5,2} =[Y1,Y1,Y3]β5,\displaystyle=[Y_{1},Y_{1},Y_{3}]\equiv\beta_{5}\,, (26)
E7,2\displaystyle E_{7,2} =[Y3,Y1,Y3]γ7,\displaystyle=[Y_{3},Y_{1},Y_{3}]\equiv-\gamma_{7}\,, (27)

where γ7\gamma_{7} is given in Appendix A. In Ref. [24] the quantities fj,if_{j,i} are the coefficients of Ej,iE_{j,i}, so the equivalent of Eq. (19) in that notation is

S(m)(t)=exp{tf1,1E1,1+t3f3,1E3,1+t5(f5,1E5,1+f5,2E5,2)+(t7)}.\displaystyle S^{(m)}(t)=\exp\!\bigg{\{}tf_{1,1}E_{1,1}+t^{3}f_{3,1}E_{3,1}+t^{5}(f_{5,1}E_{5,1}+f_{5,2}E_{5,2})+\order{t^{7}}\bigg{\}}\,. (28)

To obtain the conditions for a 6th order kernel from Ref. [24], one should use the conditions for fj,if_{j,i} up to q=5q=5 in Table 4, which give f1,1=1f_{1,1}=1, f3,1=0f_{3,1}=0, f5,1=0f_{5,1}=0, which in the above notation are equivalent to

A1,m=1,A3,m=0,A5,m=0.A_{1,m}=1,\quad A_{3,m}=0,\quad A_{5,m}=0. (29)

The conditions for the kernel are the same as for the complete product formula, but missing B5,m=0B_{5,m}=0. That is a general feature of the kernel order conditions; they are a subset of the conditions for the complete product formula, enabling shorter kernels, or more flexibility to choose kernels of the same length but lower error. For 4th order there are the same number of conditions, so the kernel is a valid product formula of that order.

For the processors, Ref. [24] uses the notation pj,ip_{j,i} for the coefficients of Ej,iE_{j,i} in the expansion of PP. These need to be determined from the BCH formula rather than symmetric BCH, and instead of only odd-order terms there need to be only even-order terms. Some symmetries mean that low-order odd terms automatically cancel, but higher-order ones need to be made zero by the choice of PP. Table 4 in Ref. [24] gives the conditions on the even-order terms. The first processor condition depending on fj,if_{j,i} is for q=5q=5, which means that for 6th order and higher the processors are dependent on the kernel. The 4th order the kernel is already 4th order, and the processors will yield another 4th order product formula, but can be chosen to reduce the error (which arises from higher-order terms). See Appendix B for more detail on the method of determining processors.

III Method of solution

III.1 Numerical methods

The size of the product formula we consider as in Eq. (15) is giverned by the value of mm, so there is a product of M=2m+1M=2m+1 integrators. The number of free parameters is mm, and it needs to be a least a large as the number of independent equations in order for there to be a solution. Yoshida [20] finds that m=7m=7 is minimal for 8th order, and similarly to prior work [22] we find that m=15m=15 is minimal for 10th order; see Appendix A. Choosing the minimal mm typically yields product formulae with large error, and for that reason we also consider larger values of mm.

For the numerical solution of the simultaneous nonlinear equations, we found that Matlab’s fsolve (using the Levenberg-Marquardt algorithm) provided rapid results using the vector of errors. The method was to repeatedly solve from random starting vectors w\vec{w} generated according to the normal distribution. The best solutions were those with smaller values for the coefficients; for much of the calculations we selected standard deviations of 0.60.6 for 8th order and 0.90.9 for 10th order. We filtered the large number of solutions by numerically checking the error for example Hamiltonians, and selecting those that provided the best performance. We then perform tests for larger numbers of samples, to better select the best performing product formulae. We also further refined the solutions to give the results to extended precision, which enables testing of the error with smaller values of tt. We were also able to further reduce the error by using an alternative solution method using a Taylor series.

III.2 Solution using Taylor expansion

Another solution method we use is based on computing the Taylor expansion of both the exact exponential and its product formula approximation with given wjw_{j}. This Taylor expansion is performed on both sides up to the desired order of approximation for the integrator. By imposing equality on terms of the same order we obtain a series of equations for wjw_{j} which can be solved. For higher orders, a large number of simultaneous equations are obtained, so we need an automated way of generating them.

To make precise the problem we are solving, denote as 𝒯k[]\mathcal{T}_{k}[\cdot] the map giving the Taylor expansion in tt around 0, truncated at order kk, so

𝒯k[et(X+Y)]\displaystyle\mathcal{T}_{k}[e^{t(X+Y)}] =p=0ktpp!(X+Y)p\displaystyle=\sum_{p=0}^{k}\frac{t^{p}}{p!}(X+Y)^{p}
=p=0ktpp!r1,,rp=01Xr1Y1r1XrpY1rp.\displaystyle=\sum_{p=0}^{k}\frac{t^{p}}{p!}\sum_{r_{1},\cdots,r_{p}=0}^{1}X^{r_{1}}Y^{1-r_{1}}\cdots X^{r_{p}}Y^{1-r_{p}}. (30)

We consider a sum of two operators X+YX+Y, but this approach for solving for wjw_{j} will also be sufficient to provide product formulae for sums of arbitrary numbers of terms. That is because the solutions must also be solutions of the equations derived using Yoshida’s method, and as explained above those equations will be the same when considering sums of arbitrary numbers of terms. Now consider the ansatz operator of Yoshida from Eq. 15

S(m)(t,w1,,wm)\displaystyle S^{(m)}(t,w_{1},\cdots,w_{m}) =etwmX/2etwmYet(wm+wm1)X/2etwm1Yet(wm1+wm2)X/2\displaystyle=e^{tw_{m}X/2}e^{tw_{m}Y}e^{t(w_{m}+w_{m-1})X/2}e^{tw_{m-1}Y}e^{t(w_{m-1}+w_{m-2})X/2}
etw0Yet(w1+w0)X/2etw1YetwmX/2\displaystyle\quad\cdots e^{tw_{0}Y}e^{t(w_{1}+w_{0})X/2}e^{tw_{1}Y}\cdots e^{tw_{m}X/2}
=etc1Xetc2Yetc4m+3X,\displaystyle=e^{tc_{1}X}e^{tc_{2}Y}\cdots e^{tc_{4m+3}X}, (31)

where in the last line we have defined the constants c1=wm/2c_{1}=w_{m}/2, c2=wmc_{2}=w_{m}, c3=(wm+wm1)/2,c_{3}=(w_{m}+w_{m-1})/2,\cdots c4m+3=wm/2c_{4m+3}=w_{m}/2. We Taylor expand this ansatz up order kk, noting that the total number of exponentials in Yoshida’s ansatz is 4m+34m+3

𝒯k[S(m)(t,w1,,wm)]\displaystyle\mathcal{T}_{k}[S^{(m)}(t,w_{1},\dots,w_{m})] =r1,,r4m+3=0r1++r4m+3kktr1++r4m+3r1!r4m+3!c1r1c4m+3r4m+3Xr1Yr2Xrk1Yr4m+3.\displaystyle=\sum_{\begin{subarray}{c}r_{1},\cdots,r_{4m+3}=0\\ r_{1}+\dots+r_{4m+3}\leq k\end{subarray}}^{k}\frac{t^{r_{1}+\dots+r_{4m+3}}}{r_{1}!\cdots r_{4m+3}!}c_{1}^{r_{1}}\dots c_{4m+3}^{r_{4m+3}}X^{r_{1}}Y^{r_{2}}\dots X^{r_{k-1}}Y^{r_{4m+3}}. (32)

We require that the product formula obtained from our solution procedure be independent of the choice of XX and YY, so need to match the coefficients for each sequence of products of XX and YY. Because XX and YY are non-commuting, we need to record coefficients for each ordered sequence. To do this in an automated way we construct a data structure to store the coefficients.

Given operators of the form ecX=I+cX+c22!X2+c33!X3+e^{cX}=I+cX+\frac{c^{2}}{2!}X^{2}+\frac{c^{3}}{3!}X^{3}+\cdots and edY=I+dY+d22!Y2+d33!Y3+e^{dY}=I+dY+\frac{d^{2}}{2!}Y^{2}+\frac{d^{3}}{3!}Y^{3}+\cdots with c,dc,d scalar coefficients and X,YX,Y general operators, we describe this Taylor expansion up to order kk using an array encoding. First, write monomials composed of XX and YY operators in lexicographical order and note that these operators can be mapped to binary numbers:

IXYXXXYYXYYXXXXXY1101110010111011110001001123456789\begin{matrix}I&X&Y&XX&XY&YX&YY&XXX&XXY&\cdots\\ 1&10&11&100&101&110&111&1000&1001&\cdots\\ 1&2&3&4&5&6&7&8&9&\cdots\end{matrix} (33)

To construct a bit string, we map each XX to 0 and each YY to 1, then place a 1 on the left to flag the length of the string, as shown in the second line of Eq. 33. Then, to obtain the operator products, simply remove the leading 1 and then map 0 to XX and 11 to YY. The empty string corresponds to the identity II. Now, to store the coefficients in a sum of products of XX and YY, convert each product to a binary integer as above, then store the coefficient in the corresponding location in a vector.

By way of illustration, consider the polynomial 10I+3X+2Y+2X2+YX10I+3X+2Y+2X^{2}+YX. This operator would be stored in an array as [10,3,2,2,0,1,0,][10,3,2,2,0,1,0,\cdots]. In this way, any polynomial of XX and YY can be efficiently stored in a vector. We denote this vector storing the coefficients of operators of order up to kk as veck[p(X,Y)]\mathrm{vec}_{k}[p(X,Y)], where kk denotes that the vector will only store the coefficients of the corresponding operators up to order kk (so a vector of length 2k+112^{k+1}-1) and p(X,Y)p(X,Y) is the polynomial in terms of XX and YY.

This approach can be used to solve for product formulae, but is also very effective at further improving the performance of solutions that have already been found. If a solution is of 8th order (for example), then we can consider the error using the Taylor expansion up to and including the 9th order. This will give nonzero error for the 8th order product formula, so minimising the error gives a new product formula. That can be used as a starting point to again solve for an 8th order formula, and often it is found that the product formula adjusted in that way has reduced error.

IV Product formula search

IV.1 Improved 8th order

Following the method outlined in Section III, we have searched for new product formulae and found thousands of product formulae of 8th order. The case m=7m=7 yields product formulae of minimal length, because the number of equations is equal to the number of unknowns. We have found over 600 solutions, and the search now finds almost only repeated solutions and very few new solutions. This indicates that we have found nearly all solutions, but it is also possible that there are many more solutions with large values of w\vec{w}. Indeed, the most recent new solutions we found have significantly larger values of w\vec{w}. We find that large values of w\vec{w} correspond to worse product formulae with larger error. Therefore, even if there are many more solutions with large w\vec{w}, they likely will not give improved performance over those we have already found.

Once we have obtained the potential solutions, we generate random Hamiltonians and compute the product formula errors as a function of time. For 8th order product formulae we know that the product formula error is (t9)\order{t^{9}}. We check the error scaling by picking two times t1t_{1} and t2t_{2} and computing errors δ1\delta_{1} and δ2\delta_{2} at these times, then we compute log(δ1/δ2)/log(t1/t2)\log(\delta_{1}/\delta_{2})/\log(t_{1}/t_{2}) and check that it is close to 99. As the error for our product formulae is given by δ(t)=χt9\delta(t)=\chi t^{9} where χ\chi is a constant factor, we can compute χ\chi for each of them by from χ=δ/t9\chi=\delta/t^{9}. For each product formula, we compute an average constant factor error; this average corresponds to the geometric mean of the constant factors computed for each random Hamiltonian. This method of comparing the performance of product formulae through the estimation of the constant factor in the error has been used before (see for example [33]) and is considered a good approximation to the performance of the product formula in practice.

The best performing 8th order product formula of minimal length m=7m=7 is shown in Table 1. This has average constant factor χ=5.8×106\chi=5.8\times 10^{-6}, where we are measuring error using the spectral norm. We evaluated all the 8th order solutions of Yoshida, and found Solution D was best, with a constant χ=9.7×104\chi=9.7\times 10^{-4}. See Section VI for more extensive comparisons other product formulae. The best solution we found for m=7m=7 corresponds to that reported as s15odr8 in [21], where the relation between the coefficients given is wj=δ8jw_{j}=\delta_{8-j} (so they list the coefficients in reverse order to ours). The extensive nature of our search indicates that the product formula reported in Ref. [21] is optimal for minimal-length 8th order product formulae.

Best 8th order with m=7m=7 Best 8th order with m=8m=8
w1w_{1} 0.3152930923967665966320566638110.315293092396766596632056663811 0.291373847679866630965285009680490.29137384767986663096528500968049
w2w_{2} 0.334624918245298183784957979882180.33462491824529818378495797988218 0.260203942349041502773166677098640.26020394234904150277316667709864
w3w_{3} 0.29906418130365592384446354068860.2990641813036559238444635406886 0.186696481495406875498319029999110.18669648149540687549831902999911
w4w_{4} 0.57386247111608226665638772663554-0.57386247111608226665638772663554 0.40049110428180105319963667975074-0.40049110428180105319963667975074
w5w_{5} 0.190754710296238379953876256450370.19075471029623837995387625645037 0.159827622086099232173901661272560.15982762208609923217390166127256
w6w_{6} 0.40910082580003159399730009589356-0.40910082580003159399730009589356 0.38400573301491401473462588779099-0.38400573301491401473462588779099
w7w_{7} 0.741670364350612953448227801783810.74167036435061295344822780178381 0.561488452663564468935907295728080.56148845266356446893590729572808
w8w_{8} NA 0.127833609862841108378575549504430.12783360986284110837857554950443
Table 1: Our best-performing 8th order solutions with m=7m=7 and 8.
Best 8th order for spectral-norm error Best 8th order for eigenvalue error
w1w_{1} 0.593580604008506258635140592652240.59358060400850625863514059265224 0.104676365322458952523407325798530.10467636532245895252340732579853
w2w_{2} 0.46916012347004197296293264921328-0.46916012347004197296293264921328 0.57896999331780988041471955125778-0.57896999331780988041471955125778
w3w_{3} 0.27435664258984679072282428781460.2743566425898467907228242878146 0.575033501600617859461415632798910.57503350160061785946141563279891
w4w_{4} 0.171938794846567730599190749653770.17193879484656773059919074965377 0.122310118687070297865613975426630.12231011868707029786561397542663
w5w_{5} 0.234398744825413844154305787475410.23439874482541384415430578747541 0.277931499990395248167339033017470.27793149999039524816733903301747
w6w_{6} 0.48616424480326193899617759997914-0.48616424480326193899617759997914 0.37349605088056728482635987352576-0.37349605088056728482635987352576
w7w_{7} 0.496173673881146603548717570449060.49617367388114660354871757044906 0.115755665894804632206165439724030.11575566589480463220616543972403
w8w_{8} 0.32660218948439130114501815323814-0.32660218948439130114501815323814 0.14646456109758006187125692303260.1464645610975800618712569230326
w9w_{9} 0.232716793493698576794454102705570.23271679349369857679445410270557 0.39443578322284085764474498594073-0.39443578322284085764474498594073
w10w_{10} 0.0982495574147085332734719061806430.098249557414708533273471906180643 0.443702287260212189231971411831960.44370228726021218923197141183196
Table 2: Our best-performing 8th order solutions when setting m=10m=10.

We have also conducted a search for 8th order solutions with m=8m=8, finding over 600 solutions. Using m=8m=8 results in an underdetermined system of equations with continuous sets of solutions, and nearly half of these solutions were close (with a norm less than 0.01). The underdetermined nature of the solutions also gives the flexibility to adjust the solution to reduce the error by further using the method described in Section III. The best solution found is given in Table 1, with an average constant factor of χ=5.7×107\chi=5.7\times 10^{-7}, which is an order of magnitude improvement with only a slight increase in the number of exponentials. This solution is close to solutions s17odr8a and s17odr8b given in Ref. [21], suggesting that the difference is due to the system of equations being underdetermined.

Extending our search further to m=10m=10 we found over 100,000 solutions, including solutions improving over those in prior work. Our best solution is given in Table 2 and has a constant of χ=4.9×108\chi=4.9\times 10^{-8}. This new 8th order product formula for m=10m=10 improves 40,000 times over the ones provided by Yoshida. An alternative measure of the performance of the product formulae is the accuracy of the eigenvalues. When considering this error, the product formula has a somewhat smaller constant factor ζ=1.1×108\zeta=1.1\times 10^{-8}. It is also possible to select product formulae to provide the minimum error in the eigenvaules. When we do this, we obtain the second solution given in Table 2, which has the significantly smaller constant factor in the error ζ=1.6×109\zeta=1.6\times 10^{-9}. That is better than any prior product formulae we have tested when accounting for the number of exponentials.

A further improvement in product formulae can be obtained by using processing [24]. We have performed a search for high-accuracy 8th order product formulae using this method with m=8m=8, and found one particular example that provides excellent performance. In our testing this solution is found repeatedly, and provides significantly lower error than any other solutions. Because the equations are underdetermined, the solution may be adjusted to provide best performance.

Best 8th order kernel Processor for kernel
w1w_{1} 0.217841766817310060746819691865130.21784176681731006074681969186513 0.44324901019570126590495430949294-0.44324901019570126590495430949294 γ1\gamma_{1}
w2w_{2} 0.19470177060539032240224563429070.1947017706053903224022456342907 0.254598571920037728506223770669440.25459857192003772850622377066944 γ2\gamma_{2}
w3w_{3} 0.183724132811455899442616421803630.18372413281145589944261642180363 0.73862036266779261573694538099739-0.73862036266779261573694538099739 γ3\gamma_{3}
w4w_{4} 0.37307499512657736825709230652023-0.37307499512657736825709230652023 0.00024139614958652134370419495289618-0.00024139614958652134370419495289618 γ4\gamma_{4}
w5w_{5} 0.157576442575691463730336620604610.15757644257569146373033662060461 0.738734603541253657393797538749640.73873460354125365739379753874964 γ5\gamma_{5}
w6w_{6} 0.33342207567391682979227850551172-0.33342207567391682979227850551172 0.20285971152536085519251666906017-0.20285971152536085519251666906017 γ6\gamma_{6}
w7w_{7} 0.517886496829879242817871422268030.51788649682987924281787142226803 0.449895216896768695718276374240460.44989521689676869571827637424046 γ7\gamma_{7}
w8w_{8} 0.214564754998977669863812196217610.21456475499897766986381219621761 0.295383980078768711840267475056570.29538398007876871184026747505657 γ8\gamma_{8}
0.3364996155865700091428329802017-0.3364996155865700091428329802017 γ9\gamma_{9}
Table 3: Our best-performing 8th order kernel and processor when setting m=8m=8. The value of γ10\gamma_{10} for the processor is chosen such that the sum is zero.

Our best solution is given in Table 3. For the kernel there are many solutions for the processor, but we list the one with the best performance of the solutions we have found. This one was found by solving for many (hundreds of thousands) of processors, picking the one with the best performance, and further optimising. This kernel provides a constant of ζ=2.2×109\zeta=2.2\times 10^{-9}, only slightly larger than the best length m=10m=10 product formula, while providing better performance due to the shorter length. Moreover, using the processor the constant for the spectral-norm error is χ=5.4×108\chi=5.4\times 10^{-8}. That is only about 10% more than the best length m=10m=10 formula, while again providing better performance due to the shorter length.

IV.2 Finding 10th order

We have also used our solution procedure to find new 1010th order product formulae. We generalised Yoshida’s method, and find 1515 independent equations to be solved (see Appendix A). We performed searches for solutions with m=15m=15 (the minimal number) to m=18m=18. Again the larger values of mm give the flexibility to adjust the solution to reduce the error. We report our 10th order product formulae for m=15m=15, 1616 and 1818 in Table 4 that are best for spectral-norm error. We don’t report values for m=17m=17 as the performance of the product formulae found were worse than for m=16m=16. In Table 5 we report product formulae for m=17m=17 and m=18m=18 with our best results for eigenvalue error.

As in Section IV.1, we compare the performance of product formulae of 1010th order by computing the constant factors χ\chi and ζ\zeta for random Hamiltonians. For the best solution with m=16m=16 we have a constant factor of χ=1.9×108\chi=1.9\times 10^{-8}, and the best solution with m=15m=15 has χ=4.5×107\chi=4.5\times 10^{-7}, which is about a factor of 24 times worse. The far better constant factor for m=16m=16 is far more significant than the slightly larger number of exponentials in the product formula.

Further increasing mm to 18 gives a product formula with χ=3.1×109\chi=3.1\times 10^{-9}; a significant further improvement. Again we can consider the error in terms of the eigenvalues; in this case the constant for eigenvalue error is ζ=2.2×109\zeta=2.2\times 10^{-9}, only slightly less. Selecting the best performing product formula for eigenvalue error gives the significantly smaller constant ζ=4.2×1010\zeta=4.2\times 10^{-10}. However, out of our solutions the best eigenvalue performance is given by a m=17m=17 solution, which yields ζ=1.8×1010\zeta=1.8\times 10^{-10}. These solutions are listed in Table 5.

In the search for 10th order product formulae, unlike in the case of 8th order, we find that almost all new solutions found are different from those found before. That indicates that there is an extremely large number of solutions, and we have only found a very small proportion of them. Indeed, the solutions are distinct from those in prior work.

Best 10th order solution with m=15m=15 Best 10th order solution with m=16m=16 Best 10th order solution with m=18m=18
w1w_{1} 0.145528599554994297390881355966180.14552859955499429739088135596618 0.4945013179955571856347147977644-0.4945013179955571856347147977644 0.019042478645106035261914181501875
w2w_{2} 0.48773512068133537309419933740564-0.48773512068133537309419933740564 0.29043172229701214798784142920930.2904317222970121479878414292093 0.48337326409346903272186302946692-0.48337326409346903272186302946692
w3w_{3} 0.127620112424295359097273423016560.12762011242429535909727342301656 0.347815410687053309379138902810030.34781541068705330937913890281003 0.0350609617418791924512981026252190.035060961741879192451298102625219
w4w_{4} 0.702254500194857512201430805879590.70225450019485751220143080587959 0.98828132118546184603769781410676-0.98828132118546184603769781410676 0.206904753315059920818840483197250.20690475331505992081884048319725
w5w_{5} 0.62035679146761710925756521405042-0.62035679146761710925756521405042 0.988551875327564052357339573056130.98855187532756405235733957305613 0.0395543422698003833122129598795870.039554342269800383312212959879587
w6w_{6} 0.390991524127861781336888693731140.39099152412786178133688869373114 0.34622976933123177430694714630668-0.34622976933123177430694714630668 0.0620108373564010489971199186373920.062010837356401048997119918637392
w7w_{7} 0.178602536043554658077910413670450.17860253604355465807791041367045 0.202189526190731175547142803670180.20218952619073117554714280367018 0.46961231983086041266381539270133-0.46961231983086041266381539270133
w8w_{8} 0.80455783177921776295588528272593-0.80455783177921776295588528272593 0.130642730697862477872088954714610.13064273069786247787208895471461 0.15137223243888068391593992998235-0.15137223243888068391593992998235
w9w_{9} 0.0530872164427582421186873856462830.053087216442758242118687385646283 0.26441199183146805554735845490359-0.26441199183146805554735845490359 0.131862227457093955766755947637840.13186222745709395576675594763784
w10w_{10} 0.868363079102755562586870309047530.86836307910275556258687030904753 0.0609991405592104088690969922915310.060999140559210408869096992291531 0.446286633031363751451227850148950.44628663303136375145122785014895
w11w_{11} 0.85326297197907834671536254437991-0.85326297197907834671536254437991 0.6855442489606141359108973267028-0.6855442489606141359108973267028 0.31721379667717916478350053562451-0.31721379667717916478350053562451
w12w_{12} 0.11732457198874083224967699358383-0.11732457198874083224967699358383 0.15843692473786584550599206557006-0.15843692473786584550599206557006 0.443135886497766937051542310638710.44313588649776693705154231063871
w13w_{13} 0.038273454941860566324069477720470.03827345494186056632406947772047 0.154146917799582991502864522155750.15414691779958299150286452215575 0.168870075841530915113951194341710.16887007584153091511395119434171
w14w_{14} 0.748435290295324982339977933053570.74843529029532498233997793305357 0.667152058272143203710618392970550.66715205827214320371061839297055 0.22652658662557993653900899346103-0.22652658662557993653900899346103
w15w_{15} 0.302087156219757737124109480259060.30208715621975773712410948025906 0.204118744746965982896036776935110.20411874474696598289603677693511 0.130537362971372324831814273840480.13053736297137232483181427384048
w16w_{16} NA 0.0812073182102725932250877114416840.081207318210272593225087711441684 0.113373010502856510538193091878020.11337301050285651053819309187802
w17w_{17} NA NA 0.0561995576601481087980289602381240.056199557660148108798028960238124
w18w_{18} NA NA 0.0389183231157940120698689898639520.038918323115794012069868989863952
Table 4: Our best performing 10th order solutions for spectral-norm error with m=15m=15, m=16m=16 and m=18m=18.
Best 10th order solution with m=17m=17 Best 10th order solution with m=18m=18
w1w_{1} 0.28371232689144296279654621726493-0.28371232689144296279654621726493 0.0257225546230064804937263083965860.025722554623006480493726308396586
w2w_{2} 0.0467795047781473816053310002782230.046779504778147381605331000278223 0.0246739230893921545351006435103440.024673923089392154535100643510344
w3w_{3} 0.368458923827977706196575042175390.36845892382797770619657504217539 0.40545153312882551694596948883526-0.40545153312882551694596948883526
w4w_{4} 0.191862040946745147397604081974610.19186204094674514739760408197461 0.0868703233642572821810730619151680.086870323364257282181073061915168
w5w_{5} 0.53123134392680669702873064192428-0.53123134392680669702873064192428 0.123688993477720196561372765419420.12368899347772019656137276541942
w6w_{6} 0.0081253242720827266680816105600661-0.0081253242720827266680816105600661 0.345995910690833611017910996186560.34599591069083361101791099618656
w7w_{7} 0.16389450414378567860032917538393-0.16389450414378567860032917538393 0.0467656785177405507055480614868110.046765678517740550705548061486811
w8w_{8} 0.185147661192914050325286478810.18514766119291405032528647881 0.27103335145245847800657868572535-0.27103335145245847800657868572535
w9w_{9} 0.53835846947546819891746688065050.5383584694754681989174668806505 0.133985944712009432612550655678660.13398594471200943261255065567866
w10w_{10} 0.30583981835573485697292316732177-0.30583981835573485697292316732177 0.45010365706956744617357917877887-0.45010365706956744617357917877887
w11w_{11} 0.431999356095233012892954737744880.43199935609523301289295473774488 0.336998581130233993975879063628810.33699858113023399397587906362881
w12w_{12} 0.15105023016317868530201246128130.1510502301631786853020124612813 0.142864790240772765059292639270290.14286479024077276505929263927029
w13w_{13} 0.35051099204829676098801520498121-0.35051099204829676098801520498121 0.30679647776174213774450994020067-0.30679647776174213774450994020067
w14w_{14} 0.10329711258442916745115130076610.1032971125844291674511513007661 0.0487858611989213843225723809488580.048785861198921384322572380948858
w15w_{15} 0.150439369438171526973719468062290.15043936943817152697371946806229 0.0352584836310526203048822071894390.035258483631052620304882207189439
w16w_{16} 0.121184694986507365114104915868460.12118469498650736511410491586846 0.22380268023236595677874655821875-0.22380268023236595677874655821875
w17w_{17} 0.104377427795478263582966815574440.10437742779547826358296681557444 0.423464497594125058720945262324330.42346449759412505872094526232433
w18w_{18} NA 0.148887054638054557024546293537630.14888705463805455702454629353763
Table 5: Our best performing 10th order solutions for eigenvalue error with m=17m=17 and m=18m=18.

V Methods for comparison of product formulae

V.1 Same order comparison

We make a fair comparison between product formulae of different length in the following way. An order kk integrator for time tt will have an error δ=χtk+1\delta=\chi t^{k+1} where χ\chi is a real constant. Let TT be the total evolution time for an integrator of order kk, and ε\varepsilon be the maximum allowable error. Subdivide the evolution time TT into rr subintervals, so t=T/rt={T}/{r} is the length of each time subinterval. We thus have χ(T/r)k+1ε/r\chi\left({T}/{r}\right)^{k+1}\approx{\varepsilon}/{r}, which gives

r(χTϵ)1/kT.\displaystyle r\approx\left(\frac{\chi T}{\epsilon}\right)^{{1}/{k}}T. (34)

As explained above, the number of exponentials in the product is (4m+2)(J1)+1(4m+2)(J-1)+1. When applying products of these product formulae, two exponentials can be combined, so the effective number for each is (4m+2)(J1)(4m+2)(J-1). As a result, the total number of exponentials can be given as proportional to

M(χTϵ)1/kT,\displaystyle M\left(\frac{\chi T}{\epsilon}\right)^{{1}/{k}}T\,, (35)

where we have ignored a common factor of 2(J1)2(J-1), and M=2m+1M=2m+1. If we wish to compare product formulae of the same order, then we need only compare the values of Mχ1/kM\chi^{1/k}, and the one with the smaller value is the more efficient product formula. Similarly, if we consider eigenvalue error then we should compare Mζ1/kM\zeta^{1/k} between the formulae.

V.2 Thresholds for different order

If we wish to compare product formulae of different order, then we need to take account of the values of TT and ε\varepsilon. Assume we have two integrators of order k1k_{1} and k2k_{2}, with corresponding constants χ1\chi_{1}, χ2\chi_{2}. Given TT and ε\varepsilon, when the two integrators use the same number of exponentials we have M1r1=M2r2M_{1}r_{1}=M_{2}r_{2}, thus

M1(χ1Tε)1/k1T\displaystyle M_{1}\left(\frac{\chi_{1}T}{\varepsilon}\right)^{{1}/{k_{1}}}T =M2(χ2Tε)1/k2T\displaystyle=M_{2}\left(\frac{\chi_{2}T}{\varepsilon}\right)^{{1}/{k_{2}}}T (36)
Tε\displaystyle\implies\quad\frac{T}{\varepsilon} =(M2χ21/k2M1χ11/k1)11k11k2.\displaystyle=\left(\frac{M_{2}\chi_{2}^{1/k_{2}}}{M_{1}\chi_{1}^{1/k_{1}}}\right)^{\frac{1}{\frac{1}{k_{1}}-\frac{1}{k_{2}}}}. (37)

This gives the threshold beyond which the higher-order product formula should be used for improved performance. Again, when considering eigenvalue error, χ\chi would be replaced with ζ\zeta.

A limitation of this analysis it is assumed that the time step (ϵ/(χT))1/k(\epsilon/(\chi T))^{1/k} is small enough for the scaling law for the error to hold. To adjust the threshold for cases where the time step is large, we can consider a more general functional dependence of the error on the time interval as f(t)f(t). Then we require for time TT and error ϵ\epsilon that

f(t)×T/t=ϵ.f(t)\times T/t=\epsilon\,. (38)

The cost is then proportional to M×r=MT/tM\times r=MT/t. For the threshold between two product formulae, we require

M1T/t1=M2T/t2.M_{1}T/t_{1}=M_{2}T/t_{2}\,. (39)

That implies t2=t1M2/M1t_{2}=t_{1}M_{2}/M_{1}. For the total error to be equal to ϵ\epsilon in each case, we then require

f1(t1)×T/t1=f2(t2)×T/t2.f_{1}(t_{1})\times T/t_{1}=f_{2}(t_{2})\times T/t_{2}\,. (40)

That can be rearranged to give

f1(t1)=f2(t1M2/M1)×M1/M2.f_{1}(t_{1})=f_{2}(t_{1}M_{2}/M_{1})\times M_{1}/M_{2}\,. (41)

This is an expression we can solve for t1t_{1}, which may then be used to determine the T/ϵT/\epsilon threshold from

T/ϵ=t1/f1(t1).T/\epsilon=t_{1}/f_{1}(t_{1})\,. (42)

This means that the threshold is still for the ratio T/ϵT/\epsilon, rather than having separate dependence on TT and ϵ\epsilon.

V.3 Motivation for considering eigenvalue error

A further question is the measure of the error to be used. If the goal is to estimate eigenvalues of the Hamiltonian (as is often the case in quantum chemistry), then the measure of the error should be that in the eigenvalues. That is, how close are the eigenvalues of the product formula to those of the exact Hamiltonian evolution. In the case that the goal is to accurately reproduce the final state after the evolution, then an appropriate measure of the error should be the spectral norm of the difference of the unitary operators. This error accounts for both error in the eigenvalues and basis.

That error will upper bound the 2-norm error in the generated quantum state, but using the triangle inequality to bound the error for long evolution times overestimates the error. This is because the error beyond that in the eigenvalues cancels when using a product of many short time intervals. Let us denote the exact evolution operator for short time UU, and the approximate evolution operator provided by the product formula as U~\widetilde{U}. In the basis of the Hamiltonian’s eigenstates, UU is diagonal, and we can diagonalise U~\widetilde{U} in this basis as U~=VDV\widetilde{U}=VDV^{\dagger}. Then the difference between DD and UU describes the eigenvalue error, since these are diagonal matrices with the eigenvalues on the diagonal. The matrix VV describes the basis error.

It is convenient to write V=eiH~V=e^{i\tilde{H}} for some Hermitian matrix H~\tilde{H}. Then the error can be given as

U~U\displaystyle\|\widetilde{U}-U\| =eiH~DeiH~U\displaystyle=\|e^{i\tilde{H}}De^{-i\tilde{H}}-U\|
=(I+iH~+𝒪(H~2))D(IiH~+𝒪(H~2))U\displaystyle=\|(I+i\tilde{H}+\mathcal{O}(\tilde{H}^{2}))D(I-i\tilde{H}+\mathcal{O}(\tilde{H}^{2}))-U\|
=i[H~,D]+DU+𝒪(H~2)\displaystyle=\|i[\tilde{H},D]+D-U+\mathcal{O}(\tilde{H}^{2})\|
[H~,DI]+DU+𝒪(H~2).\displaystyle\leq\|[\tilde{H},D-I]\|+\|D-U\|+\mathcal{O}(\tilde{H}^{2})\,. (43)

That is, the error can be split up between a part [H~,DI][\tilde{H},D-I] corresponding to basis error, and a part DUD-U corresponding to eigenvalue error. Note that DID-I is proportional to tt, so the error in the operator due to the basis error is one order higher than the error in the basis.

Then, for the case of a large number of time steps rr, we can bound the overall error as

U~rUr\displaystyle\|\widetilde{U}^{r}-U^{r}\| =VDrVUr\displaystyle=\|VD^{r}V^{\dagger}-U^{r}\|
=(VDrVDrV)+(DrVUrV)+(UrVUr)\displaystyle=\|(VD^{r}V^{\dagger}-D^{r}V^{\dagger})+(D^{r}V^{\dagger}-U^{r}V^{\dagger})+(U^{r}V^{\dagger}-U^{r})\|
VI+DrUr+VI\displaystyle\leq\|V-I\|+\|D^{r}-U^{r}\|+\|V^{\dagger}-I\|
2VI+rDU.\displaystyle\leq 2\|V-I\|+r\|D-U\|\,. (44)

That is, the error due to the eigenvalues is multiplied by a factor of rr, but the error due to the basis is not. The error in the basis can be larger than for a single time step, but only by a factor of 1/t\sim 1/t. That factor arises because for a single time step the basis error is commuted with DID-I which is proportional to tt, so the expression here without DID-I can be a factor 1/t\propto 1/t larger.

In practice we find that in many cases the error in the basis scales as 𝒪(tk)\mathcal{O}(t^{k}), resulting in the expected contribution to the spectral-norm error 𝒪(tk+1)\mathcal{O}(t^{k+1}). However, in other cases we find scaling of the error in the basis as 𝒪(tk+1)\mathcal{O}(t^{k+1}), resulting in a higher-order contribution to the spectral-norm error. Let us give the error in the eigenvalues as ζtk+1\zeta t^{k+1}, and the error in the basis as μtk+ν\mu t^{k+\nu} where ν[0,1]\nu\in[0,1] to account for the various scalings found numerically.

Then the choice of rr to make the error in the eigenvalues smaller than ϵ\epsilon is

r(ζTϵ)1/kT.r\approx\left(\frac{\zeta T}{\epsilon}\right)^{1/k}T. (45)

The contribution to the total error from the basis error is

2μ(T/r)k+ν2μ(ϵζT)1+ν/k.2\mu(T/r)^{k+\nu}\approx 2\mu\left(\frac{\epsilon}{\zeta T}\right)^{1+\nu/k}\,. (46)

That gives the ratio of the basis error to the eigenvalue error as approximately

2μϵ(ϵζT)1+ν/k=2μζT(ϵζT)ν/k.\frac{2\mu}{\epsilon}\left(\frac{\epsilon}{\zeta T}\right)^{1+\nu/k}=\frac{2\mu}{\zeta T}\left(\frac{\epsilon}{\zeta T}\right)^{\nu/k}\,. (47)

Hence, for large TT the eigenvalue error should be dominant. This means that the eigenvalue error for a single step is the relevant measure to estimate the spectral-norm error for long-time evolution.

VI Numerical comparison of product formulae

VI.1 Comparison for general matrices

In what follows, we report on the comparison between product formulae of 4th, 6th, 8th and 10th order based on the threshold provided above. We analyse those found in this work, and those from prior work including processed product formulae. When giving the number of stages of a processed product formula, we refer to the number of stages in the kernel (not PP), because those are the ones that would be repeated for a simulation over an extended time.

We list 4th order product formulae in Table 6, 6th order in Table 7, 8th order in Table 8, and 10th order in Table 9, and give the constant factors in the error scaling χ,ζ\chi,\zeta (spectral norm or eigenvalue error). These product formulae include some constructed with even MM rather than M=2m+1M=2m+1 as for our product formulae. We generated 10,000 random Hamiltonians for each product formula, and then estimated the constants χ\chi and ζ\zeta by taking the geometric mean. Each random Hamiltonian is generated as a sum of two random Hamiltonians H=A+BH=A+B, where AA and BB are random Hermitian matrices of dimension 6×66\times 6 and norm 1.

label MM processing reference χ\chi Mχ1/kM\chi^{1/k} ζ\zeta Mζ1/kM\zeta^{1/k}
S4m1 33 no first Suzuki product [1] 4.5×1024.5\times 10^{-2} 1.38 3.0×1023.0\times 10^{-2} 1.25
S4m2 55 no second Suzuki product [2] 2.6×1032.6\times 10^{-3} 1.13 4.2×1044.2\times 10^{-4} 0.72
O4M5 55 no Ostmeyer Eq. (40) of [34] 2.9×1042.9\times 10^{-4} 0.65 1.2×1041.2\times 10^{-4} 0.52
BM4M6 66 no S6S_{6} Table 2 of [35] 1.5×1041.5\times 10^{-4} 0.67 3.6×1053.6\times 10^{-5} 0.47
BCE4m6b 1212 no BM[4]6{}_{6}^{[4]} Table 1 of [31] 4.5×1054.5\times 10^{-5} 0.99 2.8×1052.8\times 10^{-5} 0.87
PPBCM4m6 1212 yes P64P_{6}4 Table 5 of [24] 5.0×1055.0\times 10^{-5} 1.01 1.4×1051.4\times 10^{-5} 0.74
BCE4m3 66 yes ψ3[4]\psi_{3}^{[4]} Table 6 of [31] 1.5×1021.5\times 10^{-2} 2.09 1.8×1031.8\times 10^{-3} 1.23
BCE4m4 88 yes ψ4[4]\psi_{4}^{[4]} Table 6 of [31] 5.0×1045.0\times 10^{-4} 1.19 1.3×1041.3\times 10^{-4} 0.85
BCE4m5 1010 yes ψ5[4]\psi_{5}^{[4]} Table 6 of [31] 5.9×1055.9\times 10^{-5} 0.88 2.7×1052.7\times 10^{-5} 0.72
BCE4m6 1212 yes ψ6[4]\psi_{6}^{[4]} Table 6 of [31] 2.3×1052.3\times 10^{-5} 0.83 8.9×1068.9\times 10^{-6} 0.66
BCE4m7 1414 yes ψ7[4]\psi_{7}^{[4]} Table 6 of [31] 1.2×1051.2\times 10^{-5} 0.82 3.9×1063.9\times 10^{-6} 0.62
BCE4m8 1616 yes ψ8[4]\psi_{8}^{[4]} Table 6 of [31] 6.8×1066.8\times 10^{-6} 0.82 2.0×1062.0\times 10^{-6} 0.60
BCE4m9 1818 yes ψ9[4]\psi_{9}^{[4]} Table 6 of [31] 4.5×1064.5\times 10^{-6} 0.83 1.1×1061.1\times 10^{-6} 0.58
Table 6: The list of 4th order product formulae. The column labelled “processing” indicates whether the formula uses processor. The label is the name we will refer to the formula by. The constant factor in the error is denoted χ\chi for spectral-norm error and ζ\zeta for eigenvalue error, and the corresponding quantities Mχ1/kM\chi^{1/k} and Mζ1/kM\zeta^{1/k} are given. The best results are highlighted in bold. Results for product formulae that are not products of S2S_{2} are given in italics.
label MM processing reference χ\chi Mχ1/kM\chi^{1/k} ζ\zeta Mζ1/kM\zeta^{1/k}
S6m1 99 no first Suzuki product [1] 4.0×1024.0\times 10^{-2} 5.26 3.2×1023.2\times 10^{-2} 5.08
S6m2 2525 no second Suzuki product [2] 1.0×1051.0\times 10^{-5} 3.68 2.6×1072.6\times 10^{-7} 1.99
Y6m3a 77 no Solution A in Table 1 of [20] 1.7×1031.7\times 10^{-3} 2.42 1.3×1031.3\times 10^{-3} 2.32
KL6s9a 99 no s9odr6a in Appendix A of [21] 2.5×1042.5\times 10^{-4} 2.26 2.0×1042.0\times 10^{-4} 2.18
KL6s9b 99 no s9odr6b in Appendix A of [21] 2.5×1042.5\times 10^{-4} 2.25 2.0×1042.0\times 10^{-4} 2.18
SS6s11 1111 no Section 4.2 of [22] 3.4×1053.4\times 10^{-5} 1.98 1.7×1051.7\times 10^{-5} 1.77
SS6s13 1313 no Section 4.2 of [22] 1.7×1051.7\times 10^{-5} 2.08 4.0×1064.0\times 10^{-6} 1.64
BM6M10 1010 no S10S_{10} Table 2 of [35] 5.5×1065.5\times 10^{-6} 1.33 2.0×1062.0\times 10^{-6} 1.12
BCE6m10b 2020 no BM[6]10{}_{10}^{[6]} Table 1 of [31] 5.4×1065.4\times 10^{-6} 2.65 7.8×1077.8\times 10^{-7} 1.92
PPBCM6m9 99 yes P96P_{9}6 Table 5 of [24] 2.0×1062.0\times 10^{-6} 2.02 1.2×1071.2\times 10^{-7} 1.26
PPBCM6m5 1111 yes P116P_{11}6 in Table 6 of [24] 1.6×1061.6\times 10^{-6} 1.19 9.6×1079.6\times 10^{-7} 1.09
PPBCM6m6 1313 yes P136P_{13}6 in Table 6 of [24] 4.2×1074.2\times 10^{-7} 1.13 2.6×1072.6\times 10^{-7} 1.04
BCE6m5 1010 yes ψ5[6]\psi_{5}^{[6]} Table 8 of [31] 5.1×1035.1\times 10^{-3} 4.15 4.0×1034.0\times 10^{-3} 3.98
BCE6m6 1212 yes ψ6[6]\psi_{6}^{[6]} Table 8 of [31] 6.5×1056.5\times 10^{-5} 2.41 2.6×1052.6\times 10^{-5} 2.07
BCE6m7 1414 yes ψ7[6]\psi_{7}^{[6]} Table 8 of [31] 1.3×1051.3\times 10^{-5} 2.14 3.0×1063.0\times 10^{-6} 1.69
BCE6m8 1616 yes ψ8[6]\psi_{8}^{[6]} Table 8 of [31] 3.6×1063.6\times 10^{-6} 1.98 4.3×1074.3\times 10^{-7} 1.39
BCE6m9 1818 yes ψ9[6]\psi_{9}^{[6]} Table 8 of [31] 5.7×1065.7\times 10^{-6} 2.41 1.7×1071.7\times 10^{-7} 1.33
BCE6m10 2020 yes ψ10[6]\psi_{10}^{[6]} Table 8 of [31] 2.4×1062.4\times 10^{-6} 2.32 9.9×1099.9\times 10^{-9} 0.93
BCE6m11 2222 yes ψ11[6]\psi_{11}^{[6]} Table 8 of [31] 6.1×1066.1\times 10^{-6} 2.97 1.5×1081.5\times 10^{-8} 1.10
Table 7: The list of 6th order product formulae with their average errors. The best results are highlighted in bold. Results for product formulae that are not products of S2S_{2} are given in italics.
label MM processing reference χ\chi Mχ1/kM\chi^{1/k} ζ\zeta Mζ1/kM\zeta^{1/k}
S8m1 2727 no first Suzuki product [1] 4.8×1024.8\times 10^{-2} 18.5 2.3×1022.3\times 10^{-2} 16.9
S8m2 125125 no second Suzuki product [2] 4.8×1094.8\times 10^{-9} 11.4 5.0×10135.0\times 10^{-13} 3.62
Y8m7d 1515 no Solution D in Table 2 of [20] 9.7×1049.7\times 10^{-4} 6.30 1.9×1041.9\times 10^{-4} 5.15
KL8s15 1515 no s15odr8 in [21] (our Table 1) 5.9×1065.9\times 10^{-6} 3.33 2.7×1062.7\times 10^{-6} 3.02
KL8s17a 1717 no s17odr8a in [21] 5.9×1075.9\times 10^{-7} 2.83 2.3×1072.3\times 10^{-7} 2.52
KL8s17b 1717 no s17odr8b in [21] 5.8×1075.8\times 10^{-7} 2.83 2.1×1072.1\times 10^{-7} 2.49
SS8s19 1919 no Section 4.3 of [22] 1.6×1071.6\times 10^{-7} 2.68 7.2×1087.2\times 10^{-8} 2.43
SS8s21 2121 no Section 4.3 of [22] 2.6×1072.6\times 10^{-7} 3.16 8.1×1088.1\times 10^{-8} 2.73
PP8s13 1313 yes P138P_{13}8 in Table 6 of [24] 1.1×1061.1\times 10^{-6} 2.33 8.6×1078.6\times 10^{-7} 2.27
PP8s19 1919 yes P198P_{19}8 in Table 6 of [24] NA NA 2.6×1072.6\times 10^{-7} 2.85
Y8m10 2121 no Table 2 (our new result) 4.9×1084.9\times 10^{-8} 2.56 1.1×1081.1\times 10^{-8} 2.13
Y8m10b 2121 no Table 2 (our new result) 5.4×1075.4\times 10^{-7} 3.45 1.6×1091.6\times 10^{-9} 1.67
YP8m8 1717 yes Table 3 (our new result) 5.4×1085.4\times 10^{-8} 2.10 2.2×1092.2\times 10^{-9} 1.41
Table 8: The list of 8th order product formulae with their average errors. The best results are highlighted in bold.
label MM processing reference χ\chi Mχ1/kM\chi^{1/k} ζ\zeta Mζ1/kM\zeta^{1/k}
S10m1 8181 no first Suzuki product [1] 7.5×1027.5\times 10^{-2} 62.5 8.1×1038.1\times 10^{-3} 50.0
S10m2 625625 no second Suzuki product [2] 2.6×10132.6\times 10^{-13} 34.5 5.9×10195.9\times 10^{-19} 9.39
KL10s31a 3131 no s31odr10a in Appendix A of [21] 6.1×1066.1\times 10^{-6} 9.33 5.4×1065.4\times 10^{-6} 9.21
KL10s31b 3131 no s31odr10b in Appendix A of [21] 6.4×1056.4\times 10^{-5} 11.88 4.2×1054.2\times 10^{-5} 11.3
SS10s31 3131 no Section 4.4 of [22] 3.4×1083.4\times 10^{-8} 5.55 2.7×1082.7\times 10^{-8} 5.43
SS10s33 3333 no Section 4.4 of [22] 1.0×1081.0\times 10^{-8} 5.25 8.1×1098.1\times 10^{-9} 5.12
SS10s35 3535 no Section 4.4 of [22] 8.0×10108.0\times 10^{-10} 4.31 4.3×10114.3\times 10^{-11} 3.22
Alberdi31 3131 no Appendix A of [36] 1.1×1071.1\times 10^{-7} 6.23 1.0×1071.0\times 10^{-7} 6.18
Alberdi33 3333 no Appendix A of [36] 5.9×1085.9\times 10^{-8} 6.25 5.2×1085.2\times 10^{-8} 6.17
Alberdi35 3535 no Appendix A of [36] 1.1×1081.1\times 10^{-8} 5.62 9.3×1099.3\times 10^{-9} 5.51
PP10s19 1919 yes P1910P_{19}10 in Table 6 of [24] NA NA 5.7×1065.7\times 10^{-6} 5.68
PP10s23 2323 yes P2310P_{23}10 in Table 6 of [24] 2.7×1052.7\times 10^{-5} 8.04 3.2×1083.2\times 10^{-8} 4.09
Y10m15 3131 no Table 4 4.5×1074.5\times 10^{-7} 7.19 4.1×1074.1\times 10^{-7} 7.12
Y10m16 3333 no Table 4 1.9×1081.9\times 10^{-8} 5.57 7.5×1097.5\times 10^{-9} 5.08
Y10m17 3535 no Table 5 1.4×1081.4\times 10^{-8} 5.75 1.8×10101.8\times 10^{-10} 3.71
Y10m18 3737 no Table 4 3.1×1093.1\times 10^{-9} 5.22 2.2×1092.2\times 10^{-9} 5.05
Y10m18b 3737 no Table 5 2.6×1082.6\times 10^{-8} 6.46 4.2×10104.2\times 10^{-10} 4.27
Table 9: The list of 10th order product formulae with their average errors. The best results are highlighted in bold, and the second-best are underlined.

To compare among product formulae of the same order, we compare Mχ1/kM\chi^{1/k} for spectral norm error, or Mζ1/kM\zeta^{1/k} for eigenvalue error, and the best results in each table are indicated by the values in bold. First we consider the comparison of χ\chi for spectral-norm error. For 4th order, the lowest value for Mχ1/kM\chi^{1/k} is that of the processed formulae BCE4m7 and BCE4m8, which yield very similar values. That is out of product formulae that are products of S2S_{2}, we will discuss the others below. In the 6th order case we find that the best is PPBCM6m6. Among the 8th order, the best performing is our processed formula YP8m8. The best 10th order is given by SS10s35, but the second-best is given by our Y10m18.

For eigenvalue error, we compare the quantity Mζ1/kM\zeta^{1/k} to establish the performance of the product formulae when they have the same order. The best performing 4th order formula for eigenvalue error is BCE4m9 for product formulae based on S2S_{2}. For 6th order, the best one is BCE6m10. Among the 8th order product formulae, our product formula YP8m8 is still the best performing. The best 10th order is still SS10s35, though the second-best is our Y10m17. Therefore, in both cases the best 10th order is SS10s35 from Ref. [22], though we have found the second-best product formulae. More importantly, our new 8th order product formula provides the best performance both in terms of spectral norm and eigenvalue error.

In Table 6 and Table 7 we also list results for some product formulae that are specific to Hamiltonians that are a sum of only two terms, and are not constructed as a product of S2S_{2}. In the case of fourth order, these give better performance than any of the other product formulae listed. In particular, for eigenvalue error the best performance is given by the formula BM4M6 of Blanes and Moan [35], whereas for spectral-norm error, a recently proposed formula O4M5 of Ostmeyer [34] gives the best performance. In the case of 6th order, the formula BM6M10 of Blanes and Moan gives very good performance, but still not better than some of the processed product formulae.

The Suzuki product formulae give poor performance in all cases as compared to the best product formulae, and Suzuki’s second formula always outperforms the first. In the case of 6th order, Suzuki’s second formula provides better performance than the best 6th order product formula of Yoshida in terms of eigenvalue error, but not spectral norm error. Suzuki’s second product formula also outperforms the best 8th order product formula of Yoshida. A surprising result is that Suzuki’s second product formula provides much smaller eigenvalue error than spectral-norm error, particularly for higher orders. For 10th order it is smaller by nearly 6 orders of magnitude. Nevertheless, despite that extremely small error, the large number of factors in the product mean that it has significantly worse performance than most other product formulae in the list.

To consider the threshold to use a higher-order product formula over a lower-order one, we first consider the asymptotic expression using Mζ1/kM\zeta^{1/k} for eigenvalue error. For 4th order the value of this quantity is 0.58 (BCE4m9) and for 6th order is 0.93 (BCE6m10), which gives a threshold of 290. The value for the best 8th order is 1.41 (YP8m8), which gives a threshold of around 1,200 when compared with 4th order and 22,000 when compared with 6th order. The thresholds from 4th order are larger using BM4M6, which is specific to Hamiltonians that are a sum of two parts. Then the threshold from 4th to 6th is 3800, and from 4th to 8th is 7000. At 10th order the best value is 3.22 (SS10s35), which when compared with 8th order gives a threshold of 2.2×10142.2\times 10^{14}. This is far greater than can be expected for realistic simulations.

Most of these thresholds based on asymptotic expressions are inaccurate, because the thresholds correspond to large time steps where the asymptotic expressions break down. An exception is the threshold for 8th to 10th order. That corresponds to a time interval around 0.20.2 that is sufficiently small that the threshold estimate is accurate. In contrast, the calculated threshold between 6th and 8th order would correspond to a time step larger than 3, which is far too large for the scaling of the error to be accurate.

In this case we find numerically that the threshold between these 6th and 8th order formulae (BCE6m10 and YP8m8) is about 70007000, which is less than the threshold calculated above using the asymptotic scaling. On the other hand, the threshold for the 6th order formula PPBCM6m6 to our 8th order formula YP8m8 is 940,000. That corresponds to about 590,000 time steps, with a time step size of about 1.61.6 (for the 8th order formula).

It is possible to optimise our kernel for the larger time step size in order to provide better performance. This optimisation provides a kernel (see Table 10) such that the threshold from PPBCM6m6 is about 82,000. This now corresponds to 36,000 time steps with a time step size of about 2.32.3. Thus the optimisation provides another order of magnitude where 8th order product formulae are optimal, albeit with a product formula tailored to the time step size.

8th order kernel for large time steps
w1w_{1} 0.172929777115435075761568461867510.17292977711543507576156846186751
w2w_{2} 0.271703024386100826297809953034550.27170302438610082629780995303455
w3w_{3} 0.219095482361175846717324736117640.21909548236117584671732473611764
w4w_{4} 0.37248751509173994928726577188503-0.37248751509173994928726577188503
w5w_{5} 0.123712152428295622845526629060280.12371215242829562284552662906028
w6w_{6} 0.38248795584080401246916516638535-0.38248795584080401246916516638535
w7w_{7} 0.544581169396945516083785197118770.54458116939694551608378519711877
w8w_{8} 0.217032194945120289230532518242150.21703219494512028923053251824215
Table 10: The 8th order kernel tailored for large time step size to improve the threshold between 6th and 8th order.

One must also be careful in considering thresholds based on eigenvalue error, because there needs to be a large number of time steps in order for the spectral-norm error in the complete evolution time to be dominated by the eigenvalue error. There is a factor of about 24 between the spectral-norm error and the eigenvalue error for the 8th order product formula YP8m8. For the threshold between 6th order and 8th order the number of time steps is orders of magnitude larger than this, which indicates that the eigenvalue error is an appropriate measure.

These threshold calculations are for Hamiltonians that are sums of two terms, each of which is normalised. In the case where the Hamiltonian is not normalised, the threshold should be scaled by the norm. In the simple case where the Hamiltonian is a sum of two terms with equal norms A=B\|A\|=\|B\|, then the threshold will be for AT/ϵ\|A\|T/\epsilon rather than T/ϵT/\epsilon. More generally the Hamiltonian may be a sum of any number of terms with different norms, which would change the threshold and make it unclear what quantity the threshold should be considered in terms of. There is also the possibility that the threshold can be changed for Hamiltonians that have structure to them instead of being random, for example those for quantum chemistry.

VI.2 Comparison for fermionic Hamiltonians

Fermionic Hamiltonians encountered in quantum chemistry often have the form

p,q=1dτpqapaq+p,q=1dνpqapapaqaq\displaystyle\sum_{p,q=1}^{d}\tau_{pq}a^{{\dagger}}_{p}a_{q}+\sum_{p,q=1}^{d}\nu_{pq}a^{{\dagger}}_{p}a_{p}a^{{\dagger}}_{q}a_{q} (48)

where apa^{\dagger}_{p} and apa_{p} are the fermionic creation and annihilation operators acting on orbital pp, and there are a total of dd orbitals. Each entry τpq\tau_{pq}, νpq\nu_{pq} is real and there is symmetry in exchanging indices. The behaviour of the error as the size of system is changed can be predicted based on the result in Theorem 4 of [37].

Theorem 5 (Theorem 4 in [37]).

Let H=T+V=p,qdτpqapaq+p,qdνpqapapaqaqH=T+V=\sum_{p,q}^{d}\tau_{pq}a^{{\dagger}}_{p}a_{q}+\sum_{p,q}^{d}\nu_{pq}a^{{\dagger}}_{p}a_{p}a^{{\dagger}}_{q}a_{q} be an interacting-electronic Hamiltonian, and Sk(t)S_{k}(t) be a kkth order product formula splitting the evolutions under TT and VV. Then

Sk(t)eitHWη=((τ1+ν1,[η])k1τ1ν1,[η]ηtk+1),\norm{S_{k}(t)-e^{-itH}}_{W_{\eta}}=\order{(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})^{k-1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta t^{k+1}}\,, (49)

where Wη\norm{\cdot}_{W_{\eta}} corresponds to the operator norm on the operator acting in the η\eta-electron subspace, τ1=maxpq|τpq|\norm{\tau}_{1}=\max_{p}\sum_{q}\absolutevalue{\tau_{pq}} and ν1,[η]=maxpmaxq1<q1<<qη(|νpq1|++|νpqη|)\norm{\nu}_{1,[\eta]}=\max_{p}\max_{q_{1}<q_{1}<...<q_{\eta}}(\absolutevalue{\nu_{pq_{1}}}+\cdots+\absolutevalue{\nu_{pq_{\eta}}}).

To fairly compare product formulae as applied to quantum chemistry, we can define ξ,ω\xi,\omega so that

χ\displaystyle\chi =ξ(τ1+ν1,[η])k1τ1ν1,[η]η,\displaystyle=\xi\,(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})^{k-1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta\,, (50)
ζ\displaystyle\zeta =ω(τ1+ν1,[η])k1τ1ν1,[η]η.\displaystyle=\omega\,(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})^{k-1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta\,. (51)

That is, χ,ξ\chi,\xi are for the spectral-norm error, and ζ,ω\zeta,\omega are for the eigenvalue error, with ξ,ω\xi,\omega being the quantities defined for fermionic Hamiltonians. Note that the expression in Ref. [37] was derived for error in the spectral norm, but that also provides an upper bound on the error in the eigenvalues, so it is reasonable to consider the constant defined for eigenvalue error. Then the formula for the threshold T/εT/\varepsilon becomes, for ω\omega (eigenvalue error)

Tε\displaystyle\frac{T}{\varepsilon} =(M2[ω2(τ1+ν1,[η])k21τ1ν1,[η]η]1/k2M1[ω1(τ1+ν1,[η])k11τ1ν1,[η]η]1/k1)11k11k2\displaystyle=\left(\frac{M_{2}[\omega_{2}(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})^{k_{2}-1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta]^{1/k_{2}}}{M_{1}[\omega_{1}(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})^{k_{1}-1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta]^{1/k_{1}}}\right)^{\frac{1}{\frac{1}{k_{1}}-\frac{1}{k_{2}}}}
=(M2[ω2τ1ν1,[η]η/(τ1+ν1,[η])]1/k2M1[ω1τ1ν1,[η]η/(τ1+ν1,[η])]1/k1)11k11k2\displaystyle=\left(\frac{M_{2}[\omega_{2}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta/(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})]^{1/k_{2}}}{M_{1}[\omega_{1}\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta/(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]})]^{1/k_{1}}}\right)^{\frac{1}{\frac{1}{k_{1}}-\frac{1}{k_{2}}}}
τ1ν1,[η]ητ1+ν1,[η]Tε\displaystyle\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}\frac{T}{\varepsilon} =(M2ω21/k2M1ω11/k1)11k11k2.\displaystyle=\left(\frac{M_{2}\omega_{2}^{1/k_{2}}}{M_{1}\omega_{1}^{1/k_{1}}}\right)^{\frac{1}{\frac{1}{k_{1}}-\frac{1}{k_{2}}}}. (52)

Thus we see that the ratio τ1ν1,[η]η/(τ1+ν1,[η]){\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}/(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}) governs the threshold where a 10th-order product formula will improve over an 8th-order product formula (and similarly for other orders). This is somewhat different from the factor of the norm of the Hamiltonian that might otherwise be expected for unstructured Hamiltonians.

To analyse the non-asymptotic regime, we rewrite the error to be proportional to

τ1ν1,[η]ητ1+ν1,[η]t×[t(τ1+ν1,[η])]k.\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}t\times\left[{t}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right]^{k}. (53)

Following the derivation of Ref. [37], it is easily seen that the contributions to the error from each of the orders higher than k+1k+1 will be of a similar form. These higher-order contributions to the error can be used to determine the error in the non-asymptotic regime where the order k+1k+1 term is not dominant.

To see why that is the case, note that Lemma 2 of Ref. [37] describes the error at order k+1k+1 (p+1p+1 in the notation of that work) in terms of a sum over norms of multicommutators of k+1k+1 terms in the Hamiltonian. It is easily seen that the error at each of the higher orders will also be given by similar multicommutator forms. The method of proof of Theorem 4 in Ref. [37] is based on simplifying the multicommutator expressions from Lemma 2, and proceeds in exactly the same way for the higher orders.

Therefore, the sum over errors of all orders would give a total error of the form

τ1ν1,[η]ητ1+ν1,[η]t×j=k+1ξj[t(τ1+ν1,[η])]j.\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}t\times\sum_{j=k+1}^{\infty}\xi_{j}\left[t\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right]^{j}. (54)

This sum would then give the non-asymptotic form for the error

τ1ν1,[η]ητ1+ν1,[η]t×g(t(τ1+ν1,[η])),\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}t\times g\!\left({t}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right)\,, (55)

for some function gg.

Then following the same procedure to determine the threshold as before, we will have a total error given by

τ1ν1,[η]ητ1+ν1,[η]T×g(t(τ1+ν1,[η])).\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}T\times g\!\left({t}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right)\,. (56)

That would imply that comparing two product formulae, the threshold will be obtained for

g1(t1(τ1+ν1,[η]))=g2(t2(τ1+ν1,[η])).g_{1}\!\left({t_{1}}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right)=g_{2}\!\left({t_{2}}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right)\,. (57)

Using t2=t1M2/M1t_{2}=t_{1}M_{2}/M_{1} and solving for t1t_{1} as before, the T/ϵT/\epsilon threshold would then be given by substituting the solution for t1t_{1} into

τ1ν1,[η]ητ1+ν1,[η]Tϵ=1/g1(t1(τ1+ν1,[η])).\frac{\norm{\tau}_{1}\norm{\nu}_{1,[\eta]}\eta}{\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}}\frac{T}{\epsilon}=1/g_{1}\!\left({t_{1}}\left(\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\right)\right)\,. (58)

The threshold can be calculated for Hamiltonians with a given value of τ1+ν1,[η]\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}, then for some new Hamiltonian with different values of the norms, the threshold should still be for the same value of t1(τ1+ν1,[η]){t_{1}}\big{(}\norm{\tau}_{1}+\norm{\nu}_{1,[\eta]}\big{)}. This implies that the right-hand-side of Eq. (58) should be the same for the threshold with the new Hamiltonian. As a result, we expect that the threshold should be in terms of the left-hand-side of Eq. (58), which is the same expression as found in the asymptotic regime.

In our numerical testing, the coefficients τpq\tau_{pq}, νpq\nu_{pq} were chosen uniformly from the interval [1,1][-1,1]. We computed ζ\zeta for a selection of the product formulae with the best performance for random Hamiltonians. With the errors computed numerically, we can compute ω\omega. In Table 11, Table 12 and Table 13 we give the computed result for d=6d=6 orbitals, assuming half-filling of the orbitals. Our numerics indicate that the error is roughly proportional to the bound in Eq. 49, independent of the number of orbitals, though the constant factors are small. In Appendix C we give the constant factors for the case d=4d=4, showing that the computed constant does not change much with a different dd.

label ω\omega Mω1/kM\omega^{1/k}
BCE6m10 3.4×10113.4\times 10^{-11} 0.360.36
PPBCM6m6 1.2×1091.2\times 10^{-9} 0.420.42
Table 11: Comparison of constant factors ω\omega for the best product formulae for 6th order. We generate 10001000 random Hamiltonians with d=6d=6 orbitals as in Eq. 48 and compute the average ω\omega.
label ω\omega Mω1/kM\omega^{1/k}
SS8s19 3.5×10113.5\times 10^{-11} 0.940.94
Y8m10 8.5×10128.5\times 10^{-12} 0.870.87
Y8m10b 1.3×10121.3\times 10^{-12} 0.680.68
PP8s13 4.4×10104.4\times 10^{-10} 0.880.88
YP8m8 1.7×10121.7\times 10^{-12} 0.570.57
Table 12: Comparison of constant factors ω\omega for the best product formulae for 8th order. We generate 10001000 random Hamiltonians with d=6d=6 orbitals as in Eq. 48 and compute the average ω\omega.
label ω\omega Mω1/kM\omega^{1/k}
SS10s35 3.0×10153.0\times 10^{-15} 1.241.24
Y10m17 2.6×10142.6\times 10^{-14} 1.53
Y10m18b 5.1×10145.1\times 10^{-14} 1.74
PP10s23 2.3×10122.3\times 10^{-12} 1.58
Table 13: Comparison of constant factors ω\omega for the best product formulae for 10th order. We generate 10001000 random Hamiltonians with d=6d=6 orbitals as in Eq. 48 and compute the average ω\omega.

For calculating the thresholds based on the asymptotic formula in Eq. 52, we should compare our best 8th order formula YP8m8 to SS10s35 in the 10th order case, and BCE6m10 in the 6th order case. Then we obtain a threshold of 3×10133\times 10^{13} for 8th to 10th order, and 6×1046\times 10^{4} for 6th to 8th order, indicating about 9 orders of magnitude for T/ϵT/\epsilon where our 8th order product formula is optimal.

Similar to the case for general matrices, we expect that the threshold for 8th to 10th order is sufficiently large that the time steps are small and the asymptotic formulae are accurate, but the threshold between 6th and 8th order may be unreliable. In that case we solve Eq. 57 for t1t_{1} with t2=t1M2/M2t_{2}=t_{1}M_{2}/M_{2}, and find the threshold using Eq. 58 is increased to 5×1055\times 10^{5}. Similar to the case with general matrices, we expect that the 6th order product formula PPBCM6m6 can give better performance for larger time steps, and indeed we find that the threshold is further increased to 8×1068\times 10^{6}. By using our 8th order kernel optimized for large time step in Table 10 we can improve the threshold between PPBCM6m6 and our 8th order formula in the non-asymptotic case. In this case we obtain a threshold of around 2×1062\times 10^{6}, which is an improvement by a factor of 4 times; not as significant as the improvement for general random matrices.

Next we compare the performance of the product formulae for specific systems. From Ref. [37] the norms can be expected to scale as

ν1,[η]=𝒪(η2/3N1/3Ω1/3),τ1=𝒪(N2/3Ω2/3),\norm{\nu}_{1,[\eta]}=\mathcal{O}\left(\frac{\eta^{2/3}N^{1/3}}{\Omega^{1/3}}\right),\qquad\norm{\tau}_{1}=\mathcal{O}\left(\frac{N^{2/3}}{\Omega^{2/3}}\right), (59)

where NN is the number of orbitals and Ω\Omega is the volume (denoted nn and ω\omega in [37]). The constants for the scaling of these norms are derived in Ref. [38] as

ν1,[η]π1/3(3/4)2/3η2/3N1/3Ω1/3,τ13π2N2/32Ω2/3.\norm{\nu}_{1,[\eta]}\approx\pi^{1/3}(3/4)^{2/3}\frac{\eta^{2/3}N^{1/3}}{\Omega^{1/3}},\qquad\norm{\tau}_{1}\approx\frac{3\pi^{2}N^{2/3}}{2\Omega^{2/3}}. (60)

For an initial order of magnitude estimate, the chemical accuracy required for the phase estimation is about 0.0010.001 Hartree, which implies TT of about 1000π1000\pi, and ε\varepsilon can be taken to be of order 1. With high estimates η100\eta\approx 100 and N/Ω109N/\Omega\approx 10^{9}, the left-hand-side (LHS) of Eq. (52) would be on the order of 101010^{10}, which means our 8th order should be preferred to 10th order. Therefore, in this case either 8th order or 10th order could be chosen depending on the details of the system

For comparison, we also consider the Alpha + Hydrogen system from Ref. [38]. The parameters η\eta, NN and Ω\Omega are given in Table II of that work. Some of the most challenging values of T,ϵT,\epsilon are given in Figure 8 of that work, with for example T=40T=40 and ϵ=104\epsilon=10^{-4}. With these parameters the LHS of Eq. (52) is approximately 8.6×1098.6\times 10^{9}, which is below the threshold (between 8th and 10th order) by a few orders of magnitude. The smallest values of T/ϵT/\epsilon from Ref. [38] are obtained with T=10T=10 and ϵ=0.01\epsilon=0.01 (see Table IV). That results in a value for the LHS of Eq. (52) of 2.2×1072.2\times 10^{7}, which is above the threshold between 6th and 8th order, so our 8th order product formula would still be optimal. This indicates that the new 8th order product formula found here should be optimal for the full range of parameter values used in Ref. [38]. Nonetheless, for smaller values of T/ϵT/\epsilon it may be advantageous to further customise the product formula to provide better performance at larger time steps, as described above to improve the threshold between 6th order and 8th order.

Note also that the fact that the threshold here is different than for random Hamiltonians means that the threshold will depend on the class of Hamiltonians. Selecting a different type of Hamiltonians can change the threshold. Similarly, it will be expected that choosing a particular class of fermionic Hamiltonians (such as those arising from plane waves) rather than random fermionic Hamiltonians can give a different threshold. Nevertheless, changing the class of Hamiltonians still resulted in an extremely large threshold for 10th order product formulae to be optimal. This suggests that the 8th order formula will be optimal for realistic simulations regardless of the class of Hamiltonians.

VII Conclusion

In this work we have developed a method of fairly comparing product formulae with different numbers of factors and different orders, as well as searched for improved higher-order product formulae. We find that it is better to compare the eigenvalue error, rather than the spectral-norm error as in prior work, because it is the eigenvalue error in a single step which dominates the error over longer time evolutions. The optimal order of integrator to use depends primarily on the ratio T/ϵT/\epsilon, the ratio of the total evolution time to the required error.

This is because, as T/ϵT/\epsilon becomes larger, the error allowed for each individual time step becomes smaller, and the time steps must become shorter to make the error sufficiently small. That results in a larger overall number of time steps, and so a larger complexity. The higher-order integrators better reduce the error, with shorter time steps, and so for large T/ϵT/\epsilon that improvement more than compensates for the larger number of exponentials for the higher-order product formulae.

In our search for higher-order product formulae we found very large numbers of solutions at both 8th and 10th order. Our solution at 8th order improves over all prior 8th order product formulae that we found in prior work. The best 8th order product formula we have found is a processed product formula, which means that in a long time simulation the kernel would be repeated many times, and the processor would only be used at the beginning and end of the simulation. For applications in eigenvalue estimation the processor would not be needed at all. We found highly accurate solutions at 10th order, though they are outperformed by one result found in prior work that is extremely accurate. These product formulae greatly outperform the fractal product formulae of Suzuki, which are commonly considered in quantum simulation but require a large number of exponentials.

We show via numerical testing for random Hamiltonians that there is a range of about eight orders of magnitude for T/ϵT/\epsilon where our new 8th-order product formula outperforms all other known product formulae. Moreover, our 8th order product formula can be further adjusted for smaller values of T/ϵT/\epsilon to provide another order of magnitude where it is optimal. This range includes reasonable combinations of parameters that one would use in applications. A lower-order formula would only be optimal for small simulations that are likely to be classically tractable. A 10th order formula would only be optimal for simulations well beyond the scale considered for quantum algorithms.

A particularly important class of Hamiltonians is those corresponding to fermions (rather than just random), because those are relevant to simulations in quantum chemistry. Using the formula from Ref. [37], those give a threshold for T/ϵT/\epsilon combined with norms of the matrices describing the fermionic system. As an example we consider parameters for the simulations from Ref. [38], and estimate that the threshold for 10th order product formulae to outperform our 8th order product formula is about three orders of magnitude larger than the largest simulations considered there. Those simulations are already expected to require more than 101310^{13} Toffolis, so even larger simulations where 10th order product formulae are optimal would be well beyond the scale that could be realistically implemented on foreseeable quantum computers.

A potential topic for future work is further customising product formulae for large time steps. We have customised the 8th order product formula to improve the threshold, but it would also be possible to customise the 6th and 4th order product formulae. It would also be possible to customise the product formulae for larger time steps with the fermionic Hamiltonians. So far we have only customised the 8th order product formula for general matrices.

VIII Acknowledgements

MESM was supported by the ARC Centre of Excellence for Quantum Computation and Communication Technology (CQC2T), project number CE170100012 and a scholarship top-up and extension from the Sydney Quantum Academy. MESM and YRS were supported by the Defense Advanced Research Projects Agency under Contract No. HR001122C0074. Any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not necessarily reflect the views of the Defense Advanced Research Projects Agency. DKB acknowledges funding by the Australian Research Council (project numbers FT190100106, DP210101367, CE170100009). DWB worked on this project under a sponsored research agreement with Google Quantum AI. DWB is also supported by Australian Research Council Discovery Projects DP210101367 and DP220101602.

References

Appendix A Extending Yoshida’s method to 10th order

Here we explain how to extend the method of Yoshida to obtain the equations for a 10th order integrator. See Section II.2 for an introductory explanation of how the method is used for 6th order. The general principle used there was to provide an expression for S(m)(τ)S^{(m)}(\tau) in Eq. (19) with the expression in the exponential given up to 6th order. Then the coefficients of the multicommutators of operators needed to be made equal to zero in order to obtain a 6th order approximation. Here we apply the same principle, except now we need to derive the terms up to 10th order.

In order to do this, we start by expressing S2(t)S_{2}(t) up to 10th order as

S2(t)=etα1+t3α3+t5α5+t7α7+t9α9+𝒪(t11)S_{2}(t)=e^{t\alpha_{1}+t^{3}\alpha_{3}+t^{5}\alpha_{5}+t^{7}\alpha_{7}+t^{9}\alpha_{9}+\mathcal{O}(t^{11})} (61)

where we follow the notation from Corollary 3 with αj\alpha_{j} defined as commutators of operators. We will then iteratively apply the symmetric BCH expansion for ZZ in eZ=eCeDeCe^{Z}=e^{C}e^{D}e^{C} in order to obtain an expression for S(m)(τ)S^{(m)}(\tau) up to 10th order. It will be helpful to consider the following notation for commutators. For any operators X1,X2,,XLX_{1},X_{2},\cdots,X_{L} we define

[X1n1,X2n1,,XLnL]:=[X1,,X1n_1 times ,X2,,X2n_2 times ,,XL,,XLn_L times ],[X_{1}^{n_{1}},X_{2}^{n_{1}},\cdots,X_{L}^{n_{L}}]:=[\underbrace{X_{1},\cdots,X_{1}}_{\text{{\hbox{n_1}} times }},\underbrace{X_{2},\cdots,X_{2}}_{\text{{\hbox{n_2}} times }},\cdots,\underbrace{X_{L},\cdots,X_{L}}_{\text{{\hbox{n_L}} times }}], (62)

where the commutator on the right hand should be understood as in Eq. 9. To express the results it will be useful to define the following commutators

β9\displaystyle\beta_{9} =[α1,α1,α7],\displaystyle=[\alpha_{1},\alpha_{1},\alpha_{7}], (63)
γ9(1)\displaystyle\gamma^{(1)}_{9} =[α1,α3,α5],\displaystyle=[\alpha_{1},\alpha_{3},\alpha_{5}], (64)
γ9(2)\displaystyle\gamma^{(2)}_{9} =[α3,α1,α5],\displaystyle=[\alpha_{3},\alpha_{1},\alpha_{5}], (65)
γ9(3)\displaystyle\gamma^{(3)}_{9} =[α5,α1,α3],\displaystyle=[\alpha_{5},\alpha_{1},\alpha_{3}], (66)
δ9(1)\displaystyle\delta^{(1)}_{9} =[α14,α5],\displaystyle=[\alpha_{1}^{4},\alpha_{5}], (67)
δ9(2)\displaystyle\delta^{(2)}_{9} =[α3,α13,α3],\displaystyle=[\alpha_{3},\alpha_{1}^{3},\alpha_{3}], (68)
δ9(3)\displaystyle\delta^{(3)}_{9} =[α1,α3,α12,α3],\displaystyle=[\alpha_{1},\alpha_{3},\alpha_{1}^{2},\alpha_{3}], (69)
ϵ9\displaystyle\epsilon_{9} =[α16,α3].\displaystyle=[\alpha_{1}^{6},\alpha_{3}]. (70)

We also use the notation of Yoshida [20] for the commutators

β5\displaystyle\beta_{5} =[α1,α1,α3]E5,2,\displaystyle=[\alpha_{1},\alpha_{1},\alpha_{3}]\equiv E_{5,2}, (71)
β7\displaystyle\beta_{7} =[α1,α1,α5]E7,3,\displaystyle=[\alpha_{1},\alpha_{1},\alpha_{5}]\equiv E_{7,3}, (72)
γ7\displaystyle\gamma_{7} =[α3,α3,α1]E7,2,\displaystyle=[\alpha_{3},\alpha_{3},\alpha_{1}]\equiv-E_{7,2}, (73)
δ7\displaystyle\delta_{7} =[α1,α1,α1,α1,α3]E7,4,\displaystyle=[\alpha_{1},\alpha_{1},\alpha_{1},\alpha_{1},\alpha_{3}]\equiv E_{7,4}, (74)

where we have indicated the equivalence to the multicommutators in the notation of Ref. [24].

Note that here we have only defined commutators of up to 7 of the αj\alpha_{j} operators. In the following we will be expanding expressions up to 9th order in tt, because the symmetry of the formulae means that 10th order terms (and all even-order terms) must be zero. The only way to obtain 9th order in tt with commutators of tα1t\alpha_{1}, t3α3t^{3}\alpha_{3}, etc., is to have commutators of all α1\alpha_{1} as [α1,α1,,α1][\alpha_{1},\alpha_{1},\cdots,\alpha_{1}]. But that expression must be zero, because [α1,α1]=0[\alpha_{1},\alpha_{1}]=0. Then when we express the expansion of ZZ in eZ=eCeDeCe^{Z}=e^{C}e^{D}e^{C}, the first-order terms in CC and DD will be proportional to α1\alpha_{1}. That means the only 9th order terms coming from commutators of 9 of C,DC,D would correspond to [α1,α1,,α1][\alpha_{1},\alpha_{1},\cdots,\alpha_{1}] and therefore be zero. This means we will only need commutators of up to 7 operators when expressing the expansion of ZZ in eZ=eCeDeCe^{Z}=e^{C}e^{D}e^{C} as well.

To obtain the coefficients multiplying the commutators with up to 7 operators in the symmetric BCH expansion for eZ=eCeDeCe^{Z}=e^{C}e^{D}e^{C}, we first use the algorithm defined in Section V of Ref. [39]. The algorithm in that work generates the scalar coefficients multiplying products of operators rather than their commutators, as we need here. In order to derive the corresponding expressions in terms of commutators, we express the symmetric BCH expansion in the Ph. Hall basis, which is a basis for writing Lie monomials consisting of commutators of the generators of the Lie algebra. For example, the elements up to 4th order in C,DC,D are the operators C,DC,D themselves, as well as

[C,D],[C,C,D],[D,C,D],[C,C,C,D],[D,C,C,D],[D,D,C,D].[C,D],\quad[C,C,D],\quad[D,C,D],\quad[C,C,C,D],\quad[D,C,C,D],\quad[D,D,C,D]. (75)

For a list of operators in this basis up to 7th order, see Table 1 in [40].

We obtain the coefficients for the Ph. Hall basis by solving the corresponding linear problem of changing from one basis to another. As an example, consider the term with 33 operators in the symmetric BCH expansion from Corollary 3, given as α3=112[Y,[Y,X]]124[X,[X,Y]]\alpha_{3}=\frac{1}{12}[Y,[Y,X]]-\frac{1}{24}[X,[X,Y]]. We can also express the commutators as products by expanding out the commutators, which gives α~3=124(2Y2X4YXY2XY2X2Y+2XYX+YX2)\tilde{\alpha}_{3}=\frac{1}{24}\left(2Y^{2}X-4YXY-2XY^{2}-X^{2}Y+2XYX+YX^{2}\right). The algorithm in [39] outputs expressions with the commutators expanded out as in α~3\tilde{\alpha}_{3}. In order to obtain the original expression α3\alpha_{3}, we write α~3=a[Y,[Y,X]]+b[X,[X,Y]]\tilde{\alpha}_{3}=a[Y,[Y,X]]+b[X,[X,Y]] with a,ba,b\in\mathbb{R} and expand the commutators [Y,[Y,X]][Y,[Y,X]] and [X,[X,Y]][X,[X,Y]]. This gives several linear equations that can be written in terms of a matrix. By inverting this matrix, we obtain the coefficients aa and bb.

By using this method, we obtain the symmetric BCH expansion for eZ=eCeDeCe^{Z}=e^{C}e^{D}e^{C} up to 7th order as

Z\displaystyle Z =2C+D+16([D,D,C][C,C,D])\displaystyle=2C+D+\frac{1}{6}([D,D,C]-[C,C,D])
+7360[C,C,C,C,D]1360[D,D,D,D,C]\displaystyle\quad+\frac{7}{360}[C,C,C,C,D]-\frac{1}{360}[D,D,D,D,C]
+190[C,D,D,D,C]+145[D,C,C,C,D]\displaystyle\quad+\frac{1}{90}[C,D,D,D,C]+\frac{1}{45}[D,C,C,C,D]
160[C,C,D,D,C]+130[D,D,C,C,D]\displaystyle\quad-\frac{1}{60}[C,C,D,D,C]+\frac{1}{30}[D,D,C,C,D]
3115120[C,C,C,C,C,C,D]315040[D,C,C,C,C,C,D]\displaystyle\quad-\frac{31}{15120}[C,C,C,C,C,C,D]-\frac{31}{5040}[D,C,C,C,C,C,D]
131890[D,D,C,C,C,C,D]5315120[D,D,D,C,C,C,D]\displaystyle\quad-\frac{13}{1890}[D,D,C,C,C,C,D]-\frac{53}{15120}[D,D,D,C,C,C,D]
11260[D,D,D,D,C,C,D]115120[D,D,D,D,D,C,D]+(9).\displaystyle\quad-\frac{1}{1260}[D,D,D,D,C,C,D]-\frac{1}{15120}[D,D,D,D,D,C,D]+\mathcal{R}_{(9\leq)}. (76)

Here (9)\mathcal{R}_{(9\leq)} is an infinite sum with commutators of an odd number of operators equal to or higher than 99. Now we prove the following Lemma which gives us the expansion up to 10th order of S(m)(τ)S^{(m)}(\tau). This will allow us to derive the equations for 10th order product formulae.

Lemma 6.

Using the definition for S(m)(τ)S^{(m)}(\tau) as given in Eq. 15, we have that for all mm\in\mathbb{N}

S(m)(τ)\displaystyle S^{(m)}(\tau) =exp{τA1,mα1+τ3A3,mα3+τ5(A5,mα5+B5,mβ5)\displaystyle=\exp\{\tau A_{1,m}\alpha_{1}+\tau^{3}A_{3,m}\alpha_{3}+\tau^{5}(A_{5,m}\alpha_{5}+B_{5,m}\beta_{5})
+τ7(A7,mα7+B7,mβ7+C7,mγ7+D7,mδ7)\displaystyle\quad+\tau^{7}(A_{7,m}\alpha_{7}+B_{7,m}\beta_{7}+C_{7,m}\gamma_{7}+D_{7,m}\delta_{7})
+τ9(A9,mα9+B9,mβ9+C9,m(1)γ9(1)+C9,m(2)γ9(2)+C9,m(3)γ9(3)\displaystyle\quad+\tau^{9}(A_{9,m}\alpha_{9}+B_{9,m}\beta_{9}+C^{(1)}_{9,m}\gamma^{(1)}_{9}+C^{(2)}_{9,m}\gamma^{(2)}_{9}+C^{(3)}_{9,m}\gamma^{(3)}_{9}
+D9,m(1)δ9(1)+D9,m(2)δ9(2)+D9,m(3)δ9(3)+E9,mϵ9)+(τ11)},\displaystyle\quad+D^{(1)}_{9,m}\delta^{(1)}_{9}+D^{(2)}_{9,m}\delta^{(2)}_{9}+D^{(3)}_{9,m}\delta^{(3)}_{9}+E_{9,m}\epsilon_{9})+\order{\tau^{11}}\bigg{\}}, (77)

where the variables in upper case denote polynomials in the variables (w1,,wm)(w_{1},\cdots,w_{m}).

Proof.

We proceed by induction. First, note that the statement is true for the case m=0m=0,

S(m=0)(τ)\displaystyle S^{(m=0)}(\tau) =S2(w0τ),\displaystyle=S_{2}(w_{0}\tau), (78)
=exp(tw0α1+t3w03α3+t5w05α5+t7w07α7+t9w09α9+𝒪(t11)).\displaystyle=\exp{tw_{0}\alpha_{1}+t^{3}w_{0}^{3}\alpha_{3}+t^{5}w_{0}^{5}\alpha_{5}+t^{7}w_{0}^{7}\alpha_{7}+t^{9}w_{0}^{9}\alpha_{9}+\mathcal{O}(t^{11})}. (79)

This clearly has the form of Lemma 6 by taking Aj,m=0=w0jA_{j,m=0}=w_{0}^{j} and all other scalar variables as 0.

Assume now that Lemma 6 is correct, we want to derive an expression for S(m+1)S^{(m+1)}. We then have S(m+1)(τ)=S2(wm+1τ)S(m)(τ)S2(wm+1τ)S^{(m+1)}(\tau)=S_{2}(w_{m+1}\tau)S^{(m)}(\tau)S_{2}(w_{m+1}\tau) and thus

S2(wm+1τ)S(m)(τ)S2(wm+1τ)\displaystyle S_{2}(w_{m+1}\tau)S^{(m)}(\tau)S_{2}(w_{m+1}\tau) =exp{τwm+1α1+τ3wm+13α3+τ5wm+15α5+τ7wm+17α7+τ9wm+19α9+𝒪(τ11)}\displaystyle=\exp\{\tau w_{m+1}\alpha_{1}+\tau^{3}w_{m+1}^{3}\alpha_{3}+\tau^{5}w_{m+1}^{5}\alpha_{5}+\tau^{7}w_{m+1}^{7}\alpha_{7}+\tau^{9}w_{m+1}^{9}\alpha_{9}+\mathcal{O}(\tau^{11})\bigg{\}}
×exp{τA1,mα1+τ3A3,mα3+τ5(A5,mα5+B5,mβ5)\displaystyle\quad\times\exp\{\tau A_{1,m}\alpha_{1}+\tau^{3}A_{3,m}\alpha_{3}+\tau^{5}(A_{5,m}\alpha_{5}+B_{5,m}\beta_{5})
+τ7(A7,mα7+B7,mβ7+C7,mγ7+D7,mδ7)\displaystyle\quad+\tau^{7}(A_{7,m}\alpha_{7}+B_{7,m}\beta_{7}+C_{7,m}\gamma_{7}+D_{7,m}\delta_{7})
+τ9(A9,mα9+B9,mβ9+C9,m(1)γ9(1)+C9,m(2)γ9(2)+C9,m(3)γ9(3)\displaystyle\quad+\tau^{9}(A_{9,m}\alpha_{9}+B_{9,m}\beta_{9}+C^{(1)}_{9,m}\gamma^{(1)}_{9}+C^{(2)}_{9,m}\gamma^{(2)}_{9}+C^{(3)}_{9,m}\gamma^{(3)}_{9}
+D9,m(1)δ9(1)+D9,m(2)δ9(2)+D9,m(3)δ9(3)+E9,mϵ9)+(τ11)}\displaystyle\quad+D^{(1)}_{9,m}\delta^{(1)}_{9}+D^{(2)}_{9,m}\delta^{(2)}_{9}+D^{(3)}_{9,m}\delta^{(3)}_{9}+E_{9,m}\epsilon_{9})+\order{\tau^{11}}\bigg{\}}
×exp{τwm+1α1+τ3wm+13α3+τ5wm+15α5+τ7wm+17α7+τ9wm+19α9+(τ11)}.\displaystyle\quad\times\exp\{\tau w_{m+1}\alpha_{1}+\tau^{3}w_{m+1}^{3}\alpha_{3}+\tau^{5}w_{m+1}^{5}\alpha_{5}+\tau^{7}w_{m+1}^{7}\alpha_{7}+\tau^{9}w_{m+1}^{9}\alpha_{9}+\order{\tau^{11}}\bigg{\}}. (80)

We compute the right-hand-side (RHS) of Appendix A applying the symmetric BCH formula from Corollary 3. Writing the RHS as eCeDeCe^{C}e^{D}e^{C}, we have that

C\displaystyle C =τwm+1α1+τ3wm+13α3+τ5wm+15α5+τ7wm+17α7+τ9wm+19α9+𝒪(τ11)\displaystyle=\tau w_{m+1}\alpha_{1}+\tau^{3}w_{m+1}^{3}\alpha_{3}+\tau^{5}w_{m+1}^{5}\alpha_{5}+\tau^{7}w_{m+1}^{7}\alpha_{7}+\tau^{9}w_{m+1}^{9}\alpha_{9}+\mathcal{O}(\tau^{11}) (81)
D\displaystyle D =τA1,mα1+τ3A3,mα3+τ5(A5,mα5+B5,mβ5)+τ7(A7,mα7+B7,mβ7+C7,mγ7+D7,mδ7)\displaystyle=\tau A_{1,m}\alpha_{1}+\tau^{3}A_{3,m}\alpha_{3}+\tau^{5}(A_{5,m}\alpha_{5}+B_{5,m}\beta_{5})+\tau^{7}(A_{7,m}\alpha_{7}+B_{7,m}\beta_{7}+C_{7,m}\gamma_{7}+D_{7,m}\delta_{7})
+τ9(A9,mα9+B9,mβ9+C9,m(1)γ9(1)+C9,m(2)γ9(2)+C9,m(3)γ9(3)+D9,m(1)δ9(1)+D9,m(2)δ9(2)+D9,m(3)δ9(3)+E9,mϵ9)\displaystyle\quad+\tau^{9}(A_{9,m}\alpha_{9}+B_{9,m}\beta_{9}+C^{(1)}_{9,m}\gamma^{(1)}_{9}+C^{(2)}_{9,m}\gamma^{(2)}_{9}+C^{(3)}_{9,m}\gamma^{(3)}_{9}+D^{(1)}_{9,m}\delta^{(1)}_{9}+D^{(2)}_{9,m}\delta^{(2)}_{9}+D^{(3)}_{9,m}\delta^{(3)}_{9}+E_{9,m}\epsilon_{9})
+(τ11).\displaystyle\quad+\order{\tau^{11}}. (82)

We then compute the commutators of CC and DD that appear in the symmetric BCH formula, here we give the resulting 9th order operators after applying the commutators. When we write [C,D,,C]9[C,D,\cdots,C]_{9}, the subscript indicates that we are only keeping the 99th order terms when expanding the commutator. We will explain in detail how to compute the commutator 𝒞=[D,D,C]9\mathcal{C}=[D,D,C]_{9}, the other commutators are computed in a similar way. Since we only need to consider terms of 9th order when computing 𝒞\mathcal{C}, each term will have contributions from each operator inside 𝒞\mathcal{C} (in this case two operators DD and one CC) which is comprised of odd numbers that sum up to 99 such that the commutator is non-zero. We then have that

[D,D,C]9\displaystyle[D,D,C]_{9} =τ9(A1,m2wm+17A1,mA7,mwm+1)[α1,α1,α7]\displaystyle=\tau^{9}(A^{2}_{1,m}w_{m+1}^{7}-A_{1,m}A_{7,m}w_{m+1})[\alpha_{1},\alpha_{1},\alpha_{7}]
+τ9A1,mB7,mwm+1[α1,β7,α1]\displaystyle\quad+\tau^{9}A_{1,m}B_{7,m}w_{m+1}[\alpha_{1},\beta_{7},\alpha_{1}]
+τ9A1,mC7,mwm+1[α1,γ7,α1]\displaystyle\quad+\tau^{9}A_{1,m}C_{7,m}w_{m+1}[\alpha_{1},\gamma_{7},\alpha_{1}]
+τ9A1,mD7,mwm+1[α1,δ7,α1]\displaystyle\quad+\tau^{9}A_{1,m}D_{7,m}w_{m+1}[\alpha_{1},\delta_{7},\alpha_{1}]
+τ9(A1,mA3,mwm+1A1,mA5,mwm+13)[α1,α3,α5]\displaystyle\quad+\tau^{9}(A_{1,m}A_{3,m}w_{m+1}-A_{1,m}A_{5,m}w_{m+1}^{3})[\alpha_{1},\alpha_{3},\alpha_{5}]
+τ9A1,mB5,mwm+13[α1,β5,α3]\displaystyle\quad+\tau^{9}A_{1,m}B_{5,m}w_{m+1}^{3}[\alpha_{1},\beta_{5},\alpha_{3}]
+τ9(A3,mA1,mwm+15A3,mA5,mwm+1)[α3,α1,α5]\displaystyle\quad+\tau^{9}(A_{3,m}A_{1,m}w_{m+1}^{5}-A_{3,m}A_{5,m}w_{m+1})[\alpha_{3},\alpha_{1},\alpha_{5}]
+τ9A1,mB5,mwm+1[α3,β5,α1]\displaystyle\quad+\tau^{9}A_{1,m}B_{5,m}w_{m+1}[\alpha_{3},\beta_{5},\alpha_{1}]
+τ9(A5,mA1,mwm+13A5,mA3,mwm+1)[α5,α1,α3]\displaystyle\quad+\tau^{9}(A_{5,m}A_{1,m}w_{m+1}^{3}-A_{5,m}A_{3,m}w_{m+1})[\alpha_{5},\alpha_{1},\alpha_{3}]
+τ9B5,mA3,mwm+1[β5,α3,α1].\displaystyle\quad+\tau^{9}B_{5,m}A_{3,m}w_{m+1}[\beta_{5},\alpha_{3},\alpha_{1}]. (83)

Given how we have defined the commutator, we have then

[D,D,C]9\displaystyle[D,D,C]_{9} =τ9(A1,m2wm+17A1,mA7,mwm+1)β9τ9A1,mB7,mwm+1δ9(1)\displaystyle=\tau^{9}(A_{1,m}^{2}w_{m+1}^{7}-A_{1,m}A_{7,m}w_{m+1})\beta_{9}-\tau^{9}A_{1,m}B_{7,m}w_{m+1}\delta^{(1)}_{9}
+τ9A1,mC7,mwm+1δ9(3)τ9A1,mD7,mwm+1ϵ9\displaystyle\quad+\tau^{9}A_{1,m}C_{7,m}w_{m+1}\delta^{(3)}_{9}-\tau^{9}A_{1,m}D_{7,m}w_{m+1}\epsilon_{9}
+τ9(A1,mA3,mwm+15A1,mA5,mw13)γ9(1)τ9A1,mB5,mwm+13δ9(3)\displaystyle\quad+\tau^{9}(A_{1,m}A_{3,m}w_{m+1}^{5}-A_{1,m}A_{5,m}w_{1}^{3})\gamma^{(1)}_{9}-\tau^{9}A_{1,m}B_{5,m}w_{m+1}^{3}\delta^{(3)}_{9}
+τ9(A3,mA1,mwm+15A3,mA5,mwm+1)γ9(2)τ9A3,mB5,mwm+1δ9(2)\displaystyle\quad+\tau^{9}(A_{3,m}A_{1,m}w_{m+1}^{5}-A_{3,m}A_{5,m}w_{m+1})\gamma^{(2)}_{9}-\tau^{9}A_{3,m}B_{5,m}w_{m+1}\delta^{(2)}_{9}
+τ9(A5,mA1,mwm+13A5,mA3,mwm+12)γ9(3)\displaystyle\quad+\tau^{9}(A_{5,m}A_{1,m}w_{m+1}^{3}-A_{5,m}A_{3,m}w_{m+1}^{2})\gamma^{(3)}_{9}
+τ9(B5,mA1,mwm+13B5,mA3,mwm+1)(δ9(2)δ9(3))\displaystyle\quad+\tau^{9}(B_{5,m}A_{1,m}w_{m+1}^{3}-B_{5,m}A_{3,m}w_{m+1})(\delta^{(2)}_{9}-\delta^{(3)}_{9}) (84)
[C,C,D]9\displaystyle[C,C,D]_{9} =τ9(wm+12A7,mwm+18A1,m)β9+τ9wm+12B7,mδ9(1)\displaystyle=\tau^{9}(w_{m+1}^{2}A_{7,m}-w_{m+1}^{8}A_{1,m})\beta_{9}+\tau^{9}w_{m+1}^{2}B_{7,m}\delta^{(1)}_{9}
τ9wm+12C7,mδ9(3)+τ9wm+12D7,mϵ9\displaystyle\quad-\tau^{9}w_{m+1}^{2}C_{7,m}\delta^{(3)}_{9}+\tau^{9}w_{m+1}^{2}D_{7,m}\epsilon_{9}
+τ9(wm+14A5,mwm+16A3,m)γ9(1)+τ9wm+14B5,mδ9(3)\displaystyle\quad+\tau^{9}(w_{m+1}^{4}A_{5,m}-w_{m+1}^{6}A_{3,m})\gamma^{(1)}_{9}+\tau^{9}w_{m+1}^{4}B_{5,m}\delta^{(3)}_{9}
+τ9(wm+14A5,mwm+18A1,m)γ9(2)+τ9wm+14B5,mδ9(2)\displaystyle\quad+\tau^{9}(w_{m+1}^{4}A_{5,m}-w_{m+1}^{8}A_{1,m})\gamma^{(2)}_{9}+\tau^{9}w_{m+1}^{4}B_{5,m}\delta^{(2)}_{9}
+τ9(wm+16A3,mwm+18A1,m)γ9(3)\displaystyle\quad+\tau^{9}(w_{m+1}^{6}A_{3,m}-w_{m+1}^{8}A_{1,m})\gamma^{(3)}_{9} (85)
[C,C,C,C,D]9\displaystyle[C,C,C,C,D]_{9} =τ9(wm+14A5,mwm+18A1,m)δ9(1)+τ9A1,m3B5,mwm+1ϵ9\displaystyle=\tau^{9}(w_{m+1}^{4}A_{5,m}-w_{m+1}^{8}A_{1,m})\delta^{(1)}_{9}+\tau^{9}A_{1,m}^{3}B_{5,m}w_{m+1}\epsilon_{9}
+τ9(wm+16A3,mwm+18A1,m)δ9(2)\displaystyle\quad+\tau^{9}(w_{m+1}^{6}A_{3,m}-w_{m+1}^{8}A_{1,m})\delta^{(2)}_{9}
+τ92(wm+16A3,mwm+18A1,m)δ9(3)\displaystyle\quad+\tau^{9}2(w_{m+1}^{6}A_{3,m}-w_{m+1}^{8}A_{1,m})\delta^{(3)}_{9} (86)
[D,D,D,D,C]9\displaystyle[D,D,D,D,C]_{9} =τ9(A1,m4wm+15A1,m3A5,mwm+1)δ9(1)τ9A1,m3B5,mwm+1ϵ9\displaystyle=\tau^{9}(A_{1,m}^{4}w_{m+1}^{5}-A_{1,m}^{3}A_{5,m}w_{m+1})\delta^{(1)}_{9}-\tau^{9}A_{1,m}^{3}B_{5,m}w_{m+1}\epsilon_{9}
+τ9(A3,mA1,m3wm+13A3,m2A1,m2wm+1)δ9(2)\displaystyle\quad+\tau^{9}(A_{3,m}A_{1,m}^{3}w_{m+1}^{3}-A_{3,m}^{2}A_{1,m}^{2}w_{m+1})\delta^{(2)}_{9}
+τ92(A1,m3A3,mwm+13A1,m2A3,m2wm+1)δ9(3)\displaystyle\quad+\tau^{9}2(A_{1,m}^{3}A_{3,m}w_{m+1}^{3}-A_{1,m}^{2}A_{3,m}^{2}w_{m+1})\delta^{(3)}_{9} (87)
[C,D,D,D,C]9\displaystyle[C,D,D,D,C]_{9} =τ9(wm+16A1,m3wm+12A1,m2A5,m)δ9(1)τ9A1,m2B5,mwm+12ϵ9\displaystyle=\tau^{9}(w_{m+1}^{6}A_{1,m}^{3}-w_{m+1}^{2}A_{1,m}^{2}A_{5,m})\delta^{(1)}_{9}-\tau^{9}A_{1,m}^{2}B_{5,m}w_{m+1}^{2}\epsilon_{9}
+τ9(wm+16A1,m3wm+14A1,m2A3,m)δ9(2)\displaystyle\quad+\tau^{9}(w_{m+1}^{6}A_{1,m}^{3}-w_{m+1}^{4}A_{1,m}^{2}A_{3,m})\delta^{(2)}_{9}
+τ92(wm+14A3,mA1,m2wm+12A3,m2A1,m)δ9(3)\displaystyle\quad+\tau^{9}2(w_{m+1}^{4}A_{3,m}A_{1,m}^{2}-w_{m+1}^{2}A_{3,m}^{2}A_{1,m})\delta^{(3)}_{9} (88)
[D,C,C,C,D]9\displaystyle[D,C,C,C,D]_{9} =τ9(A1,mA5,mwm+13A1,m2wm+17)δ9(1)+τ9wm+13A1,mB5,mϵ9\displaystyle=\tau^{9}(A_{1,m}A_{5,m}w_{m+1}^{3}-A_{1,m}^{2}w_{m+1}^{7})\delta^{(1)}_{9}+\tau^{9}w_{m+1}^{3}A_{1,m}B_{5,m}\epsilon_{9}
+τ9(A3,m2wm+13A1,mA3,mwm+15)δ(2)\displaystyle\quad+\tau^{9}(A_{3,m}^{2}w_{m+1}^{3}-A_{1,m}A_{3,m}w_{m+1}^{5})\delta^{(2)}
+τ92(A1,mA3,mwm+15A1,m2wm+17)δ9(3)\displaystyle\quad+\tau^{9}2(A_{1,m}A_{3,m}w_{m+1}^{5}-A_{1,m}^{2}w_{m+1}^{7})\delta^{(3)}_{9} (89)
[C,C,D,D,C]9\displaystyle[C,C,D,D,C]_{9} =τ9(wm+17A1,m2wm+13A1,mA5,m)δ9(1)wm+13A1,mB5,mϵ9\displaystyle=\tau^{9}(w_{m+1}^{7}A_{1,m}^{2}-w_{m+1}^{3}A_{1,m}A_{5,m})\delta^{(1)}_{9}-w_{m+1}^{3}A_{1,m}B_{5,m}\epsilon_{9}
+τ9(wm+17A1,m2wm+15A1,mA3,m)δ9(2)\displaystyle\quad+\tau^{9}(w_{m+1}^{7}A_{1,m}^{2}-w_{m+1}^{5}A_{1,m}A_{3,m})\delta^{(2)}_{9}
+τ9(wm+15A1,mA3,mwm+13A3,m2)δ9(3)\displaystyle\quad+\tau^{9}(w_{m+1}^{5}A_{1,m}A_{3,m}-w_{m+1}^{3}A_{3,m}^{2})\delta^{(3)}_{9} (90)
[D,D,C,C,D]9\displaystyle[D,D,C,C,D]_{9} =τ9(A1,m2A5,mwm+12A1,m3wm+16)δ9(1)+A1,m2B5,mwm+12ϵ9\displaystyle=\tau^{9}(A_{1,m}^{2}A_{5,m}w_{m+1}^{2}-A_{1,m}^{3}w_{m+1}^{6})\delta^{(1)}_{9}+A_{1,m}^{2}B_{5,m}w_{m+1}^{2}\epsilon_{9}
+(A3,m2A1,mwm+12A3,mA1,m2wm+14)δ9(2)\displaystyle\quad+(A_{3,m}^{2}A_{1,m}w_{m+1}^{2}-A_{3,m}A_{1,m}^{2}w_{m+1}^{4})\delta^{(2)}_{9}
+τ9(A3,m2A1,mwm+12A3,mA1,m2wm+14)δ9(3)+τ9(A1,m2A3,mwm+14A1,m3wm+16)δ9(3)\displaystyle\quad+\tau^{9}(A_{3,m}^{2}A_{1,m}w_{m+1}^{2}-A_{3,m}A_{1,m}^{2}w_{m+1}^{4})\delta^{(3)}_{9}+\tau^{9}(A_{1,m}^{2}A_{3,m}w_{m+1}^{4}-A_{1,m}^{3}w_{m+1}^{6})\delta^{(3)}_{9} (91)
[C,C,C,C,C,C,D]9\displaystyle[C,C,C,C,C,C,D]_{9} =τ9(wm+16A3,mwm+18A1,m)ϵ9\displaystyle=\tau^{9}(w_{m+1}^{6}A_{3,m}-w_{m+1}^{8}A_{1,m})\epsilon_{9}
[D,C,C,C,C,C,D]9\displaystyle[D,C,C,C,C,C,D]_{9} =τ9(wm+15A1,mA3,mwm+17A1,m2)ϵ9\displaystyle=\tau^{9}(w_{m+1}^{5}A_{1,m}A_{3,m}-w_{m+1}^{7}A_{1,m}^{2})\epsilon_{9} (92)
[D,D,C,C,C,C,D]9\displaystyle[D,D,C,C,C,C,D]_{9} =τ9(A1,m2A3,mwm+14A1,m3wm+16)ϵ9\displaystyle=\tau^{9}(A_{1,m}^{2}A_{3,m}w_{m+1}^{4}-A_{1,m}^{3}w_{m+1}^{6})\epsilon_{9} (93)
[D,D,D,C,C,C,D]9\displaystyle[D,D,D,C,C,C,D]_{9} =τ9(A1,m3A3,mwm+13A1,m4wm+15)ϵ9\displaystyle=\tau^{9}(A_{1,m}^{3}A_{3,m}w_{m+1}^{3}-A_{1,m}^{4}w_{m+1}^{5})\epsilon_{9} (94)
[D,D,D,D,C,C,D]9\displaystyle[D,D,D,D,C,C,D]_{9} =τ9(A1,m4A3,mwm+12A1,m5wm+14)ϵ9\displaystyle=\tau^{9}(A_{1,m}^{4}A_{3,m}w_{m+1}^{2}-A_{1,m}^{5}w_{m+1}^{4})\epsilon_{9} (95)
[D,D,D,D,D,C,D]9\displaystyle[D,D,D,D,D,C,D]_{9} =τ9(A1,m5A3,mwm+1A1,m6wm+13)ϵ9\displaystyle=\tau^{9}(A_{1,m}^{5}A_{3,m}w_{m+1}-A_{1,m}^{6}w_{m+1}^{3})\epsilon_{9} (96)

Note that all the terms previously computed have terms that can be written as in Lemma 6, thus proving that S(m+1)S^{(m+1)} can also be written in this way. ∎

Having proved Lemma 6, we can now compute the polynomials in Lemma 6. The polynomials are obtained from the recursion in Appendix A, the left hand side corresponds to S(m+1)S^{(m+1)} and can can be written as a single exponential, the same is true of the right side which is written as a single exponential. We have then the following polynomials:

A9,m+1\displaystyle A_{9,m+1} =A9,m+2wm+19\displaystyle=A_{9,m}+2w_{m+1}^{9} (97)
B9,m+1\displaystyle B_{9,m+1} =B9,m+16(A1,m2wm+17A1,mA7,mwm+1)\displaystyle=B_{9,m}+\frac{1}{6}(A_{1,m}^{2}w_{m+1}^{7}-A_{1,m}A_{7,m}w_{m+1})
16(A7,mwm+12A1,mwm+18)\displaystyle\quad-\frac{1}{6}(A_{7,m}w_{m+1}^{2}-A_{1,m}w_{m+1}^{8}) (98)
C9,m+1(1)\displaystyle C_{9,m+1}^{(1)} =C9,m(1)+16(A3,m2A1,mwm+15A1,mA5,mwm+13)\displaystyle=C_{9,m}^{(1)}+\frac{1}{6}(A_{3,m}^{2}A_{1,m}w_{m+1}^{5}-A_{1,m}A_{5,m}w_{m+1}^{3})
16(A5,mwm+14A3,mwm+16)\displaystyle\quad-\frac{1}{6}(A_{5,m}w_{m+1}^{4}-A_{3,m}w_{m+1}^{6}) (99)
C9,m+1(2)\displaystyle C_{9,m+1}^{(2)} =C9,m(2)+16(A3,m2A1,mwm+15A3,mA5,mwm+1)\displaystyle=C_{9,m}^{(2)}+\frac{1}{6}(A_{3,m}^{2}A_{1,m}w_{m+1}^{5}-A_{3,m}A_{5,m}w_{m+1})
16(A5,mwm+14A1,mwm+18)\displaystyle\quad-\frac{1}{6}(A_{5,m}w_{m+1}^{4}-A_{1,m}w_{m+1}^{8}) (100)
C9,m+1(3)\displaystyle C_{9,m+1}^{(3)} =C9,m(3)+16(A5,mA1,mwm+13A3,mA5,mwm+1)\displaystyle=C_{9,m}^{(3)}+\frac{1}{6}(A_{5,m}A_{1,m}w_{m+1}^{3}-A_{3,m}A_{5,m}w_{m+1})
16(A3,mwm+16A1,mwm+18)\displaystyle\quad-\frac{1}{6}(A_{3,m}w_{m+1}^{6}-A_{1,m}w_{m+1}^{8}) (101)
D9,m+1(1)\displaystyle D_{9,m+1}^{(1)} =D9,m(1)16(A1,mB7,mwm+1+wm+12B7,m)\displaystyle=D_{9,m}^{(1)}-\frac{1}{6}(A_{1,m}B_{7,m}w_{m+1}+w_{m+1}^{2}B_{7,m})
+7360(A5,mwm+14wm+18A1,m)\displaystyle\quad+\frac{7}{360}(A_{5,m}w_{m+1}^{4}-w_{m+1}^{8}A_{1,m})
1360(A1,m4wm+15A1,m3A5,mwm+1)\displaystyle\quad-\frac{1}{360}(A_{1,m}^{4}w_{m+1}^{5}-A_{1,m}^{3}A_{5,m}w_{m+1})
+190(A1,m3wm+16A1,m2A5,mwm+12)\displaystyle\quad+\frac{1}{90}(A_{1,m}^{3}w_{m+1}^{6}-A_{1,m}^{2}A_{5,m}w_{m+1}^{2})
+145(A1,mA5,mwm+13A1,m2wm+17)\displaystyle\quad+\frac{1}{45}(A_{1,m}A_{5,m}w_{m+1}^{3}-A_{1,m}^{2}w_{m+1}^{7})
160(A1,m2wm+17A1,mA5,mwm+13)\displaystyle\quad-\frac{1}{60}(A_{1,m}^{2}w_{m+1}^{7}-A_{1,m}A_{5,m}w_{m+1}^{3})
+130(A1,m2A5,mwm+12A1,m3wm+16)\displaystyle\quad+\frac{1}{30}(A_{1,m}^{2}A_{5,m}w_{m+1}^{2}-A_{1,m}^{3}w_{m+1}^{6}) (102)
D9,m+1(2)\displaystyle D_{9,m+1}^{(2)} =D9,m(2)16(A3,mB5,mwm+1+wm+14B5,m)\displaystyle=D_{9,m}^{(2)}-\frac{1}{6}(A_{3,m}B_{5,m}w_{m+1}+w_{m+1}^{4}B_{5,m})
+7360(A3,mwm+16wm+18A1,m)\displaystyle\quad+\frac{7}{360}(A_{3,m}w_{m+1}^{6}-w_{m+1}^{8}A_{1,m})
1360(A1,m3A3,mwm+13A1,m2A3,m2wm+1)\displaystyle\quad-\frac{1}{360}(A_{1,m}^{3}A_{3,m}w_{m+1}^{3}-A_{1,m}^{2}A_{3,m}^{2}w_{m+1})
+190(A1,m3wm+16A1,m2A3,mwm+14)\displaystyle\quad+\frac{1}{90}(A_{1,m}^{3}w_{m+1}^{6}-A_{1,m}^{2}A_{3,m}w_{m+1}^{4})
+145(A3,m2wm+13A1,mA3,mwm+15)\displaystyle\quad+\frac{1}{45}(A_{3,m}^{2}w_{m+1}^{3}-A_{1,m}A_{3,m}w_{m+1}^{5})
160(A1,m2wm+17A1,mA3,mwm+15)\displaystyle\quad-\frac{1}{60}(A_{1,m}^{2}w_{m+1}^{7}-A_{1,m}A_{3,m}w_{m+1}^{5})
+130(A3,m2A1,mwm+12A1,m2A3,mwm+14)\displaystyle\quad+\frac{1}{30}(A_{3,m}^{2}A_{1,m}w_{m+1}^{2}-A_{1,m}^{2}A_{3,m}w_{m+1}^{4}) (103)
D9,m+1(3)\displaystyle D_{9,m+1}^{(3)} =D9,m(3)16(A1,mB5,mwm+13+wm+14B5,m)\displaystyle=D_{9,m}^{(3)}-\frac{1}{6}(A_{1,m}B_{5,m}w_{m+1}^{3}+w_{m+1}^{4}B_{5,m})
+14360(A3,mwm+16wm+18A1,m)\displaystyle\quad+\frac{14}{360}(A_{3,m}w_{m+1}^{6}-w_{m+1}^{8}A_{1,m})
2360(A1,m3A3,mwm+13A1,m2A3,m2wm+1)\displaystyle\quad-\frac{2}{360}(A_{1,m}^{3}A_{3,m}w_{m+1}^{3}-A_{1,m}^{2}A_{3,m}^{2}w_{m+1})
+290(A1,m2A3,mwm+14A1,mA3,m2wm+12)\displaystyle\quad+\frac{2}{90}(A_{1,m}^{2}A_{3,m}w_{m+1}^{4}-A_{1,m}A_{3,m}^{2}w_{m+1}^{2})
+245(A3,mA1,mwm+15A1,m2wm+17)\displaystyle\quad+\frac{2}{45}(A_{3,m}A_{1,m}w_{m+1}^{5}-A_{1,m}^{2}w_{m+1}^{7})
160(A1,m2wm+17A1,mA3,mwm+15)\displaystyle\quad-\frac{1}{60}(A_{1,m}^{2}w_{m+1}^{7}-A_{1,m}A_{3,m}w_{m+1}^{5})
+130(A3,m2A1,mwm+12A1,m2A3,mwm+14)\displaystyle\quad+\frac{1}{30}(A_{3,m}^{2}A_{1,m}w_{m+1}^{2}-A_{1,m}^{2}A_{3,m}w_{m+1}^{4})
+16(A1,mC7,mwm+1+wm+12c7,m)\displaystyle\quad+\frac{1}{6}(A_{1,m}C_{7,m}w_{m+1}+w_{m+1}^{2}c_{7,m})
160(wm+15A1,mA3,mwm+13A3,m2)\displaystyle\quad-\frac{1}{60}(w_{m+1}^{5}A_{1,m}A_{3,m}-w_{m+1}^{3}A_{3,m}^{2})
+130(A1,m2A3,mwm+14A1,m3wm+16)\displaystyle\quad+\frac{1}{30}(A_{1,m}^{2}A_{3,m}w_{m+1}^{4}-A_{1,m}^{3}w_{m+1}^{6})
16(B5,mA1,mwm+13B5,mA3,mwm+1)\displaystyle\quad-\frac{1}{6}(B_{5,m}A_{1,m}w_{m+1}^{3}-B_{5,m}A_{3,m}w_{m+1}) (104)
E9,m+1\displaystyle E_{9,m+1} =E9,m16(A1,mD7,mwm+12D7,m)\displaystyle=E_{9,m}-\frac{1}{6}(A_{1,m}D_{7,m}-w_{m+1}^{2}D_{7,m})
+7360wm+14B5,m\displaystyle\quad+\frac{7}{360}w_{m+1}^{4}B_{5,m}
+1360A1,m3B5,mwm+1\displaystyle\quad+\frac{1}{360}A_{1,m}^{3}B_{5,m}w_{m+1}
190A1,m2B5,mwm+12\displaystyle\quad-\frac{1}{90}A_{1,m}^{2}B_{5,m}w_{m+1}^{2}
+145A1,mB5,mwm+13\displaystyle\quad+\frac{1}{45}A_{1,m}B_{5,m}w_{m+1}^{3}
+160A1,mB5,mwm+13\displaystyle\quad+\frac{1}{60}A_{1,m}B_{5,m}w_{m+1}^{3}
+130A1,m2B5,mwm+12\displaystyle\quad+\frac{1}{30}A_{1,m}^{2}B_{5,m}w_{m+1}^{2}
3115120(wm+16A3,mwm+18A1,m)\displaystyle\quad-\frac{31}{15120}(w_{m+1}^{6}A_{3,m}-w_{m+1}^{8}A_{1,m})
315040(wm+15A1,mA3,mwm+17A1,m2)\displaystyle\quad-\frac{31}{5040}(w_{m+1}^{5}A_{1,m}A_{3,m}-w_{m+1}^{7}A_{1,m}^{2})
131890(A1,m2A3,mwm+14A1,m3wm+16)\displaystyle\quad-\frac{13}{1890}(A_{1,m}^{2}A_{3,m}w_{m+1}^{4}-A_{1,m}^{3}w_{m+1}^{6})
5315120(A1,m3A3,mwm+13A1,m4wm+15)\displaystyle\quad-\frac{53}{15120}(A_{1,m}^{3}A_{3,m}w_{m+1}^{3}-A_{1,m}^{4}w_{m+1}^{5})
11260(A1,m4A3,mwm+12A1,m5wm+14)\displaystyle\quad-\frac{1}{1260}(A_{1,m}^{4}A_{3,m}w_{m+1}^{2}-A_{1,m}^{5}w_{m+1}^{4})
115120(A1,m5A3,mwm+1A1,m6wm+13).\displaystyle\quad-\frac{1}{15120}(A_{1,m}^{5}A_{3,m}w_{m+1}-A_{1,m}^{6}w_{m+1}^{3}). (105)

We obtain the polynomial equations for the tenth order product formula by imposing that A1,m=1A_{1,m}=1 and all other terms are equal to zero. Because C9,m(2)=C9,m(1)+C9,m(3)C^{(2)}_{9,m}=C^{(1)}_{9,m}+C^{(3)}_{9,m}, one equation is eliminated, there are 15 equations to solve.

Appendix B Method for determining processors

A processed formula is composed of two elements: a kernel, Σ(t)\Sigma(t), and a processor, P(t)P(t). The effective order kk captures up to which order in tt the full product formula, including the processor, reproduces the the target dynamics, P(t)Σ(t)P(t)1=e(X+Y)t+(tk+1)P(t)\Sigma(t)P(t)^{-1}=e^{(X+Y)t}+\order{t^{k+1}}. In this work, we use processors that are constructed with the same procedure as Ref. [24]. This type of processors are products of S2(τwi)S_{2}(\tau w_{i}) arranged as Q(m)(τ)Q(m)(τ)Q^{(m)}(\tau)Q^{(m)}(-\tau), where Q(m)(τ)=S2(τwm)S2(τwm1)S2(τw0)Q^{(m)}(\tau)=S_{2}(\tau w_{m})S_{2}(\tau w_{m-1})\cdots S_{2}(\tau w_{0}). In the language of Ref. [24], Q(m)(±τ)Q^{(m)}(\pm\tau) is an element of the group of integrators 𝒢3\mathcal{G}_{3}. The same reference gives a basis for the generating algebra, which we used to perform the following calculations. The basis can be found in Table 2 of Ref. [24], and we use the same naming scheme for the algebra elements.

We can obtain recursive formulas for Q(m)(τ)=S2(τwm)Q(m1)(τ)Q^{(m)}(\tau)=S_{2}(\tau w_{m})Q^{(m-1)}(\tau) via successive applications of the BCH formula (as opposed to the symmetric BCH formula in the case of the kernel). After taking the product Q(m)(τ)Q(m)(τ)Q^{(m)}(\tau)Q^{(m)}(-\tau), the processor simplifies compared to Q(m)Q^{(m)}. In particular, the logarithm of P(τ)P(\tau) is zero up to terms of order t3t^{3}. We illustrate the procedure that gives an iterative expression for the processor up to order 8, omitting the calculations. First, we write the product S2(τw1)S2(τw0)S_{2}(\tau w_{1})S_{2}(\tau w_{0}) in the Ei,jE_{i,j}-basis. We have, for the logarithm of Q(1)(τ)=S2(w1τ)S2(w0τ)Q^{(1)}(\tau)=S_{2}(w_{1}\tau)S_{2}(w_{0}\tau),

log(Q(1)(τ))\displaystyle\log(Q^{(1)}(\tau)) =p1,1(1)Y1t+p3,1(1)Y3t3+p4,1(1)E4,1t4+(p5,1(1)E5,1t5+p5,2(1)E5,2)+(p6,1(1)E6,1+p6,2(1)E6,2)t6+(t7),\displaystyle=p_{1,1}^{(1)}Y_{1}t+p_{3,1}^{(1)}Y_{3}t^{3}+p_{4,1}^{(1)}E_{4,1}t^{4}+\left(p_{5,1}^{(1)}E_{5,1}t^{5}+p_{5,2}^{(1)}E_{5,2}\right)+\left(p_{6,1}^{(1)}E_{6,1}+p_{6,2}^{(1)}E_{6,2}\right)t^{6}+\order{t^{7}}\,, (106)

where each coefficient is a function of w0,w1w_{0},w_{1}. Using an inductive argument analogous to the derivation in the case of the symmetric BCH formula, this expression will give a recursive expression for Q(m)(τ)=S2(τwm)Q(m1)(τ)Q^{(m)}(\tau)=S_{2}(\tau w_{m})Q^{(m-1)}(\tau), whose logarithm is an expansion in terms of pi,j(m)(w0,,wm)p^{(m)}_{i,j}(w_{0},\dots,w_{m}). The resulting expression is identical to the one above, with the superscripts (1)(1) replaced by (m)(m), and each coefficient is a polynomial of w0,,wmw_{0},\dots,w_{m}. In fact, it is not necessary to compute all the pp-coefficients to find P(m)(τ)P^{(m)}(\tau). For example, only three of them (out of seven) are necessary to find the processor up to order 66, since

log(P(m)(τ))=2p4,1(m)E4,1t4+p1,1(m)p4,1(m)E5,3t5+13((p1,1(m))2p4,1(m)E6,1+6p6,2(m)E6,2)t6+(t7).\displaystyle\log(P^{(m)}(\tau))=2p^{(m)}_{4,1}E_{4,1}t^{4}+p^{(m)}_{1,1}p^{(m)}_{4,1}E_{5,3}t^{5}+\frac{1}{3}\left(\left(p^{(m)}_{1,1}\right)^{2}p^{(m)}_{4,1}E_{6,1}+6p^{(m)}_{6,2}E_{6,2}\right)t^{6}+\order{t^{7}}\,. (107)

Similar cancellations occur at any order. In our simulations, terms up to eighth order are required. The seventh-order term in the expansion is (omitting the dependency on mm)

12p3,1p4,1E7,2+p1,1p6,1E7,3+(112p1,13p4,1+p1,1p6,2)E7,4,\displaystyle\frac{1}{2}p_{3,1}p_{4,1}E_{7,2}+p_{1,1}p_{6,1}E_{7,3}+\left(\frac{1}{12}p_{1,1}^{3}p_{4,1}+p_{1,1}p_{6,2}\right)E_{7,4}\,, (108)

while the eighth-order term is

13E8,4p1,12p6,1+2E8,1p8,1+2E8,2p8,2+12E8,3(p1,1p3,1p4,1p3,1p5,2+4p8,3)\displaystyle\frac{1}{3}E_{8,4}p_{1,1}^{2}p_{6,1}+2E_{8,1}p_{8,1}+2E_{8,2}p_{8,2}+\frac{1}{2}E_{8,3}\left(p_{1,1}p_{3,1}p_{4,1}-p_{3,1}p_{5,2}+4p_{8,3}\right)
+2E8,4p8,4+E8,5(160P1,14p4,1+13p1,12p6,2+2p8,5).\displaystyle+2E_{8,4}p_{8,4}+E_{8,5}\left(\frac{1}{60}P_{1,1}^{4}p_{4,1}+\frac{1}{3}p_{1,1}^{2}p_{6,2}+2p_{8,5}\right)\,. (109)

Note that not all the coefficients that appear in the expansion of Q(m)(τ)Q^{(m)}(\tau) are necessary.

Appendix C Error constants for fermionic Hamiltonians with 4 orbitals

In this appendix we provide the average error constant ω\omega for the eigenvalue error for the evolution of fermionic Hamiltonians in the case that the number of orbitals is d=4d=4. We find that the ω\omega obtained are close to those we obtain when d=6d=6.

label ω\omega Mω1/kM\omega^{1/k}
BCE6m10 3.3×10113.3\times 10^{-11} 0.360.36
PPBCM6m6 1.4×1091.4\times 10^{-9} 0.430.43
Table 14: Comparison of constant factors ω\omega for the best product formulae for 6th order. We generate 10001000 random Hamiltonians with d=4d=4 orbitals as in Eq. 48 and compute the average ω\omega.
label ω\omega Mω1/kM\omega^{1/k}
SS8s19 3.4×10113.4\times 10^{-11} 0.930.93
Y8m10 8.7×10128.7\times 10^{-12} 0.870.87
Y8m10b 1.5×10121.5\times 10^{-12} 0.680.68
PP8s13 4.2×10104.2\times 10^{-10} 0.870.87
YP8m8 2.3×10122.3\times 10^{-12} 0.590.59
Table 15: Comparison of constant factors ω\omega for the best product formulae for 8th order. We generate 10001000 random Hamiltonians with d=4d=4 orbitals as in Eq. 48 and compute the average ω\omega.
label ω\omega Mω1/kM\omega^{1/k}
SS10s35 3.0×10153.0\times 10^{-15} 1.241.24
Y10m17 2.5×10142.5\times 10^{-14} 1.53
Y10m18b 3.8×10143.8\times 10^{-14} 1.69
PP10s23 1.5×10121.5\times 10^{-12} 1.51
Table 16: Comparison of constant factors ω\omega for the best product formulae for 10th order. We generate 10001000 random Hamiltonians with d=4d=4 orbitals as in Eq. 48 and compute the average ω\omega.