This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Quantum algorithms for linear and non-linear fractional reaction-diffusion equations

Dong An1,  Konstantina Trivisa2,3
1 Joint Center for Quantum Information and Computer Science, University of Maryland, MD 20742, USA
2 Department of Mathematics, University of Maryland, MD 20742, USA
3 Institute for Physical Science and Technology, University of Maryland, MD 20742, USA
Abstract

High-dimensional fractional reaction-diffusion equations have numerous applications in the fields of biology, chemistry, and physics, and exhibit a range of rich phenomena. While classical algorithms have an exponential complexity in the spatial dimension, a quantum computer can produce a quantum state that encodes the solution with only polynomial complexity, provided that suitable input access is available. In this work, we investigate efficient quantum algorithms for linear and nonlinear fractional reaction-diffusion equations with periodic boundary conditions. For linear equations, we analyze and compare the complexity of various methods, including the second-order Trotter formula, time-marching method, and truncated Dyson series method. We also present a novel algorithm that combines the linear combination of Hamiltonian simulation technique with the interaction picture formalism, resulting in optimal scaling in the spatial dimension. For nonlinear equations, we employ the Carleman linearization method and propose a block-encoding version that is appropriate for the dense matrices that arise from the spatial discretization of fractional reaction-diffusion equations.

1 Introduction

Reaction–diffusion equations arise in many areas in science and engineering [33, 9, 35, 17, 22]. In population dynamics models in biology, the reaction term typically accounts for growth, whereas the diffusion term accounts for migration [24, 31]. The classical diffusion term has its origin in a model in physics. Recent research investigations indicate that the classical diffusion equation is inadequate to model many real situations, where a particle plume spreads faster than that predicted by the classical model, and may exhibit significant asymmetry [32]. In a fractional diffusion equation, the classical Laplace operator in the spatial variable is replaced by a fractional Laplacian of order less than two (Δ)α/2(-\Delta)^{\alpha/2} where 0<α20<\alpha\leq 2. The fundamental solutions of these equations still exhibit useful scaling properties that make them attractive for applications.

The present article deals with

tu(t,x)=(Δ)α/2u(t,x)c(t,x)u(t,x)+au(t,x)(1u(t,x)),t[0,T],x[0,1]d,u(0,x)=u0(x).\begin{split}\partial_{t}u(t,x)&=-(-\Delta)^{\alpha/2}u(t,x)-c(t,x)u(t,x)+au(t,x)(1-u(t,x)),\quad t\in[0,T],x\in[0,1]^{d},\\ u(0,x)&=u_{0}(x).\end{split} (1)

Here (Δ)α/2(-\Delta)^{\alpha/2} is the fractional Laplacian where 0<α20<\alpha\leq 2, and c(t,x)c(t,x) is the potential function. For the nonlinear term, in this work we only consider the quadratic non-linearity which yields Fisher’s equation, but our results can be generalized to equations with high-order polynomial non-linearity. For the fractional Laplacian operator, there are several different definitions of (Δ)α/2(-\Delta)^{\alpha/2} on bounded domain, including spectral definition and Riesz definition [28]. This works focuses on spectral fractional Laplacian with periodic boundary condition. Classical numerical algorithms for solving Equation 1 typically require exponential computational resources when the spatial variable is in high dimension: suppose that we use NN grid points or basis functions for spatial discretization in each dimension, then the dimension of semi-discretized differential equations becomes as large as NdN^{d}. Therefore we would like to explore the power of quantum algorithms for fractional reaction-diffusion equations and whether quantum algorithms can be efficient in high-dimensional case.

Quantum algorithms for differential equations aim at preparing a quantum state encoding the solutions at discrete grid points in its amplitudes. The first quantum differential equation algorithm was proposed in [6], which transforms differential equations into a linear system of equations using multi-step discretization and then applies quantum linear system algorithms such as HHL algorithm [21] or advanced ones [10, 36, 3, 27, 16]. Since then, there have been remarkable progresses on designing better quantum algorithms, for linear differential equations based on refined discretization [7, 12, 13, 26, 8], time-marching strategy [18], Schrödingerization [25], linear combination of unitaries technique [4], and for nonlinear differential equations using linearization techniques [29, 1]. With the caveat that the output is a quantum state encoding solutions in its amplitudes, these quantum algorithms can achieve exponential speedup in the system size compared to classical algorithms.

However, when directly applied to fractional reaction-diffusion equations, existing generic quantum algorithms are not as efficient as expected due to two major difficulties. First, similar as the standard Laplacian operator, fractional Laplacian operator is an unbounded operator, so its spatially discretized version has a huge spectral norm as the spatial dimension and the number of grid points increase. Most existing quantum differential equation algorithms scale at least linearly on the spectral norm of the coefficient matrix, and thus can be computationally expensive in high-dimensional case for accurate simulation. Second, when the equation is genuinely fractional (i.e., α<2\alpha<2), the coefficient matrix for the linear part after spatial discretization is unavoidably dense, because fractional differential operators are global operators that depends on the function evaluated in the entire space. The dense coefficient matrix poses computational difficulties in solving nonlinear equations. This is because all the existing quantum Carleman linearization algorithms require the coefficient matrix to be sparse in order to bypass the difficulty caused by the enlarged Carleman matrix, which is a direct sum of matrices in different dimensions [29].

We remark that the (fractional- or integer-order) Laplacian operator, and more general spatial differential operators, widely appear in various types of partial differential equations. In the contexts other than fractional reaction-diffusion equations, there have been several work managing to overcome the computational difficulty brought by its large spectral norm after spatial discretization, but unfortunately those techniques do not apply to the fractional reaction-diffusion equations. For example, in real-space Schrödinger equation, a poly-logarithmic dependence on the spectral norm of the Laplacian operator can be achieved by simulating the Hamiltonian in the interaction picture [30, 11], which simulates the transformed wavefunction under the rotation associated with the Laplacian operator. The resulting interaction picture Hamiltonian becomes bounded and thus can be efficiently simulated. To avoid the large spectral norm dependence in the rotations, the algorithm takes advantage of an important feature of the Laplacian operator: it can be diagonalized by the quantum Fourier transform (QFT) circuit and thus can be fastforwarded (i.e., Hamiltonian simulation governed by the Laplacian operator can be implemented with cost independent of evolution time and its spectral norm). However, such a technique does not directly work for fractional reaction-diffusion equations. The interaction picture transformation requires both forward and backward time evolution of the Laplacian operator. This is efficient for Schrödinger equations because the dynamics is reversible and both forward and backward time evolution operators are unitary, but fractional reaction-diffusion equation is a dissipative system and implementing its backward time evolution can be prohibitively expensive. The recent work [5] proposes efficient quantum algorithms for various partial differential equations beyond Schrödinger equations. The key technique there is a generalization of the fastforwarding simulation of the Laplacian operator. However, the algorithms in [5] require the entire coefficient matrix of the linear part to be fast-forwardable (i.e., its spectral decomposition has quantumly implementable eigenstates and classically computable eigenvalues), which cannot be satisfied even by linear fractional reaction-diffusion equations with the presence of the potential c(t,x)c(t,x).

Method Queries to the matrices
dd ϵ\epsilon TT Norm
Second-order Trotter (12) 𝒪~(dα(1/2+σ/2))\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma/2)}) 𝒪~(ϵ1/2)\widetilde{\mathcal{O}}(\epsilon^{-1/2}) 𝒪~(T3/2)\widetilde{\mathcal{O}}(T^{3/2}) 𝒪~((g(T))3/2)\widetilde{\mathcal{O}}((g(T))^{3/2})
Time-marching (14) 𝒪~(dα(1+2σ))\widetilde{\mathcal{O}}(d^{\alpha(1+2\sigma)}) 𝒪(polylog(1/ϵ))\mathcal{O}(\text{poly}\log(1/\epsilon)) 𝒪~(T2)\widetilde{\mathcal{O}}(T^{2}) 𝒪~(Q)\widetilde{\mathcal{O}}(Q)
Dyson series (16) 𝒪~(dα(1/2+σ))\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma)}) 𝒪(polylog(1/ϵ))\mathcal{O}(\text{poly}\log(1/\epsilon)) 𝒪~(T)\widetilde{\mathcal{O}}(T) 𝒪~(g(T))\widetilde{\mathcal{O}}(g(T))
LCHS-IP (18) 𝒪~(polylog(d))\widetilde{\mathcal{O}}(\text{poly}\log(d)) 𝒪~(ϵ1)\widetilde{\mathcal{O}}(\epsilon^{-1}) 𝒪~(T)\widetilde{\mathcal{O}}(T) 𝒪~(g(T)2)\widetilde{\mathcal{O}}(g(T)^{2})
Method Queries to the state preparation
dd ϵ\epsilon TT Norm
Second-order Trotter (12) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(g(T))\mathcal{O}(g(T))
Time-marching (14) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(Q)\mathcal{O}(Q)
Dyson series (16) 𝒪~(dα(1/2+σ))\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma)}) 𝒪(polylog(1/ϵ))\mathcal{O}(\text{poly}\log(1/\epsilon)) 𝒪~(T)\widetilde{\mathcal{O}}(T) 𝒪~(g(T))\widetilde{\mathcal{O}}(g(T))
LCHS-IP (18) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(1)\mathcal{O}(1) 𝒪(g(T))\mathcal{O}(g(T))
Table 1: Query complexities of differential methods for linear fractional reaction-diffusion equations. Here dd is the spatial dimension, ϵ\epsilon is the tolerated error in 22-norm, TT is the evolution time, α\alpha is the half order of the fractional Laplacian operator ranging in (0,2](0,2] and σ\sigma is the parameter of the Gevrey class defined in Equation 40. The function g(T)u(0)/u(T)g(T)\geq\|\vec{u}(0)\|/\|\vec{u}(T)\| describes the decay of the spatially discretized solution, QQ is the decay corrected by the spectral norm of infinitesimal evolution operators as defined in Equation 65, and we always have Qg(T)Q\leq g(T).

In this work, we investigate efficient quantum algorithms for fractional reaction-diffusion equations. The majority of our work is devoted to linear fractional reaction-diffusion equations. We numerically treat the equations by the method of lines, i.e., first discretizing the spatial variable to obtain a system of ODEs

ddtu=BuC(t)u,\frac{d}{dt}\vec{u}=-B\vec{u}-C(t)\vec{u}, (2)

and then solving the resulting ODE system with different quantum ODE algorithms. Here u\vec{u} represents the solution evaluated at different spatial grid points, BB is the discretized fractional Laplacian operator, and C(t)C(t) is the potential matrix. For time evolution, we consider four different algorithms: second-order Trotter formula [14], time-marching method [18], truncated Dyson series method [8], and linear combination of Hamiltonian simulation in the interaction picture (LCHS-IP). We analyze the complexity of these four methods and the results are shown in Table 1. Our main results and contributions are summarized as follows:

  1. 1.

    Second-order Trotter formula: unlike Hamiltonian simulation, Equation 2 is not a unitary dynamics, and in the Trotter formula we need to implement non-unitary operators eBse^{-Bs} and eC(t)se^{-C(t)s}. We discuss efficient construction of these operators via controlled rotations, and implementing their multiplication through a generalization of the compression gadget technique [30, 18] with only poly-logarithmic many ancilla qubits. For complexity analysis, we derive an improved Trotter error bound that avoids the exponential factor in [14] and generalizes to the time-dependent case. Compared to time-marching and truncated Dyson series methods, second-order Trotter has better dependence on the dimension dd thanks to its commutator scalings, but has worse dependence on the precision.

  2. 2.

    Time-marching method: we directly apply the standard time-marching method in [18] and analyze its complexity for fractional reaction-diffusion equations. It has low state preparation cost and poly-logarithmic dependence on precision, but has worse scalings in the dimension dd and the evolution time TT.

  3. 3.

    Truncated Dyson series method: we directly apply the standard truncated Dyson series method in [8] and analyze its complexity for fractional reaction-diffusion equations. It still depends polynomially on the dimension due to its spectral norm dependence and has high state preparation cost due to the usage of quantum linear system algorithms, but can achieve poly-logarithmic scaling in the precision and linear scaling in time simultaneously.

  4. 4.

    LCHS-IP: The LCHS-IP method is a novel method that combines the linear combination of Hamiltonian simulation (LCHS) [4] technique with the interaction picture Hamiltonian simulation [30]. The LCHS method first represents the evolution operator of Equation 2 as a linear combination of several Hamiltonian simulation problems associated with the matrices BB and C(t)C(t). To avoid the computational overhead brought by the discretized fractional Laplacian BB, we implement each Hamiltonian simulation in the interaction picture by rotating the Hamiltonian with respect to BB. Therefore, the resulting algorithm only has poly-logarithm dependence on the dimension dd and thus is the most preferable algorithm in the high-dimensional case. It also has low state preparation cost, but only linear scaling in precision.

For nonlinear fractional reaction-diffusion equations, we discuss a block-encoding version of the Carleman linearization technique to deal with the dense coefficient matrix. Let u(t)\vec{u}(t) denote the solution vector of the fractional reaction-diffusion equation after spatial discretization. Standard quantum Carleman linearization algorithm [29] considers the dynamics of the enlarged vector [u(t);u(t)2;;u(t)M][\vec{u}(t);\vec{u}(t)^{\otimes 2};\cdots;\vec{u}(t)^{\otimes M}], which approximately satisfies a linear system of ODEs governed by a so-called Carleman matrix. Since u(t)m\vec{u}(t)^{\otimes m}’s are in different sizes for different powers, the Carleman matrix is a direct sum of matrices in different dimensions. In our work, we propose a simple generalization of the Carleman linearization, by extending the Carleman matrix to even higher dimension such that it becomes the direct sum of matrices in the same dimension and the corresponding solution vector [w1;w2;;wM][\vec{w}_{1};\vec{w}_{2};\cdots;\vec{w}_{M}] can be exactly mapped to the solution [u(t);u(t)2;;u(t)M][\vec{u}(t);\vec{u}(t)^{\otimes 2};\cdots;\vec{u}(t)^{\otimes M}] via discarding all the zero entries. Then the extended Carleman matrix can be easily block-encoded through the block-encoding of the original coefficient matrix, and thus the corresponding extended linearized system can be solved via quantum linear differential algorithms with block-encoding input model (e.g., [18, 8, 4]). This makes the quantum Carleman linearization algorithm applicable to the equations with dense coefficient matrices, including the fractional reaction-diffusion equations.

The rest of this paper is organized as follows. Section 2 first discusses the mathematical setup of the fractional reaction-diffusion equations, notations being used throughout the paper and some preliminary results. The main results of this paper start by Section 3 with the simplest case where the equations only involve a single fractional Laplacian operator without potential or nonlinear term, then Section 4 considers the general linear equations with potential. Quantum algorithm for nonlinear equations is presented in Section 5, followed by conclusion and open questions in Section 6.

2 Preliminaries

We start with a more rigorous setup of the fractional reaction-diffusion equation we are interested in and a summary of theoretical tools and technical lemmas being used in our analysis.

2.1 Setup

We consider the spatial fractional reaction-diffusion equations Equation 1. Let NN be a positive integer and we discretize the spatial variable xx using equi-distant nodes (j/N)(j/N), j[N]dj\in[N]^{d}. Quantum algorithms for solving Equation 1 aim at preparing a quantum state approximately encoding u(T,j/N)u(T,j/N) in its amplitude, i.e., 1(u(T,j/N))j[N]dj[N]du(T,j/N)|j\frac{1}{\|(u(T,j/N))_{j\in[N]^{d}}\|}\sum_{j\in[N]^{d}}u(T,j/N)\ket{j}.

There are several different definitions of (Δ)α/2(-\Delta)^{\alpha/2} on bounded domain (see the recent paper [28] for a comprehensive review). The spectral fractional Laplacian is defined using the eigenvalues and eigenfunctions of the original Laplacian (Δ)(-\Delta). Suppose that λj\lambda_{j}’s are the eigenvalues of (Δ)(-\Delta) and ej(x)e_{j}(x)’s are the corresponding eigenfunctions, then the spectral fractional Laplacian is defined to be

(Δ)α/2v(x)=jλjα/2(v,ej)L2ej(x)(-\Delta)^{\alpha/2}v(x)=\sum_{j}\lambda_{j}^{\alpha/2}(v,e_{j})_{L^{2}}e_{j}(x) (3)

where (,)L2(\cdot,\cdot)_{L^{2}} denotes the L2L^{2} inner product on [0,1]d[0,1]^{d}. An alternative definition is the Riesz fractional Laplacian

(Δ)α/2v(x)=2αΓ(α2+d2)πd/2|Γ(α2)|p.v.dv(x)v(y)|xy|d+α𝑑y.(-\Delta)^{\alpha/2}v(x)=\frac{2^{\alpha}\Gamma(\frac{\alpha}{2}+\frac{d}{2})}{\pi^{d/2}|\Gamma(-\frac{\alpha}{2})|}\text{p.v.}\int_{\mathbb{R}^{d}}\frac{v(x)-v(y)}{|x-y|^{d+\alpha}}dy. (4)

Here Γ(s)\Gamma(s) denotes the gamma function and p.v. refers to the principle value integral. Notice that the definition Equation 4 is not closed yet since the integral is over the entire space d\mathbb{R}^{d} while v(x)v(x) is confined in the cube [0,1]d[0,1]^{d}. Therefore we must enforce boundary condition on d[0,1]d\mathbb{R}^{d}\setminus[0,1]^{d}, e.g., the homogeneous Dirichlet boundary condition v(x)=0v(x)=0 for all xd[0,1]dx\in\mathbb{R}^{d}\setminus[0,1]^{d}. For periodic boundary condition, the definition is unambiguous that we will always follow Equation 3, while either definition is commonly used with other boundary conditions. In this work, we will focus on the periodic boundary condition and use the spectral definition for the fractional Laplacian operator.

2.2 Notations

We use a\vec{a} (a letter with an array above) to denote a possibly unnormalized vector. When u(t,x)u(t,x) refers to a scalar-valued function with arguments tt\in\mathbb{R} and x[0,1]dx\in[0,1]^{d}, we use (u(t,j/N))j[N]d(u(t,j/N))_{j\in[N]^{d}} or simply (u(t,j/N))(u(t,j/N)) to denote the NdN^{d}-dimensional vector with entries u(t,j/N)u(t,j/N), where j[N]dj\in[N]^{d} and [N]={0,1,,N1}[N]=\left\{0,1,\cdots,N-1\right\} for a positive integer NN. In our analysis, we also use u0\vec{u}_{0} as a shorthand notation of the initial condition vector (u(0,j/N))(u(0,j/N)), and u(t)\vec{u}(t) as (u(t,j/N))(u(t,j/N)) for a fixed time tt.

For a vector a\vec{a}, we use a\|\vec{a}\| without subscript for the standard vector 2-norm of a\vec{a}, and ap\|\vec{a}\|_{p} for vector pp-norm. The notation |a\ket{\vec{a}} with ket notation denotesthe corresponding quantum state, i.e., the normalized vector a/a\vec{a}/\|\vec{a}\|.

Let AA and BB be two matrices. We use [A,B][A,B] to denote the commutator between AA and BB, defined as ABBAAB-BA. For a matrix AA, A\|A\| denotes its spectral norm or equivalently the matrix 2-norm.

2.3 Quantum linear algebra

To design and analyze quantum algorithms for the spatial fractional reaction-diffusion equations, we need to frequently implement linear algebra operations, including matrix-vector multiplication, matrix-matrix addition and multiplication. To this end, we briefly introduce the concept and properties of block-encoding, which is a widely-used quantum input model for possibly non-unitary matrix.

We start with the definition of block-encoding.

Definition 1 (Block-encoding).

Suppose that AA is a 2s2^{s}-dimensional matrix, then we say that the (s+n)(s+n)-qubit unitary UU is an (α,n,ϵ)(\alpha,n,\epsilon)-block-encoding of AA, if

Aα((0|nI)U(|0nI))ϵ.\left\|A-\alpha\left((\bra{0}^{\otimes n}\otimes I)U(\ket{0}^{\otimes n}\otimes I)\right)\right\|\leq\epsilon. (5)

Intuitively, in the block-encoding, a matrix is represented as the upper-left block of a unitary matrix as

U(A/α).U\approx\left(\begin{array}[]{cc}A/\alpha&*\\ &*\end{array}\right). (6)

α\alpha is called the block-encoding factor such that αA\alpha\geq\|A\|, since the larger matrix UU is supposed to be unitary and the norm of its sub-block should be bounded by 11.

With block-encoding structure, we may implement matrix linear algebra operations. For example, matrix addition can be directly implemented by the linear combination of unitaries (LCU) technique, which has become an important subroutine in designing various quantum algorithms including Hamiltonian simulation [15], solving linear systems [10] and differential equations [4]. Here we present the LCU lemma from [19].

Lemma 2.

Let A=j=0m1yjAjA=\sum_{j=0}^{m-1}y_{j}A_{j} and y1β\|y\|_{1}\leq\beta. Suppose that (PL,PR)(P_{L},P_{R}) is a pair of bb-qubit unitaries such that PL|0j=0m1cj|jP_{L}\ket{0}\sum_{j=0}^{m-1}c_{j}\ket{j} and PR|0j=0m1dj|jP_{R}\ket{0}\sum_{j=0}^{m-1}d_{j}\ket{j} with j=0m1|β(cjdjyj)|<ϵ1\sum_{j=0}^{m-1}|\beta(c_{j}^{*}d_{j}-y_{j})|<\epsilon_{1}, and W=j=0m1|jj|UjW=\sum_{j=0}^{m-1}\ket{j}\bra{j}\otimes U_{j} where UjU_{j} is a (α,a,ϵ2)(\alpha,a,\epsilon_{2})-block-encoding of AjA_{j}. Then we can construct a (αβ,a+b,αϵ1+αβϵ2)(\alpha\beta,a+b,\alpha\epsilon_{1}+\alpha\beta\epsilon_{2})-block-encoding of AA with a single use of WW, PLP_{L}^{\dagger} and PRP_{R}.

Matrix multiplication can also be implemented via block-encodings. For two arbitrary matrices AA and BB, a straightforward approach of constructing the block-encoding of ABAB is to multiply together the block-encodings of AA and BB, as shown in the following lemma [19].

Lemma 3.

If UAU_{A} is an (αA,nA,ϵA)(\alpha_{A},n_{A},\epsilon_{A})-block-encoding of AA, and UBU_{B} is an (αB,nB,ϵB)(\alpha_{B},n_{B},\epsilon_{B})-block-encoding of BB, then (InBUA)(InAUB)(I_{n_{B}}\otimes U_{A})(I_{n_{A}}\otimes U_{B}) is an (αAαB,nA+nB,αAϵB+αBϵA)(\alpha_{A}\alpha_{B},n_{A}+n_{B},\alpha_{A}\epsilon_{B}+\alpha_{B}\epsilon_{A})-block-encoding of ABAB.

Despite its simplicity, such a straightforward approach for matrix multiplication may incur large space overhead. This is because we need to enlarge the ancilla register at each step of the multiplication, so the multiplication of JJ many matrices will in general require 𝒪(J)\mathcal{O}(J) ancilla qubits. To overcome this issue, [30] introduces a technique called compression gadget, which is further simplified in [18]. The circuit is given in Figure 1, and the idea of compression gadget is to use a counter register to keep track of the multiplication in a coherent way. Specifically, let ADD implements addition by 11 modulo the 2-power of the number of the qubits in the counter register. One can first apply ADD to lift the counter register from 0 to L1L-1. Then we sequentially apply the block-encodings using the same ancilla register, but after each block-encoding we apply a controlled ADD\text{ADD}^{\dagger} to reduce the counter by 11 if the corresponding block-encoding was successfully implemented. Therefore, a 0 final outcome of the counter register implies successful application of all the block-encodings. We present the result in [18], in which interested readers may find more details on its proof.

\Qcircuit@R=1em @C=1em Counter      & \gateADD^J \gateADD^† \qw \gateADD^† \qw ⋯   \qw \gateADD^† \qw
Ancilla       \multigate1U_0 \ctrlo-1 \multigate1U_1 \ctrlo-1 \qw ⋯   \multigate1U_J-1 \ctrlo-1 \qw
System       \ghostU_0 \qw \ghostU_1 \qw \qw ⋯   \ghostU_J-1 \qw \qw

Figure 1: Quantum circuit for compression gadget to block encode AJ1A1A0A_{J-1}\cdots A_{1}A_{0}. Here the Counter register contains log2J+1\lceil\log_{2}J\rceil+1 qubits and the ancilla register contains maxnj\max n_{j} qubits. ADD implements addition by 11 modulo 2log2J+12^{\lceil\log_{2}J\rceil+1}, and UjU_{j} is an (αj,nj,0)(\alpha_{j},n_{j},0)-block-encoding of the matrix AjA_{j}.
Lemma 4.

For 0jJ10\leq j\leq J-1, let UjU_{j} be an (αj,nj,0)(\alpha_{j},n_{j},0)-block-encoding of AjA_{j}. Then an (α,n,0)(\alpha,n,0)-block-encoding of AJ1A1A0A_{J-1}\cdots A_{1}A_{0} can be constructed using one application of each UjU_{j}, where α=α0α1αJ1\alpha=\alpha_{0}\alpha_{1}\cdots\alpha_{J-1} and n=maxnj+log2(J)+1n=\max n_{j}+\lceil\log_{2}(J)\rceil+1.

3 Linear equations without potential

We start with the simplest case where c(t,x)=0c(t,x)=0 and a=0a=0. In this case, the right hand side of Equation 1 only involves a fractional Laplacian operator. By the spectral definition, this operator has closed-form eigenvalues and known eigenfunctions, so the corresponding time-evolution operator can be implemented fast-forwardly [5] (i.e., query complexity is independent of the spectral norm and the evolution time). Here we present our algorithm in general high dimension and establish rigorous complexity estimate taking into consideration the spatial discretization errors.

We consider general dd-dimensional case. The eigenvalues and eigenfunctions of (Δ)(-\Delta) are 4π2(k02++kd12)4\pi^{2}(k_{0}^{2}+\cdots+k_{d-1}^{2}) and e2πi(k0x0++kd1xd1)e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}, where x=(x0,,xd1)x=(x_{0},\cdots,x_{d-1}) denotes the spatial coordinate and k=(k0,,kd1)k=(k_{0},\cdots,k_{d-1}) denotes a set of integers. Let the Fourier series of u0(x)u_{0}(x) be

u0(x)=kdu^ke2πi(k0x0++kd1xd1),u_{0}(x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}, (7)

where u^k\hat{u}_{k} denotes the Fourier coefficients. Then the solution of Equation 1 has the form

u(T,x)=kdu^ke(4π2(k02++kd12))α/2Te2πi(k0x0++kd1xd1).u(T,x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{-(4\pi^{2}(k_{0}^{2}+\cdots+k_{d-1}^{2}))^{\alpha/2}T}e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}. (8)

Numerical solutions can be obtained by truncating the Fourier series at a finite order.

Equation 8 can be quantumly implemented using quantum Fourier transform (QFT) and controlled rotations. Recall that our goal is to prepare an approximation of the quantum state encoding the normalized solution at discrete spatial grid points (j/N)j[N]d(j/N)_{j\in[N]^{d}}. Let NN be the number of the grid points in each spatial dimension. Suppose that we are given the oracle Ou0O_{u_{0}} that prepares the normalized initial condition

|u0=1u0n[N]du0(n0/N,,nd1/N)|n0|nd1,\ket{u_{0}}=\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\ket{n_{0}}\cdots\ket{n_{d-1}}, (9)

where

u0=n[N]du0(n0/N,,nd1/N)|n0|nd1.\vec{u}_{0}=\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\ket{n_{0}}\cdots\ket{n_{d-1}}. (10)

We first compute a quantum state encoding the Fourier coefficients u^k\hat{u}_{k} by QFT. Specifically, let ωN=e2πi/N\omega_{N}=e^{2\pi i/N} and \mathcal{F} denote the one-dimensional QFT, i.e., for any computational basis state |j\ket{j},

|j=1Nl=0N1ωNjl|l.\mathcal{F}\ket{j}=\frac{1}{\sqrt{N}}\sum_{l=0}^{N-1}\omega_{N}^{jl}\ket{l}. (11)

Then

(1)du0\displaystyle(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{0} =1Nd/2n[N]du0(n0/N,,nd1/N)(m0=0N1ωNn0m0|m0)(md1=0N1ωNnd1md1|md1)\displaystyle=\frac{1}{N^{d/2}}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\left(\sum_{m_{0}=0}^{N-1}\omega_{N}^{-n_{0}m_{0}}\ket{m_{0}}\right)\cdots\left(\sum_{m_{d-1}=0}^{N-1}\omega_{N}^{-n_{d-1}m_{d-1}}\ket{m_{d-1}}\right) (12)
=1Nd/2m[N]dn[N]du0(n0/N,,nd1/N)ωNn0m0nd1md1|m0|md1\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\omega_{N}^{-n_{0}m_{0}-\cdots-n_{d-1}m_{d-1}}\ket{m_{0}}\cdots\ket{m_{d-1}} (13)
=1Nd/2m[N]dn[N]dkdu^ke2πi(k0n0/N++kd1nd1/N)ωNn0m0nd1md1|m0|md1\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{n\in[N]^{d}}\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{2\pi i(k_{0}n_{0}/N+\cdots+k_{d-1}n_{d-1}/N)}\omega_{N}^{-n_{0}m_{0}-\cdots-n_{d-1}m_{d-1}}\ket{m_{0}}\cdots\ket{m_{d-1}} (14)
=1Nd/2m[N]dkdu^k(n0=0N1ωNn0(k0m0))(nd1=0N1ωNnd1(kd1md1))|m0|md1\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}\left(\sum_{n_{0}=0}^{N-1}\omega_{N}^{n_{0}(k_{0}-m_{0})}\right)\cdots\left(\sum_{n_{d-1}=0}^{N-1}\omega_{N}^{n_{d-1}(k_{d-1}-m_{d-1})}\right)\ket{m_{0}}\cdots\ket{m_{d-1}} (15)
=Nd/2m[N]djdu^m+jN|m0|md1.\displaystyle=N^{d/2}\sum_{m\in[N]^{d}}\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}\ket{m_{0}}\cdots\ket{m_{d-1}}. (16)

When the function u0(x)u_{0}(x) satisfies certain regularity assumption (to be specified later), the summation jdu^m+jN\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN} is dominated by the index with smallest absolute value, because the Fourier coefficients decay rapidly with respect to its frequency. Therefore

(1)d|u0Nd/2u0m[N]du^i(m)|m0|md1,(\mathcal{F}^{-1})^{\otimes d}\ket{\vec{u}_{0}}\approx\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}, (17)

where i(m)=(i0(m),,id1(m))i(m)=(i_{0}(m),\cdots,i_{d-1}(m)) represents an dd-dimensional vector with the jj-th index

ij(m)={mj, if 0mjN/2mjN, if N/2+1mjN1.i_{j}(m)=\begin{cases}m_{j},&\text{ if }0\leq m_{j}\leq N/2\\ m_{j}-N,&\text{ if }N/2+1\leq m_{j}\leq N-1.\end{cases} (18)

Now we append three ancilla registers to Equation 17 and get the (approximate) quantum state

Nd/2u0m[N]du^i(m)|m0|md1|0|0|0.\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\ket{0}\ket{0}. (19)

The first two ancilla registers are used to binarily encode e(4π2(i0(m)2++id1(m)2))α/2Te^{-(4\pi^{2}(i_{0}(m)^{2}+\cdots+i_{d-1}(m)^{2}))^{\alpha/2}T} and the third register is used for rotation. Specifically, suppose that we are given the oracle that encodes the eigenvalues as

O1:|m0|md1|0|m0|md1|(2πi(m))α,O_{1}:\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\rightarrow\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{(2\pi\|i(m)\|)^{\alpha}}, (20)

and the oracle for computing an exponential function as

Oexp,1:|x|0|x|exT.O_{\exp,1}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xT}}. (21)

Notice that both functions have closed-form expression so the oracles can be efficiently constructed using classical arithmetic [34]. Applying O1O_{1} on the index and first ancilla registers and then Oexp,1O_{\exp,1} on the first and second ancilla registers yields

Nd/2u0m[N]du^i(m)|m0|md1|i(m)|e(2πi(m))αT|0.\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{\|i(m)\|}\ket{e^{-(2\pi\|i(m)\|)^{\alpha}T}}\ket{0}. (22)

Let c-R\text{c-}R denote the controlled rotation

c-R:|θ|0|θ(θ|0+1|θ|2|1).\text{c-}R:\ket{\theta}\ket{0}\rightarrow\ket{\theta}\left(\theta\ket{0}+\sqrt{1-|\theta|^{2}}\ket{1}\right). (23)

Then, by applying c-R\text{c-}R on the last two registers in Equation 22, we obtain

Nd/2u0m[N]du^i(m)e(2πi(m))αT|m0|md1|i(m)|e(2πi(m))αT|0+|,\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{\|i(m)\|}\ket{e^{-(2\pi\|i(m)\|)^{\alpha}T}}\ket{0}+\ket{\perp}, (24)

where |\ket{\perp} represents the orthogonal part with 11 in the last ancilla register. Uncomputing the first two ancilla registers yields

Nd/2u0m[N]du^i(m)e(2πi(m))αT|m0|md1|0|0|0+|,\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\ket{0}\ket{0}+\ket{\perp}, (25)

Notice that, by replacing u0(x)u_{0}(x) with u(T,x)u(T,x) in Equation 17 and using Equation 8, we have

(1)du(T,x)Nd/2m[N]du^i(m)e(2πi(m))αT|m0|md1.(\mathcal{F}^{-1})^{\otimes d}\vec{u}(T,x)\approx N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}. (26)

Therefore, applying the QFT d\mathcal{F}^{\otimes d} to Equation 25 approximately gives

1u0n[N]du(T,n/N)|n0|nd1|0|0|0+|.\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}u(T,n/N)\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}\ket{0}\ket{0}+\ket{\perp}. (27)

Measuring the ancilla registers to get all 0’s yields an approximation of |u(T)\ket{u(T)}, and the averaged number of repeats for success after amplitude amplification scales 𝒪(u0/u(T))\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|).

\Qcircuit@R=1em @C=1em —u_0⟩    & \gate(F^-1)^⊗d \multigate1O_1 \qw \qw \qw \multigate1O_1^† \gateF^⊗d \qw
—0⟩     \qw \ghostO_1 \multigate1O_exp,1 \qw \multigate1O_exp,1^† \ghostO_1^† \qw \meter
—0⟩     \qw \qw \ghostO_exp,1 \ctrl1 \ghostO_exp,1^† \qw \qw \meter
—0⟩     \qw \qw \qw \gateR \qw \qw \qw \meter

Figure 2: Quantum circuit for solving linear fractional reaction-diffusion equations without potential. Here O1O_{1}, Oexp,1O_{\exp,1} are the oracles defined in Equation 20 and Equation 21, and RR is the single-qubit rotation gate defined in Equation 23.

We summarize the quantum algorithm in Figure 2 and present its error and complexity estimates in the following result. The main result is that the overall error will decrease as the number of the grid points increases, since more grid points imply larger truncation order in the Fourier series. Detailed proof can be found in Appendix B.

Theorem 5.

Consider solving Equation 1 with a=c=0a=c=0. Suppose that we are given the state preparation oracle Ou0O_{u_{0}} and the oracles O1O_{1}, Oexp,1O_{\exp,1} as defined in Equation 20 and Equation 21. Furthermore, suppose that there exists an integer pd+2p\geq d+2 such that u(t,x)u(t,x) is pp-th order spatially continuously differentiable. Then there exists a quantum algorithm that prepares an approximation of the quantum state |u(T)\ket{\vec{u}(T)} with success probability 1Ω(1)1-\Omega(1) and 22-norm error at most

4maxj,t{0,T}xjpu(t,x)L1u(T)(π/2)pNpd,\frac{4\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\|\vec{u}(T)\|(\pi/2)^{p}N^{p-d}}, (28)

using 𝒪(u0/u(T))\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|) queries to Ou0,O1,Oexp,1O_{u_{0}},O_{1},O_{\exp,1}, their inverse and controlled versions, and 𝒪((u0/u(T))dlog2(N))\mathcal{O}((\|\vec{u}_{0}\|/\|\vec{u}(T)\|)d\log^{2}(N)) additional gates.

4 Linear equations with potential

Consider the linear spatial fractional reaction-diffusion equations

tu(t,x)=(Δ)α/2u(t,x)c(t,x)u(t,x),t[0,T],x[0,1]d,u(0,x)=u0(x).\begin{split}\partial_{t}u(t,x)&=-(-\Delta)^{\alpha/2}u(t,x)-c(t,x)u(t,x),\quad t\in[0,T],x\in[0,1]^{d},\\ u(0,x)&=u_{0}(x).\end{split} (29)

Throughout this section, we assume the potential function c(t,x)c(t,x) to be non-negative. We will first discuss the validity of this assumption by so-called shifting equivalence. The equation Equation 29 can then be treated by the method of lines, i.e., first discretizing the spatial variable to obtain a system of ODEs, and then solving the ODE with different quantum algorithms. We consider four different quantum algorithms: second-order Trotter formula, time-marching method, truncated Dyson series method, and linear combination of Hamiltonian simulation in the interaction picture.

4.1 Shifting equivalence

We consider the shifted PDE

tv(t,x)=(Δ)α/2v(t,x)c~(t,x)v(t,x),t[0,T],x[0,1]d,v(0,x)=u0(x).\begin{split}\partial_{t}v(t,x)&=-(-\Delta)^{\alpha/2}v(t,x)-\widetilde{c}(t,x)v(t,x),\quad t\in[0,T],x\in[0,1]^{d},\\ v(0,x)&=u_{0}(x).\end{split} (30)

Here

c~(t,x)=c(t,x)γ(t)\widetilde{c}(t,x)=c(t,x)-\gamma(t) (31)

where γ(t)\gamma(t) is a real scalar-valued function. An important observation is that Equation 29 and Equation 30 are quantumly equivalent in the sense that the normalized solutions are the same. Specifically, let uu denote the solution of Equation 29, then

t(e0tγ(s)𝑑su)=e0tγ(s)𝑑stu+γ(t)e0tγ(s)𝑑su=e0tγ(s)𝑑s((Δ)α/2uc(t,x)u)+γ(t)e0tγ(s)𝑑su=(Δ)α/2(e0tγ(s)𝑑su)c~(t,x)(e0tγ(s)𝑑su).\begin{split}\partial_{t}(e^{\int_{0}^{t}\gamma(s)ds}u)&=e^{\int_{0}^{t}\gamma(s)ds}\partial_{t}u+\gamma(t)e^{\int_{0}^{t}\gamma(s)ds}u\\ &=e^{\int_{0}^{t}\gamma(s)ds}\left(-(-\Delta)^{\alpha/2}u-c(t,x)u\right)+\gamma(t)e^{\int_{0}^{t}\gamma(s)ds}u\\ &=-(-\Delta)^{\alpha/2}(e^{\int_{0}^{t}\gamma(s)ds}u)-\widetilde{c}(t,x)(e^{\int_{0}^{t}\gamma(s)ds}u).\end{split} (32)

Therefore e0tγ(s)𝑑sue^{\int_{0}^{t}\gamma(s)ds}u solves Equation 30 and only differs uu by a multiplicative constant factor at fixed time, which implies that they are the same in a quantum state representation, i.e.,

1un[N]du(t,n/N)|n=1e0tγ(s)𝑑sun[N]de0tγ(s)𝑑su(t,n/N)|n.\frac{1}{\|\vec{u}\|}\sum_{n\in[N]^{d}}u(t,n/N)\ket{n}=\frac{1}{\|e^{\int_{0}^{t}\gamma(s)ds}\vec{u}\|}\sum_{n\in[N]^{d}}e^{\int_{0}^{t}\gamma(s)ds}u(t,n/N)\ket{n}. (33)

We will choose the shifting parameter to be

γ(t)=minxc(t,x).\gamma(t)=\min_{x}c(t,x). (34)

Therefore the function c~(t,x)\widetilde{c}(t,x) is non-negative. For notation simplicity, we will directly assume that the original c(t,x)c(t,x) in Equation 29 to be non-negative instead of introducing new notations with tildes.

4.2 Spatial discretization

Our goal is to prepare a quantum state encoding u(T,x)u(T,x) at equi-distant grid points (n0/N,,nd1/N)(n_{0}/N,\cdots,n_{d-1}/N) where nj[N]n_{j}\in[N]. Motivated by the spectral decomposition, we define

B=()dD(1)dB=(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d} (35)

where DD is an NdN^{d}-dimensional diagonal matrix

D=n[N]d2απαi(n)α|n0|nd1n0|nd1|.D=\sum_{n\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(n)\|^{\alpha}\ket{n_{0}}\cdots\ket{n_{d-1}}\bra{n_{0}}\cdots\bra{n_{d-1}}. (36)

For each tt, let C(t)C(t) be an NdN^{d}-dimensional diagonal matrix

C(t)=n[N]dc(t,n/N)|n0|nd1n0|nd1|.C(t)=\sum_{n\in[N]^{d}}c(t,n/N)\ket{n_{0}}\cdots\ket{n_{d-1}}\bra{n_{0}}\cdots\bra{n_{d-1}}. (37)

Then we consider the spatially discretized equation

ddtu=BuC(t)u,t[0,T]u(0)=u0.\begin{split}\frac{d}{dt}\vec{u}&=-B\vec{u}-C(t)\vec{u},\quad t\in[0,T]\\ \vec{u}(0)&=\vec{u}_{0}.\end{split} (38)

In order to bound the spatial discretization error by ϵ\epsilon, we need to choose a sufficiently large NN. In the following, we derive an error bound in terms of NN. The proof of this result can be found in Appendix C.

Lemma 6.

Let uu be the exact solution of Equation 29 and u\vec{u} be the solution of Equation 38. Suppose u(t,x)u(t,x) is pp-th order spatially continuously differentialable where pd+α+2p\geq d+\alpha+2, then

(u(T,n/N))n[N]du(T)T2p+1dα/2maxt,jxjpu(t,x)L1πpαNpdα.\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}. (39)

6 tells that, similar to the case without potential, the order of the spatial discretization error convergence depends on the smoothness of the solution. In particular, it can be exponential convergence if the solution is within the Gevrey class. The Gevrey class includes infinitely differentiable functions whose pp-th order derivative grows polynomially in p!p!. Notice that Gevrey class greatly enlarges the class of real analytic functions, since the Taylor series will only have convergence radius 0 if the pp-th order derivative scales super-linearly in p!p!. We give the explicit error bounds and the choice of the grid for bounded errors in the next two results. Their proof can be found in Appendix C as well.

Corollary 7.

Consider solving Equation 29 on discrete grid points (T,n/N)(T,n/N) where n[N]dn\in[N]^{d}. Suppose that the exact solution u(t,x)u(t,x) is in the Gevrey class GσG^{\sigma} in the sense that u(t,x)u(t,x) is smooth and there exist constants Λ>0\Lambda>0 and σ0\sigma\geq 0 such that

supj[d],t[0,T],x[0,1]d|xjpu(t,x)|Λp+1(p!)σ.\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}. (40)

Then, for any N(2Λ/π)(d+α+2)σN\geq(2\Lambda/\pi)(d+\alpha+2)^{\sigma}, we have

(u(T,n/N))n[N]du(T)c1T(c2d)c3ddα/2exp(c4N1/σ),\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\leq c_{1}T(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right), (41)

where cjc_{j}’s are constants only depending on σ,α\sigma,\alpha and Λ\Lambda.

Corollary 8.

Consider solving Equation 29 on discrete grid points (T,n/N)(T,n/N) where n[N]dn\in[N]^{d}. Suppose that the exact solution u(t,x)u(t,x) is in the Gevrey class GσG^{\sigma} as in 7. Then, in order to bound the spatial discretization error in the quantum state (i.e., |(u(T,n/N))n[N]d|u(T)\|\ket{(u(T,n/N))_{n\in[N]^{d}}}-\ket{\vec{u}(T)}\|) by ϵ\epsilon, it suffices to choose

N=𝒪((dlogd+log(T(u(T,n/N))n[N]d)+log(1ϵ))σ).N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\|(u(T,n/N))_{n\in[N]^{d}}\|}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right). (42)

4.3 Second-order Trotter formula

We solve Equation 38 by Trotter formula. In particular, we divide [0,T][0,T] into rr equi-length segments and let h=T/rh=T/r. Consider

𝒯e0T(BC(t))𝑑tj=0r1S2((j+1)h,jh)\mathcal{T}e^{\int_{0}^{T}(-B-C(t))dt}\approx\prod_{j=0}^{r-1}S_{2}((j+1)h,jh) (43)

where S2(jh,(j+1)h)S_{2}(jh,(j+1)h) is the second-order time-dependent Trotter method, aiming at approximating 𝒯ejh(j+1)h(BC(t))𝑑t\mathcal{T}e^{\int_{jh}^{(j+1)h}(-B-C(t))dt} and defined as

S2((j+1)h,jh)=eBh/2eC((j+1/2)h)heBh/2.S_{2}((j+1)h,jh)=e^{-Bh/2}e^{-C((j+1/2)h)h}e^{-Bh/2}. (44)

In this subsection, we first derive a bound of the Trotter error and show the choice of the number of the segments for bounded errors. Then we discuss how to quantumly implement the numerical scheme and estimate its complexity.

4.3.1 Error bound, commutator scalings, and the number of the time steps

We first bound the distance between the exact evolution operator and the numerical integrators.

Lemma 9.

Consider solving Equation 38 using second-order Trotter formula j=0r1S2((j+1)h,jh)\prod_{j=0}^{r-1}S_{2}((j+1)h,jh) with time step size h=T/rh=T/r, where the local integrator S2S_{2} is defined in Equation 44. Then

𝒯e0T(BC(t))𝑑tj=0r1S2((j+1)h,jh)Th2(124maxC′′+14(B+maxC)maxC+16max[B,[B,C]]+14max[B,C]maxC+13maxC3),\begin{split}&\quad\left\|\mathcal{T}e^{\int_{0}^{T}(-B-C(t))dt}-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\right\|\\ &\leq Th^{2}\left(\frac{1}{24}\max\|C^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|C\|)\max\|C^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,C]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right),\end{split} (45)

where all the maximums are taken over t[0,T]t\in[0,T].

The proof of 9 can be found in Appendix D, which contains two parts. First, we deal with the time-ordering operator by bounding the distance between 𝒯e0h(BC(t))𝑑t\mathcal{T}e^{\int_{0}^{h}(-B-C(t))dt} and e(BC(h/2))he^{(-B-C(h/2))h} using the variation of parameters formula. This part contributes to the first two terms in the error bound (the second line of Equation 45) involving time derivatives of the potential matrix C(t)C(t). Then we can bound the error between e(BC(h/2))he^{(-B-C(h/2))h} and S2S_{2} by time-independent Trotter error bounds. Notice that the proof for the second part is different from that for Hamiltonian simulation [14], since in our case the parameters β\beta and γ\gamma in the exponentials eβBe^{-\beta B} and eγCe^{-\gamma C} are restricted to non-negative to avoid exponential overhead, while for Hamiltonian simulation there is no such restriction. Instead, we mostly follow the procedure in [23] to establish the second part of the error bound. This contributes to the last three terms in the error bound (the third line of Equation 45) involving the commutators and the spectral norm of CC explicitly.

Now we compute the norm of the commutators [B,C(t)][B,{C}(t)] and [B,[B,C(t)]][B,[B,{C}(t)]]. All the results are for a fixed tt, so we will omit the explicit tt dependence in our notation for now. Naive bounds are [B,C]𝒪(B)=𝒪(dα/2Nα)\|[B,{C}]\|\leq\mathcal{O}(\|B\|)=\mathcal{O}(d^{\alpha/2}N^{\alpha}) and [B,[B,C]]𝒪(B2)=𝒪(dαN2α)\|[B,[B,{C}]]\|\leq\mathcal{O}(\|B\|^{2})=\mathcal{O}(d^{\alpha}N^{2\alpha}). However, these naive bounds are improvable, because the order of the commutator of the Laplacian operator can be reduced. Such a phenomenon has been observed in [23, 2]. To see this, let us take α=2\alpha=2 and consider its continuous analog. Then for any smooth function f(x)f(x),

[Δ,c]f=Δ(cf)c(Δf)=(Δc)f2(c)(f)cΔf+cΔf=(Δc)f2(c)(f).\begin{split}[-\Delta,{c}]f&=-\Delta({c}f)-{c}(-\Delta f)\\ &=-(\Delta{c})f-2(\nabla{c})\cdot(\nabla f)-{c}\Delta f+{c}\Delta f\\ &=-(\Delta{c})f-2(\nabla{c})\cdot(\nabla f).\end{split} (46)

So we expect, in the discrete setting, the first commutator is bounded by the discretized divergence operator, whose norm is only 𝒪(dN)\mathcal{O}(dN).

Now we state the improved bound in the discrete setting. The proof can be found in Appendix E, which is a generalization of [23].

Lemma 10.

Suppose that c(t,x){c}(t,x) is a bounded C5+dC^{5+d} function in xx. Then we have

[B,C(t)]𝒪(dα/2Nα/2)\|[B,{C}(t)]\|\leq\mathcal{O}(d^{\alpha/2}N^{\alpha/2}) (47)

and

[B,[B,C(t)]]𝒪(dαNα).\|[B,[B,{C}(t)]]\|\leq\mathcal{O}(d^{\alpha}N^{\alpha}). (48)

The choice of the number of time steps is a direct consequence of 9 and 10.

Corollary 11.

Consider solving Equation 38 using second-order Trotter formula j=0r1S2((j+1)h,jh)\prod_{j=0}^{r-1}S_{2}((j+1)h,jh) with time step size h=T/rh=T/r, where the local integrator S2S_{2} is defined in Equation 44. Suppose that C(k)(t)\|{C}^{(k)}(t)\|’s are uniformly bounded in tt for kd+5k\leq d+5. Then, in order to bound the operator splitting error by ϵ\epsilon, it suffices to choose

r=𝒪(dα/2Nα/2T3/2ϵ1/2).r=\mathcal{O}\left(d^{\alpha/2}N^{\alpha/2}\frac{T^{3/2}}{\epsilon^{1/2}}\right). (49)

4.3.2 Quantum implementation and complexity estimate

Now we discuss quantum implementation of this method. The main idea is to construct the block-encodings of eBh/2e^{-Bh/2} and eC((j+1/2)h)he^{-{C}((j+1/2)h)h} and multiply them together using 4. The block-encodings can be constructed using controlled rotations since each evolution operator is unitarily equivalent to a diagonal matrix and the corresponding unitary transformation matrix is efficiently implementable. The nuance is that the operator eC((j+1/2)h)he^{-{C}((j+1/2)h)h} still depends on the specific time, so we use the counter register in the compression gadget as the time clock as well.

Suppose that we are given the oracle O1O_{1}, defined in Equation 20, that encodes the eigenvalues of BB, and the oracle O2O_{2} that gives the element of C(t)C(t) as

O2:|t|n0|nd1|0|t|n0|nd1|c(t,n/N).O_{2}:\ket{t}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}\rightarrow\ket{t}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{c(t,n/N)}. (50)

Here, with an abuse of notations, |t\ket{t} represents some specific way of encoding the information of tt, i.e., (2j+1)(2j+1) for the time point (j+1/2)h(j+1/2)h after time discretization (the reason why it is (2j+1)(2j+1) will be clear soon).

As discussed before, we may use the circuit in Figure 2 with a replacement of Oexp,1O_{\exp,1} by Oexp,2:|x|0|x|exh/2O_{\exp,2}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xh/2}} to construct a (1,,0)(1,*,0)-block-encoding of eBh/2e^{-Bh/2}. Here the number of the ancilla qubits depends on those for encoding the eigenvalues of BB and their exponentials, on which we do not keep track for technical simplicity. We denote this block-encoding by VB,h/2V_{B,h/2}. Similarly, a (1,,0)(1,*,0)-block-encoding of eBhe^{-Bh} can be constructed with Oexp,3:|x|0|x|exhO_{\exp,3}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xh}}. We denote it by VB,hV_{B,h}. Furthermore, in Figure 2, if we discard the QFT steps, and replace Oexp,1O_{\exp,1} by Oexp,3O_{\exp,3} and the oracle O1O_{1} by O2O_{2} controlled by an extra counter register, we can construct a (controlled version of) (1,,0)(1,*,0)-block-encoding of eC((j+1/2)h)he^{-C((j+1/2)h)h}, denoted by VC,jV_{C,j}. Notice that the constructions of block-encodings VB,h/2V_{B,h/2}, VB,hV_{B,h} and VC,jV_{C,j} only require 𝒪(1)\mathcal{O}(1) queries to the aforementioned oracles and QFT.

Now we construct the numerical integrator. Mathematically we need to block encode the operator

j=0r1S2((j+1)h,jh)=eBh/2(j=1r1eC((j+1/2)h)heBh)eC(h/2)heBh/2.\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)=e^{-Bh/2}\left(\prod_{j=1}^{r-1}e^{-C((j+1/2)h)h}e^{-Bh}\right)e^{-C(h/2)h}e^{-Bh/2}. (51)

This can be done by the circuit in Figure 3, which can be viewed as a variant of the compression gadget in 4. The idea is to use an extra counter register with log2(2r+1)+1\lceil\log_{2}(2r+1)\rceil+1 qubits for both keeping track of the success/failure of the multiplication and indicating the index of the time step. If all the applications of the block-encodings of the local exponentials are successful, then the value of the counter register at each step suggests the correct time step index, and the final value is reset to be 0. Notice that the block-encoding factor of VB,hV_{B,h} and VC,jV_{C,j} are 11. Then the circuit in Figure 3 gives a (1,,0)(1,*,0)-block-encoding of j=0r1S2((j+1)h,jh)\prod_{j=0}^{r-1}S_{2}((j+1)h,jh), as desired.

\Qcircuit@R=1em @C=1em Counter      & \qw \gateADD \ctrl1 \gateADD \qw \gateADD \ctrl1 \gateADD \qw
Ancilla       \multigate1V_B,h/2 \ctrlo-1 \multigate1V_C,0 \ctrlo-1 \multigate1V_B,h \ctrlo-1 \multigate1V_C,1 \ctrlo-1 \qw
System       \ghostV_B,h/2 \qw \ghostV_C,0 \qw \ghostV_B,h \qw \ghostV_C,1 \qw \qw

Counter       ⋯   \qw \gateADD \ctrl1 \gateADD \qw \gateADD \gate(ADD^†)^2r+1 \qw
Ancilla       ⋯   \multigate1V_B,h \ctrlo-1 \multigate1V_C,j-1 \ctrlo-1 \multigate1V_B,h/2 \ctrlo-1 \qw \qw
System       ⋯   \ghostV_B,h \qw \ghostV_C,j-1 \qw \ghostV_B,h/2 \qw \qw \qw

Figure 3: Quantum circuit for implementing second-order Trotter method. Here the Counter register contains log2(2r+1)+1\lceil\log_{2}(2r+1)\rceil+1 qubits. ADD implements addition by 11 modulo 2log2(2r+1)+12^{\lceil\log_{2}(2r+1)\rceil+1}.

We apply this block-encoding to the input state |u0\ket{u_{0}}, and the final state is

1u0n[N]d(ur)n|n0|nd1|0+|,\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}(\vec{u}_{r})_{n}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}+\ket{\perp}, (52)

where

ur=j=0r1S2((j+1)h,jh)u0u(T),\begin{split}\vec{u}_{r}=\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\vec{u}_{0}\approx\vec{u}(T),\end{split} (53)

and |\ket{\perp} represents the junk state with ancilla register not equal to 0. The final step is to measure the ancilla registers of Equation 52. If all the ancilla registers are 0, then we get a good approximation of |u(T)\ket{u(T)}. The averaged number of repeats after amplitude amplification is 𝒪(u0/u(T))\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|).

We summarize the overall complexity as follows.

Theorem 12.

Consider solving Equation 29 on discrete grid points (T,n/N)(T,n/N) where n[N]dn\in[N]^{d}. Let u(t,x)u(t,x) denote the solution of the equation Equation 29, and u(t)\vec{u}(t) denote the solution of the spatially discretized Equation 38. Suppose that

  1. 1.

    we are given oracles O1O_{1} and O2O_{2} defined in Equation 20 and Equation 50, and the state preparation oracle Ou:|0|u0O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}},

  2. 2.

    u(t,x)u(t,x) is in the Gevrey class GσG^{\sigma} in the sense that u(t,x)u(t,x) is smooth and there exist constants Λ>0\Lambda>0 and σ0\sigma\geq 0 such that

    supj[d],t[0,T],x[0,1]d|xjpu(t,x)|Λp+1(p!)σ.\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}. (54)
  3. 3.

    (u(T,n/N))n[N]dg~(T)\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T) for a function g~\widetilde{g}.

  4. 4.

    u(0)/u(T)g(T)\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T) for a function gg.

Then, with second-order operator splitting method for time propagation, an ϵ\epsilon-approximation of |(u(T,n/N))n[N]d\ket{(u(T,n/N))_{n\in[N]^{d}}} can be obtained by choosing

N=𝒪((dlogd+log(Tg~(T))+log(1ϵ))σ).N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right). (55)

and using

  1. 1.
    𝒪((g(T))3/2T3/2ϵ1/2dα/2(dlogd+log(Tg~(T))+log(1ϵ))ασ/2)\mathcal{O}\left((g(T))^{3/2}\frac{T^{3/2}}{\epsilon^{1/2}}d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma/2}\right) (56)

    queries to O1O_{1},O2O_{2}, their inverses and controlled versions,

  2. 2.
    𝒪(g(T))\mathcal{O}\left(g(T)\right) (57)

    queries to the state preparation oracle OuO_{u} and its inverse.

  3. 3.
    𝒪((g(T))3/2T3/2ϵ1/2dα/2+1(dlogd+log(Tg~(T))+log(1ϵ))ασ/2log2(dlogd+log(Tg~(T))+log(1ϵ)))\mathcal{O}\left((g(T))^{3/2}\frac{T^{3/2}}{\epsilon^{1/2}}d^{\alpha/2+1}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma/2}\log^{2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)\right) (58)

    additional elementary gates.

Proof.

It suffices to bound both spatial and time discretization errors by ϵ\epsilon. The choice of NN directly follows 8. We now count the overall complexity. Each run of the algorithm requires an application of Figure 2 which implements eBh/2e^{-Bh/2} and eC(t)he^{-{C}(t)h} for 𝒪(r)\mathcal{O}(r) times. In each block-encoding of eBh/2e^{-Bh/2} and eC(t)he^{-{C}(t)h}, we need to use 𝒪(1)\mathcal{O}(1) queries to O1O_{1} and O2O_{2}, and 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) additional gates mainly due to the QFT. According to 11, it suffices to choose r=𝒪(g(T)1/2dα/2Nα/2T3/2ϵ1/2)r=\mathcal{O}\left(g(T)^{1/2}d^{\alpha/2}N^{\alpha/2}\frac{T^{3/2}}{\epsilon^{1/2}}\right). Here the extra factor g(T)1/2g(T)^{1/2} is because 11 only bounds the operator norm, and in order to bound the error in the quantum state by ϵ\epsilon, the operator norm error bound needs to be bounded by 𝒪(ϵu(T)/u(0))\mathcal{O}(\epsilon\|\vec{u}(T)\|/\|\vec{u}(0)\|) according to 19. The averaged number of repeats to succeed after amplitude amplification is 𝒪(u(0)/u(T))=𝒪(g(T))\mathcal{O}(\|\vec{u}(0)\|/\|\vec{u}(T)\|)=\mathcal{O}(g(T)). Multiplying these together gives the overall query complexity and additional gates required. Notice that in each run we only need one query to the state preparation oracle, so the overall number of state preparation is only g(T)\sim g(T). ∎

4.4 Time-marching method

Now we consider an alternative algorithm proposed in [18] called time-marching method. The method is designed for general ODE

ddtϕ(t)=A(t)ϕ(t),ϕ(0)=ϕ0\frac{d}{dt}\phi(t)=A(t)\phi(t),\quad\phi(0)=\phi_{0} (59)

with a time-dependent matrix-valued function A(t)A(t), and is a quantum analog of the classical exponential propagation methods. It first divides the time interval [0,T][0,T] into small segments with mesh 0=t0<t1<<tL=T0=t_{0}<t_{1}<\cdots<t_{L}=T and applies the short-time evolution operator sequentially. While naive applications of a sequence of non-unitary operators may incur an exponential overhead in the number of the operators, the time-marching method avoids such overhead by a technique that combines the uniform singular value transformation and the amplitude amplification. We refer to [18] for more details and only roughly summarize the main result here.

Lemma 13 (Theorem 8 of [18]).

Consider the ODE Equation 59. Suppose that we are given the prepare oracle OϕO_{\phi} of |ϕ0\ket{\phi_{0}} such that Oϕ|0=|ϕ0O_{\phi}\ket{0}=\ket{\phi_{0}} and an input model of A(t)A(t), denoted by MATA\text{MAT}_{A}, that simultaneously block encodes A(tk)A(t_{k}^{\prime}) at some refined mesh points tkt_{k}^{\prime}. Then, with the time-marching method, an ϵ\epsilon-approximation of |ϕ(T)\ket{\phi(T)} can be prepared using

𝒪(η2T2Qlog(ηTQ)log(ηTQ/ϵ)loglog(ηTQ/ϵ))\mathcal{O}\left(\eta^{2}T^{2}Q\log(\eta TQ)\frac{\log(\eta TQ/\epsilon)}{\log\log(\eta TQ/\epsilon)}\right) (60)

queries to MATA\text{MAT}_{A} and 𝒪(Q)\mathcal{O}(Q) queries to OϕO_{\phi}, its inverse and controlled version. Here η\eta is the block-encoding factor of MATA\text{MAT}_{A} such that ηA(t)\eta\geq\|A(t)\| for all t[0,T]t\in[0,T], and

Q=ϕ(0)l=1L𝒯etl1tlA(t)𝑑tϕ(T).Q=\frac{\|\phi(0)\|\prod_{l=1}^{L}\left\|\mathcal{T}e^{\int_{t_{l-1}}^{t_{l}}A(t)dt}\right\|}{\|\phi(T)\|}. (61)

Now we discuss the complexity of applying the time-marching method to spatially discretized equation Equation 38. For technical simplicity, we only estimate the query complexities.

Theorem 14.

Consider solving Equation 29 on discrete grid points (T,n/N)(T,n/N) where n[N]dn\in[N]^{d}. Let u(t,x)u(t,x) denote the solution of the equation Equation 29, and u(t)\vec{u}(t) denote the solution of the spatially discretized Equation 38. Suppose that

  1. 1.

    we are given oracles encoding the eigenvalues of BB and the diagonal entries of C(t){C}(t), i.e., O1O_{1} and O2O_{2} defined in Equation 20 and Equation 50, and the state preparation oracle Ou:|0|u0O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}},

  2. 2.

    v(t,x)v(t,x) is in the Gevrey class GσG^{\sigma} in the sense that v(t,x)v(t,x) is smooth and there exist constants Λ>0\Lambda>0 and σ0\sigma\geq 0 such that

    supj[d],t[0,T],x[0,1]d|xjpv(t,x)|Λp+1(p!)σ.\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}v(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}. (62)
  3. 3.

    (v(T,n/N))n[N]dg~(T)\|(v(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T) for a function g~\widetilde{g}.

Then, with the time-marching method for time propagation, an ϵ\epsilon-approximation of |(u(T,n/N))n[N]d\ket{(u(T,n/N))_{n\in[N]^{d}}} can be obtained by choosing

N=𝒪((dlogd+log(Tg~(T))+log(1ϵ))σ).N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right). (63)

and using

  1. 1.
    𝒪~(QT2dα(dlogd+log(Tg~(T))+log(1ϵ))2ασlog(1ϵ))\widetilde{\mathcal{O}}\left(QT^{2}d^{\alpha}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{2\alpha\sigma}\log\left(\frac{1}{\epsilon}\right)\right) (64)

    queries to O1{O}_{1}, O2{O}_{2}, the dd-dimensional QFT circuit, their inverses and controlled versions, where

    Q=u(0)l=1L𝒯etl1tl(BC(t))𝑑tu(T),Q=\frac{\|\vec{u}(0)\|\prod_{l=1}^{L}\left\|\mathcal{T}e^{\int_{t_{l-1}}^{t_{l}}(-B-{C}(t))dt}\right\|}{\|\vec{u}(T)\|}, (65)
  2. 2.
    𝒪(Q)\mathcal{O}\left(Q\right) (66)

    queries to the state preparation oracle OuO_{u} or its inverse.

Proof.

First, the oracle MATA\text{MAT}_{A} mentioned in 13 can be constructed through the given O1{O}_{1} and O2{O}_{2} and additional 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) gates. The idea is as follows. We can first construct the block-encoding of the diagonal matrix DD with O1{O}_{1}, then construct the block-encoding of B=()dD(1)dB=(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d} using QFT. Meanwhile we may construct the simultaneous block-encoding of C(t){C}(t) with O2{O}_{2}. Finally, we can linearly combine these two block-encodings to obtain the desired MATA\text{MAT}_{A} that block encodes A(t)=BC(t)A(t)=-B-{C}(t). According to [19, Lemma 48 & Lemma 52 & Lemma53], such approach requires 𝒪(1)\mathcal{O}(1) queries to O1{O}_{1} and O2{O}_{2}, so we may directly estimate the number of queries to MATA\text{MAT}_{A} as that of queries to O1{O}_{1} and O2{O}_{2}. Note that there are additional 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) gates required to construct MATA\text{MAT}_{A} from O1{O}_{1} and O2{O}_{2}, but we only focus on the query complexities here for technical simplicity.

Now we may directly use 13 to estimate the complexity, and it suffices to write down explicit scalings of the block-encoding factor η\eta in the example of Equation 38. Under the assumption that C(t){C}(t) is uniformly bounded, the spectral norm BC(t)=𝒪(dα/2Nα)\|-B-{C}(t)\|=\mathcal{O}(d^{\alpha/2}N^{\alpha}), so the parameter η=𝒪(dα/2Nα)\eta=\mathcal{O}(d^{\alpha/2}N^{\alpha}). Using the choice of NN estimated in 6, we have

η=𝒪(dα/2(dlogd+log(Tg~(T))+log(1ϵ))ασ).\eta=\mathcal{O}\left(d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\right). (67)

Plugging this parameter back to the scalings in 13 yields the desired estimates. ∎

4.5 Truncated Dyson series method

Now we discuss the query complexity of applying the state-of-the-art generic quantum ODE solvers to our linear fractional reaction-diffusion equation. We consider the method proposed in [8], which is based on the truncated Dyson series. It first expands the solution via truncated Dyson series and encode it into a linear system of equations, then solve it using the optimal quantum linear system algorithm. It works for the most general linear ODE with time dependent coefficient matrix and possible inhomogeneous term. In our case, we are only interested in the homogeneous equation, so here we only summarize their main result for the homogeneous equation Equation 59.

Lemma 15 (Theorem 1 of [8]).

Consider the ODE Equation 59 where the coefficient matrix A(t)A(t) has non-positive logarithmic norm for all tt. Suppose that we are given the prepare oracle OϕO_{\phi} of |ϕ0\ket{\phi_{0}} such that Oϕ|0=|ϕ0O_{\phi}\ket{0}=\ket{\phi_{0}} and an input model of A(t)A(t), denoted by MATA\text{MAT}_{A}, that simultaneously block-encoding A(tk)A(t_{k}^{\prime}) at some refined mesh points tkt_{k}^{\prime}. Then, with the truncated Dyson series method, an ϵ\epsilon-approximation of |ϕ(T)\ket{\phi(T)} can be prepared using

𝒪(maxt[0,T]ϕ(t)ϕ(T)ηTlog(1/ϵ)log(ηT/ϵ))\mathcal{O}\left(\frac{\max_{t\in[0,T]}\|\phi(t)\|}{\|\phi(T)\|}\eta T\log(1/\epsilon)\log(\eta T/\epsilon)\right) (68)

queries to MATA\text{MAT}_{A} and

𝒪(maxt[0,T]ϕ(t)ϕ(T)ηTlog(1/ϵ))\mathcal{O}\left(\frac{\max_{t\in[0,T]}\|\phi(t)\|}{\|\phi(T)\|}\eta T\log(1/\epsilon)\right) (69)

queries to OϕO_{\phi}. Here η\eta is the block-encoding factor of MATA\text{MAT}_{A} such that ηA(t)\eta\geq\|A(t)\| for all t[0,T]t\in[0,T].

Theorem 16.

Consider solving Equation 29 on discrete grid points (T,n/N)(T,n/N) where n[N]dn\in[N]^{d}. Let u(t,x)u(t,x) denote the solution of the equation Equation 29, and u(t)\vec{u}(t) denote the solution of the spatially discretized Equation 38. Suppose that

  1. 1.

    we are given oracles encoding the eigenvalues of BB and the diagonal entries of C(t){C}(t), i.e., O1O_{1} and O2O_{2} defined in Equation 20 and Equation 50, and the state preparation oracle Ou:|0|u0O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}},

  2. 2.

    u(t,x)u(t,x) is in the Gevrey class GσG^{\sigma} in the sense that u(t,x)u(t,x) is smooth and there exist constants Λ>0\Lambda>0 and σ0\sigma\geq 0 such that

    supj[d],t[0,T],x[0,1]d|xjpu(t,x)|Λp+1(p!)σ.\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}. (70)
  3. 3.

    (u(T,n/N))n[N]dg~(T)\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T) for a function g~\widetilde{g},

  4. 4.

    u(0)/u(T)g(T)\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T) for a function gg.

Then, with the truncated Dyson series method for time propagation, an ϵ\epsilon-approximation of |(u(T,n/N))n[N]d\ket{(u(T,n/N))_{n\in[N]^{d}}} can be obtained by choosing

N=𝒪((dlogd+log(Tg~(T))+log(1ϵ))σ).N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right). (71)

and using

  1. 1.
    𝒪~(g(T)Tdα/2(dlogd+log(Tg~(T))+log(1ϵ))ασ(log(1ϵ))2)\widetilde{\mathcal{O}}\left(g(T)Td^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\left(\log\left(\frac{1}{\epsilon}\right)\right)^{2}\right) (72)

    queries to O1{O}_{1} and O2{O}_{2}, dd-dimensional QFT, their inverses and controlled versions,

  2. 2.
    𝒪(g(T)Tdα/2(dlogd+log(Tg~(T))+log(1ϵ))ασlog(1ϵ))\mathcal{O}\left(g(T)Td^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\log\left(\frac{1}{\epsilon}\right)\right) (73)

    queries to the state preparation oracle OuO_{u} or its inverse.

Proof.

As shown in the proof of 14, we may directly estimate the number of queries to Oj{O}_{j} as that of queries to MATA\text{MAT}_{A}, and the block-encoding factor η\eta can be bounded as

η=𝒪(dα/2(dlogd+log(Tg~(T))+log(1ϵ))ασ).\eta=\mathcal{O}\left(d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\right). (74)

Since the coefficient matrix (BC(t))(-B-{C}(t)) is always negative semi-definite, the norm of the solution ϕ(t)\|\phi(t)\| is non-increasing over tt, so

maxt[0,T]u(t)u(T)=u(0)u(T)g(T).\frac{\max_{t\in[0,T]}\|\vec{u}(t)\|}{\|\vec{u}(T)\|}=\frac{\|\vec{u}(0)\|}{\|\vec{u}(T)\|}\leq g(T). (75)

Plugging these parameters back to 15 completes the proof. ∎

4.6 Linear combination of Hamiltonian simulation in the interaction picture

We have discussed and analyzed the second-order operator splitting method, the time-marching method, and the truncated Dyson series method. All the methods have extra polynomial dependence on the dimension dd, which comes from the dependence on the spectral norm of the discrete fractional Laplacian operator BB, although the operator splitting method may partially benefit from its commutator scalings.

In quantum dynamics, a common technique to avoid the explicit dependence on B\|B\| is to simulate the dynamics in the interaction picture. Specifically, if we consider the fractional Schrödinger equation

iddt|ψ=(BC(t))|ψ,i\frac{d}{dt}\ket{\psi}=(-B-{C}(t))\ket{\psi}, (76)

then, by defining the interaction picture Hamiltonian HI(t)=eiBtC(t)eiBtH_{I}(t)=-e^{-iBt}{C}(t)e^{iBt} and |ψI=eiBt|ψ\ket{\psi_{I}}=e^{-iBt}\ket{\psi}, we may obtain the transformed solution |ψI\ket{\psi_{I}} by simulating iddt|ψI=HI(t)|ψIi\frac{d}{dt}\ket{\psi_{I}}=H_{I}(t)\ket{\psi_{I}}. Here HI(t)H_{I}(t) is bounded independently of B\|B\|, and its oscillations depend on B\|B\|. Therefore we may efficiently simulate the interaction picture Hamiltonian using truncated Dyson series method which has linearly dependence on HI\|H_{I}\| but only poly-logarithmically depends on its derivatives. We refer to [30] for more details.

The success of the interaction picture Hamiltonian simulation relies on the fast-forwarded implementation of eiBse^{iBs}, that is, eiBse^{iBs} can be implemented for any real number ss with cost independent of B\|B\| and |s||s|. However, in our reaction-diffusion equation, which can be viewed as the imaginary time evolution of the Schrödinger equation, we cannot directly apply similar technique. This is because eBse^{-Bs} is only fast-forwardable when s0s\geq 0, while the transformation into the analog of interaction picture requires both forward and backward time evolution.

4.6.1 Representation

To take advantage of the interaction picture technique, we can relate the reaction-diffusion equation with the Hamiltonian simulation problem. A recent work [4] shows that any linear ODE can be represented as a linear combination of Hamiltonian simulation. In particular, we may write the evolution operator of our fractional reaction-diffusion equation as

𝒯e0T(B+C(s))𝑑s=1π(1+ξ2)𝒯ei0Tξ(B+C(s))𝑑s𝑑ξ.\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}=\int_{\mathbb{R}}\frac{1}{\pi(1+\xi^{2})}\mathcal{T}e^{-i\int_{0}^{T}\xi(B+{C}(s))ds}d\xi. (77)

Here 𝒯\mathcal{T} is the time-ordering operator. The proof of Equation 77 can be found in [4]. Now we use the interaction picture simulation. Let

U(t)=𝒯ei0tξ(B+C(s))𝑑s,U(t)=\mathcal{T}e^{-i\int_{0}^{t}\xi(B+{C}(s))ds}, (78)

then U(t)U(t) satisfies the time-dependent Hamiltonian simulation problem as

dUdt=i(ξB+ξC(t))U(t),U(0)=I.\frac{dU}{dt}=-i(\xi B+\xi{C}(t))U(t),\quad U(0)=I. (79)

Let

UI(t)=eiξBtU(t).U_{I}(t)=e^{i\xi Bt}U(t). (80)

We may compute that

dUIdt=iξBeiξBtU(t)ieiξBt(ξB+ξC(t))U(t)=ieiξBtξC(t)eiξBtUI(t).\begin{split}\frac{dU_{I}}{dt}&=i\xi Be^{i\xi Bt}U(t)-ie^{i\xi Bt}(\xi B+\xi{C}(t))U(t)\\ &=-ie^{i\xi Bt}\xi{C}(t)e^{-i\xi Bt}U_{I}(t).\end{split} (81)

Define

HI(t;ξ)=eiξBtξC(t)eiξBt.H_{I}(t;\xi)=e^{i\xi Bt}\xi{C}(t)e^{-i\xi Bt}. (82)

Then

UI(t)=𝒯ei0tHI(s;ξ)𝑑s,U_{I}(t)=\mathcal{T}e^{-i\int_{0}^{t}H_{I}(s;\xi)ds}, (83)

and we may write Equation 77 as

𝒯e0T(B+C(s))𝑑s=1π(1+ξ2)eiξBT𝒯ei0THI(s;ξ)𝑑s𝑑ξ.\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}=\int_{\mathbb{R}}\frac{1}{\pi(1+\xi^{2})}e^{-i\xi BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi)ds}d\xi. (84)

4.6.2 Numerical quadrature

We can truncate Equation 84 over a finite interval [Ξ,Ξ][-\Xi,\Xi] and write it as

𝒯e0T(B+C(s))𝑑sΞΞ1π(1+ξ2)eiξBT𝒯ei0THI(s;ξ)𝑑s𝑑ξ.\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}\approx\int_{-\Xi}^{\Xi}\frac{1}{\pi(1+\xi^{2})}e^{-i\xi BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi)ds}d\xi. (85)

The resulting integral can be discretized using standard numerical quadrature. Here we use the simplest Riemann sum formula with MM grid points. For 0jM10\leq j\leq M-1, let ξj=Ξ+2jΞ/M\xi_{j}=-\Xi+2j\Xi/M and wj=1π(1+ξj2)2ΞMw_{j}=\frac{1}{\pi(1+\xi_{j}^{2})}\frac{2\Xi}{M}. Then

𝒯e0T(B+C(s))𝑑sj=0M1wjeiξjBT𝒯ei0THI(s;ξj)𝑑s.\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}\approx\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}. (86)

The discretization error can be bounded as follows, and its proof is given in Appendix F.

Lemma 17.

We have

𝒯e0T(B+C(s))𝑑sj=0M1wjeiξjBT𝒯ei0THI(s;ξj)𝑑s2πΞ+2Ξ2πM(1+T(B+maxC)).\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\leq\frac{2}{\pi\Xi}+\frac{2\Xi^{2}}{\pi M}\left(1+T(\|B\|+\max\|{C}\|)\right). (87)

In order to bound the discretization error by 𝒪(ϵ)\mathcal{O}(\epsilon), it suffices to choose

Ξ=𝒪(1ϵ),M=𝒪(TBϵ3).\Xi=\mathcal{O}\left(\frac{1}{\epsilon}\right),\quad M=\mathcal{O}\left(\frac{T\|B\|}{\epsilon^{3}}\right). (88)

4.6.3 Implementation and complexity

Suppose that we are given the same input oracles as in previous algorithms, encoding the eigenvalues of BB and the diagonal entries of C(t){C}(t), i.e., O1O_{1} and O2O_{2} defined in Equation 20 and Equation 50. The sketch of the algorithm is to simulate the interaction picture Hamiltonian using truncated Dyson series method [30] and then apply the LCU technique to compute Equation 86.

We start with the HAM-T encoding of the matrix HIH_{I}. For a fixed time step size hh and an integer such that [mh,(m+1)h][0,T][mh,(m+1)h]\subset[0,T], we may first construct the HAM-T encoding of C{C} from its sparse input oracle following the approach in [19]. The resulting HAM-T encoding satisfies

0|aHAM-TC,m|0a=l=0MH1|ll|C(mh+lh/MH)αC.\bra{0}_{a}\text{HAM-T}_{C,m}\ket{0}_{a}=\sum_{l=0}^{M_{H}-1}\ket{l}\bra{l}\otimes\frac{{C}(mh+lh/M_{H})}{\alpha_{C}}. (89)

Here αC\alpha_{C} is the upper bound of C\|{C}\|, and MHM_{H} is the number of grid points used in the truncated Dyson series method. By appending an ancilla register |j\ket{j} encoding the index for ξ\xi and applying the controlled rotation |j|0|j(ξjΞ|0+1ξj2Ξ2|1)\ket{j}\ket{0}\rightarrow\ket{j}\left(\frac{\xi_{j}}{\Xi}\ket{0}+\sqrt{1-\frac{\xi_{j}^{2}}{\Xi^{2}}}\ket{1}\right) with the help of an additional ancilla qubit, we obtain the HAM-T encoding of ξC\xi{C} such that

0|aHAM-TξC,m|0a=j=0M1l=0MH1|jj||ll|ξjC(mh+lh/MH)ΞαC.\bra{0}_{a}\text{HAM-T}_{\xi C,m}\ket{0}_{a}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\frac{\xi_{j}{C}(mh+lh/M_{H})}{\Xi\alpha_{C}}. (90)

To construct the block-encoding of eiξBte^{-i\xi Bt}, we write

eiξjB(mh+lh/MH)=ei(Ξ+2jΞ/M)B(mh+lh/MH)=eiΞBmh(ei(2Ξ/M)Bmh)j(eiΞBh/MH)l((ei(2Ξ/M)Bh/MH)j)l.\begin{split}e^{-i\xi_{j}B(mh+lh/M_{H})}&=e^{-i(-\Xi+2j\Xi/M)B(mh+lh/M_{H})}\\ &=e^{i\Xi Bmh}\left(e^{-i(2\Xi/M)Bmh}\right)^{j}\left(e^{i\Xi Bh/M_{H}}\right)^{l}\left(\left(e^{-i(2\Xi/M)Bh/M_{H}}\right)^{j}\right)^{l}.\end{split} (91)

Noticing that the matrix BB can be diagonalized with QFT and the diagonal components are given through the oracle O1O_{1}, we can implement eiBse^{-iBs} for any real number ss fast-forwardly with 𝒪(1)\mathcal{O}(1) uses of O1O_{1}, controlled phase gate and QFT. Then, according to the binary encoding of 0jM10\leq j\leq M-1, we can use the controlled version of a total of log2(M)\log_{2}(M) operators ei(2Ξ/M)Bmhe^{-i(2\Xi/M)Bmh}, ei2(2Ξ/M)Bmhe^{-i2(2\Xi/M)Bmh}, ei22(2Ξ/M)Bmhe^{-i2^{2}(2\Xi/M)Bmh}, \cdots, ei2log2(M)(2Ξ/M)Bmhe^{-i2^{\log_{2}(M)}(2\Xi/M)Bmh} to implement the controlled evolution j=0M1|jj|(ei(2Ξ/M)Bmh)j\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes\left(e^{-i(2\Xi/M)Bmh}\right)^{j}. Similarly, we can construct the evolution l=0MH1|ll|(eiΞBh/MH)l\sum_{l=0}^{M_{H}-1}\ket{l}\bra{l}\otimes\left(e^{i\Xi Bh/M_{H}}\right)^{l} and j=0M1l=0MH1|jj||ll|((ei(2Ξ/M)Bh/MH)j)l\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\left(\left(e^{-i(2\Xi/M)Bh/M_{H}}\right)^{j}\right)^{l} with logarithmic cost as well. Multiplying them together gives the select oracle

SELB,m=j=0M1l=0MH1|jj||ll|eiξjB(mh+lh/MH).\text{SEL}_{B,m}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes e^{-i\xi_{j}B(mh+lh/M_{H})}. (92)

Then

HAM-THI,m:=(InaSELB,m)HAM-TξC,m(InaSELB,m)\text{HAM-T}_{H_{I},m}:=(I_{n_{a}}\otimes\text{SEL}_{B,m}^{\dagger})\text{HAM-T}_{\xi C,m}(I_{n_{a}}\otimes\text{SEL}_{B,m}) (93)

gives the HAM-T encoding of HIH_{I} that

0|aHAM-THI,m|0a=j=0M1l=0MH1|jj||ll|HI(mh+lh/MH;ξj)ΞαC.\bra{0}_{a}\text{HAM-T}_{H_{I},m}\ket{0}_{a}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\frac{H_{I}(mh+lh/M_{H};\xi_{j})}{\Xi\alpha_{C}}. (94)

The HAM-THI,m\text{HAM-T}_{H_{I},m} serves as the Hamiltonian input oracle in the truncated Dyson series method. Therefore, the method in [30] gives the select oracle

SELW=j=0M1|jj|Wj,\text{SEL}_{W}=\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes W_{j}, (95)

where WjW_{j} is an approximation of UI(T;ξj)U_{I}(T;\xi_{j}). We then multiply it on the left by j=0M1|jj|eiξjBT\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes e^{-i\xi_{j}BT} (which again can be efficiently constructed according to the binary representation of jj) and obtain

SELU=j=0M1|jj|Uj.\text{SEL}_{U}=\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes U_{j}. (96)

Here UjU_{j} is an approximation of eiξjBT𝒯ei0THI(s;ξj)𝑑se^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}. The SELU\text{SEL}_{U} operator serves as the select oracle in the LCU subroutine. Hence the formula Equation 86 can be directly implemented by the LCU technique (2).

The overall complexity of the algorithm is given as follows.

Theorem 18.

Consider the spatially discretized equation Equation 38. Suppose that

  1. 1.

    we are given oracles encoding the eigenvalues of BB and the diagonal entries of C(t){C}(t), i.e., O1O_{1} and O2O_{2} defined in Equation 20 and Equation 50, and the state preparation oracle Ou:|0|u0O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}},

  2. 2.

    u(t,x)u(t,x) is in the Gevrey class GσG^{\sigma} in the sense that u(t,x)u(t,x) is smooth and there exist constants Λ>0\Lambda>0 and σ0\sigma\geq 0 such that

    supj[d],t[0,T],x[0,1]d|xjpu(t,x)|Λp+1(p!)σ,\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}, (97)
  3. 3.

    (u(T,n/N))n[N]dg~(T)\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T) for a function g~\widetilde{g},

  4. 4.

    u(0)/u(T)g(T)\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T) for a function gg.

Then, the linear combination of Hamiltonian simulation in the interaction picture can prepare an ϵ\epsilon-approximation of |(u(T,n/N))n[N]d\ket{(u(T,n/N))_{n\in[N]^{d}}} using

  1. 1.

    queries to O1O_{1} and O2O_{2} a total number of times

    𝒪(g(T)2Tϵlog3(g(T)Tdlogdϵlog(1g~(T)))),\mathcal{O}\left(g(T)^{2}\frac{T}{\epsilon}\log^{3}\left(\frac{g(T)Td\log d}{\epsilon}\log\left(\frac{1}{\widetilde{g}(T)}\right)\right)\right), (98)
  2. 2.

    queries to OuO_{u} for 𝒪(g(T))\mathcal{O}(g(T)) times,

  3. 3.

    additional elementary gates for

    𝒪(g(T)2Tϵlog(g(T)Tϵ)(dlog2(d+log(Tg~(T))+log(1ϵ))+log(g(T)Tϵ))).\mathcal{O}\left(g(T)^{2}\frac{T}{\epsilon}\log\left(\frac{g(T)T}{\epsilon}\right)\left(d\log^{2}\left(d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)+\log\left(\frac{g(T)T}{\epsilon}\right)\right)\right). (99)
Proof.

As discussed before, the oracle HAM-TξC,m\text{HAM-T}_{\xi C,m} can be implemented with 𝒪(1)\mathcal{O}(1) queries to O2O_{2} and 𝒪(dlog(N))\mathcal{O}(d\log(N)) additional gates [19, Lemma 48], and the construction of SELB,m\text{SEL}_{B,m} requires 𝒪(log(M)log(MH))\mathcal{O}(\log(M)\log(M_{H})) O1O_{1} and QFT (which requires 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) gates). So each HAM-THI,m\text{HAM-T}_{H_{I},m} can be constructed with 𝒪(log(M)log(MH))\mathcal{O}(\log(M)\log(M_{H})) queries to O1O_{1}, O2O_{2} and 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) additional gates. According to [30, Corolllary 4], by choosing

MH=𝒪(TΞαCϵ(BΞ2αC+ΞmaxC+Ξ2αC2))=𝒪(Tϵ(BΞ+maxC)),\begin{split}M_{H}&=\mathcal{O}\left(\frac{T}{\Xi\alpha_{C}\epsilon^{\prime}}\left(\|B\|\Xi^{2}\alpha_{C}+\Xi\max\|{C}^{\prime}\|+\Xi^{2}\alpha_{C}^{2}\right)\right)\\ &=\mathcal{O}\left(\frac{T}{\epsilon^{\prime}}\left(\|B\|\Xi+\max\|{C}^{\prime}\|\right)\right),\end{split} (100)

we may implement the select oracle SELW=j=0M1j||jWj\text{SEL}_{W}=\sum_{j=0}^{M-1}\bra{j}\ket{j}\otimes W_{j} such that

Wj𝒯ei0THI(s;ξj)𝑑sϵ\left\|W_{j}-\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\leq\epsilon^{\prime} (101)

with

𝒪(ΞαCTlog(ΞαCT/ϵ)loglog(ΞαCT/ϵ))=𝒪(ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ))\mathcal{O}\left(\Xi\alpha_{C}T\frac{\log(\Xi\alpha_{C}T/\epsilon^{\prime})}{\log\log(\Xi\alpha_{C}T/\epsilon^{\prime})}\right)=\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\right) (102)

queries to HAM-THI,m\text{HAM-T}_{H_{I},m} and

𝒪(ΞαCTlog(ΞαCT/ϵ)loglog(ΞαCT/ϵ)(dlog(N)+log(MH)))=𝒪(ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ)(dlog(N)+log(MH)))\begin{split}&\quad\mathcal{O}\left(\Xi\alpha_{C}T\frac{\log(\Xi\alpha_{C}T/\epsilon^{\prime})}{\log\log(\Xi\alpha_{C}T/\epsilon^{\prime})}\left(d\log(N)+\log(M_{H})\right)\right)\\ &=\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log(N)+\log(M_{H})\right)\right)\end{split} (103)

additional gates. Hence the SELU\text{SEL}_{U} can be implemented with asymptotically the same cost, and, taking into account the cost of constructing HAM-THI,m\text{HAM-T}_{H_{I},m}, this step needs

𝒪(ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ)log(M)log(MH))\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\log(M)\log(M_{H})\right) (104)

queries to O1O_{1} and O2O_{2}, and

𝒪(ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ)(dlog2(N)+log(MH)))\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log^{2}(N)+\log(M_{H})\right)\right) (105)

additional gates.

The LCU algorithm requires a single application of the select oracle and two applications of the prepare oracle for 1w1j=0M1wj|j\frac{1}{\sqrt{\|w\|_{1}}}\sum_{j=0}^{M-1}\sqrt{w_{j}}\ket{j}. Noticing that ww represents the discretized Cauchy distribution, we can implement this prepare oracle with 𝒪(log(M))\mathcal{O}(\log(M)) gates [20] and w1=𝒪(1)\|w\|_{1}=\mathcal{O}(1). The output of the LCU step can be written as 1wu0|0v~+|\frac{1}{\|w\|\|\vec{u}_{0}\|}\ket{0}\widetilde{v}+\ket{\perp}, where

v~=j=0M1wjUju0.\widetilde{v}=\sum_{j=0}^{M-1}w_{j}U_{j}\vec{u}_{0}. (106)

Using the inequality a/ab/b2ab/a\|\vec{a}/\|a\|-\vec{b}/\|b\|\|\leq 2\|\vec{a}-\vec{b}\|/\|\vec{a}\|, we can bound the error in the quantum state as

|u(T)|v~2u(T)u(T)|v~2u0u(T)𝒯e0T(B+C(s))𝑑sj=0M1wjeiξjBT𝒯ei0THI(s;ξj)𝑑s+2u0u(T)j=0M1wjeiξjBT𝒯ei0THI(s;ξj)𝑑sj=0M1wjUj2u0u(T)𝒯e0T(B+C(s))𝑑sj=0M1wjeiξjBT𝒯ei0THI(s;ξj)𝑑s+2u0u(T)w1ϵ.\begin{split}\|\ket{\vec{u}(T)}-\ket{\widetilde{v}}\|&\leq\frac{2}{\|\vec{u}(T)\|}\|\vec{u}(T)-\ket{\widetilde{v}}\|\\ &\leq\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\\ &\quad\quad+\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}-\sum_{j=0}^{M-1}w_{j}U_{j}\right\|\\ &\leq\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|+\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\|w\|_{1}\epsilon^{\prime}.\end{split} (107)

To bound the error by ϵ\epsilon, according to 17, it suffices to choose

Ξ=𝒪(u0u(T)1ϵ),M=𝒪((u0u(T))3TBϵ3),ϵ=𝒪(u(T)u0ϵ).\Xi=\mathcal{O}\left(\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\frac{1}{\epsilon}\right),\quad M=\mathcal{O}\left(\left(\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\right)^{3}\frac{T\|B\|}{\epsilon^{3}}\right),\quad\epsilon^{\prime}=\mathcal{O}\left(\frac{\|\vec{u}(T)\|}{\|\vec{u}_{0}\|}\epsilon\right). (108)

With these and by B=𝒪(dα/2Nα)\|B\|=\mathcal{O}(d^{\alpha/2}N^{\alpha}) and NN given in 8, in each run of the algorithm we need queries to OuO_{u} for 𝒪(1)\mathcal{O}(1) times, queries to O1O_{1} and O2O_{2} for

𝒪(ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ)log(M)log(MH))=𝒪(g(T)Tϵlog3(g(T)Tdlogdϵlog(1g~(T)))),\begin{split}&\quad\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\log(M)\log(M_{H})\right)\\ &=\mathcal{O}\left(g(T)\frac{T}{\epsilon}\log^{3}\left(\frac{g(T)Td\log d}{\epsilon}\log\left(\frac{1}{\widetilde{g}(T)}\right)\right)\right),\end{split} (109)

and additional gates for a total number of

𝒪(log(M)+ΞTlog(ΞT/ϵ)loglog(ΞT/ϵ)(dlog2(N)+log(MH)))=𝒪(g(T)Tϵlog(g(T)Tϵ)(dlog2(d+log(Tg~(T))+log(1ϵ))+log(g(T)Tϵ))).\begin{split}&\quad\mathcal{O}\left(\log(M)+\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log^{2}(N)+\log(M_{H})\right)\right)\\ &=\mathcal{O}\left(g(T)\frac{T}{\epsilon}\log\left(\frac{g(T)T}{\epsilon}\right)\left(d\log^{2}\left(d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)+\log\left(\frac{g(T)T}{\epsilon}\right)\right)\right).\end{split} (110)

With amplitude amplification, the average number of repeats to get a success is 𝒪(wu0/v~)=𝒪(g(T))\mathcal{O}(\|w\|\|\vec{u}_{0}\|/\|\widetilde{v}\|)=\mathcal{O}(g(T)), so the overall complexity should be multiplied by this factor. ∎

5 Nonlinear equations

We now discuss the full non-linear fractional reaction-diffusion equation as in Equation 1. For simplicity we only consider time-independent potential function c(t,x)c(t)c(t,x)\equiv c(t). After spatial discretization, Equation 1 becomes a system of nonlinear ODEs with quadratic nonlinear term, which can be tackled by the Carleman linearization technique [29]. Existing algorithms based on Carleman linearization assume the sparsity of the coefficient matrices in the nonlinear ODEs. This facilities the construction of the Carleman matrix, which is a direct sum of matrices in different dimensions. However, in the fractional reaction-diffusion equation, the coefficient matrices are unavoidably dense, so we will discuss a block-encoding implementation of the Carleman linearization technique.

5.1 Spatial discretization

We use the same spatial discretization as in the linear case. Let u(t)\vec{u}(t) be a NdN^{d}-dimensional vector approximating the exact solution u(t,x)u(t,x) at equi-distant grid points (n1/N.,nd/N)(n_{1}/N.\cdots,n_{d}/N) where nj[N]n_{j}\in[N]. The spatially discretized equation can be written as

ddtu=F1u+F2u2.\frac{d}{dt}\vec{u}=F_{1}\vec{u}+F_{2}\vec{u}^{\otimes 2}. (111)

Here u2=uu\vec{u}^{\otimes 2}=\vec{u}\otimes\vec{u} is a N2dN^{2d}-dimensional vector. F1=BC+aIF_{1}=-B-C+aI. F2F_{2} is a Nd×N2dN^{d}\times N^{2d} dimensional matrix that maps u2\vec{u}^{\otimes 2} to au-a\vec{u}, i.e., each row of F2F_{2} only has one non-zero entry to be a-a at its ((j1)Nd+j)((j-1)N^{d}+j)-th column (for the jj-th row).

5.2 Carleman linearization

The idea of Carleman linearization for Equation 111 is to convert the nonlinear ODE to an equivalent infinite-dimensional linear ODE. Specifically, for any positive integer mm, the tensor product um\vec{u}^{\otimes m} satisfies the ODE

ddt(um)=Ammum+Am+1mu(m+1),\frac{d}{dt}(\vec{u}^{\otimes m})=A_{m}^{m}\vec{u}^{\otimes m}+A_{m+1}^{m}\vec{u}^{\otimes(m+1)}, (112)

where (II represents the identity matrix of dimension Nd×NdN^{d}\times N^{d})

Amm=j=1mI(j1)F1I(mj),A_{m}^{m}=\sum_{j=1}^{m}I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)}, (113)
Am+1m=j=1mI(j1)F2I(mj).A_{m+1}^{m}=\sum_{j=1}^{m}I^{\otimes(j-1)}\otimes F_{2}\otimes I^{\otimes(m-j)}. (114)

So the infinite-dimensional vector [u;u2;u3;][\vec{u};\vec{u}^{\otimes 2};\vec{u}^{\otimes 3};\cdots] satisfies a system of homogeneous linear ODE with coefficient matrix

(A11A21A22A32AMMAM+1M).\left(\begin{array}[]{cccccc}A_{1}^{1}&A_{2}^{1}&&&&\\ &A_{2}^{2}&A_{3}^{2}&&&\\ &&\ddots&\ddots&&\\ &&&A_{M}^{M}&A_{M+1}^{M}&\\ &&&&\ddots&\ddots\\ \end{array}\right). (115)

To implement the Carleman linearization numerically, we truncate the infinite-dimensional ODE at a specific order MM and consider the ODE

ddtw=Aw.\frac{d}{dt}\vec{w}=A\vec{w}. (116)

Here w=[w1;w2;;wM]\vec{w}=[\vec{w}_{1};\vec{w}_{2};\cdots;\vec{w}_{M}] and each wj\vec{w}_{j} is an NjdN^{jd}-dimensional vector expected to approximate uj\vec{u}^{\otimes j}. The matrix AA can be represented as

A=(A11A21A22A32AM1M1AMM1AMM).A=\left(\begin{array}[]{ccccc}A_{1}^{1}&A_{2}^{1}&&&\\ &A_{2}^{2}&A_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&A_{M-1}^{M-1}&A_{M}^{M-1}\\ &&&&A_{M}^{M}\\ \end{array}\right). (117)

When the linear part F1F_{1} has negative eigenvalue and the nonlinear part F2F_{2} is relatively small compared to the decay rate of the linear part, w1\vec{w}_{1} can be a good approximation of u\vec{u}, and the truncation order MM is only logarithmic in the precision. [1] establishes rigorous analysis for the convergence of such procedure for integer-order reaction-diffusion equation, and we expect the same convergence for the fractional order as well.

5.3 Block-encoded implementation

The coefficient matrix AA is expressed as a partitioned matrix with blocks in different dimension. When all the blocks are sparse, as assumed in existing quantum algorithm based on Carleman linearization, the entire matrix AA is still sparse and one can directly implement it from sparse input model. This is also true for (integer-order) reaction-diffusion equation when α=2\alpha=2. However, when 0<α<20<\alpha<2, while the off-diagonal blocks Am+1mA_{m+1}^{m} are still sparse, the diagonal blocks AmmA_{m}^{m} are unavoidably dense. Though the block-encoding of each block AmmA_{m}^{m} is still construable, it is somewhat cumbersome to assemble them together to the block-encoding of the entire AA.

Inspired by the technique of [29] for state preparation, we further enlarge the dimension of the linearized ODE. The purpose is to make the resulting enlarged coefficient matrix expressed as a partitioned matrix with blocks in the same dimension, and meanwhile a subspace of the enlarged solution is still the solution of the original ODE. Specifically, let us consider the ODE

ddty=A~y.\frac{d}{dt}\vec{y}=\widetilde{A}\vec{y}. (118)

Here y=[y1;;yM]\vec{y}=[\vec{y}_{1};\cdots;\vec{y}_{M}] and each yj\vec{y}_{j} is an NMdN^{Md}-dimensional vector. The initial value y(0)\vec{y}(0) is chosen to be [|0M1|u(0);|0M2|u(0)2;;|u(0)M][\ket{0}^{\otimes M-1}\otimes\ket{\vec{u}(0)};\ket{0}^{\otimes M-2}\otimes\ket{\vec{u}(0)}^{\otimes 2};\cdots;\ket{\vec{u}(0)}^{\otimes M}] where |0=(0,,0,1)T\ket{0}=(0,\cdots,0,1)^{T} in NdN^{d} dimension. The coefficient matrix is

A~=(A~11A~21A~22A~32A~M1M1A~MM1A~MM).\widetilde{A}=\left(\begin{array}[]{ccccc}\widetilde{A}_{1}^{1}&\widetilde{A}_{2}^{1}&&&\\ &\widetilde{A}_{2}^{2}&\widetilde{A}_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&\widetilde{A}_{M-1}^{M-1}&\widetilde{A}_{M}^{M-1}\\ &&&&\widetilde{A}_{M}^{M}\\ \end{array}\right). (119)

Here each A~mm\widetilde{A}_{m}^{m} and A~m+1m\widetilde{A}_{m+1}^{m} is an NMdN^{Md}-dimensional square matrix. A~mm=I(Mm)Amm\widetilde{A}_{m}^{m}=I^{\otimes(M-m)}\otimes A_{m}^{m}. A~m+1m\widetilde{A}_{m+1}^{m} contains Am+1mA_{m+1}^{m} at its most bottom right and otherwise 0.

We now show that the non-zero entries of y\vec{y} exactly form w\vec{w}. To this end, for each ym\vec{y}_{m}, we write it as [ym,1;;ym,N(Mm)d][\vec{y}_{m,1};\cdots;\vec{y}_{m,N^{(M-m)d}}], where each ym,j\vec{y}_{m,j} is an NmdN^{md}-dimensional vector. By definition of y(0)\vec{y}(0), we have that for every mm, all but the last component ym,N(Mm)d(0)\vec{y}_{m,N^{(M-m)d}}(0) of ym(0)\vec{y}_{m}(0) are zero. Furthermore, by the definition of A~\widetilde{A}, the variables ym,j\vec{y}_{m,j} for jN(Mm)dj\neq N^{(M-m)d} does not interact with other variables outside. Therefore, ym,j(t)\vec{y}_{m,j}(t) is always 0 for all tt, mm and jN(Mm)dj\neq N^{(M-m)d}. Now, if we only focus on the ODEs that ym,N(Mm)d\vec{y}_{m,N^{(M-m)d}}’s satisfy, it is exactly the original ODE Equation 116 with the same initial condition. So we have [y1,N(M1)d(t);y2,N(M2)d(t);;yM,1(t)]=w(t)[\vec{y}_{1,N^{(M-1)d}}(t);\vec{y}_{2,N^{(M-2)d}}(t);\cdots;\vec{y}_{M,1}(t)]=\vec{w}(t) for all tt. This implies that, instead of solving Equation 116, we can focus on its equivalent formalism Equation 118.

Equation 118 can be solved by standard quantum algorithm for linear ODEs. For example, we can use the method based on truncated Dyson series [8] discussed in the previous section. To use this method, we need the state preparation for y(0)\vec{y}(0) and the block-encoding of A~\widetilde{A}. The state preparation oracle can be constructed in a similar manner as in [29]. To construct the block-encoding of A~\widetilde{A}, we decompose it as A~=D~+R~\widetilde{A}=\widetilde{D}+\widetilde{R}, where

D~=(A~11A~22A~M1M1A~MM),R~=(0A~210A~320A~MM10).\widetilde{D}=\left(\begin{array}[]{ccccc}\widetilde{A}_{1}^{1}&&&&\\ &\widetilde{A}_{2}^{2}&&&\\ &&\ddots&&\\ &&&\widetilde{A}_{M-1}^{M-1}&\\ &&&&\widetilde{A}_{M}^{M}\\ \end{array}\right),\quad\widetilde{R}=\left(\begin{array}[]{ccccc}0&\widetilde{A}_{2}^{1}&&&\\ &0&\widetilde{A}_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&0&\widetilde{A}_{M}^{M-1}\\ &&&&0\\ \end{array}\right). (120)

Notice that R~\widetilde{R} is a 𝒪(M)\mathcal{O}(M)-sparse matrix. According to [19, Lemma 48], we may implement a block-encoding of \mathcal{R} with 𝒪(1)\mathcal{O}(1) query complexity and 𝒪(M)\mathcal{O}(M) block-encoding factor. For D~\widetilde{D}, we start with the block-encoding of F1F_{1}, which can be constructed from the linear combination of B,CB,C and aIaI. According to 2, the block-encoding factor is 𝒪(dαN2α)\mathcal{O}(d^{\alpha}N^{2\alpha}), corresponding to the spectral norm of BB, and the query complexity for block encoding F1F_{1} is 𝒪(1)\mathcal{O}(1). Denote this block-encoding by UF1U_{F_{1}}. Then a block-encoding of I(j1)F1I(mj)I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)} can be constructed by applying UF1U_{F_{1}} on the correct register. Since AmmA_{m}^{m} is the summation of I(j1)F1I(mj)I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)}, according to 2, we may further construct the block-encoding of AmmA_{m}^{m}, denoted by UAmmU_{A_{m}^{m}}, with 𝒪(MdαN2α)\mathcal{O}(Md^{\alpha}N^{2\alpha}) block-encoding factor. Here we choose the block-encoding factor for all AmmA_{m}^{m} to be the same and corresponds to the worst case AMMA_{M}^{M} with most summation terms in order to facilitate later construction for bigger matrix. Then, for each mm, a block-encoding of A~mm\widetilde{A}_{m}^{m}, denoted by UA~mmU_{\widetilde{A}_{m}^{m}}, can be constructed with a single use of UAmmU_{A_{m}^{m}}, and the block-encoding of D~\widetilde{D} is given by

m=0M1|mm|UA~mm.\sum_{m=0}^{M-1}\ket{m}\bra{m}\otimes U_{\widetilde{A}_{m}^{m}}. (121)

This can be implemented by controlled version of UAmmU_{A_{m}^{m}} and requires 𝒪(M)\mathcal{O}(M) query complexity, and the block-encoding factor is 𝒪(MdαN2α)\mathcal{O}(Md^{\alpha}N^{2\alpha}). The final step is to use 2 again and, from the block-encoding of D~\widetilde{D} and R~\widetilde{R}, we may construct the desired block-encoding of A~\widetilde{A} with 𝒪(M)\mathcal{O}(M) query complexity and 𝒪(MdαN2α)\mathcal{O}(Md^{\alpha}N^{2\alpha}) block-encoding factor. Notice that both query complexity and the block-encoding factor does not involve exponential dependence on dd.

With the state preparation for y(0)\vec{y}(0) and the block-encoding of A~\widetilde{A}, we can solve Equation 118 efficiently using truncated Dyson series method [8]. Notice that the method requires A~\widetilde{A} to have non-positive logarithmic norm, which can be guaranteed if all the eigenvalues of F1F_{1} are negative and F2F_{2} is bounded, which corresponds to the standard assumption on the boundedness of the nonlinearity (namely the condition RD<1R_{D}<1 in [1]).

6 Conclusion

In this paper, we study efficient quantum algorithms for linear and nonlinear fractional reaction-diffusion equations. For linear equations, we improve and analyze the complexity of four different methods: second-order Trotter formula, time-marching method, truncated Dyson series method, and the linear combination of Hamiltonian simulation with the interaction picture formalism (LCHS-IP). Among all the methods, the LCHS-IP method achieves best scaling in the spatial dimension and thus is most suiable for high-dimensional linear fractional reaction-diffusion equations. For nonlinear equations, we generalize the quantum Carleman linearization algorithm to the case with block-encoding input oracle, making the algorithm applicable to dense coefficient matrices.

A natural extension of this work is to design better quantum algorithm for linear reaction-diffusion equations with near-optimal scalings in all parameters. A desired algorithm is expected to simultaneously scale poly-logarithmically in dimension (as LCHS-IP), poly-logarithmically in precision (as time-marching and truncated Dyson series methods), linearly in evolution time (as truncated Dyson series and LCHS-IP), and have low state preparation cost (as Trotter, time-marching and LCHS-IP). Among all the methods being considered in this paper, the LCHS-IP method is the closest one as it only misses the poly-logarithmic dependence on precision, so it is interesting to explore whether the LCHS-IP method can be further exponentially improved in terms of precision. Another possibility is to use higher-order product formula, whose asymptotic scaling tends to be T1+o(1)/ϵo(1)T^{1+o(1)}/\epsilon^{o(1)}. However, as proved in [14], there might be extra exponential overhead in Trotter errors when we deal with non-unitary dynamics. Therefore a tailored design of the product formula and an improved error analysis would be necessary. Furthermore, to obtain a poly-logarithmic dependence on dimension, we may need to take advantage of the vector-norm scaling of the product formula, which states that the Trotter errors may be independent of the spectral norm of the Hamiltonians if the quantum states are within a more regular subspace with better smoothness assumption. Such a vector-norm scaling has been proved for first- and second-order Trotter applied to Hamiltonian simulation in [2], and it remains open to establish similar error bounds for higher-order product formula and the cases beyond Hamiltonian simulation. This is our ongoing work.

For non-linear equations, a natural next step is to establish a rigorous analysis with detailed computational costs. We expect the analysis presented in [1] to work with suitable modifications. However, the complexity estimate in [1] still depends polynomially on the dimension dd. It is interesting to explore whether the quantum Carleman algorithm can also avoid such a polynomial overhead with the help of tighter error bounds, or new techniques are necessary to achieve this task.

Throughout this paper, we focus on the spectral fractional Laplacian with periodic boundary condition, which facilitates its spatial discretization and quantum implementation of its time evolution through the QFT circuit. Our future work will be focusing on the fractional Laplacian operator with Riesz definition and exploring the efficiency of quantum algorithms.

Acknowledgments

DA acknowledges the support by the Department of Defense through the Hartree Postdoctoral Fellowship at QuICS, and the seed grant at the NSF Quantum Leap Challenge Institute for Robust Quantum Simulation (QLCI grant OMA-2120757). KT gratefully acknowledges the support by the National Science Foundation under the grants DMS-2231533 and DMS-2008568.

References

  • [1] D. An, D. Fang, S. Jordan, J.-P. Liu, G. H. Low, and J. Wang. Efficient quantum algorithm for nonlinear reaction-diffusion equations and energy estimation, 2022.
  • [2] D. An, D. Fang, and L. Lin. Time-dependent unbounded hamiltonian simulation with vector norm scaling. Quantum, 5:459, 2021.
  • [3] D. An and L. Lin. Quantum linear system solver based on time-optimal adiabatic quantum computing and quantum approximate optimization algorithm. ACM Transactions on Quantum Computing, 3(2):1–28, mar 2022.
  • [4] D. An, J.-P. Liu, and L. Lin. Linear combination of hamiltonian simulation for non-unitary dynamics with optimal state preparation cost, 2023.
  • [5] D. An, J.-P. Liu, D. Wang, and Q. Zhao. A theory of quantum differential equation solvers: limitations and fast-forwarding, 2023.
  • [6] D. W. Berry. High-order quantum algorithm for solving linear differential equations. Journal of Physics A: Mathematical and Theoretical, 47(10):105301, 2014.
  • [7] D. W. Berry, A. M. Childs, A. Ostrander, and G. Wang. Quantum algorithm for linear differential equations with exponentially improved dependence on precision. Communications in Mathematical Physics, 356(3):1057–1081, 2017.
  • [8] D. W. Berry and P. C. S. Costa. Quantum algorithm for time-dependent differential equations using dyson series, 2022.
  • [9] C. C. Cantrell R.S. Spatial ecology via reaction–diffusion equations. John Wiley & Sons Ltd., Chichester, 2003, 2003.
  • [10] A. M. Childs, R. Kothari, and R. D. Somma. Quantum algorithm for systems of linear equations with exponentially improved dependence on precision. SIAM J. Comput., 46:1920–1950, 2017.
  • [11] A. M. Childs, J. Leng, T. Li, J.-P. Liu, and C. Zhang. Quantum simulation of real-space dynamics. Quantum, 6:860, nov 2022.
  • [12] A. M. Childs and J.-P. Liu. Quantum spectral methods for differential equations. Communications in Mathematical Physics, 375(2):1427–1457, 2020.
  • [13] A. M. Childs, J.-P. Liu, and A. Ostrander. High-precision quantum algorithms for partial differential equations. Quantum, 5:574, nov 2021.
  • [14] A. M. Childs, Y. Su, M. C. Tran, N. Wiebe, and S. Zhu. Theory of trotter error with commutator scaling. Phys. Rev. X, 11:011020, 2021.
  • [15] A. M. Childs and N. Wiebe. Hamiltonian simulation using linear combinations of unitary operations. Quantum Information and Computation, 12:901–924, 2012.
  • [16] P. C. Costa, D. An, Y. R. Sanders, Y. Su, R. Babbush, and D. W. Berry. Optimal scaling quantum linear-systems solver via discrete adiabatic theorem. PRX Quantum, 3:040303, Oct 2022.
  • [17] R. F. Global solutions of reaction–diffusion systems, in: Lecture notes in mathematics. Lecture Notes in Mathematics, 1072, 1984.
  • [18] D. Fang, L. Lin, and Y. Tong. Time-marching based quantum solvers for time-dependent linear differential equations, 2022.
  • [19] A. Gilyén, Y. Su, G. H. Low, and N. Wiebe. Quantum singular value transformation and beyond: exponential improvements for quantum matrix arithmetics. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, pages 193–204, 2019.
  • [20] L. Grover and T. Rudolph. Creating superpositions that correspond to efficiently integrable probability distributions, 2002.
  • [21] A. W. Harrow, A. Hassidim, and S. Lloyd. Quantum algorithm for linear systems of equations. Phys. Rev. Lett., 103:150502, 2009.
  • [22] S. J. Shock waves and reaction–diffusion equations, 2nd edn, in: Grundlehren der mathematischen wissenschaften [fundamental principles of mathematical. Lecture Notes in Mathematics, 258, 1994.
  • [23] T. Jahnke and C. Lubich. Error bounds for exponential operator splittings. BIT Numerical Mathematics, 40(4):735–744, 2000.
  • [24] M. J.D. Mathematical Biology. I,II, 3rd edn, in: Interdisciplinary Applied Mathematics, volume 17, 18. Springer-Verlag, New York, 2002.
  • [25] S. Jin, N. Liu, and Y. Yu. Quantum simulation of partial differential equations via schrodingerisation, 2022.
  • [26] H. Krovi. Improved quantum algorithms for linear and nonlinear differential equations, 2022.
  • [27] L. Lin and Y. Tong. Optimal polynomial based quantum eigenstate filtering with application to solving quantum linear systems. Quantum, 4:361, nov 2020.
  • [28] A. Lischke, G. Pang, M. Gulian, F. Song, C. Glusa, X. Zheng, Z. Mao, W. Cai, M. M. Meerschaert, M. Ainsworth, and G. E. Karniadakis. What is the fractional laplacian? a comparative review with new results. Journal of Computational Physics, 404:109009, 2020.
  • [29] J.-P. Liu, H. Ø. Kolden, H. K. Krovi, N. F. Loureiro, K. Trivisa, and A. M. Childs. Efficient quantum algorithm for dissipative nonlinear differential equations. Proceedings of the National Academy of Sciences, 118(35), 2021.
  • [30] G. H. Low and N. Wiebe. Hamiltonian simulation in the interaction picture. arXiv:1805.00675, 2019.
  • [31] C. H. Neubert M. Demography and dispersal: Calculation and sensitivity analysis of invasion speed for structured populations. Ecology, 81, 2000.
  • [32] C. H. Neubert M. From diffusion to anomalous diffusion: A century after einstein’s brownian motion. Chaos, 15:26–103, 2005.
  • [33] B. N.F. Reaction–Diffusion Equations and Their Applications to Biology,. Academic Press Inc. [Harcourt Brace Jovanovich Publishers], London, 1986, 1986.
  • [34] M. A. Nielsen and I. Chuang. Quantum computation and quantum information, 2000.
  • [35] G. P. The theory and applications of reaction–diffusion equations. Oxford Applied Mathematics and Computing Science Series, 2003.
  • [36] Y. Subaşı, R. D. Somma, and D. Orsucci. Quantum algorithms for systems of linear equations inspired by adiabatic quantum computing. Phys. Rev. Lett., 122:060504, 2019.

Appendix A Technical lemmas

Lemma 19.

Let a\vec{a} and b\vec{b} be two non-zero vectors, possibly unnormalized. Then

aabb2aba.\left\|\frac{\vec{a}}{\|\vec{a}\|}-\frac{\vec{b}}{\|\vec{b}\|}\right\|\leq\frac{2\|\vec{a}-\vec{b}\|}{\|\vec{a}\|}. (122)
Lemma 20.

Suppose A(t)A(t) is a matrix-valued continuous function and the operator S(t,s)S(t,s) solves the differential equation

ddtS(t,s)=A(t)S(t,s),S(s,s)=I.\frac{d}{dt}S(t,s)=A(t)S(t,s),\quad S(s,s)=I. (123)

Then,

  1. 1.

    for any matrix-valued continuous function R(t)R(t), the solution of the differential equation

    ddtS~(t,0)=A(t)S~(t,0)+R(t),S~(0,0)=I\frac{d}{dt}\widetilde{S}(t,0)=A(t)\widetilde{S}(t,0)+R(t),\quad\widetilde{S}(0,0)=I (124)

    can be represented as

    S~(t,0)=S(t,0)+0tS(t,s)R(s)𝑑s.\widetilde{S}(t,0)=S(t,0)+\int_{0}^{t}S(t,s)R(s)ds. (125)
  2. 2.

    for any vector-valued continuous function r(t)\vec{r}(t), the solution of the differential equation

    ddtψ(t)=A(t)ψ(t)+r(t),ψ(0)=ψ0\frac{d}{dt}\vec{\psi}(t)=A(t)\vec{\psi}(t)+\vec{r}(t),\quad\vec{\psi}(0)=\vec{\psi}_{0} (126)

    can be represented as

    ψ(t)=S(t,0)ψ0+0tS(t,s)r(s)𝑑s.\vec{\psi}(t)=S(t,0)\vec{\psi}_{0}+\int_{0}^{t}S(t,s)\vec{r}(s)ds. (127)

Appendix B Proof of 5

Proof.

For any function ff defined on [0,1]d[0,1]^{d}, its Fourier coefficient is defined to be

f^k=[0,1]df(x)e2πi(k0x0++kd1xd1)𝑑x.\hat{f}_{k}=\int_{[0,1]^{d}}f(x)e^{-2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}dx. (128)

Using integration by parts for pp times, we may obtain

|f^k|maxjxjpf(x)L1(2πk)p.|\hat{f}_{k}|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}f(x)\|_{L^{1}}}{(2\pi\|k\|_{\infty})^{p}}. (129)

As a result, for each fixed mm,

|jdu^m+jNu^i(m)|l=1j=l|u^i(m)+jN|l=1((l+1)dld)maxjxjpu0(x)L1(π(2l1)N)p.\begin{split}\left|\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right|&\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}|\hat{u}_{i(m)+jN}|\\ &\leq\sum_{l=1}^{\infty}((l+1)^{d}-l^{d})\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi(2l-1)N)^{p}}.\end{split} (130)

Suppose that pd+2p\geq d+2. Using the inequality that 2l1l+12l-1\geq l+1 for all l2l\geq 2, we have

|jdu^m+jNu^i(m)|(2d1+l=2(l+1)dld(2l1)p)maxjxjpu0(x)L1πpNp(2d1+l=21(2l1)pd)maxjxjpu0(x)L1πpNp(2d1+1dx(2x1)pd)maxjxjpu0(x)L1πpNp=(2d1+12(pd1))maxjxjpu0(x)L1πpNpmaxjxjpu0(x)L1(π/2)pNp.\begin{split}\left|\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right|&\leq\left(2^{d}-1+\sum_{l=2}^{\infty}\frac{(l+1)^{d}-l^{d}}{(2l-1)^{p}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\left(2^{d}-1+\sum_{l=2}^{\infty}\frac{1}{(2l-1)^{p-d}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\left(2^{d}-1+\int_{1}^{\infty}\frac{dx}{(2x-1)^{p-d}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &=\left(2^{d}-1+\frac{1}{2(p-d-1)}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi/2)^{p}N^{p}}.\end{split} (131)

Therefore,

(1)du0Nd/2m[N]du^i(m)|m0|md1=Nd/2m[N]d(jdu^m+jNu^i(m))|m0|md1maxjxjpu0(x)L1(π/2)pNpd.\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{0}-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.\end{split} (132)

Similarly,

(1)du(T,x)Nd/2m[N]du^i(m)e(2πi(m))αT|m0|md1maxjxjpu(T,x)L1(π/2)pNpd.\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}(T,x)-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u(T,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}. (133)

Let ~=()dII\widetilde{\mathcal{F}}=(\mathcal{F})^{\otimes d}\otimes I\otimes I and UU denote the composition of O1O_{1}, Oexp,1O_{\exp,1} and c-RR specified in Figure 2, then

(Id0|3)~U~1|u0|031u0u(T)(Id0|3)~U~1|u0|03(Id0|3)~UNd/2u0m[N]du^i(m)|m0|md1|03+(Id0|3)~(Nd/2u0m[N]du^i(m)e(2πi(m))αT|m0|md1|03+|)1u0m[N]du(T,m/N)|m0|md1(1)d|u0Nd/2u0m[N]du^i(m)|m0|md1+dNd/2u0m[N]du^i(m)e(2πi(m))αT|m0|md11u0m[N]du(T,m/N)|m0|md11u0maxjxjpu0(x)L1+maxjxjpu(T,x)L1(π/2)pNpd.\begin{split}&\quad\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\widetilde{\mathcal{F}}^{-1}\ket{u_{0}}\ket{0}^{\otimes 3}-\frac{1}{\|\vec{u}_{0}\|}\vec{u}(T)\right\|\\ &\leq\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\widetilde{\mathcal{F}}^{-1}\ket{u_{0}}\ket{0}^{\otimes 3}-(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}^{\otimes 3}\right\|\\ &\quad+\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}\left(\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}^{\otimes 3}+\ket{\perp}\right)\right.\\ &\quad\quad\left.-\frac{1}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}u(T,m/N)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\left\|(\mathcal{F}^{-1})^{\otimes d}\ket{u_{0}}-\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\quad+\left\|\mathcal{F}^{\otimes d}\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}-\frac{1}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}u(T,m/N)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{1}{\|\vec{u}_{0}\|}\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}+\max_{j}\|\partial_{x_{j}}^{p}u(T,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.\end{split} (134)

By 19, the overall 22-norm error of the output quantum state after successful measurement can be bounded by

4u0maxj,t{0,T}xjpu(t,x)L1(π/2)pNpdu0u(T)=4maxj,t{0,T}xjpu(t,x)L1u(T)(π/2)pNpd.\frac{4}{\|\vec{u}_{0}\|}\frac{\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}=\frac{4\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\|\vec{u}(T)\|(\pi/2)^{p}N^{p-d}}. (135)

The claimed complexity can be shown by noticing that in each run of Figure 2, we need 𝒪(1)\mathcal{O}(1) queries to the aforementioned oracles, and 𝒪(dlog2(N))\mathcal{O}(d\log^{2}(N)) gates due to QFT and controlled rotations, and under amplitude amplification, the averaged repeats to get a success is 𝒪(u0/u(T))\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|). ∎

Appendix C Bounding the spatial discretization errors

Here we present the detailed proofs of the spatial discretization results (67 and 8).

Proof of 6.

Let ue(t)=(u(t,n/N))n[N]d\vec{u}_{e}(t)=(u(t,n/N))_{n\in[N]^{d}} denote the exact solution evaluated at discrete grid points. Using Equation 29, for any n[N]dn\in[N]^{d},

tu(t,n/N)=(Δ)α/2u(t,n/N)c(t,n/N)u(t,n/N)=(Bue)nc(t,n/N)u(t,n/N)+(Bue)n(Δ)α/2u(t,n/N).\begin{split}\partial_{t}u(t,n/N)&=-(-\Delta)^{\alpha/2}u(t,n/N)-c(t,n/N)u(t,n/N)\\ &=\left(-B\vec{u}_{e}\right)_{n}-c(t,n/N)u(t,n/N)+\left(B\vec{u}_{e}\right)_{n}-(-\Delta)^{\alpha/2}u(t,n/N).\end{split} (136)

Therefore the vector ue(t)\vec{u}_{e}(t) solves the differential equation

ddtue=BueC(t)ue+r(t),\frac{d}{dt}\vec{u}_{e}=-B\vec{u}_{e}-C(t)\vec{u}_{e}+\vec{r}(t), (137)

where

(r(t))n=(Bue)n(Δ)α/2u(t,n/N).\left(\vec{r}(t)\right)_{n}=\left(B\vec{u}_{e}\right)_{n}-(-\Delta)^{\alpha/2}u(t,n/N). (138)

Therefore, the equations ue\vec{u}_{e} satisfies can be viewed as a perturbation (by r\vec{r}) of the equations u\vec{u} satisfies. By 20, we have

ue(t)=u(t)+0t𝒯est(BC(τ))𝑑τr(s)𝑑s,\vec{u}_{e}(t)=\vec{u}(t)+\int_{0}^{t}\mathcal{T}e^{\int_{s}^{t}(-B-C(\tau))d\tau}\vec{r}(s)ds, (139)

and thus

ue(T)u(T)Tmaxt[0,T]r(t).\|\vec{u}_{e}(T)-\vec{u}(T)\|\leq T\max_{t\in[0,T]}\|\vec{r}(t)\|. (140)

It remains to bound r(t)\|\vec{r}(t)\|, which can be done similarly to the proof of 5. Suppose that the Fourier series of u(t,x)u(t,x) is

u(t,x)=kdu^k(t)e2πi(k0x0++kd1xd1),u(t,x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}(t)e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}, (141)

where

u^k(t)=[0,1]du(t,x)e2πi(k0x0++kd1xd1)𝑑x.\hat{u}_{k}(t)=\int_{[0,1]^{d}}u(t,x)e^{-2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}dx. (142)

By the same reasoning of Equation 132,

(1)due(t)Nd/2m[N]du^i(m)(t)|m0|md1maxjxjpu(t,x)L1(π/2)pNpd.\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}. (143)

Noticing that DπαNαdα/2\|D\|\leq\pi^{\alpha}N^{\alpha}d^{\alpha/2}, we have

D(1)due(t)Nd/2m[N]d2απαi(m)αu^i(m)(t)|m0|md12pdα/2maxjxjpu(t,x)L1πpαNpdα.\left\|D(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{2^{p}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}. (144)

Again using the derivation of Equation 132 and noting that (Δ)α/2u(t,x)=kd2απαkαu^k(t)e2πi(k0x0++kd1xd1)(-\Delta)^{\alpha/2}u(t,x)=\sum_{k\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|k\|^{\alpha}\hat{u}_{k}(t)e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}, we have

(1)d((Δ)α/2u(t,n/N))n[N]dNd/2m[N]d2απαi(m)αu^i(m)(t)|m0|md1=Nd/2m[N]d(jd2απαm+jNαu^m+jN(t)2απαi(m)αu^i(m)(t))|m0|md1.\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|.\end{split} (145)

Using Equation 129, for each fixed mm, we have

|jd2απαm+jNαu^m+jN(t)2απαi(m)αu^i(m)(t)|l=1j=l2απαi(m)+jNα|u^i(m)+jN(t)|l=1j=l2απαi(m)+jNαmaxjxjpu(t,x)L1(2πi(m)+jN)pl=1((l+1)dld)πα(2l+1)αNαdα/2maxjxjpu(t,x)L1(π(2l1)N)p\begin{split}&\quad\left|\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right|\\ &\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}2^{\alpha}\pi^{\alpha}\|i(m)+jN\|^{\alpha}|\hat{u}_{i(m)+jN}(t)|\\ &\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}2^{\alpha}\pi^{\alpha}\|i(m)+jN\|^{\alpha}\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(2\pi\|i(m)+jN\|_{\infty})^{p}}\\ &\leq\sum_{l=1}^{\infty}((l+1)^{d}-l^{d})\pi^{\alpha}(2l+1)^{\alpha}N^{\alpha}d^{\alpha/2}\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi(2l-1)N)^{p}}\end{split} (146)

Therefore, for any pd+α+2p\geq d+\alpha+2,

|jd2απαm+jNαu^m+jN(t)2απαi(m)αu^i(m)(t)|(l=1((l+1)dld)(2l+1)α(2l1)p)dα/2maxjxjpu(t,x)L1πpαNpα((2d1)3α+l=2((l+1)dld)(2l+1)α(2l1)p)dα/2maxjxjpu(t,x)L1πpαNpα((2d1)3α+2αl=21(2l1)pdα)dα/2maxjxjpu(t,x)L1πpαNpα((2d1)3α+2α12(pdα1))dα/2maxjxjpu(t,x)L1πpαNpα2d3αdα/2maxjxjpu(t,x)L1πpαNpα.\begin{split}&\quad\left|\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right|\\ &\leq\left(\sum_{l=1}^{\infty}\frac{((l+1)^{d}-l^{d})(2l+1)^{\alpha}}{(2l-1)^{p}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+\sum_{l=2}^{\infty}\frac{((l+1)^{d}-l^{d})(2l+1)^{\alpha}}{(2l-1)^{p}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+2^{\alpha}\sum_{l=2}^{\infty}\frac{1}{(2l-1)^{p-d-\alpha}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+2^{\alpha}\frac{1}{2(p-d-\alpha-1)}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\frac{2^{d}3^{\alpha}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}.\end{split} (147)

As a result,

(1)d((Δ)α/2u(t,n/N))n[N]dNd/2m[N]d2απαi(m)αu^i(m)(t)|m0|md1=Nd/2m[N]d(jd2απαm+jNαu^m+jN(t)2απαi(m)αu^i(m)(t))|m0|md12d3αdα/2maxjxjpu(t,x)L1πpαNpdα.\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{2^{d}3^{\alpha}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.\end{split} (148)

Combining Equation 144 and Equation 148, we have

D(1)due(t)(1)d((Δ)α/2u(t,n/N))n[N]d2p+1dα/2maxjxjpu(t,x)L1πpαNpdα,\left\|D(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}\right\|\leq\frac{2^{p+1}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}, (149)

and

r(t)2p+1dα/2maxjxjpu(t,x)L1πpαNpdα.\|r(t)\|\leq\frac{2^{p+1}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}. (150)

Plugging this back into Equation 140 and we obtain

ue(T)u(T)T2p+1dα/2maxt,jxjpu(t,x)L1πpαNpdα.\|\vec{u}_{e}(T)-\vec{u}(T)\|\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}. (151)

Proof of 7.

According to 6, the spatial discretization error can be bounded by

(u(T,n/N))n[N]du(T)T2p+1dα/2maxt,jxjpu(t,x)L1πpαNpdα2Λπαdα/2T(2Λ/π)p(p!)σNpdα21+σπα+σ/2Λdα/2Tpσ/2(2Λ/π)p(p/e)pσNpdα\begin{split}\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|&\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}\\ &\leq 2\Lambda\pi^{\alpha}d^{\alpha/2}T\frac{(2\Lambda/\pi)^{p}(p!)^{\sigma}}{N^{p-d-\alpha}}\\ &\leq 2^{1+\sigma}\pi^{\alpha+\sigma/2}\Lambda d^{\alpha/2}T\frac{p^{\sigma/2}(2\Lambda/\pi)^{p}(p/e)^{p\sigma}}{N^{p-d-\alpha}}\end{split} (152)

where in the second line we use p!2πp(p/e)pe1/(12p)2πp(p/e)pp!\leq\sqrt{2\pi p}(p/e)^{p}e^{1/(12p)}\leq 2\sqrt{\pi p}(p/e)^{p}. We choose p=pp=\lfloor p_{*}\rfloor where

p=(π2Λ)1/σN1/σ.p_{*}=\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}. (153)

We remark that here we need pd+α+2p_{*}\geq d+\alpha+2 and thus N(2Λ/π)(d+α+2)σN\geq(2\Lambda/\pi)(d+\alpha+2)^{\sigma}. Then,

(u(T,n/N))n[N]du(T)22+σπα+σ/2Λdα/2Tpσ/2(2Λ/π)p(p/e)pσNpdα=23/2+σπα+σ/2+1/2Λ1/2Tdα/2Nd+α+1/2exp(σ(π2Λ)1/σN1/σ).\begin{split}\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|&\leq 2^{2+\sigma}\pi^{\alpha+\sigma/2}\Lambda d^{\alpha/2}T\frac{p_{*}^{\sigma/2}(2\Lambda/\pi)^{p_{*}}(p_{*}/e)^{p_{*}\sigma}}{N^{p_{*}-d-\alpha}}\\ &=2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2}Td^{\alpha/2}N^{d+\alpha+1/2}\exp\left(-\sigma\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right).\end{split} (154)

By the Taylor expansion ex=k1k!xke^{x}=\sum_{k}\frac{1}{k!}x^{k} and thus ex1k!xke^{x}\geq\frac{1}{k!}x^{k} for x0x\geq 0, we have

exp(σ2(π2Λ)1/σN1/σ)1σ(d+α+1/2)!(σ2(π2Λ)1/σ)σ(d+α+1/2)Nd+α+1/2(σ2σ(d+α+1/2)(π2Λ)1/σ)σ(d+α+1/2)Nd+α+1/2.\begin{split}\exp\left(\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right)&\geq\frac{1}{\lceil\sigma(d+\alpha+1/2)\rceil!}\left(\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}N^{d+\alpha+1/2}\\ &\geq\left(\frac{\sigma}{2\lceil\sigma(d+\alpha+1/2)\rceil}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}N^{d+\alpha+1/2}.\end{split} (155)

Therefore

(u(T,n/N))n[N]du(T)23/2+σπα+σ/2+1/2Λ1/2Tdα/2(2σ(d+α+1/2)σ(2Λπ)1/σ)σ(d+α+1/2)exp(σ2(π2Λ)1/σN1/σ)c1T(c2d)c3ddα/2exp(c4N1/σ),\begin{split}&\quad\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\\ &\leq 2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2}Td^{\alpha/2}\left(\frac{2\lceil\sigma(d+\alpha+1/2)\rceil}{\sigma}\left(\frac{2\Lambda}{\pi}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}\exp\left(-\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right)\\ &\leq c_{1}T(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right),\end{split} (156)

where we may choose

c1\displaystyle c_{1} =23/2+σπα+σ/2+1/2Λ1/2,\displaystyle=2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2}, (157)
c2\displaystyle c_{2} =max{2(2+α+1/σ)(2Λπ)1/σ,1},\displaystyle=\max\left\{2(2+\alpha+1/\sigma)\left(\frac{2\Lambda}{\pi}\right)^{1/\sigma},1\right\}, (158)
c3\displaystyle c_{3} =σ(3+α),\displaystyle=\sigma(3+\alpha), (159)
c4\displaystyle c_{4} =σ2(π2Λ)1/σ.\displaystyle=\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}. (160)

Proof of 8.

By 7 and 19, we have

|(u(T,n/N))n[N]d|u(T)2c1T(u(T,n/N))n[N]d(c2d)c3ddα/2exp(c4N1/σ).\|\ket{(u(T,n/N))_{n\in[N]^{d}}}-\ket{\vec{u}(T)}\|\leq\frac{2c_{1}T}{\|(u(T,n/N))_{n\in[N]^{d}}\|}(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right). (161)

The choice of NN can be solved by letting the error bound smaller than ϵ\epsilon. ∎

Appendix D Bounding the Trotter error

In this section we present the detailed proof of 9. For s[t0,t0+h]s\in[t_{0},t_{0}+h] and t[0,h]t\in[0,h], we consider three operators

S(s+t,s)=𝒯ess+t(BC(τ))𝑑τ,S(s+t,s)=\mathcal{T}e^{\int_{s}^{s+t}(-B-{C}(\tau))d\tau}, (162)
S~(s+t,s)=e(BC(t0+h/2))t,\widetilde{S}(s+t,s)=e^{(-B-{C}(t_{0}+h/2))t}, (163)

and

S2(s+t,s)=eBt/2eC(t0+h/2)teBt/2.S_{2}(s+t,s)=e^{-Bt/2}e^{-{C}(t_{0}+h/2)t}e^{-Bt/2}. (164)

The proof follows the steps that we first bound SI\|S-I\| and S~I\|\widetilde{S}-I\|, then separately bound SS~\|S-\widetilde{S}\| and S~S2\|\widetilde{S}-S_{2}\| using variation of parameters formula.

D.1 Local error due to the time dependency

First, S(s+t,s)S(s+t,s) satisfies the differential equation

ddtS(s+t,s)=(BC(s+t))S(s+t,s).\frac{d}{dt}S(s+t,s)=(-B-{C}(s+t))S(s+t,s). (165)

Integrate this differential equation and get

S(s+t,s)I=ss+t(BC(s+τ))S(s+τ,s)𝑑τ,S(s+t,s)-I=\int_{s}^{s+t}(-B-{C}(s+\tau))S(s+\tau,s)d\tau, (166)

and thus we may write

S(s+t,s)=I+tR1(s+t,s)S(s+t,s)=I+tR_{1}(s+t,s) (167)

where

R1(s+t,s)(B+maxτC(τ)).\|R_{1}(s+t,s)\|\leq\left(\|B\|+\max_{\tau}\|{C}(\tau)\|\right). (168)

Similarly,

S~(s+t,s)=I+tR2(s+t,s)\widetilde{S}(s+t,s)=I+tR_{2}(s+t,s) (169)

where

R2(s+t,s)(B+maxτC(τ)).\|R_{2}(s+t,s)\|\leq\left(\|B\|+\max_{\tau}\|{C}(\tau)\|\right). (170)

Now we bound the distance between SS and S~\widetilde{S}. Notice that S~\widetilde{S} satisfies the differential equation

ddtS~(t0+t,t0)=(BC(t0+h/2))S~(t0+t,t0)=(BC(t0+t))S~(t0+t,t0)+(C(t0+t)C(t0+h/2))S~(t0+t,t0).\begin{split}\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})&=(-B-{C}(t_{0}+h/2))\widetilde{S}(t_{0}+t,t_{0})\\ &=(-B-{C}(t_{0}+t))\widetilde{S}(t_{0}+t,t_{0})+({C}(t_{0}+t)-{C}(t_{0}+h/2))\widetilde{S}(t_{0}+t,t_{0}).\end{split} (171)

By Taylor expansion, we can write

C(t0+t)=C(t0+h/2)+(th/2)C(t0+h/2)+R3(t),{C}(t_{0}+t)={C}(t_{0}+h/2)+(t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t), (172)

where

R3(t)(th/2)22maxτC′′(τ).\|R_{3}(t)\|\leq\frac{(t-h/2)^{2}}{2}\max_{\tau}\|{C}^{\prime\prime}(\tau)\|. (173)

Therefore the differential equation becomes

ddtS~(t0+t,t0)=(BC(t0+t))S~(t0+t,t0)+((th/2)C(t0+h/2)+R3(t))S~(t0+t,t0).\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})=(-B-{C}(t_{0}+t))\widetilde{S}(t_{0}+t,t_{0})+((t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t))\widetilde{S}(t_{0}+t,t_{0}). (174)

Comparing this with Equation 165 and using 20, we obtain

S~(t0+h,t0)=S(t0+h,t0)+0hS(t0+h,t0+t)((th/2)C(t0+h/2)+R3(t))S~(t0+t,t0)𝑑t.\widetilde{S}(t_{0}+h,t_{0})=S(t_{0}+h,t_{0})+\int_{0}^{h}S(t_{0}+h,t_{0}+t)((t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t))\widetilde{S}(t_{0}+t,t_{0})dt. (175)

Plugging in Equation 167 and Equation 169 yields

S~(t0+h,t0)=S(t0+h,t0)+0h(I+(ht)R1(t0+h,t0+t))(th/2)C(t0+h/2)(I+tR2(t0+t,t0))𝑑t+0hS(t0+h,t0+t)R3(t)S~(t0+t,t0)𝑑t=S(t0+h,t0)+0h(th/2)C(t0+h/2)𝑑t+0h(ht)(th/2)R1(t0+h,t0+t)C(t0+h/2)S~(t0+t,t0)𝑑t+0ht(th/2)C(t0+h/2)R2(t0+t,t0)𝑑t+0hS(t0+h,t0+t)R3(t)S~(t0+t,t0)𝑑t=S(t0+h,t0)+0h(ht)(th/2)R1(t0+h,t0+t)C(t0+h/2)S~(t0+t,t0)𝑑t+0ht(th/2)C(t0+h/2)R2(t0+t,t0)𝑑t+0hS(t0+h,t0+t)R3(t)S~(t0+t,t0)𝑑t.\begin{split}\widetilde{S}(t_{0}+h,t_{0})&=S(t_{0}+h,t_{0})+\int_{0}^{h}(I+(h-t)R_{1}(t_{0}+h,t_{0}+t))(t-h/2){C}^{\prime}(t_{0}+h/2)(I+tR_{2}(t_{0}+t,t_{0}))dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt\\ &=S(t_{0}+h,t_{0})+\int_{0}^{h}(t-h/2){C}^{\prime}(t_{0}+h/2)dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}(h-t)(t-h/2)R_{1}(t_{0}+h,t_{0}+t){C}^{\prime}(t_{0}+h/2)\widetilde{S}(t_{0}+t,t_{0})dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}t(t-h/2){C}^{\prime}(t_{0}+h/2)R_{2}(t_{0}+t,t_{0})dt+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt\\ &=S(t_{0}+h,t_{0})+\int_{0}^{h}(h-t)(t-h/2)R_{1}(t_{0}+h,t_{0}+t){C}^{\prime}(t_{0}+h/2)\widetilde{S}(t_{0}+t,t_{0})dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}t(t-h/2){C}^{\prime}(t_{0}+h/2)R_{2}(t_{0}+t,t_{0})dt+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt.\end{split} (176)

Therefore

S~(t0+h,t0)S(t0+h,t0)0ht|th/2|𝑑t(max(R1C)+max(CR2))+0hR3(t)𝑑t0ht|th/2|𝑑t(max(R1C)+max(CR2))+0h(th/2)22𝑑tmaxC′′=h3(14(B+maxC)maxC+124maxC′′).\begin{split}&\quad\|\widetilde{S}(t_{0}+h,t_{0})-S(t_{0}+h,t_{0})\|\\ &\leq\int_{0}^{h}t|t-h/2|dt\left(\max(\|R_{1}\|\|{C}^{\prime}\|)+\max(\|{C}^{\prime}\|\|R_{2}\|)\right)+\int_{0}^{h}\|R_{3}(t)\|dt\\ &\leq\int_{0}^{h}t|t-h/2|dt\left(\max(\|R_{1}\|\|{C}^{\prime}\|)+\max(\|{C}^{\prime}\|\|R_{2}\|)\right)+\int_{0}^{h}\frac{(t-h/2)^{2}}{2}dt\max\|{C}^{\prime\prime}\|\\ &=h^{3}\left(\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|+\frac{1}{24}\max\|{C}^{\prime\prime}\|\right).\end{split} (177)

D.2 Local time-independent Trotter error

Next, we bound the distance between S~\widetilde{S} and S2S_{2}. For notation simplicity, throughout this subsection we let A1=BA_{1}=-B and A2=C(t0+h/2)A_{2}=-{C}(t_{0}+h/2). Differentiating S~(t0+t,t0)\widetilde{S}(t_{0}+t,t_{0}) and eA1(ts)e^{A_{1}(t-s)} yields

ddtS~(t0+t,t0)=A1S~(t0+t,t0)+A2S~(t0+t,t0),\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})=A_{1}\widetilde{S}(t_{0}+t,t_{0})+A_{2}\widetilde{S}(t_{0}+t,t_{0}), (178)

and

ddt(eA1(ts))=A1eA1(ts).\frac{d}{dt}(e^{A_{1}(t-s)})=A_{1}e^{A_{1}(t-s)}. (179)

By 20, for any t[0,h]t\in[0,h], we have

S~(t0+t,t0)=eA1t+0teA1(ts)A2S~(t0+s,t0)𝑑s.\widetilde{S}(t_{0}+t,t_{0})=e^{A_{1}t}+\int_{0}^{t}e^{A_{1}(t-s)}A_{2}\widetilde{S}(t_{0}+s,t_{0})ds. (180)

We use this equation iteratively for three times and get

S~(t0+h,t0)=eA1h+0heA1(ht)A2S~(t0+t,t0)𝑑t=eA1h+0heA1(ht)A2eA1t𝑑t+0h0teA1(ht)A2eA1(ts)A2S~(t0+s,t0)𝑑s𝑑t=eA1h+0heA1(ht)A2eA1t𝑑t+0h0teA1(ht)A2eA1(ts)A2eA1s𝑑s𝑑t+R4(h),\begin{split}\widetilde{S}(t_{0}+h,t_{0})&=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}\widetilde{S}(t_{0}+t,t_{0})dt\\ &=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt+\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}\widetilde{S}(t_{0}+s,t_{0})dsdt\\ &=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt+\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt+R_{4}(h),\end{split} (181)

where

R4(h)=0h0t0seA1(ht)A2eA1(ts)A2eA1(sτ)A2S~(t0+τ,t0)𝑑τ𝑑s𝑑t.R_{4}(h)=\int_{0}^{h}\int_{0}^{t}\int_{0}^{s}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}(s-\tau)}A_{2}\widetilde{S}(t_{0}+\tau,t_{0})d\tau dsdt. (182)

Meanwhile, using the Taylor expansion of eA2he^{A_{2}h}, we may obtain

S2(t0+h,t0)=eA1h/2eA2heA1h/2=eA1h+heA1h/2A2eA1h/2+h22eA1h/2A22eA1h/2+R5(h),\begin{split}S_{2}(t_{0}+h,t_{0})&=e^{A_{1}h/2}e^{A_{2}h}e^{A_{1}h/2}\\ &=e^{A_{1}h}+he^{A_{1}h/2}A_{2}e^{A_{1}h/2}+\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}+R_{5}(h),\end{split} (183)

where

R5(h)=eA1h/2(0h(ht)22A23eA2t𝑑t)eA1h/2.R_{5}(h)=e^{A_{1}h/2}\left(\int_{0}^{h}\frac{(h-t)^{2}}{2}A_{2}^{3}e^{A_{2}t}dt\right)e^{A_{1}h/2}. (184)

Therefore,

S~(t0+h,t0)S2(t0+h,t0)=(0heA1(ht)A2eA1t𝑑theA1h/2A2eA1h/2)+(0h0teA1(ht)A2eA1(ts)A2eA1s𝑑s𝑑th22eA1h/2A22eA1h/2)+R4(h)R5(h).\begin{split}&\quad\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\\ &=\left(\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt-he^{A_{1}h/2}A_{2}e^{A_{1}h/2}\right)\\ &\quad+\left(\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right)\\ &\quad+R_{4}(h)-R_{5}(h).\end{split} (185)

It remains to bound each term in the right hand side of Equation 185. Let f(t)=eA1(ht)A2eA1tf(t)=e^{A_{1}(h-t)}A_{2}e^{A_{1}t}, then

0heA1(ht)A2eA1t𝑑theA1h/2A2eA1h/2=0hf(t)𝑑thf(h/2)=0h(f(h/2)+(th/2)f(h/2)+0t(ts)f′′(s)𝑑s)𝑑thf(h/2)=0h0t(ts)f′′(s)𝑑s𝑑t.\begin{split}&\quad\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt-he^{A_{1}h/2}A_{2}e^{A_{1}h/2}\\ &=\int_{0}^{h}f(t)dt-hf(h/2)\\ &=\int_{0}^{h}\left(f(h/2)+(t-h/2)f^{\prime}(h/2)+\int_{0}^{t}(t-s)f^{\prime\prime}(s)ds\right)dt-hf(h/2)\\ &=\int_{0}^{h}\int_{0}^{t}(t-s)f^{\prime\prime}(s)dsdt.\end{split} (186)

Notice that f′′(t)=eA1(ht)[[A2,A1],A1]eA1tf^{\prime\prime}(t)=e^{A_{1}(h-t)}[[A_{2},A_{1}],A_{1}]e^{A_{1}t}, so we bound

S~(t0+h,t0)S2(t0+h,t0)0h0t|ts|𝑑s𝑑tmaxτ[0,h]f′′(τ)16h3[[A2,A1],A1].\left\|\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\right\|\leq\int_{0}^{h}\int_{0}^{t}|t-s|dsdt\max_{\tau\in[0,h]}\|f^{\prime\prime}(\tau)\|\leq\frac{1}{6}h^{3}\|[[A_{2},A_{1}],A_{1}]\|. (187)

Similarly, let g(t,s)=eA1(ht)A2eA1(ts)A2eA1sg(t,s)=e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}, then

0h0teA1(ht)A2eA1(ts)A2eA1s𝑑s𝑑th22eA1h/2A22eA1h/2=0h0tg(t,s)𝑑s𝑑th22g(h/2,h/2)=0h0t(g(h/2,h/2)+01g(h/2+α(th/2),h/2+α(sh/2))(th/2,sh/2)𝑑α)𝑑s𝑑th22g(h/2,h/2)=0h0t01g(h/2+α(th/2),h/2+α(sh/2))(th/2,sh/2)𝑑α𝑑s𝑑t.\begin{split}&\quad\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\\ &=\int_{0}^{h}\int_{0}^{t}g(t,s)dsdt-\frac{h^{2}}{2}g(h/2,h/2)\\ &=\int_{0}^{h}\int_{0}^{t}\left(g(h/2,h/2)+\int_{0}^{1}\nabla g(h/2+\alpha(t-h/2),h/2+\alpha(s-h/2))\cdot\left(t-h/2,s-h/2\right)d\alpha\right)dsdt-\frac{h^{2}}{2}g(h/2,h/2)\\ &=\int_{0}^{h}\int_{0}^{t}\int_{0}^{1}\nabla g(h/2+\alpha(t-h/2),h/2+\alpha(s-h/2))\cdot\left(t-h/2,s-h/2\right)d\alpha dsdt.\end{split} (188)

For 0α10\leq\alpha\leq 1, 0st0\leq s\leq t and 0th0\leq t\leq h, we always have 0h/2+α(sh/2)h/2+α(th/2)0\leq h/2+\alpha(s-h/2)\leq h/2+\alpha(t-h/2) and 0h/2+α(th/2)h0\leq h/2+\alpha(t-h/2)\leq h, so

0h0teA1(ht)A2eA1(ts)A2eA1s𝑑s𝑑th22eA1h/2A22eA1h/20h0t01(|th/2|+|sh/2|)𝑑α𝑑s𝑑t(max{max0sthg(t,s)t,max0sthg(t,s)s})=14h3(max{max0sthg(t,s)t,max0sthg(t,s)s}).\begin{split}&\quad\left\|\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right\|\\ &\leq\int_{0}^{h}\int_{0}^{t}\int_{0}^{1}\left(|t-h/2|+|s-h/2|\right)d\alpha dsdt\left(\max\left\{\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial t}\right\|,\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial s}\right\|\right\}\right)\\ &=\frac{1}{4}h^{3}\left(\max\left\{\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial t}\right\|,\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial s}\right\|\right\}\right).\end{split} (189)

We may compute that

gt=eA1(ht)[A2,A1]eA1(ts)A2eA1s,\frac{\partial g}{\partial t}=e^{A_{1}(h-t)}[A_{2},A_{1}]e^{A_{1}(t-s)}A_{2}e^{A_{1}s}, (190)

and

gs=eA1(ht)A2eA1(ts)[A2,A1]eA1s,\frac{\partial g}{\partial s}=e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}[A_{2},A_{1}]e^{A_{1}s}, (191)

so for 0sth0\leq s\leq t\leq h, both g/t\|\partial g/\partial t\| and g/s\|\partial g/\partial s\| are bounded by [A2,A1]A2\|[A_{2},A_{1}]\|\|A_{2}\|. Then

0h0teA1(ht)A2eA1(ts)A2eA1s𝑑s𝑑th22eA1h/2A22eA1h/214h3[A2,A1]A2.\left\|\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right\|\leq\frac{1}{4}h^{3}\|[A_{2},A_{1}]\|\|A_{2}\|. (192)

The bounds for R4R_{4} and R5R_{5} are straightforward from Equation 182 and Equation 184 that

R4(h)16h3A23\|R_{4}(h)\|\leq\frac{1}{6}h^{3}\|A_{2}\|^{3} (193)

and

R5(h)16h3A23.\|R_{5}(h)\|\leq\frac{1}{6}h^{3}\|A_{2}\|^{3}. (194)

Plug Equations 187, 192, 193 and 194 back to Equation 185, and we get

S~(t0+h,t0)S2(t0+h,t0)h3(16[A1,[A1,A2]]+14[A1,A2]A2+13A23)h3(16max[B,[B,C]]+14max[B,C]maxC+13maxC3).\begin{split}\left\|\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\right\|&\leq h^{3}\left(\frac{1}{6}\|[A_{1},[A_{1},A_{2}]]\|+\frac{1}{4}\|[A_{1},A_{2}]\|\|A_{2}\|+\frac{1}{3}\|A_{2}\|^{3}\right)\\ &\leq h^{3}\left(\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split} (195)

D.3 Global error

Combining Equation 177 and Equation 195, we obtain the local error bound as

S(t0+h,t0)S2(t0+h,t0)h3(124maxC′′+14(B+maxC)maxC+16max[B,[B,C]]+14max[B,C]maxC+13maxC3).\begin{split}&\quad\|S(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\|\\ &\leq h^{3}\left(\frac{1}{24}\max\|{C}^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split} (196)

Finally we bound the global error. Notice that, for any t0t\geq 0, S2(t+s,t)1\|S_{2}(t+s,t)\|\leq 1 and S(s+t,t)1\|S(s+t,t)\|\leq 1, then the global error accumulates linearly as

S(T,0)j=0r1S2((j+1)h,jh)=j=0r1S((j+1)h,jh)j=0r1S2((j+1)h,jh)k=0r1j=k+1r1S2((j+1)h,jh)(S((k+1)h,kh)S2((k+1)h,kh))j=0k1S((j+1)h,jh)k=0r1S((k+1)h,kh)S2((k+1)h,kh)Th2(124maxC′′+14(B+maxC)maxC+16max[B,[B,C]]+14max[B,C]maxC+13maxC3).\begin{split}&\quad\|S(T,0)-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\|\\ &=\left\|\prod_{j=0}^{r-1}S((j+1)h,jh)-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\right\|\\ &\leq\sum_{k=0}^{r-1}\left\|\prod_{j=k+1}^{r-1}S_{2}((j+1)h,jh)\left(S((k+1)h,kh)-S_{2}((k+1)h,kh)\right)\prod_{j=0}^{k-1}S((j+1)h,jh)\right\|\\ &\leq\sum_{k=0}^{r-1}\left\|S((k+1)h,kh)-S_{2}((k+1)h,kh)\right\|\\ &\leq Th^{2}\left(\frac{1}{24}\max\|{C}^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split} (197)

Appendix E Bounding the commutators

Proof of 10.

We shall consider a fixed tt and omit the explicit time dependence in our notation. We first consider [B,C][B,{C}] and write

[B,C]=[()dD(1)d,C]=()d[D,(1)dC()d](1)d=[D,(1)dC()d].\begin{split}\|[B,{C}]\|&=\|[(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d},{C}]\|\\ &=\|(\mathcal{F})^{\otimes d}[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}](\mathcal{F}^{-1})^{\otimes d}\|\\ &=\|[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]\|.\end{split} (198)

By the definition of the matrices, we have

C()d=1Nd/2j,k[N]dc(j/N)ωNjk|j0jd1k0kd1|\begin{split}{C}(\mathcal{F})^{\otimes d}&=\frac{1}{N^{d/2}}\sum_{j,k\in[N]^{d}}{c}(j/N)\omega_{N}^{j\cdot k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\end{split} (199)

and

(1)dC()d=1Ndj,k[N]d(l[N]dωNjlc(l/N)ωNlk)|j0jd1k0kd1|=1Ndj,k[N]d(l[N]d(mdc^mωNlm)ωNl(kj))|j0jd1k0kd1|=1Ndj,k[N]d(mdc^m(l[N]dωNl(m(jk))))|j0jd1k0kd1|=j,k[N]d(mdc^jk+Nm)|j0jd1k0kd1|.\begin{split}(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}&=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{l\in[N]^{d}}\omega_{N}^{-j\cdot l}{c}(l/N)\omega_{N}^{l\cdot k}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{l\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{m}\omega_{N}^{l\cdot m}\right)\omega_{N}^{l\cdot(k-j)}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{m}\left(\sum_{l\in[N]^{d}}\omega_{N}^{l\cdot(m-(j-k))}\right)\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\sum_{j,k\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{j-k+Nm}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}.\end{split} (200)

Let b^l=mdc^l+Nm\hat{b}_{l}=\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{l+Nm}, and we can compute that

[D,(1)dC()d]=j,k[N]d(i(j)αi(k)α)b^jk|j0jd1k0kd1|.[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}\right)\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}. (201)

Notice the decomposition

i(j)αi(k)α=(i(j)α/2i(k)α/2)2+2(i(j)α/2i(k)α/2)i(k)α/2.\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}=(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{2}+2(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})\|i(k)\|^{\alpha/2}. (202)

We may write

[D,(1)dC()d]=A1+2A2A3[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]=A_{1}+2A_{2}A_{3} (203)

where

A1=j,k[N]d(i(j)α/2i(k)α/2)2b^jk|j0jd1k0kd1|,A_{1}=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)^{2}\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}, (204)
A2=j,k[N]d(i(j)α/2i(k)α/2)b^jk|j0jd1k0kd1|A_{2}=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}} (205)

and

A3=k[N]di(k)α/2|k0kd1k0kd1|.A_{3}=\sum_{k\in[N]^{d}}\|i(k)\|^{\alpha/2}\ket{k_{0}\cdots k_{d-1}}\bra{k_{0}\cdots k_{d-1}}. (206)

Now we bound the norms of AjA_{j}’s. The norm of A3A_{3} is clearly

A3=𝒪(dα/4Nα/2).\|A_{3}\|=\mathcal{O}(d^{\alpha/4}N^{\alpha/2}). (207)

For A2A_{2}, let q=argmax||i(jq)||i(kq)||q=\text{argmax}||i(j_{q^{\prime}})|-|i(k_{q^{\prime}})||. We use the inequality

|i(j)α/2i(k)α/2||i(j)i(k)|α/2|i(j)||i(k)|α/2dα/4||i(jq)||i(kq)||α/2.\begin{split}\left|\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right|&\leq\left|\|i(j)\|-\|i(k)\|\right|^{\alpha/2}\\ &\leq\||i(j)|-|i(k)|\|^{\alpha/2}\\ &\leq d^{\alpha/4}||i(j_{q})|-|i(k_{q})||^{\alpha/2}.\end{split} (208)

Here the second line is the triangle inequality (and |v||v| for a vector vv denotes its entrywise absolute), the third line is due to the definition of qq, and the first line can be proved by the following arguments: let ba0b\geq a\geq 0 be two real number and f(x)=xα/2f(x)=x^{\alpha/2}, then bα/2aα/2=abf(x)𝑑x0baf(x)𝑑x=(ba)α/2b^{\alpha/2}-a^{\alpha/2}=\int_{a}^{b}f^{\prime}(x)dx\leq\int_{0}^{b-a}f^{\prime}(x)dx=(b-a)^{\alpha/2} because f(x)=α2xα/21f^{\prime}(x)=\frac{\alpha}{2}x^{\alpha/2-1} is monotonically decreasing for 0<α20<\alpha\leq 2. Notice that the Fourier coefficient |c^l|=𝒪(|lq|(2+d))|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(2+d)}) for any index qq^{\prime}, and when |i(jq)i(kq)|N/2|i(j_{q})-i(k_{q})|\leq N/2 (c.f., i(jq)i(kq)>N/2i(j_{q})-i(k_{q})>N/2 or i(jq)i(kq)<N/2i(j_{q})-i(k_{q})<-N/2), c^i(j)i(k)\hat{c}_{i(j)-i(k)} (c.f., c^i(j)i(k)N\hat{c}_{i(j)-i(k)-N} or c^i(j)i(k)+N\hat{c}_{i(j)-i(k)+N}) is the term with lowest frequency in b^jk\hat{b}_{j-k}, We have

k[N]d|i(j)α/2i(k)α/2||b^jk|dα/4k[N]d||i(jq)||i(kq)||α/2|b^jk|dα/4k[N]d,m[N]djk+Nmα/2|c^jk+Nm|=𝒪(dα/4).\begin{split}\sum_{k\in[N]^{d}}\left|\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right||\hat{b}_{j-k}|&\leq d^{\alpha/4}\sum_{k\in[N]^{d}}||i(j_{q})|-|i(k_{q})||^{\alpha/2}|\hat{b}_{j-k}|\\ &\leq d^{\alpha/4}\sum_{k\in[N]^{d},m\in[N]^{d}}\|j-k+Nm\|_{\infty}^{\alpha/2}|\hat{c}_{j-k+Nm}|\\ &=\mathcal{O}(d^{\alpha/4}).\end{split} (209)

Similarly, the summation over jj is also bounded by 𝒪(dα/4)\mathcal{O}(d^{\alpha/4}). These imply A21=𝒪(dα/4)\|A_{2}\|_{1}=\mathcal{O}(d^{\alpha/4}) and A2=𝒪(dα/4)\|A_{2}\|_{\infty}=\mathcal{O}(d^{\alpha/4}), which further implies that

A2=𝒪(dα/4)\|A_{2}\|=\mathcal{O}(d^{\alpha/4}) (210)

since A2A1A\|A\|^{2}\leq\|A\|_{1}\|A\|_{\infty} for any matrix AA. For A1A_{1}, we use Equation 208 again that

(i(j)α/2i(k)α/2)2dα/2||i(jq)||i(kq)||α,\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)^{2}\leq d^{\alpha/2}||i(j_{q})|-|i(k_{q})||^{\alpha}, (211)

and

A1=𝒪(dα/2)\|A_{1}\|=\mathcal{O}(d^{\alpha/2}) (212)

by the same argument for A2A_{2} (with the only difference that we need the Fourier coefficient to decay as |c^l|=𝒪(|lq|(3+d))|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(3+d)})). Plugging Equations 212, 210 and 207 back to Equation 203 yields

[B,C]=𝒪(dα/2Nα/2).\|[B,{C}]\|=\mathcal{O}(d^{\alpha/2}N^{\alpha/2}). (213)

The second estimate [B,[B,C]]=𝒪(dαNα)\|[B,[B,{C}]]\|=\mathcal{O}(d^{\alpha}N^{\alpha}) can be proved in the same way by using |c^l|=𝒪(|lq|(5+d))|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(5+d)}) and noticing that

[B,[B,C]]=[D,[D,(1)dC()d]],\|[B,[B,{C}]]\|=\|[D,[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]]\|, (214)
[D,[D,(1)dC()d]]=j,k[N]d(i(j)αi(k)α)2b^jk|j0jd1k0kd1|,[D,[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]]=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}\right)^{2}\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}, (215)

and

(i(j)αi(k)α)2=(i(j)α/2i(k)α/2)4+4(i(j)α/2i(k)α/2)3i(k)α/2+4(i(j)α/2i(k)α/2)2i(k)α.\begin{split}&\quad(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha})^{2}\\ &=(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{4}+4(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{3}\|i(k)\|^{\alpha/2}+4(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{2}\|i(k)\|^{\alpha}.\end{split} (216)

Appendix F Discretization error in the LCHS-IP method

Proof of 17.

Let U(t;ξ)=𝒯ei0tξ(B+C(s))𝑑s=eiξBt𝒯ei0tHI(s;ξ)𝑑sU(t;\xi)=\mathcal{T}e^{-i\int_{0}^{t}\xi(B+{C}(s))ds}=e^{-i\xi Bt}\mathcal{T}e^{-i\int_{0}^{t}H_{I}(s;\xi)ds} and V(t;ξ)=1π(1+ξ2)U(t;ξ)V(t;\xi)=\frac{1}{\pi(1+\xi^{2})}U(t;\xi). The truncation error can be bounded by

𝒯e0T(B+C(s))𝑑sΞΞ1π(1+ξ2)U(T;ξ)𝑑ξ2Ξdξπ(1+ξ2)=2π(π2arctan(Ξ))2πΞ.\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\int_{-\Xi}^{\Xi}\frac{1}{\pi(1+\xi^{2})}U(T;\xi)d\xi\right\|\leq 2\int_{\Xi}^{\infty}\frac{d\xi}{\pi(1+\xi^{2})}=\frac{2}{\pi}(\frac{\pi}{2}-\arctan(\Xi))\leq\frac{2}{\pi\Xi}. (217)

According to the standard quadrature error bound, we have

𝒯e0T(B+C(s))𝑑sj=0M1wjU(T;ξj)2Ξ2Msupξ[Ξ,Ξ]Vξ.\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}U(T;\xi_{j})\right\|\leq\frac{2\Xi^{2}}{M}\sup_{\xi\in[-\Xi,\Xi]}\left\|\frac{\partial V}{\partial\xi}\right\|. (218)

To estimate V/ξ\partial V/\partial\xi, we first compute U/ξ\partial U/\partial\xi. Differentiating Equation 79 yields

tUξ=i(B+C(t))Ui(ξB+ξC(t))Uξ,Uξ(0;ξ)=0.\frac{\partial}{\partial t}\frac{\partial U}{\partial\xi}=-i(B+{C}(t))U-i(\xi B+\xi{C}(t))\frac{\partial U}{\partial\xi},\quad\frac{\partial U}{\partial\xi}(0;\xi)=0. (219)

By the variation of constants formula, we have

Uξ(t;ξ)=0t(𝒯eist(ξB+ξC(τ))𝑑τ)(i)(B+C(s))U(s;ξ)𝑑s,\frac{\partial U}{\partial\xi}(t;\xi)=\int_{0}^{t}\left(\mathcal{T}e^{-i\int_{s}^{t}(\xi B+\xi{C}(\tau))d\tau}\right)(-i)(B+{C}(s))U(s;\xi)ds, (220)

and thus

UξT(B+maxC).\left\|\frac{\partial U}{\partial\xi}\right\|\leq T(\|B\|+\max\|{C}\|). (221)

Therefore, by the product rule, we have

Vξ2π|ξ|π2(1+ξ2)+1π(1+ξ2)Uξ1π(1+T(B+maxC)).\begin{split}\left\|\frac{\partial V}{\partial\xi}\right\|&\leq\frac{2\pi|\xi|}{\pi^{2}(1+\xi^{2})}+\frac{1}{\pi(1+\xi^{2})}\left\|\frac{\partial U}{\partial\xi}\right\|\\ &\leq\frac{1}{\pi}\left(1+T(\|B\|+\max\|{C}\|)\right).\end{split} (222)

Plugging this back into Equation 218 and Equation 217 gives the desired error bound, and the choices of Ξ\Xi and MM directly follow from the error bound. ∎