Quantum algorithms for linear and non-linear fractional reaction-diffusion equations

Dong An¹, Konstantina Trivisa^2,3
¹ Joint Center for Quantum Information and Computer Science, University of Maryland, MD 20742, USA
² Department of Mathematics, University of Maryland, MD 20742, USA
³ Institute for Physical Science and Technology, University of Maryland, MD 20742, USA

Abstract

High-dimensional fractional reaction-diffusion equations have numerous applications in the fields of biology, chemistry, and physics, and exhibit a range of rich phenomena. While classical algorithms have an exponential complexity in the spatial dimension, a quantum computer can produce a quantum state that encodes the solution with only polynomial complexity, provided that suitable input access is available. In this work, we investigate efficient quantum algorithms for linear and nonlinear fractional reaction-diffusion equations with periodic boundary conditions. For linear equations, we analyze and compare the complexity of various methods, including the second-order Trotter formula, time-marching method, and truncated Dyson series method. We also present a novel algorithm that combines the linear combination of Hamiltonian simulation technique with the interaction picture formalism, resulting in optimal scaling in the spatial dimension. For nonlinear equations, we employ the Carleman linearization method and propose a block-encoding version that is appropriate for the dense matrices that arise from the spatial discretization of fractional reaction-diffusion equations.

1 Introduction

Reaction–diffusion equations arise in many areas in science and engineering [33, 9, 35, 17, 22]. In population dynamics models in biology, the reaction term typically accounts for growth, whereas the diffusion term accounts for migration [24, 31]. The classical diffusion term has its origin in a model in physics. Recent research investigations indicate that the classical diffusion equation is inadequate to model many real situations, where a particle plume spreads faster than that predicted by the classical model, and may exhibit significant asymmetry [32]. In a fractional diffusion equation, the classical Laplace operator in the spatial variable is replaced by a fractional Laplacian of order less than two $(-\Delta)^{\alpha/2}$ where $0<\alpha\leq 2$ . The fundamental solutions of these equations still exhibit useful scaling properties that make them attractive for applications.

The present article deals with

\begin{split}\partial_{t}u(t,x)&=-(-\Delta)^{\alpha/2}u(t,x)-c(t,x)u(t,x)+au(t,x)(1-u(t,x)),\quad t\in[0,T],x\in[0,1]^{d},\\ u(0,x)&=u_{0}(x).\end{split}

(1)

Here $(-\Delta)^{\alpha/2}$ is the fractional Laplacian where $0<\alpha\leq 2$ , and $c(t,x)$ is the potential function. For the nonlinear term, in this work we only consider the quadratic non-linearity which yields Fisher’s equation, but our results can be generalized to equations with high-order polynomial non-linearity. For the fractional Laplacian operator, there are several different definitions of $(-\Delta)^{\alpha/2}$ on bounded domain, including spectral definition and Riesz definition [28]. This works focuses on spectral fractional Laplacian with periodic boundary condition. Classical numerical algorithms for solving Equation 1 typically require exponential computational resources when the spatial variable is in high dimension: suppose that we use $N$ grid points or basis functions for spatial discretization in each dimension, then the dimension of semi-discretized differential equations becomes as large as $N^{d}$ . Therefore we would like to explore the power of quantum algorithms for fractional reaction-diffusion equations and whether quantum algorithms can be efficient in high-dimensional case.

Quantum algorithms for differential equations aim at preparing a quantum state encoding the solutions at discrete grid points in its amplitudes. The first quantum differential equation algorithm was proposed in [6], which transforms differential equations into a linear system of equations using multi-step discretization and then applies quantum linear system algorithms such as HHL algorithm [21] or advanced ones [10, 36, 3, 27, 16]. Since then, there have been remarkable progresses on designing better quantum algorithms, for linear differential equations based on refined discretization [7, 12, 13, 26, 8], time-marching strategy [18], Schrödingerization [25], linear combination of unitaries technique [4], and for nonlinear differential equations using linearization techniques [29, 1]. With the caveat that the output is a quantum state encoding solutions in its amplitudes, these quantum algorithms can achieve exponential speedup in the system size compared to classical algorithms.

However, when directly applied to fractional reaction-diffusion equations, existing generic quantum algorithms are not as efficient as expected due to two major difficulties. First, similar as the standard Laplacian operator, fractional Laplacian operator is an unbounded operator, so its spatially discretized version has a huge spectral norm as the spatial dimension and the number of grid points increase. Most existing quantum differential equation algorithms scale at least linearly on the spectral norm of the coefficient matrix, and thus can be computationally expensive in high-dimensional case for accurate simulation. Second, when the equation is genuinely fractional (i.e., $\alpha<2$ ), the coefficient matrix for the linear part after spatial discretization is unavoidably dense, because fractional differential operators are global operators that depends on the function evaluated in the entire space. The dense coefficient matrix poses computational difficulties in solving nonlinear equations. This is because all the existing quantum Carleman linearization algorithms require the coefficient matrix to be sparse in order to bypass the difficulty caused by the enlarged Carleman matrix, which is a direct sum of matrices in different dimensions [29].

We remark that the (fractional- or integer-order) Laplacian operator, and more general spatial differential operators, widely appear in various types of partial differential equations. In the contexts other than fractional reaction-diffusion equations, there have been several work managing to overcome the computational difficulty brought by its large spectral norm after spatial discretization, but unfortunately those techniques do not apply to the fractional reaction-diffusion equations. For example, in real-space Schrödinger equation, a poly-logarithmic dependence on the spectral norm of the Laplacian operator can be achieved by simulating the Hamiltonian in the interaction picture [30, 11], which simulates the transformed wavefunction under the rotation associated with the Laplacian operator. The resulting interaction picture Hamiltonian becomes bounded and thus can be efficiently simulated. To avoid the large spectral norm dependence in the rotations, the algorithm takes advantage of an important feature of the Laplacian operator: it can be diagonalized by the quantum Fourier transform (QFT) circuit and thus can be fastforwarded (i.e., Hamiltonian simulation governed by the Laplacian operator can be implemented with cost independent of evolution time and its spectral norm). However, such a technique does not directly work for fractional reaction-diffusion equations. The interaction picture transformation requires both forward and backward time evolution of the Laplacian operator. This is efficient for Schrödinger equations because the dynamics is reversible and both forward and backward time evolution operators are unitary, but fractional reaction-diffusion equation is a dissipative system and implementing its backward time evolution can be prohibitively expensive. The recent work [5] proposes efficient quantum algorithms for various partial differential equations beyond Schrödinger equations. The key technique there is a generalization of the fastforwarding simulation of the Laplacian operator. However, the algorithms in [5] require the entire coefficient matrix of the linear part to be fast-forwardable (i.e., its spectral decomposition has quantumly implementable eigenstates and classically computable eigenvalues), which cannot be satisfied even by linear fractional reaction-diffusion equations with the presence of the potential $c(t,x)$ .

Method	Queries to the matrices
Method	$d$	$\epsilon$	$T$	Norm
Second-order Trotter (12)	$\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma/2)})$	$\widetilde{\mathcal{O}}(\epsilon^{-1/2})$	$\widetilde{\mathcal{O}}(T^{3/2})$	$\widetilde{\mathcal{O}}((g(T))^{3/2})$
Time-marching (14)	$\widetilde{\mathcal{O}}(d^{\alpha(1+2\sigma)})$	$\mathcal{O}(\text{poly}\log(1/\epsilon))$	$\widetilde{\mathcal{O}}(T^{2})$	$\widetilde{\mathcal{O}}(Q)$
Dyson series (16)	$\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma)})$	$\mathcal{O}(\text{poly}\log(1/\epsilon))$	$\widetilde{\mathcal{O}}(T)$	$\widetilde{\mathcal{O}}(g(T))$
LCHS-IP (18)	$\widetilde{\mathcal{O}}(\text{poly}\log(d))$	$\widetilde{\mathcal{O}}(\epsilon^{-1})$	$\widetilde{\mathcal{O}}(T)$	$\widetilde{\mathcal{O}}(g(T)^{2})$
Method	Queries to the state preparation
Method	$d$	$\epsilon$	$T$	Norm
Second-order Trotter (12)	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(g(T))$
Time-marching (14)	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(Q)$
Dyson series (16)	$\widetilde{\mathcal{O}}(d^{\alpha(1/2+\sigma)})$	$\mathcal{O}(\text{poly}\log(1/\epsilon))$	$\widetilde{\mathcal{O}}(T)$	$\widetilde{\mathcal{O}}(g(T))$
LCHS-IP (18)	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(1)$	$\mathcal{O}(g(T))$

Table 1: Query complexities of differential methods for linear fractional reaction-diffusion equations. Here

d

is the spatial dimension,

\epsilon

is the tolerated error in

2

-norm,

T

is the evolution time,

\alpha

is the half order of the fractional Laplacian operator ranging in

(0,2]

and

\sigma

is the parameter of the Gevrey class defined in Equation 40. The function

g(T)\geq\|\vec{u}(0)\|/\|\vec{u}(T)\|

describes the decay of the spatially discretized solution,

Q

is the decay corrected by the spectral norm of infinitesimal evolution operators as defined in Equation 65, and we always have

Q\leq g(T)

In this work, we investigate efficient quantum algorithms for fractional reaction-diffusion equations. The majority of our work is devoted to linear fractional reaction-diffusion equations. We numerically treat the equations by the method of lines, i.e., first discretizing the spatial variable to obtain a system of ODEs

\frac{d}{dt}\vec{u}=-B\vec{u}-C(t)\vec{u},

(2)

and then solving the resulting ODE system with different quantum ODE algorithms. Here $\vec{u}$ represents the solution evaluated at different spatial grid points, $B$ is the discretized fractional Laplacian operator, and $C(t)$ is the potential matrix. For time evolution, we consider four different algorithms: second-order Trotter formula [14], time-marching method [18], truncated Dyson series method [8], and linear combination of Hamiltonian simulation in the interaction picture (LCHS-IP). We analyze the complexity of these four methods and the results are shown in Table 1. Our main results and contributions are summarized as follows:

1.

Second-order Trotter formula: unlike Hamiltonian simulation, Equation 2 is not a unitary dynamics, and in the Trotter formula we need to implement non-unitary operators $e^{-Bs}$ and $e^{-C(t)s}$ . We discuss efficient construction of these operators via controlled rotations, and implementing their multiplication through a generalization of the compression gadget technique [30, 18] with only poly-logarithmic many ancilla qubits. For complexity analysis, we derive an improved Trotter error bound that avoids the exponential factor in [14] and generalizes to the time-dependent case. Compared to time-marching and truncated Dyson series methods, second-order Trotter has better dependence on the dimension $d$ thanks to its commutator scalings, but has worse dependence on the precision.
2.

Time-marching method: we directly apply the standard time-marching method in [18] and analyze its complexity for fractional reaction-diffusion equations. It has low state preparation cost and poly-logarithmic dependence on precision, but has worse scalings in the dimension $d$ and the evolution time $T$ .
3.

Truncated Dyson series method: we directly apply the standard truncated Dyson series method in [8] and analyze its complexity for fractional reaction-diffusion equations. It still depends polynomially on the dimension due to its spectral norm dependence and has high state preparation cost due to the usage of quantum linear system algorithms, but can achieve poly-logarithmic scaling in the precision and linear scaling in time simultaneously.
4.

LCHS-IP: The LCHS-IP method is a novel method that combines the linear combination of Hamiltonian simulation (LCHS) [4] technique with the interaction picture Hamiltonian simulation [30]. The LCHS method first represents the evolution operator of Equation 2 as a linear combination of several Hamiltonian simulation problems associated with the matrices $B$ and $C(t)$ . To avoid the computational overhead brought by the discretized fractional Laplacian $B$ , we implement each Hamiltonian simulation in the interaction picture by rotating the Hamiltonian with respect to $B$ . Therefore, the resulting algorithm only has poly-logarithm dependence on the dimension $d$ and thus is the most preferable algorithm in the high-dimensional case. It also has low state preparation cost, but only linear scaling in precision.

For nonlinear fractional reaction-diffusion equations, we discuss a block-encoding version of the Carleman linearization technique to deal with the dense coefficient matrix. Let $\vec{u}(t)$ denote the solution vector of the fractional reaction-diffusion equation after spatial discretization. Standard quantum Carleman linearization algorithm [29] considers the dynamics of the enlarged vector $[\vec{u}(t);\vec{u}(t)^{\otimes 2};\cdots;\vec{u}(t)^{\otimes M}]$ , which approximately satisfies a linear system of ODEs governed by a so-called Carleman matrix. Since $\vec{u}(t)^{\otimes m}$ ’s are in different sizes for different powers, the Carleman matrix is a direct sum of matrices in different dimensions. In our work, we propose a simple generalization of the Carleman linearization, by extending the Carleman matrix to even higher dimension such that it becomes the direct sum of matrices in the same dimension and the corresponding solution vector $[\vec{w}_{1};\vec{w}_{2};\cdots;\vec{w}_{M}]$ can be exactly mapped to the solution $[\vec{u}(t);\vec{u}(t)^{\otimes 2};\cdots;\vec{u}(t)^{\otimes M}]$ via discarding all the zero entries. Then the extended Carleman matrix can be easily block-encoded through the block-encoding of the original coefficient matrix, and thus the corresponding extended linearized system can be solved via quantum linear differential algorithms with block-encoding input model (e.g., [18, 8, 4]). This makes the quantum Carleman linearization algorithm applicable to the equations with dense coefficient matrices, including the fractional reaction-diffusion equations.

The rest of this paper is organized as follows. Section 2 first discusses the mathematical setup of the fractional reaction-diffusion equations, notations being used throughout the paper and some preliminary results. The main results of this paper start by Section 3 with the simplest case where the equations only involve a single fractional Laplacian operator without potential or nonlinear term, then Section 4 considers the general linear equations with potential. Quantum algorithm for nonlinear equations is presented in Section 5, followed by conclusion and open questions in Section 6.

2 Preliminaries

We start with a more rigorous setup of the fractional reaction-diffusion equation we are interested in and a summary of theoretical tools and technical lemmas being used in our analysis.

2.1 Setup

We consider the spatial fractional reaction-diffusion equations Equation 1. Let $N$ be a positive integer and we discretize the spatial variable $x$ using equi-distant nodes $(j/N)$ , $j\in[N]^{d}$ . Quantum algorithms for solving Equation 1 aim at preparing a quantum state approximately encoding $u(T,j/N)$ in its amplitude, i.e., $\frac{1}{\|(u(T,j/N))_{j\in[N]^{d}}\|}\sum_{j\in[N]^{d}}u(T,j/N)\ket{j}$ .

There are several different definitions of $(-\Delta)^{\alpha/2}$ on bounded domain (see the recent paper [28] for a comprehensive review). The spectral fractional Laplacian is defined using the eigenvalues and eigenfunctions of the original Laplacian $(-\Delta)$ . Suppose that $\lambda_{j}$ ’s are the eigenvalues of $(-\Delta)$ and $e_{j}(x)$ ’s are the corresponding eigenfunctions, then the spectral fractional Laplacian is defined to be

(-\Delta)^{\alpha/2}v(x)=\sum_{j}\lambda_{j}^{\alpha/2}(v,e_{j})_{L^{2}}e_{j}(x)

(3)

where $(\cdot,\cdot)_{L^{2}}$ denotes the $L^{2}$ inner product on $[0,1]^{d}$ . An alternative definition is the Riesz fractional Laplacian

(-\Delta)^{\alpha/2}v(x)=\frac{2^{\alpha}\Gamma(\frac{\alpha}{2}+\frac{d}{2})}{\pi^{d/2}|\Gamma(-\frac{\alpha}{2})|}\text{p.v.}\int_{\mathbb{R}^{d}}\frac{v(x)-v(y)}{|x-y|^{d+\alpha}}dy.

(4)

Here $\Gamma(s)$ denotes the gamma function and p.v. refers to the principle value integral. Notice that the definition Equation 4 is not closed yet since the integral is over the entire space $\mathbb{R}^{d}$ while $v(x)$ is confined in the cube $[0,1]^{d}$ . Therefore we must enforce boundary condition on $\mathbb{R}^{d}\setminus[0,1]^{d}$ , e.g., the homogeneous Dirichlet boundary condition $v(x)=0$ for all $x\in\mathbb{R}^{d}\setminus[0,1]^{d}$ . For periodic boundary condition, the definition is unambiguous that we will always follow Equation 3, while either definition is commonly used with other boundary conditions. In this work, we will focus on the periodic boundary condition and use the spectral definition for the fractional Laplacian operator.

2.2 Notations

We use $\vec{a}$ (a letter with an array above) to denote a possibly unnormalized vector. When $u(t,x)$ refers to a scalar-valued function with arguments $t\in\mathbb{R}$ and $x\in[0,1]^{d}$ , we use $(u(t,j/N))_{j\in[N]^{d}}$ or simply $(u(t,j/N))$ to denote the $N^{d}$ -dimensional vector with entries $u(t,j/N)$ , where $j\in[N]^{d}$ and $[N]=\left\{0,1,\cdots,N-1\right\}$ for a positive integer $N$ . In our analysis, we also use $\vec{u}_{0}$ as a shorthand notation of the initial condition vector $(u(0,j/N))$ , and $\vec{u}(t)$ as $(u(t,j/N))$ for a fixed time $t$ .

For a vector $\vec{a}$ , we use $\|\vec{a}\|$ without subscript for the standard vector 2-norm of $\vec{a}$ , and $\|\vec{a}\|_{p}$ for vector $p$ -norm. The notation $\ket{\vec{a}}$ with ket notation denotesthe corresponding quantum state, i.e., the normalized vector $\vec{a}/\|\vec{a}\|$ .

Let $A$ and $B$ be two matrices. We use $[A,B]$ to denote the commutator between $A$ and $B$ , defined as $AB-BA$ . For a matrix $A$ , $\|A\|$ denotes its spectral norm or equivalently the matrix 2-norm.

2.3 Quantum linear algebra

To design and analyze quantum algorithms for the spatial fractional reaction-diffusion equations, we need to frequently implement linear algebra operations, including matrix-vector multiplication, matrix-matrix addition and multiplication. To this end, we briefly introduce the concept and properties of block-encoding, which is a widely-used quantum input model for possibly non-unitary matrix.

We start with the definition of block-encoding.

Definition 1 (Block-encoding).

Suppose that $A$ is a $2^{s}$ -dimensional matrix, then we say that the $(s+n)$ -qubit unitary $U$ is an $(\alpha,n,\epsilon)$ -block-encoding of $A$ , if

\left\|A-\alpha\left((\bra{0}^{\otimes n}\otimes I)U(\ket{0}^{\otimes n}\otimes I)\right)\right\|\leq\epsilon.

(5)

Intuitively, in the block-encoding, a matrix is represented as the upper-left block of a unitary matrix as

U\approx\left(\begin{array}[]{cc}A/\alpha&*\\ &*\end{array}\right).

(6)

$\alpha$ is called the block-encoding factor such that $\alpha\geq\|A\|$ , since the larger matrix $U$ is supposed to be unitary and the norm of its sub-block should be bounded by $1$ .

With block-encoding structure, we may implement matrix linear algebra operations. For example, matrix addition can be directly implemented by the linear combination of unitaries (LCU) technique, which has become an important subroutine in designing various quantum algorithms including Hamiltonian simulation [15], solving linear systems [10] and differential equations [4]. Here we present the LCU lemma from [19].

Lemma 2.

Let $A=\sum_{j=0}^{m-1}y_{j}A_{j}$ and $\|y\|_{1}\leq\beta$ . Suppose that $(P_{L},P_{R})$ is a pair of $b$ -qubit unitaries such that $P_{L}\ket{0}\sum_{j=0}^{m-1}c_{j}\ket{j}$ and $P_{R}\ket{0}\sum_{j=0}^{m-1}d_{j}\ket{j}$ with $\sum_{j=0}^{m-1}|\beta(c_{j}^{*}d_{j}-y_{j})|<\epsilon_{1}$ , and $W=\sum_{j=0}^{m-1}\ket{j}\bra{j}\otimes U_{j}$ where $U_{j}$ is a $(\alpha,a,\epsilon_{2})$ -block-encoding of $A_{j}$ . Then we can construct a $(\alpha\beta,a+b,\alpha\epsilon_{1}+\alpha\beta\epsilon_{2})$ -block-encoding of $A$ with a single use of $W$ , $P_{L}^{\dagger}$ and $P_{R}$ .

Matrix multiplication can also be implemented via block-encodings. For two arbitrary matrices $A$ and $B$ , a straightforward approach of constructing the block-encoding of $AB$ is to multiply together the block-encodings of $A$ and $B$ , as shown in the following lemma [19].

Lemma 3.

If $U_{A}$ is an $(\alpha_{A},n_{A},\epsilon_{A})$ -block-encoding of $A$ , and $U_{B}$ is an $(\alpha_{B},n_{B},\epsilon_{B})$ -block-encoding of $B$ , then $(I_{n_{B}}\otimes U_{A})(I_{n_{A}}\otimes U_{B})$ is an $(\alpha_{A}\alpha_{B},n_{A}+n_{B},\alpha_{A}\epsilon_{B}+\alpha_{B}\epsilon_{A})$ -block-encoding of $AB$ .

Despite its simplicity, such a straightforward approach for matrix multiplication may incur large space overhead. This is because we need to enlarge the ancilla register at each step of the multiplication, so the multiplication of $J$ many matrices will in general require $\mathcal{O}(J)$ ancilla qubits. To overcome this issue, [30] introduces a technique called compression gadget, which is further simplified in [18]. The circuit is given in Figure 1, and the idea of compression gadget is to use a counter register to keep track of the multiplication in a coherent way. Specifically, let ADD implements addition by $1$ modulo the 2-power of the number of the qubits in the counter register. One can first apply ADD to lift the counter register from $0$ to $L-1$ . Then we sequentially apply the block-encodings using the same ancilla register, but after each block-encoding we apply a controlled $\text{ADD}^{\dagger}$ to reduce the counter by $1$ if the corresponding block-encoding was successfully implemented. Therefore, a $0$ final outcome of the counter register implies successful application of all the block-encodings. We present the result in [18], in which interested readers may find more details on its proof.

\Qcircuit@R=1em @C=1em Counter & \gateADD^J \gateADD^† \qw \gateADD^† \qw ⋯ \qw \gateADD^† \qw
Ancilla \multigate1U_0 \ctrlo-1 \multigate1U_1 \ctrlo-1 \qw ⋯ \multigate1U_J-1 \ctrlo-1 \qw
System \ghostU_0 \qw \ghostU_1 \qw \qw ⋯ \ghostU_J-1 \qw \qw

Figure 1: Quantum circuit for compression gadget to block encode

A_{J-1}\cdots A_{1}A_{0}

. Here the Counter register contains

\lceil\log_{2}J\rceil+1

qubits and the ancilla register contains

\max n_{j}

qubits. ADD implements addition by

1

modulo

2^{\lceil\log_{2}J\rceil+1}

, and

U_{j}

is an

(\alpha_{j},n_{j},0)

-block-encoding of the matrix

A_{j}

Lemma 4.

For $0\leq j\leq J-1$ , let $U_{j}$ be an $(\alpha_{j},n_{j},0)$ -block-encoding of $A_{j}$ . Then an $(\alpha,n,0)$ -block-encoding of $A_{J-1}\cdots A_{1}A_{0}$ can be constructed using one application of each $U_{j}$ , where $\alpha=\alpha_{0}\alpha_{1}\cdots\alpha_{J-1}$ and $n=\max n_{j}+\lceil\log_{2}(J)\rceil+1$ .

3 Linear equations without potential

We start with the simplest case where $c(t,x)=0$ and $a=0$ . In this case, the right hand side of Equation 1 only involves a fractional Laplacian operator. By the spectral definition, this operator has closed-form eigenvalues and known eigenfunctions, so the corresponding time-evolution operator can be implemented fast-forwardly [5] (i.e., query complexity is independent of the spectral norm and the evolution time). Here we present our algorithm in general high dimension and establish rigorous complexity estimate taking into consideration the spatial discretization errors.

We consider general $d$ -dimensional case. The eigenvalues and eigenfunctions of $(-\Delta)$ are $4\pi^{2}(k_{0}^{2}+\cdots+k_{d-1}^{2})$ and $e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}$ , where $x=(x_{0},\cdots,x_{d-1})$ denotes the spatial coordinate and $k=(k_{0},\cdots,k_{d-1})$ denotes a set of integers. Let the Fourier series of $u_{0}(x)$ be

u_{0}(x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})},

(7)

where $\hat{u}_{k}$ denotes the Fourier coefficients. Then the solution of Equation 1 has the form

u(T,x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{-(4\pi^{2}(k_{0}^{2}+\cdots+k_{d-1}^{2}))^{\alpha/2}T}e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}.

(8)

Numerical solutions can be obtained by truncating the Fourier series at a finite order.

Equation 8 can be quantumly implemented using quantum Fourier transform (QFT) and controlled rotations. Recall that our goal is to prepare an approximation of the quantum state encoding the normalized solution at discrete spatial grid points $(j/N)_{j\in[N]^{d}}$ . Let $N$ be the number of the grid points in each spatial dimension. Suppose that we are given the oracle $O_{u_{0}}$ that prepares the normalized initial condition

\ket{u_{0}}=\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\ket{n_{0}}\cdots\ket{n_{d-1}},

(9)

where

\vec{u}_{0}=\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\ket{n_{0}}\cdots\ket{n_{d-1}}.

(10)

We first compute a quantum state encoding the Fourier coefficients $\hat{u}_{k}$ by QFT. Specifically, let $\omega_{N}=e^{2\pi i/N}$ and $\mathcal{F}$ denote the one-dimensional QFT, i.e., for any computational basis state $\ket{j}$ ,

\mathcal{F}\ket{j}=\frac{1}{\sqrt{N}}\sum_{l=0}^{N-1}\omega_{N}^{jl}\ket{l}.

(11)

Then

$\displaystyle(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{0}$	$\displaystyle=\frac{1}{N^{d/2}}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\left(\sum_{m_{0}=0}^{N-1}\omega_{N}^{-n_{0}m_{0}}\ket{m_{0}}\right)\cdots\left(\sum_{m_{d-1}=0}^{N-1}\omega_{N}^{-n_{d-1}m_{d-1}}\ket{m_{d-1}}\right)$	(12)
	$\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{n\in[N]^{d}}u_{0}(n_{0}/N,\cdots,n_{d-1}/N)\omega_{N}^{-n_{0}m_{0}-\cdots-n_{d-1}m_{d-1}}\ket{m_{0}}\cdots\ket{m_{d-1}}$	(13)
	$\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{n\in[N]^{d}}\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}e^{2\pi i(k_{0}n_{0}/N+\cdots+k_{d-1}n_{d-1}/N)}\omega_{N}^{-n_{0}m_{0}-\cdots-n_{d-1}m_{d-1}}\ket{m_{0}}\cdots\ket{m_{d-1}}$	(14)
	$\displaystyle=\frac{1}{N^{d/2}}\sum_{m\in[N]^{d}}\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}\left(\sum_{n_{0}=0}^{N-1}\omega_{N}^{n_{0}(k_{0}-m_{0})}\right)\cdots\left(\sum_{n_{d-1}=0}^{N-1}\omega_{N}^{n_{d-1}(k_{d-1}-m_{d-1})}\right)\ket{m_{0}}\cdots\ket{m_{d-1}}$	(15)
	$\displaystyle=N^{d/2}\sum_{m\in[N]^{d}}\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}\ket{m_{0}}\cdots\ket{m_{d-1}}.$	(16)

When the function $u_{0}(x)$ satisfies certain regularity assumption (to be specified later), the summation $\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}$ is dominated by the index with smallest absolute value, because the Fourier coefficients decay rapidly with respect to its frequency. Therefore

(\mathcal{F}^{-1})^{\otimes d}\ket{\vec{u}_{0}}\approx\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}},

(17)

where $i(m)=(i_{0}(m),\cdots,i_{d-1}(m))$ represents an $d$ -dimensional vector with the $j$ -th index

i_{j}(m)=\begin{cases}m_{j},&\text{ if }0\leq m_{j}\leq N/2\\ m_{j}-N,&\text{ if }N/2+1\leq m_{j}\leq N-1.\end{cases}

(18)

Now we append three ancilla registers to Equation 17 and get the (approximate) quantum state

\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\ket{0}\ket{0}.

(19)

The first two ancilla registers are used to binarily encode $e^{-(4\pi^{2}(i_{0}(m)^{2}+\cdots+i_{d-1}(m)^{2}))^{\alpha/2}T}$ and the third register is used for rotation. Specifically, suppose that we are given the oracle that encodes the eigenvalues as

O_{1}:\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\rightarrow\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{(2\pi\|i(m)\|)^{\alpha}},

(20)

and the oracle for computing an exponential function as

O_{\exp,1}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xT}}.

(21)

Notice that both functions have closed-form expression so the oracles can be efficiently constructed using classical arithmetic [34]. Applying $O_{1}$ on the index and first ancilla registers and then $O_{\exp,1}$ on the first and second ancilla registers yields

\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{\|i(m)\|}\ket{e^{-(2\pi\|i(m)\|)^{\alpha}T}}\ket{0}.

(22)

Let $\text{c-}R$ denote the controlled rotation

\text{c-}R:\ket{\theta}\ket{0}\rightarrow\ket{\theta}\left(\theta\ket{0}+\sqrt{1-|\theta|^{2}}\ket{1}\right).

(23)

Then, by applying $\text{c-}R$ on the last two registers in Equation 22, we obtain

\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{\|i(m)\|}\ket{e^{-(2\pi\|i(m)\|)^{\alpha}T}}\ket{0}+\ket{\perp},

(24)

where $\ket{\perp}$ represents the orthogonal part with $1$ in the last ancilla register. Uncomputing the first two ancilla registers yields

\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}\ket{0}\ket{0}+\ket{\perp},

(25)

Notice that, by replacing $u_{0}(x)$ with $u(T,x)$ in Equation 17 and using Equation 8, we have

(\mathcal{F}^{-1})^{\otimes d}\vec{u}(T,x)\approx N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}.

(26)

Therefore, applying the QFT $\mathcal{F}^{\otimes d}$ to Equation 25 approximately gives

\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}u(T,n/N)\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}\ket{0}\ket{0}+\ket{\perp}.

(27)

Measuring the ancilla registers to get all $0$ ’s yields an approximation of $\ket{u(T)}$ , and the averaged number of repeats for success after amplitude amplification scales $\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|)$ .

\Qcircuit@R=1em @C=1em —u_0⟩ & \gate(F^-1)^⊗d \multigate1O_1 \qw \qw \qw \multigate1O_1^† \gateF^⊗d \qw
—0⟩ \qw \ghostO_1 \multigate1O_exp,1 \qw \multigate1O_exp,1^† \ghostO_1^† \qw \meter
—0⟩ \qw \qw \ghostO_exp,1 \ctrl1 \ghostO_exp,1^† \qw \qw \meter
—0⟩ \qw \qw \qw \gateR \qw \qw \qw \meter

Figure 2: Quantum circuit for solving linear fractional reaction-diffusion equations without potential. Here

O_{1}

O_{\exp,1}

are the oracles defined in Equation 20 and Equation 21, and

R

is the single-qubit rotation gate defined in Equation 23.

We summarize the quantum algorithm in Figure 2 and present its error and complexity estimates in the following result. The main result is that the overall error will decrease as the number of the grid points increases, since more grid points imply larger truncation order in the Fourier series. Detailed proof can be found in Appendix B.

Theorem 5.

Consider solving Equation 1 with $a=c=0$ . Suppose that we are given the state preparation oracle $O_{u_{0}}$ and the oracles $O_{1}$ , $O_{\exp,1}$ as defined in Equation 20 and Equation 21. Furthermore, suppose that there exists an integer $p\geq d+2$ such that $u(t,x)$ is $p$ -th order spatially continuously differentiable. Then there exists a quantum algorithm that prepares an approximation of the quantum state $\ket{\vec{u}(T)}$ with success probability $1-\Omega(1)$ and $2$ -norm error at most

\frac{4\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\|\vec{u}(T)\|(\pi/2)^{p}N^{p-d}},

(28)

using $\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|)$ queries to $O_{u_{0}},O_{1},O_{\exp,1}$ , their inverse and controlled versions, and $\mathcal{O}((\|\vec{u}_{0}\|/\|\vec{u}(T)\|)d\log^{2}(N))$ additional gates.

4 Linear equations with potential

Consider the linear spatial fractional reaction-diffusion equations

\begin{split}\partial_{t}u(t,x)&=-(-\Delta)^{\alpha/2}u(t,x)-c(t,x)u(t,x),\quad t\in[0,T],x\in[0,1]^{d},\\ u(0,x)&=u_{0}(x).\end{split}

(29)

Throughout this section, we assume the potential function $c(t,x)$ to be non-negative. We will first discuss the validity of this assumption by so-called shifting equivalence. The equation Equation 29 can then be treated by the method of lines, i.e., first discretizing the spatial variable to obtain a system of ODEs, and then solving the ODE with different quantum algorithms. We consider four different quantum algorithms: second-order Trotter formula, time-marching method, truncated Dyson series method, and linear combination of Hamiltonian simulation in the interaction picture.

4.1 Shifting equivalence

We consider the shifted PDE

\begin{split}\partial_{t}v(t,x)&=-(-\Delta)^{\alpha/2}v(t,x)-\widetilde{c}(t,x)v(t,x),\quad t\in[0,T],x\in[0,1]^{d},\\ v(0,x)&=u_{0}(x).\end{split}

(30)

Here

\widetilde{c}(t,x)=c(t,x)-\gamma(t)

(31)

where $\gamma(t)$ is a real scalar-valued function. An important observation is that Equation 29 and Equation 30 are quantumly equivalent in the sense that the normalized solutions are the same. Specifically, let $u$ denote the solution of Equation 29, then

\begin{split}\partial_{t}(e^{\int_{0}^{t}\gamma(s)ds}u)&=e^{\int_{0}^{t}\gamma(s)ds}\partial_{t}u+\gamma(t)e^{\int_{0}^{t}\gamma(s)ds}u\\ &=e^{\int_{0}^{t}\gamma(s)ds}\left(-(-\Delta)^{\alpha/2}u-c(t,x)u\right)+\gamma(t)e^{\int_{0}^{t}\gamma(s)ds}u\\ &=-(-\Delta)^{\alpha/2}(e^{\int_{0}^{t}\gamma(s)ds}u)-\widetilde{c}(t,x)(e^{\int_{0}^{t}\gamma(s)ds}u).\end{split}

(32)

Therefore $e^{\int_{0}^{t}\gamma(s)ds}u$ solves Equation 30 and only differs $u$ by a multiplicative constant factor at fixed time, which implies that they are the same in a quantum state representation, i.e.,

\frac{1}{\|\vec{u}\|}\sum_{n\in[N]^{d}}u(t,n/N)\ket{n}=\frac{1}{\|e^{\int_{0}^{t}\gamma(s)ds}\vec{u}\|}\sum_{n\in[N]^{d}}e^{\int_{0}^{t}\gamma(s)ds}u(t,n/N)\ket{n}.

(33)

We will choose the shifting parameter to be

\gamma(t)=\min_{x}c(t,x).

(34)

Therefore the function $\widetilde{c}(t,x)$ is non-negative. For notation simplicity, we will directly assume that the original $c(t,x)$ in Equation 29 to be non-negative instead of introducing new notations with tildes.

4.2 Spatial discretization

Our goal is to prepare a quantum state encoding $u(T,x)$ at equi-distant grid points $(n_{0}/N,\cdots,n_{d-1}/N)$ where $n_{j}\in[N]$ . Motivated by the spectral decomposition, we define

B=(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d}

(35)

where $D$ is an $N^{d}$ -dimensional diagonal matrix

D=\sum_{n\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(n)\|^{\alpha}\ket{n_{0}}\cdots\ket{n_{d-1}}\bra{n_{0}}\cdots\bra{n_{d-1}}.

(36)

For each $t$ , let $C(t)$ be an $N^{d}$ -dimensional diagonal matrix

C(t)=\sum_{n\in[N]^{d}}c(t,n/N)\ket{n_{0}}\cdots\ket{n_{d-1}}\bra{n_{0}}\cdots\bra{n_{d-1}}.

(37)

Then we consider the spatially discretized equation

\begin{split}\frac{d}{dt}\vec{u}&=-B\vec{u}-C(t)\vec{u},\quad t\in[0,T]\\ \vec{u}(0)&=\vec{u}_{0}.\end{split}

(38)

In order to bound the spatial discretization error by $\epsilon$ , we need to choose a sufficiently large $N$ . In the following, we derive an error bound in terms of $N$ . The proof of this result can be found in Appendix C.

Lemma 6.

Let $u$ be the exact solution of Equation 29 and $\vec{u}$ be the solution of Equation 38. Suppose $u(t,x)$ is $p$ -th order spatially continuously differentialable where $p\geq d+\alpha+2$ , then

\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.

(39)

6 tells that, similar to the case without potential, the order of the spatial discretization error convergence depends on the smoothness of the solution. In particular, it can be exponential convergence if the solution is within the Gevrey class. The Gevrey class includes infinitely differentiable functions whose $p$ -th order derivative grows polynomially in $p!$ . Notice that Gevrey class greatly enlarges the class of real analytic functions, since the Taylor series will only have convergence radius $0$ if the $p$ -th order derivative scales super-linearly in $p!$ . We give the explicit error bounds and the choice of the grid for bounded errors in the next two results. Their proof can be found in Appendix C as well.

Corollary 7.

Consider solving Equation 29 on discrete grid points $(T,n/N)$ where $n\in[N]^{d}$ . Suppose that the exact solution $u(t,x)$ is in the Gevrey class $G^{\sigma}$ in the sense that $u(t,x)$ is smooth and there exist constants $\Lambda>0$ and $\sigma\geq 0$ such that

\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}.

(40)

Then, for any $N\geq(2\Lambda/\pi)(d+\alpha+2)^{\sigma}$ , we have

\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\leq c_{1}T(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right),

(41)

where $c_{j}$ ’s are constants only depending on $\sigma,\alpha$ and $\Lambda$ .

Corollary 8.

Consider solving Equation 29 on discrete grid points $(T,n/N)$ where $n\in[N]^{d}$ . Suppose that the exact solution $u(t,x)$ is in the Gevrey class $G^{\sigma}$ as in 7. Then, in order to bound the spatial discretization error in the quantum state (i.e., $\|\ket{(u(T,n/N))_{n\in[N]^{d}}}-\ket{\vec{u}(T)}\|$ ) by $\epsilon$ , it suffices to choose

N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\|(u(T,n/N))_{n\in[N]^{d}}\|}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right).

(42)

4.3 Second-order Trotter formula

We solve Equation 38 by Trotter formula. In particular, we divide $[0,T]$ into $r$ equi-length segments and let $h=T/r$ . Consider

\mathcal{T}e^{\int_{0}^{T}(-B-C(t))dt}\approx\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)

(43)

where $S_{2}(jh,(j+1)h)$ is the second-order time-dependent Trotter method, aiming at approximating $\mathcal{T}e^{\int_{jh}^{(j+1)h}(-B-C(t))dt}$ and defined as

S_{2}((j+1)h,jh)=e^{-Bh/2}e^{-C((j+1/2)h)h}e^{-Bh/2}.

(44)

In this subsection, we first derive a bound of the Trotter error and show the choice of the number of the segments for bounded errors. Then we discuss how to quantumly implement the numerical scheme and estimate its complexity.

4.3.1 Error bound, commutator scalings, and the number of the time steps

We first bound the distance between the exact evolution operator and the numerical integrators.

Lemma 9.

Consider solving Equation 38 using second-order Trotter formula $\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)$ with time step size $h=T/r$ , where the local integrator $S_{2}$ is defined in Equation 44. Then

\begin{split}&\quad\left\|\mathcal{T}e^{\int_{0}^{T}(-B-C(t))dt}-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\right\|\\ &\leq Th^{2}\left(\frac{1}{24}\max\|C^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|C\|)\max\|C^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,C]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right),\end{split}

(45)

where all the maximums are taken over $t\in[0,T]$ .

The proof of 9 can be found in Appendix D, which contains two parts. First, we deal with the time-ordering operator by bounding the distance between $\mathcal{T}e^{\int_{0}^{h}(-B-C(t))dt}$ and $e^{(-B-C(h/2))h}$ using the variation of parameters formula. This part contributes to the first two terms in the error bound (the second line of Equation 45) involving time derivatives of the potential matrix $C(t)$ . Then we can bound the error between $e^{(-B-C(h/2))h}$ and $S_{2}$ by time-independent Trotter error bounds. Notice that the proof for the second part is different from that for Hamiltonian simulation [14], since in our case the parameters $\beta$ and $\gamma$ in the exponentials $e^{-\beta B}$ and $e^{-\gamma C}$ are restricted to non-negative to avoid exponential overhead, while for Hamiltonian simulation there is no such restriction. Instead, we mostly follow the procedure in [23] to establish the second part of the error bound. This contributes to the last three terms in the error bound (the third line of Equation 45) involving the commutators and the spectral norm of $C$ explicitly.

Now we compute the norm of the commutators $[B,{C}(t)]$ and $[B,[B,{C}(t)]]$ . All the results are for a fixed $t$ , so we will omit the explicit $t$ dependence in our notation for now. Naive bounds are $\|[B,{C}]\|\leq\mathcal{O}(\|B\|)=\mathcal{O}(d^{\alpha/2}N^{\alpha})$ and $\|[B,[B,{C}]]\|\leq\mathcal{O}(\|B\|^{2})=\mathcal{O}(d^{\alpha}N^{2\alpha})$ . However, these naive bounds are improvable, because the order of the commutator of the Laplacian operator can be reduced. Such a phenomenon has been observed in [23, 2]. To see this, let us take $\alpha=2$ and consider its continuous analog. Then for any smooth function $f(x)$ ,

\begin{split}[-\Delta,{c}]f&=-\Delta({c}f)-{c}(-\Delta f)\\ &=-(\Delta{c})f-2(\nabla{c})\cdot(\nabla f)-{c}\Delta f+{c}\Delta f\\ &=-(\Delta{c})f-2(\nabla{c})\cdot(\nabla f).\end{split}

(46)

So we expect, in the discrete setting, the first commutator is bounded by the discretized divergence operator, whose norm is only $\mathcal{O}(dN)$ .

Now we state the improved bound in the discrete setting. The proof can be found in Appendix E, which is a generalization of [23].

Lemma 10.

Suppose that ${c}(t,x)$ is a bounded $C^{5+d}$ function in $x$ . Then we have

\|[B,{C}(t)]\|\leq\mathcal{O}(d^{\alpha/2}N^{\alpha/2})

(47)

and

\|[B,[B,{C}(t)]]\|\leq\mathcal{O}(d^{\alpha}N^{\alpha}).

(48)

The choice of the number of time steps is a direct consequence of 9 and 10.

Corollary 11.

Consider solving Equation 38 using second-order Trotter formula $\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)$ with time step size $h=T/r$ , where the local integrator $S_{2}$ is defined in Equation 44. Suppose that $\|{C}^{(k)}(t)\|$ ’s are uniformly bounded in $t$ for $k\leq d+5$ . Then, in order to bound the operator splitting error by $\epsilon$ , it suffices to choose

r=\mathcal{O}\left(d^{\alpha/2}N^{\alpha/2}\frac{T^{3/2}}{\epsilon^{1/2}}\right).

(49)

4.3.2 Quantum implementation and complexity estimate

Now we discuss quantum implementation of this method. The main idea is to construct the block-encodings of $e^{-Bh/2}$ and $e^{-{C}((j+1/2)h)h}$ and multiply them together using 4. The block-encodings can be constructed using controlled rotations since each evolution operator is unitarily equivalent to a diagonal matrix and the corresponding unitary transformation matrix is efficiently implementable. The nuance is that the operator $e^{-{C}((j+1/2)h)h}$ still depends on the specific time, so we use the counter register in the compression gadget as the time clock as well.

Suppose that we are given the oracle $O_{1}$ , defined in Equation 20, that encodes the eigenvalues of $B$ , and the oracle $O_{2}$ that gives the element of $C(t)$ as

O_{2}:\ket{t}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}\rightarrow\ket{t}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{c(t,n/N)}.

(50)

Here, with an abuse of notations, $\ket{t}$ represents some specific way of encoding the information of $t$ , i.e., $(2j+1)$ for the time point $(j+1/2)h$ after time discretization (the reason why it is $(2j+1)$ will be clear soon).

As discussed before, we may use the circuit in Figure 2 with a replacement of $O_{\exp,1}$ by $O_{\exp,2}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xh/2}}$ to construct a $(1,*,0)$ -block-encoding of $e^{-Bh/2}$ . Here the number of the ancilla qubits depends on those for encoding the eigenvalues of $B$ and their exponentials, on which we do not keep track for technical simplicity. We denote this block-encoding by $V_{B,h/2}$ . Similarly, a $(1,*,0)$ -block-encoding of $e^{-Bh}$ can be constructed with $O_{\exp,3}:\ket{x}\ket{0}\rightarrow\ket{x}\ket{e^{-xh}}$ . We denote it by $V_{B,h}$ . Furthermore, in Figure 2, if we discard the QFT steps, and replace $O_{\exp,1}$ by $O_{\exp,3}$ and the oracle $O_{1}$ by $O_{2}$ controlled by an extra counter register, we can construct a (controlled version of) $(1,*,0)$ -block-encoding of $e^{-C((j+1/2)h)h}$ , denoted by $V_{C,j}$ . Notice that the constructions of block-encodings $V_{B,h/2}$ , $V_{B,h}$ and $V_{C,j}$ only require $\mathcal{O}(1)$ queries to the aforementioned oracles and QFT.

Now we construct the numerical integrator. Mathematically we need to block encode the operator

\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)=e^{-Bh/2}\left(\prod_{j=1}^{r-1}e^{-C((j+1/2)h)h}e^{-Bh}\right)e^{-C(h/2)h}e^{-Bh/2}.

(51)

This can be done by the circuit in Figure 3, which can be viewed as a variant of the compression gadget in 4. The idea is to use an extra counter register with $\lceil\log_{2}(2r+1)\rceil+1$ qubits for both keeping track of the success/failure of the multiplication and indicating the index of the time step. If all the applications of the block-encodings of the local exponentials are successful, then the value of the counter register at each step suggests the correct time step index, and the final value is reset to be $0$ . Notice that the block-encoding factor of $V_{B,h}$ and $V_{C,j}$ are $1$ . Then the circuit in Figure 3 gives a $(1,*,0)$ -block-encoding of $\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)$ , as desired.

\Qcircuit@R=1em @C=1em Counter & \qw \gateADD \ctrl1 \gateADD \qw \gateADD \ctrl1 \gateADD \qw ⋯
Ancilla \multigate1V_B,h/2 \ctrlo-1 \multigate1V_C,0 \ctrlo-1 \multigate1V_B,h \ctrlo-1 \multigate1V_C,1 \ctrlo-1 \qw ⋯
System \ghostV_B,h/2 \qw \ghostV_C,0 \qw \ghostV_B,h \qw \ghostV_C,1 \qw \qw ⋯

Counter ⋯ \qw \gateADD \ctrl1 \gateADD \qw \gateADD \gate(ADD^†)^2r+1 \qw
Ancilla ⋯ \multigate1V_B,h \ctrlo-1 \multigate1V_C,j-1 \ctrlo-1 \multigate1V_B,h/2 \ctrlo-1 \qw \qw
System ⋯ \ghostV_B,h \qw \ghostV_C,j-1 \qw \ghostV_B,h/2 \qw \qw \qw

Figure 3: Quantum circuit for implementing second-order Trotter method. Here the Counter register contains

\lceil\log_{2}(2r+1)\rceil+1

qubits. ADD implements addition by

1

modulo

2^{\lceil\log_{2}(2r+1)\rceil+1}

We apply this block-encoding to the input state $\ket{u_{0}}$ , and the final state is

\frac{1}{\|\vec{u}_{0}\|}\sum_{n\in[N]^{d}}(\vec{u}_{r})_{n}\ket{n_{0}}\cdots\ket{n_{d-1}}\ket{0}+\ket{\perp},

(52)

where

\begin{split}\vec{u}_{r}=\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\vec{u}_{0}\approx\vec{u}(T),\end{split}

(53)

and $\ket{\perp}$ represents the junk state with ancilla register not equal to $0$ . The final step is to measure the ancilla registers of Equation 52. If all the ancilla registers are $0$ , then we get a good approximation of $\ket{u(T)}$ . The averaged number of repeats after amplitude amplification is $\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|)$ .

We summarize the overall complexity as follows.

Theorem 12.

Consider solving Equation 29 on discrete grid points $(T,n/N)$ where $n\in[N]^{d}$ . Let $u(t,x)$ denote the solution of the equation Equation 29, and $\vec{u}(t)$ denote the solution of the spatially discretized Equation 38. Suppose that

1.

we are given oracles $O_{1}$ and $O_{2}$ defined in Equation 20 and Equation 50, and the state preparation oracle $O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}}$ ,
2.

$u(t,x)$ is in the Gevrey class $G^{\sigma}$ in the sense that $u(t,x)$ is smooth and there exist constants $\Lambda>0$ and $\sigma\geq 0$ such that

$\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}.$ (54)
3.

$\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T)$ for a function $\widetilde{g}$ .
4.

$\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T)$ for a function $g$ .

Then, with second-order operator splitting method for time propagation, an $\epsilon$ -approximation of $\ket{(u(T,n/N))_{n\in[N]^{d}}}$ can be obtained by choosing

N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right).

(55)

and using

\mathcal{O}\left((g(T))^{3/2}\frac{T^{3/2}}{\epsilon^{1/2}}d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma/2}\right)

(56)

queries to $O_{1}$ , $O_{2}$ , their inverses and controlled versions,

2.

$\mathcal{O}\left(g(T)\right)$ (57)

queries to the state preparation oracle $O_{u}$ and its inverse.

\mathcal{O}\left((g(T))^{3/2}\frac{T^{3/2}}{\epsilon^{1/2}}d^{\alpha/2+1}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma/2}\log^{2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)\right)

(58)

additional elementary gates.

Proof.

It suffices to bound both spatial and time discretization errors by $\epsilon$ . The choice of $N$ directly follows 8. We now count the overall complexity. Each run of the algorithm requires an application of Figure 2 which implements $e^{-Bh/2}$ and $e^{-{C}(t)h}$ for $\mathcal{O}(r)$ times. In each block-encoding of $e^{-Bh/2}$ and $e^{-{C}(t)h}$ , we need to use $\mathcal{O}(1)$ queries to $O_{1}$ and $O_{2}$ , and $\mathcal{O}(d\log^{2}(N))$ additional gates mainly due to the QFT. According to 11, it suffices to choose $r=\mathcal{O}\left(g(T)^{1/2}d^{\alpha/2}N^{\alpha/2}\frac{T^{3/2}}{\epsilon^{1/2}}\right)$ . Here the extra factor $g(T)^{1/2}$ is because 11 only bounds the operator norm, and in order to bound the error in the quantum state by $\epsilon$ , the operator norm error bound needs to be bounded by $\mathcal{O}(\epsilon\|\vec{u}(T)\|/\|\vec{u}(0)\|)$ according to 19. The averaged number of repeats to succeed after amplitude amplification is $\mathcal{O}(\|\vec{u}(0)\|/\|\vec{u}(T)\|)=\mathcal{O}(g(T))$ . Multiplying these together gives the overall query complexity and additional gates required. Notice that in each run we only need one query to the state preparation oracle, so the overall number of state preparation is only $\sim g(T)$ . ∎

4.4 Time-marching method

Now we consider an alternative algorithm proposed in [18] called time-marching method. The method is designed for general ODE

\frac{d}{dt}\phi(t)=A(t)\phi(t),\quad\phi(0)=\phi_{0}

(59)

with a time-dependent matrix-valued function $A(t)$ , and is a quantum analog of the classical exponential propagation methods. It first divides the time interval $[0,T]$ into small segments with mesh $0=t_{0}<t_{1}<\cdots<t_{L}=T$ and applies the short-time evolution operator sequentially. While naive applications of a sequence of non-unitary operators may incur an exponential overhead in the number of the operators, the time-marching method avoids such overhead by a technique that combines the uniform singular value transformation and the amplitude amplification. We refer to [18] for more details and only roughly summarize the main result here.

Lemma 13 (Theorem 8 of [18]).

Consider the ODE Equation 59. Suppose that we are given the prepare oracle $O_{\phi}$ of $\ket{\phi_{0}}$ such that $O_{\phi}\ket{0}=\ket{\phi_{0}}$ and an input model of $A(t)$ , denoted by $\text{MAT}_{A}$ , that simultaneously block encodes $A(t_{k}^{\prime})$ at some refined mesh points $t_{k}^{\prime}$ . Then, with the time-marching method, an $\epsilon$ -approximation of $\ket{\phi(T)}$ can be prepared using

\mathcal{O}\left(\eta^{2}T^{2}Q\log(\eta TQ)\frac{\log(\eta TQ/\epsilon)}{\log\log(\eta TQ/\epsilon)}\right)

(60)

queries to $\text{MAT}_{A}$ and $\mathcal{O}(Q)$ queries to $O_{\phi}$ , its inverse and controlled version. Here $\eta$ is the block-encoding factor of $\text{MAT}_{A}$ such that $\eta\geq\|A(t)\|$ for all $t\in[0,T]$ , and

Q=\frac{\|\phi(0)\|\prod_{l=1}^{L}\left\|\mathcal{T}e^{\int_{t_{l-1}}^{t_{l}}A(t)dt}\right\|}{\|\phi(T)\|}.

(61)

Now we discuss the complexity of applying the time-marching method to spatially discretized equation Equation 38. For technical simplicity, we only estimate the query complexities.

Theorem 14.

1.

we are given oracles encoding the eigenvalues of $B$ and the diagonal entries of ${C}(t)$ , i.e., $O_{1}$ and $O_{2}$ defined in Equation 20 and Equation 50, and the state preparation oracle $O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}}$ ,
2.

$v(t,x)$ is in the Gevrey class $G^{\sigma}$ in the sense that $v(t,x)$ is smooth and there exist constants $\Lambda>0$ and $\sigma\geq 0$ such that

$\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}v(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}.$ (62)
3.

$\|(v(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T)$ for a function $\widetilde{g}$ .

Then, with the time-marching method for time propagation, an $\epsilon$ -approximation of $\ket{(u(T,n/N))_{n\in[N]^{d}}}$ can be obtained by choosing

N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right).

(63)

and using

\widetilde{\mathcal{O}}\left(QT^{2}d^{\alpha}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{2\alpha\sigma}\log\left(\frac{1}{\epsilon}\right)\right)

(64)

queries to ${O}_{1}$ , ${O}_{2}$ , the $d$ -dimensional QFT circuit, their inverses and controlled versions, where

Q=\frac{\|\vec{u}(0)\|\prod_{l=1}^{L}\left\|\mathcal{T}e^{\int_{t_{l-1}}^{t_{l}}(-B-{C}(t))dt}\right\|}{\|\vec{u}(T)\|},

(65)

2.

$\mathcal{O}\left(Q\right)$ (66)

queries to the state preparation oracle $O_{u}$ or its inverse.

Proof.

First, the oracle $\text{MAT}_{A}$ mentioned in 13 can be constructed through the given ${O}_{1}$ and ${O}_{2}$ and additional $\mathcal{O}(d\log^{2}(N))$ gates. The idea is as follows. We can first construct the block-encoding of the diagonal matrix $D$ with ${O}_{1}$ , then construct the block-encoding of $B=(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d}$ using QFT. Meanwhile we may construct the simultaneous block-encoding of ${C}(t)$ with ${O}_{2}$ . Finally, we can linearly combine these two block-encodings to obtain the desired $\text{MAT}_{A}$ that block encodes $A(t)=-B-{C}(t)$ . According to [19, Lemma 48 & Lemma 52 & Lemma53], such approach requires $\mathcal{O}(1)$ queries to ${O}_{1}$ and ${O}_{2}$ , so we may directly estimate the number of queries to $\text{MAT}_{A}$ as that of queries to ${O}_{1}$ and ${O}_{2}$ . Note that there are additional $\mathcal{O}(d\log^{2}(N))$ gates required to construct $\text{MAT}_{A}$ from ${O}_{1}$ and ${O}_{2}$ , but we only focus on the query complexities here for technical simplicity.

Now we may directly use 13 to estimate the complexity, and it suffices to write down explicit scalings of the block-encoding factor $\eta$ in the example of Equation 38. Under the assumption that ${C}(t)$ is uniformly bounded, the spectral norm $\|-B-{C}(t)\|=\mathcal{O}(d^{\alpha/2}N^{\alpha})$ , so the parameter $\eta=\mathcal{O}(d^{\alpha/2}N^{\alpha})$ . Using the choice of $N$ estimated in 6, we have

\eta=\mathcal{O}\left(d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\right).

(67)

Plugging this parameter back to the scalings in 13 yields the desired estimates. ∎

4.5 Truncated Dyson series method

Now we discuss the query complexity of applying the state-of-the-art generic quantum ODE solvers to our linear fractional reaction-diffusion equation. We consider the method proposed in [8], which is based on the truncated Dyson series. It first expands the solution via truncated Dyson series and encode it into a linear system of equations, then solve it using the optimal quantum linear system algorithm. It works for the most general linear ODE with time dependent coefficient matrix and possible inhomogeneous term. In our case, we are only interested in the homogeneous equation, so here we only summarize their main result for the homogeneous equation Equation 59.

Lemma 15 (Theorem 1 of [8]).

Consider the ODE Equation 59 where the coefficient matrix $A(t)$ has non-positive logarithmic norm for all $t$ . Suppose that we are given the prepare oracle $O_{\phi}$ of $\ket{\phi_{0}}$ such that $O_{\phi}\ket{0}=\ket{\phi_{0}}$ and an input model of $A(t)$ , denoted by $\text{MAT}_{A}$ , that simultaneously block-encoding $A(t_{k}^{\prime})$ at some refined mesh points $t_{k}^{\prime}$ . Then, with the truncated Dyson series method, an $\epsilon$ -approximation of $\ket{\phi(T)}$ can be prepared using

\mathcal{O}\left(\frac{\max_{t\in[0,T]}\|\phi(t)\|}{\|\phi(T)\|}\eta T\log(1/\epsilon)\log(\eta T/\epsilon)\right)

(68)

queries to $\text{MAT}_{A}$ and

\mathcal{O}\left(\frac{\max_{t\in[0,T]}\|\phi(t)\|}{\|\phi(T)\|}\eta T\log(1/\epsilon)\right)

(69)

queries to $O_{\phi}$ . Here $\eta$ is the block-encoding factor of $\text{MAT}_{A}$ such that $\eta\geq\|A(t)\|$ for all $t\in[0,T]$ .

Theorem 16.

1.

we are given oracles encoding the eigenvalues of $B$ and the diagonal entries of ${C}(t)$ , i.e., $O_{1}$ and $O_{2}$ defined in Equation 20 and Equation 50, and the state preparation oracle $O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}}$ ,
2.

$u(t,x)$ is in the Gevrey class $G^{\sigma}$ in the sense that $u(t,x)$ is smooth and there exist constants $\Lambda>0$ and $\sigma\geq 0$ such that

$\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma}.$ (70)
3.

$\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T)$ for a function $\widetilde{g}$ ,
4.

$\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T)$ for a function $g$ .

Then, with the truncated Dyson series method for time propagation, an $\epsilon$ -approximation of $\ket{(u(T,n/N))_{n\in[N]^{d}}}$ can be obtained by choosing

N=\mathcal{O}\left(\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\sigma}\right).

(71)

and using

\widetilde{\mathcal{O}}\left(g(T)Td^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\left(\log\left(\frac{1}{\epsilon}\right)\right)^{2}\right)

(72)

queries to ${O}_{1}$ and ${O}_{2}$ , $d$ -dimensional QFT, their inverses and controlled versions,

\mathcal{O}\left(g(T)Td^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\log\left(\frac{1}{\epsilon}\right)\right)

(73)

queries to the state preparation oracle $O_{u}$ or its inverse.

Proof.

As shown in the proof of 14, we may directly estimate the number of queries to ${O}_{j}$ as that of queries to $\text{MAT}_{A}$ , and the block-encoding factor $\eta$ can be bounded as

\eta=\mathcal{O}\left(d^{\alpha/2}\left(d\log d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)^{\alpha\sigma}\right).

(74)

Since the coefficient matrix $(-B-{C}(t))$ is always negative semi-definite, the norm of the solution $\|\phi(t)\|$ is non-increasing over $t$ , so

\frac{\max_{t\in[0,T]}\|\vec{u}(t)\|}{\|\vec{u}(T)\|}=\frac{\|\vec{u}(0)\|}{\|\vec{u}(T)\|}\leq g(T).

(75)

Plugging these parameters back to 15 completes the proof. ∎

4.6 Linear combination of Hamiltonian simulation in the interaction picture

We have discussed and analyzed the second-order operator splitting method, the time-marching method, and the truncated Dyson series method. All the methods have extra polynomial dependence on the dimension $d$ , which comes from the dependence on the spectral norm of the discrete fractional Laplacian operator $B$ , although the operator splitting method may partially benefit from its commutator scalings.

In quantum dynamics, a common technique to avoid the explicit dependence on $\|B\|$ is to simulate the dynamics in the interaction picture. Specifically, if we consider the fractional Schrödinger equation

i\frac{d}{dt}\ket{\psi}=(-B-{C}(t))\ket{\psi},

(76)

then, by defining the interaction picture Hamiltonian $H_{I}(t)=-e^{-iBt}{C}(t)e^{iBt}$ and $\ket{\psi_{I}}=e^{-iBt}\ket{\psi}$ , we may obtain the transformed solution $\ket{\psi_{I}}$ by simulating $i\frac{d}{dt}\ket{\psi_{I}}=H_{I}(t)\ket{\psi_{I}}$ . Here $H_{I}(t)$ is bounded independently of $\|B\|$ , and its oscillations depend on $\|B\|$ . Therefore we may efficiently simulate the interaction picture Hamiltonian using truncated Dyson series method which has linearly dependence on $\|H_{I}\|$ but only poly-logarithmically depends on its derivatives. We refer to [30] for more details.

The success of the interaction picture Hamiltonian simulation relies on the fast-forwarded implementation of $e^{iBs}$ , that is, $e^{iBs}$ can be implemented for any real number $s$ with cost independent of $\|B\|$ and $|s|$ . However, in our reaction-diffusion equation, which can be viewed as the imaginary time evolution of the Schrödinger equation, we cannot directly apply similar technique. This is because $e^{-Bs}$ is only fast-forwardable when $s\geq 0$ , while the transformation into the analog of interaction picture requires both forward and backward time evolution.

4.6.1 Representation

To take advantage of the interaction picture technique, we can relate the reaction-diffusion equation with the Hamiltonian simulation problem. A recent work [4] shows that any linear ODE can be represented as a linear combination of Hamiltonian simulation. In particular, we may write the evolution operator of our fractional reaction-diffusion equation as

\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}=\int_{\mathbb{R}}\frac{1}{\pi(1+\xi^{2})}\mathcal{T}e^{-i\int_{0}^{T}\xi(B+{C}(s))ds}d\xi.

(77)

Here $\mathcal{T}$ is the time-ordering operator. The proof of Equation 77 can be found in [4]. Now we use the interaction picture simulation. Let

U(t)=\mathcal{T}e^{-i\int_{0}^{t}\xi(B+{C}(s))ds},

(78)

then $U(t)$ satisfies the time-dependent Hamiltonian simulation problem as

\frac{dU}{dt}=-i(\xi B+\xi{C}(t))U(t),\quad U(0)=I.

(79)

Let

U_{I}(t)=e^{i\xi Bt}U(t).

(80)

We may compute that

\begin{split}\frac{dU_{I}}{dt}&=i\xi Be^{i\xi Bt}U(t)-ie^{i\xi Bt}(\xi B+\xi{C}(t))U(t)\\ &=-ie^{i\xi Bt}\xi{C}(t)e^{-i\xi Bt}U_{I}(t).\end{split}

(81)

Define

H_{I}(t;\xi)=e^{i\xi Bt}\xi{C}(t)e^{-i\xi Bt}.

(82)

Then

U_{I}(t)=\mathcal{T}e^{-i\int_{0}^{t}H_{I}(s;\xi)ds},

(83)

and we may write Equation 77 as

\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}=\int_{\mathbb{R}}\frac{1}{\pi(1+\xi^{2})}e^{-i\xi BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi)ds}d\xi.

(84)

4.6.2 Numerical quadrature

We can truncate Equation 84 over a finite interval $[-\Xi,\Xi]$ and write it as

\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}\approx\int_{-\Xi}^{\Xi}\frac{1}{\pi(1+\xi^{2})}e^{-i\xi BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi)ds}d\xi.

(85)

The resulting integral can be discretized using standard numerical quadrature. Here we use the simplest Riemann sum formula with $M$ grid points. For $0\leq j\leq M-1$ , let $\xi_{j}=-\Xi+2j\Xi/M$ and $w_{j}=\frac{1}{\pi(1+\xi_{j}^{2})}\frac{2\Xi}{M}$ . Then

\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}\approx\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}.

(86)

The discretization error can be bounded as follows, and its proof is given in Appendix F.

Lemma 17.

We have

\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\leq\frac{2}{\pi\Xi}+\frac{2\Xi^{2}}{\pi M}\left(1+T(\|B\|+\max\|{C}\|)\right).

(87)

In order to bound the discretization error by $\mathcal{O}(\epsilon)$ , it suffices to choose

\Xi=\mathcal{O}\left(\frac{1}{\epsilon}\right),\quad M=\mathcal{O}\left(\frac{T\|B\|}{\epsilon^{3}}\right).

(88)

4.6.3 Implementation and complexity

Suppose that we are given the same input oracles as in previous algorithms, encoding the eigenvalues of $B$ and the diagonal entries of ${C}(t)$ , i.e., $O_{1}$ and $O_{2}$ defined in Equation 20 and Equation 50. The sketch of the algorithm is to simulate the interaction picture Hamiltonian using truncated Dyson series method [30] and then apply the LCU technique to compute Equation 86.

We start with the HAM-T encoding of the matrix $H_{I}$ . For a fixed time step size $h$ and an integer such that $[mh,(m+1)h]\subset[0,T]$ , we may first construct the HAM-T encoding of ${C}$ from its sparse input oracle following the approach in [19]. The resulting HAM-T encoding satisfies

\bra{0}_{a}\text{HAM-T}_{C,m}\ket{0}_{a}=\sum_{l=0}^{M_{H}-1}\ket{l}\bra{l}\otimes\frac{{C}(mh+lh/M_{H})}{\alpha_{C}}.

(89)

Here $\alpha_{C}$ is the upper bound of $\|{C}\|$ , and $M_{H}$ is the number of grid points used in the truncated Dyson series method. By appending an ancilla register $\ket{j}$ encoding the index for $\xi$ and applying the controlled rotation $\ket{j}\ket{0}\rightarrow\ket{j}\left(\frac{\xi_{j}}{\Xi}\ket{0}+\sqrt{1-\frac{\xi_{j}^{2}}{\Xi^{2}}}\ket{1}\right)$ with the help of an additional ancilla qubit, we obtain the HAM-T encoding of $\xi{C}$ such that

\bra{0}_{a}\text{HAM-T}_{\xi C,m}\ket{0}_{a}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\frac{\xi_{j}{C}(mh+lh/M_{H})}{\Xi\alpha_{C}}.

(90)

To construct the block-encoding of $e^{-i\xi Bt}$ , we write

\begin{split}e^{-i\xi_{j}B(mh+lh/M_{H})}&=e^{-i(-\Xi+2j\Xi/M)B(mh+lh/M_{H})}\\ &=e^{i\Xi Bmh}\left(e^{-i(2\Xi/M)Bmh}\right)^{j}\left(e^{i\Xi Bh/M_{H}}\right)^{l}\left(\left(e^{-i(2\Xi/M)Bh/M_{H}}\right)^{j}\right)^{l}.\end{split}

(91)

Noticing that the matrix $B$ can be diagonalized with QFT and the diagonal components are given through the oracle $O_{1}$ , we can implement $e^{-iBs}$ for any real number $s$ fast-forwardly with $\mathcal{O}(1)$ uses of $O_{1}$ , controlled phase gate and QFT. Then, according to the binary encoding of $0\leq j\leq M-1$ , we can use the controlled version of a total of $\log_{2}(M)$ operators $e^{-i(2\Xi/M)Bmh}$ , $e^{-i2(2\Xi/M)Bmh}$ , $e^{-i2^{2}(2\Xi/M)Bmh}$ , $\cdots$ , $e^{-i2^{\log_{2}(M)}(2\Xi/M)Bmh}$ to implement the controlled evolution $\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes\left(e^{-i(2\Xi/M)Bmh}\right)^{j}$ . Similarly, we can construct the evolution $\sum_{l=0}^{M_{H}-1}\ket{l}\bra{l}\otimes\left(e^{i\Xi Bh/M_{H}}\right)^{l}$ and $\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\left(\left(e^{-i(2\Xi/M)Bh/M_{H}}\right)^{j}\right)^{l}$ with logarithmic cost as well. Multiplying them together gives the select oracle

\text{SEL}_{B,m}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes e^{-i\xi_{j}B(mh+lh/M_{H})}.

(92)

Then

\text{HAM-T}_{H_{I},m}:=(I_{n_{a}}\otimes\text{SEL}_{B,m}^{\dagger})\text{HAM-T}_{\xi C,m}(I_{n_{a}}\otimes\text{SEL}_{B,m})

(93)

gives the HAM-T encoding of $H_{I}$ that

\bra{0}_{a}\text{HAM-T}_{H_{I},m}\ket{0}_{a}=\sum_{j=0}^{M-1}\sum_{l=0}^{M_{H}-1}\ket{j}\bra{j}\otimes\ket{l}\bra{l}\otimes\frac{H_{I}(mh+lh/M_{H};\xi_{j})}{\Xi\alpha_{C}}.

(94)

The $\text{HAM-T}_{H_{I},m}$ serves as the Hamiltonian input oracle in the truncated Dyson series method. Therefore, the method in [30] gives the select oracle

\text{SEL}_{W}=\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes W_{j},

(95)

where $W_{j}$ is an approximation of $U_{I}(T;\xi_{j})$ . We then multiply it on the left by $\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes e^{-i\xi_{j}BT}$ (which again can be efficiently constructed according to the binary representation of $j$ ) and obtain

\text{SEL}_{U}=\sum_{j=0}^{M-1}\ket{j}\bra{j}\otimes U_{j}.

(96)

Here $U_{j}$ is an approximation of $e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}$ . The $\text{SEL}_{U}$ operator serves as the select oracle in the LCU subroutine. Hence the formula Equation 86 can be directly implemented by the LCU technique (2).

The overall complexity of the algorithm is given as follows.

Theorem 18.

Consider the spatially discretized equation Equation 38. Suppose that

1.

we are given oracles encoding the eigenvalues of $B$ and the diagonal entries of ${C}(t)$ , i.e., $O_{1}$ and $O_{2}$ defined in Equation 20 and Equation 50, and the state preparation oracle $O_{u}:\ket{0}\rightarrow\ket{\vec{u}_{0}}$ ,
2.

$u(t,x)$ is in the Gevrey class $G^{\sigma}$ in the sense that $u(t,x)$ is smooth and there exist constants $\Lambda>0$ and $\sigma\geq 0$ such that

$\sup_{j\in[d],t\in[0,T],x\in[0,1]^{d}}|\partial_{x_{j}}^{p}u(t,x)|\leq\Lambda^{p+1}(p!)^{\sigma},$ (97)
3.

$\|(u(T,n/N))_{n\in[N]^{d}}\|\geq\widetilde{g}(T)$ for a function $\widetilde{g}$ ,
4.

$\|\vec{u}(0)\|/\|\vec{u}(T)\|\leq g(T)$ for a function $g$ .

Then, the linear combination of Hamiltonian simulation in the interaction picture can prepare an $\epsilon$ -approximation of $\ket{(u(T,n/N))_{n\in[N]^{d}}}$ using

queries to $O_{1}$ and $O_{2}$ a total number of times

\mathcal{O}\left(g(T)^{2}\frac{T}{\epsilon}\log^{3}\left(\frac{g(T)Td\log d}{\epsilon}\log\left(\frac{1}{\widetilde{g}(T)}\right)\right)\right),

(98)

2.

queries to $O_{u}$ for $\mathcal{O}(g(T))$ times,

additional elementary gates for

\mathcal{O}\left(g(T)^{2}\frac{T}{\epsilon}\log\left(\frac{g(T)T}{\epsilon}\right)\left(d\log^{2}\left(d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)+\log\left(\frac{g(T)T}{\epsilon}\right)\right)\right).

(99)

Proof.

As discussed before, the oracle $\text{HAM-T}_{\xi C,m}$ can be implemented with $\mathcal{O}(1)$ queries to $O_{2}$ and $\mathcal{O}(d\log(N))$ additional gates [19, Lemma 48], and the construction of $\text{SEL}_{B,m}$ requires $\mathcal{O}(\log(M)\log(M_{H}))$ $O_{1}$ and QFT (which requires $\mathcal{O}(d\log^{2}(N))$ gates). So each $\text{HAM-T}_{H_{I},m}$ can be constructed with $\mathcal{O}(\log(M)\log(M_{H}))$ queries to $O_{1}$ , $O_{2}$ and $\mathcal{O}(d\log^{2}(N))$ additional gates. According to [30, Corolllary 4], by choosing

\begin{split}M_{H}&=\mathcal{O}\left(\frac{T}{\Xi\alpha_{C}\epsilon^{\prime}}\left(\|B\|\Xi^{2}\alpha_{C}+\Xi\max\|{C}^{\prime}\|+\Xi^{2}\alpha_{C}^{2}\right)\right)\\ &=\mathcal{O}\left(\frac{T}{\epsilon^{\prime}}\left(\|B\|\Xi+\max\|{C}^{\prime}\|\right)\right),\end{split}

(100)

we may implement the select oracle $\text{SEL}_{W}=\sum_{j=0}^{M-1}\bra{j}\ket{j}\otimes W_{j}$ such that

\left\|W_{j}-\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\leq\epsilon^{\prime}

(101)

with

\mathcal{O}\left(\Xi\alpha_{C}T\frac{\log(\Xi\alpha_{C}T/\epsilon^{\prime})}{\log\log(\Xi\alpha_{C}T/\epsilon^{\prime})}\right)=\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\right)

(102)

queries to $\text{HAM-T}_{H_{I},m}$ and

\begin{split}&\quad\mathcal{O}\left(\Xi\alpha_{C}T\frac{\log(\Xi\alpha_{C}T/\epsilon^{\prime})}{\log\log(\Xi\alpha_{C}T/\epsilon^{\prime})}\left(d\log(N)+\log(M_{H})\right)\right)\\ &=\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log(N)+\log(M_{H})\right)\right)\end{split}

(103)

additional gates. Hence the $\text{SEL}_{U}$ can be implemented with asymptotically the same cost, and, taking into account the cost of constructing $\text{HAM-T}_{H_{I},m}$ , this step needs

\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\log(M)\log(M_{H})\right)

(104)

queries to $O_{1}$ and $O_{2}$ , and

\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log^{2}(N)+\log(M_{H})\right)\right)

(105)

additional gates.

The LCU algorithm requires a single application of the select oracle and two applications of the prepare oracle for $\frac{1}{\sqrt{\|w\|_{1}}}\sum_{j=0}^{M-1}\sqrt{w_{j}}\ket{j}$ . Noticing that $w$ represents the discretized Cauchy distribution, we can implement this prepare oracle with $\mathcal{O}(\log(M))$ gates [20] and $\|w\|_{1}=\mathcal{O}(1)$ . The output of the LCU step can be written as $\frac{1}{\|w\|\|\vec{u}_{0}\|}\ket{0}\widetilde{v}+\ket{\perp}$ , where

\widetilde{v}=\sum_{j=0}^{M-1}w_{j}U_{j}\vec{u}_{0}.

(106)

Using the inequality $\|\vec{a}/\|a\|-\vec{b}/\|b\|\|\leq 2\|\vec{a}-\vec{b}\|/\|\vec{a}\|$ , we can bound the error in the quantum state as

\begin{split}\|\ket{\vec{u}(T)}-\ket{\widetilde{v}}\|&\leq\frac{2}{\|\vec{u}(T)\|}\|\vec{u}(T)-\ket{\widetilde{v}}\|\\ &\leq\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|\\ &\quad\quad+\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}-\sum_{j=0}^{M-1}w_{j}U_{j}\right\|\\ &\leq\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}e^{-i\xi_{j}BT}\mathcal{T}e^{-i\int_{0}^{T}H_{I}(s;\xi_{j})ds}\right\|+\frac{2\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\|w\|_{1}\epsilon^{\prime}.\end{split}

(107)

To bound the error by $\epsilon$ , according to 17, it suffices to choose

\Xi=\mathcal{O}\left(\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\frac{1}{\epsilon}\right),\quad M=\mathcal{O}\left(\left(\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}\right)^{3}\frac{T\|B\|}{\epsilon^{3}}\right),\quad\epsilon^{\prime}=\mathcal{O}\left(\frac{\|\vec{u}(T)\|}{\|\vec{u}_{0}\|}\epsilon\right).

(108)

With these and by $\|B\|=\mathcal{O}(d^{\alpha/2}N^{\alpha})$ and $N$ given in 8, in each run of the algorithm we need queries to $O_{u}$ for $\mathcal{O}(1)$ times, queries to $O_{1}$ and $O_{2}$ for

\begin{split}&\quad\mathcal{O}\left(\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\log(M)\log(M_{H})\right)\\ &=\mathcal{O}\left(g(T)\frac{T}{\epsilon}\log^{3}\left(\frac{g(T)Td\log d}{\epsilon}\log\left(\frac{1}{\widetilde{g}(T)}\right)\right)\right),\end{split}

(109)

and additional gates for a total number of

\begin{split}&\quad\mathcal{O}\left(\log(M)+\Xi T\frac{\log(\Xi T/\epsilon^{\prime})}{\log\log(\Xi T/\epsilon^{\prime})}\left(d\log^{2}(N)+\log(M_{H})\right)\right)\\ &=\mathcal{O}\left(g(T)\frac{T}{\epsilon}\log\left(\frac{g(T)T}{\epsilon}\right)\left(d\log^{2}\left(d+\log\left(\frac{T}{\widetilde{g}(T)}\right)+\log\left(\frac{1}{\epsilon}\right)\right)+\log\left(\frac{g(T)T}{\epsilon}\right)\right)\right).\end{split}

(110)

With amplitude amplification, the average number of repeats to get a success is $\mathcal{O}(\|w\|\|\vec{u}_{0}\|/\|\widetilde{v}\|)=\mathcal{O}(g(T))$ , so the overall complexity should be multiplied by this factor. ∎

5 Nonlinear equations

We now discuss the full non-linear fractional reaction-diffusion equation as in Equation 1. For simplicity we only consider time-independent potential function $c(t,x)\equiv c(t)$ . After spatial discretization, Equation 1 becomes a system of nonlinear ODEs with quadratic nonlinear term, which can be tackled by the Carleman linearization technique [29]. Existing algorithms based on Carleman linearization assume the sparsity of the coefficient matrices in the nonlinear ODEs. This facilities the construction of the Carleman matrix, which is a direct sum of matrices in different dimensions. However, in the fractional reaction-diffusion equation, the coefficient matrices are unavoidably dense, so we will discuss a block-encoding implementation of the Carleman linearization technique.

5.1 Spatial discretization

We use the same spatial discretization as in the linear case. Let $\vec{u}(t)$ be a $N^{d}$ -dimensional vector approximating the exact solution $u(t,x)$ at equi-distant grid points $(n_{1}/N.\cdots,n_{d}/N)$ where $n_{j}\in[N]$ . The spatially discretized equation can be written as

\frac{d}{dt}\vec{u}=F_{1}\vec{u}+F_{2}\vec{u}^{\otimes 2}.

(111)

Here $\vec{u}^{\otimes 2}=\vec{u}\otimes\vec{u}$ is a $N^{2d}$ -dimensional vector. $F_{1}=-B-C+aI$ . $F_{2}$ is a $N^{d}\times N^{2d}$ dimensional matrix that maps $\vec{u}^{\otimes 2}$ to $-a\vec{u}$ , i.e., each row of $F_{2}$ only has one non-zero entry to be $-a$ at its $((j-1)N^{d}+j)$ -th column (for the $j$ -th row).

5.2 Carleman linearization

The idea of Carleman linearization for Equation 111 is to convert the nonlinear ODE to an equivalent infinite-dimensional linear ODE. Specifically, for any positive integer $m$ , the tensor product $\vec{u}^{\otimes m}$ satisfies the ODE

\frac{d}{dt}(\vec{u}^{\otimes m})=A_{m}^{m}\vec{u}^{\otimes m}+A_{m+1}^{m}\vec{u}^{\otimes(m+1)},

(112)

where ( $I$ represents the identity matrix of dimension $N^{d}\times N^{d}$ )

A_{m}^{m}=\sum_{j=1}^{m}I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)},

(113)

A_{m+1}^{m}=\sum_{j=1}^{m}I^{\otimes(j-1)}\otimes F_{2}\otimes I^{\otimes(m-j)}.

(114)

So the infinite-dimensional vector $[\vec{u};\vec{u}^{\otimes 2};\vec{u}^{\otimes 3};\cdots]$ satisfies a system of homogeneous linear ODE with coefficient matrix

\left(\begin{array}[]{cccccc}A_{1}^{1}&A_{2}^{1}&&&&\\ &A_{2}^{2}&A_{3}^{2}&&&\\ &&\ddots&\ddots&&\\ &&&A_{M}^{M}&A_{M+1}^{M}&\\ &&&&\ddots&\ddots\\ \end{array}\right).

(115)

To implement the Carleman linearization numerically, we truncate the infinite-dimensional ODE at a specific order $M$ and consider the ODE

\frac{d}{dt}\vec{w}=A\vec{w}.

(116)

Here $\vec{w}=[\vec{w}_{1};\vec{w}_{2};\cdots;\vec{w}_{M}]$ and each $\vec{w}_{j}$ is an $N^{jd}$ -dimensional vector expected to approximate $\vec{u}^{\otimes j}$ . The matrix $A$ can be represented as

A=\left(\begin{array}[]{ccccc}A_{1}^{1}&A_{2}^{1}&&&\\ &A_{2}^{2}&A_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&A_{M-1}^{M-1}&A_{M}^{M-1}\\ &&&&A_{M}^{M}\\ \end{array}\right).

(117)

When the linear part $F_{1}$ has negative eigenvalue and the nonlinear part $F_{2}$ is relatively small compared to the decay rate of the linear part, $\vec{w}_{1}$ can be a good approximation of $\vec{u}$ , and the truncation order $M$ is only logarithmic in the precision. [1] establishes rigorous analysis for the convergence of such procedure for integer-order reaction-diffusion equation, and we expect the same convergence for the fractional order as well.

5.3 Block-encoded implementation

The coefficient matrix $A$ is expressed as a partitioned matrix with blocks in different dimension. When all the blocks are sparse, as assumed in existing quantum algorithm based on Carleman linearization, the entire matrix $A$ is still sparse and one can directly implement it from sparse input model. This is also true for (integer-order) reaction-diffusion equation when $\alpha=2$ . However, when $0<\alpha<2$ , while the off-diagonal blocks $A_{m+1}^{m}$ are still sparse, the diagonal blocks $A_{m}^{m}$ are unavoidably dense. Though the block-encoding of each block $A_{m}^{m}$ is still construable, it is somewhat cumbersome to assemble them together to the block-encoding of the entire $A$ .

Inspired by the technique of [29] for state preparation, we further enlarge the dimension of the linearized ODE. The purpose is to make the resulting enlarged coefficient matrix expressed as a partitioned matrix with blocks in the same dimension, and meanwhile a subspace of the enlarged solution is still the solution of the original ODE. Specifically, let us consider the ODE

\frac{d}{dt}\vec{y}=\widetilde{A}\vec{y}.

(118)

Here $\vec{y}=[\vec{y}_{1};\cdots;\vec{y}_{M}]$ and each $\vec{y}_{j}$ is an $N^{Md}$ -dimensional vector. The initial value $\vec{y}(0)$ is chosen to be $[\ket{0}^{\otimes M-1}\otimes\ket{\vec{u}(0)};\ket{0}^{\otimes M-2}\otimes\ket{\vec{u}(0)}^{\otimes 2};\cdots;\ket{\vec{u}(0)}^{\otimes M}]$ where $\ket{0}=(0,\cdots,0,1)^{T}$ in $N^{d}$ dimension. The coefficient matrix is

\widetilde{A}=\left(\begin{array}[]{ccccc}\widetilde{A}_{1}^{1}&\widetilde{A}_{2}^{1}&&&\\ &\widetilde{A}_{2}^{2}&\widetilde{A}_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&\widetilde{A}_{M-1}^{M-1}&\widetilde{A}_{M}^{M-1}\\ &&&&\widetilde{A}_{M}^{M}\\ \end{array}\right).

(119)

Here each $\widetilde{A}_{m}^{m}$ and $\widetilde{A}_{m+1}^{m}$ is an $N^{Md}$ -dimensional square matrix. $\widetilde{A}_{m}^{m}=I^{\otimes(M-m)}\otimes A_{m}^{m}$ . $\widetilde{A}_{m+1}^{m}$ contains $A_{m+1}^{m}$ at its most bottom right and otherwise $0$ .

We now show that the non-zero entries of $\vec{y}$ exactly form $\vec{w}$ . To this end, for each $\vec{y}_{m}$ , we write it as $[\vec{y}_{m,1};\cdots;\vec{y}_{m,N^{(M-m)d}}]$ , where each $\vec{y}_{m,j}$ is an $N^{md}$ -dimensional vector. By definition of $\vec{y}(0)$ , we have that for every $m$ , all but the last component $\vec{y}_{m,N^{(M-m)d}}(0)$ of $\vec{y}_{m}(0)$ are zero. Furthermore, by the definition of $\widetilde{A}$ , the variables $\vec{y}_{m,j}$ for $j\neq N^{(M-m)d}$ does not interact with other variables outside. Therefore, $\vec{y}_{m,j}(t)$ is always $0$ for all $t$ , $m$ and $j\neq N^{(M-m)d}$ . Now, if we only focus on the ODEs that $\vec{y}_{m,N^{(M-m)d}}$ ’s satisfy, it is exactly the original ODE Equation 116 with the same initial condition. So we have $[\vec{y}_{1,N^{(M-1)d}}(t);\vec{y}_{2,N^{(M-2)d}}(t);\cdots;\vec{y}_{M,1}(t)]=\vec{w}(t)$ for all $t$ . This implies that, instead of solving Equation 116, we can focus on its equivalent formalism Equation 118.

Equation 118 can be solved by standard quantum algorithm for linear ODEs. For example, we can use the method based on truncated Dyson series [8] discussed in the previous section. To use this method, we need the state preparation for $\vec{y}(0)$ and the block-encoding of $\widetilde{A}$ . The state preparation oracle can be constructed in a similar manner as in [29]. To construct the block-encoding of $\widetilde{A}$ , we decompose it as $\widetilde{A}=\widetilde{D}+\widetilde{R}$ , where

\widetilde{D}=\left(\begin{array}[]{ccccc}\widetilde{A}_{1}^{1}&&&&\\ &\widetilde{A}_{2}^{2}&&&\\ &&\ddots&&\\ &&&\widetilde{A}_{M-1}^{M-1}&\\ &&&&\widetilde{A}_{M}^{M}\\ \end{array}\right),\quad\widetilde{R}=\left(\begin{array}[]{ccccc}0&\widetilde{A}_{2}^{1}&&&\\ &0&\widetilde{A}_{3}^{2}&&\\ &&\ddots&\ddots&\\ &&&0&\widetilde{A}_{M}^{M-1}\\ &&&&0\\ \end{array}\right).

(120)

Notice that $\widetilde{R}$ is a $\mathcal{O}(M)$ -sparse matrix. According to [19, Lemma 48], we may implement a block-encoding of $\mathcal{R}$ with $\mathcal{O}(1)$ query complexity and $\mathcal{O}(M)$ block-encoding factor. For $\widetilde{D}$ , we start with the block-encoding of $F_{1}$ , which can be constructed from the linear combination of $B,C$ and $aI$ . According to 2, the block-encoding factor is $\mathcal{O}(d^{\alpha}N^{2\alpha})$ , corresponding to the spectral norm of $B$ , and the query complexity for block encoding $F_{1}$ is $\mathcal{O}(1)$ . Denote this block-encoding by $U_{F_{1}}$ . Then a block-encoding of $I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)}$ can be constructed by applying $U_{F_{1}}$ on the correct register. Since $A_{m}^{m}$ is the summation of $I^{\otimes(j-1)}\otimes F_{1}\otimes I^{\otimes(m-j)}$ , according to 2, we may further construct the block-encoding of $A_{m}^{m}$ , denoted by $U_{A_{m}^{m}}$ , with $\mathcal{O}(Md^{\alpha}N^{2\alpha})$ block-encoding factor. Here we choose the block-encoding factor for all $A_{m}^{m}$ to be the same and corresponds to the worst case $A_{M}^{M}$ with most summation terms in order to facilitate later construction for bigger matrix. Then, for each $m$ , a block-encoding of $\widetilde{A}_{m}^{m}$ , denoted by $U_{\widetilde{A}_{m}^{m}}$ , can be constructed with a single use of $U_{A_{m}^{m}}$ , and the block-encoding of $\widetilde{D}$ is given by

\sum_{m=0}^{M-1}\ket{m}\bra{m}\otimes U_{\widetilde{A}_{m}^{m}}.

(121)

This can be implemented by controlled version of $U_{A_{m}^{m}}$ and requires $\mathcal{O}(M)$ query complexity, and the block-encoding factor is $\mathcal{O}(Md^{\alpha}N^{2\alpha})$ . The final step is to use 2 again and, from the block-encoding of $\widetilde{D}$ and $\widetilde{R}$ , we may construct the desired block-encoding of $\widetilde{A}$ with $\mathcal{O}(M)$ query complexity and $\mathcal{O}(Md^{\alpha}N^{2\alpha})$ block-encoding factor. Notice that both query complexity and the block-encoding factor does not involve exponential dependence on $d$ .

With the state preparation for $\vec{y}(0)$ and the block-encoding of $\widetilde{A}$ , we can solve Equation 118 efficiently using truncated Dyson series method [8]. Notice that the method requires $\widetilde{A}$ to have non-positive logarithmic norm, which can be guaranteed if all the eigenvalues of $F_{1}$ are negative and $F_{2}$ is bounded, which corresponds to the standard assumption on the boundedness of the nonlinearity (namely the condition $R_{D}<1$ in [1]).

6 Conclusion

In this paper, we study efficient quantum algorithms for linear and nonlinear fractional reaction-diffusion equations. For linear equations, we improve and analyze the complexity of four different methods: second-order Trotter formula, time-marching method, truncated Dyson series method, and the linear combination of Hamiltonian simulation with the interaction picture formalism (LCHS-IP). Among all the methods, the LCHS-IP method achieves best scaling in the spatial dimension and thus is most suiable for high-dimensional linear fractional reaction-diffusion equations. For nonlinear equations, we generalize the quantum Carleman linearization algorithm to the case with block-encoding input oracle, making the algorithm applicable to dense coefficient matrices.

A natural extension of this work is to design better quantum algorithm for linear reaction-diffusion equations with near-optimal scalings in all parameters. A desired algorithm is expected to simultaneously scale poly-logarithmically in dimension (as LCHS-IP), poly-logarithmically in precision (as time-marching and truncated Dyson series methods), linearly in evolution time (as truncated Dyson series and LCHS-IP), and have low state preparation cost (as Trotter, time-marching and LCHS-IP). Among all the methods being considered in this paper, the LCHS-IP method is the closest one as it only misses the poly-logarithmic dependence on precision, so it is interesting to explore whether the LCHS-IP method can be further exponentially improved in terms of precision. Another possibility is to use higher-order product formula, whose asymptotic scaling tends to be $T^{1+o(1)}/\epsilon^{o(1)}$ . However, as proved in [14], there might be extra exponential overhead in Trotter errors when we deal with non-unitary dynamics. Therefore a tailored design of the product formula and an improved error analysis would be necessary. Furthermore, to obtain a poly-logarithmic dependence on dimension, we may need to take advantage of the vector-norm scaling of the product formula, which states that the Trotter errors may be independent of the spectral norm of the Hamiltonians if the quantum states are within a more regular subspace with better smoothness assumption. Such a vector-norm scaling has been proved for first- and second-order Trotter applied to Hamiltonian simulation in [2], and it remains open to establish similar error bounds for higher-order product formula and the cases beyond Hamiltonian simulation. This is our ongoing work.

For non-linear equations, a natural next step is to establish a rigorous analysis with detailed computational costs. We expect the analysis presented in [1] to work with suitable modifications. However, the complexity estimate in [1] still depends polynomially on the dimension $d$ . It is interesting to explore whether the quantum Carleman algorithm can also avoid such a polynomial overhead with the help of tighter error bounds, or new techniques are necessary to achieve this task.

Throughout this paper, we focus on the spectral fractional Laplacian with periodic boundary condition, which facilitates its spatial discretization and quantum implementation of its time evolution through the QFT circuit. Our future work will be focusing on the fractional Laplacian operator with Riesz definition and exploring the efficiency of quantum algorithms.

Acknowledgments

DA acknowledges the support by the Department of Defense through the Hartree Postdoctoral Fellowship at QuICS, and the seed grant at the NSF Quantum Leap Challenge Institute for Robust Quantum Simulation (QLCI grant OMA-2120757). KT gratefully acknowledges the support by the National Science Foundation under the grants DMS-2231533 and DMS-2008568.

References

[1] D. An, D. Fang, S. Jordan, J.-P. Liu, G. H. Low, and J. Wang. Efficient quantum algorithm for nonlinear reaction-diffusion equations and energy estimation, 2022.
[2] D. An, D. Fang, and L. Lin. Time-dependent unbounded hamiltonian simulation with vector norm scaling. Quantum, 5:459, 2021.
[3] D. An and L. Lin. Quantum linear system solver based on time-optimal adiabatic quantum computing and quantum approximate optimization algorithm. ACM Transactions on Quantum Computing, 3(2):1–28, mar 2022.
[4] D. An, J.-P. Liu, and L. Lin. Linear combination of hamiltonian simulation for non-unitary dynamics with optimal state preparation cost, 2023.
[5] D. An, J.-P. Liu, D. Wang, and Q. Zhao. A theory of quantum differential equation solvers: limitations and fast-forwarding, 2023.
[6] D. W. Berry. High-order quantum algorithm for solving linear differential equations. Journal of Physics A: Mathematical and Theoretical, 47(10):105301, 2014.
[7] D. W. Berry, A. M. Childs, A. Ostrander, and G. Wang. Quantum algorithm for linear differential equations with exponentially improved dependence on precision. Communications in Mathematical Physics, 356(3):1057–1081, 2017.
[8] D. W. Berry and P. C. S. Costa. Quantum algorithm for time-dependent differential equations using dyson series, 2022.
[9] C. C. Cantrell R.S. Spatial ecology via reaction–diffusion equations. John Wiley & Sons Ltd., Chichester, 2003, 2003.
[10] A. M. Childs, R. Kothari, and R. D. Somma. Quantum algorithm for systems of linear equations with exponentially improved dependence on precision. SIAM J. Comput., 46:1920–1950, 2017.
[11] A. M. Childs, J. Leng, T. Li, J.-P. Liu, and C. Zhang. Quantum simulation of real-space dynamics. Quantum, 6:860, nov 2022.
[12] A. M. Childs and J.-P. Liu. Quantum spectral methods for differential equations. Communications in Mathematical Physics, 375(2):1427–1457, 2020.
[13] A. M. Childs, J.-P. Liu, and A. Ostrander. High-precision quantum algorithms for partial differential equations. Quantum, 5:574, nov 2021.
[14] A. M. Childs, Y. Su, M. C. Tran, N. Wiebe, and S. Zhu. Theory of trotter error with commutator scaling. Phys. Rev. X, 11:011020, 2021.
[15] A. M. Childs and N. Wiebe. Hamiltonian simulation using linear combinations of unitary operations. Quantum Information and Computation, 12:901–924, 2012.
[16] P. C. Costa, D. An, Y. R. Sanders, Y. Su, R. Babbush, and D. W. Berry. Optimal scaling quantum linear-systems solver via discrete adiabatic theorem. PRX Quantum, 3:040303, Oct 2022.
[17] R. F. Global solutions of reaction–diffusion systems, in: Lecture notes in mathematics. Lecture Notes in Mathematics, 1072, 1984.
[18] D. Fang, L. Lin, and Y. Tong. Time-marching based quantum solvers for time-dependent linear differential equations, 2022.
[19] A. Gilyén, Y. Su, G. H. Low, and N. Wiebe. Quantum singular value transformation and beyond: exponential improvements for quantum matrix arithmetics. In Proceedings of the 51st Annual ACM SIGACT Symposium on Theory of Computing, pages 193–204, 2019.
[20] L. Grover and T. Rudolph. Creating superpositions that correspond to efficiently integrable probability distributions, 2002.
[21] A. W. Harrow, A. Hassidim, and S. Lloyd. Quantum algorithm for linear systems of equations. Phys. Rev. Lett., 103:150502, 2009.
[22] S. J. Shock waves and reaction–diffusion equations, 2nd edn, in: Grundlehren der mathematischen wissenschaften [fundamental principles of mathematical. Lecture Notes in Mathematics, 258, 1994.
[23] T. Jahnke and C. Lubich. Error bounds for exponential operator splittings. BIT Numerical Mathematics, 40(4):735–744, 2000.
[24] M. J.D. Mathematical Biology. I,II, 3rd edn, in: Interdisciplinary Applied Mathematics, volume 17, 18. Springer-Verlag, New York, 2002.
[25] S. Jin, N. Liu, and Y. Yu. Quantum simulation of partial differential equations via schrodingerisation, 2022.
[26] H. Krovi. Improved quantum algorithms for linear and nonlinear differential equations, 2022.
[27] L. Lin and Y. Tong. Optimal polynomial based quantum eigenstate filtering with application to solving quantum linear systems. Quantum, 4:361, nov 2020.
[28] A. Lischke, G. Pang, M. Gulian, F. Song, C. Glusa, X. Zheng, Z. Mao, W. Cai, M. M. Meerschaert, M. Ainsworth, and G. E. Karniadakis. What is the fractional laplacian? a comparative review with new results. Journal of Computational Physics, 404:109009, 2020.
[29] J.-P. Liu, H. Ø. Kolden, H. K. Krovi, N. F. Loureiro, K. Trivisa, and A. M. Childs. Efficient quantum algorithm for dissipative nonlinear differential equations. Proceedings of the National Academy of Sciences, 118(35), 2021.
[30] G. H. Low and N. Wiebe. Hamiltonian simulation in the interaction picture. arXiv:1805.00675, 2019.
[31] C. H. Neubert M. Demography and dispersal: Calculation and sensitivity analysis of invasion speed for structured populations. Ecology, 81, 2000.
[32] C. H. Neubert M. From diffusion to anomalous diffusion: A century after einstein’s brownian motion. Chaos, 15:26–103, 2005.
[33] B. N.F. Reaction–Diffusion Equations and Their Applications to Biology,. Academic Press Inc. [Harcourt Brace Jovanovich Publishers], London, 1986, 1986.
[34] M. A. Nielsen and I. Chuang. Quantum computation and quantum information, 2000.
[35] G. P. The theory and applications of reaction–diffusion equations. Oxford Applied Mathematics and Computing Science Series, 2003.
[36] Y. Subaşı, R. D. Somma, and D. Orsucci. Quantum algorithms for systems of linear equations inspired by adiabatic quantum computing. Phys. Rev. Lett., 122:060504, 2019.

Appendix A Technical lemmas

Lemma 19.

Let $\vec{a}$ and $\vec{b}$ be two non-zero vectors, possibly unnormalized. Then

\left\|\frac{\vec{a}}{\|\vec{a}\|}-\frac{\vec{b}}{\|\vec{b}\|}\right\|\leq\frac{2\|\vec{a}-\vec{b}\|}{\|\vec{a}\|}.

(122)

Lemma 20.

Suppose $A(t)$ is a matrix-valued continuous function and the operator $S(t,s)$ solves the differential equation

\frac{d}{dt}S(t,s)=A(t)S(t,s),\quad S(s,s)=I.

(123)

Then,

1.

for any matrix-valued continuous function $R(t)$ , the solution of the differential equation

$\frac{d}{dt}\widetilde{S}(t,0)=A(t)\widetilde{S}(t,0)+R(t),\quad\widetilde{S}(0,0)=I$ (124)

can be represented as

$\widetilde{S}(t,0)=S(t,0)+\int_{0}^{t}S(t,s)R(s)ds.$ (125)
2.

for any vector-valued continuous function $\vec{r}(t)$ , the solution of the differential equation

$\frac{d}{dt}\vec{\psi}(t)=A(t)\vec{\psi}(t)+\vec{r}(t),\quad\vec{\psi}(0)=\vec{\psi}_{0}$ (126)

can be represented as

$\vec{\psi}(t)=S(t,0)\vec{\psi}_{0}+\int_{0}^{t}S(t,s)\vec{r}(s)ds.$ (127)

Appendix B Proof of 5

Proof.

For any function $f$ defined on $[0,1]^{d}$ , its Fourier coefficient is defined to be

\hat{f}_{k}=\int_{[0,1]^{d}}f(x)e^{-2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}dx.

(128)

Using integration by parts for $p$ times, we may obtain

|\hat{f}_{k}|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}f(x)\|_{L^{1}}}{(2\pi\|k\|_{\infty})^{p}}.

(129)

As a result, for each fixed $m$ ,

\begin{split}\left|\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right|&\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}|\hat{u}_{i(m)+jN}|\\ &\leq\sum_{l=1}^{\infty}((l+1)^{d}-l^{d})\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi(2l-1)N)^{p}}.\end{split}

(130)

Suppose that $p\geq d+2$ . Using the inequality that $2l-1\geq l+1$ for all $l\geq 2$ , we have

\begin{split}\left|\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right|&\leq\left(2^{d}-1+\sum_{l=2}^{\infty}\frac{(l+1)^{d}-l^{d}}{(2l-1)^{p}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\left(2^{d}-1+\sum_{l=2}^{\infty}\frac{1}{(2l-1)^{p-d}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\left(2^{d}-1+\int_{1}^{\infty}\frac{dx}{(2x-1)^{p-d}}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &=\left(2^{d}-1+\frac{1}{2(p-d-1)}\right)\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{\pi^{p}N^{p}}\\ &\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi/2)^{p}N^{p}}.\end{split}

(131)

Therefore,

\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{0}-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}\hat{u}_{m+jN}-\hat{u}_{i(m)}\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.\end{split}

(132)

Similarly,

\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}(T,x)-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u(T,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.

(133)

Let $\widetilde{\mathcal{F}}=(\mathcal{F})^{\otimes d}\otimes I\otimes I$ and $U$ denote the composition of $O_{1}$ , $O_{\exp,1}$ and c- $R$ specified in Figure 2, then

\begin{split}&\quad\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\widetilde{\mathcal{F}}^{-1}\ket{u_{0}}\ket{0}^{\otimes 3}-\frac{1}{\|\vec{u}_{0}\|}\vec{u}(T)\right\|\\ &\leq\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\widetilde{\mathcal{F}}^{-1}\ket{u_{0}}\ket{0}^{\otimes 3}-(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}U\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}^{\otimes 3}\right\|\\ &\quad+\left\|(I^{\otimes d}\otimes\bra{0}^{\otimes 3})\widetilde{\mathcal{F}}\left(\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}\ket{0}^{\otimes 3}+\ket{\perp}\right)\right.\\ &\quad\quad\left.-\frac{1}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}u(T,m/N)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\left\|(\mathcal{F}^{-1})^{\otimes d}\ket{u_{0}}-\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\quad+\left\|\mathcal{F}^{\otimes d}\frac{N^{d/2}}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}e^{-(2\pi\|i(m)\|)^{\alpha}T}\ket{m_{0}}\cdots\ket{m_{d-1}}-\frac{1}{\|\vec{u}_{0}\|}\sum_{m\in[N]^{d}}u(T,m/N)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{1}{\|\vec{u}_{0}\|}\frac{\max_{j}\|\partial_{x_{j}}^{p}u_{0}(x)\|_{L^{1}}+\max_{j}\|\partial_{x_{j}}^{p}u(T,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.\end{split}

(134)

By 19, the overall $2$ -norm error of the output quantum state after successful measurement can be bounded by

\frac{4}{\|\vec{u}_{0}\|}\frac{\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}\frac{\|\vec{u}_{0}\|}{\|\vec{u}(T)\|}=\frac{4\max_{j,t\in\left\{0,T\right\}}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\|\vec{u}(T)\|(\pi/2)^{p}N^{p-d}}.

(135)

The claimed complexity can be shown by noticing that in each run of Figure 2, we need $\mathcal{O}(1)$ queries to the aforementioned oracles, and $\mathcal{O}(d\log^{2}(N))$ gates due to QFT and controlled rotations, and under amplitude amplification, the averaged repeats to get a success is $\mathcal{O}(\|\vec{u}_{0}\|/\|\vec{u}(T)\|)$ . ∎

Appendix C Bounding the spatial discretization errors

Here we present the detailed proofs of the spatial discretization results (6, 7 and 8).

Proof of 6.

Let $\vec{u}_{e}(t)=(u(t,n/N))_{n\in[N]^{d}}$ denote the exact solution evaluated at discrete grid points. Using Equation 29, for any $n\in[N]^{d}$ ,

\begin{split}\partial_{t}u(t,n/N)&=-(-\Delta)^{\alpha/2}u(t,n/N)-c(t,n/N)u(t,n/N)\\ &=\left(-B\vec{u}_{e}\right)_{n}-c(t,n/N)u(t,n/N)+\left(B\vec{u}_{e}\right)_{n}-(-\Delta)^{\alpha/2}u(t,n/N).\end{split}

(136)

Therefore the vector $\vec{u}_{e}(t)$ solves the differential equation

\frac{d}{dt}\vec{u}_{e}=-B\vec{u}_{e}-C(t)\vec{u}_{e}+\vec{r}(t),

(137)

where

\left(\vec{r}(t)\right)_{n}=\left(B\vec{u}_{e}\right)_{n}-(-\Delta)^{\alpha/2}u(t,n/N).

(138)

Therefore, the equations $\vec{u}_{e}$ satisfies can be viewed as a perturbation (by $\vec{r}$ ) of the equations $\vec{u}$ satisfies. By 20, we have

\vec{u}_{e}(t)=\vec{u}(t)+\int_{0}^{t}\mathcal{T}e^{\int_{s}^{t}(-B-C(\tau))d\tau}\vec{r}(s)ds,

(139)

and thus

\|\vec{u}_{e}(T)-\vec{u}(T)\|\leq T\max_{t\in[0,T]}\|\vec{r}(t)\|.

(140)

It remains to bound $\|\vec{r}(t)\|$ , which can be done similarly to the proof of 5. Suppose that the Fourier series of $u(t,x)$ is

u(t,x)=\sum_{k\in\mathbb{Z}^{d}}\hat{u}_{k}(t)e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})},

(141)

where

\hat{u}_{k}(t)=\int_{[0,1]^{d}}u(t,x)e^{-2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}dx.

(142)

By the same reasoning of Equation 132,

\left\|(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-N^{d/2}\sum_{m\in[N]^{d}}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi/2)^{p}N^{p-d}}.

(143)

Noticing that $\|D\|\leq\pi^{\alpha}N^{\alpha}d^{\alpha/2}$ , we have

\left\|D(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\leq\frac{2^{p}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.

(144)

Again using the derivation of Equation 132 and noting that $(-\Delta)^{\alpha/2}u(t,x)=\sum_{k\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|k\|^{\alpha}\hat{u}_{k}(t)e^{2\pi i(k_{0}x_{0}+\cdots+k_{d-1}x_{d-1})}$ , we have

\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|.\end{split}

(145)

Using Equation 129, for each fixed $m$ , we have

\begin{split}&\quad\left|\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right|\\ &\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}2^{\alpha}\pi^{\alpha}\|i(m)+jN\|^{\alpha}|\hat{u}_{i(m)+jN}(t)|\\ &\leq\sum_{l=1}^{\infty}\sum_{\|j\|_{\infty}=l}2^{\alpha}\pi^{\alpha}\|i(m)+jN\|^{\alpha}\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(2\pi\|i(m)+jN\|_{\infty})^{p}}\\ &\leq\sum_{l=1}^{\infty}((l+1)^{d}-l^{d})\pi^{\alpha}(2l+1)^{\alpha}N^{\alpha}d^{\alpha/2}\frac{\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{(\pi(2l-1)N)^{p}}\end{split}

(146)

Therefore, for any $p\geq d+\alpha+2$ ,

\begin{split}&\quad\left|\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right|\\ &\leq\left(\sum_{l=1}^{\infty}\frac{((l+1)^{d}-l^{d})(2l+1)^{\alpha}}{(2l-1)^{p}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+\sum_{l=2}^{\infty}\frac{((l+1)^{d}-l^{d})(2l+1)^{\alpha}}{(2l-1)^{p}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+2^{\alpha}\sum_{l=2}^{\infty}\frac{1}{(2l-1)^{p-d-\alpha}}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\left((2^{d}-1)3^{\alpha}+2^{\alpha}\frac{1}{2(p-d-\alpha-1)}\right)\frac{d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}\\ &\leq\frac{2^{d}3^{\alpha}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-\alpha}}.\end{split}

(147)

As a result,

\begin{split}&\quad\left\|(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}-N^{d/2}\sum_{m\in[N]^{d}}2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &=N^{d/2}\left\|\sum_{m\in[N]^{d}}\left(\sum_{j\in\mathbb{Z}^{d}}2^{\alpha}\pi^{\alpha}\|m+jN\|^{\alpha}\hat{u}_{m+jN}(t)-2^{\alpha}\pi^{\alpha}\|i(m)\|^{\alpha}\hat{u}_{i(m)}(t)\right)\ket{m_{0}}\cdots\ket{m_{d-1}}\right\|\\ &\leq\frac{2^{d}3^{\alpha}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.\end{split}

(148)

Combining Equation 144 and Equation 148, we have

\left\|D(\mathcal{F}^{-1})^{\otimes d}\vec{u}_{e}(t)-(\mathcal{F}^{-1})^{\otimes d}\left((-\Delta)^{\alpha/2}u(t,n/N)\right)_{n\in[N]^{d}}\right\|\leq\frac{2^{p+1}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}},

(149)

and

\|r(t)\|\leq\frac{2^{p+1}d^{\alpha/2}\max_{j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.

(150)

Plugging this back into Equation 140 and we obtain

\|\vec{u}_{e}(T)-\vec{u}(T)\|\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}.

(151)

∎

Proof of 7.

According to 6, the spatial discretization error can be bounded by

\begin{split}\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|&\leq T\frac{2^{p+1}d^{\alpha/2}\max_{t,j}\|\partial_{x_{j}}^{p}u(t,x)\|_{L^{1}}}{\pi^{p-\alpha}N^{p-d-\alpha}}\\ &\leq 2\Lambda\pi^{\alpha}d^{\alpha/2}T\frac{(2\Lambda/\pi)^{p}(p!)^{\sigma}}{N^{p-d-\alpha}}\\ &\leq 2^{1+\sigma}\pi^{\alpha+\sigma/2}\Lambda d^{\alpha/2}T\frac{p^{\sigma/2}(2\Lambda/\pi)^{p}(p/e)^{p\sigma}}{N^{p-d-\alpha}}\end{split}

(152)

where in the second line we use $p!\leq\sqrt{2\pi p}(p/e)^{p}e^{1/(12p)}\leq 2\sqrt{\pi p}(p/e)^{p}$ . We choose $p=\lfloor p_{*}\rfloor$ where

p_{*}=\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}.

(153)

We remark that here we need $p_{*}\geq d+\alpha+2$ and thus $N\geq(2\Lambda/\pi)(d+\alpha+2)^{\sigma}$ . Then,

\begin{split}\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|&\leq 2^{2+\sigma}\pi^{\alpha+\sigma/2}\Lambda d^{\alpha/2}T\frac{p_{*}^{\sigma/2}(2\Lambda/\pi)^{p_{*}}(p_{*}/e)^{p_{*}\sigma}}{N^{p_{*}-d-\alpha}}\\ &=2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2}Td^{\alpha/2}N^{d+\alpha+1/2}\exp\left(-\sigma\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right).\end{split}

(154)

By the Taylor expansion $e^{x}=\sum_{k}\frac{1}{k!}x^{k}$ and thus $e^{x}\geq\frac{1}{k!}x^{k}$ for $x\geq 0$ , we have

\begin{split}\exp\left(\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right)&\geq\frac{1}{\lceil\sigma(d+\alpha+1/2)\rceil!}\left(\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}N^{d+\alpha+1/2}\\ &\geq\left(\frac{\sigma}{2\lceil\sigma(d+\alpha+1/2)\rceil}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}N^{d+\alpha+1/2}.\end{split}

(155)

Therefore

\begin{split}&\quad\|(u(T,n/N))_{n\in[N]^{d}}-\vec{u}(T)\|\\ &\leq 2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2}Td^{\alpha/2}\left(\frac{2\lceil\sigma(d+\alpha+1/2)\rceil}{\sigma}\left(\frac{2\Lambda}{\pi}\right)^{1/\sigma}\right)^{\lceil\sigma(d+\alpha+1/2)\rceil}\exp\left(-\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}N^{1/\sigma}\right)\\ &\leq c_{1}T(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right),\end{split}

(156)

where we may choose

$\displaystyle c_{1}$	$\displaystyle=2^{3/2+\sigma}\pi^{\alpha+\sigma/2+1/2}\Lambda^{1/2},$	(157)
$\displaystyle c_{2}$	$\displaystyle=\max\left\{2(2+\alpha+1/\sigma)\left(\frac{2\Lambda}{\pi}\right)^{1/\sigma},1\right\},$	(158)
$\displaystyle c_{3}$	$\displaystyle=\sigma(3+\alpha),$	(159)
$\displaystyle c_{4}$	$\displaystyle=\frac{\sigma}{2}\left(\frac{\pi}{2\Lambda}\right)^{1/\sigma}.$	(160)

∎

Proof of 8.

By 7 and 19, we have

\|\ket{(u(T,n/N))_{n\in[N]^{d}}}-\ket{\vec{u}(T)}\|\leq\frac{2c_{1}T}{\|(u(T,n/N))_{n\in[N]^{d}}\|}(c_{2}d)^{c_{3}d}d^{\alpha/2}\exp\left(-c_{4}N^{1/\sigma}\right).

(161)

The choice of $N$ can be solved by letting the error bound smaller than $\epsilon$ . ∎

Appendix D Bounding the Trotter error

In this section we present the detailed proof of 9. For $s\in[t_{0},t_{0}+h]$ and $t\in[0,h]$ , we consider three operators

S(s+t,s)=\mathcal{T}e^{\int_{s}^{s+t}(-B-{C}(\tau))d\tau},

(162)

\widetilde{S}(s+t,s)=e^{(-B-{C}(t_{0}+h/2))t},

(163)

and

S_{2}(s+t,s)=e^{-Bt/2}e^{-{C}(t_{0}+h/2)t}e^{-Bt/2}.

(164)

The proof follows the steps that we first bound $\|S-I\|$ and $\|\widetilde{S}-I\|$ , then separately bound $\|S-\widetilde{S}\|$ and $\|\widetilde{S}-S_{2}\|$ using variation of parameters formula.

D.1 Local error due to the time dependency

First, $S(s+t,s)$ satisfies the differential equation

\frac{d}{dt}S(s+t,s)=(-B-{C}(s+t))S(s+t,s).

(165)

Integrate this differential equation and get

S(s+t,s)-I=\int_{s}^{s+t}(-B-{C}(s+\tau))S(s+\tau,s)d\tau,

(166)

and thus we may write

S(s+t,s)=I+tR_{1}(s+t,s)

(167)

where

\|R_{1}(s+t,s)\|\leq\left(\|B\|+\max_{\tau}\|{C}(\tau)\|\right).

(168)

Similarly,

\widetilde{S}(s+t,s)=I+tR_{2}(s+t,s)

(169)

where

\|R_{2}(s+t,s)\|\leq\left(\|B\|+\max_{\tau}\|{C}(\tau)\|\right).

(170)

Now we bound the distance between $S$ and $\widetilde{S}$ . Notice that $\widetilde{S}$ satisfies the differential equation

\begin{split}\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})&=(-B-{C}(t_{0}+h/2))\widetilde{S}(t_{0}+t,t_{0})\\ &=(-B-{C}(t_{0}+t))\widetilde{S}(t_{0}+t,t_{0})+({C}(t_{0}+t)-{C}(t_{0}+h/2))\widetilde{S}(t_{0}+t,t_{0}).\end{split}

(171)

By Taylor expansion, we can write

{C}(t_{0}+t)={C}(t_{0}+h/2)+(t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t),

(172)

where

\|R_{3}(t)\|\leq\frac{(t-h/2)^{2}}{2}\max_{\tau}\|{C}^{\prime\prime}(\tau)\|.

(173)

Therefore the differential equation becomes

\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})=(-B-{C}(t_{0}+t))\widetilde{S}(t_{0}+t,t_{0})+((t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t))\widetilde{S}(t_{0}+t,t_{0}).

(174)

Comparing this with Equation 165 and using 20, we obtain

\widetilde{S}(t_{0}+h,t_{0})=S(t_{0}+h,t_{0})+\int_{0}^{h}S(t_{0}+h,t_{0}+t)((t-h/2){C}^{\prime}(t_{0}+h/2)+R_{3}(t))\widetilde{S}(t_{0}+t,t_{0})dt.

(175)

Plugging in Equation 167 and Equation 169 yields

\begin{split}\widetilde{S}(t_{0}+h,t_{0})&=S(t_{0}+h,t_{0})+\int_{0}^{h}(I+(h-t)R_{1}(t_{0}+h,t_{0}+t))(t-h/2){C}^{\prime}(t_{0}+h/2)(I+tR_{2}(t_{0}+t,t_{0}))dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt\\ &=S(t_{0}+h,t_{0})+\int_{0}^{h}(t-h/2){C}^{\prime}(t_{0}+h/2)dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}(h-t)(t-h/2)R_{1}(t_{0}+h,t_{0}+t){C}^{\prime}(t_{0}+h/2)\widetilde{S}(t_{0}+t,t_{0})dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}t(t-h/2){C}^{\prime}(t_{0}+h/2)R_{2}(t_{0}+t,t_{0})dt+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt\\ &=S(t_{0}+h,t_{0})+\int_{0}^{h}(h-t)(t-h/2)R_{1}(t_{0}+h,t_{0}+t){C}^{\prime}(t_{0}+h/2)\widetilde{S}(t_{0}+t,t_{0})dt\\ &\quad\quad\quad\quad\quad+\int_{0}^{h}t(t-h/2){C}^{\prime}(t_{0}+h/2)R_{2}(t_{0}+t,t_{0})dt+\int_{0}^{h}S(t_{0}+h,t_{0}+t)R_{3}(t)\widetilde{S}(t_{0}+t,t_{0})dt.\end{split}

(176)

Therefore

\begin{split}&\quad\|\widetilde{S}(t_{0}+h,t_{0})-S(t_{0}+h,t_{0})\|\\ &\leq\int_{0}^{h}t|t-h/2|dt\left(\max(\|R_{1}\|\|{C}^{\prime}\|)+\max(\|{C}^{\prime}\|\|R_{2}\|)\right)+\int_{0}^{h}\|R_{3}(t)\|dt\\ &\leq\int_{0}^{h}t|t-h/2|dt\left(\max(\|R_{1}\|\|{C}^{\prime}\|)+\max(\|{C}^{\prime}\|\|R_{2}\|)\right)+\int_{0}^{h}\frac{(t-h/2)^{2}}{2}dt\max\|{C}^{\prime\prime}\|\\ &=h^{3}\left(\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|+\frac{1}{24}\max\|{C}^{\prime\prime}\|\right).\end{split}

(177)

D.2 Local time-independent Trotter error

Next, we bound the distance between $\widetilde{S}$ and $S_{2}$ . For notation simplicity, throughout this subsection we let $A_{1}=-B$ and $A_{2}=-{C}(t_{0}+h/2)$ . Differentiating $\widetilde{S}(t_{0}+t,t_{0})$ and $e^{A_{1}(t-s)}$ yields

\frac{d}{dt}\widetilde{S}(t_{0}+t,t_{0})=A_{1}\widetilde{S}(t_{0}+t,t_{0})+A_{2}\widetilde{S}(t_{0}+t,t_{0}),

(178)

and

\frac{d}{dt}(e^{A_{1}(t-s)})=A_{1}e^{A_{1}(t-s)}.

(179)

By 20, for any $t\in[0,h]$ , we have

\widetilde{S}(t_{0}+t,t_{0})=e^{A_{1}t}+\int_{0}^{t}e^{A_{1}(t-s)}A_{2}\widetilde{S}(t_{0}+s,t_{0})ds.

(180)

We use this equation iteratively for three times and get

\begin{split}\widetilde{S}(t_{0}+h,t_{0})&=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}\widetilde{S}(t_{0}+t,t_{0})dt\\ &=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt+\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}\widetilde{S}(t_{0}+s,t_{0})dsdt\\ &=e^{A_{1}h}+\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt+\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt+R_{4}(h),\end{split}

(181)

where

R_{4}(h)=\int_{0}^{h}\int_{0}^{t}\int_{0}^{s}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}(s-\tau)}A_{2}\widetilde{S}(t_{0}+\tau,t_{0})d\tau dsdt.

(182)

Meanwhile, using the Taylor expansion of $e^{A_{2}h}$ , we may obtain

\begin{split}S_{2}(t_{0}+h,t_{0})&=e^{A_{1}h/2}e^{A_{2}h}e^{A_{1}h/2}\\ &=e^{A_{1}h}+he^{A_{1}h/2}A_{2}e^{A_{1}h/2}+\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}+R_{5}(h),\end{split}

(183)

where

R_{5}(h)=e^{A_{1}h/2}\left(\int_{0}^{h}\frac{(h-t)^{2}}{2}A_{2}^{3}e^{A_{2}t}dt\right)e^{A_{1}h/2}.

(184)

Therefore,

\begin{split}&\quad\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\\ &=\left(\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt-he^{A_{1}h/2}A_{2}e^{A_{1}h/2}\right)\\ &\quad+\left(\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right)\\ &\quad+R_{4}(h)-R_{5}(h).\end{split}

(185)

It remains to bound each term in the right hand side of Equation 185. Let $f(t)=e^{A_{1}(h-t)}A_{2}e^{A_{1}t}$ , then

\begin{split}&\quad\int_{0}^{h}e^{A_{1}(h-t)}A_{2}e^{A_{1}t}dt-he^{A_{1}h/2}A_{2}e^{A_{1}h/2}\\ &=\int_{0}^{h}f(t)dt-hf(h/2)\\ &=\int_{0}^{h}\left(f(h/2)+(t-h/2)f^{\prime}(h/2)+\int_{0}^{t}(t-s)f^{\prime\prime}(s)ds\right)dt-hf(h/2)\\ &=\int_{0}^{h}\int_{0}^{t}(t-s)f^{\prime\prime}(s)dsdt.\end{split}

(186)

Notice that $f^{\prime\prime}(t)=e^{A_{1}(h-t)}[[A_{2},A_{1}],A_{1}]e^{A_{1}t}$ , so we bound

\left\|\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\right\|\leq\int_{0}^{h}\int_{0}^{t}|t-s|dsdt\max_{\tau\in[0,h]}\|f^{\prime\prime}(\tau)\|\leq\frac{1}{6}h^{3}\|[[A_{2},A_{1}],A_{1}]\|.

(187)

Similarly, let $g(t,s)=e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}$ , then

\begin{split}&\quad\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\\ &=\int_{0}^{h}\int_{0}^{t}g(t,s)dsdt-\frac{h^{2}}{2}g(h/2,h/2)\\ &=\int_{0}^{h}\int_{0}^{t}\left(g(h/2,h/2)+\int_{0}^{1}\nabla g(h/2+\alpha(t-h/2),h/2+\alpha(s-h/2))\cdot\left(t-h/2,s-h/2\right)d\alpha\right)dsdt-\frac{h^{2}}{2}g(h/2,h/2)\\ &=\int_{0}^{h}\int_{0}^{t}\int_{0}^{1}\nabla g(h/2+\alpha(t-h/2),h/2+\alpha(s-h/2))\cdot\left(t-h/2,s-h/2\right)d\alpha dsdt.\end{split}

(188)

For $0\leq\alpha\leq 1$ , $0\leq s\leq t$ and $0\leq t\leq h$ , we always have $0\leq h/2+\alpha(s-h/2)\leq h/2+\alpha(t-h/2)$ and $0\leq h/2+\alpha(t-h/2)\leq h$ , so

\begin{split}&\quad\left\|\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right\|\\ &\leq\int_{0}^{h}\int_{0}^{t}\int_{0}^{1}\left(|t-h/2|+|s-h/2|\right)d\alpha dsdt\left(\max\left\{\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial t}\right\|,\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial s}\right\|\right\}\right)\\ &=\frac{1}{4}h^{3}\left(\max\left\{\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial t}\right\|,\max_{0\leq s^{\prime}\leq t^{\prime}\leq h}\left\|\frac{\partial g(t^{\prime},s^{\prime})}{\partial s}\right\|\right\}\right).\end{split}

(189)

We may compute that

\frac{\partial g}{\partial t}=e^{A_{1}(h-t)}[A_{2},A_{1}]e^{A_{1}(t-s)}A_{2}e^{A_{1}s},

(190)

and

\frac{\partial g}{\partial s}=e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}[A_{2},A_{1}]e^{A_{1}s},

(191)

so for $0\leq s\leq t\leq h$ , both $\|\partial g/\partial t\|$ and $\|\partial g/\partial s\|$ are bounded by $\|[A_{2},A_{1}]\|\|A_{2}\|$ . Then

\left\|\int_{0}^{h}\int_{0}^{t}e^{A_{1}(h-t)}A_{2}e^{A_{1}(t-s)}A_{2}e^{A_{1}s}dsdt-\frac{h^{2}}{2}e^{A_{1}h/2}A_{2}^{2}e^{A_{1}h/2}\right\|\leq\frac{1}{4}h^{3}\|[A_{2},A_{1}]\|\|A_{2}\|.

(192)

The bounds for $R_{4}$ and $R_{5}$ are straightforward from Equation 182 and Equation 184 that

\|R_{4}(h)\|\leq\frac{1}{6}h^{3}\|A_{2}\|^{3}

(193)

and

\|R_{5}(h)\|\leq\frac{1}{6}h^{3}\|A_{2}\|^{3}.

(194)

Plug Equations 187, 192, 193 and 194 back to Equation 185, and we get

\begin{split}\left\|\widetilde{S}(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\right\|&\leq h^{3}\left(\frac{1}{6}\|[A_{1},[A_{1},A_{2}]]\|+\frac{1}{4}\|[A_{1},A_{2}]\|\|A_{2}\|+\frac{1}{3}\|A_{2}\|^{3}\right)\\ &\leq h^{3}\left(\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split}

(195)

D.3 Global error

Combining Equation 177 and Equation 195, we obtain the local error bound as

\begin{split}&\quad\|S(t_{0}+h,t_{0})-S_{2}(t_{0}+h,t_{0})\|\\ &\leq h^{3}\left(\frac{1}{24}\max\|{C}^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split}

(196)

Finally we bound the global error. Notice that, for any $t\geq 0$ , $\|S_{2}(t+s,t)\|\leq 1$ and $\|S(s+t,t)\|\leq 1$ , then the global error accumulates linearly as

\begin{split}&\quad\|S(T,0)-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\|\\ &=\left\|\prod_{j=0}^{r-1}S((j+1)h,jh)-\prod_{j=0}^{r-1}S_{2}((j+1)h,jh)\right\|\\ &\leq\sum_{k=0}^{r-1}\left\|\prod_{j=k+1}^{r-1}S_{2}((j+1)h,jh)\left(S((k+1)h,kh)-S_{2}((k+1)h,kh)\right)\prod_{j=0}^{k-1}S((j+1)h,jh)\right\|\\ &\leq\sum_{k=0}^{r-1}\left\|S((k+1)h,kh)-S_{2}((k+1)h,kh)\right\|\\ &\leq Th^{2}\left(\frac{1}{24}\max\|{C}^{\prime\prime}\|+\frac{1}{4}(\|B\|+\max\|{C}\|)\max\|{C}^{\prime}\|\right.\\ &\quad\quad\quad\quad\quad\left.+\frac{1}{6}\max\|[B,[B,{C}]]\|+\frac{1}{4}\max\|[B,{C}]\|\max\|{C}\|+\frac{1}{3}\max\|{C}\|^{3}\right).\end{split}

(197)

Appendix E Bounding the commutators

Proof of 10.

We shall consider a fixed $t$ and omit the explicit time dependence in our notation. We first consider $[B,{C}]$ and write

\begin{split}\|[B,{C}]\|&=\|[(\mathcal{F})^{\otimes d}D(\mathcal{F}^{-1})^{\otimes d},{C}]\|\\ &=\|(\mathcal{F})^{\otimes d}[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}](\mathcal{F}^{-1})^{\otimes d}\|\\ &=\|[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]\|.\end{split}

(198)

By the definition of the matrices, we have

\begin{split}{C}(\mathcal{F})^{\otimes d}&=\frac{1}{N^{d/2}}\sum_{j,k\in[N]^{d}}{c}(j/N)\omega_{N}^{j\cdot k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\end{split}

(199)

and

\begin{split}(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}&=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{l\in[N]^{d}}\omega_{N}^{-j\cdot l}{c}(l/N)\omega_{N}^{l\cdot k}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{l\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{m}\omega_{N}^{l\cdot m}\right)\omega_{N}^{l\cdot(k-j)}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\frac{1}{N^{d}}\sum_{j,k\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{m}\left(\sum_{l\in[N]^{d}}\omega_{N}^{l\cdot(m-(j-k))}\right)\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}\\ &=\sum_{j,k\in[N]^{d}}\left(\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{j-k+Nm}\right)\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}.\end{split}

(200)

Let $\hat{b}_{l}=\sum_{m\in\mathbb{Z}^{d}}\hat{c}_{l+Nm}$ , and we can compute that

[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}\right)\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}.

(201)

Notice the decomposition

\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}=(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{2}+2(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})\|i(k)\|^{\alpha/2}.

(202)

We may write

[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]=A_{1}+2A_{2}A_{3}

(203)

where

A_{1}=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)^{2}\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}},

(204)

A_{2}=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}}

(205)

and

A_{3}=\sum_{k\in[N]^{d}}\|i(k)\|^{\alpha/2}\ket{k_{0}\cdots k_{d-1}}\bra{k_{0}\cdots k_{d-1}}.

(206)

Now we bound the norms of $A_{j}$ ’s. The norm of $A_{3}$ is clearly

\|A_{3}\|=\mathcal{O}(d^{\alpha/4}N^{\alpha/2}).

(207)

For $A_{2}$ , let $q=\text{argmax}||i(j_{q^{\prime}})|-|i(k_{q^{\prime}})||$ . We use the inequality

\begin{split}\left|\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right|&\leq\left|\|i(j)\|-\|i(k)\|\right|^{\alpha/2}\\ &\leq\||i(j)|-|i(k)|\|^{\alpha/2}\\ &\leq d^{\alpha/4}||i(j_{q})|-|i(k_{q})||^{\alpha/2}.\end{split}

(208)

Here the second line is the triangle inequality (and $|v|$ for a vector $v$ denotes its entrywise absolute), the third line is due to the definition of $q$ , and the first line can be proved by the following arguments: let $b\geq a\geq 0$ be two real number and $f(x)=x^{\alpha/2}$ , then $b^{\alpha/2}-a^{\alpha/2}=\int_{a}^{b}f^{\prime}(x)dx\leq\int_{0}^{b-a}f^{\prime}(x)dx=(b-a)^{\alpha/2}$ because $f^{\prime}(x)=\frac{\alpha}{2}x^{\alpha/2-1}$ is monotonically decreasing for $0<\alpha\leq 2$ . Notice that the Fourier coefficient $|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(2+d)})$ for any index $q^{\prime}$ , and when $|i(j_{q})-i(k_{q})|\leq N/2$ (c.f., $i(j_{q})-i(k_{q})>N/2$ or $i(j_{q})-i(k_{q})<-N/2$ ), $\hat{c}_{i(j)-i(k)}$ (c.f., $\hat{c}_{i(j)-i(k)-N}$ or $\hat{c}_{i(j)-i(k)+N}$ ) is the term with lowest frequency in $\hat{b}_{j-k}$ , We have

\begin{split}\sum_{k\in[N]^{d}}\left|\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right||\hat{b}_{j-k}|&\leq d^{\alpha/4}\sum_{k\in[N]^{d}}||i(j_{q})|-|i(k_{q})||^{\alpha/2}|\hat{b}_{j-k}|\\ &\leq d^{\alpha/4}\sum_{k\in[N]^{d},m\in[N]^{d}}\|j-k+Nm\|_{\infty}^{\alpha/2}|\hat{c}_{j-k+Nm}|\\ &=\mathcal{O}(d^{\alpha/4}).\end{split}

(209)

Similarly, the summation over $j$ is also bounded by $\mathcal{O}(d^{\alpha/4})$ . These imply $\|A_{2}\|_{1}=\mathcal{O}(d^{\alpha/4})$ and $\|A_{2}\|_{\infty}=\mathcal{O}(d^{\alpha/4})$ , which further implies that

\|A_{2}\|=\mathcal{O}(d^{\alpha/4})

(210)

since $\|A\|^{2}\leq\|A\|_{1}\|A\|_{\infty}$ for any matrix $A$ . For $A_{1}$ , we use Equation 208 again that

\left(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2}\right)^{2}\leq d^{\alpha/2}||i(j_{q})|-|i(k_{q})||^{\alpha},

(211)

and

\|A_{1}\|=\mathcal{O}(d^{\alpha/2})

(212)

by the same argument for $A_{2}$ (with the only difference that we need the Fourier coefficient to decay as $|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(3+d)})$ ). Plugging Equations 212, 210 and 207 back to Equation 203 yields

\|[B,{C}]\|=\mathcal{O}(d^{\alpha/2}N^{\alpha/2}).

(213)

The second estimate $\|[B,[B,{C}]]\|=\mathcal{O}(d^{\alpha}N^{\alpha})$ can be proved in the same way by using $|\hat{c}_{l}|=\mathcal{O}(|l_{q^{\prime}}|^{-(5+d)})$ and noticing that

\|[B,[B,{C}]]\|=\|[D,[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]]\|,

(214)

[D,[D,(\mathcal{F}^{-1})^{\otimes d}{C}(\mathcal{F})^{\otimes d}]]=\sum_{j,k\in[N]^{d}}\left(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha}\right)^{2}\hat{b}_{j-k}\ket{j_{0}\cdots j_{d-1}}\bra{k_{0}\cdots k_{d-1}},

(215)

and

\begin{split}&\quad(\|i(j)\|^{\alpha}-\|i(k)\|^{\alpha})^{2}\\ &=(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{4}+4(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{3}\|i(k)\|^{\alpha/2}+4(\|i(j)\|^{\alpha/2}-\|i(k)\|^{\alpha/2})^{2}\|i(k)\|^{\alpha}.\end{split}

(216)

∎

Appendix F Discretization error in the LCHS-IP method

Proof of 17.

Let $U(t;\xi)=\mathcal{T}e^{-i\int_{0}^{t}\xi(B+{C}(s))ds}=e^{-i\xi Bt}\mathcal{T}e^{-i\int_{0}^{t}H_{I}(s;\xi)ds}$ and $V(t;\xi)=\frac{1}{\pi(1+\xi^{2})}U(t;\xi)$ . The truncation error can be bounded by

\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\int_{-\Xi}^{\Xi}\frac{1}{\pi(1+\xi^{2})}U(T;\xi)d\xi\right\|\leq 2\int_{\Xi}^{\infty}\frac{d\xi}{\pi(1+\xi^{2})}=\frac{2}{\pi}(\frac{\pi}{2}-\arctan(\Xi))\leq\frac{2}{\pi\Xi}.

(217)

According to the standard quadrature error bound, we have

\left\|\mathcal{T}e^{-\int_{0}^{T}(B+{C}(s))ds}-\sum_{j=0}^{M-1}w_{j}U(T;\xi_{j})\right\|\leq\frac{2\Xi^{2}}{M}\sup_{\xi\in[-\Xi,\Xi]}\left\|\frac{\partial V}{\partial\xi}\right\|.

(218)

To estimate $\partial V/\partial\xi$ , we first compute $\partial U/\partial\xi$ . Differentiating Equation 79 yields

\frac{\partial}{\partial t}\frac{\partial U}{\partial\xi}=-i(B+{C}(t))U-i(\xi B+\xi{C}(t))\frac{\partial U}{\partial\xi},\quad\frac{\partial U}{\partial\xi}(0;\xi)=0.

(219)

By the variation of constants formula, we have

\frac{\partial U}{\partial\xi}(t;\xi)=\int_{0}^{t}\left(\mathcal{T}e^{-i\int_{s}^{t}(\xi B+\xi{C}(\tau))d\tau}\right)(-i)(B+{C}(s))U(s;\xi)ds,

(220)

and thus

\left\|\frac{\partial U}{\partial\xi}\right\|\leq T(\|B\|+\max\|{C}\|).

(221)

Therefore, by the product rule, we have

\begin{split}\left\|\frac{\partial V}{\partial\xi}\right\|&\leq\frac{2\pi|\xi|}{\pi^{2}(1+\xi^{2})}+\frac{1}{\pi(1+\xi^{2})}\left\|\frac{\partial U}{\partial\xi}\right\|\\ &\leq\frac{1}{\pi}\left(1+T(\|B\|+\max\|{C}\|)\right).\end{split}

(222)

Plugging this back into Equation 218 and Equation 217 gives the desired error bound, and the choices of $\Xi$ and $M$ directly follow from the error bound. ∎