Module for arbitrary controlled rotation in gate-based quantum algorithms

Shilu Yan School of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China Tong Dou School of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China Runqiu Shu School of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China Wei Cui [email protected] School of Automation Science and Engineering, South China University of Technology, Guangzhou 510640, China

Abstract

To assess whether a gate-based quantum algorithm can be executed successfully on a noisy intermediate-scale quantum (NISQ) device, both complexity and actual value of quantum resources should be considered carefully. Based on quantum phase estimation, we implemente arbitrary controlled rotation of quantum algorithms with a proposed modular method. The proposed method is not limited to be used as a submodule of the HHL algorithm and can be applied to more general quantum machine learning algorithms. Compared with the polynomial-fitting function method, our method only requires the least ancillas and the least quantum gates to maintain the high fidelity of quantum algorithms. The method theoretically will not influence the acceleration of original algorithms. Numerical simulations illustrate the effectiveness of the proposed method. Furthermore, if the corresponding diagonal unitary matrix can be effectively decomposed, the method is also polynomial in time cost.

I Introduction

Since Feynman showed his belief of that physics can be simulated with quantum computersFeynman (1982), quantum computing has developed over the last forty years. A variety of quantum algorithms that potentially give quantum speedups over classical computers have been proposed. Although building a universal quantum computer remains extremely difficult, researches underway at multiple technology companies, among IBM, Google, Rigetti, IonQ, Microsoft, Honeywell, D-wave and Origin Quantum, have led to a series of technological breakthroughs in building quantum computer systems. Nowadays, some platform has been provided for the public with access to cloud-based quantum computers.

Usually a complete gate-based quantum algorithm is modular, where each module has independent functions. Most gate-based quantum algorithms involve a module

\displaystyle\ket{\lambda}\mapsto Cf(\lambda)\ket{\lambda},

(1)

where $\ket{\lambda}\in(\mathbb{C}^{2})^{\otimes m},$ C is a normalizing constant and $f(\lambda)$ is an arbitrary function of $\lambda$ . The module can also be defined as quantum digital-to-analog conversion (QDAC) Mitarai et al. (2019), or the oracle discussed in Sec. V. Eq. (1) can be accomplished by performing a theoretical controlled rotation

\displaystyle U_{cr}:\ket{0}\ket{\lambda}\mapsto Cf(\lambda)\ket{1}\ket{\lambda}+\sqrt{1-(Cf(\lambda))^{2}}\ket{0}\ket{\lambda},

(2)

measuring the first qubit, and repeating the procedure until the measurement results in $\ket{1}$ . The famous example is the Harrow-Hassidim-Lloyd (HHL) algorithm Harrow et al. (2009), which is widely applied in quantum machine learning to solve the classical standard linear system of equations. The HHL algorithm requires a typical controlled rotation with $f(\lambda)=\lambda^{-1}$ . To implement the typical controlled rotation, Ref. Cao et al. (2013) first used the Newton’s iteration to approximate $\lambda^{-1}$ and then evaluated the arcsine function by a bisection search. Ref. Cong and Duan (2016) implemented the inversion and inverse trigonometric function with the Taylor series and Maclaurin series, respectively. Refs. Wang et al. (2020); Li et al. (2020) calculated the rotation angular coefficients with evaluating inverse trigonometric function in a recursive way by a binary expansion method. More efficiently, Refs. Häner et al. (2018); Vazquez et al. (2020) showed how to use piecewise polynomial approximation for arbitrary functions and provided a rigorous performance theory.

However, these polynomial-fitting methods still use a large number of qubits to implement the controlled rotation. If more bits of precision are required, the number can easily stretch to several hundred qubits. This will severely limit the implementation efficiency of the entire quantum algorithm and even inhibit the acceleration. The implementation of HHL algorithms are facing the challenge of limited quantum resources. The core idea of reducing the cost of qubits is combining the function outputs and quantum circuit design, where the Refs. Möttönen et al. (2004); Saeedi and Markov (2013); Shafaei et al. (2013) can be seen as the different concrete implementations of this kind of method. Following the core idea, we propose a modular method to achieve arbitrary controlled rotation based on phase estimation. Under the preconditions of appling the original algorithm, our method theoretically will not influence the acceleration of the original algorithm. Compared with the polynomial-fitting function method, our method only requires the least ancillas and the least quantum gates in some way. Furthermore, if the corresponding diagonal unitary matrix can be effectively decomposed, our method is also polynomial in time cost.

The paper is organized as follows. Section II describes the proposed controlled rotation module based on phase estimation. In Section III, we present different applications of the controlled rotation module. Section IV verifies our methods with numerical simulations. Section V are analysis and discussions. Conclusions are presented in Section VI.

Notation. The bold italic $\boldsymbol{i}$ represents the imaginary unit of the complex field. $FT^{\dagger}$ is the inverse quantum Fourier transform. The $R_{z}(\alpha)$ gate is defined as $R_{z}(\alpha)=\left[\begin{matrix}1&0\\ 0&e^{2\pi\boldsymbol{i}\alpha}\end{matrix}\right]$ .

II The controlled roation module based on phase estimation

Refer to caption — Figure 1: The controlled rotation module based on phase estimation.

In the Eq. (2), $\ket{\lambda}$ stores a binary representation of numbers, specifically, the eigenvalues of a matrix. To find a circuit performing the transformation $U_{cr}$ , it can be divided into two parts: a unitary operator

\displaystyle U_{f}:\ket{\lambda}\ket{0}^{\otimes m}\mapsto\ket{\lambda}\ket{\dfrac{1}{2}arcsin(Cf(\lambda))},{}

(3)

and a standard controlled $R_{y}$ rotation

\displaystyle R_{y}(2\theta)=\left[\begin{matrix}\sqrt{1-{{\left({Cf\left(\lambda\right)}\right)}^{2}}}&-Cf\left(\lambda\right)\\ Cf\left(\lambda\right)&\sqrt{1-{{\left({Cf\left(\lambda\right)}\right)}^{2}}}\end{matrix}\right],{}

(4)

where $\theta=\dfrac{1}{2}{\arcsin\left({Cf\left(\lambda\right)}\right)}$ . The quantum circuit for the proposed controlled rotation module is presented in Fig. 1. The unitary $D$ is a diagonal operator, whose diagonal elements are $e^{{2\pi\boldsymbol{i}}\theta}$ . Note that the unitary operator $D$ has an eigenvector $\ket{\lambda}$ with eigenvalue $e^{{2\pi\boldsymbol{i}}\theta}$ , where the $\theta$ is the most important factor in the generalization of the circuit. Eq. (3) can be seen as the procedure of phase estimation, which is used to approximate the eigenvalues of a discretized matrix and entangle the states encoding the eigenvalues with the corresponding eigenstates Luis and Peřina (1996).

The controlled rotation procedure uses two registers and an ancilla. Both registers contain $m$ qubits with $Reg.E$ in the state $\ket{\lambda}$ obtained by previous step and $Reg.A$ initially in $\ket{0}^{\otimes m}$ . The number $m$ depends on two factors: one is the number of digits of accuracy in the estimation for $\theta$ , and the other is the probability of successful estimation Nielsen and Chuang (2010). The circuit in Fig. 1 begins by applying a Hadamard transform to the $Reg.A$ , followed by an application of controlled- $D^{2^{i}}$ operations, where $i=0,1,\dots,m-1$ . Then the $Reg.A$ will be the state

\frac{1}{{\sqrt{{2^{m}}}}}\sum\limits_{k=0}^{{2^{m}}-1}{{e^{2\pi\boldsymbol{i}\theta k}}\left|k\right\rangle}.

Applying the inverse quantum Fourier transform, $Reg.A$ becomes $\ket{\tilde{\theta}}$ , which is an approximation of $\ket{\theta}$ . After the controlled $R_{y}(2\tilde{\theta})$ rotations acting on the ancilla $\ket{0}$ , we obtain the state

Cf(\lambda)\ket{1}\ket{\lambda}+\sqrt{1-(Cf(\lambda))^{2}}\ket{0}\ket{\lambda}.

Finally if the ancilla is $\ket{1}$ , we have $Cf(\lambda)\ket{\lambda}$ .

III Some Applications

III.1 Applications in HHL algorithm

The problem of linear system can be written as

\displaystyle Ax=b,

(5)

where $A\in R^{N\times N}$ , $x,b\in R^{N},N=2^{n}.$ $A$ can be represented by an $N\times N$ Hermitian matrix with a spectral decomposition of $A=\lambda_{j}\ket{u_{j}}\bra{u_{j}}$ , where $\lambda_{j}$ is the $j$ th eigenvalue of matrix $A$ with the corresponding eigenvector $\ket{u_{j}}$ . For convenience, we can assume that $b$ and $x$ are normalized and map them to the quantum states $\ket{b}$ and $\ket{x}$ respectively. The overview of the quantum circuit for HHL algorithm is presented in Fig. 2 and a simple description is listed as follows. Note that we have assumed that all computations have been exact. The initial state of $Reg.E$ and $Reg.B$ is

\displaystyle\ket{0}^{\otimes m}\ket{b}=\sum_{j=0}^{N-1}b_{j}\ket{0}^{\otimes m}\ket{j}=\sum_{j=0}^{N-1}\beta_{j}\ket{0}^{\otimes m}\ket{u_{j}}.

(6)

(i) Apply phase estimation with

\displaystyle{e^{\boldsymbol{i}2\pi A}}=\sum\limits_{j=0}^{N-1}{{e^{\boldsymbol{i}2\pi{\lambda_{j}}}}\left|{{u_{j}}}\right\rangle\left\langle{{u_{j}}}\right|},

(7)

where we can assume without loss of generality that the evolution time $t=2\pi$ . The registers become the state

\displaystyle\sum\limits_{j=0}^{N-1}{{\beta_{j}}\left|{{\tilde{\lambda}_{j}}}\right\rangle\left|{{u_{j}}}\right\rangle}.

(8)

where $\tilde{\lambda}_{j}$ is an approximation of $\lambda_{j}$ .

(ii) Add an ancilla qubit and perform the operation in Eq. (2), we have

\displaystyle\sum\limits_{j=0}^{N-1}{\left({\sqrt{1-{{\left({C{\tilde{\lambda}_{j}}^{-1}}\right)}^{2}}}\left|0\right\rangle+C{\tilde{\lambda}_{j}}^{-1}\left|1\right\rangle}\right){\beta_{j}}\left|{{\tilde{\lambda}_{j}}}\right\rangle\left|{{u_{j}}}\right\rangle}.

(9)

(iii) Apply the inverse of all operations before $R_{y}$ . If the eigenvalues are perfectly estimated, the result would be

\displaystyle\sum\limits_{j=0}^{N-1}{\left({\sqrt{1-{{\left({C{\tilde{\lambda}_{j}}^{-1}}\right)}^{2}}}\left|0\right\rangle+C{\tilde{\lambda}_{j}}^{-1}\left|1\right\rangle}\right){\beta_{j}}{{\left|{\rm{0}}\right\rangle}^{\otimes m}}\left|{{u_{j}}}\right\rangle}.

(10)

(iv) Perform the measurement on the ancilla qubit, if the ancilla qubit is in the state $\ket{1}$ , then the register $B$ becomes the state

\displaystyle\left|{x^{{}^{\prime}}}\right\rangle=\frac{1}{{\sqrt{\sum{{{\left({C{\tilde{\lambda}_{j}}^{-1}{\beta_{j}}}\right)}^{2}}}}}}\sum\limits_{j=0}^{N-1}{C{\tilde{\lambda}_{j}}^{-1}{\beta_{j}}\left|{{u_{j}}}\right\rangle},

(11)

where $\left|{x^{{}^{\prime}}}\right\rangle$ is an approximation of $\left|x\right\rangle$ . The steps (ii) can be seen as the procedure of controlled rotation implemented by the method in Sec. II. The elements of diagonal operator $D_{f}$ are $e^{{2\pi\boldsymbol{i}}\theta_{j}}$ , with $\theta_{j}=\frac{1}{2}\arcsin\left({C{\lambda_{j}^{-1}}}\right)$ .

III.2 Applications in HHL-based QML algorithms

The HHL algorithm is of great inspiration to the design of quantum machine learning (QML) algorithms. The algorthms that utilize HHL algorithm as a subroutine or inspired by HHL algorithm can be defined as HHL-based QML algorithms. By changing the value of $\theta$ in the diagonal element $e^{{2\pi\boldsymbol{i}}\theta}$ of the unitary diagonal matrix, our proposed controlled rotation can be applied to the following HHL-based QML algorithms.

QSVT:

Based on the singular value decomposition of a matrix, singular value thresholding (SVT) is a fundamental core module in computer vision and machine learning. The quantum SVT (QSVT) algorithm provides an exponential speed improvement over the classical algorithm Duan et al. (2018). The QSVT algorithm involves a controlled rotation

\displaystyle\begin{array}[]{l}U_{QSVT}:\left|0\right\rangle\left|\lambda_{j}\right\rangle\\ \mapsto\left({\frac{{\gamma\left({\sqrt{\lambda}_{j}-\tau}\right)}}{{\sqrt{\lambda}_{j}}}\left|1\right\rangle+\sqrt{1-\frac{{{\gamma^{2}}{{\left({\sqrt{\lambda}_{j}-\tau}\right)}^{2}}}}{\lambda_{j}}}\left|0\right\rangle}\right)\left|\lambda_{j}\right\rangle\end{array}.

(14)

where $\gamma$ and $\tau$ are constants. If the method in Sec. II is applied to implement $U_{QSVT}$ , $\theta_{j}$ in the corresponding diagonal operator included in the phase estimation should be $\dfrac{1}{2}{\arcsin\left(\frac{{\gamma\left({\sqrt{\lambda}_{j}-\tau}\right)}}{{\sqrt{\lambda}_{j}}}\right)}$ .

QAOP:

Ref. Duan et al. (2019) proposed a quantum A-optimal projection (QAOP) algorithm that can be used to speed up the learning process of an important dimensionality reduction algorithm in pattern recognition and machine learning. The QAOP algorithm assumes that $\sigma$ is the singular value of an input matrix $\tilde{X}$ and $\beta^{(i)}$ is the singular value of the projection matrix $A^{(i)}$ , which is computed by the $i$ th iteration of the QAOP algorithm. The QAOP algorithm mainly includes phase estimation and a controlled rotation

{U_{QAOP}}:\left|0\right\rangle\left|{{\sigma_{j}^{2}}}\right\rangle\left|{{{\left({\beta_{j}^{\left({i-1}\right)}}\right)}^{2}}}\right\rangle\mapsto\left[{\rho\left({1+\frac{{{\lambda_{2}}}}{{{{\left({{\sigma_{j}}\beta_{j}^{\left({i-1}\right)}}\right)}^{2}}}}}\right)\left|1\right\rangle+\sqrt{1-{\rho^{2}}{{\left({1+\frac{{{\lambda_{2}}}}{{{{\left({{\sigma_{j}}\beta_{j}^{\left({i-1}\right)}}\right)}^{2}}}}}\right)}^{2}}}\left|0\right\rangle}\right]\left|{\sigma_{j}^{2}}\right\rangle\left|{{{\left({\beta_{j}^{\left({i-1}\right)}}\right)}^{2}}}\right\rangle,

(15)

where $\rho<1$ is used for normalization of the quantum state and $\lambda_{2}$ is a small regularization coefficient. In this case, to implement $U_{QAOP}$ , $\theta_{j}$ of the corresponding diagonal operator should be $\dfrac{1}{2}arcsin\left(\rho\left({1+\frac{{{\lambda_{2}}}}{{{{\left({{\sigma_{j}}\beta_{j}^{\left({i-1}\right)}}\right)}^{2}}}}}\right)\right)$ .

QKPCA:

The Ref. Li et al. (2020) illustrated an architecture to simulate the arbitrary nonlinear kernels and further proposed the quantum kernel principal component analysis (QKPCA) algorithm, which also involved a controlled rotation

{U_{QKPCA}}:\left|0\right\rangle\left|\lambda_{j}\right\rangle\mapsto\left({\frac{1}{{\sqrt{\lambda}_{j}}}\left|1\right\rangle+\sqrt{1-\frac{1}{\lambda_{j}}}\left|0\right\rangle}\right)\left|\lambda_{j}\right\rangle,

(16)

where $\lambda_{j}$ is supposed to be larger than $1$ in this situation. Here $\theta_{j}$ of the corresponding diagnal operator prepared for $U_{QKPCA}$ should be $\dfrac{1}{2}arcsin\left(\frac{1}{{\sqrt{\lambda}_{j}}}\right)$ .

III.3 Other Applications

The encoding of the classical data can be maily categorized into two types: analog encoding (or called amplitude encoding), which encodes data into amplitudes of a state, and digital encoding (or called basis encoding), where the data are stored as qubit strings. The former is the basic condition for quantum acceleration, whereas the latter is required to perform arithmetics on quantum devices. Ref. Mitarai et al. (2019) has proposed algorithms that convert these two encodings to one another, where the proposed algorithms are seriously based on the fact that the transformation

\displaystyle\ket{x}\ket{0}^{\otimes m}\mapsto\ket{x}\ket{f\left(x\right)}

(17)

can be performed effectively. However, Ref. Mitarai et al. (2019) still omits the details of Eq. (17), which is a small but not negligible part of their proposed algorithms. The method in Sec. II can just complement this technical detail without considering the complexity. Thus the controlled rotation module can also be applied in arbitrary nonlinear transformations of amplitudes of a quantum state, data loading and quantum state preparation Mitarai et al. (2019).

Moreover, Eq. (17) has been used as an important part in implementing quantum neurons to merge the nonlinear, dissipative dynamics of neural computing into the linear, unitary quantum system Schuld et al. (2015); de Paula Neto et al. (2020); Yan et al. (2020). The diagonal matrix has stored a set of weights, inner products or activation function values in its diagonal elements.

IV Experiments

As the HHL algorithm is a ”proof of concept”, the implementation of it has become an important task. Although Refs. Cai et al. (2013); Pan et al. (2014); Barz et al. (2014); Subaş ı et al. (2019) has used several qubits to verify the correctness of the HHL algorithm, the stability and practicability of the HHL algorithm running on a universal quantum computer are still unknown. In this section, we will use numerical simulations to illustrate the influence of controlled rotation on the fidelity of implementing the HHL algorithm. We use the controlled variable method to explore the effect of the following four factors on the fidelity of implementing the HHL algorithm: the scaling factor $s$ , the condition number $\kappa$ of $A$ , the dimension $2^{n}$ of $A$ and the number of output qubits $m$ . The process of simulation experiments is listed as follows.

(i) Randomly generate Hermitian matrices with uniform distribution of eigenvalues in $(0,1]$ . Here, we have assumed all the generated Hermitian matrices are row-sparse and row-computable. To ensure the generality of simulations, the eigenvalues of each simulation are re-generated randomly.

(ii) Take $m$ decimal places of the binary eigenvalues, then we can get an approximation $\tilde{\lambda}$ . In the simulation, all $\tilde{\lambda}$ equal to 0 will be changed to $\underbrace{0\cdots 0}_{m-1}1$ as the object of inversion cannot be 0.

(iii) Caculate $\arcsin\left({C{{\tilde{\lambda}}^{-1}}}\right)$ and intercept its binary form by $m$ bits, we have an approximation $2\tilde{\theta}\approx 2\theta=\arcsin\left({C{{\lambda}^{-1}}}\right)$ .

(iv) Caculate $sin\tilde{\theta}$ and intercept its binary form by $m$ bits, we obtian the simulation result $\widetilde{C{\lambda^{-1}}}$ , which is an approximation of the theoretical result ${C{\lambda^{-1}}}$ .

Noting that the largest eigenvalue of $A$ is always $1$ , we have $\kappa=1/\lambda_{min}$ . Since $C\in O(1/\kappa)$ , the scaling factor $s\in(0,1)$ satisfies $C=s\times\lambda_{min}$ . We define the fidelity of implementing the HHL algorithm as $\bra{x^{{}^{\prime}}}x\rangle$ .

Fig. 3(a) shows the result of fidelity varying with $m$ and $s$ under the condition of $n=10$ and $\kappa=10$ . We observe that as the parameter $s$ gets closer to 1,the fidelity increases. Fig. 3(b) shows the result of fidelity varying with $m$ and $n$ under the condition of $s=0.99$ and $\kappa=10$ . We come to a conclusion that smaller $n$ represents better performance in fidelity, although the improvement is subtle. Fig. 3(c) shows the result of fidelity varying with $m$ and $\kappa$ under the condition of $s=0.99$ and $n=10$ . We conclude that the larger the parameter $\kappa$ , the larger the number of qubits $m$ required to maintain high fidelity.

V Analysis and Disscusions

V.1 Complexity

Taking the HHL algorithm as an example, we will illustrate that the controlled rotation will not influence the acceleration of the quantum algorithm. Based on phase estimation and amplitude amplification, the details of the complexity and error analysis of HHL algorithm have been shown in Ref. Harrow et al. (2009). $e^{iAt}$ can be simulated in time $O($ log $(N)d^{2}t)$ Berry et al. (2007), where $d$ is the sparseness of $A$ . If the phase estimation involving $e^{iAt}$ errs by $O(1/t)$ , which translates into a relative error of $O(1/\lambda t)$ in $\lambda^{-1}$ . The HHL algorithm has considered the complexity of the controlled rotation without a specific implementation. If $\lambda\geq 1/\kappa$ taking $t=O(\kappa/\varepsilon)$ induces a final error of $\varepsilon$ . Considering all of these above, the running time of HHL algorithm turns to $O({{\log\left(N\right){d^{2}}{\kappa^{2}}}\mathord{\left/{\vphantom{{\log\left(N\right){d^{2}}{\kappa^{2}}}\varepsilon}}\right.\kern-1.2pt}\varepsilon})$ , which offers an exponential speedup over the fastest classical algorithm: the conjugate gradient method. The conjugate gradient method runs in $O(N\kappa)$ (or $O(N{\sqrt{\kappa}})$ for positive semidefinite matrices). Typically, the condition number $\kappa$ is taken as $O({1\mathord{\left/{\vphantom{1\varepsilon}}\right.\kern-1.2pt}\varepsilon})$ , where $\varepsilon={2^{-m}}$ . In this case, the running time of HHL algorithm increases exponentially with the parameter $m$ , which is one of four caveats for HHL proposed in Ref. Aaronson (2015). These caveats can be crucial in practice. Thus the speed-up of HHL algorithm must be based on that both $\kappa$ and ${1\mathord{\left/{\vphantom{1\varepsilon}}\right.\kern-1.2pt}\varepsilon}$ are poly $\log\left(N\right)$ .

In a similar way, even if the running time of the controlled rotation is $O(2^{m})$ , it would be translated into $O(poly({1\mathord{\left/{\vphantom{1\varepsilon}}\right.\kern-1.2pt}\varepsilon}))$ . The controlled rotation would not influence the acceleration of original algorithms theoretically.

V.2 Comparison

In this section, we analyze the space (circuit width or the qubit complexity) and time (circuit depth or the gate complexity) resources required for different methods of the controlled rotation module. As any unitary operation can be decomposed into elementary gates Barenco et al. (1995), the elementary gate library is universal. We assume that any single- and two-qubit gate can be seen as elementary gates. To analyze the gate complexity, we use elementary gate complexity. While talking about the actual value, we use elementary gates and Toffoli gates. In the previous works, the common used methods that can construct any controlled rotation are mainly divided into two types: polynomial-fitting function method and measurement-based method. Table. 1 shows the comparisons of these different methods.

The polynomial-fitting function method includes Newton iteration,Taylor series, quantum function value binary expansion (QFBE) method, and so on. The basic principle of the polynomial-fitting function method is to transform the function calculation into quantum version of multiplication and addition, which has been studied in Refs. Saeedi and Markov (2013); Vedral et al. (1996); Beckman et al. (1996); Chuang and Modha (2000); Draper (2000); Cuccaro et al. (2004); Takahashi and Kunihiro (2005); Draper et al. (2006); Álvarez-Sánchez et al. (2008); Takahashi et al. (2010); Rieffel and Polak (2011). Reversibility of any unitary operation effects that addition and multiplication cannot be directly deduced from their classical Boolean counterparts. Ref. Vedral et al. (1996) first showed that the original addition of two registers $\ket{a}$ and $\ket{b}$ can be written as $\left|{a,b,0}\right\rangle\mapsto\left|{a,b,a+b}\right\rangle$ or $\left|{a,b}\right\rangle\mapsto\left|{a,a+b}\right\rangle$ . If both $a$ and $b$ are encoded on $m$ qubits, implementing an adder costs $3m+1$ qubits, $20m+2$ CNOT gates and $20m-10$ Toffoli gates. Based on adder, multiplier is computationally more expensive, where a multiplier of $a$ with $b$ can be thought of as adding $a$ to itself $b$ times. In this way, multiplier takes more qubits and quantum gates to complete the multiplicative task. Thus the polynomial-fitting function method is not a resource-saving method at some times. Recently, Ref. Wang et al. (2020) develops the QFBE method to evaluate the transcendental functions in a recursive way. Totally, the QFBE method uses $34{m^{3}}-16{m^{2}}+4m$ quantum gates to implement the controlled rotaion.

The measurement-based method can be divided into two types: one is the RUS arithmetic (or function synthesis) Wiebe and Roetteler (2016) and the other is classical computing Lee et al. (2019). Focused on optimizing space, RUS arithmetic encodes numbers in the amplitudes of a qubit, or more properly as polar angles on the Bloch sphere. The core idea is utilizing measurement to implement nonlinear mappings between sets of input and output rotation angles. However, RUS arithmetic has not discussed an important problem that how many repetitions of these circuits are needed before a successful result is observed with high probability. Classical computing directly analyzes measurement outcomes from $\ket{\lambda}$ by means of classical computers. Based on the analyzed data, a simpler circuit implementation of the original part can be built together with reducing the circuit depth and space. Similarly, classical computing does not solve the problem of the number of measurements. The measurement-based method generally requires polynomial time and space complexity.

The idea of our method in Sec. II is fully combining the function outputs and quantum circuit design. Containing $2^{m}$ parameters, the accurate evaluation of diagonal unitary operations might be the most resourceintensive element. For space optimizing, we need to use the decomposition method to realize the diagonal operation $D$ . Alternating controlled not gates and $R_{z}$ rotations, the best-known compiling algorithm provided circuits of size $2^{m}-3$ for arbitrary $m$ -qubit diagonal unitaries Bullock and Markov (2004). As the result of the power exponent of the diagonal matrix is also diagonal, $2^{m}-3$ Toffoli gates and controlled- $R_{z}$ gates are also enough for implementing controlled $D^{2^{i}}$ gates. An efficient implementation of the phase estimation includes $O(m^{2})$ elementary gates for an inverse quantum Fourier transform and one call to controlled $D^{2^{i}}$ . That is it totally needs $m\left({{2^{m+1}}-3}\right)+m\left({m+1}\right)/2+m$ single- and two-qubit gates and Toffoli gates.

Table 1: Comparisons of different methods.

Qubit complexity

Gate complexity

Newton iteration

O(m^{3})

O(m^{4})

QFBE

O(m^{2})

O(m^{3})

Measurement-based

O(poly(m))

O(poly(m))

Our method

O(m)

O(m2^{m})

Fig. 3(d) shows the total number of quantum gates that are needed to implement the controlled rotation in HHL algorithm with two different methods. When $m$ is less than 11, the number of quantum gates required by our method is less than QFBE. At this time, the HHL algorithm using our method can already make the fidelity of inversing the matrices with $\kappa\leq 1000$ close to 1, where the condition number 1000 already contains most of the matrices that need to be inverted in practice. Thus, our method only requires the least ancillas or the least quantum gates to maintain the high fidelity of quantum algorithms.

We define the diagonal unitary operator as

\displaystyle D=diag\left\{{{e^{2\pi\boldsymbol{i}\cdot{\theta_{0}}}},{{e^{2\pi\boldsymbol{i}\cdot{\theta_{1}}}}},\cdots,{e^{2\pi\boldsymbol{i}\cdot{\theta_{{2^{m}}-1}}}}}\right\},

(18)

where $\theta_{i}\in(0,1]$ and $i\in\{0,1,\cdots,2^{m}-1\}$ . Supposing the values of all $\theta_{i}$ are known in order, then the controlled- $D$ is written as

\displaystyle CD=C^{m}R_{z}(\theta_{0})\otimes\cdots\otimes C^{m}R_{z}(\theta_{2^{m}-1}).

(19)

That is the $CD$ can be implemented with $2^{m}$ different $C^{m}R_{z}$ gates, where a $C^{m}R_{z}$ gate can be simulated by $O(m)$ gates with one auxiliary qubit Barenco et al. (1995). If the number of different $\theta_{i}$ is $O(poly(m))$ , the gate complexity would be $O(poly(m))$ . In this way, our method would be more competitive.

V.3 The Threat of Oracle Expansion

Quantum oracle, or called black box, is an important tool used in quantum computation. Many quantum algorithms would like to use the oracle as a part of themselves, such as phase flip of target state in Grover’s algorithm, modular exponentiation in Shor’s algorithm and contrlled rotation in HHL algorithm. The focuses of designing a quantum algorithm are complexity and error analysis. Most papers deal with a quantum algorithm but omit the details of implementation. Each oracle is implemented by a quantum algorithm itself. When an oracle is used to implemente an algorithm, it often significantly contributes to the depth and width of the corresponding algorithm: the depth of an algorithm is the number of gates to be performed sequentially, and its width is the number of qubits it actually manipulates. Although oracles are usually considered effective, they may affect the efficiency of implementing the algorithm within limited conditions. To evaluate whether a gate-based algorithm can be implemented and executed on a particular NISQ device, we have to carefully consider the exact value and complexity of the width and depth of a circuit.

Ref. Leymann and Barzen (2020) has discussed different impact factors of algorithm implementations and drawn a conclusion: a quantum algorithm using oracles (i.e. black box functions in general) must be modified by expanding these functions as quantum circuits themselves and substituting these functions by the corresponding circuits. For modular gate-based algorithms, each module will affect the implementation of the entire algorithm. Existing generic constructions focus on the complexity of the circuit depth rather than actual value. In theory, the polynomial algorithm is better than the exponential algorithm. However, time cost of the exponential algorithm in a certain range will be less, when the factor of the polynomial is large. The result in Fig. 3(d) validates our point.

Although our method allows to reduce the quantum resources in some way, the resulting circuits are still quite expensive, especially in terms of the number of required quantum gates. At present, there is no satisfactory method to realize arbitrary controlled rotation, then the acceleration of some quantum algorithm may be inhibited.

VI Conclusions

In conclusion, our work implements arbitrary controlled rotation module for HHL algorithm. The proposed method can also be applied in HHL-based quantum algorithms and quantum machine learning. The analysis show that the method theoretically makes the quantum algorithms have high fidelity without influencing the acceleration of original algorithms. Compared with the polynomial-fitting function method or some other methods, our method only requires the least ancillas or the least quantum gates in some way. Numerical simulations illustrate the effectiveness of the proposed method. If the corresponding diagonal unitary matrix can be effectively decomposed, the method is also polynomial in time cost. We highlight a fact that there is still no satisfactory method to realize arbitrary controlled rotation. We also hope that there can be more researches in the implementation of quantum algorithms to further reduce the cost of arbitrary controlled rotation. In the future, we will further optimize the quantum circuit to make the gate-based quantum algorithms more forcefully demonstrate its superiority.

Acknowledgements.

This work was supported by the National Natural Science Foundation of China under Grant 61873317 and in part by the Guangdong Basic and Applied Basic Research Foundation under Grant 2020A1515011375.

References

Feynman (1982) R. P. Feynman, Int. J. Theor. Phys 21 (1982).
Mitarai et al. (2019) K. Mitarai, M. Kitagawa, and K. Fujii, Phys. Rev. A 99, 012301 (2019).
Harrow et al. (2009) A. W. Harrow, A. Hassidim, and S. Lloyd, Phys. Rev. Lett. 103, 150502 (2009).
Cao et al. (2013) Y. Cao, A. Papageorgiou, I. Petras, J. Traub, and S. Kais, New Journal of Physics 15, 013021 (2013).
Cong and Duan (2016) I. Cong and L. Duan, New Journal of Physics 18, 073011 (2016).
Wang et al. (2020) S. Wang, Z. Wang, W. Li, L. Fan, Z. Wei, and Y. Gu, Quantum Information Processing 19, 1 (2020).
Li et al. (2020) Y. Li, R.-G. Zhou, R. Xu, W. Hu, and P. Fan, Quantum Science and Technology 6, 014001 (2020).
Häner et al. (2018) T. Häner, M. Roetteler, and K. M. Svore, arXiv preprint arXiv:1805.12445 (2018).
Vazquez et al. (2020) A. C. Vazquez, R. Hiptmair, and S. Woerner, arXiv preprint arXiv:2009.04484 (2020).
Möttönen et al. (2004) M. Möttönen, J. J. Vartiainen, V. Bergholm, and M. M. Salomaa, Phys. Rev. Lett. 93, 130502 (2004).
Saeedi and Markov (2013) M. Saeedi and I. L. Markov, ACM Computing Surveys (CSUR) 45, 1 (2013).
Shafaei et al. (2013) A. Shafaei, M. Saeedi, and M. Pedram, in 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE) (IEEE, 2013), pp. 1235–1240.
Luis and Peřina (1996) A. Luis and J. Peřina, Phys. Rev. A 54, 4564 (1996).
Nielsen and Chuang (2010) M. A. Nielsen and I. Chuang, Quantum Computation and Quantum Information (Cambridge University Press, Cambridge, UK, 2010).
Duan et al. (2018) B. Duan, J. Yuan, Y. Liu, and D. Li, Phys. Rev. A 98, 012308 (2018).
Duan et al. (2019) B. Duan, J. Yuan, J. Xu, and D. Li, Phys. Rev. A 99, 032311 (2019).
Schuld et al. (2015) M. Schuld, I. Sinayskiy, and F. Petruccione, Physics Letters A 379, 660 (2015).
de Paula Neto et al. (2020) F. M. de Paula Neto, T. B. Ludermir, W. R. de Oliveira, and A. J. da Silva, IEEE Transactions on Neural Networks and Learning Systems 31, 3741 (2020).
Yan et al. (2020) S. Yan, H. Qi, and W. Cui, Phys. Rev. A 102, 052421 (2020).
Cai et al. (2013) X.-D. Cai, C. Weedbrook, Z.-E. Su, M.-C. Chen, M. Gu, M.-J. Zhu, L. Li, N.-L. Liu, C.-Y. Lu, and J.-W. Pan, Phys. Rev. Lett. 110, 230501 (2013).
Pan et al. (2014) J. Pan, Y. Cao, X. Yao, Z. Li, C. Ju, H. Chen, X. Peng, S. Kais, and J. Du, Phys. Rev. A 89, 022313 (2014).
Barz et al. (2014) S. Barz, I. Kassal, M. Ringbauer, Y. O. Lipp, B. Dakić, A. Aspuru-Guzik, and P. Walther, Scientific Reports 4, 6115 (2014).
Subaş ı et al. (2019) Y. b. u. Subaş ı, R. D. Somma, and D. Orsucci, Phys. Rev. Lett. 122, 060504 (2019).
Berry et al. (2007) D. W. Berry, G. Ahokas, R. Cleve, and B. C. Sanders, Communications in Mathematical Physics 270, 359 (2007).
Aaronson (2015) S. Aaronson, Nature Physics 11, 291 (2015).
Barenco et al. (1995) A. Barenco, C. H. Bennett, R. Cleve, D. P. DiVincenzo, N. Margolus, P. Shor, T. Sleator, J. A. Smolin, and H. Weinfurter, Phys. Rev. A 52, 3457 (1995).
Vedral et al. (1996) V. Vedral, A. Barenco, and A. Ekert, Phys. Rev. A 54, 147 (1996).
Beckman et al. (1996) D. Beckman, A. N. Chari, S. Devabhaktuni, and J. Preskill, Phys. Rev. A 54, 1034 (1996).
Chuang and Modha (2000) I. L. Chuang and D. S. Modha, IEEE Transactions on Information Theory 46, 1104 (2000).
Draper (2000) T. G. Draper, arXiv preprint quant-ph/0008033 (2000).
Cuccaro et al. (2004) S. A. Cuccaro, T. G. Draper, S. A. Kutin, and D. P. Moulton, arXiv preprint quant-ph/0410184 (2004).
Takahashi and Kunihiro (2005) Y. Takahashi and N. Kunihiro, Quantum Info. Comput. 5, 440–448 (2005).
Draper et al. (2006) T. G. Draper, S. A. Kutin, E. M. Rains, and K. M. Svore, Quantum Info. Comput. 6, 351–369 (2006).
Álvarez-Sánchez et al. (2008) J. J. Álvarez-Sánchez, J. V. Álvarez-Bravo, and L. M. Nieto, Journal of Physics: Conference Series 128, 012013 (2008).
Takahashi et al. (2010) Y. Takahashi, S. Tani, and N. Kunihiro, 10, 872–890 (2010).
Rieffel and Polak (2011) E. G. Rieffel and W. H. Polak, Quantum computing: A gentle introduction (MIT Press, 2011).
Wiebe and Roetteler (2016) N. Wiebe and M. Roetteler, Quantum Info. Comput. 16, 134 (2016).
Lee et al. (2019) Y. Lee, J. Joo, and S. Lee, Scientific Reports 9, 4778 (2019).
Bullock and Markov (2004) S. S. Bullock and I. L. Markov, Quantum Inf. Comput. 4, 27 (2004).
Leymann and Barzen (2020) F. Leymann and J. Barzen, Quantum Science and Technology 5, 044007 (2020).