Classically-Boosted Variational Quantum Eigensolver

Maxwell D. Radin Peter Johnson

(Zapata Computing, Inc., 100 Federal St., Boston, MA 02110, USA)

Abstract

The ability of near-term quantum computers to represent classically-intractable quantum states has brought much interest in using such devices for estimating the ground and excited state energies of fermionic Hamiltonians. The usefulness of such near-term techniques, generally based on the Variational Quantum Eigensolver (VQE), however, is limited by device noise and the need to perform many circuit repetitions. This paper addresses these challenges by generalizing VQE to consider wavefunctions in a subspace spanned by classically tractable states and states that can be prepared on a quantum computer. The manuscript shows how the ground and excited state energies can be estimated using such “classical-boosting” and how this approach can be combined with VQE Hamiltonian decomposition techniques. Unlike existing VQE approaches, the sensitivity to sampling error and device noise approaches zero in the limit where the classically tractable states are able to describe an eigenstate. A detailed analysis of the measurement requirements in the simplest case, where a single computational basis state is used to boost conventional VQE, shows that the ground-state energy estimation of several closed-shell homonuclear diatomic molecules can be accelerated by a factor of approximately 10-1000. The analysis also shows that the measurement reduction of such single basis state boosting, relative to conventional VQE, can be estimated using only the overlap between the ground state and the computational basis state used for boosting.

1 Introduction

Due to recent advances in gate-model quantum computing, there has been much interest in using near-term quantum computers for modeling electronic structure. This interest has largely focused on the estimation of the ground state energy, mainly through the variational quantum eigensolver (VQE) [22, 4].

Despite great interest in VQE, its utility is limited by the error introduced by device noise as well as the large number of circuit repetitions required to achieve sufficient precision in energy estimation. Error from device noise is exacerbated by the large numbers of gates required by useful problem instances. For example, Kühn et al. found that millions of two-qubit gates may be required to reach chemical accuracy for reactions involving small molecules [14]. Even if devices had sufficient fidelity to perform such a large number of operations coherently, Gonthier et al. [7] found that VQE, as it is conventionally formulated, would still be unlikely to yield an advantage in run time over classical algorithms for typical computational chemistry problems. Advanced estimation techniques such as Quantum Phase Estimation [1] and enhanced sampling [29] can dramatically improve the run time, but only with the availability of high fidelity quantum devices.

A number of studies have recently proposed improvements to VQE based on subspace expansion methods [18, 17, 26, 28, 15]. These methods consider wavefunctions in a subspace derived from a quantum state that can be prepared on a quantum computer. For example, McClean et al. proposed a method considering the subspace spanned by excitation operators applied to the state prepared on a quantum computer and found that this approach could mitigate errors from device noise [18, 17]. This was extended to the virtual quantum subspace expansion, wherein the inclusion of excitations into virtual orbitals allows for a reduction in the number of required qubits [26, 28]. The translation quantum-subspace expansion instead considers the application of translation operators to the state prepared on the quantum computer and can reduce the qubit and fidelity requirements for estimating the ground-state energy of periodic Hamiltonians [15].

In addition to these subspace expansion methods, a number of recent studies have proposed techniques to leverage classical approximations to electronic structure problems to enhance VQE. For example, Kottman et al. explored the use of multiresolution analysis [13] to reduce the qubit requirements by optimizing the corresponding orbitals. Kirby et al. proposed an approach to obtain a correction to classical estimation of the ground-state energy by using VQE to search the Hamiltonian’s contextual subspace [11, 10, 12]. This technique was found to significantly reduce the qubit requirements and number of Hamiltonian terms to be evaluated on the quantum computer when estimating the ground state energy of small molecules to within chemical accuracy.

While these studies show that such techniques (as well as other recent proposals such as entanglement forging [5]) can significantly reduce the qubit and fidelity requirements fro VQE, there is, to our knowledge, no evidence that any proposed near-term quantum algorithm could yield an advantage over classical electronic structure methods. Additionally, few proposals have considered how VQE might augment, rather than replace, commonly used classical electronic structure methods. (While contextual subspace VQE uses VQE to augment a classical energy estimate, the classical estimate is derived from a non-contextual approximation to the Hamiltonian rather than electronic structure methods commonly used in computational chemistry.)

As a step towards addressing this challenge, this study introduces classically-boosted VQE (CB-VQE), a generalization of VQE that allows one to reduce the fidelity and measurement requirements by taking advantage of classical approximate solutions to an electronic structure problem. This approach, building on prior subspace methods, considers the subspace spanned by a set of states of which some are classically tractable and others that can be prepared on a quantum computer. This work derives the CB-VQE method, assesses the sensitivity to device and sampling error, and presents a numerical and analytical estimation of measurement requirements for the simplest version of CB-VQE.

The analysis shows that the sensitivity of estimated eigenvalues to device and sampling error can vanish in the limit that the classically tractable states approach the exact solution. In the simplest version of classical boosting, where a single Slater determinant is used to boost conventional VQE, the number of measurements needed to estimate the ground-state energy of homonuclear closed-shell diatomic molecules is reduced by several orders of magnitude. An analytical expression for this speedup in terms of only the overlap between the exact ground state and the Slater determinant used for boosting is derived, allowing for the utility of single-determinant classical boosting to be readily assessed for larger systems.

2 Theory

This section first introduces a generalized subspace expansion, wherein a quantum computer is used to find the minimum energy within a subspace spanned by states $\left\{\ket{\Phi_{\alpha}}\right\}$ . Next, a sensitivity analysis explores the behavior when some of the states in this set are classically tractable. In the regime where the classically tractable states provide a good approximation of an eigenstate, the sensitivity to errors in quantum estimation is found to vanish under reasonable assumptions.

The ground state energy (as well as excited state energies) are estimated by solving the generalized eigenvalue problem [18]

\displaystyle\overline{H}\vec{v}=\lambda\overline{S}\vec{v},

(1)

where

	$\displaystyle[\overline{H}]_{\alpha,\beta}=H_{\alpha,\beta}$	$\displaystyle=\langle\Phi_{\alpha}\|H\|\Phi_{\beta}\rangle$		(2)
	$\displaystyle[\overline{S}]_{\alpha,\beta}=S_{\alpha,\beta}$	$\displaystyle=\langle\Phi_{\alpha}\|\Phi_{\beta}\rangle,$		(3)

where $\alpha$ and $\beta$ run over the states chosen to span the subspace of interest.

$S_{\alpha,\alpha}$ is equal to unity and $H_{\alpha,\alpha}$ can be determined using the estimation techniques used in conventional VQE [4]. The remaining entries are the $S_{\alpha,\beta}$ and $H_{\alpha,\beta}$ for $\alpha\neq\beta$ . To estimate these entries, let $U_{\alpha}$ be a unitary operator that transforms the all-zero state $\ket{0^{N}}$ to $\ket{\Phi_{\alpha}}$ , i.e. $U_{\alpha}\ket{0^{N}}=\ket{\Phi_{\alpha}}$ where $N$ is the number of qubits. Additionally, let $H=\sum_{\ell}h_{\ell}$ be a decomposition of the Hamiltonian into unitary operators $h_{\ell}$ . The off-diagonal entries can be expressed in terms of expectation values involving these operators:

H_{\alpha,\beta}=\sum_{\ell}\bra{0^{N}}U_{\alpha}^{\dagger}h_{\ell}U_{\beta}\ket{0^{N}}

(4)

and

S_{\alpha,\beta}=\bra{0^{N}}U_{\alpha}^{\dagger}U_{\beta}\ket{0^{N}}

(5)

The real and imaginary parts of these expressions can be estimated using the Hadamard test. Fig. 1 shows the circuits used to estimate the real parts with the state preparation unitaries for $\ket{\Phi_{\alpha}}$ and $\ket{\Phi_{\beta}}$ factorized as $U_{\alpha}=W_{\alpha,\beta}V_{\alpha}$ and $U_{\beta}=W_{\alpha,\beta}V_{\beta}$ to take advantage of common operations $W_{\alpha,\beta}$ at the beginning of the state preparations.

One strategy for implementing controlled versions of the ansatz operations $V_{\alpha}$ and $V_{\beta}$ is to promote each elementary gate of the original compilation into a controlled version of the gate [9]. This introduces an additional cost to the circuit in terms of depth and number of gates. As discussed in B, some simplifications allow for an efficient implementation of these controls for ansatzes based on particle-number conserving blocks. Under reasonable assumptions, these simplifications allow a control to be applied to $V_{\alpha}$ by increasing the number of two-qubit gates by a factor of only $1+5/N$ , where $N$ is the number of qubits.

The outcome likelihoods of the circuits shown in Figure 1 are

\displaystyle\textup{Pr}(\pm)=\frac{1}{2}\left(1\pm\textup{Re}\left(\bra{0^{N}}U_{\alpha}^{\dagger}h_{\ell}U_{\beta}\ket{0^{N}}\right)\right)

(6)

for the Hamiltonian entries and

\displaystyle\textup{Pr}(\pm)=\frac{1}{2}\left(1\pm\textup{Re}\left(\bra{0^{N}}U_{\alpha}^{\dagger}U_{\beta}\ket{0^{N}}\right)\right)

(7)

for the overlap entries. Measurement samples give a statistical estimate of the quantities of interest $\textup{Re}\left(\bra{0^{N}}U_{\alpha}^{\dagger}h_{\ell}U_{\beta}\ket{0^{N}}\right)$ and $\textup{Re}\left(\bra{0^{N}}U_{\alpha}^{\dagger}U_{\beta}\ket{0^{N}}\right)$ . The imaginary component of the overlap can be estimated by instead initializing the ancilla qubit in the state $\ket{-i}\ket{0^{N}}$ , in which case the Re in Eqs. 6 and 7 is replaced with Im. However, as discussed in Appendix A, in typical cases estimating the imaginary components of $\overline{H}$ and $\overline{S}$ is not necessary.

(a)

(b)

Figure 1: Quantum circuits implementing the Hadamard-test for estimating (a)

\textup{Re}\left(\bra{\Phi_{\alpha}}H\ket{\Phi_{\beta}}\right)

and (b)

\textup{Re}\left(\innerproduct{\Phi_{\alpha}}{\Phi_{\beta}}\right)

This estimation, in both the real and imaginary cases, can be accelerated using quantum amplitude estimation or enhanced sampling methods [29]. The estimation process described above yields a root mean squared error in the (unbiased) estimate of $O(1/\sqrt{M})$ where $M$ is the number of samples. For devices with sufficiently low error, quantum amplitude estimation can improve this to $O(1/M)$ . Enhanced sampling methods, in contrast, achieve a scaling of $O(\sqrt{r/M})$ where $r$ is proportional to the error rate of elementary gates of the device.

It may be advantageous to choose some of the $\left\{\ket{\Phi_{\alpha}}\right\}$ basis states to be classically tractable. Here, a set of states is said to be classically tractable if $H_{\alpha,\beta}=\langle\Phi_{\alpha}|H|\Phi_{\beta}\rangle$ and $S_{\alpha,\beta}=\langle\Phi_{\alpha}|\Phi_{\beta}\rangle$ can be calculated classically for all $\ket{\Phi_{\alpha}}$ and $\ket{\Phi_{\beta}}$ in the set. Below, the states for which $H_{\alpha,\beta}$ and $S_{\alpha,\beta}$ are to be calculated classically are referred to as the “classical” states, and the remaining states in the basis as the “quantum” states.

The sensitivity of the generalized eigenvalues to errors in the entries of $\overline{H}$ and $\overline{S}$ can give insight into how such “classical boosting” might reduce the measurement and fidelity requirements. From Eq. 12 of Ref. [6], the sensitivity of a generalized eigenvalue $\lambda$ to $H_{\alpha,\beta}$ and $S_{\alpha,\beta}$ is

\frac{\partial\lambda}{\partial H_{\alpha,\beta}}=v_{\alpha}v_{\beta}\left(2-\delta_{\alpha,\beta}\right)

(8)

and

\frac{\partial\lambda}{\partial S_{\alpha,\beta}}=-\lambda v_{\alpha}v_{\beta}\left(2-\delta_{\alpha,\beta}\right)

(9)

where $\vec{v}$ is taken to be normalized so that $\vec{v}^{T}\overline{S}\vec{v}=1$ and imaginary components of $\overline{H}$ and $\overline{S}$ are neglected, as discussed above. (Note that the factor of two arises from taking advantage of the fact that $\textup{Re}\left(\overline{H}\right)$ and $\textup{Re}\left(\overline{S}\right)$ are symmetric.)

Suppose that the states $\left\{\ket{\Phi_{\alpha}}\right\}$ are mutually orthonormal. In this case $S$ is equal to the identity and the normalization condition on $v$ reduces to $\sum_{\alpha}|v_{\alpha}|^{2}=1$ . Consider the limit where the classical states provide a good approximation of $\vec{v}$ , i.e.

\sum_{\alpha\in C}|v_{\alpha}|^{2}\rightarrow 1

(10)

where the summation runs over indices corresponding to classical states. In this limit, the normalization condition implies that for all quantum states $\alpha$ , $v_{\alpha}\rightarrow 0$ . Inserting this relation into Eqs. 8 and 9 shows that in this limit, the sensitivity to errors in $H_{\alpha,\beta}$ and $S_{\alpha,\beta}$ approaches zero when one or both of $\alpha$ and $\beta$ corresponds to a quantum state.

This asymptotic behavior shows that choosing classical states that, in linear combination, can provide a good approximation of an eigenstate may greatly reduce the measurement and fidelity requirements for estimating the exact eigenvalue to within a desired accuracy. Importantly, as the classical approximation approaches the exact eigenstate, the sensitivity of the estimated energy to device and sampling error vanishes.

The above analysis has assumed that the classical and quantum states are mutually orthogonal. Approximate orthogonality could be enforced by adding a penalty proportional to the sum of square overlaps $|S_{\alpha,\beta}|^{2}$ to the cost function used in optimizing ansatz parameters. Alternatively, one could design quantum circuits that, by construction, yield states that are orthogonal. For example, Nakanishi et al. have suggested the use of mutually orthogonal input states as a method for constructing a set of mutually orthogonal entangled states [19].

The above approach can be implemented with a variety of choices for the quantum states and classical states. Choices for quantum states include any of the ansatzes already proposed for VQE [4], whose parameters could be variationally optimized so as to minimize the ground state energy estimated from the generalized eigenvalue problem in Eq. 1. The simplest choice for classical states are computational basis states, which can be thought of as can be thought of as boosting VQE with a configuration interaction calculation. The case where only a single computational basis state is included is analyzed in more detail below. Other choices for classical states include matrix-product states with low bond order as well as quantum circuits inspired by the Lipkin-Meshkov-Glick model [24].

CB-VQE can also be used in conjunction with many Hamiltonian decomposition $\{h_{\ell}\}$ previously proposed for VQE. One choice for the unitary decomposition would be the conventional decomposition of $H$ into Pauli strings [22]. Others include the decomposition into unitary operators comprised of sets of mutually anti-commuting [9] or commuting [30] Pauli strings, as well as low-rank factorizations of the Hamiltonian yielding Pauli strings conjugated by orbital rotations [23, 8].

3 Measurement analysis of single-determinant boosting

The simplest example of the CB-VQE considers just two states, one quantum and one classically tractable, with the classically tractable state being a computational basis state. Under common encodings, a computational basis state corresponds to a single Slater determinant. Given that the Slater determinant that provides the best approximation to the ground state is the Hartree-Fock state, we will refer to this method as HF-VQE. This section shows how the estimation of the eigenvalue in Eq. 1 can be simplified in this case and derives the relationship between the number of measurements and precision when measurements are optimally allocated to the measurement of the entries of $\overline{H}$ and $\overline{S}$ .

Let classical and quantum states $\ket{\Phi_{\textup{cl}}}$ and $\ket{\Phi_{\textup{q}}}$ correspond to indices $\alpha=1$ and $\alpha=2$ , respectively, in $H_{\alpha,\beta}$ and $S_{\alpha,\beta}$ . $H_{1,1}=\bra{\Phi_{\textup{cl}}}H\ket{\Phi_{\textup{cl}}}$ can be trivially evaluated and $H_{2,2}=\bra{\Phi_{\textup{q}}}H\ket{\Phi_{\textup{q}}}$ estimated using existing VQE approaches. In the evaluation of $H_{1,2}=H_{2,1}^{*}=\bra{\Phi_{\textup{cl}}}H\ket{\Phi_{\textup{q}}}$ , the Hamiltonian can be decomposed into measurable components by inserting a resolution of the identity operator into $\bra{\Phi_{\textup{cl}}}H\ket{\Phi_{\textup{q}}}$ :

H_{1,2}=\bra{\Phi_{\textup{q}}}H\ket{\Phi_{\textup{cl}}}=\sum_{i}y_{i}\bra{i}H\ket{i_{0}}

(11)

where $y_{i}=\textup{Re}(\innerproduct{\Phi_{\textup{q}}}{i})$ and the sum runs over all computational basis states $i$ that the Hamiltonian couples to the computational basis state $\ket{i_{0}}\equiv\ket{\Phi_{\textup{cl}}}$ .

The entries of the overlap matrix $\overline{S}$ are given by

S_{\alpha,\beta}=\begin{cases}1&\text{if $\alpha=\beta$}\\ y_{i_{0}}&\text{otherwise}\end{cases}.

(12)

The overlap between $\ket{\Phi_{\textup{q}}}$ and a given computational basis state $\ket{i}$ can be estimated using the Hadamard test as shown in Fig. 1(b) by choosing $U_{\textup{cl}}$ to be an operator that transforms the $\ket{0^{N}}$ state to the desired computational basis state.

When the generalized eigenvalue problem Eq. 1 is used to estimate $\lambda$ given estimators $\left\{\hat{y}_{i}\right\}$ and $\hat{H}_{2,2}$ for $\left\{y_{i}\right\}$ and $H_{2,2}$ , the rules for propagation of uncertainty show that the variance in the estimator $\hat{\lambda}$ is

\textup{Var}\left(\hat{\lambda}\right)\approx\sum_{i}\left(\frac{\partial\lambda}{\partial y_{i}}\right)^{2}\textup{Var}\left(\hat{y}_{i}\right)+\left(\frac{\partial\lambda}{\partial H_{2,2}}\right)^{2}\textup{Var}\left(\hat{H}_{2,2}\right)

(13)

when the variances $\textup{Var}\left(\hat{y}_{i}\right)$ and $\textup{Var}\left(\hat{H}_{2,2}\right)$ are small.

The chain rule allows the derivative of $\lambda$ with respect to $y_{i}$ to be expressed as

\frac{\partial\lambda}{\partial y_{i}}=\sum_{\alpha,\beta}\left(\frac{\partial\lambda}{\partial H_{\alpha,\beta}}\frac{\partial H_{\alpha,\beta}}{\partial{y_{i}}}+\frac{\partial\lambda}{\partial S_{\alpha,\beta}}\frac{\partial S_{\alpha,\beta}}{\partial{y_{i}}}\right).

(14)

Combining this with Eqs. 8, 9, 11, and 12 and using the fact that $H_{1,1}$ and $H_{2,2}$ do not depend on $y_{i}$ yields

\frac{\partial\lambda}{\partial y_{i}}=2v_{1}v_{2}\left(\bra{i}H\ket{i_{0}}-\lambda\delta_{i,i_{0}}\right).

(15)

Inserting the estimator relations corresponding to above expression and Eq. 8 into 13 shows that the variance in the eigenvalue estimator is given by

\textup{Var}\left(\hat{\lambda}\right)\approx 4v_{1}^{2}v_{2}^{2}\sum_{i}\left(\bra{i}H\ket{i_{0}}-\lambda\delta_{i,i_{0}}\right)^{2}\textup{Var}\left(\hat{y}_{i}\right)+v_{2}^{4}\textup{Var}\left(\hat{H}_{2,2}\right)

(16)

where the true weights $v_{1}$ , $v_{2}$ and true eigenvalue $\lambda$ are used instead of the corresponding estimators because they are equal in the limit that the variances $\textup{Var}\left(\hat{y}_{i}\right)$ and $\textup{Var}\left(\hat{H}_{2,2}\right)$ are small.¹¹1If the eigenvalue is degenerate, then there are additional complications in attributing a sensitivity to eigenvector components. The present analysis therefore applies only if the two generalized eigenvalues are distinct, which will be true in the typical case.

Note that in the case where $\ket{\Phi_{\textup{q}}}$ is orthogonal to $\ket{\Phi_{\textup{cl}}}$ and when the state $v_{1}\ket{\Phi_{\textup{cl}}}+v_{2}\ket{\Phi_{\textup{q}}}$ corresponds to the exact ground state $\ket{\Phi_{\textup{gs}}}$ , then $v_{1}=\innerproduct{\Phi_{\textup{cl}}}{\Phi_{\textup{gs}}}\equiv\alpha$ and $v_{2}=\sqrt{1-\alpha^{2}}$ .

To determine $\textup{Var}\left(\hat{y_{i}}\right)$ , first rewrite Eq. 6 as

\displaystyle y_{i}=2p_{i}-1

(17)

where $p_{i}=\textup{Pr}(+|\innerproduct{\Phi_{\textup{q}}}{i})$ . Let $\hat{p}_{i}$ be the estimator for $p_{i}$ corresponding to the sample mean of measurements on the ancilla qubit in the $\left(+,-\right)$ basis. Eq. 17 can be used to estimate $y_{i}$ , and the corresponding estimator $\hat{y_{i}}$ has variance

\displaystyle\textup{Var}\left(\hat{y_{i}}\right)=4\textup{Var}\left(\hat{p_{i}}\right)

(18)

Since $\hat{p}_{i}$ corresponds to the sample mean of a Bernoulli distribution, its variance is

\displaystyle\textup{Var}\left(\hat{p}_{i}\right)=\frac{p_{i}\left(1-p_{i}\right)}{M_{i}}

(19)

where $M_{i}$ is the number of samples used to estimate $p_{i}$ . Combining Eqs. 17, 18, and 19 allows the variance of the overlap estimator $\hat{y}_{i}$ to be expressed in terms of the overlap $y_{i}$ :

\displaystyle\textup{Var}\left(\hat{y}_{i}\right)=\frac{1-y_{i}^{2}}{M_{i}}

(20)

Similarly, the measurement analysis for conventional VQE shows that if $M^{\prime}$ measurements are used to estimate $H_{2,2}$ , then

\textup{Var}\left(\hat{H}_{2,2}\right)=\frac{K^{\prime}}{M^{\prime}}

(21)

where $K^{\prime}$ depends on the details of the estimation technique (e.g., use of grouping or other Hamiltonian decompositions) [7, 9, 30, 23, 8].

Previous works [25] have shown that given a set of random variables $x_{j}$ with estimators $\hat{x}_{j}$ where

\textup{Var}\left(\hat{x}_{j}\right)=\frac{\sigma_{j}^{2}}{N_{j}}

(22)

then when a function $z\left(\left\{x_{i}\right\}\right)$ such that $\textup{Var}\left(\hat{z}\right)=\sum_{i}a_{i}^{2}\textup{Var}\left(\hat{x}_{i}\right)$ is to be estimated subject to the constraint $\sum_{j}N_{j}=N$ , then the optimal choice of $N_{j}$ yields a variance of

\displaystyle\textup{Var}\left(\hat{z}\right)=\frac{\left(\sum_{j}\left|a_{j}\right|\sigma_{j}\right)^{2}}{N}.

(23)

This relation applied to Eqs. 16, 20, and 21, yields

\textup{Var}\left(\hat{E}\right)=\frac{K}{M}

(24)

where $M=\sum_{i}M_{i}+M^{\prime}$ is the total number of measurements and

K=\left(2\alpha\sqrt{1-\alpha^{2}}\sum_{i}|\bra{i}H\ket{i_{0}}-E\delta_{i,i_{0}}|\sqrt{1-y_{i}^{2}}+\left(1-\alpha^{2}\right)\sqrt{K^{\prime}}\right)^{2}.

(25)

Eqs. 24 and 25 allow one to estimate the number of measurements needed to achieve a desired precision in terms of the overlaps $y_{i}$ and $\alpha$ and the measurement factor $K^{\prime}$ .

Importantly, as the overlap $\alpha$ goes to unity, $K$ approaches zero. This means that in the limit that the classically tractable state $\ket{\Phi_{\textup{cl}}}$ provides a good approximation of the ground state, the number of measurements required to reach a given precision will approach zero.

Note that in the practical implementation of classical boosting, one does not know $\alpha$ , $E$ , and each $y_{i}$ a priori. Therefore one would use approximations for these quantities for the purposes of allocating measurements to the estimation of $H_{2,2}$ and each $y_{i}$ . This is analogous to conventional VQE, where one uses approximate expectation values of each Pauli to allocate measurements to groups [7].

4 Numerical estimates of measurement counts

To explore the speedup provided by classical boosting, this section presents numerical calculations of measurement requirements of HF-VQE as well as conventional VQE for closed-shell homonuclear diatomic molecules. Hamiltonians were generated using Psi4 [21] with cc-pVQZ basis sets. Estimates use canonical orbitals, although the findings of Tubman et al. [27] suggest that HF-VQE may require fewer measurements when orbital rotations are used to obtain a Slater determinant with greater overlap with the ground state. After selecting an active space, OpenFermion [16] was used to apply the Jordan-Wigner transformation and obtain the ground state within that active space. This ground state was used to obtain overlaps $y_{i}$ as well as the term variances needed for estimating measurements for conventional VQE. These calculations were performed using Zapata’s Orquestra™ platform.

For these conventional VQE measurement estimates, co-measurable terms were grouped using a greedy algorithm. Variances in the estimators for each term were computed using the exact ground state while covariances were neglected [7]. The measurement estimations for classically-boosted VQE employ Eqs. 24 and 25. This corresponds to single-determinant boosting with the quantum state orthogonal to the classical state.

Table 1 shows the resulting estimated number of measurements to achieve a precision of 1 mHa with conventional VQE and HF-VQE. As illustrated by Figure 2, classical boosting reduces the number of measurements required by $\sim 1-3$ orders of magnitude for these systems. The speedup, for a given molecule, is relatively consistent across the range of qubit counts considered, although some oscillations are present.

To gain greater insight into the comparison between HF-VQE and conventional VQE in large basis set (i.e. high qubit count) regime, the Appendix shows that in the limit of a large number of spin-orbitals $N_{\textup{orb}}$ (for fixed number of electrons), then the speedup $S$ in terms of number of measurements of HF-VQE relative to conventional VQE (when the Hamiltonian is decomposed into groups of co-measureable terms) is approximately

\lim_{N_{\textup{orb}}\rightarrow\infty}S\approx\frac{1}{\left(1-\alpha^{2}\right)^{2}}.

(26)

Figure 3 shows the speedup obtained numerically relative to this asymptotic approximation. The data shows that as the number of qubits approaches infinity, the speedup approximately aproaches that expected from Eq. 26, corresponding to the $S/\left(1-\alpha\right)^{-2}=1$ position on the vertical axis of Figure 3.

Molecule	Number of qubits	Overlap	Measurements		Speedup
			Conventional VQE	HF-VQE
H₂	4	0.9997	$3.1\times 10^{3}$	$3.2$	$970$
	8	0.9984	$1.9\times 10^{5}$	$3.9\times 10^{2}$	$490$
	12	0.9945	$2.7\times 10^{6}$	$1.4\times 10^{4}$	$190$
	16	0.9944	$1.7\times 10^{7}$	$2.9\times 10^{4}$	$570$
Li₂	4	0.9975	$1.2\times 10^{3}$	$6.4$	$190$
	8	0.9934	$7.2\times 10^{3}$	$2.5\times 10^{2}$	$29$
	12	0.9928	$2.1\times 10^{5}$	$7.6\times 10^{2}$	$280$
	16	0.9870	$1.4\times 10^{6}$	$6.0\times 10^{3}$	$230$
N₂	12	0.9287	$1.3\times 10^{7}$	$3.2\times 10^{6}$	$4.1$
N₂	16	0.9287	$2.6\times 10^{7}$	$4.1\times 10^{6}$	$6.3$
F₂	16	0.9737	$1.7\times 10^{8}$	$1.0\times 10^{6}$	$170$

Table 1: Estimated number of measurements required to reach a precision of 1 mHa with conventional VQE and HF-VQE and the speedup (in terms of number of measurements).

Refer to caption — Figure 2: Speedup, in terms of number of measurements, of HF-VQE relative to conventional VQE for closed-shell homonuclear diatomic molecules.

5 Conclusions

This paper has introduced the notion of classically-boosting VQE calculations to reduce measurement and fidelity requirements in estimating Hamiltonian eigenvalues. The derivation of this method shows that when the classically tractable states used for boosting provide a good approximation of the desired eigenstate, then the sensitivity to sampling and device error vanishes. Although the circuit depth required is in general greater than that required for conventional VQE by a constant factor, the lower sensitivity to device noise may nevertheless result in lower fidelity requirements. Additionally, for ansatzes based on particle-number conserving blocks, classical boosting can be applied with asymptotically negligible overhead.

The simplest version of classical boosting, where a single computational basis state is used to boost a single quantum state, is found to reduce the number of required measurements by a factor of $10-1000$ for selected closed-shell homonuclear diatomic molecules. Furthermore, the data show that the speedup can be well approximated using only the overlap between the ground state and Hartree-Fock state.

This approximation for the speedup from single-determinant boosting can be used to estimate the effectiveness of classical boosting in accelerating calculations on systems of practical interest. FeMoco has been highlighted as an examplary system of practical interest for which quantum computers could provide value. Using the square overlap of 0.77 found by Tubman et al. for this system [27] indicates that, in the limit of large basis size, HF-VQE would reduce the number of shots required for ground-state energy calculations on this system by a factor of approximately 19. A greater speedup potentially could be achieved by using a classical state (or linear combination of classical states) that incorporates electron correlation.

Determining whether classically boosted VQE could outperform classical methods, while beyond the scope of this manuscript, is a crucial question for guiding the direction of future research on this topic. However, one distinguishing feature of CB-VQE compared to other near-term quantum algorithms for electronic structure is that it uses VQE to augment, rather than replace, classical electronic structure methods. This suggests that CB-VQE may be able to perform at least as well as the classical methods that could be used for boosting, such as Hartree-Fock, configuration interaction, or matrix-product state methods. In order to have a more fair comparison to our method, it would be useful to have measurement cost analyses for many of the recently proposed extensions to VQE [18, 17, 26, 28, 15, 13, 12, 5]. The measurement costing formalism introduced in this paper may be used to assess the measurement costs of those techniques that are similarly based on generalized eigenvalue problems [18, 17, 26, 28, 15].

Acknowledgement

The authors acknowledge insightful scientific discussions and suggestions from Jérôme F. Gonthier, Alex Kunitsa, Peter Love, and Alán Aspuru-Guzik.

References

[1] Alán Aspuru-Guzik, Anthony D Dutoi, Peter J Love and Martin Head-Gordon “Simulated quantum computation of molecular energies” In Science 309.5741 American Association for the Advancement of Science, 2005, pp. 1704–1707
[2] Adriano Barenco et al. “Elementary gates for quantum computation” In Physical review A 52.5 APS, 1995, pp. 3457
[3] Panagiotis Kl. Barkoutsos et al. “Quantum algorithms for electronic structure calculations: Particle-hole Hamiltonian and optimized wave-function expansions” In Phys. Rev. A 98 American Physical Society, 2018, pp. 022322 DOI: 10.1103/PhysRevA.98.022322
[4] Yudong Cao et al. “Quantum chemistry in the age of quantum computing” In Chem. Rev. 119.19 ACS Publications, 2019, pp. 10856–10915 DOI: 10.1021/acs.chemrev.8b00803
[5] Andrew Eddins et al. “Doubling the size of quantum simulators by entanglement forging”, 2021 arXiv:2104.10220 [quant-ph]
[6] R.. Fox and M.. Kapoor “Rates of change of eigenvalues and eigenvectors” In AIAA Journal 6.12, 1968, pp. 2426–2429 DOI: 10.2514/3.5008
[7] Jérôme F. Gonthier et al. “Identifying challenges towards practical quantum advantage through resource estimation: the measurement roadblock in the variational quantum eigensolver”, 2020 arXiv:2012.04001 [quant-ph]
[8] William J. Huggins et al. “Efficient and Noise Resilient Measurements for Quantum Chemistry on Near-Term Quantum Computers”, 2019 arXiv:1907.13117
[9] Artur F. Izmaylov, Tzu-Ching Yen, Robert A. Lang and Vladyslav Verteletskyi “Unitary Partitioning Approach to the Measurement Problem in the Variational Quantum Eigensolver Method” In Journal of Chemical Theory and Computation 16.1, 2020, pp. 190–195 DOI: 10.1021/acs.jctc.9b00791
[10] William M Kirby and Peter J Love “Classical simulation of noncontextual Pauli Hamiltonians” In Physical Review A 102.3 APS, 2020, pp. 032418
[11] William M Kirby and Peter J Love “Contextuality test of the nonclassicality of variational quantum eigensolvers” In Physical review letters 123.20 APS, 2019, pp. 200501
[12] William M Kirby, Andrew Tranter and Peter J Love “Contextual Subspace Variational Quantum Eigensolver” In Quantum 5, 2021, pp. 456 arXiv:2011.10027 [quant-ph]
[13] Jakob S Kottmann, Philipp Schleich, Teresa Tamayo-Mendoza and Alán Aspuru-Guzik “Reducing qubit requirements while maintaining numerical precision for the variational quantum eigensolver: A basis-set-free approach” In The Journal of Physical Chemistry Letters 12.1 ACS Publications, 2021, pp. 663–673
[14] Michael Kühn et al. “Accuracy and Resource Estimations for Quantum Chemistry on a Near-Term Quantum Computer” In Journal of Chemical Theory and Computation 15.9, 2019, pp. 4764–4780 DOI: 10.1021/acs.jctc.9b00236
[15] David Zsolt Manrique et al. “Momentum-Space Unitary Coupled Cluster and Translational Quantum Subspace Expansion for Periodic Systems on Quantum Computers”, 2021 arXiv:2008.08694 [quant-ph]
[16] Jarrod R McClean et al. “OpenFermion: the electronic structure package for quantum computers” In Quantum Science and Technology 5.3 IOP Publishing, 2020, pp. 034014
[17] Jarrod R. McClean et al. “Decoding quantum errors with subspace expansions” In Nat. Comm. 11, 2020, pp. 636 DOI: 10.1038/s41467-020-14341-w
[18] Jarrod R. McClean, Mollie E. Kimchi-Schwartz, Jonathan Carter and Wibe A. Jong “Hybrid quantum-classical hierarchy for mitigation of decoherence and determination of excited states” In Phys. Rev. A 95 American Physical Society, 2017, pp. 042308 DOI: 10.1103/PhysRevA.95.042308
[19] Ken M. Nakanishi, Kosuke Mitarai and Keisuke Fujii “Subspace-search variational quantum eigensolver for excited states” In Phys. Rev. Research 1, 2019, pp. 033062 DOI: 10.1103/PhysRevResearch.1.033062
[20] Bryan O’Gorman, William J. Huggins, Eleanor G. Rieffel and K. Whaley “Generalized swap networks for near-term quantum computing”, 2019 arXiv:1905.05118 [quant-ph]
[21] Robert M. Parrish et al. “Psi4 1.1: An Open-Source Electronic Structure Program Emphasizing Automation, Advanced Libraries, and Interoperability” In J. Chem. Theory Comput. 13.7, 2017, pp. 3185–3197 DOI: 10.1021/acs.jctc.7b00174
[22] Alberto Peruzzo et al. “A variational eigenvalue solver on a photonic quantum processor” In Nature Communications 5, 2014, pp. 4213 DOI: 10.1038/ncomms5213
[23] Maxwell D. Radin and Peter Johnson “Measurement Reduction via Orbital Frames Decompositions on Quantum Computers, World patent WO 2020/146794, 2020”, 2020
[24] Kenneth Robbins and Peter J. Love “Benchmarking VQE with the LMG model”, 2021 arXiv:2105.06761 [quant-ph]
[25] Nicholas C. Rubin, Ryan Babbush and Jarrod McClean “Application of fermionic marginal constraints to hybrid quantum algorithms” In New Journal of Physics 20, 2018, pp. 053020 DOI: 10.1088/1367-2630/aab919
[26] Tyler Takeshita et al. “Increasing the Representation Accuracy of Quantum Simulations of Chemistry without Extra Quantum Resources” In Phys. Rev. X 10 American Physical Society, 2020, pp. 011004 DOI: 10.1103/PhysRevX.10.011004
[27] Norm M. Tubman et al. “Postponing the orthogonality catastrophe: Efficient state preparation for electronic structure simulations on quantum devices” In arXiv, 2018, pp. 1–13 arXiv:1809.05523
[28] Miroslav Urbanek, Daan Camps, Roel Van Beeumen and Wibe A. Jong “Chemistry on Quantum Computers with Virtual Quantum Subspace Expansion” In Journal of Chemical Theory and Computation 16.9, 2020, pp. 5425–5431 DOI: 10.1021/acs.jctc.0c00447
[29] Guoming Wang, Dax Enshan Koh, Peter D Johnson and Yudong Cao “Minimizing Estimation Runtime on Noisy Quantum Computers” In PRX Quantum 2.1 APS, 2021, pp. 010346 DOI: 10.1103/PRXQuantum.2.010346
[30] Tzu-Ching Yen, Vladyslav Verteletskyi and Artur F. Izmaylov “Measuring All Compatible Operators in One Series of Single-Qubit Measurements Using Unitary Transformations” In Journal of Chemical Theory and Computation 16.4, 2020, pp. 2400–2409 DOI: 10.1021/acs.jctc.0c00008

Appendix A Imaginary components of the generalized eigenvalue problem

Here we show that ignoring the imaginary components of $\overline{H}$ and $\overline{S}$ still results in a valid ground state energy upper bound. Consider the generalized eigenvalue problem

\displaystyle\textup{Re}\left(\overline{H}\right)\vec{u}=\lambda_{\textup{R}}\textup{Re}\left(\overline{S}\right)\vec{u}.

(27)

Because $\textup{Re}\left(\overline{H}\right)$ and $\textup{Re}\left(\overline{S}\right)$ are real and symmetric, the eigenvalues $\lambda_{\textup{R}}$ and eigenvectors $u$ are real. Let $\lambda_{\textup{R}}^{\left(0\right)}$ be the lowest eigenvalue and $\vec{u}^{\left(0\right)}$ be the corresponding eigenvector.

The lowest eigenvalue of the complex generalized eigenvalue problem (Eq. 1) is the minimum of the Rayleigh quotient

\lambda^{\left(0\right)}\leq R\left(\overline{H},\overline{S},\vec{v}\right)=\frac{\vec{v}^{\dagger}\overline{H}\vec{v}}{\vec{v}^{\dagger}\overline{S}\vec{v}}.

(28)

Consider now

R\left(\overline{H},\overline{S},\vec{u}^{\left(0\right)}\right)=\frac{\left(\vec{u}^{\left(0\right)}\right)^{\dagger}\overline{H}\vec{u}^{\left(0\right)}}{\left(\vec{u}^{\left(0\right)}\right)^{\dagger}\overline{S}\vec{u}^{\left(0\right)}}.

(29)

This expression can be simplified because given a Hermitian matrix $\overline{A}$ and a real vector $\vec{w}$ , the imaginary component of $\overline{A}$ does not contribute to $\vec{w}^{\dagger}A\vec{w}$ . This can be seen by noting that $\vec{w}^{\dagger}A\vec{w}$ is real because $\overline{A}$ is Hermitian; and the imaginary component of each entry of $\overline{A}$ cannot contribute to the real component of $\vec{w}^{\dagger}A\vec{w}$ because $\vec{w}$ is real.

Therefore Eq. 29 can be rewritten as

R\left(\overline{H},\overline{S},\vec{u}^{\left(0\right)}\right)=\frac{\left(\vec{u}^{\left(0\right)}\right)^{\textup{T}}\textup{Re}\left(\overline{H}\right)\vec{u}^{\left(0\right)}}{\left(\vec{u}^{\left(0\right)}\right)^{\textup{T}}\textup{Re}\left(\overline{S}\right)\vec{u}^{\left(0\right)}}.

(30)

The right hand side of this equation represents the Rayleigh quotient corresponding to the real generalized eigenvalue problem of Eq. 27, and so

R\left(\overline{H},\overline{S},\vec{u}^{\left(0\right)}\right)=\lambda_{\textup{R}}^{\left(0\right)}.

(31)

Combining this relation with Eq. 28 shows that the lowest eigenvalue of the complex generalized eigenvalue problem (Eq. 1 is a lower bound on the lowest eigenvalue of the real generalized eigenvalue problem (Eq 27):

\lambda^{\left(0\right)}\leq\lambda_{\textup{R}}^{\left(0\right)}.

(32)

Eq. 32 shows that neglecting the imaginary components of $\overline{H}$ and $\overline{S}$ will never cause the ground state energy to be underestimated.

While the neglect of the imaginary components of $\overline{H}$ and $\overline{S}$ in general will lead to the ground state energy being overestimated, for Hamiltonians with time reversal symmetry (which is the case for typical chemical systems of interest), then, given suitable choices for $\left\{\ket{\Phi_{\textup{cl}}^{\left(j\right)}}\right\}$ and $\left\{\ket{\Phi_{\textup{q}}^{\left(k\right)}}\right\}$ , the exact ground state energy can nevertheless be obtained from the real generalized eigenvalue problem. Time reversal symmetry implies that for each eigenvalue, one can construct an eigenstate wherein the coefficient of each computational basis state is real. This ground state can be obtained as a linear combination of classical and quantum wavefunctions which also have a real coefficient for each computational basis state. In this case, all entries in $\overline{H}$ and $\overline{S}$ will be real and so $\lambda^{\left(0\right)}=\lambda_{\textup{R}}^{\left(0\right)}$ .

In practice, one can choose the $\left\{\ket{\Phi_{\textup{cl}}^{\left(j\right)}}\right\}$ to be real by construction and rely on variational optimization to find the values of $\left\{\ket{\Phi_{\textup{q}}^{\left(k\right)}}\right\}$ that yield the true ground state energy. It may also be possible to design ansatzes such that the quantum states are real by construction.

Appendix B Efficient implementation of controlled operations

This section shows that ansatzes that are comprised of an initial layer of $X$ gates to prepare a single computational basis state $\ket{i}$ followed by particle-number conserving blocks allow for efficient implementation of the controlled operations shown in Figs. 1(a) and 1(b). Ansatzes in this class include the hardware-efficient ansatzes based on $U_{1,\textup{ex}}$ and $U_{2,\textup{ex}}$ gates [3] as well as the swap-network implementation of the $k$ -UpCCGSD ansatz [20].

Consider basis states $\ket{\Phi_{\alpha}}$ that can be prepared by applying the operator $V_{\alpha}$ to the Jordan-Wigner encoding of the Hartree-Fock state, $\ket{i_{0}}=\ket{0^{N-k}1^{k}}=W\ket{0}$ . The estimation of $H_{\alpha,\beta}$ and $S_{\alpha,\beta}$ using the Hadamard test as shown in 1(a) and 1(b) requires controlled versions of the unitary $V_{\alpha}$ and $V_{\beta}$ . A straight-forward approach would promote each of the blocks $V^{(j)}_{\alpha}$ and the Hamiltonian components $h_{\ell}$ to controlled versions, leading to a constant-factor increase in the circuit depth and number of gates.

However, a the circuit depth and number gates needed to implement the overlap estimation can be significantly reduced if $V_{\alpha}$ can be decomposed into particle-number conserving blocks, i.e $V_{\alpha}=V^{(T)}_{\alpha}\ldots V^{(1)}_{\alpha}$ where $V^{(T)}_{\alpha},\ldots V^{(1)}_{\alpha}$ are operations that conserve particle number and act on adjacent qubits. Let $\ket{\psi}$ be an eigenstate of $V$ : $V\ket{\psi}=e^{i\theta}\ket{\psi}$ . For arbitrary unitaries $A$ and $B$ ,

$\displaystyle c-(AVB)\|+\psi>$	$\displaystyle=\frac{1}{\sqrt{2}}\ket{0}\ket{\psi}+\frac{1}{\sqrt{2}}\ket{1}AVB\ket{\psi}$	(33)
	$\displaystyle=\frac{1}{\sqrt{2}}e^{-i\theta}\ket{0}V\ket{\psi}+\frac{1}{\sqrt{2}}\ket{1}AVB\ket{\psi}$	(34)
	$\displaystyle=(c-A)(R_{Z}(-\theta)\otimes V)(c-B)\|+\psi>,$	(35)

where $R_{Z}(\theta)=[[e^{i\theta},0],[0,1]]$ . That is, for any operations in a sequence of controlled-gates that preserve the initial state, we can equivalently implement them as a non-controlled operation in parallel with a $Z$ -rotation on the ancilla qubit. Of the blocks in $V_{q}$ , the only which do not preserve the Hartree-Fock state are those which couple the filled and unfilled spin orbitals. Blocks that act entirely on qubits initialized in $\ket{0}$ or entirely on qubits initialized in $\ket{1}$ therefore need not be promoted to controlled gates. If the blocks are arranged in a brick-layer format with $D$ layers, one “coupling block” occurs on every other layer of blocks, leading to just $D/2$ blocks that require a controlled-implementation.

As an example, Fig. 4 shows the circuit used to estimate $\bra{\Phi_{\textup{q}}}H\ket{\Phi_{\textup{cl}}}$ in such a case with four spin-orbitals (in an interleaved spin-orbital ordering) and two electrons where the classical state $ket{\Phi_{\textup{cl}}}$ is taken to be the Hartree-Fock state. Importantly, two of the three entangling gates $U_{\textup{ent}}$ do not need to have controls applied. The $R_{z}\left(\theta\right)$ gate applies a phase to compensate for the phase that these two gates apply in the subspace where the ancilla is in the $\ket{0}$ state.

Figure 4: Efficient implementation of the circuit for estimating

\bra{\Phi_{\textup{q}}}H\ket{\Phi_{\textup{cl}}}

for an ansatz comprised of particle-number conserving blocks when the classical state is a single computational basis state.

The number of two-qubit gates needed to implement the overlap test in HF-VQE relative to the number needed for estimating an expectation value in conventional VQE can be estimated under reasonable assumptions. Suppose that promotion of each block into a controlled version incurs a factor $F$ increase in the number of elementary gates over the original block. Assuming that the Hamiltonian is decomposed into Pauli strings, a controlled version of each Pauli string operation $h_{\ell}$ can be implemented by promoting each single-qubit Pauli gate to a controlled-version of that gate, requiring at most $N$ two-qubit gates (assuming native controlled-gates and all-to-all connectivity). The initial brick-layer arrangement of operations of $D$ layers of blocks includes approximately $D(N/2)$ blocks. Therefore the factor increase in gates due to the controlled implementation is

\displaystyle\frac{DN/2+FD/2+N}{DN/2}=1+\frac{F}{N}+\frac{2}{D}.

(36)

Assume the ansatz is chosen have depth $D=N$ . Further, assume that the blocks $V^{(j)}$ are compiled into gates with each two-qubit gate being a controlled-NOT; from [2], this can be promoted to a Toffoli using three controlled-NOT gates, leading to $F\leq 3$ . Then, the total factor increase in two-qubit gates is $1+5/N$ .

Appendix C Asymptotic speedup

This section estimates the speedup in terms of measurements of HF-VQE relative to conventional VQE, that is

S=\frac{M^{\left(\textup{VQE}\right)}}{M}

(37)

when

\textup{Var}\left(\hat{E}\right)=\textup{Var}\left(\hat{E}_{\textup{VQE}}\right),

(38)

where $M^{\left(\textup{VQE}\right)}$ is the number of measurements applied to a conventional VQE ground-state energy estimator $\hat{E}_{\textup{VQE}}$ whose variance is given by

\textup{Var}\left(\hat{E}_{\textup{VQE}}\right)=\frac{K^{\left(\textup{VQE}\right)}}{M^{\left(\textup{VQE}\right)}}.

(39)

Combining these relations with Eqs. 24 and 39 shows that $S=K^{\left(\textup{VQE}\right)}/K$ , and applying Eq. 25 yields:

\frac{1}{\sqrt{S}}=\frac{2\alpha\sqrt{1-\alpha^{2}}\sum_{i}\bra{i}H\ket{i_{0}}-E\delta_{i,i_{0}}|\sqrt{1-y_{i}^{2}}+\left(1-\alpha^{2}\right)\sqrt{K^{\prime}}}{\sqrt{K^{\left(\textup{VQE}\right)}}}.

(40)

This expression can be simplified when the conventional VQE energy $E_{\textup{VQE}}=\bra{\Phi}H\ket{\Phi}$ as well as $H_{2,2}=\bra{\Phi_{\textup{q}}}H\ket{\Phi_{\textup{q}}}$ are estimated by the independent measurement of Pauli terms in the Hamiltonian. In this case,

K^{\left(\textup{VQE}\right)}=\left(\sum_{i}|h_{i}|\sqrt{\textup{Var}_{\ket{\Phi}}\left(\hat{P}_{i}\right)}\right)^{2}

(41)

where $\hat{P}_{i}$ is the estimator for the $i^{\textup{th}}$ Pauli term in the Hamiltonian and $h_{i}$ is its coefficient [7]. Similarly,

K^{\prime}=\left(\sum_{i}|h_{i}|\sqrt{\textup{Var}_{\ket{\Phi_{\textup{q}}}}\left(\hat{P}_{i}\right)}\right)^{2}

(42)

These quantities can be approximated by setting the variances to their upper bound of unity, in which case

K^{\left(\textup{VQE}\right)}\approx K^{\prime}\approx\left(\sum_{i}|h_{i}|\right)^{2}.

(43)

Prior studies have found that this approximation yields good estimates for VQE measurement requirements [7]. One can analogously approximate $\sqrt{1-y_{i}^{2}}$ with the upper bound of unity in Eq. 40.

Applying these approximations to Eq. 40 yields a simplified expression for the speedup:

\frac{1}{\sqrt{S}}=\frac{2\alpha\sqrt{1-\alpha^{2}}\sum_{i}|\bra{i}H\ket{i_{0}}-E\delta_{i,i_{0}}|}{\sqrt{K^{\left(\textup{VQE}\right)}}}+\left(1-\alpha^{2}\right).

(44)

Because each Pauli string $P_{i}$ that does not annihilate $\ket{i_{0}}$ will transform it to a single computational basis state,

\sum_{i}|\bra{i}H\ket{i_{0}}|\leq\sum_{i\in T}|h_{j}|

(45)

where $T$ is the set of Pauli terms that do not annihilate $\ket{i_{0}}$ .

For typical electronic structure problems, $T$ includes $O\left(N_{\textup{orb}}^{2}N_{\textup{el}}^{2}\right)$ terms, where $N_{\textup{orb}}$ is the number of spin-orbitals included in the calculation and $N_{\textup{el}}$ the number of electrons. In contrast, the full Hamiltonian contains $O\left(N_{\textup{orb}}^{4}\right)$ terms. Therefore in the limit of large $N_{\textup{orb}}$ (with $N_{\textup{el}}$ fixed), the ratio $\sum_{j\in T}|h_{j}|/\sum_{j}|h_{j}|$ will vanish provided that the ratio of the mean absolute coefficient in $T$ to the mean absolute coefficient of the full Hamiltonian scales subquadratically in $N_{\textup{orb}}$ . In such cases,

\lim_{N_{\textup{orb}}\rightarrow\infty}\frac{\sum_{i}|\bra{i}H\ket{i_{0}}-E\delta_{i,i_{0}}|}{\sum_{i}|h_{i}|}=\frac{|\bra{i_{0}}H\ket{i_{0}}-E|-|\bra{i_{0}}H\ket{i_{0}}|}{\sum_{i}|h_{i}|}

(46)

Note also that for typical electronic structure problems, the numerator on the right-hand side of the above equation will approach a constant in the $N_{\textup{orb}}\rightarrow\infty$ limit, while the denominator will diverge. Therefore

\lim_{N_{\textup{orb}}\rightarrow\infty}\frac{\sum_{i}|\bra{i}H\ket{i_{0}}-E\delta_{i,i_{0}}|}{\sum_{i}|h_{i}|}=0

(47)

Taking the $N_{\textup{orb}}\rightarrow\infty$ limit of Eq. 44, inserting the above relation, and rearranging yields

\lim_{N_{\textup{orb}}\rightarrow\infty}S\approx\frac{1}{\left(1-\alpha^{2}\right)^{2}}.

(48)

Note that this derivation has assumed that the Hamiltonian is decomposed into co-measurable groups [22] for the purpose of estimating $H_{2,2}$ and the conventional VQE energy. Modifications may be required when other Hamiltonian decomposition techniques are used [9, 30, 23, 8] . In particular, Gonthier et al. [7], have shown that for other decomposition techniques the upper bound on variances does not provide a good approximation of the measurement requirements.