Enhancing quantum computer performance via symmetrization

Andrii Maksymov IonQ, College Park, MD 20740, USA Jason Nguyen IonQ, College Park, MD 20740, USA Yunseong Nam IonQ, College Park, MD 20740, USA Department of Physics, University of Maryland, College Park, MD 20742, USA Igor Markov IonQ, College Park, MD 20740, USA

Abstract

Large quantum computers promise to solve some critical problems not solvable otherwise. However, modern quantum technologies suffer various imperfections such as control errors and qubit decoherence, inhibiting their potential utility. The overheads of quantum error correction are too great for near-term quantum computers, whereas error-mitigation strategies that address specific device imperfections may lose relevance as devices improve. To enhance the performance of quantum computers with high-quality qubits, we introduce a strategy based on symmetrization and nonlinear aggregation. On a commercial trapped-ion quantum computer, it improves performance of multiple practical algorithms by 100x with no qubit or gate overhead.

I Introduction

Quantum computers (QCs) are rapidly growing in capacity, but are held back by quantum noise, decoherence, crosstalk and gate control inaccuracies Erhard et al. (2019); Wright et al. (2019); Zhu et al. (2022a); Tomesh et al. (2022). Each qubit technology seeks to suppress such irregularities for individual qubits and gates Maksymov et al. (2021a); Blumel et al. (2019); Blümel et al. (2021); Ballance et al. (2016); Elder et al. (2020); Lao et al. (2021). However, the circuit fidelity provided by these methods falls short by orders of magnitude compared to the needs of large-scale quantum algorithms. This necessitates the development of higher-level strategies that systematically improve performance as observed at the algorithmic level, and we offer such techniques in our work. As in conventional computers, firmware in quantum computers provides necessary low-level control for a device’s specific hardware and orchestrates hardware so that software can run more effectively and efficiently. Firmware can implement quantum error correcting codes (QECC) that mathematically promise to tolerate small-enough irregularities via wide-circuit redundancy. However, for near-term quantum computers, irregularities are often too great for these codes to function properly Nielsen and Chuang (2011). Leading QECC techniques require many additional qubits, gates, measurements, low-latency classical-control interconnects, and exorbitant amounts of supporting nonquantum computation. Although eventually QECC promises attractive scalibility, present-day quantum computers are far too small to benefit from QECC and wide-circuit redundancy ¹¹1The overhead for QECC is typically 5-7x the number of qubits. For NISQ system with 50-70 qubits this leaves very few logical qubits for quantum computation..

To make progress with present-day QCs, researchers have developed alternate firmware approaches known as error mitigation. In leading superconducting QCs, the quality of individual physical qubits varies enough for the result to depend on the mapping of logical to physical qubits. Therefore, researchers try to optimally map qubits Murali et al. (2019); Tannu and Qureshi (2019a); Maksymov et al. (2021b), order gates Nishio et al. (2020), and ensemble-average over circuit mappings to mitigate the effect of correlated errors with minimal overhead Tannu and Qureshi (2019b). A series of techniques is based on first accurately characterizing quantum device irregularities and errors, then suppressing them by adjusting control pulses Sun et al. (2021), probabilistically canceling them via applying extra gates Strikis et al. (2021); Suzuki et al. (2021); Temme et al. (2017); Piveteau et al. (2021), or using machine learning on the quantum computational output Strikis et al. (2021); Czarnik et al. (2021). Another insight is that decorrelated noise accumulates at a smaller rate with the number of gates. Hence, gate-level decorrelation Campbell (2017); Kern et al. (2005); Wallman and Emerson (2016); Cai et al. (2020); Hashim et al. (2020) adds gates to decorrelate noise at the cost of some overhead, which can also add to the noise if significant. Researchers have used this effect to systematically amplify noise, which allows one to extrapolate output states to the zero-noise limit Cai (2021); Temme et al. (2017); Kandala et al. (2019); Endo et al. (2018); Song et al. (2019).

Leading error-mitigation strategies developed and deployed for superconducting QCs address stochastic noise and uneven quality of physical qubits. To improve, superconduting QCs must attain uniformly-high qubit quality and low stochastic noise. After such improvements, the error-mitigation techniques we reviewed above may lose relevance. Such improved technologies can be illustrated by the present-day trapped-ion QCs where practically-identical qubits enjoy long decoherence times and low random noise Wang et al. (2021); Lee et al. (2016); Kielpinski et al. (2002). The remaining adverse effects are due to slowly-drifting control inaccuracies Maksymov et al. (2021a). In this work, we develop and validate novel error mitigation techniques for ion-trap QCs with expectation of broader applicability to present-day and future QCs.

We introduce a firmware-level error mitigation strategy called symmetrization. To avoid qubit- and gate-level overhead, it distinguishes the ideal quantum computation by its invariance under certain symmetries that arise at multiple levels of physical implementation ²²2Such as qubit mappings, circuit compilation, gate decomposition, pulse sequences etc.. Our strategy first uses symmetries to generate variant circuit implementations. These variants run on one or multiple QCs, and collected measurement statistics are aggregated via linear or nonlinear techniques. Subsequently, symmetrized effects of deterministic inaccuracies largely cancel out while random noise does not get amplified.

We validated our strategy on the IonQ Aria commercial QC for quantum algorithms of practical interest Lubinski et al. (2021); Zhu et al. (2022b); IonQ (2022a, b). For quantum ML (QML) circuits Zhu et al. (2022b), linear aggregation gives a 1.5-2 $\times$ performance boost. Nonlinear aggregation by voting provides much greater gains but may distort results if used inappropriately. For a 15-qubit quantum Fourier transform (QFT) adder circuit Lubinski et al. (2021) with voting, we see a 100 $\times$ performance gain without distortion. We explore the choice of aggregation in Section II.2 and provide a guide for future uses in Discussions.

II Results

II.1 Symmetrization strategy

Refer to caption — Figure 1: Symmetrized circuit execution: (a) splitting execution into symmetrized variants illustrated by varying qubit assignments, (b) measuring each result affected by individual inaccuracies, (c) aggregating measurement statistics while (d) compares the difference between averaged results and obtained through plurality voting. First, for each of four selected qubit pairs, a circuit variant produces a superposition state $(\left|00\right>+\left|11\right>)/\sqrt{2}$ (target qubits are marked orange) (a). For each qubit pair, the output state is recreated and measured five times in the computational basis (b). If the measurements are grouped per mapping, their statistics significantly deviate from the ideal, but approach the ideal when averaged; residual erroneous counts are shown as red circles and crosses, while all-zero states are green triangles, all-one are blue diamonds (c). When aggregated with plurality vote taken across variants, erroneous counts are filtered out, whereas componentwise averaging preserves all counts (d).

We consider a set of $n$ -qubit computations $\mathfrak{U}_{n}$ , each including state initialization, some operator from $SU(2^{n})$ , and final measurements. Let $\mathfrak{R}_{n}$ be a set of realizations (of all quantum computations in $\mathfrak{U}_{n}$ ) that represent gate-level quantum circuits with qubit assignment, initialization, measurements, postprocessing, and possibly implementation details such as pulse sequences specified. We define the function $\pi:\mathfrak{R}_{n}\to\mathfrak{U}_{n}$ that finds the computation $u$ performed by a given concrete realization $r$ . We define the general symmetries of $\mathfrak{U}_{n}$ , denoted $\Gamma(\mathfrak{U}_{n})$ , as the set of functions $\gamma:\mathfrak{R}_{n}\to\mathfrak{R}_{n}$ that satisfy

\displaystyle\pi\circ\gamma=\pi.

(1)

That is, $\gamma\in\Gamma(\mathfrak{U}_{n})$ if and only if for all $r\in\mathfrak{R}_{n}$ , whenever $\pi(r)=u$ , $\pi(\gamma(r))=u$ . In other words, applying $\gamma$ to any realization will produce the same quantum computation. We also define computation-specific symmetries, $\Gamma(\mathfrak{U}_{n},r)$ , as the set of functions $\gamma:\mathfrak{R}_{n}\to\mathfrak{R}_{n}$ that satisfy $\pi(\gamma(r))=\pi(r)$ for a particular $r$ . For example, general symmetries could be conjugations (in group-action sense) of gate-level circuits by qubit permutations. Namely, the initial state is replaced by its permutation, the gates are applied on permuted qubits, and the measurement results are permuted back. Examples of computation-specific symmetries are gate decompositions, permutations of commuting gates, the addition of gates that preserve a given state (e.g., before measurement), and changes of gates and measurements compensated by changes in postprocessing. When $\mathfrak{R}_{n}$ specifies pulse sequences, symmetries can replace them with physically equivalent ones.

By distributing the computation over multiple symmetries, we cancel out the effect of control inaccuracies without amplification of random errors. The steps of the procedure, as shown in Fig. 1, are then:

1.

Define symmetries $\Gamma$ and sample $\Gamma^{\prime}\subset\Gamma$ .
2.

Generate circuit variants for $\Gamma^{\prime}$ .
3.

Execute each variant on the QC hardware.
4.

Aggregate all measurement statistics.

II.2 Choice of symmetries and aggregations

We now consider why symmetrization works. Let inaccurate realizations $\tilde{r}_{u}$ be determined by instantaneous parameters of the physical system, such that $\pi(\tilde{r}_{u})=u+\delta u$ . A key example is unitary under- or over-rotations of particular gates Maksymov et al. (2021a).

To mitigate the impact of inaccuracies, we consider $\gamma(\tilde{r}_{u})$ for multiple $\gamma\in\Gamma=\Gamma(\mathfrak{U}_{n},r_{u})$ so as to symmetrize the error term in $\pi(\gamma(\tilde{r}_{u}))=u+\delta u_{\Gamma\text{-inv}}+\delta u_{\gamma}$ . In the absence of errors, all realizations $r_{u}$ of $u$ implement the same computation. In the presence of errors, we rely on symmetrization over multiple $\gamma\in\Gamma$ to produce a computation $u+\delta u_{\Gamma\text{-inv}}+\left<\delta u_{\gamma}\right>_{\Gamma}$ . As long as we select an uncorrelated set of $\gamma$ , $||\left<\delta u_{\gamma}\right>_{\Gamma}||\ll\left<||\delta u_{\gamma}||\right>_{\Gamma}$ , and the cumulative effect of non- $\Gamma$ -invariant errors is much reduced.

In practice, rather than aggregating all $\pi(\gamma(\tilde{r}_{u}))$ , we consider the output states produced by $\pi(\gamma(\tilde{r}_{u}))$ and aggregate their measurement statistics (because, e.g., coherently adding two quantum states would require additional qubits). The impact of inaccuracies on an ideal distribution $\mathbf{h_{u}}\in\mathbb{R}^{+}(2^{n})$ may be expressed as $\mathbf{h_{u}}+\delta h_{\Gamma\text{-inv}}+\delta h_{\gamma}$ . Aggregating measurement statistics can “enhance the contrast” between the target output states and erroneously observed states. The error terms may cancel out, but more typically they would be uncorrelated. For example, if $\mathbf{h_{u}}=(0,\dots,1_{k},\dots,0_{2^{n}})$ and $\Gamma=S_{2^{n}}$ , then the symmetrized result would be

\mathbf{h_{u}}+\delta h_{\Gamma\text{-inv}}+\left<\delta h_{\gamma}\right>_{\Gamma}=\mathbf{h_{u}}+\delta h_{\Gamma\text{-inv}}=\\ =\left(\frac{\varepsilon}{2^{n}-1},\dots,(1-\varepsilon)_{k},\dots,\frac{\varepsilon}{2^{n}-1}\right)

(2)

where $\varepsilon$ is the average error on $k$ and $\Gamma$ is sufficiently large. The probability of output $k$ is no better, but other probabilities (that should ideally be 0) become less pronounced. This decreases the probability that an erroneous output is observed repeatedly by chance and helps find the desired outputs with fewer samples.

The term $\delta h_{\Gamma\text{-inv}}$ in Eq. 2 captures the remaining fully-depolarizing error channel, i.e., the effect of incoherent errors Takagi et al. (2022). This residual error can be reduced with aggregation techniques such as plurality voting, e.g., for $\mathbf{h_{u}}$ with $l$ output states of frequency $\frac{1}{l}$ if $\varepsilon<1-l/2^{n}$ as proven in Supplemental Materials.

As a concrete example, we demonstrate the effect of symmetrization on 4-qubit circuits with six two-qubit gates on different qubit pairs and random single-qubit gates mapped to eight ions (see Methods). We model gate miscalibrations as random under-rotations of multiple two-qubit gates fixed per qubit pair. We assume an average under-rotation across all qubit pairs causing a similar error for all variants (Fig. 2a). Symmetries $\Gamma$ are represented by eight random qubit assignments $\gamma$ . For each qubit assignment, we simulated corresponding inaccurate realizations to obtain vectors $\mathbf{h_{u}}+\delta h_{\Gamma\text{-inv}}+\delta h_{\gamma}$ . In Fig. 2, we illustrate 256-dimensional vectors for ideal, individual, and symmetrized results by plotting their two largest principal components (principal component analysis (PCA) was initially performed on $\{\delta h_{\Gamma\text{-inv}}+\delta h_{\gamma}\}$ vectors). In the first case, since all gates are under-rotated by some amount on average, the variants fail to symmetrize the errors because they are only exploring qubit assignment symmetries. Hence, error effects $\delta h_{\Gamma\text{-inv}}$ remain after aggregation as shown in Fig. 2a. In the second example (Fig. 2b), we use additional symmetries of gate decompositions, to zero out $\delta h_{\Gamma\text{-inv}}$ . The effect of under-rotation in fully-entangling XX gates is addressed using an alternative implementation that combines phase-flipped XX^-1 gates with pairs of X-gates thus implementing the same ideal unitary but reversing the effect of under-rotation.

An aggregation strategy for measurement statistics is a procedure that combines measurement statics from multiple implementations of the same computation. Without errors, all implementations should produce identical statistics in the limit (with infinite repetition count). An aggregation strategy is considered stable for a given type of statistics if, provided a set of identical statistics of this type, it produces another copy. Aggregation by componentwise averaging is trivially stable for statistics of any type. Yet aggregation by voting is not. This can be seen for the probability distribution $(1-\varepsilon,\varepsilon)$ which voting-based aggregation brings closer to $(1,0)$ for $\varepsilon<1/2$ . What makes aggregation strategies useful is that ( $i$ ) they coerce arbitrary statistics to statistics of the desired type, ( $ii$ ) they distill original statistics from multiple erroneous variants of the original. To this end, output probability distributions are analytically characterized for many quantum algorithms including Shor’s and Grover’s. The choice of aggregation is determined by the type of output probability distribution of a given quantum algorithm.

For best performance, we recommend aggregation by plurality voting for quantum algorithms with ideal measurement statistics comprising of $l$ outputs with frequencies $\frac{1}{l}$ . Such algorithms have zero-frequency outputs and a subset of target outputs that needs to be determined. For algorithms with different measurement statistics, aggregation by averaging can be used to avoid distortion.

II.3 Experiment

We evaluate the impact of symmetrization and the choice of aggregation strategy experimentally by comparing the results of unsymmetrized runs to symmetrized runs with componentwise averaging and plurality voting. We use the IonQ Aria trapped-ion quantum computer for these experiments, configured to utilize 20 addressable qubits. See methods for experimental details.

Performance is measured by Hellinger fidelity, defined as a statistical overlap $F_{H}=\left(\sum_{i}\sqrt{p_{i}q_{i}}\right)^{2}$ between the actual output statistics $p_{i}$ and the ideal result $q_{i}$ is computed via an error-free simulator. $F_{H}$ ranges from $0$ to $1$ , with 0 capturing probability distributions that do not overlap, and 1 corresponding to a pair of identical distributions. Also known as the Bhattacharyya coefficient Bhattacharyya (1943), $F_{H}$ is commonly used to measure the discrepancy between probability distributions and is consistent with the definition of fidelity for quantum states.

We demonstrate the impact of symmetrization on a 13-qubit single-output QFT-based adder circuit Lubinski et al. (2021). In Fig. 3a we compare the largest output probabilities out of $2^{13}$ between the unsymmetrized histogram, symmetrized with componentwise averaging, and symmetrized with plurality voting. The first and largest value corresponds to the bitstring with ideal probability 1 while the rest should have otuput probability 0. Fig. 3b shows the change in the error bars with the number of shots. We observe that symmetrization with componentwise averaging does not improve the probability of the target bitstring but does reduce next-largest probabilities, which allows for a dramatic increase in the probability of the target state after plurality voting (from 1.5% to 95%). For a 15-qubit QFT-based adder, the boost exceeds 100 $\times$ .

Next, we examine the performance of symmetrization for several use cases shown in Fig. 4. All jobs had 2500 shots taken with output probability distributions that vary in the number of correct output states with nonzero probability, and thus benefit differently from different aggregation strategies. In Fig. 4a, we evaluate results for QFT-based adders, phase estimation, and amplitude estimation with a single output state Lubinski et al. (2021). We see that symmetrization with plurality voting significantly increases $F_{H}$ while symmetrized runs with componentwise averaging show no improvement. In Fig. 4b, we compare results for amplitude estimation and Monte Carlo sampling circuits before tracing out the ancillary qubits. Symmetrization with plurality voting still shows the strongest improvement in $F_{H}$ but componentwise averaging is also better than no symmetrization because it evens out the errors across the four target states. In Fig. 4c, we evaluate symmetrization on variational quantum eigensolver (VQE) and quantum machine learning (QML) circuits Zhu et al. (2022b). Those circuits have broader, irregular output distribution, so that symmetrization with componentwise averaging shows the best improvement while plurality voting can skew the results. Circuits with more peaked output probability distributions often benefit more from aggregation with plurality voting (see Methods).

III Discussion

To enhance the performance of present-day quantum computers, scientists and engineers devote considerable effort to finding and mitigating error sources. However, device inaccuracies and computational errors tend to persist even after heroic improvements. In particular, coherent errors — which often arise from unintentional mis-calibrations that may drift in time — can significantly degrade performance (error mitigation techniques run into limitations for incoherent errors, as proven in Takagi et al. (2022) via lower bounds). Even without hardware improvement, our strategy boosts QC performance because systematic errors vary between certain symmetric implementations. Symmetrization is the process of creating variant implementations of quantum computation on specific hardware, so as to diminish errors (Fig. 2b) and improve QC performance. In particular, we split a given number of executions of a quantum circuit into batches, and each batch is executed using a different realization that should, by symmetry, give the same outcome in the absence of inaccuracies. To aggregate the measurement statistics of symmetrized runs, we show that appropriately chosen techniques produce strong gains on a commercial QC.

Aggregation by componentwise averaging is stable for measurement statistics of any type. We use it to demonstrate a 2 $\times$ fidelity improvement for QML and VQE algorithms which produce few low-frequency outputs. For the algorithms with many zero-frequency outputs (QFT-based adders, amplitude estimation, phase estimation, Monte Carlo sampling) where the output result is encoded in a small set of target output states, componentwise averaging gives little to no improvement since it cannot recover zero-frequency outputs. Plurality voting is stable for this type of measurement statistics and demonstrates an up to 100 $\times$ performance boost on our 20-qubit commercial QC IonQ (2022a). Our error mitigation strategy appears applicable to multiple qubit technologies and is compatible with prior error-mitigation strategies.

IV Methods

Here, we give additional details on the two steps of symmetrization: the sampling of symmetries and the aggregation of measurement statistics. We also outline several considerations of scalability for these two steps. Details on our experiment and simulation are given as well.

IV.1 Sampling symmetries

Since using all possible symmetries for a given quantum computation is impractical, we need to sample from those symmetries. For an error-free quantum computation, it suffices to use the identity symmetry alone. Assuming a single inaccuracy of a known type, very few symmetries would be sufficient, regardless of the magnitude of inaccuracies or the number of qubits. As the dimensionality of the error space grows, more symmetries must be sampled.

We sample symmetries $\gamma$ to minimize $\left<\delta u_{\gamma}\right>$ . Selecting dissimilar (rather than random Takagi et al. (2022)) symmetries reduces the bias and decorrelates inaccuracies between the variants. If symmetries $\Gamma$ are qubit assignments, one may select assignments that share fewer gates between physical qubits for a given device-specific connectivity.

IV.2 Aggregation strategies

Continuing the discussion in Section II.B, we compare two aggregation strategies for measurement statistics: one represents them by frequency distributions, and the other — by raw output samples.

Componentwise averaging. Our first strategy performs componentwise averaging of frequencies in given histograms Tannu and Qureshi (2019b). It suites computations with few or no zeros in the ideal probability distribution, such as VQE or QML circuits. Fig. 2b represents with vectors the differences between the histogram of each variant and the ideal histogram. With an appropriate sampling of symmetries, these vectors cancel out and their sum converges to the ideal one as the number of variants increases. Componentwise averaging is unable to recover zero frequencies in ideal output distributions. Intuitively, averaging is related to the set-union operation, whereas set-intersection suggests different aggregation methods. Namely, methods based on voting and can filter out low-frequency outputs and recover zero frequencies.

Plurality voting. To specify aggregation by plurality voting we represent measurement results for each circuit variant by a set of bitstrings, one per shot. Since each variant has the same number of shots, each shot can be represented by the same number of variant bitstrings (see Fig. 1c). The winning bitstring is determined by the plurality vote that additionally exceeds a specified threshold. Since the order of bitstrings does not matter, voting per shot is repeated many times over the scrambled orderings of bitstrings in each variant. If no winning bitstring is found, the threshold is reduced by one. If no winner exists for the threshold value of two, a componentwise average of variant histograms is returned (this is common for spread-out distributions and/or also when available samples lack statistical significance). After accumulating counts from all winning bitstrings, the final histogram is normalized. The voting threshold is determined by training runs for a given QC architecture. Executed for a set of circuits with known outputs, the training runs also help to determine optimal numbers of variants, repetitions, gate decompositions etc. These hyperparameters are used for multiple circuits.

Due to the nonlinearity of voting, it is a stable aggregation strategy for ideal output probabilities with $l$ equally probable outputs and $r-l$ zero frequencies (see Supplemental Materials for proofs). Relevant circuits include QFT-based adders, phase and amplitude estimation algorithms, some Monte Carlo algorithms.

IV.3 Considerations of scalability

Sampling of symmetries. Uniformly random selection Patel and Tiwari (2020) offers a computationally scalable sampling method in that the memory of all previously selected symmetries is not necessary to select the next $\gamma$ . Since $k$ uniformly random symmetries produce a uniformly random set of $\delta u_{\gamma}$ , we have $\left<\delta u_{\gamma}\right>\propto 1/\sqrt{k}$ Takagi et al. (2022). Selecting dissimilar symmetries can reduce the expectation $\left<\delta u_{\gamma}\right>$ , just like low-discrepancy sequences Halton (1960); Sobol' (1967); Fisher and Yates (1974) improve upon random samples. To avoid specializing symmetry selection to each individual computation, we engineer it for entire classes of computation, possibly with moderate suboptimality. For example, similar VQE circuits (on the same number of qubits) can be viewed as one class.

Aggregation of measurement statistics. Run time and memory complexity depend on the number of observed output states rather than the number of all possible states. For componentwise averaging, the postprocessing comes down to the simple or weighted merger of output counts (zero frequencies are implicit). Plurality voting is performed in small groups of outputs and does not require significant memory.

IV.4 Experimental details

We use the IonQ Aria IonQ (2022a) trapped-ion quantum computer which utilizes trapped Ytterbium ions individually addressed by pulses of 355 nm light. These pulses can be engineered to generate a Mølmer-Sørensen entangling gate between ions as well as single qubit rotations/gates. The Aria system uses 22 such ions as qubits to perform quantum information processing. Here, we split our experiments into 25 different maps (variants) between physical and computational qubits, running 100 experimental shots on each variant resulting in 2500 total experimental repetitions. For circuits on more than six qubits, we genrated permutations on a set of physical qubits. Otherwise, two additional physical qubits were utilized to increase the number of diverse mappings. All variants were measured under similar conditions. To this end, for most of our experiments, we executed our jobs within one calibration cycle. Whenever this was not possible (e.g. due to ion-chain loss), the calibration parameters were carefully replicated.

IV.5 Simulation details

We show the effect of symmetrization on a 4-qubit random circuit (Fig. 5a) in eight implementations with varying qubit assignment onto eight ions (Fig. 5b). We model gate miscalibrations as random under-rotations of multiple two-qubit gates fixed per ion pair. We assume an average under-rotation across all qubit pairs causing a similar error for all variants (Fig. 2a). Symmetries $\Gamma$ are represented by eight random qubit assignments $\gamma$ . For each qubit assignment, we simulated corresponding inaccurate realizations to obtain vectors $\mathbf{h_{u}}+\delta h_{\Gamma\text{-inv}}+\delta h_{\gamma}$ .

In the first case (Fig. 2a), we use only vary qubit assignment between the implementations (Fig. 5b) while in the other case (Fig. 2b), we also replace every fourth XX-gate with a phase-flipped XX^-1 gates with pairs of X-gates thus implementing the same ideal unitary but reversing the effect of under-rotation (Fig. 2b).

References

Erhard et al. (2019) A. Erhard, J. J. Wallman, L. Postler, M. Meth, R. Stricker, E. A. Martinez, P. Schindler, T. Monz, J. Emerson, and R. Blatt, Nature Communications 10 (2019), URL https://doi.org/10.1038/s41467-019-13068-7.
Wright et al. (2019) K. Wright, K. M. Beck, S. Debnath, J. M. Amini, Y. Nam, N. Grzesiak, J.-S. Chen, N. C. Pisenti, M. Chmielewski, C. Collins, et al., Nature Communications 10, 1 (2019), ISSN 2041-1723, URL http://dx.doi.org/10.1038/s41467-019-13534-2.
Zhu et al. (2022a) D. Zhu, Z. P. Cian, C. Noel, A. Risinger, D. Biswas, L. Egan, Y. Zhu, A. M. Green, C. H. Alderete, N. H. Nguyen, et al., Nature Communications 13 (2022a), URL https://doi.org/10.1038/s41467-022-34279-5.
Tomesh et al. (2022) T. Tomesh, P. Gokhale, V. Omole, G. S. Ravi, K. N. Smith, J. Viszlai, X.-C. Wu, N. Hardavellas, M. R. Martonosi, and F. T. Chong, Supermarq: A scalable quantum benchmark suite (2022), URL https://arxiv.org/abs/2202.11045.
Maksymov et al. (2021a) A. O. Maksymov, J. Nguyen, V. Chaplin, Y. Nam, and I. L. Markov, in 28th IEEE International Symposium on High-Performance Computer Architecture (2021a), eprint 2108.03708.
Blumel et al. (2019) R. Blumel, N. Grzesiak, and Y. Nam, Power-optimal, stabilized entangling gate between trapped-ion qubits (2019), eprint 1905.09292.
Blümel et al. (2021) R. Blümel, N. Grzesiak, N. H. Nguyen, A. M. Green, M. Li, A. Maksymov, N. M. Linke, and Y. Nam, Phys. Rev. Lett. 126, 220503 (2021), URL https://link.aps.org/doi/10.1103/PhysRevLett.126.220503.
Ballance et al. (2016) C. J. Ballance, T. P. Harty, N. M. Linke, M. A. Sepiol, and D. M. Lucas, Phys. Rev. Lett. 117, 060504 (2016).
Elder et al. (2020) S. S. Elder, C. S. Wang, P. Reinhold, C. T. Hann, K. S. Chou, B. J. Lester, S. Rosenblum, L. Frunzio, L. Jiang, and R. J. Schoelkopf, Physical Review X 10 (2020), URL https://doi.org/10.1103/physrevx.10.011001.
Lao et al. (2021) L. Lao, P. Murali, M. Martonosi, and D. Browne, 2021 ACM/IEEE 48th Annual International Symposium on Computer Architecture (ISCA) (2021), URL http://dx.doi.org/10.1109/ISCA52012.2021.00071.
Nielsen and Chuang (2011) M. A. Nielsen and I. L. Chuang, Quantum Computation and Quantum Information: 10th Anniversary Edition (Cambridge University Press, USA, 2011), 10th ed., ISBN 1107002176.
Murali et al. (2019) P. Murali, J. M. Baker, A. J. Abhari, F. T. Chong, and M. Martonosi, Noise-adaptive compiler mappings for noisy intermediate-scale quantum computers (2019), eprint 1901.11054.
Tannu and Qureshi (2019a) S. S. Tannu and M. K. Qureshi, in Proceedings of the Twenty-Fourth International Conference on Architectural Support for Programming Languages and Operating Systems (Association for Computing Machinery, New York, NY, USA, 2019a), ASPLOS ’19, p. 987–999, ISBN 9781450362405, URL https://doi.org/10.1145/3297858.3304007.
Maksymov et al. (2021b) A. Maksymov, P. Niroula, and Y. Nam, Quantum Science and Technology 6, 034009 (2021b), URL https://doi.org/10.1088/2058-9565/abf718.
Nishio et al. (2020) S. Nishio, Y. Pan, T. Satoh, H. Amano, and R. V. Meter, J. Emerg. Technol. Comput. Syst. 16 (2020), ISSN 1550-4832, URL https://doi.org/10.1145/3386162.
Tannu and Qureshi (2019b) S. S. Tannu and M. Qureshi, in Proceedings of the 52nd Annual IEEE/ACM International Symposium on Microarchitecture (Association for Computing Machinery, New York, NY, USA, 2019b), MICRO ’52, p. 253–265, ISBN 9781450369381, URL https://doi.org/10.1145/3352460.3358257.
Sun et al. (2021) J. Sun, X. Yuan, T. Tsunoda, V. Vedral, S. C. Benjamin, and S. Endo, Physical Review Applied 15 (2021), ISSN 2331-7019, URL http://dx.doi.org/10.1103/PhysRevApplied.15.034026.
Strikis et al. (2021) A. Strikis, D. Qin, Y. Chen, S. C. Benjamin, and Y. Li, Learning-based quantum error mitigation (2021), eprint 2005.07601.
Suzuki et al. (2021) Y. Suzuki, S. Endo, K. Fujii, and Y. Tokunaga, Quantum error mitigation for fault-tolerant quantum computing (2021), eprint 2010.03887.
Temme et al. (2017) K. Temme, S. Bravyi, and J. M. Gambetta, Physical Review Letters 119 (2017), ISSN 1079-7114, URL http://dx.doi.org/10.1103/PhysRevLett.119.180509.
Piveteau et al. (2021) C. Piveteau, D. Sutter, S. Bravyi, J. M. Gambetta, and K. Temme, Error mitigation for universal gates on encoded qubits (2021), eprint 2103.04915.
Czarnik et al. (2021) P. Czarnik, A. Arrasmith, P. J. Coles, and L. Cincio, Error mitigation with clifford quantum-circuit data (2021), eprint 2005.10189.
Campbell (2017) E. Campbell, Physical Review A 95 (2017), ISSN 2469-9934, URL http://dx.doi.org/10.1103/PhysRevA.95.042306.
Kern et al. (2005) O. Kern, G. Alber, and D. L. Shepelyansky, The European Physical Journal D 32, 153–156 (2005), ISSN 1434-6079, URL http://dx.doi.org/10.1140/epjd/e2004-00196-9.
Wallman and Emerson (2016) J. J. Wallman and J. Emerson, Physical Review A 94 (2016), ISSN 2469-9934, URL http://dx.doi.org/10.1103/PhysRevA.94.052325.
Cai et al. (2020) Z. Cai, X. Xu, and S. C. Benjamin, npj Quantum Information 6 (2020), ISSN 2056-6387, URL http://dx.doi.org/10.1038/s41534-019-0233-0.
Hashim et al. (2020) A. Hashim, R. K. Naik, A. Morvan, J.-L. Ville, B. Mitchell, J. M. Kreikebaum, M. Davis, E. Smith, C. Iancu, K. P. O’Brien, et al., Randomized compiling for scalable quantum computing on a noisy superconducting quantum processor (2020), eprint 2010.00215.
Cai (2021) Z. Cai, Multi-exponential error extrapolation and combining error mitigation techniques for nisq applications (2021), eprint 2007.01265.
Kandala et al. (2019) A. Kandala, K. Temme, A. D. Córcoles, A. Mezzacapo, J. M. Chow, and J. M. Gambetta, Nature 567, 491–495 (2019), ISSN 1476-4687, URL http://dx.doi.org/10.1038/s41586-019-1040-7.
Endo et al. (2018) S. Endo, S. C. Benjamin, and Y. Li, Physical Review X 8 (2018), ISSN 2160-3308, URL http://dx.doi.org/10.1103/PhysRevX.8.031027.
Song et al. (2019) C. Song, J. Cui, H. Wang, J. Hao, H. Feng, and Y. Li, Science Advances 5 (2019), URL https://doi.org/10.1126/sciadv.aaw5686.
Wang et al. (2021) P. Wang, C.-Y. Luan, M. Qiao, M. Um, J. Zhang, Y. Wang, X. Yuan, M. Gu, J. Zhang, and K. Kim, Nature Communications 12 (2021), ISSN 2041-1723, URL http://dx.doi.org/10.1038/s41467-020-20330-w.
Lee et al. (2016) A. C. Lee, J. Smith, P. Richerme, B. Neyenhuis, P. W. Hess, J. Zhang, and C. Monroe, Phys. Rev. A 94, 042308 (2016), URL https://link.aps.org/doi/10.1103/PhysRevA.94.042308.
Kielpinski et al. (2002) D. Kielpinski, C. Monroe, and D. J. Wineland, Nature 417, 709 (2002), URL https://doi.org/10.1038/nature00784.
Lubinski et al. (2021) T. Lubinski, S. Johri, P. Varosy, J. Coleman, L. Zhao, J. Necaise, C. H. Baldwin, K. Mayer, and T. Proctor, Application-oriented performance benchmarks for quantum computing (2021), eprint 2110.03137.
Zhu et al. (2022b) D. Zhu, W. Shen, A. Giani, S. R. Majumder, B. Neculaes, and S. Johri, Copula-based risk aggregation with trapped ion quantum computers (2022b), URL https://arxiv.org/abs/2206.11937.
IonQ (2022a) IonQ, Ionq announces first quarter 2022 financial results (2022a), (Accessed March 31, 2022), URL https://www.businesswire.com/news/home/20220516005952/en/IonQ-Announces-First-Quarter-2022-Financial-Results.
IonQ (2022b) IonQ, Ionq aria achieves record 20 algorithmic qubits (2022b), (Accessed February 25, 2022), URL https://thequantuminsider.com/2022/02/25/ionq-aria-achieves-record-20-algorithmic-qubits/.
Takagi et al. (2022) R. Takagi, S. Endo, S. Minagawa, and M. Gu, npj Quantum Information 8 (2022), URL https://doi.org/10.1038/s41534-022-00618-z.
Bhattacharyya (1943) A. Bhattacharyya, Bull. Calcutta Math. Soc. 35, 99 (1943).
Patel and Tiwari (2020) T. Patel and D. Tiwari, in SC20: International Conference for High Performance Computing, Networking, Storage and Analysis (IEEE, 2020), URL https://doi.org/10.1109/sc41405.2020.00019.
Halton (1960) J. H. Halton, Numerische Mathematik 2, 84 (1960), URL https://doi.org/10.1007/bf01386213.
Sobol' (1967) I. Sobol', USSR Computational Mathematics and Mathematical Physics 7, 86 (1967), URL https://doi.org/10.1016/0041-5553(67)90144-9.
Fisher and Yates (1974) R. A. Fisher and F. Yates, Statistical tables for biological, agricultural and medical research (Longman, London, England, 1974), 6th ed.

V Acknowledgments

We thank John Gamble for insightful discussions and valuable suggestions.

VI Contributions

I.M. and Y.N. conceived and coordinated the project. I.M. proposed the idea of the strategy and designed the methods with A.M. J.N. conducted the experiment, A.M. wrote and performed the simulations and data processing. All authors contributed to writing the manuscript.

Appendix A Supplemental Information - validity and efficacy of plurality voting

As detailed in the main text, plurality voting is a powerful aggregation strategy because it is nonlinear and can strongly suppress errors for some circuits. However, it can also degrade performance if used for other circuits. Here, we formally analyze the properties of the plurality vote procedure, detailing the conditions that should be satisfied for its use to be beneficial.

Let us first consider the simple case with no finite-sample effects, no errors, and $r$ possible valid output states. We consider $m$ variants with the probability $h_{i}$ to measure state $i$ . The probability to measure each output state a specified number of times $\{x_{1},\dots,x_{r}\}=\mathbf{x}^{r}$ such that $\sum_{k=1}^{r}x_{k}=m$ can be written in terms of multinomial coefficients as

\displaystyle\gamma(m,\mathbf{x}^{r})=\binom{m}{x_{1},\dots,x_{r}}\prod_{j=1}^{r}h_{j}^{x_{j}},

(3)

We can then write down the probability to find a state $i$ exactly $x_{i}$ times out of $m$ variants by summing $\gamma(m,\mathbf{x}^{r})$ over every variable in $\mathbf{x}^{r}$ except the prefixed $x_{i}$ denoting the constraint $\sum_{k=1}^{r}x_{k}=m$ with a primed sum as

\displaystyle\gamma_{i}(m,x_{i})=\sideset{}{{}^{\prime}}{\sum}_{\mathbf{x}^{r}\setminus x_{i}=0}\binom{m}{x_{1},\dots,x_{r}}\prod_{j=1}^{r}h_{j}^{x_{j}},

(4)

The probability that the measured state $i$ is the most frequently measured state and is found at least $t$ times out of $m$ variants can be expressed as a sum over $\gamma(m,x_{i})$ with an additional constraint that requires any $x_{k}\in\mathbf{x}^{r}\setminus x_{i}$ to be less than $x_{i}$ :

\displaystyle G_{i}(m,t)=\sum_{x_{i}=t}^{m}\phantom{x}\sideset{}{{}^{\prime}}{\sum}_{\mathbf{x}^{r}\setminus x_{i}=0}^{x_{i}-1}\binom{m}{x_{1},\dots,x_{r}}\prod_{j=1}^{r}h_{j}^{x_{j}}

(5)

The output probability of state $i$ in the aggregated results can be expressed through the normalized $G_{i}(m,t)$

\displaystyle g_{i}(m,t)=\frac{G_{i}(m,t)}{\sum_{j}G_{j}(m,t)}

(6)

Theorem A.1.

For any ideal output probability distribution $\{h_{1},\dots,h_{r}\}$ and any two states $1\leq i\neq j\leq r$ , the corresponding aggregated output probabilities $g_{i}$ , $g_{j}$ satisfy $g_{i}/g_{j}<h_{i}/h_{j}$ if $h_{i}<h_{j}\notin\{0,1\}$ .

Proof.

Let us consider an output probability distribution with $l$ nonzero output states. $G_{i}(m,t)$ can be written as

\displaystyle G_{i}(m,t)=\sum_{x_{i}=t}^{m}\sum_{x_{j}=0}^{\begin{subarray}{c}\min(x_{i}-1,\\ m-x_{i})\end{subarray}}h_{i}^{x_{i}}h_{j}^{x_{j}}f_{ij}^{m}(x_{i},x_{j}),

(7)

where

\displaystyle f_{ij}^{m}(x_{i},x_{j})=\sideset{}{{}^{\prime}}{\sum}_{\mathbf{x}^{r}\setminus x_{i}\setminus x_{j}=0}^{x_{i}-1}\phantom{|}\prod_{k=1}^{l}h_{k}^{x_{k}}\binom{m}{x_{1},\dots,x_{l}}

(8)

Let us change the summation over $t\leq x_{i}\leq m$ and $0\leq x_{j}\leq\min(x_{i}-1,m-x_{i})$ to $q=x_{i}+x_{j}$ and $u=x_{i}-x_{j}$ where $1\leq u\leq m$ and $\max(u,2t-u)\leq q\leq m$ , which can be confirmed geometrically, so that

G_{i}(m,t)=\sum_{u=1}^{m}h_{i}^{u}\sum_{\begin{subarray}{c}q=\max(u,\\ 2t-u)\end{subarray}}^{m}(h_{i}h_{j})^{\frac{q-u}{2}}f_{ij}^{m}(\tfrac{q+u}{2},\tfrac{q-u}{2})=\\ =\sum_{u=1}^{m}h_{i}^{u}\phi_{ij}(m,t,u),

(9)

where $\phi_{ij}^{m}(t,u)=\sum_{q=\max(t,u)}^{m}(h_{i}h_{j})^{\frac{q-u}{2}}f_{ij}^{m}(\tfrac{q+u}{2},\tfrac{q-u}{2})$ . The ratio between the aggregated output probabilities can be expressed as

\frac{g_{i}(m,t)}{g_{j}(m,t)}=\frac{G_{i}(m,t)}{G_{j}(m,t)}=\frac{h_{i}}{h_{j}}\frac{\sum_{u=1}^{m}h_{i}^{u-1}\phi_{ij}^{m}(t,u)}{\sum_{u=1}^{m}h_{j}^{u-1}\phi_{ij}^{m}(t,u)}

(10)

Comparing the sums term by term, since $h_{i}<h_{j}$ , for $u\geq 1$ , $h_{i}^{u-1}\leq h_{j}^{u-1}$ so that $\frac{g_{i}(m,t)}{g_{j}(m,t)}<\frac{h_{i}}{h_{j}}$ . ∎

Corollary A.1.1.

If $h_{i}=0$ , $g_{i}(m,t)=\alpha G_{i}(m,t)=\alpha\sum_{u=1}^{m}h_{i}^{u}\phi_{ij}(m,t,u)=0$ .

Corollary A.1.2.

If $h_{i}=1$ , $h_{k\neq i}=g_{k\neq i}=0$ , $g_{i}=1$ .

Corollary A.1.3.

For any output probability distribution $\{h_{1},\dots,h_{r}\}$ such that $h_{i}=1/l$ for $1\leq l\leq r$ states and $h_{i}=0$ for the rest, $g_{i}(m,t)=h_{i}$ .

\displaystyle g_{i}(m,t)=\begin{cases}0,&h_{i}=0\\ \frac{G_{i}(m,t)}{\sum^{l}G_{j}(m,t)}=\frac{G_{i}(m,t)}{lG_{i}(m,t)}=\frac{1}{l},&h_{i}=1/l\end{cases}

(11)

Corollary A.1.4.

If there is an imbalance $d$ between a state with probability $1/l$ and a state with probability $0$ so that $h_{i}=\frac{1}{l}-d$ and $h_{j}=d$ , $\frac{g_{i}(m,t)}{g_{j}(m,t)}>\frac{h_{i}}{h_{j}}$ given that $h_{i}>h_{j}$ or that $0<d<\frac{1}{2l}$ .

Corollary A.1.5.

If there is an imbalance $d$ between two states with probability $1/l$ so that $h_{i}=\frac{1}{l}-d$ and $h_{j}=\frac{1}{l}+d$ , $\frac{g_{i}(m,t)}{g_{j}(m,t)}>\frac{h_{i}}{h_{j}}$ given that $h_{i}>h_{j}$ or that $0<d<\frac{1}{l}$ .

It follows from Theorem A.1 and its corollaries that plurality voting is a stable aggregation strategy for ideal output probabilities with $1\leq l\leq r$ equally probable outputs and $r-l$ zero frequencies. If the non-zero output probabilities differ, the smaller ones get further reduced in the aggregated results, while the larger ones get amplified. This property helps to reduce the aggregated probabilities of zero-frequency outputs when they are erroneously measured.