Simulating and mitigating crosstalk
Abstract
We describe an efficient and scalable framework for modeling crosstalk effects on quantum information processors. By applying optimal control techniques, we show how to tuneup arbitrary high-fidelity parallel operations on systems with substantial local and nonlocal crosstalk. Simulations show drastically lower error rates for a 2D square array of 100 superconducting transmon qubits. These results suggest that rather than striving to engineer away undesirable interactions during fabrication, we can largely mitigate their effects through careful characterization and control optimization.
Introduction. — The foremost obstacle to realizing practical quantum computing is its innate sensitivity to errors and noise. One facet of the problem is the dichotomy between the implementation of high-fidelity simultaneous one-qubit and high-fidelity two-qubit gates. Fast two-qubit gates require spatially or spectrally nearby qubits, which reduces the constituent subsystems’ addressability because a resonant pulse intended for one qubit can induce rotations on the others Gershenfeld and Chuang (1997); Pioro-Ladrière et al. (2008). Proposed architectures have typically dealt with this crosstalk by attempting to maximize the gap between qubits or by executing local operations asynchronously Schutjens et al. (2013); Piltz et al. (2013); Boixo et al. (2018). The former solution requires the ability to tune couplings or extra engineering, but the added complexity can adversely impact coherence times and requires additional control wires. In the latter approach, depending on the extent to which the control fields affect neighboring subsystems, the time overhead can be significant.
Crosstalk describes a broad range of effects that violate one of two assumptions: spatial locality and independence of operations Rudinger et al. (2019); Sarovar et al. (2019); Abrams et al. (2019). Gates and other operations are supposed to act on disjoint subsets of qubits. However, unintended interactions can couple the qubits, producing nonlocal correlated noise. Even if an operation has a well-defined action on a particular subset of qubits, the effective noise might depend on its context – what operations affect other qubits.
In this Letter, we introduce a scalable framework for accurately modeling idle and operation crosstalk on experimental devices. Our technique exploits the tensor product structure of local (classical) crosstalk to efficiently express its impact on gates. Through a perturbative expansion, we extend our ideas to nonlocal (quantum) crosstalk and capture its effects to arbitrary order. Provided there is a sufficient degree of control, we can try to minimize the effect of these errors. We illustrate our framework’s novel applications through a series of simulations of parallel gates on a square array of 100 superconducting transmon qubits. In our first experiment, we apply gradient-based optimization to the experimentally significant problem of implementing arbitrary elements of on superconducting transmon qubits. Despite substantial local crosstalk, we show that error rates near the crosstalk-free limit are possible with modern control hardware. We further show how to tuneup simultaneous cross-resonance gates and, again, obtain dramatically higher error rates. Our results suggest that contrary to prevailing opinions Aude Craik et al. (2017); Neill et al. (2018); Reagor et al. (2018); Murali et al. (2020), crosstalk need not be a prohibitive limitation on noisy intermediate-scale quantum (NISQ) era devices Preskill (2018). Higher quality quantum information processors may be made possible by using our techniques to better balance the tradeoffs in device fabrication and pulse design.
Background. — Prior work has often approached the problem of implementing several operations on a collection of qubits by breaking it into a temporally disjoint sequence of gates. In contrast, Ref. Steffen et al. (2000) analyzed the problem of driving two spins with a homogenous field in the setting of NMR. However, it is unclear how to apply the method to multilevel systems such as transmons or trapped ions. Ref. Theis et al. (2016) studied how to drive two transmons coupled to the same cavity suffering from spectral crowding with simultaneous or gates with rotation angles and . In either case, these methods do not directly apply to many-qubit systems, nor do they handle nonlocal correlations. It is our objective to develop an efficient and systematic method for optimizing the implementation of nontrivial parallel operations under general crosstalk.
What crosstalk Sarovar et al. (2019) acts on physical qubits during idling or the implementation of gates (as opposed to preparation or measurement crosstalk), and how can we efficiently simulate and consequently, try to mitigate it? It is natural to classify crosstalk as either local or nonlocal. Local crosstalk can arise when a semiclassical drive field interacts with several qubits, causing unitary errors on supposedly idle qubits, but not entangling independent subsystems. Nonlocal crosstalk creates correlations that are nonfactorizable over system qubits and may originate from, for example, the residual static coupling between two qubits or miscalibration.
Quantifying and reducing crosstalk requires a figure of merit. Depending on the application, it makes sense to evaluate the average fidelity of one-qubit or two-qubit gates rather than the fidelity per clock cycle. Our ideas work in either case, but we focus on the former situation. Local error measures relate directly to fault-tolerance thresholds, are easier to estimate experimentally, and are more common in the literature. We show that the average local fidelity is especially simple to approximate.
Local crosstalk. — Although local crosstalk (typically) produces correlated noise, it can be factorized and simulated efficiently on a digital computer. The induced correlations are classical and do not entangle the individual subsystems. We model local crosstalk via the Hamiltonian
(1) |
Each term acts exclusively on subsystem , and denotes shared classical parameters that result in crosstalk. The vector may, for example, contain the phases and amplitudes that specify drive fields. The average process fidelity Nielsen (2002); Carignan-Dugas et al. (2019) between a target operation and the noisy implementation , where , can be expressed as
(2) |
The equation holds more generally when are CPTP maps, for example, when a dissipative process also affects the system or the control parameters fluctuate over time.
Nonlocal crosstalk. — Unlike local crosstalk, a digital computer cannot usually exactly simulate a large system affected by nonlocal crosstalk. Thus we develop a perturbative technique for simulating nonlocal crosstalk. Our approximation scheme characterizes a noise channel by estimating some of the associated Pauli error rates . The Pauli-twirled noise channel is
(3) | ||||
(4) |
where is the Pauli group on qubits. These error rates provide a partial description of the noise affecting a quantum system and can assist in experimental device calibration. On large experimental devices, we can scalably and estimate the parameters in a way that is robust to state preparation and measurement (SPAM) errors Flammia and Wallman (2019). We might also combine the quantities to calculate holistic measures of device performance, such as the average two-qubit fidelity or global fidelity.
It is helpful to sketch our approach using a graphical model of the noise (see, e.g., Ref. Diestel (2005) for basic graph theory definitions). We construct a graph where each node is a strongly interacting subsystem during an operation of interest, such as a qubit during a single-qubit gate or a two-qubit pair entangled by a cross-resonance interaction. The entire target operation is factorizable over the tensor product space partitioning defined by the nodes. Edges denote nonlocal crosstalk that couples subsystems, and we only allow two-body coupling. We impose the constraint that the graph has limited connectivity (in a spatial spectral sense) since our approach relies on simulating subsystems. The constraint is satisfied in contemporary architectures where a majority of nodes have a degree of at most four.
A pair of positive integers specifies the expansion order of the noise approximation; designates the ‘environment’ distance and the maximum component order. We consider the set of all components of all induced subgraphs of such that the order of every component is less than or equal to , and any component with an order less than has the same edges as in . I.e., we do not look at induced components with order less than . The idea of the simulation scheme is to calculate the Pauli errors that occur on each component.
We approximate the behavior of a component by evolving it along with all vertices of distance at most , generating a map . Next, we compute the diagonal of the Pauli-Liouville representation of the channel . A Walsh-Hadamard transformation relates to the Pauli probability vector , with Flammia and Wallman (2019). The vector is the error probability distribution for a Pauli twirled copy of . Marginalizing the error distribution over the environment produces an estimate of the local error distribution on the target component. After calculating the marginal distributions for all of the components in , we can use the theory of probabilistic graphical models Koller and Friedman (2009) to construct an estimate of the entire Pauli error distribution up to some specified error weight. By truncating the distribution at some error weight, the size of the distribution scales polynomially in the number of qubits.
In practice, including the nearest environmental nodes is sufficient to compute the local error distribution with high relative precision. We can intuitively understand the limited-depth requirement from the fact that intermediate systems must mediate the influence of one subsystem on a nonadjacent subsystem. One may formally bound these effects with Lieb-Robinson bounds Lieb and Robinson (1972).
Single-qubit gate engineering. — We review a typical implementation of single-qubit operations on transmons. A local oscillator acts as a single tone microwave source outputting a constant signal that is shaped by an arbitrary waveform generator via an IQ mixer. A good description of a transmon qubit is an anharmonic oscillator driven by microwave pulses. In the lab frame, the relevant Hamiltonian is
(5) |
where is the annihilation operator of the oscillator, , is the anharmonicity, is the drive phase, is the oscillator’s resonant frequency, specifies the drive envelope, and we set .
The lowest two energy levels form the qubit subspace. Including the third energy level models the leading-order effect of leakage provided the anharmonicity is sufficiently large. After making a rotating wave approximation (RWA) and moving into the rotating frame of the qubit, the Hamiltonian projected into the qubit subspace is
(6) |
To see how the control induces single-qubit gates, consider a resonant pulse (), and , which corresponds to an ideal sufficiently long pulse. The control generates and gates by modulating the coupling between the zero and one states, while the drive phase fixes the rotation axis in the -plane, and the pulse area sets the rotation angle. Rotations about the remaining -axis correspond to a change in the relative phase between the states. Rather than manipulating the transmon’s state, it is equivalent to rotate the control with respect to the state, realizing a virtual- gate Knill et al. (2000, 2008); McKay et al. (2017). We accomplish this physically by adding a phase offset to all subsequent gates. A pulse with an area and a relative phase offset generates the unitary with the notation . Combining two of these phase-offset pulses and a final virtual- realizes any element of McKay et al. (2017).
Consider the problem of implementing an arbitrary element of concurrently on an ensemble of qubits where their respective drive fields weakly interact with other qubits. The semiclassical Hamiltonian governing transmon with local drive crosstalk is
(7) |
The parameters and characterize the crosstalk affecting the system. We focus on the case where each transmon has a local drive. The crosstalk parameters are matrices, and we can set = 1 and = 0 without loss of generality by modifying and . These constraints lead us to interpret as the relative drive strength, and as the phase lag. Experimental data supports the model Xia et al. (2015); Magesan and Gambetta (2018); Xue et al. (2019), and one can efficiently estimate the parameters with standard Rabi and Ramsey experiments.
We simulate a system of transmons that evolve under (7) and include the first three energy levels. The qubits are on a square grid with nonzero only for neighboring qubits. Qubits have a frequency of either GHz or GHz, and no two adjacent qubits have the same frequency. All qubits have anharmonicity MHz. In each iteration of the experiment, the target gate is chosen randomly from . The crosstalk phase lag parameter are sampled randomly from the interval , and we draw from a normal distribution centered at zero. There are two discrete periods of successive evolution, each taking time . It is necessary to pick pulse shapes. On the one hand, we want pulses that yield error rates near the decoherence limit for short gate times. On the other hand, there are experimental realities, such as power-bandwidth constraints and the degree of calibration needed to implement complicated pulses accurately. Balancing these constraints, we pick Gaussian pulses with , and half-derivative DRAG corrections Motzoi et al. (2009); Gambetta et al. (2011); Chen et al. (2016).
Fig. 2 shows the average single-qubit process infidelity as a function of . Green diamonds denote the raw infidelity for a crosstalk-free system (, where is the Kronecker delta). The blue markers are infidelities obtained using the crosstalk-free control scheme but with various strengths of drive crosstalk. The red markers are infidelities obtained with optimized control and the same drive crosstalk as the blue markers. We optimize control pulses with the method of Ref. Machnes et al. (2018). Applying the protocol requires the selection of appropriate optimization parameters. Sticking to our simple control ansatz, we tune the overall magnitude of the resonant quadrature, off-resonant quadrature and the carrier signal phase , for a total of parameters. We observe approximately two orders of magnitude improvement in the infidelity with our crosstalk minimization technique.
In real experimental devices, decoherence significantly reduces the average error rates. Moreover, decoherence errors grow with time, whereas control errors typically decrease. These contrasting effects imply that there is an optimal gate time that minimizes their combined errors. We repeat the simulation implementing with decoherence added to the model. Table 1 presents data showing the potential benefit of our methods.
Crosstalk | Original | Opt., ns | Opt., ns |
---|---|---|---|
0.05 | 6.02e-4 | 1.00e-4 | 1.86e-4 |
0.1 | 7.13e-4 | 1.03e-4 | 1.91e-4 |
0.25 | 2.13e-3 | 1.15e-4 | 1.84e-4 |
0.5 | 1.77e-2 | 1.07e-4 | 1.81e-4 |
Two-qubit gate engineering. — We continue our simulations using the ideal system of fixed-frequency transmons and the parameter values specified above. Our aim is to implement parallel cross-resonance gates Rigetti et al. (2005); Chow et al. (2011); Sheldon et al. (2016); Magesan and Gambetta (2018); Allen et al. (2019), which are equivalent to CNOTs up to single-qubit operations. Constant capacitive coupling provides a mechanism for implementing entangling operations. Assuming equal coupling between all neighboring qubits in the system, the corresponding interaction Hamiltonian is
(8) |
where denotes a sum over all adjacent qubit pairs. The entire system evolves under .
The basic idea of the cross resonance effect is that if we define the qubits in a dressed basis, local microwave drive fields drive both single and two-qubit gates. For two ideal coupled qubits, in the dressed basis, a drive applied to qubit 1 at the frequency of qubit 2 yields the effective Hamiltonian
(9) |
where is the difference of qubit frequencies and we made an RWA. Although we can decouple the direct qubit coupling, higher-levels of the transmon lead to additional terms in the effective Hamiltonian Magesan and Gambetta (2018). We can use the term to generate a maximally entangling gate.
Again, we simulate a system of transmons on a grid and include the first three energy levels of each. We group adjacent qubits in pairs and try to implement 50 simultaneous maximally entangling gates using the CR effect. Our qubits have 8 distinct frequencies ( GHz) to ensure each CR pair is addressable. We set the frequencies so that no two neighbors of one qubit have the same frequency. The target CNOT equivalent is determined using Cartan’s KAK decomposition Tucci (2005) and is invariant to local operations. The qubit coupling strength is MHz. We realize qubit control with the same drives as above but with variable drive detuning and phase offset. We independently parameterize the resonant and off-resonant control envelopes with the first three Hanning window functions
(10) |
There are a total of parameters that determine the drive fields.
Fig. 3 shows the average two-qubit process infidelity of each entangling gate as a function of the gate duration . We compute all points with optimized pulse parameters Machnes et al. (2018) but under different system models. Red diamonds denote the infidelity obtained using a drive-crosstalk-free model and no undesirable coupling. The blue squares are infidelities calculated using the crosstalk-free optimal control but with added drive crosstalk () and nonlocal coupling. The green triangles are infidelities obtained with controls tuned up under the crosstalk model. We approximate the nonlocal crosstalk effects with . The deviation caused by including additional neighbors is unresolvable on the plot.
Conclusion. — We have described techniques to efficiently model and minimize crosstalk that occurs during qubit idling and gates. Compared to other quantum control methods such as dynamical decoupling Viola et al. (1999), which attempts to echo out undesirable interactions, we change parameters so the effects do not appear in the first place. Our results show how to mitigate such effects on transmons using a fast control tuneup procedure on a digital computer. We hope that these methods aid in understanding the role of crosstalk on NISQ devices and validate improved pulse shapes. We plan to extend our simulation capabilities to other platforms such as trapped ions and apply our methods to improve the performance of experimental platforms.
Acknowledgments. — We thank Ian Hincks and Dar Dahlen for helpful discussions and feedback on the manuscript. This research was supported by the U.S. Army Research Office through grant W911NF-14-1-0103 and Quantum Benchmark Inc.
References
- Gershenfeld and Chuang (1997) N. A. Gershenfeld and I. L. Chuang, Science 275, 350 (1997).
- Pioro-Ladrière et al. (2008) M. Pioro-Ladrière, T. Obata, Y. Tokura, Y.-S. S. Shin, T. Kubo, K. Yoshida, T. Taniyama, and S. Tarucha, Nature Physics 4, 776 (2008).
- Schutjens et al. (2013) R. Schutjens, F. A. Dagga, D. J. Egger, and F. K. Wilhelm, Physical Review A 88, 052330 (2013), arXiv:arXiv:1306.2279v1 .
- Piltz et al. (2013) C. Piltz, B. Scharfenberger, A. Khromova, A. F. Varón, and C. Wunderlich, Physical Review Letters 110, 200501 (2013), arXiv:arXiv:1208.2204v3 .
- Boixo et al. (2018) S. Boixo, S. V. Isakov, V. N. Smelyanskiy, R. Babbush, N. Ding, Z. Jiang, M. J. Bremner, J. M. Martinis, and H. Neven, Nature Physics 14, 595 (2018), arXiv:1608.00263 .
- Rudinger et al. (2019) K. Rudinger, T. Proctor, D. Langharst, M. Sarovar, K. Young, and R. Blume-Kohout, Physical Review X 9, 021045 (2019).
- Sarovar et al. (2019) M. Sarovar, T. Proctor, K. Rudinger, K. Young, E. Nielsen, and R. Blume-Kohout, (2019), arXiv:1908.09855 .
- Abrams et al. (2019) D. M. Abrams, N. Didier, S. A. Caldwell, B. R. Johnson, and C. A. Ryan, Physical Review Applied 12, 064022 (2019), arXiv:1908.11856 .
- Aude Craik et al. (2017) D. P. L. Aude Craik, N. M. Linke, M. A. Sepiol, T. P. Harty, J. F. Goodwin, C. J. Ballance, D. N. Stacey, A. M. Steane, D. M. Lucas, and D. T. C. Allcock, Physical Review A 95, 022337 (2017), arXiv:1601.02696 .
- Neill et al. (2018) C. Neill, P. Roushan, K. Kechedzhi, S. Boixo, S. V. Isakov, V. Smelyanskiy, A. Megrant, B. Chiaro, A. Dunsworth, K. Arya, R. Barends, B. Burkett, Y. Chen, Z. Chen, A. Fowler, B. Foxen, M. Giustina, R. Graff, E. Jeffrey, T. Huang, J. Kelly, P. Klimov, E. Lucero, J. Mutus, M. Neeley, C. Quintana, D. Sank, A. Vainsencher, J. Wenner, T. C. White, H. Neven, and J. M. Martinis, Science 360, 195 (2018), arXiv:1709.06678 .
- Reagor et al. (2018) M. Reagor, C. B. Osborn, N. Tezak, A. Staley, G. Prawiroatmodjo, M. Scheer, N. Alidoust, E. A. Sete, N. Didier, M. P. da Silva, E. Acala, J. Angeles, A. Bestwick, M. Block, B. Bloom, A. Bradley, C. Bui, S. Caldwell, L. Capelluto, R. Chilcott, J. Cordova, G. Crossman, M. Curtis, S. Deshpande, T. El Bouayadi, D. Girshovich, S. Hong, A. Hudson, P. Karalekas, K. Kuang, M. Lenihan, R. Manenti, T. Manning, J. Marshall, Y. Mohan, W. O’Brien, J. Otterbach, A. Papageorge, J.-P. Paquette, M. Pelstring, A. Polloreno, V. Rawat, C. A. Ryan, R. Renzas, N. Rubin, D. Russel, M. Rust, D. Scarabelli, M. Selvanayagam, R. Sinclair, R. Smith, M. Suska, T.-W. To, M. Vahidpour, N. Vodrahalli, T. Whyland, K. Yadav, W. Zeng, and C. T. Rigetti, Science Advances 4, eaao3603 (2018), arXiv:arXiv:1706.06570v3 .
- Murali et al. (2020) P. Murali, D. C. McKay, M. Martonosi, and A. Javadi-Abhari, (2020), 10.1145/3373376.3378477, arXiv:2001.02826 .
- Preskill (2018) J. Preskill, Quantum 2, 79 (2018), arXiv:1801.00862 .
- Steffen et al. (2000) M. Steffen, L. M. Vandersypen, and I. L. Chuang, Journal of Magnetic Resonance 146, 369 (2000).
- Theis et al. (2016) L. S. Theis, F. Motzoi, and F. K. Wilhelm, Physical Review A 93, 012324 (2016), arXiv:arXiv:1509.04152v2 .
- Nielsen (2002) M. A. Nielsen, Physics Letters A 303, 249 (2002), arXiv:0205035 [quant-ph] .
- Carignan-Dugas et al. (2019) A. Carignan-Dugas, J. J. Wallman, and J. Emerson, New Journal of Physics 21, 053016 (2019), arXiv:1610.05296 .
- Flammia and Wallman (2019) S. T. Flammia and J. J. Wallman, , 1 (2019), arXiv:1907.12976 .
- Diestel (2005) R. Diestel, Graduate Texts in Mathematics 101 (2005).
- Koller and Friedman (2009) D. Koller and N. Friedman, Probabilistic graphical models: principles and techniques (MIT press, Cambridge, MA, 2009).
- Lieb and Robinson (1972) E. H. Lieb and D. W. Robinson, Communications in Mathematical Physics 28, 251 (1972).
- Knill et al. (2000) E. Knill, R. Laflamme, R. Martinez, and C.-H. Tseng, Nature 404, 368 (2000).
- Knill et al. (2008) E. Knill, D. Leibfried, R. Reichle, J. Britton, R. B. Blakestad, J. D. Jost, C. Langer, R. Ozeri, S. Seidelin, and D. J. Wineland, Physical Review A 77, 012307 (2008).
- McKay et al. (2017) D. C. McKay, C. J. Wood, S. Sheldon, J. M. Chow, and J. M. Gambetta, Physical Review A 96 (2017), 10.1103/PhysRevA.96.022330, arXiv:arXiv:1612.00858v2 .
- Xia et al. (2015) T. Xia, M. Lichtman, K. Maller, A. W. Carr, M. J. Piotrowicz, L. Isenhower, and M. Saffman, Physical Review Letters 114, 100503 (2015), arXiv:1501.02041 .
- Magesan and Gambetta (2018) E. Magesan and J. M. Gambetta, arXiv preprint arXiv:1804.04073 (2018), arXiv:1804.04073 .
- Xue et al. (2019) X. Xue, T. F. Watson, J. Helsen, D. R. Ward, D. E. Savage, M. G. Lagally, S. N. Coppersmith, M. A. Eriksson, S. Wehner, and L. M. Vandersypen, Physical Review X 9, 21011 (2019).
- Motzoi et al. (2009) F. Motzoi, J. M. Gambetta, P. Rebentrost, and F. K. Wilhelm, Physical Review Letters 103, 110501 (2009), arXiv:0901.0534 .
- Gambetta et al. (2011) J. M. Gambetta, F. Motzoi, S. T. Merkel, and F. K. Wilhelm, Physical Review A 83, 012308 (2011), arXiv:arXiv:1011.1949v2 .
- Chen et al. (2016) Z. Chen, J. Kelly, C. Quintana, R. Barends, B. Campbell, Y. Chen, B. Chiaro, A. Dunsworth, A. G. Fowler, E. Lucero, E. Jeffrey, A. Megrant, J. Mutus, M. Neeley, C. Neill, P. J. O’Malley, P. Roushan, D. Sank, A. Vainsencher, J. Wenner, T. C. White, A. N. Korotkov, J. M. Martinis, P. J. J. O’Malley, P. Roushan, D. Sank, A. Vainsencher, J. Wenner, T. C. White, A. N. Korotkov, and J. M. Martinis, Physical Review Letters 116, 020501 (2016), arXiv:arXiv:1509.05470v2 .
- Machnes et al. (2018) S. Machnes, E. Assémat, D. Tannor, and F. K. Wilhelm, Physical Review Letters 120, 150401 (2018), arXiv:1507.04261 .
- Rigetti et al. (2005) C. Rigetti, A. Blais, and M. Devoret, Physical Review Letters 94, 240502 (2005), arXiv:0412009 [quant-ph] .
- Chow et al. (2011) J. M. Chow, A. D. Córcoles, J. M. Gambetta, C. Rigetti, B. R. Johnson, J. A. Smolin, J. R. Rozen, G. A. Keefe, M. B. Rothwell, M. B. Ketchen, and M. Steffen, Physical Review Letters 107, 080502 (2011), arXiv:1106.0553v1 .
- Sheldon et al. (2016) S. Sheldon, E. Magesan, J. M. Chow, and J. M. Gambetta, Physical Review A 93, 060302 (2016), arXiv:1603.04821 .
- Allen et al. (2019) J. L. Allen, R. Kosut, and E. Ginossar, , 1 (2019), arXiv:1902.08056 .
- Tucci (2005) R. R. Tucci, (2005), arXiv:0507171 [quant-ph] .
- Viola et al. (1999) L. Viola, E. Knill, and S. Lloyd, Physical Review Letters 82, 2417 (1999), arXiv:9809071 [quant-ph] .