This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Low-Stabilizer-Complexity Quantum States Are Not Pseudorandom

Sabee Grewal The University of Texas at Austin. {sabee, kretsch}@cs.utexas.edu, {vishnu.iyer, dliang}@utexas.edu.    Vishnu Iyer11footnotemark: 1    William Kretschmer11footnotemark: 1    Daniel Liang11footnotemark: 1
Abstract

We show that quantum states with “low stabilizer complexity” can be efficiently distinguished from Haar-random. Specifically, given an nn-qubit pure state |ψ\ket{\psi}, we give an efficient algorithm that distinguishes whether |ψ\ket{\psi} is (i) Haar-random or (ii) a state with stabilizer fidelity at least 1k\frac{1}{k} (i.e., has fidelity at least 1k\frac{1}{k} with some stabilizer state), promised that one of these is the case. With black-box access to |ψ\ket{\psi}, our algorithm uses O(k12log(1/δ))O\!\left(k^{12}\log(1/\delta)\right) copies of |ψ\ket{\psi} and O(nk12log(1/δ))O\!\left(nk^{12}\log(1/\delta)\right) time to succeed with probability at least 1δ1-\delta, and, with access to a state preparation unitary for |ψ\ket{\psi} (and its inverse), O(k3log(1/δ))O\!\left(k^{3}\log(1/\delta)\right) queries and O(nk3log(1/δ))O\!\left(nk^{3}\log(1/\delta)\right) time suffice.

As a corollary, we prove that ω(log(n))\omega(\log(n)) TT-gates are necessary for any Clifford+TT circuit to prepare computationally pseudorandom quantum states, a first-of-its-kind lower bound.

1 Introduction

The stabilizer formalism [gottesman1997stabilizer] plays a central role in quantum information. Stabilizer states are states that lie in the intersection of the positive eigenspaces of 2n2^{n} commuting Pauli operators. Stabilizer states can be generated by Clifford circuits, which are the group of unitary transformations that normalize the Pauli group. Stabilizer states and the Clifford group have widespread applications in quantum error correction [shor1995codes, calderbank1996codes], measurement-based quantum computation [raussendorf2000mbqc], randomized benchmarking [knill2008benchmarking], and quantum learning algorithms [huangkuengpresskill]. These applications are largely thanks to the rich algebraic structure afforded by the stabilizer formalism.

Stabilizer states are also one of the few classes of states that admit efficient learning algorithms. Montanaro [montanaro2017learning] gave an algorithm that takes O(n)O(n) copies of an nn-qubit stabilizer state and correctly identifies the state with high probability in time O(n3)O(n^{3}). Gross, Nezami, and Walter [gross2021schur] gave an algorithm for property testing stabilizer states, which is the task of distinguishing whether a state is a stabilizer state or is far from any stabilizer state. Remarkably, this algorithm requires only 66 copes of the state.

Despite finding numerous applications, Clifford circuits are not universal for quantum computation. Furthermore, in 1998, Gottesman and Knill showed that Clifford circuits acting on stabilizer states can be efficiently classically simulated [gottesman1998heisenberg, aaronson2004stabilizer]. However, with the additional ability to apply a TT-gate (the gate |00|+eiπ/4|11|\ket{0}\!\!\bra{0}+e^{i\pi/4}\ket{1}\!\!\bra{1}), the resulting gate set becomes universal. Therefore, efficient simulation of so-called Clifford+TT circuits would imply \BPP=\BQP\BPP=\BQP, and a large line of work has been devoted to developing better simulation algorithms [PashayanPhysRevLett.115.070501, BrayviPhysRevLett.116.250501, RallPhysRevA.99.062337, Bravyi2019simulationofquantum].

Currently, the best-performing simulation algorithms are based on modeling the output state of a quantum circuit as a decomposition of stabilizer states [Bravyi2019simulationofquantum]. These decompositions give rise to simulation algorithms whose runtimes scale polynomially in the complexity of the decomposition. One such complexity measure is the stabilizer extent. Consider the state |ψ=ici|ϕi\ket{\psi}=\sum_{i}c_{i}\ket{\phi_{i}} for cic_{i}\in\mathbb{C} and stabilizer states |ϕi\ket{\phi_{i}}. The stabilizer extent is the minimum (i|ci|)2\left(\sum_{i}{\lvert c_{i}\rvert}\right)^{2} over all such decompositions of |ψ\ket{\psi}, and scales exponentially in the number of TT-gates in the circuit producing the state. A closely-related complexity measure is the stabilizer fidelity, which is the maximum overlap between |ψ\ket{\psi} and any stabilizer state. Indeed, the inverse of stabilizer fidelity lower bounds stabilizer extent [Bravyi2019simulationofquantum]. Collectively, we informally refer to states with either low stabilizer extent or non-negligible stabilizer fidelity as states of low “stabilizer complexity”.

As a generalization of stabilizer states, it is natural to ask whether states of low stabilizer complexity are also efficiently learnable, and indeed a similar question has been raised before [arunachalam2022phase]. Nevertheless, this problem remains largely open except in some highly restricted settings [lai2022learning]. This could be in part because many of the useful properties of stabilizer states provably fail to generalize to states with low stabilizer complexity. For example, [hinsche2022learning] observed that one can efficiently learn the output distribution of any Clifford circuit, given samples from this distribution.111Indeed, every such distribution is simply an affine subspace of 𝔽2n\mathbb{F}_{2}^{n}. However, this task already becomes intractable for circuits with a single TT-gate (producing a state of constant stabilizer extent), where [hinsche2022learning] proved that learning the output distribution is as hard as the learning parities with noise problem.

Furthermore, it is known that stabilizer states form a tt-design for t=3t=3, meaning that random stabilizer states duplicate the first 3 moments of the Haar measure [kuenghttps://doi.org/10.48550/arxiv.1510.02767, webb2016clifford]. By contrast, [haferkamp2020homeopathy] showed that circuits with \poly(t)\poly(t) non-Clifford gates are sufficient to generate approximate tt-designs. Thus, for any constant tt, states of constant stabilizer extent can form approximate tt-designs. This suggests that states of low stabilizer complexity can give much stronger information-theoretic approximations to the Haar measure than ordinary stabilizer states, because stabilizer states fail to form a tt-design for any t>3t>3 [ZKGG16].

In this work, we investigate whether these properties that differentiate stabilizer states from low-stabilizer-complexity states can be pushed further, to prove hardness of learning low-stabilizer-complexity states. One natural approach towards proving that low-stabilizer-complexity states are hard to learn would be to show that they are pseudorandom. Ji, Liu, and Song [Ji10.1007/978-3-319-96878-0_5] define an ensemble of nn-qubit states to be (computationally) pseudorandom if every \poly(n)\poly(n)-time quantum adversary has at most a negligible advantage in distinguishing copies of a state drawn randomly from the ensemble from copies of a Haar-random nn-qubit state. Note that pseudorandom states are not efficiently learnable, as any algorithm for learning some set of quantum states gives an algorithm to distinguish those states from the Haar measure.

Our main result is an efficient algorithm for distinguishing states of non-negligible stabilizer fidelity from Haar-random states, showing that such states cannot be pseudorandom. This type of distinguishing is sometimes known as weak learning in learning theory.

Theorem 1.1 (Informal version of Theorem 4.1).

Let |ψ\ket{\psi} be an unknown nn-qubit pure state, and let k452n/12k\leq\frac{4}{5}2^{n/12}. There is an efficient algorithm that distinguishes whether |ψ\ket{\psi} is Haar-random or a state with stabilizer fidelity at least 1k\frac{1}{k}, promised that one of these is the case. In particular, the algorithm uses O(k12log(1/δ))O(k^{12}\log(1/\delta)) copies of |ψ\ket{\psi} and O(nk12log(1/δ))O(nk^{12}\log(1/\delta)) time to succeed with probability at least 1δ1-\delta.

Theorem 1.1 also generalizes to distinguishing states with low stabilizer extent from Haar-random. To the best of our knowledge, prior to our work, it was even unknown whether states of stabilizer extent at most a constant could be efficiently distinguished from Haar-random. We also emphasize that the contrast between our positive learning result and the hardness result of [hinsche2022learning] stems in part from the differing access models: we assume access to copies of the quantum state, whereas [hinsche2022learning] considers algorithms that only have outcomes of standard basis measurements of the state.

As a simple corollary, we prove a first-of-its-kind lower bound on the number of TT-gates required to prepare computationally pseudorandom quantum states.

Corollary 1.2 (Corollary 4.3).

Any family of Clifford+TT circuits that produces an ensemble of nn-qubit computationally pseudorandom quantum states must use at least ω(logn)\omega(\log n) TT-gates.

In some sense, Corollary 1.2 contrasts sharply with the result of [haferkamp2020homeopathy], where circuits containing just a few non-Clifford gates are sufficient to produce strong information-theoretic approximations to the Haar measure (i.e. tt-designs). Nevertheless, we emphasize that our result and [haferkamp2020homeopathy] are formally incomparable, because computationally pseudorandom states need not form approximate tt-designs for constant tt, nor vice-versa.

1.1 Main Ideas

Let x=(p,q)𝔽22nx=(p,q)\in\mathbb{F}_{2}^{2n}, where pp and qq are the first and last nn bits of xx, respectively. Define WxipqXpZqW_{x}\coloneqq i^{p\cdot q}X^{p}Z^{q} (a Pauli operator without phase), and let |Φ+2n/2x𝔽2n|x,x\ket{\Phi^{+}}\coloneqq 2^{-n/2}\sum_{x\in\mathbb{F}_{2}^{n}}\ket{x,x} be a maximimally entangled state. Then, the set {|Wx(WxI)|Φ+x𝔽22n}\{\ket{W_{x}}\coloneqq(W_{x}\otimes I)\ket{\Phi^{+}}\mid x\in\mathbb{F}_{2}^{2n}\} is the Bell basis, an orthonormal basis of 2n2n\mathbb{C}^{2^{n}}\otimes\mathbb{C}^{2^{n}}.

Our algorithm uses Bell difference sampling [montanaro2017learning, gross2021schur], which works as follows (see Section 2.3 for more detail): Given four copies of an nn-qubit pure state |ψ\ket{\psi}, perform a Bell-basis measurement on |ψ2\ket{\psi}^{\otimes 2} to get measurement outcome x𝔽22nx\in\mathbb{F}_{2}^{2n}, repeat this on the remaining two copies to get measurement outcome y𝔽22ny\in\mathbb{F}_{2}^{2n}, and return z=x+yz=x+y.

We refer to pψ(x)2n|ψ|Wx|ψ|2p_{\psi}(x)\coloneqq 2^{-n}\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2} as the characteristic distribution of |ψ\ket{\psi}. To see that pψp_{\psi} is a distribution, recall that since the Pauli operators form an orthonormal basis over Hermitian matrices, we can always decompose |ψψ|=12nx𝔽2nψ|Wx|ψWx\ket{\psi}\!\!\bra{\psi}=\frac{1}{2^{n}}\sum_{x\in\mathbb{F}_{2}^{n}}\braket{\psi}{W_{x}}{\psi}\cdot W_{x}. By assumption, |ψ|ψ|2=1\lvert\braket{\psi}{\psi}\rvert^{2}=1, so by Parseval’s identity,

12nx𝔽2n|ψ|Wx|ψ|2=1.\frac{1}{2^{n}}\sum_{x\in\mathbb{F}_{2}^{n}}\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}=1.

Gross, Nezami, and Walter [gross2021schur] showed that Bell difference sampling an arbitrary pure state |ψ\ket{\psi} corresponds to sampling a random operator WxW_{x} according to the following distribution:

qψ(x)=y𝔽22npψ(y)pψ(x+y).q_{\psi}(x)=\!\!\!\sum_{y\in\mathbb{F}_{2}^{2n}}p_{\psi}(y)p_{\psi}(x+y).

We call qψq_{\psi} the Weyl distribution of |ψ\ket{\psi}. Note that the Weyl distribution of |ψ\ket{\psi} is the scaled convolution of the characteristic distribution with itself (i.e., qψ=4n(pψpψ)q_{\psi}=4^{n}(p_{\psi}\ast p_{\psi}), where ‘\ast’ is the convolution operator).

Define the {±1}\{\pm 1\}-outcome measurement Mx{I±Wx2}M_{x}\coloneqq\left\{\frac{I\pm W_{x}}{2}\right\} (projections onto the ±1\pm 1-eigenspaces of WxW_{x}). Our algorithm begins by repeating the following process mm times: sample a random Weyl operator WxW_{x} (via Bell difference sampling) and perform the measurement Mx2M_{x}^{\otimes 2} on |ψ2\ket{\psi^{\otimes 2}}. Then, average all of the measurement outcomes. If the average is at least 1/\poly(k)1/\poly(k), we decide that |ψ\ket{\psi} has stabilizer fidelity at least 1k\frac{1}{k}. Otherwise, we decide that |ψ\ket{\psi} is Haar-random.

What statistic are we computing in our algorithm? Denote the measurement outcome on the iith iteration as Xi{±1}X_{i}\in\{\pm 1\}. Observe that for all XiX_{i},

𝐄[Xi]=x𝔽22nqψ(x)|ψ|Wx|ψ|2=2nx𝔽22nqψ(x)pψ(x)=2n𝐄xqψ[pψ(x)],\mathop{\bf E\/}_{{\begin{subarray}{c}\end{subarray}}}[X_{i}]=\!\!\!\sum_{x\in\mathbb{F}_{2}^{2n}}q_{\psi}(x)\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}=2^{n}\!\!\!\sum_{x\in\mathbb{F}_{2}^{2n}}q_{\psi}(x)p_{\psi}(x)=2^{n}\mathop{\bf E\/}_{{\begin{subarray}{c}x\sim q_{\psi}\end{subarray}}}[p_{\psi}(x)],

where the expectation 𝐄[Xi]\mathop{\bf E\/}[X_{i}] is taken over sampling xqψx\sim q_{\psi} and the randomness from performing the measurement Mx2M_{x}^{\otimes 2}. Hence, for our algorithm to work, 𝐄xqψ[pψ(x)]\mathop{\bf E\/}_{{\begin{subarray}{c}x\sim q_{\psi}\end{subarray}}}[p_{\psi}(x)] must be “different enough” when |ψ\ket{\psi} either is Haar-random or has low stabilizer complexity. Proving that this is the case is the main technical ingredient of our work:

Lemma 1.3 (Informal version of Lemma 3.1).

Let |ψ\ket{\psi} be an nn-qubit pure state. Suppose the stabilizer fidelity of |ψ\ket{\psi} is at least 1k\frac{1}{k}. Then,

2n𝐄xqψ[pψ(x)]1k6.2^{n}\mathop{\bf E\/}_{x\sim q_{\psi}}\left[p_{\psi}(x)\right]\geq\dfrac{1}{k^{6}}.

In contrast, suppose |ψ\ket{\psi} is a Haar-random quantum state. Then, with overwhelming probability over the Haar measure,

2n𝐄xqψ[pψ(x)]2n/2.2^{n}\mathop{\bf E\/}_{x\sim q_{\psi}}\left[p_{\psi}(x)\right]\leq 2^{-n/2}.

Our proof uses Fourier analysis of Boolean functions, and some parts of our proof are reminiscent of the celebrated Blum-Luby-Rubinfield linearity test [BLRtest]. Intuitively, qψq_{\psi} is significantly closer to linear when |ψ\ket{\psi} has non-negligible stabilizer fidelity, as opposed to when |ψ\ket{\psi} is a Haar-random state.

With the above lemma, all that remains is “merely” a sample complexity analysis, namely: what mm is sufficient to distinguish whether the average is close to 0 or Ω(1/k6)\Omega(1/k^{6})? In the simplest case, we show that O(k12log(1/δ))O(k^{12}\log(1/\delta)) samples are sufficient by Hoeffding’s inequality. However, this complexity can be improved if given access to a unitary that prepares |ψ\ket{\psi} (and its inverse). In this model, we are able to achieve a quartic speedup in both sample and time complexity, which we explain in Appendix A.

2 Preliminaries

First, we establish some notation used throughout this work. We denote [n]{1,,n}[n]\coloneqq\{1,\ldots,n\}. For vnv\in\mathbb{C}^{n}, vp(i[n]|vi|p)1/p\lVert v\rVert_{p}\coloneqq(\sum_{i\in[n]}\lvert v_{i}\rvert^{p})^{1/p} is the p\ell_{p}-norm. Logarithms are assumed to be in base 22. For a probability distribution PP on a set SS, we denote drawing a sample sSs\in S according to PP by sPs\sim P. We denote drawing a sample sSs\in S uniformly at random by sSs\sim S.

2.1 Stabilizer States and Stabilizer Complexity Measures

We define the 11-qubit Pauli group to be the collection of matrices {I,X,Y,Z}\{I,X,Y,Z\}, where

I=(1001),X=(0110),Y=(0ii0),Z=(1001).I=\begin{pmatrix}1&0\\ 0&1\end{pmatrix},\quad X=\begin{pmatrix}0&1\\ 1&0\end{pmatrix},\quad Y=\begin{pmatrix}0&-i\\ i&0\end{pmatrix},\quad Z=\begin{pmatrix}1&0\\ 0&-1\end{pmatrix}.

The nn-qubit Pauli group 𝒫n\mathcal{P}_{n} is the set {±1,±i}×{I,X,Y,Z}n\{\pm 1,\pm i\}\times\{I,X,Y,Z\}^{\otimes n}. The Clifford group 𝒞n\mathcal{C}_{n} is the group of unitary transformations generated by HH, SS, and CNOT\mathrm{CNOT} gates, where HH is the Hadamard gate, S|00|+i|11|S\coloneqq\ket{0}\!\!\bra{0}+i\ket{1}\!\!\bra{1} is the phase gate, and CNOT\mathrm{CNOT} is the controlled-not gate. We refer to unitary transformations in the Clifford group as Clifford circuits. Clifford circuits with the addition of the TT-gate are universal, where the TT-gate is defined by T|00|+eiπ/4|11|T\coloneqq\ket{0}\!\!\bra{0}+e^{i\pi/4}\ket{1}\!\!\bra{1}.

A unitary transformation UU stabilizes a state |ψ\ket{\psi} when U|ψ=|ψU\ket{\psi}=\ket{\psi}. It is folklore that if an nn-qubit state can be reached from the |0n\ket{0^{n}} state by applying a Clifford circuit, then the state is stabilized by a group of 2n2^{n} commuting members of the subset {±1}×{I,X,Y,Z}n(𝒫nIn)\{\pm 1\}\times\{I,X,Y,Z\}^{\otimes n}\subset\left(\mathcal{P}_{n}\setminus-I^{\otimes n}\right), called its stabilizer group. Such states are called stabilizer states, and we denote the set of stabilizer states by 𝒮n\mathcal{S}_{n}. For |ψ𝒮n\ket{\psi}\in\mathcal{S}_{n}, we denote its stabilizer group as Stab(|ψ)\textrm{Stab}(\ket{\psi}). For more background on stabilizer states, see, e.g., [nielsen2002quantum].

We now define some complexity measures that characterize more general states in terms of stabilizer state decompositions.

Definition 2.1 (stabilizer extent [Bravyi2019simulationofquantum]).

Suppose |ψ\ket{\psi} is a pure nn-qubit state. The stabilizer extent of |ψ\ket{\psi}, denoted (|ψ)(\ket{\psi}), is the minimum of c12\lVert c\rVert_{1}^{2} over all decompositions |ψ=ici|ϕi\ket{\psi}=\sum_{i}c_{i}\ket{\phi_{i}}, where |ϕi𝒮n\ket{\phi_{i}}\in\mathcal{S}_{n} and cc is some vector in |𝒮n|\mathbb{C}^{|\mathcal{S}_{n}|}.

Definition 2.2 (stabilizer fidelity [Bravyi2019simulationofquantum]).

Suppose |ψ\ket{\psi} is a pure nn-qubit state. The stabilizer fidelity of |ψ\ket{\psi}, denoted F𝒮F_{\mathcal{S}}, is

F𝒮(|ψ)max|ϕ𝒮n|ϕ|ψ|2.F_{\mathcal{S}}(\ket{\psi})\coloneqq\max_{\ket{\phi}\in\mathcal{S}_{n}}\left|\braket{\phi}{\psi}\right|^{2}.

Below we give a useful relation between the complexity measures defined above.

Claim 2.3.

Let |ψ\ket{\psi} be an nn-qubit pure state. Then,

ξ(|ψ)1F𝒮(|ψ).\xi(\ket{\psi})\geq\frac{1}{F_{\mathcal{S}}(\ket{\psi})}.
Proof.

Let |ψ=|ϕ𝒮ncϕ|ϕ\ket{\psi}=\sum_{\ket{\phi}\in\mathcal{S}_{n}}c_{\phi}\ket{\phi} be such that (ϕ|cϕ|)2=ξ(|ψ)\left(\sum_{\phi}\lvert c_{\phi}\rvert\right)^{2}=\xi(\ket{\psi}). Suppose towards a contradiction that F𝒮(|ψ)<1ξ(|ψ)F_{\mathcal{S}}(\ket{\psi})<\frac{1}{\xi(\ket{\psi})} and therefore |ϕ|ψ|<1ξ(|ψ)\lvert\braket{\phi}{\psi}\rvert<\frac{1}{\xi(\ket{\psi})} for all |ϕ𝒮n\ket{\phi}\in\mathcal{S}_{n}. Then,

1=|ψ|ψ|=||ϕS𝒮ncϕϕ|ψ|\displaystyle 1=\lvert\braket{\psi}{\psi}\rvert=\left|\sum_{\ket{\phi_{S}}\in\mathcal{S}_{n}}c_{\phi}^{\ast}\braket{\phi}{\psi}\right| |ϕS𝒮n|cϕ||ϕ|ψ|\displaystyle\leq\sum_{\ket{\phi_{S}}\in\mathcal{S}_{n}}\left|c_{\phi}\right|\left|\braket{\phi}{\psi}\right|
maxi|ϕi|ψ||ϕS𝒮n|cϕ|\displaystyle\leq\max_{i}\lvert\braket{\phi_{i}}{\psi}\rvert\sum_{\ket{\phi_{S}}\in\mathcal{S}_{n}}\left|c_{\phi}\right|
F𝒮(|ψ)ξ(|ψ)\displaystyle\leq F_{\mathcal{S}}(\ket{\psi})\sqrt{\xi(\ket{\psi})}
<F𝒮(|ψ)1\displaystyle<\sqrt{F_{\mathcal{S}}(\ket{\psi})}\leq 1

The last line follows from the fact that F𝒮(|ψ)1F_{\mathcal{S}}(\ket{\psi})\leq 1 due to Cauchy-Schwarz and the definition of stabilizer fidelity. We then have 1<11<1 as a clear contradiction. ∎

The claim above also follows as a special case of [Bravyi2019simulationofquantum, Theorem 4], though its proof is more complicated.

To prove lower bounds on the number of TT-gates necessary to prepare pseudorandom quantum states, we need to upper bound the stabilizer extent of a quantum state prepared by a Clifford+TT circuit comprised of tt TT-gates.

Claim 2.4.

For |ψ=α|v+β|w\ket{\psi}=\alpha\ket{v}+\beta\ket{w},

ξ(|ψ)(|α|ξ(|v)+|β|ξ(|w))2.\xi(\ket{\psi})\leq\left(\lvert\alpha\rvert\sqrt{\xi(\ket{v})}+\lvert\beta\rvert\sqrt{\xi(\ket{w})}\right)^{2}.
Proof.

Let |v=ici|ϕi\ket{v}=\sum_{i}c_{i}\ket{\phi_{i}} and |w=jdj|φj\ket{w}=\sum_{j}d_{j}\ket{\varphi_{j}} be the minimal decompositions in terms of stabilizer extent (i.e., (i|ci|)2=ξ(|v)\left(\sum_{i}\lvert c_{i}\rvert\right)^{2}=\xi(\ket{v})). Since |ψ=α|v+β|w=αic|ϕi+βjd|φj\ket{\psi}=\alpha\ket{v}+\beta\ket{w}=\alpha\sum_{i}c\ket{\phi_{i}}+\beta\sum_{j}d\ket{\varphi_{j}}, we have a stabilizer decomposition of |ψ\ket{\psi}. The stabilizer extent of this decomposition is at most

(i|αci+βdi|)2(|α|i|ci|+|β|i|di|)2(|α|ξ(v)+|β|ξ(w))2.\left(\sum_{i}\lvert\alpha c_{i}+\beta d_{i}\rvert\right)^{2}\leq\left(\lvert\alpha\rvert\sum_{i}\lvert c_{i}\rvert+\lvert\beta\rvert\sum_{i}\lvert d_{i}\rvert\right)^{2}\leq\left(\lvert\alpha\rvert\sqrt{\xi(v)}+\lvert\beta\rvert\sqrt{\xi(w)}\right)^{2}.\qed
Lemma 2.5.

Let CC be any Clifford+TT circuit comprised of tt TT-gates and |ψ=C|0n\ket{\psi}=C\ket{0^{n}}. Then,

ξ(|ψ)(1+12)t.\xi(\ket{\psi})\leq\left(1+\frac{1}{\sqrt{2}}\right)^{t}.
Proof.

We note that a Clifford+TT circuit can be broken into layers of Clifford circuits, followed by a single TT-gate, followed by more layers of Clifford circuits, and so on. Since Clifford circuits preserve stabilizer extent, we only need to show that the TT-gate increases the stabilizer extent of any state by at most a constant multiplicative factor. Since the SWAP gate is a Clifford operation, we assume without loss of generality that each TT-gate is applied to the first qubit.

We proceed by induction on the layers of the circuit. In the first layer, when no TT-gates have been applied, the bound is trivially true because the stabilizer extent of any stabilizer state is 11. Now, assume that, after applying some portion of the circuit CC^{\prime} to |0n\ket{0^{n}} with t1t-1 TT-gates, we get the state |φ\ket{\varphi}. Observe that the TT-gate can be expressed as cos(π/8)eiπ/8I+sin(π/8)ei13π/8Z\cos(\pi/8)e^{i\pi/8}I+\sin(\pi/8)e^{i13\pi/8}Z. Thus, (TIn1)|φ=cos(π/8)eiπ/8|φ+sin(π/8)ei13π/8(ZIn1)|φ(T\otimes I^{\otimes n-1})\ket{\varphi}=\cos(\pi/8)e^{i\pi/8}\ket{\varphi}+\sin(\pi/8)e^{i13\pi/8}\left(Z\otimes I^{\otimes n-1}\right)\ket{\varphi}. Since ZIn1Z\otimes I^{\otimes n-1} is a Clifford operation, (ZIn1)|φ\left(Z\otimes I^{\otimes n-1}\right)\ket{\varphi} has the same extent as |φ\ket{\varphi}. Therefore, applying 2.4,

ξ(|ψ)(cos(π/8)+sin(π/8))2ξ(|φ)(1+12)t.\xi(\ket{\psi})\leq\left(\cos(\pi/8)+\sin(\pi/8)\right)^{2}\xi(\ket{\varphi})\leq\left(1+\frac{1}{\sqrt{2}}\right)^{t}.\qed

2.2 Boolean Fourier Analysis

We review the basics of Fourier analysis over the Boolean hypercube.

Definition 2.6.

Let S[n]S\subseteq[n] be an index of bits. Then the parity function on SS is defined to be

χS(x)iS(1)xi.\chi_{S}(x)\coloneqq\prod_{i\in S}(-1)^{x_{i}}.

Alternatively, we can define χS(x)=(1)xs\chi_{S}(x)=(-1)^{x\cdot s} where si=1s_{i}=1 if and only if iSi\in S. This form will prove to be more natural for our purposes.

The parity functions are orthonormal under the inner product f,g=12nx𝔽2nf(x)g(x)\langle f,g\rangle=\frac{1}{2^{n}}\sum_{x\in\mathbb{F}_{2}^{n}}f(x)g(x). Since there are 2n2^{n} distinct parity functions, this gives a complete basis. Given a function f:𝔽2nf:\mathbb{F}_{2}^{n}\to\mathbb{R}, we can then write

f(x)=S[n]f^(S)χS(x).f(x)=\sum_{S\subseteq[n]}\widehat{f}(S)\chi_{S}(x).

The f^(S)\widehat{f}(S) are real numbers known as the Fourier coefficients (collectively known as the Fourier spectrum), and are equivalently given by the formula

f^(S)=f(x),χS(x).\widehat{f}(S)=\langle f(x),\chi_{S}(x)\rangle.

As a basis change, we can then rethink inner products to be over the Fourier coefficients as well.

Fact 2.7 (Plancherel’s theorem).
f,g=12nS[n]f^(S)g^(S).\langle f,g\rangle=\frac{1}{2^{n}}\sum_{S\subseteq[n]}\widehat{f}(S)\widehat{g}(S).

Finally, the convolution is an operation that appears frequently in Fourier analysis over the reals. We can similarly define it over Boolean inputs.

Definition 2.8.

For functions f,g:𝔽2nf,g:\mathbb{F}_{2}^{n}\to\mathbb{R}, we define the convolution fgf\ast g as

(fg)(x)12nt𝔽2nf(t)g(x+t).(f\ast g)(x)\coloneqq\frac{1}{2^{n}}\sum_{t\in\mathbb{F}_{2}^{n}}f(t)g(x+t).

Much like Fourier transforms over the reals, convolution maps to multiplication in the Fourier domain.

Fact 2.9 (Convolution theorem).

fg^(S)=f^(S)g^(S)\widehat{f\ast g}(S)=\widehat{f}(S)\widehat{g}(S)

For proofs of all of these facts, as well as for a comprehensive reference on analysis of Boolean functions, we recommend [o2014analysis].

2.3 Weyl Operators and Bell Difference Sampling

For x=(p,q)𝔽22nx=(p,q)\in\mathbb{F}_{2}^{2n}, define the Weyl operator as

Wxipq(Xp1Zq1)(XpnZqn)=ipqXpZq.W_{x}\coloneqq i^{p\cdot q}(X^{p_{1}}Z^{q_{1}})\otimes\ldots\otimes(X^{p_{n}}Z^{q_{n}})=i^{p\cdot q}X^{p}Z^{q}.

Each Weyl operator is a Pauli operator, and every Pauli operator is a Weyl operator (up to a phase). Note also that WxWy=Wx+yW_{x}W_{y}=W_{x+y}, up to a phase. We use Weyl operators (rather than Pauli operators) when it is convenient to identify members of the Pauli group with length-2n2n bit strings.

A critical subroutine in our work is Bell difference sampling, which was introduced in [montanaro2017learning, gross2021schur]. Let |Φ+2n/2x𝔽2n|x,x\ket{\Phi^{+}}\coloneqq 2^{-n/2}\sum_{x\in\mathbb{F}_{2}^{n}}\ket{x,x}. Then, the set of quantum states {|Wx(WxI)|Φ+x𝔽22n}\{\ket{W_{x}}\coloneqq(W_{x}\otimes I)\ket{\Phi^{+}}\mid x\in\mathbb{F}_{2}^{2n}\} forms an orthonormal basis of 2n2n\mathbb{C}^{2^{n}}\otimes\mathbb{C}^{2^{n}}, which we call the Bell basis. Bell sampling a state |ψ\ket{\psi} refers to measuring |ψ2\ket{\psi}^{\otimes 2} in the Bell basis, and the measurement outcome is a length-2n2n bit string xx that corresponds to a Weyl operator WxW_{x}. Bell difference sampling a state |ψ\ket{\psi} refers to Bell sampling twice to get measurement outcomes x,y𝔽22nx,y\in\mathbb{F}_{2}^{2n} and returning z=x+yz=x+y, which corresponds to a Weyl operator WzW_{z} and uses four copies of |ψ\ket{\psi}. Montanaro showed Bell difference sampling can be performed in O(n)O(n) time [montanaro2017learning].

Bell difference sampling returns a random Weyl operator, but according to what distribution? Gross, Nezami, and Walter [gross2021schur] showed that the underlying distribution depends on the so-called characteristic distribution of |ψ\ket{\psi}.

Definition 2.10 (characteristic distribution).

The characteristic distribution of |ψ\ket{\psi} is defined as

pψ(x)2n|ψ|Wx|ψ|2.p_{{\psi}}(x)\coloneqq 2^{-n}\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}.
Lemma 2.11 ([gross2021schur, Theorem 3.2]).

Let |ψ\ket{\psi} be an arbitrary nn-qubit pure state. Bell difference sampling corresponds to drawing a sample from the following distribution:

qψ(x)4n(pψpψ)(x)=y𝔽22npψ(y)pψ(x+y).q_{\psi}(x)\coloneqq 4^{n}(p_{\psi}\ast p_{\psi})(x)=\sum_{y\in\mathbb{F}_{2}^{2n}}p_{\psi}(y)p_{{\psi}}(x+y).

Additionally, if |ψ𝒮n\ket{\psi}\in\mathcal{S}_{n} is a stabilizer state, then

qψ(x)=pψ(x)=2n|ψ|Wx|ψ|2.q_{\psi}(x)=p_{{\psi}}(x)=2^{-n}\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}.

We refer to qψq_{\psi} as the Weyl distribution. Using this terminology, the characteristic distribution and Weyl distribution are equal only when |ψ\ket{\psi} is a stabilizer state (i.e., when 4n(pψpψ)=pψ4^{n}(p_{\psi}\ast p_{\psi})=p_{\psi}).

3 Certificate of Low Stabilizer Complexity

To efficiently distinguish a state with low stabilizer complexity (meaning, a state with low stabilizer extent or non-negligible stabilizer fidelity) from a Haar-random one, we require a property or statistic of the state that distinguishes it from Haar-random. As such, we present the following technical lemma, which forms the backbone of our algorithm.

Lemma 3.1.

Let |ψ\ket{\psi} be an nn-qubit pure state. If the stabilizer fidelity of |ψ\ket{\psi} is at least 1k\frac{1}{k}, then

𝐄xqψ[|ψ|Wx|ψ|2]1k6.\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]\geq\dfrac{1}{k^{6}}.

In contrast, if |ψ\ket{\psi} is Haar-random and n33n\geq 33, then, with probability at least 1exp(2n/215)1-\exp\left(-2^{n/2-15}\right) over the Haar measure,

𝐄xqψ[|ψ|Wx|ψ|2]2n/2.\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]\leq 2^{-n/2}.

Our algorithm then amounts to estimating the quantity 𝐄xqψ[|ψ|Wx|ψ|2]\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right] via a procedure involving Bell difference sampling.

To prove Lemma 3.1, as a first step, we relate 𝐄xqψ[|ψ|Wx|ψ|2]\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right] to the Fourier coefficients of pψp_{\psi}. Note that this analysis closely resembles the BLR linearity test [BLRtest] (see also [o2014analysis, Section 1.6]).

Fact 3.2.

Let |ψ\ket{\psi} be an nn-qubit pure state. Then,

𝐄xqψ[|ψ|Wx|ψ|2]=32nx𝔽22npψ^(x)3.\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]=32^{n}\!\!\!\sum_{x\in\mathbb{F}_{2}^{2n}}\widehat{p_{\psi}}(x)^{3}.
Proof.
𝐄xqψ[|ψ|Wx|ψ|2]\displaystyle\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right] =2n𝐄xqψ[pψ(x)]\displaystyle=2^{n}\mathop{\bf E\/}_{x\sim q_{\psi}}\left[p_{\psi}(x)\right]
=2nx𝔽22npψ(x)qψ(x)\displaystyle=2^{n}\sum_{x\in\mathbb{F}_{2}^{2n}}p_{\psi}(x)q_{\psi}(x)
=8nx𝔽22npψ(x)(pψpψ)(x)\displaystyle=8^{n}\sum_{x\in\mathbb{F}_{2}^{2n}}p_{\psi}(x)(p_{\psi}\ast p_{\psi})(x)
=32n𝐄x𝔽22n[pψ(x)(pψpψ)(x)]\displaystyle=32^{n}\mathop{\bf E\/}_{x\sim\mathbb{F}_{2}^{2n}}[p_{\psi}(x)(p_{\psi}\ast p_{\psi})(x)]
=32nx𝔽22npψ^(x)pψpψ^(x))\displaystyle=32^{n}\sum_{x\in\mathbb{F}_{2}^{2n}}\widehat{p_{\psi}}(x)\widehat{p_{\psi}\ast p_{\psi}}(x)) (2.7)\displaystyle(\mathrm{\lx@cref{creftype~refnum}{fact:plancherel}})
=32nx𝔽22npψ^(x)3.\displaystyle=32^{n}\sum_{x\in\mathbb{F}_{2}^{2n}}\widehat{p_{\psi}}(x)^{3}. (2.9)\displaystyle(\mathrm{\lx@cref{creftype~refnum}{fact:convolution_theorem}})\qed

For the remainder of this section, we use the following convention: when x=(v,w)𝔽22nx=(v,w)\in\mathbb{F}_{2}^{2n}, vv and ww denote the first and last nn bits of xx, respectively, and, we will sometimes write pψ(v,w)p_{\psi}(v,w) and qψ(v,w)q_{\psi}(v,w), rather than pψ(x)p_{\psi}(x) and qψ(x)q_{\psi}(x).

3.1 The Fourier Spectrum of the Characteristic Distribution

By 3.2, it is clear that understanding the Fourier spectrum of pψp_{\psi} is one avenue to proving Lemma 3.1.

Proposition 3.3.

The Fourier coefficients of pψ(v,w)p_{\psi}(v,w) are pψ^(v,w)=12npψ(w,v)\widehat{p_{\psi}}(v,w)=\frac{1}{2^{n}}p_{\psi}(w,v).

Proof.

Define f:𝔽22n[1,1]f:\mathbb{F}_{2}^{2n}\xrightarrow[]{}[-1,1] as f(v,w)ψ|ivwXvZw|ψf(v,w)\coloneqq\braket{\psi}{i^{v\cdot w}X^{v}Z^{w}}{\psi}, where v,w𝔽2nv,w\in\mathbb{F}_{2}^{n}. We begin by computing the Fourier expansion of ff.

f(v,w)\displaystyle f(v,w) =ψ|ivwXvZw|ψ\displaystyle=\bra{\psi}i^{v\cdot w}X^{v}Z^{w}\ket{\psi}
=(x𝔽2ncxx|)ivwXvZw(x𝔽2ncx|x)\displaystyle=\left(\sum_{x\in\mathbb{F}_{2}^{n}}c^{*}_{x}\bra{x}\right)i^{v\cdot w}X^{v}Z^{w}\left(\sum_{x\in\mathbb{F}_{2}^{n}}c_{x}\ket{x}\right)
=ivw(x𝔽2ncxx+v|)(x𝔽2n(1)xwcx|x)\displaystyle=i^{v\cdot w}\left(\sum_{x\in\mathbb{F}_{2}^{n}}c^{*}_{x}\bra{x+v}\right)\left(\sum_{x\in\mathbb{F}_{2}^{n}}(-1)^{x\cdot w}c_{x}\ket{x}\right)
=ivwx𝔽2ncx+vcx(1)wx.\displaystyle=i^{v\cdot w}\sum_{x\in\mathbb{F}_{2}^{n}}c_{x+v}^{*}c_{x}(-1)^{w\cdot x}. (1)

In the second line we are simply writing |ψ\ket{\psi} in the computational basis.

Observe now that pψ(v,w)=12n|f(v,w)|2p_{\psi}(v,w)=\frac{1}{2^{n}}\lvert f(v,w)\rvert^{2}, which we can also treat as a function on Boolean variables. Hence,

pψ(v,w)\displaystyle p_{\psi}(v,w) =12n(ivwx𝔽2ncx+vcx(1)wx)((i)vwx𝔽2ncx+vcx(1)wx)\displaystyle=\frac{1}{2^{n}}\left(i^{v\cdot w}\sum_{x\in\mathbb{F}_{2}^{n}}c_{x+v}^{*}c_{x}(-1)^{w\cdot x}\right)\left((-i)^{v\cdot w}\sum_{x\in\mathbb{F}_{2}^{n}}c_{x+v}c^{*}_{x}(-1)^{w\cdot x}\right)
=12nx,y𝔽2ncv+ycycv+x+ycx+y(1)wx,\displaystyle=\frac{1}{2^{n}}\sum_{x,y\in\mathbb{F}_{2}^{n}}c^{*}_{v+y}c_{y}c_{v+x+y}c^{*}_{x+y}(-1)^{w\cdot x},

where the first equality follows by substituting in Section 3.1.

We can now compute the Fourier spectrum of pψp_{\psi} by taking the inner product between pψp_{\psi} and an arbitrary Fourier character (this is the simplest approach to computing Fourier coefficients).

pψ^(v,w)\displaystyle\widehat{p_{\psi}}(v,w) =14ns,t𝔽2npψ(s,t)(1)sv+tw\displaystyle=\frac{1}{4^{n}}\sum_{s,t\in\mathbb{F}_{2}^{n}}p_{\psi}(s,t)(-1)^{s\cdot v+t\cdot w}
=18ns,t,x,y𝔽2ncs+ycycs+x+ycx+y(1)tx+vs+wt\displaystyle=\frac{1}{8^{n}}\sum_{s,t,x,y\in\mathbb{F}_{2}^{n}}c^{*}_{s+y}c_{y}c_{s+x+y}c^{*}_{x+y}(-1)^{t\cdot x+v\cdot s+w\cdot t}
=18ns,x,y𝔽2ncs+ycycs+x+ycx+y(1)vst𝔽2n(1)t(x+w)\displaystyle=\frac{1}{8^{n}}\sum_{s,x,y\in\mathbb{F}_{2}^{n}}c^{*}_{s+y}c_{y}c_{s+x+y}c^{*}_{x+y}(-1)^{v\cdot s}\sum_{t\in\mathbb{F}_{2}^{n}}(-1)^{t\cdot(x+w)}
=14ns,y𝔽2ncs+ycycs+w+ycw+y(1)vs\displaystyle=\frac{1}{4^{n}}\sum_{s,y\in\mathbb{F}_{2}^{n}}c^{*}_{s+y}c_{y}c_{s+w+y}c^{*}_{w+y}(-1)^{v\cdot s}
=12npψ(w,v).\displaystyle=\frac{1}{2^{n}}p_{\psi}(w,v).\qed

3.2 Low-Stabilizer-Complexity States

We prove the first part of Lemma 3.1; namely, that

𝐄xqψ[|ψ|Wx|ψ|2]1k6\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]\geq\dfrac{1}{k^{6}}

when |ψ\ket{\psi} has low stabilizer complexity.

Claim 3.4.

For an nn-qubit pure state |ψ=x𝔽2ncx|x\ket{\psi}=\sum_{x\in\mathbb{F}_{2}^{n}}c_{x}\ket{x},

32nx𝔽22npψ^(x)3|c0|12.32^{n}\sum_{x\in\mathbb{F}_{2}^{2n}}\widehat{p_{\psi}}(x)^{3}\geq\lvert c_{0}\rvert^{12}.
Proof.
32nv,w𝔽22npψ^(v,w)3\displaystyle 32^{n}\!\!\sum_{v,w\in\mathbb{F}_{2}^{2n}}\widehat{p_{\psi}}(v,w)^{3} =4nv,w𝔽22npψ(w,v)3\displaystyle=4^{n}\sum_{v,w\in\mathbb{F}_{2}^{2n}}\!\!p_{\psi}(w,v)^{3} (Proposition 3.3.)\displaystyle(\mathrm{\lx@cref{creftype~refnum}{prop:fourier-coefficients}}.)
4nv𝔽2npψ(0,v)3\displaystyle\geq 4^{n}\sum_{v\in\mathbb{F}_{2}^{n}}p_{\psi}(0,v)^{3} (x,y,pψ(x,y)0.)\displaystyle(\forall x,y,p_{\psi}(x,y)\geq 0.)
=12nv𝔽2n|ψ|Zv|ψ|6\displaystyle=\frac{1}{2^{n}}\sum_{v\in\mathbb{F}_{2}^{n}}\lvert\braket{\psi}{Z^{v}}{\psi}\rvert^{6}
126n(v𝔽2nψ|Zv|ψ)6\displaystyle\geq\frac{1}{2^{6n}}\left(\sum_{v\in\mathbb{F}_{2}^{n}}\braket{\psi}{Z^{v}}{\psi}\right)^{6} (i=1m|ai|61m5(i=1m|ai|)6.)\displaystyle\left(\sum_{i=1}^{m}\lvert a_{i}\rvert^{6}\geq\frac{1}{m^{5}}\left(\sum_{i=1}^{m}\lvert a_{i}\rvert\right)^{6}.\right)
|c0|12.\displaystyle\geq\lvert c_{0}\rvert^{12}. (v𝔽2nZv=2n|0n0n|.)\displaystyle\left(\sum_{v\in\mathbb{F}_{2}^{n}}Z^{v}=2^{n}\ket{0^{n}}\!\!\bra{0^{n}}.\right)\qed
Proof of first part of Lemma 3.1.

Let |ψ\ket{\psi} be an nn-qubit pure state, and suppose that the stabilizer fidelity of |ψ\ket{\psi} is at least 1k\frac{1}{k}. Then there exists a Clifford circuit C𝒞nC\in\mathcal{C}_{n} such that C|ψ=x𝔽2ncx|xC\ket{\psi}=\sum_{x\in\mathbb{F}_{2}^{n}}c_{x}\ket{x} where |c0|21k\lvert c_{0}\rvert^{2}\geq\frac{1}{k}. Call |ϕC|ψ\ket{\phi}\coloneqq C\ket{\psi}. By 3.4,

32nv,w𝔽2npϕ^(v,w)3|c0|121k6.32^{n}\sum_{v,w\in\mathbb{F}_{2}^{n}}\widehat{p_{\phi}}(v,w)^{3}\geq\lvert c_{0}\rvert^{12}\geq\frac{1}{k^{6}}.

A Clifford circuit CC is a permutation of the Pauli group under conjugation (i.e., C𝒫nC=𝒫nC^{\dagger}\mathcal{P}_{n}C=\mathcal{P}_{n} for any C𝒞nC\in\mathcal{C}_{n}). Hence, for all C𝒞nC\in\mathcal{C}_{n} and g:𝒫ng:\mathcal{P}_{n}\rightarrow\mathbb{R},

x𝔽22ng(Wx)=x𝔽22ng(CWxC).\sum_{x\in\mathbb{F}_{2}^{2n}}g(W_{x})=\sum_{x\in\mathbb{F}_{2}^{2n}}g(C^{\dagger}W_{x}C).

Therefore, we conclude that

32nv,w𝔽2npψ^(v,w)31k632^{n}\sum_{v,w\in\mathbb{F}_{2}^{n}}\widehat{p_{\psi}}(v,w)^{3}\geq\frac{1}{k^{6}}

as well. Combining this bound with 3.2 completes the proof. ∎

3.3 Haar-Random States

We complete the proof of Lemma 3.1 by showing that 𝐄xqψ[|ψ|Wx|ψ|2]\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right] is small when |ψ\ket{\psi} is a Haar-random state. We begin by showing that, for a Haar-random state, all of the Weyl measurements (except Wx=IW_{x}=I) are exponentially close to 0 with overwhelming probability.

Lemma 3.5 (Lévy’s Lemma, see e.g. [Gerken13measureconcentration]).

Let 𝕊N\mathbb{S}^{N} denote the set of all NN-dimensional pure quantum states, and let f:𝕊Nf:\mathbb{S}^{N}\to\mathbb{R} be LL-Lipschitz, meaning that |f(|ψ)f(|φ)|L|ψ|φ2\lvert f(\ket{\psi})-f(\ket{\varphi})\rvert\leq L\cdot\lVert\ket{\psi}-\ket{\varphi}\rVert_{2}. Then:

𝐏𝐫|ψμHaar[|f(|ψ)𝐄[f]|ε]2exp(Nε29π3L2).\mathop{\bf Pr\/}_{\ket{\psi}\sim\mu_{\rm{Haar}}}\left[\lvert f(\ket{\psi})-\mathop{\bf E\/}[f]\rvert\geq\varepsilon\right]\leq 2\exp\left(-\frac{N\varepsilon^{2}}{9\pi^{3}L^{2}}\right).
Lemma 3.6.

For any nn-qubit Weyl operator WxW_{x}, the function fx:𝕊2nf_{x}:\mathbb{S}^{2^{n}}\to\mathbb{R} defined by fx(|ψ)=ψ|Wx|ψf_{x}(\ket{\psi})=\bra{\psi}W_{x}\ket{\psi} is 22-Lipschitz.

Proof.

Write Wx=Π+ΠW_{x}=\Pi_{+}-\Pi_{-} where Π+\Pi_{+} and Π\Pi_{-} are the projectors onto the positive and negative eigenspaces of WxW_{x}, respectively. Then,

|fx(|ψ)fx(|φ)|\displaystyle\lvert f_{x}(\ket{\psi})-f_{x}(\ket{\varphi})\rvert =|ψ|Wx|ψφ|Wx|φ|\displaystyle=\lvert\bra{\psi}W_{x}\ket{\psi}-\bra{\varphi}W_{x}\ket{\varphi}\rvert
=|ψ|Π+|ψφ|Π+|φψ|Π|ψ+φ|Π|φ|\displaystyle=\lvert\bra{\psi}\Pi_{+}\ket{\psi}-\bra{\varphi}\Pi_{+}\ket{\varphi}-\bra{\psi}\Pi_{-}\ket{\psi}+\bra{\varphi}\Pi_{-}\ket{\varphi}\rvert
|ψ|Π+|ψφ|Π+|φ|+|ψ|Π|ψ+φ|Π|φ|\displaystyle\leq\lvert\bra{\psi}\Pi_{+}\ket{\psi}-\bra{\varphi}\Pi_{+}\ket{\varphi}\rvert+\lvert\bra{\psi}\Pi_{-}\ket{\psi}+\bra{\varphi}\Pi_{-}\ket{\varphi}\rvert
=|Π+|ψ2Π+|φ|2+|Π|ψ2Π|φ2|\displaystyle=\lvert\ \lVert\Pi_{+}\ket{\psi}\rVert_{2}-\lVert\Pi_{+}\ket{\varphi}\rVert\rvert_{2}+\lvert\lVert\Pi_{-}\ket{\psi}\rVert_{2}-\lVert\Pi_{-}\ket{\varphi}\rVert_{2}\rvert
Π+(|ψ|φ)2+Π(|ψ|φ)2\displaystyle\leq\lVert\Pi_{+}(\ket{\psi}-\ket{\varphi})\rVert_{2}+\lVert\Pi_{-}(\ket{\psi}-\ket{\varphi})\rVert_{2}
2|ψ|φ2,\displaystyle\leq 2\lVert\ket{\psi}-\ket{\varphi}\rVert_{2},

where the third and fifth lines apply the triangle inequality, and the fourth and sixth lines use the fact that Π+\Pi_{+} and Π\Pi_{-} are projectors. ∎

Corollary 3.7.

Let WxW_{x} be any nn-qubit Weyl operator in which x0x\neq 0 (i.e. WxIW_{x}\neq I). Then:

𝐏𝐫|ψμHaar[|ψ|Wx|ψ|ε]2exp(2nε236π3).\mathop{\bf Pr\/}_{\ket{\psi}\sim\mu_{\rm{Haar}}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert\geq\varepsilon\right]\leq 2\exp\left(-\frac{2^{n}\varepsilon^{2}}{36\pi^{3}}\right).
Proof.

Define fx(|ψ)=ψ|Wx|ψf_{x}(\ket{\psi})=\bra{\psi}W_{x}\ket{\psi} as in Lemma 3.6. By Lemma 3.6, we know that fxf_{x} is 22-Lipschitz. Additionally, observe that 𝐄[f]=0\mathop{\bf E\/}[f]=0 over the Haar measure because exactly half of the eigenvalues of WxW_{x} are 11 and the other half are 1-1. Then the corollary follows from Lemma 3.5. ∎

Corollary 3.8.
𝐏𝐫|ψμHaar[x0:|ψ|Wx|ψ|ε]22n+1exp(2nε236π3).\mathop{\bf Pr\/}_{\ket{\psi}\sim\mu_{\rm{Haar}}}\left[\exists x\neq 0:\lvert\braket{\psi}{W_{x}}{\psi}\rvert\geq\varepsilon\right]\leq 2^{2n+1}\exp\left(-\frac{2^{n}\varepsilon^{2}}{36\pi^{3}}\right).
Proof.

This follows from Corollary 3.7 and a union bound over all 22n2^{2n} possible Weyl operators. ∎

Note that if ε1poly(n)\varepsilon\geq\frac{1}{\mathrm{poly}(n)}, then the probability bound in Corollary 3.8 is doubly-exponentially small.

We have shown that, with high probability, all Weyl measurements (except Wx=IW_{x}=I) are close to 0. We use this to complete the proof of Lemma 3.1.

Proof of second part of Lemma 3.1.

Suppose |ψ\ket{\psi} is a Haar-random state. By Corollary 3.8, for all WxIW_{x}\neq I, |ψ|Wx|ψ|2=2np(x)ε2\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}=2^{n}p(x)\leq\varepsilon^{2} with probability 122n+1exp(2nε236π3)1-2^{2n+1}\exp\left(-\frac{2^{n}\varepsilon^{2}}{36\pi^{3}}\right). Therefore by 3.2 and Proposition 3.3,

𝐄xqψ[|ψ|Wx|ψ|2]\displaystyle\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right] =32nx,y𝔽2np^(x,y)3\displaystyle=32^{n}\sum_{x,y\in\mathbb{F}_{2}^{n}}\widehat{p}(x,y)^{3}
=4nw,v𝔽2np(v,w)3\displaystyle=4^{n}\sum_{w,v\in\mathbb{F}_{2}^{n}}p(v,w)^{3}
=4n(18n+w,v𝔽2nw,v0p(v,w)3)\displaystyle=4^{n}\left(\frac{1}{8^{n}}+\sum_{\begin{subarray}{c}w,v\in\mathbb{F}_{2}^{n}\\ w,v\neq 0\end{subarray}}p(v,w)^{3}\right)
1+(4n1)ε62n,\displaystyle\leq\frac{1+(4^{n}-1)\varepsilon^{6}}{2^{n}},

with probability at least 122n+1exp(2nε236π3)1-2^{2n+1}\exp\left(-\frac{2^{n}\varepsilon^{2}}{36\pi^{3}}\right). By setting ϵ2=12n/6(2n2n/24n1)1/3\epsilon^{2}=\frac{1}{2^{n/6}}\left(\frac{2^{n}-2^{n/2}}{4^{n}-1}\right)^{1/3}, we get

𝐄xqψ[|ψ|Wx|ψ|2]12n/2\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]\leq\frac{1}{2^{n/2}}

with probability at least 122n+1exp(25n/636π3(2n2n/24n1)1/3)1-2^{2n+1}\exp\left(-\frac{2^{5n/6}}{36\pi^{3}}\left(\frac{2^{n}-2^{n/2}}{4^{n}-1}\right)^{1/3}\right), which is at least 1exp(2n/215)1-\exp\left(-2^{n/2-15}\right) for n33n\geq 33. ∎

4 Algorithm and Sample Complexity Analysis

We are now ready to state and analyze our algorithm that distinguishes between Haar-random states and states with low stabilizer complexity. Our algorithm uses the fact that we can efficiently sample from qψq_{\psi} (via Bell difference sampling) and efficiently estimate |ψ|Wx|ψ|2\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2} for any given x𝔽22nx\in\mathbb{F}_{2}^{2n}, using quantum measurements. By combining these subroutines, we construct an unbiased estimator for 𝐄xq[|ψ|Wx|ψ|2]\mathop{\bf E\/}_{x\sim q}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right]. Motivated by Lemma 3.1, if our estimator exceeds a certain threshold we determine that the input state has low stabilizer complexity; otherwise, we determine that the state is Haar-random. For the remainder of this section, η𝐄xq[|ψ|Wx|ψ|2]\eta\coloneqq\mathop{\bf E\/}_{x\sim q}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right].

Input: Black-box access to copies of |ψ\ket{\psi}
Promise : |ψ\ket{\psi} is Haar-random or has stabilizer fidelity at least 1k\frac{1}{k}
Output: 11 if |ψ\ket{\psi} has stabilizer fidelity at least 1k\frac{1}{k} and 0 if |ψ\ket{\psi} is Haar-random
1 Let m=60k12ln(1/δ)m=60k^{12}\ln(1/\delta).
2repeat mm times
3       Perform Bell difference sampling to obtain WxqψW_{x}\sim q_{\psi}.
4      Perform the measurement Wx2W_{x}^{\otimes 2} on |ψ2\ket{\psi}^{\otimes 2}. Let Xi{±1}X_{i}\in\{\pm 1\} denote the measurement outcome.
5
Set η^=1miXi\widehat{\eta}=\frac{1}{m}\sum_{i}X_{i}. Return 11 if η^23k6\widehat{\eta}\geq\frac{2}{3k^{6}}, and 0 otherwise.
Algorithm 1 Distinguishing Low-Stabilizer-Complexity States from Haar-Random
Theorem 4.1.

Let |ψ\ket{\psi} be an unknown nn-qubit pure state for some n33n\geq 33, and let k452n/12k\leq\frac{4}{5}2^{n/12}. Algorithm 1 distinguishes whether |ψ\ket{\psi} is Haar-random or a state with stabilizer fidelity at least 1k\frac{1}{k}, promised that one of these is the case. The algorithm uses O(k12log(1/δ))O\left(k^{12}\log(1/\delta)\right) copies of |ψ\ket{\psi} and O(nk12log(1/δ))O(nk^{12}\log(1/\delta)) time, and distinguishes the two cases with success probability at least 1δ1-\delta.

Proof.

Following the notation in Algorithm 1, XiX_{i} is the outcome of the measurement on the iith iteration of the algorithm loop. Observe that for any XiX_{i},

𝐄xqψ,meas. by Wx2[Xi]=𝐄xqψψ2|Wx2|ψ2=𝐄xqψ|ψ|Wx|ψ|2=η.\mathop{\bf E\/}_{{\begin{subarray}{c}x\sim q_{\psi},\\ \text{meas. by $W_{x}^{\otimes 2}$}\end{subarray}}}[X_{i}]=\mathop{\bf E\/}_{x\sim q_{\psi}}\bra{\psi^{\otimes 2}}W_{x}^{\otimes 2}\ket{\psi^{\otimes 2}}=\mathop{\bf E\/}_{x\sim q_{\psi}}\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}=\eta.

Therefore, η^=1miXi\widehat{\eta}=\frac{1}{m}\sum_{i}X_{i} is an unbiased estimator of η\eta (i.e., 𝐄[η^]=η\mathop{\bf E\/}[\widehat{\eta}]=\eta).

Suppose |ψ\ket{\psi} has stabilizer fidelity at least 1k\frac{1}{k}. Then, our algorithm fails when η^<23k6\widehat{\eta}<\frac{2}{3k^{6}}. Hence,

𝐏𝐫[Algorithm 1 fails]=𝐏𝐫[η^<23k6]=𝐏𝐫[η^η<23k6η]𝐏𝐫[η^η13k6],\mathop{\bf Pr\/}[\text{\lx@cref{creftype~refnum}{alg:distinguisher} fails}]=\mathop{\bf Pr\/}\left[\widehat{\eta}<\frac{2}{3k^{6}}\right]=\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta<\frac{2}{3k^{6}}-\eta\right]\leq\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta\leq-\frac{1}{3k^{6}}\right],

where the last inequality follows from Lemma 3.1. By Hoeffding’s inequality,

𝐏𝐫[η^η13k6]exp(m18k12).\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta\leq-\frac{1}{3k^{6}}\right]\leq\exp\left(-\frac{m}{18k^{12}}\right).

Therefore, m18k12ln(15)=49k12m\geq 18k^{12}\ln(15)=49k^{12} samples suffice for the failure probability to be at most 115\frac{1}{15}.

Now suppose |ψ\ket{\psi} is Haar-random. Then, our algorithm fails when η^23k6\widehat{\eta}\geq\frac{2}{3k^{6}}. By Lemma 3.1, η2n/2\eta\leq 2^{-n/2} with probability at least 1exp(2n/215)>=1e221-\exp\left(-2^{n/2-15}\right)>=1-e^{-2\sqrt{2}} for n33n\geq 33. Assuming that η2n/2\eta\leq 2^{-n/2},

𝐏𝐫[Algorithm 1 fails]=𝐏𝐫[η^23k6]=𝐏𝐫[η^η23k6η]𝐏𝐫[η^η12k62n/2].\mathop{\bf Pr\/}[\text{\lx@cref{creftype~refnum}{alg:distinguisher} fails}]=\mathop{\bf Pr\/}\left[\widehat{\eta}\geq\frac{2}{3k^{6}}\right]=\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta\geq\frac{2}{3k^{6}}-\eta\right]\leq\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta\geq\frac{1}{2k^{6}}-2^{-n/2}\right].

Once again, by Hoeffding’s inequality,

𝐏𝐫[η^η12k62n/2]\displaystyle\mathop{\bf Pr\/}\left[\widehat{\eta}-\eta\geq\frac{1}{2k^{6}}-2^{-n/2}\right] exp(m2(23k62n/2)2)\displaystyle\leq\exp\left(-\frac{m}{2}\left(\frac{2}{3k^{6}}-2^{-n/2}\right)^{2}\right)
exp(m2(23k613k6)2)\displaystyle\leq\exp\left(-\frac{m}{2}\left(\frac{2}{3k^{6}}-\frac{1}{3k^{6}}\right)^{2}\right)
exp(m18k12).\displaystyle\leq\exp\left(-\frac{m}{18k^{12}}\right).

Therefore, m18k12ln(115e22)88k12m\geq-18k^{12}\ln\left(\frac{1}{15}-e^{-2\sqrt{2}}\right)\geq 88k^{12} samples suffice for the failure probability to be at most 115e22\frac{1}{15}-e^{-2\sqrt{2}}. By the union bound, the failure probability is at most 115\frac{1}{15}, where the randomness is over both the Haar measure and the quantum measurements.

We have shown that in either case we output the correct answer with probability at least 1415\frac{14}{15}. Using the Chernoff-Hoeffding theorem, the success probability can be boosted from 1415\frac{14}{15} to at least 1δ1-\delta by doing 23ln(1/δ)\frac{2}{3}\ln(1/\delta) repetitions of Algorithm 1 and taking the majority answer. Since each iteration of the algorithm loop uses 66 copies of |ψ\ket{\psi}, Algorithm 1 consumes O(k12log(1/δ))O\left(k^{12}\log(1/\delta)\right) copies in total. Finally, Bell difference sampling and performing the measurement Wx2W_{x}^{\otimes 2} takes O(n)O(n) time, so the total runtime is O(nk12log(1/δ))O\left(nk^{12}\log(1/\delta)\right). ∎

All of these results also apply to states with stabilizer extent at most kk, since by 2.3, such states have stabilizer fidelity at least 1k\frac{1}{k}.

Corollary 4.2.

Let |ψ\ket{\psi} be an unknown nn-qubit pure state for n33n\geq 33, and let k452n/12k\leq\frac{4}{5}2^{n/12}. Algorithm 1 distinguishes whether |ψ\ket{\psi} is Haar-random or a state with stabilizer extent at most kk, promised that one of these is the case. The algorithm uses O(k12log(1/δ))O\left(k^{12}\log(1/\delta)\right) copies of |ψ\ket{\psi} and distinguishes the two cases with success probability at least 1δ1-\delta.

The above result immediately implies that output states of Clifford+TT circuits with few TT-gates cannot be computationally pseudorandom.

Corollary 4.3.

Any family of Clifford+TT circuits that produces an ensemble of nn-qubit computationally pseudorandom quantum states must use at least ω(logn)\omega(\log n) TT-gates.

Proof.

Consider any ensemble of states wherein each state in the ensemble is the output of some Clifford+TT circuit with at most KlognK\log n TT-gates. By Lemma 2.5, the stabilizer extent of any such state |ψ\ket{\psi} is at most nαKn^{\alpha K} for α0.7716\alpha\leq 0.7716. By Corollary 4.2, on input copies of |ψ\ket{\psi}, Algorithm 1 takes O(n12αK+1)\poly(n)O(n^{12\alpha K+1})\leq\poly(n) time and outputs 11 with probability at least 2/32/3. On the other hand, if |ψ\ket{\psi} is a Haar-random state then the same algorithm outputs 11 with probability at most 13\frac{1}{3}. As such, the algorithm’s distinguishing advantage between the ensemble and the Haar measure is non-negligible. This is to say that the ensemble cannot be pseudorandom under the definition of [Ji10.1007/978-3-319-96878-0_5]. ∎

5 Open Problems

An immediate direction for future work is to improve the sample complexity of our algorithm, or to prove sample complexity lower bounds. One can also endeavour to improve other features of our algorithm: Is it possible to remove the need for entangled measurements?222 The optimal algorithms for learning and testing stabilizer states use entangled measurements. So, a first step would be to understand how many separable measurements are required to separate stabilizer states from Haar-random. Or, is it possible to show that entangled measurements are in any sense necessary? Are there quantum measurements that allow us to sample from pψp_{\psi} directly (rather than qψq_{\psi})?

Beyond that, can Bell difference sampling be used for learning and/or property testing stabilizer-extent-kk states? For stabilizer states (k=1k=1), a 66-query property testing algorithm is given by [gross2021schur] and a Θ(n)\Theta(n)-query learning algorithm is given by [montanaro2017learning]. Both algorithms rely on Bell difference sampling and are asymptotically optimal. We ask if there are generalizations of these algorithms for states with higher stabilizer complexity, similar to the question that was raised in [arunachalam2022phase].

Question 5.1.

Is there a \poly(k)\poly(k)-query algorithm for property testing stabilizer-extent-kk states? Likewise, is there a \poly(n,k)\poly(n,k)-time algorithm for learning stabilizer-extent-kk states?

Our results on stabilizer extent are due to the fact that extent and fidelity are inversely related. Is it possible to relate stabilizer rank (a closely-related complexity measure, denoted by χ\chi) and stabilizer fidelity? For instance, proving that, for all states |ψ\ket{\psi},

F𝒮(|ψ)1χ(|ψ)c,for any constant c,F_{\mathcal{S}}(\ket{\psi})^{-1}\leq\chi(\ket{\psi})^{c},\quad\text{for any constant $c$,}

would imply that our algorithm can distinguish low-stabilizer-rank states from Haar-random. However, proving such a relation for even c<αnlognc<\frac{\alpha n}{\log n} for α0.2284\alpha\leq 0.2284 would imply super-linear lower bounds on the stabilizer rank of Clifford magic states, a long-standing open problem.

One can also ask if the lower bound on the number of TT-gates necessary for computationally pseudorandom states can be improved.

Question 5.2.

How many TT-gates are necessary for a family of Clifford+TT circuits to produce an ensemble of nn-qubit computationally pseudorandom states?

We remark that any improvements to our logn\log n lower bound would require techniques beyond the ones used in our paper. Indeed, in Appendix B we show that one can hope for at most a quadratic improvement in the relationship between η\eta and stabilizer fidelity. Such an improvement would only yield constant-factor improvements on the number of TT-gates necessary to prepare computationally pseudorandom states.

On the other hand, we are not aware of any attempts to optimize the TT-gate count for plausible constructions of nn-qubit pseudorandom states. The best upper bound we know of is the essentially trivial bound of O(n)O(n), based on constructions of with O(n)O(n) general gates. This is because pseudorandom states can be constructed from pseudorandom functions (PRFs) with constant overhead [brakerski10.1007/978-3-030-36030-6_10], and PRFs are believed to be constructible in linear time [ishai_10.1145/1374376.1374438, fan_10.1145/3519935.3520010].333Technically, we are not sure whether the PRFs constructed in [ishai_10.1145/1374376.1374438, fan_10.1145/3519935.3520010] are secure against quantum adversaries, which is necessary for instantiating [brakerski10.1007/978-3-030-36030-6_10]’s construction, but we consider it reasonable to conjecture that linear-time quantum-secure PRFs exist.

Acknowledgments

We thank Scott Aaronson for helpful comments. SG, VI, DL are supported by Scott Aaronson’s Vannevar Bush Fellowship from the US Department of Defense, the Berkeley NSF-QLCI CIQC Center, a Simons Investigator Award, and the Simons “It from Qubit” collaboration. WK is supported by an NDSEG Fellowship.

References

  • [ABDY22] Srinivasan Arunachalam, Sergey Bravyi, Arkopal Dutt, and Theodore J. Yoder. Optimal algorithms for learning quantum phase states, 2022. doi:10.48550/arxiv.2208.07851.
  • [AG04] Scott Aaronson and Daniel Gottesman. Improved simulation of stabilizer circuits. Physical Review A, 70(5), nov 2004. doi:10.1103/physreva.70.052328.
  • [BBC+19] Sergey Bravyi, Dan Browne, Padraic Calpin, Earl Campbell, David Gosset, and Mark Howard. Simulation of quantum circuits by low-rank stabilizer decompositions. Quantum, 3:181, September 2019. doi:10.22331/q-2019-09-02-181.
  • [BG16] Sergey Bravyi and David Gosset. Improved Classical Simulation of Quantum Circuits Dominated by Clifford Gates. Phys. Rev. Lett., 116:250501, 2016. doi:10.1103/PhysRevLett.116.250501.
  • [BHMT02] Gilles Brassard, Peter Høyer, Michele Mosca, and Alain Tapp. Quantum Amplitude Amplification and Estimation, 2002. doi:10.1090/conm/305/05215.
  • [BLR93] Manuel Blum, Michael Luby, and Ronitt Rubinfeld. Self-Testing/Correcting with Applications to Numerical Problems. J. Comput. Syst. Sci., 47(3):549–595, 1993. doi:10.1016/0022-0000(93)90044-W.
  • [BS19] Zvika Brakerski and Omri Shmueli. (Pseudo) Random Quantum States with Binary Phase. In Theory of Cryptography, 2019. doi:10.1007/978-3-030-36030-6_10.
  • [CS96] A. R. Calderbank and Peter W. Shor. Good quantum error-correcting codes exist. Phys. Rev. A, 54:1098–1105, 1996. doi:10.1103/PhysRevA.54.1098.
  • [FLY22] Zhiyuan Fan, Jiatu Li, and Tianqi Yang. The Exact Complexity of Pseudorandom Functions and the Black-Box Natural Proof Barrier for Bootstrapping Results in Computational Complexity. In Proceedings of the 54th Annual ACM SIGACT Symposium on Theory of Computing, STOC 2022, page 962–975, 2022. doi:10.1145/3519935.3520010.
  • [Ger13] Manuel Gerken. Measure concentration: Levy’s Lemma, 2013.
  • [GNW21] David Gross, Sepehr Nezami, and Michael Walter. Schur–Weyl duality for the Clifford group with applications: Property testing, a robust Hudson theorem, and de Finetti representations. Communications in Mathematical Physics, 385(3):1325–1393, 2021. doi:10.1007/s00220-021-04118-7.
  • [Got97] Daniel Gottesman. Stabilizer Codes and Quantum Error Correction, 1997. doi:10.48550/arxiv.quant-ph/9705052.
  • [Got98] Daniel Gottesman. The Heisenberg Representation of Quantum Computers. 1998. doi:10.48550/arXiv.quant-ph/9807006.
  • [HIN+22] Marcel Hinsche, Marios Ioannou, Alexander Nietner, Jonas Haferkamp, Yihui Quek, Dominik Hangleiter, Jean-Pierre Seifert, Jens Eisert, and Ryan Sweke. A single tt-gate makes distribution learning hard, 2022. doi:10.48550/arxiv.2207.03140.
  • [HKP20] Hsin-Yuan Huang, Richard Kueng, and John Preskill. Predicting many properties of a quantum system from very few measurements. Nature Physics, 16(10):1050–1057, 2020. doi:10.1038/s41567-020-0932-7.
  • [HMMH+20] Jonas Haferkamp, Felipe Montealegre-Mora, Markus Heinrich, Jens Eisert, David Gross, and Ingo Roth. Quantum homeopathy works: Efficient unitary designs with a system-size independent number of non-Clifford gates, 2020. doi:10.48550/arxiv.2002.09524.
  • [IKOS08] Yuval Ishai, Eyal Kushilevitz, Rafail Ostrovsky, and Amit Sahai. Cryptography with Constant Computational Overhead. In Proceedings of the Fortieth Annual ACM Symposium on Theory of Computing, STOC ’08, page 433–442, 2008. doi:10.1145/1374376.1374438.
  • [JLS18] Zhengfeng Ji, Yi-Kai Liu, and Fang Song. Pseudorandom Quantum States. In Advances in Cryptology – CRYPTO 2018. Springer International Publishing, 2018. doi:10.1007/978-3-319-96878-0_5.
  • [KG15] Richard Kueng and David Gross. Qubit stabilizer states are complex projective 3-designs, 2015. doi:10.48550/arXiv.1510.02767.
  • [KLR+08] E. Knill, D. Leibfried, R. Reichle, J. Britton, R. B. Blakestad, J. D. Jost, C. Langer, R. Ozeri, S. Seidelin, and D. J. Wineland. Randomized benchmarking of quantum gates. Physical Review A, 77(1), 2008. doi:10.1103/physreva.77.012307.
  • [LC22] Ching-Yi Lai and Hao-Chung Cheng. Learning Quantum Circuits of Some TT Gates. IEEE Transactions on Information Theory, 68(6):3951–3964, 2022. doi:10.1109/TIT.2022.3151760.
  • [Mon17] Ashley Montanaro. Learning stabilizer states by Bell sampling. arXiv preprint arXiv:1707.04012, 2017. doi:10.48550/arXiv.1707.04012.
  • [NC02] Michael A. Nielsen and Isaac Chuang. Quantum Computation and Quantum Information, 2002. doi:10.1017/CBO9780511976667.
  • [O’D14] Ryan O’Donnell. Analysis of Boolean Functions. Cambridge University Press, 2014. doi:10.1017/CBO9781139814782.
  • [PWB15] Hakop Pashayan, Joel J. Wallman, and Stephen D. Bartlett. Estimating Outcome Probabilities of Quantum Circuits Using Quasiprobabilities. Phys. Rev. Lett., 115:070501, 2015. doi:10.1103/PhysRevLett.115.070501.
  • [RB00] Robert Raussendorf and Hans J. Briegel. Quantum computing via measurements only, 2000. doi:10.48550/arxiv.quant-ph/0010033.
  • [RLCK19] Patrick Rall, Daniel Liang, Jeremy Cook, and William Kretschmer. Simulation of qubit quantum circuits via Pauli propagation. Phys. Rev. A, 99:062337, 2019. doi:10.1103/PhysRevA.99.062337.
  • [Sho95] Peter W. Shor. Scheme for reducing decoherence in quantum computer memory. Phys. Rev. A, 52:R2493–R2496, 1995. doi:10.1103/PhysRevA.52.R2493.
  • [Web16] Zak Webb. The Clifford Group Forms a Unitary 3-Design. Quantum Info. Comput., 16(15–16):1379–1400, 2016.
  • [ZKGG16] Huangjun Zhu, Richard Kueng, Markus Grassl, and David Gross. The Clifford group fails gracefully to be a unitary 4-design, 2016. arXiv:1609.08172.

Appendix A Algorithm Improvements via State Preparation Unitary

When given access to a state preparation unitary for |ψ\ket{\psi} (and its inverse), denoted by UU and UU^{\dagger}, we can improve the sample and time complexities of our algorithm to O(k3log(1/δ))O\left(k^{3}\log(1/\delta)\right) and O(nk3log(1/δ))O\left(nk^{3}\log(1/\delta)\right), respectively, at the cost of O(k3log(1/δ))O\left(k^{3}\log(1/\delta)\right) queries to UU and UU^{\dagger}.

Access to UU and UU^{\dagger} allows us to run quantum amplitude estimation (QAE) as a subroutine in our algorithm. Recall the well-known result of Brassard, Høyer, Mosca, and Tapp:

Theorem A.1 (Quantum Amplitude Estimation (Theorem 12 in [Brassard_2002])).

Let Π\Pi be a projector and |ψ\ket{\psi} be an nn-qubit pure state such that ψ|Π|ψ=η\braket{\psi}{\Pi}{\psi}=\eta. Given access to the unitary transformations RΠ=2ΠIR_{\Pi}=2\Pi-I and Rψ=2|ψψ|IR_{\psi}=2\ket{\psi}\!\!\bra{\psi}-I, there exists a quantum algorithm that outputs η^\widehat{\eta} such that

|η^η|2πη(1η)m+π2m2\lvert\widehat{\eta}-\eta\rvert\leq\frac{2\pi\sqrt{\eta(1-\eta)}}{m}+\frac{\pi^{2}}{m^{2}}

with probability at least 8π2\frac{8}{\pi^{2}}. The algorithm makes mm calls to RΠR_{\Pi} and RψR_{\psi}.

Corollary A.2.

Let Π\Pi, |ψ\ket{\psi}, RΠR_{\Pi}, and RψR_{\psi} be the same as in Theorem A.1. There exists a quantum algorithm that outputs η^\widehat{\eta} such that

|η^η|ε\lvert\widehat{\eta}-\eta\rvert\leq\varepsilon

with probability at least 8π2\frac{8}{\pi^{2}}. The algorithm makes no more than

πη(1η)+εε\pi\frac{\sqrt{\eta(1-\eta)+\varepsilon}}{\varepsilon}

calls to RΠR_{\Pi} and RψR_{\psi}.

Proof.

By Theorem A.1, this will require mm queries, where mm is a solution to the following quadratic equation:

2πη(1η)m+π2m2εmπη(1η)+εεπη(1η)+η(1η)+ε2ε.\frac{2\pi\sqrt{\eta(1-\eta)}}{m}+\frac{\pi^{2}}{m^{2}}\leq\varepsilon\Rightarrow m\geq\pi\frac{\sqrt{\eta(1-\eta)+\varepsilon}}{\varepsilon}\geq\pi\frac{\sqrt{\eta(1-\eta)}+\sqrt{\eta(1-\eta)+\varepsilon}}{2\varepsilon}.

With that, we are ready to explain the modifications to Algorithm 1 that achieves a quartic speedup in the dependency on kk.

Theorem A.3.

Let |ψ\ket{\psi} be an unknown nn-qubit pure state prepared by a unitary UU for n33n\geq 33, and let k452n/12k\leq\frac{4}{5}2^{n/12}. There exists a quantum algorithm that distinguishes whether |ψ\ket{\psi} is Haar-random or a state with stabilizer fidelity at least 1k\frac{1}{k}, promised that one of these is The case. The algorithm uses O(k3log(1/δ))O\left(k^{3}\log(1/\delta)\right) applications of either UU or UU^{\dagger} and time O(nk3log(1/δ))O\left(nk^{3}\log(1/\delta)\right), and distinguishes the two cases with success probability at least 1δ1-\delta.

Proof.

We first define the Bell difference sampling projector on xx as

Πxy𝔽22n|WyWy||Wx+yWx+y|.\Pi_{x}\coloneqq\sum_{y\in\mathbb{F}_{2}^{2n}}\ket{W_{y}}\!\!\bra{W_{y}}\otimes\ket{W_{x+y}}\!\!\bra{W_{x+y}}.

Note that this is simply a compact way of writing the Bell difference sampling procedure: the probability of sampling xx is qψ(x)=Πx|ψ4q_{\psi}(x)=\lVert\Pi_{x}\ket{\psi^{\otimes 4}}\rVert.444Indeed, this is the way Gross, Nezami, and Walter [gross2021schur] introduce Bell difference sampling. We can also perform the projective measurement Pψ,xWx|ψψ|Wx=WxU|00|UWxP_{\psi,x}\coloneqq W_{x}\ket{\psi}\!\!\bra{\psi}W_{x}=W_{x}U\ket{0}\!\!\bra{0}U^{\dagger}W_{x}, where this measurement is performed by applying WxW_{x}, UU^{\dagger}, and then measuring in the computational basis. We can entangle Πx\Pi_{x} and Pψ,xP_{\psi,x} to form the following projector:

M=x𝔽22nΠxPψ,x.M=\sum_{x\in\mathbb{F}_{2}^{2n}}\Pi_{x}\otimes P_{\psi,x}.

Building MM involves controlled applications of WxW_{x} according to the Bell difference sampling outcome. Observe that

ψ5|M|ψ5=x𝔽22nψ4|Πx|ψ4ψ|Pψ,x|ψ=𝐄xqψ[|ψ|Wx|ψ|2].\braket{\psi^{\otimes 5}}{M}{\psi^{\otimes 5}}=\sum_{x\in\mathbb{F}_{2}^{2n}}\braket{\psi^{\otimes 4}}{\Pi_{x}}{\psi^{\otimes 4}}\cdot\braket{\psi}{P_{\psi,x}}{\psi}=\mathop{\bf E\/}_{x\sim q_{\psi}}\left[\lvert\braket{\psi}{W_{x}}{\psi}\rvert^{2}\right].

Hence, we can run QAE with the input projector MM and the input state |ψ5\ket{\psi^{\otimes 5}}, and the output will be an estimate of η\eta whose accuracy depends on mm, the number of total calls to RΠR_{\Pi} and RψR_{\psi}.

Proving the sample complexity bound will mimic Theorem 4.1. Suppose |ψ\ket{\psi} is a state with stabilizer fidelity at least 1k\frac{1}{k}. Define ηmin1k6\eta_{min}\coloneqq\frac{1}{k^{6}}, and note that for any state with stabilizer fidelity at least 1k\frac{1}{k}, ηηmin\eta\geq\eta_{min} due to Lemma 3.1. For our algorithm to succeed, recall from the proof of Theorem 4.1 that we need

|η^η||23k6η|.\lvert\widehat{\eta}-\eta\rvert\leq\lvert\frac{2}{3k^{6}}-\eta\rvert.

Therefore, we can run QAE with a fixed value of mm (to be specified later) for an estimate of η\eta whose accuracy is within ±(η23k6)\pm\left(\eta-\frac{2}{3k^{6}}\right). By Corollary A.2,

mπη(1η)+η23k6η23k6\displaystyle m\geq\pi\frac{\sqrt{\eta(1-\eta)+\eta-\frac{2}{3k^{6}}}}{\eta-\frac{2}{3k^{6}}} (2)

queries suffice. The chosen value of mm must work for all η[1k6,1]\eta\in[\frac{1}{k^{6}},1]. Note that Eq. 2 is monotonically decreasing for η[23k6,1)\eta\in[\frac{2}{3k^{6}},1), and is therefore maximized by ηmin\eta_{min} for η[1k6,1]\eta\in[\frac{1}{k^{6}},1]. To succeed with probability at least 8π2\frac{8}{\pi^{2}},

m4πk3π12k69=πηmin(1ηmin)+ηmin23k6ηmin23k6m\geq 4\pi k^{3}\geq\pi\sqrt{12k^{6}-9}=\pi\frac{\sqrt{\eta_{min}(1-\eta_{min})+\eta_{min}-\frac{2}{3k^{6}}}}{\eta_{min}-\frac{2}{3k^{6}}}

calls to RΠR_{\Pi} and RψR_{\psi} suffices.

Now suppose |ψ\ket{\psi} is a Haar-random state. Again, by Lemma 3.1, we know that η2n/2\eta\leq 2^{-n/2} with probability 1e221-e^{-2\sqrt{2}} for n33n\geq 33. Assuming η2n/2\eta\leq 2^{-n/2} and using Corollary A.2, as long as we have

m6πk3π2n/2(12n/2)+23k62n/223k62n/2πη(1η)+23k6η23k6ηm\geq\sqrt{6}\pi k^{3}\geq\pi\frac{\sqrt{2^{-n/2}(1-2^{-n/2})+\frac{2}{3k^{6}}-2^{-n/2}}}{\frac{2}{3k^{6}}-2^{-n/2}}\geq\pi\frac{\sqrt{\eta(1-\eta)+\frac{2}{3k^{6}}-\eta}}{\frac{2}{3k^{6}}-\eta}

queries to RΠR_{\Pi} and RψR_{\psi}, we obtain the correct answer with probability at least 8π2\frac{8}{\pi^{2}}. In the inequalities above we use similar reasoning to the stabilizer fidelity 1k\frac{1}{k} case, combined with the fact that 2n/213k62^{-n/2}\leq\frac{1}{3k^{6}}.

Finally, since RΠR_{\Pi} and RψR_{\psi} use a constant number of calls to UU and UU^{\dagger}, the total number of calls is O(k3)O(k^{3}). Chernoff-Hoeffding can be used to bring the success probability from 3/43/4 to 1δ1-\delta using 6ln(1/δ)6\ln(1/\delta) repetitions. The runtime includes an extra factor of O(n)O(n), due to the linear cost of both preparing WxW_{x} and the Bell difference sampling projector, giving a O(nk3log(1/δ))O\left(nk^{3}\log(1/\delta)\right) time complexity. ∎

Appendix B On the Tightness of Our Analysis

We argue that the first part of Lemma 3.1 is polynomially-close to optimal. We begin by computing the stabilizer extent and stabilizer fidelity of Clifford magic states. The two technical ingredients involved in the computation are due to Bravyi et al. [Bravyi2019simulationofquantum].

Fact B.1 ([Bravyi2019simulationofquantum, Proposition 2]).

Let |ψ\ket{\psi} be a Clifford magic state. Then, ξ(|ψ)=F𝒮(|ψ)1\xi(\ket{\psi})=F_{\mathcal{S}}(\ket{\psi})^{-1}.

Fact B.2 ([Bravyi2019simulationofquantum, Proposition 1]).

Let {|ψ1,|ψ2,,|ψL}\{\ket{\psi_{1}},\ket{\psi_{2}},\ldots,\ket{\psi_{L}}\} be any set of states such that each state |ψj\ket{\psi_{j}} describes a system of at most 33 qubits. Then,

ξ(|ψ1|ψ2|ψL)=iξ(|ψi).\xi(\ket{\psi_{1}}\otimes\ket{\psi_{2}}\otimes\ldots\otimes\ket{\psi_{L}})=\prod_{i}\xi(\ket{\psi_{i}}).

It is well known that the mm-fold tensor product of |T21/2(|0+eiπ/4|0)\ket{T}\coloneqq 2^{-1/2}(\ket{0}+e^{i\pi/4}\ket{0}) is a Clifford magic state. Using the facts above, we can compute the stabilizer extent and stabilizer fidelity of |Tm\ket{T^{\otimes m}}.

Fact B.3.
ξ(|Tm)=(cosπ/8)2mandF𝒮m(|Tm)=(cosπ/8)2m.\xi(\ket{T^{\otimes m}})=\left(\cos\pi/8\right)^{-2m}\quad\text{and}\quad F_{\mathcal{S}_{m}}(\ket{T^{\otimes m}})=\left(\cos\pi/8\right)^{2m}.
Proof.

By B.2, the stabilizer extent of |Tm\ket{T^{\otimes{m}}} is simply the stabilizer extent of |T\ket{T} raised to the power mm. By B.1, the stabilizer extent is the inverse of the stabilizer fidelity. Hence, the result follows simply by showing that the stabilizer fidelity of |T\ket{T} is cos(π/8)2\cos(\pi/8)^{2}, which can be verified by explicit calculation over the 66 different 11-qubit stabilizer states. ∎

Next, we compute η\eta for the state |Tm\ket{T^{\otimes m}}.

Claim B.4.

Let |ψ=|Tm\ket{\psi}=\ket{T^{\otimes m}} and define η𝐄xqψ[2npψ(x)]\eta\coloneqq\mathop{\bf E\/}_{x\sim q_{\psi}}[2^{n}p_{\psi}(x)]. Then, η=(5/8)m\eta=(5/8)^{m}.

Proof.

We begin by writing out |TT|\ket{T}\!\!\bra{T} as a sum of Pauli matrices. By definition,

|TT|=12(I+12X+12Y).\ket{T}\!\!\bra{T}=\dfrac{1}{2}\left(I+\dfrac{1}{\sqrt{2}}X+\dfrac{1}{\sqrt{2}}Y\right).

We wish to compute x𝔽22mp^ψ(x)3\sum_{x\in\mathbb{F}_{2}^{2m}}\widehat{p}_{\psi}(x)^{3}. We know that every such Pauli with nonzero p^ψ(x)\widehat{p}_{\psi}(x) is a tensor product combination of II, XX, and YY, so we enumerate over the number of indices where an XX or YY appear.

x𝔽22mp^ψ(x)3=126mk=0m(mk)123k2k=164mk=0m(mk)14k=(5256)m.\sum_{x\in\mathbb{F}_{2}^{2m}}\widehat{p}_{\psi}(x)^{3}=\dfrac{1}{2^{6m}}\sum_{k=0}^{m}\binom{m}{k}\dfrac{1}{2^{3k}}\cdot 2^{k}=\dfrac{1}{64^{m}}\sum_{k=0}^{m}\binom{m}{k}\dfrac{1}{4^{k}}=\left(\frac{5}{256}\right)^{m}.

Thus, by 3.2,

η=32mx𝔽22mp^ψ(x)3=(58)m.\eta=32^{m}\sum_{x\in\mathbb{F}_{2}^{2m}}\widehat{p}_{\psi}(x)^{3}=\left(\frac{5}{8}\right)^{m}.\qed

Combining B.4 with Lemma 3.1, we have

F𝒮(|ψ)η1/c=(58)m/cF_{\mathcal{S}}(\ket{\psi})\leq\eta^{1/c}=\left(\frac{5}{8}\right)^{m/c}

for c=6c=6 (Lemma 3.1). But, from B.3, we know that F𝒮(|Tm)=(cosπ/8)2mF_{\mathcal{S}}(\ket{T^{\otimes m}})=(\cos\pi/8)^{2m}. Combining the two statements gives

(cosπ/8)2m(5/8)m/c.(\cos\pi/8)^{2m}\leq(5/8)^{m/c}.

c2.97c\approx 2.97 is the minimum cc that does not violate this inequality. Hence, one cannot hope for much more than a quadratic improvement in our bound.