Simulating Gaussian boson sampling quantum computers

Alexander S. Dellios, Margaret D. Reid and Peter D. Drummond Centre for Quantum Science and Technology Theory, Swinburne University of Technology, Melbourne 3122, Australia

Abstract

A growing cohort of experimental linear photonic networks implementing Gaussian boson sampling (GBS) have now claimed quantum advantage. However, many open questions remain on how to effectively verify these experimental results, as scalable methods are needed that fully capture the rich array of quantum correlations generated by these photonic quantum computers. In this paper, we briefly review recent theoretical methods to simulate experimental GBS networks. We focus mostly on methods that use phase-space representations of quantum mechanics, as these methods are highly scalable and can be used to validate experimental outputs and claims of quantum advantage for a variety of input states, ranging from the ideal pure squeezed vacuum state to more realistic thermalized squeezed states. A brief overview of the theory of GBS, recent experiments and other types of methods are also presented. Although this is not an exhaustive review, we aim to provide a brief introduction to phase-space methods applied to linear photonic networks to encourage further theoretical investigations.

I Introduction

Since Feynman’s original proposal in the early 1980s [1], a growing amount of research has been conducted to develop large scale, programmable quantum computers that promise to solve classically intractable problems. Although a variety of systems have been proposed to achieve this [2, 3, 4, 5], practical issues such as noise and losses introduce errors that scale with system size, hampering physical implementations. Therefore, one approach of current experimental efforts has been the development of specialized devices aiming to unequivocally demonstrate quantum advantage, even when experimental imperfections are present.

To this end, recent developments use photonic networks implementing different computationally hard tasks [6, 7, 8, 9]. Such devices are made entirely from linear optics such as polarizing beamsplitters, mirrors and phase-shifters [4, 10], with optical parametric oscillators (OPO) as the quantum source [11, 12]. Unlike cryogenic devices based on superconducting quantum logic-gates [13, 14], these networks are operated at room temperature. Large size networks now claim quantum advantage [15, 16, 17, 18], especially for a type of quantum computer called a Gaussian boson sampler (GBS) that generates random bit outputs with a distribution that is exponentially hard to replicate at large network sizes.

This begs the question: how can one verify that large-scale experimental outputs are correct, when the device is designed to solve problems no classical computer can solve in less than exponential time?

In this paper, we review theoretical methods that answer this for photonic network based quantum computers. We especially focus on positive-P phase-space representations that can simulate non-classical states [19]. Here, we mean simulate in the sense of a classical sampling algorithm that replicates probabilities and moments to an accuracy better than experimental sampling errors, for the same number of samples. Such methods account for loss and decoherence. They are classical algorithms for standard digital computers. In addition, they are highly scalable, with polynomial run-time for large networks.

Other types of simulation exist, where the classical algorithm replicates the detection events of the experiment [20, 21, 22]. These become intractably slow for larger networks. The use of methods like this is to offer a way to define quantum advantage, meaning a quantum device performing tasks that are classically infeasible. Hence, we review the definitions of simulation in the literature including other approaches as well. There are also methods that simulate events quickly but approximately, up to a systematic error caused by the fact that the approximate method has a different probability distribution [23, 24, 25]. A comparison is made of different approaches that are proposed or used for these problems, including classical “faking” methods that replicate stochastic detection events with some degree of inaccuracy.

Positive-P methods are exactly equivalent to quantum moments in the large sample limit, and are applicable to a variety of experimental networks. One example is the coherent Ising machine (CIM) [26, 27], which is used to solve $NP$ -hard optimization problems by simulating an Ising model with a series of light pulses [28, 9, 29]. The largest experimental implementation of the CIM to date contains $100,000$ modes [12]. This gives approximate solutions to hard max-cut problems three orders of magnitude faster than a digital computer. For such problems, typically no exact results are known. Therefore, these solutions can be hard to validate as well.

Gaussian boson sampling (GBS) [30, 31, 32] experiments have seen impressive advancements in recent years. These specialized linear network quantum computers are used as random number generators, where the detected output photon counts correspond to sampling from the $\#P$ -hard Hafnian or Torontonian distributions [7, 33, 8], which are either directly or indirectly related to the matrix permanent. While more specialized than quantum circuit devices, they are also somewhat more straightforward to implement and analyze theoretically.

Only a few years after the original theoretical proposals [7, 8, 33], multiple large scale experiments have implemented GBS with each claiming quantum advantage [15, 16, 17, 18]. This rapid increase in network size has far outpaced direct verification methods that aim to reproduce experiments using classical algorithms [20, 21, 22]. Although other validation methods exist [23, 24], these typically cannot reproduce the higher order correlations generated in linear photonic networks of this type.

In summary, we review progress on simulating binned correlations or moments of experimental GBS networks using the positive-P phase-space representation [30, 31, 32], as well as comparing this with other methods. We find that the experimental results disagree with the ideal, pure squeezed state model. However, agreement is greatly improved with the inclusion of decoherence in the inputs. When this decoherence is large enough, classical simulation is feasible [34, 35], and quantum advantage is lost. Despite this, recent experimental results demonstrate potential for quantum advantage in some of the reported datasets [32].

In order to present a complete picture, we also review the background theory of GBS, recent large scale experiments [15, 16, 17, 18] and other proposed verification algorithms [22, 23, 20, 21, 24, 25].

II Gaussian boson sampling: A brief overview

The original boson sampler introduced by Aaronson and Arkhipov [6] proposed a quantum computer that generates photon count patterns by randomly sampling an output state whose distribution corresponds to the matrix permanent. The computational complexity arises from sampling the permanent, which is a $\#P$ -hard computational problem [36, 37].

However, practical applications of the original proposal have seen limited experimental implementations, since they require one to generate large numbers of indistinguishable single photon Fock states. Although small scale networks have been implemented [38, 39, 40], to reach a computationally interesting regime requires the detection of at least 50 photons [41, 42]. This is challenging to implement experimentally.

To solve this scalability issue, Hamilton et al [7] proposed replacing the Fock states with Gaussian states, which can be more readily generated at large sizes. These types of quantum computing networks are called Gaussian boson samplers. They still use a non-classical input state, but one that is far easier to generate experimentally than the original proposal of number states.

II.1 Squeezed states

In standard GBS, $N$ single-mode squeezed vacuum states are sent into an $M$ -mode linear photonic network. If $N\neq M$ , the remaining $N-M$ ports are vacuum states. If each input mode is independent, the entire input state is defined via the density matrix

\hat{\rho}^{(\text{in})}=\prod_{j}^{M}\left|r_{j}\right\rangle\left\langle r_{j}\right|.

(1)

Here, $\left|r_{j}\right\rangle=\hat{S}(r_{j})\left|0\right\rangle$ is the squeezed vacuum state and $\hat{S}(r_{j})=\exp(r_{j}(\hat{a}_{j}^{\dagger(\text{in})})^{2}/2-r_{j}(\hat{a}_{j}^{(\text{in})})^{2}/2)$ is the squeezing operator [43, 44, 45]. We assume the squeezing phase is zero and $\hat{a}^{\dagger(\text{in})}$ $\left(\hat{a}_{j}^{(\text{in})}\right)$ is the input creation (annihilation) operator for the $j$ -th mode. The vacuum state $\left|0\right\rangle$ corresponds to letting $r_{j}=0$ in the squeezing operator.

The ideal GBS assumes the inputs are pure squeezed states [46]. These non-classical states are characterized by their quadrature variance and for normal ordering are defined as [47, 48]

	$\displaystyle\Delta_{x_{j}}^{2}$	$\displaystyle=2\left(n_{j}+m_{j}\right)=e^{2r_{j}}-1$
	$\displaystyle\Delta_{y_{j}}^{2}$	$\displaystyle=2\left(n_{j}-m_{j}\right)=e^{-2r_{j}}-1.$		(2)

Here, $\Delta_{x_{j}}^{2}=\left\langle(\Delta\hat{x}_{j})^{2}\right\rangle=\left\langle\hat{x}_{j}^{2}\right\rangle-\left\langle\hat{x}_{j}\right\rangle^{2}$ is the simplified variance notation for the input quadrature operators $\hat{x}_{j}=\hat{a}_{j}^{(\text{in})}+\hat{a}_{j}^{\dagger(\text{in})}$ and $\hat{y}_{j}=-i(\hat{a}_{j}^{(\text{in})}-\hat{a}_{j}^{\dagger(\text{in})})$ , which obey the commutation relation $[\hat{x}_{j},\hat{y}_{k}]=2i\delta_{jk}$ .

Squeezed states, while still satisfying the Heisenberg uncertainty principle $\Delta_{x_{j}}^{2}\Delta_{y_{j}}^{2}=1$ , have altered variances such that one quadrature has variance below the vacuum noise level, $\Delta_{x_{j}}^{2}<1$ . By the Heisenberg uncertainty principle, this causes the variance in the corresponding quadrature to become amplified well above the vacuum limit, $\Delta_{y_{j}}^{2}>1$ [43, 49, 50]. Other squeezing orientations are available and remain non-classical so long as one variance is below the vacuum limit.

For pure squeezed states, the number of photons in each mode is $n_{j}=\left\langle\hat{a}_{j}^{\dagger(\text{in})}\hat{a}_{j}^{(\text{in})}\right\rangle=\sinh^{2}(r_{j})$ , whilst the coherence per mode is $m_{j}=\left\langle(\hat{a}_{j}^{(\text{in})})^{2}\right\rangle=\cosh(r_{j})\sinh(r_{j})=\sqrt{n_{j}\left(n_{j}+1\right)}$ .

II.2 Input-output theory

Linear photonic networks made from a series of polarizing beamsplitters, phase shifters and mirrors act as large scale interferometers, and crucially conserve the Gaussian nature of the input state.

In the ideal lossless case, as seen in Fig.(1), photonic networks are defined by an $M\times M$ Haar random unitary matrix $\boldsymbol{U}$ . The term Haar corresponds to the Haar measure, which in general is a uniform probability measure on the elements of a unitary matrix [51].

For lossless networks, input modes are converted to outputs following

\hat{a}_{i}^{(\text{out})}=\sum_{j=1}^{M}U_{ij}\hat{a}_{j}^{(\text{in})},

(3)

where $\hat{a}_{i}^{(\text{out})}$ is the output annihilation operator for the $i$ -th mode. Therefore, each output mode is a linear combination of all input modes.

Practical applications will always include some form of losses in the network, for example photon absorption. This causes the matrix to become non-unitary, in which case its denoted by the transmission matrix $\boldsymbol{T}$ , and the input-to-output mode conversion now contains additional loss terms [52, 53]:

\hat{a}_{i}^{(\text{out})}=\sum_{j=1}^{N}T_{ij}\hat{a}_{j}^{(\text{in})}+\sum_{j=1}^{M}B_{ij}\hat{b}_{j}^{(\text{in})}.

(4)

Here, $\hat{b}_{j}^{(\text{in})}$ is the annihilation operator for the $j$ -th loss mode whilst $\boldsymbol{B}$ is a random noise matrix. The loss matrix conserves the total unitary of the system as $\boldsymbol{T}\boldsymbol{T}^{\dagger}+\boldsymbol{B}\boldsymbol{B}^{\dagger}=\boldsymbol{I}$ , where $\boldsymbol{I}$ is the identity matrix.

Although linear networks are conceptually simple systems, they introduce computational complexity from the exponential number of possible interference pathways photons can take in the network. These are generated from the beamsplitters. As an example, the input ports of a 50/50 beamsplitter are defined as superpositions of the output modes:

	$\displaystyle\hat{a}_{1}^{\dagger(\text{in})}$	$\displaystyle=\frac{1}{\sqrt{2}}\left(\hat{a}_{3}^{\dagger(\text{out})}+\hat{a}_{4}^{\dagger(\text{out})}\right)$
	$\displaystyle\hat{a}_{2}^{\dagger(\text{in})}$	$\displaystyle=\frac{1}{\sqrt{2}}\left(\hat{a}_{3}^{\dagger(\text{out})}-\hat{a}_{4}^{\dagger(\text{out})}\right).$		(5)

Classical states such as coherent and thermal states input into large scale linear networks are readily simulated on a classical computer [34]. This leads to the question, why are non-classical states such as Fock and squeezed states computationally hard to simulate on large scale photonic networks?

In short, non-classical states input into a linear network generate large amounts of quantum correlations, which are non-trivial to simulate classically if combined with photo-detection. Continuing with the beamsplitter example above, if input ports $1$ and $2$ now contain indistinguishable Fock states, as is the case with standard boson sampling, the input state can be written as

\left|1,1\right\rangle_{1,2}=\frac{1}{\sqrt{2}}\left(\left|2,0\right\rangle_{3,4}-\left|0,2\right\rangle_{3,4}\right).

(6)

The absence of a $\left|1,1\right\rangle_{3,4}=\hat{a}_{3}^{\dagger(\text{out})}\hat{a}_{4}^{\dagger(\text{out})}\left|0,0\right\rangle_{3,4}$ term means the output photons are bunched, that is, they are entangled and always arrive in pairs [54]. This phenomena is known as the Hong-Ou-Mandel (HOM) interference effect and is named after the authors who first observed this phenomena [55].

If input ports $1,2$ have input squeezed vacuum states, one generates two-mode Einstein-Podolsky-Rosen (EPR) entanglement in the quadrature phase amplitudes [56, 57]. More details, including a summary of the derivation, can be found in [58, 31].

Refer to caption — Figure 1: Gaussian boson sampling schematic where $N\subset M$ single-mode squeezed states of different polarizations (blue ellipses) are sent into a lossless linear photonic network where the remaining $N-M$ inputs are vacuum states (blue outlined circles). The linear network is created from a series of phase-shifters and beamsplitters, represented here in a circuit diagram format. This generates an exponential number of interference pathways. Mathematically, the network is defined by an $M\times M$ Haar random unitary $\boldsymbol{U}$ . Output photon patterns are then obtained using either PNR or threshold detectors.

II.3 Photon counting

In GBS experiments, photon count patterns, denoted by the vector $\boldsymbol{c}$ , are generated by measuring the output state $\hat{\rho}^{(\text{out})}$ . The corresponding distribution changes depending on the type of detector used.

The original proposal for GBS [7] involved normally ordered photon-number-resolving (PNR) detectors that can distinguish between incoming photons and implement the projection operator [54, 59]

\hat{p}_{j}(c_{j})=\frac{1}{c_{j}!}:(\hat{n}^{\prime}_{j})^{c_{j}}e^{-\hat{n}^{\prime}_{j}}:.

(7)

Here, $:\dots:$ denotes normal ordering, $c_{j}=0,1,2,\dots$ is the number of counts in the $j$ -th detector and $\hat{n}^{\prime}_{j}=\hat{a}_{j}^{\dagger(\text{out})}\hat{a}_{j}^{(\text{out})}$ is the output photon number. Practically, a PNR detector typically saturates for a maximum number of counts $c_{\text{max}}$ , the value of which varies depending on the type of PNR detector implemented.

When the mean number of output photons per mode satisfies $\left\langle\hat{n}^{\prime}_{j}\right\rangle\ll 1$ , PNR detectors are equivalent to threshold, or click, detectors which saturate for more than one photon at a detector. Threshold detectors are described by the projection operator [59]

\hat{\pi}_{j}(c_{j})=:e^{-\hat{n}^{\prime}_{j}}\left(e^{\hat{n}^{\prime}_{j}}-1\right)^{c_{j}}:,

(8)

where $c_{j}=0,1$ regardless of the actual number of photons arriving at the detector.

In situations where $\left\langle\hat{n}^{\prime}_{j}\right\rangle\gg 1$ , threshold detectors become ineffective since they cannot accurate discriminate between the different numbers of incoming photons. However, by sending photons through a demultiplexer (demux) before arriving at the detector, the output pulses of light become ’diluted’ such that the mean photon number is small enough to be accurately counted using threshold detectors [60, 61]. In this approach the demux acts as a type of secondary optical network of size $1\times M_{S}$ , where $M_{S}$ is the number of secondary modes. For output photons to become sufficiently dilute, one must satisfy $M_{S}\gg\left\langle\hat{n}^{\prime}_{j}\right\rangle$ [60, 61, 62]. When this is the case, the detectors can be thought of pseudo-PNR (PPNR), because the threshold detectors are accurately measuring photon counts, thus becoming equivalent to PNR detectors [62].

The computational complexity of GBS arises in the probabilities of count patterns $\boldsymbol{c}$ . These probabilities are determined using a straightforward extension of the projection operators for both PNR and click detectors. In the case of PNR detectors, the operator for a specific photon count pattern is

\hat{P}(\boldsymbol{c})=\bigotimes_{j=1}^{M}\hat{p}_{j}(c_{j}),

(9)

the expectation value of which corresponds to the $\#P$ -hard matrix Hafnian [7, 33]

\left\langle\hat{P}(\boldsymbol{c})\right\rangle\approx|\text{Haf}(\boldsymbol{D}_{S})|^{2}.

(10)

Here, $\boldsymbol{D}_{S}$ is the sub-matrix of $\boldsymbol{D}=\boldsymbol{T}\left(\bigoplus_{j=1}^{M}\tanh(r_{j})\right)\boldsymbol{T}^{T}$ , corresponding to the channels with detected photons, where the superscript $T$ denotes the transpose of the lossy experimental transmission matrix.

The Hafnian is a more general function than the matrix permanent, although both are related following [7, 33]

\text{perm}(\boldsymbol{D})=\text{Haf}\left(\begin{array}[]{cc}0&\boldsymbol{D}\\ \boldsymbol{D}^{T}&0\end{array}\right).

(11)

Therefore, one of the advantages of GBS with PNR detectors is that one can compute the permanent of large matrices from the Hafnian instead of performing Fock state boson sampling.

For threshold detectors, the operator for a specific binary count pattern $\boldsymbol{c}$ is

\hat{\Pi}(\boldsymbol{c})=\bigotimes_{j=1}^{M}\hat{\pi}_{j}(c_{j}),

(12)

whose expectation value corresponds to the Torontonian [8]

\left\langle\hat{\Pi}(\boldsymbol{c})\right\rangle=\text{Tor}(\boldsymbol{O}_{S}),

(13)

where $\boldsymbol{O}_{S}$ is a matrix constructed from the covariance matrix of the output state. The Torontonian has also been shown to be $\#P$ -hard to compute, and is directly related to the Hafnian [8].

II.4 Experimental progress

Only a few short years after the original proposal of GBS with threshold detectors [8], the first large scale experimental network, called Jiuzhang 1.0, was implemented by Zhong et al [15]. In this experiment, $N=50$ single-mode squeezed states were sent into a lossy $M=100$ mode linear network which generated over 50 million binary count patterns in $\approx 200\text{s}$ .

The experimentalists estimated it would take Fugaku, the largest supercomputer at the time, $600$ million years to generate the same number of samples using the fastest exact classical algorithm at the time [20]. Exact samplers are algorithms designed to replicate an experiment by directly sampling the Torontonian or Hafnian using a classical supercomputer. When experiments claim quantum advantage, they typically do so by comparing the run-time of the experiment with such exact samplers.

The algorithm implemented in [20] uses a chain rule sampler that samples each mode sequentially by computing the conditional probability of observing photons in the $k$ -th mode given the photon probabilities of the previous $k-1$ -th modes. The computational complexity of this algorithm scales as $\mathcal{O}(N_{D}^{3}2^{N_{D}})$ where $N_{D}=\sum_{j=1}^{M}c_{j}$ is the total number of detected photon counts. Due to this scaling with detected counts and system size, more recent experiments aim to beat these exact samplers by increasing the number of observed counts. The mean number of counts per pattern in Jiuzhang 1.0 was $\approx 41$ , and the largest number of counts in a single pattern was $76$ .

Shortly after this initial experiment, in order to reduce the probability of an exact sampler replicating the experiment, Zhong et al [16] implemented an even larger GBS network, named Jiuzhang 2.0, which increased the number of modes to $M=144$ and performed multiple experiments for different input laser powers and waists. This expectedly produced an increase in the number of observed clicks, with a mean click rate of $\approx 68$ and a maximum observed click number of $N_{D}=113$ .

At the same time, in classical computing, the increased probability of multiple photons arriving at the same detector was exploited by Bulmer et al [22] to improve the scaling of the chain rule sampler, achieving $\mathcal{O}(N_{D}^{3}2^{N_{D}/2})$ for GBS with both PNR and click detectors. Despite this apparently modest improvement, it was estimated that generating the same number of samples as Jiuzhang 1.0 on Fugaku would now only take $\sim 73$ days [22]. Due to the substantial speed-up over previous exact algorithms [20, 21], it was predicted that experiments using either PNR or click detectors needed $N_{D}\gtrsim 100$ to surpass exact samplers [22].

To reduce the probability of multiple photons arriving at a single detector, Deng et al [18] added a $1\times 8$ demux to the output modes of Jiuzhang 2.0, dubbing this upgraded network Jiuzhang 3.0. The demux is made of multiple fiber loop beamsplitters that separate photons into four temporal bins and two spatial path bins and aims to ensure $\left\langle\hat{n}^{\prime}_{j}\right\rangle\apprle 1$ , increasing the likelihood the threshold detectors are accurately counting the oncoming photons. This simple addition generated patterns with almost double the largest number of clicks obtained in Jiuzhang 2.0, with one experiment observing a maximum of $N_{D}=255$ with a mean click number of more than $100$ .

The linear networks implemented in the three experiments above are all static, that is, once the networks have been fabricated, one cannot reprogram any of the internal optics. This has the advantage of circumventing the exponential accumulation of loss that arises from increased depth [17, 63], although one sacrifices programmability.

To this end, Madsen et al [17] implemented a programmable GBS called Borealis using three fiber loop variable beamsplitters, each containing a programmable phase-shifter. In this experiment, $N$ squeezed state pulses are generated at a rate of $6\text{MHz}$ and are sent into each variable beamsplitter sequentially, allowing pulses in different time-bins to interfere. Photons output from the final fibre loop are then sent into a $1\times 16$ demultiplexer and then counted by $16$ PNR detectors, which are superconducting transition edge sensors (TES).

The demultiplexer is required for two reasons. The first is to partially ’dilute’ the number of photons in each output pulse for the PNR detectors used [17]. The second is to ensure each TES has reached its baseline operating temperature before another pulse is detected, ensuring an accurate determination of photon numbers.

The two largest networks using this setup sent $N=216$ and $N=288$ input pulses into the network, detecting a mean photon number of $\approx 125$ and $\approx 143$ , respectively. However, this network was more susceptible to losses than the previous systems [15, 16, 18].

III Simulation methods

The rapid growth of experimental networks has spurred a parallel increase in the number of algorithms proposed for validation and/or simulation of these networks. Due to this, there are some inconsistencies in the literature on the language used to define a simulator which runs on a classical computer. Therefore, we clarify the definitions of the various classical simulators proposed for validation used throughout the rest of the review.

We first emphasize that a classical algorithm or classical simulator is any program designed to run on a digital computer, be that a standard desktop or supercomputer. The current algorithms proposed to validate GBS can be defined as either strong or weak classical simulators [64]. There is a large literature on this topic, starting from early work in quantum optics.

Weak classical simulators parallel the quantum computation of experimental networks by sampling from the output probability distribution [65]. Algorithms that sample from the full Torontonian or Hafnian distributions are exact samplers [20, 21, 22], as outlined above, and all known exact algorithms are exponentially slow. There are faster but approximate algorithms producing discrete photon count patterns that are typically called “faked” or “spoofed” patterns. Some of these generate fake patterns by sampling from the marginal probabilities of these distributions [23, 24], in which case they are called low-order marginal samplers. Such low-order methods are only approximate, since they don’t precisely simulate higher-order correlations [23, 24].

In contrast, strong classical simulators are a type of classical algorithm that evaluates the output probability distribution, be that the full or marginal probabilities, of a GBS with multiplicative accuracy, in time polynomial in the size of the quantum computation [65, 66].

Phase-space simulators have a similarity to strong classical simulators, because the samples, which are produced with a stochastic component representing quantum fluctuations, are used to compute probabilities or moments of the ideal distribution. Positive-P phase-space simulations have been widely used in quantum optics for this reason [19, 67, 68, 69, 70, 71]. While the sampling precision is not multiplicative, it is better than the experimental precision of GBS at the same sample number, which is sufficient for validation. By approximating the inputs as classical states, one can also use phase-space methods to produce fake patterns [32], which in case they are called a classical P-function sampler, as explained below.

For non-classical inputs, one can generate probabilities but not count patterns using phase-space methods: since there are no known algorithms to generate counts efficiently from non-classical phase-space samples.

III.1 Exact classical simulation

There are a number of “exact” classical simulation methods that generate equivalent photon-counts, where we use quotes for “exact”, because even these methods have round-off errors due to to finite arithmetic. These algorithms are all exponentially slow [20, 21, 22]. It is known that the non-classicality of the squeezed input states of GBS is essential to providing quantum advantage in the GBS experiments, since classical states which have a positive Glauber-Sudarshan P-representation [72, 73] are known to have classically simulable photon counts. This will be treated in Section (III.2.2). However, the non-Gaussianity arising from the photon-detection measurement process is also important.

This is because the measurement set-up where quadrature-phase-amplitude measurements are made on a system described by Gaussian states can be modeled by a positive Wigner function, and hence becomes classically simulable [34, 35]. Gaussian states are defined as those with a Gaussian Wigner function. We note that squeezed states, while non-classical, are Gaussian states. Examples of the classical simulation of entanglement for systems with a positive Wigner function are well known [74, 75, 76, 77, 78, 79].

The role of the measurement set-up in GBS is clarified by the work of Chabaud, Ferrini, Grosshans and Markham [65] and Chabaud and Walschaers [66]. These authors illustrate a connection between quantum advantage and the non-Gaussianity of both the input states and measurement set-up. In [65], conditions sufficient for the strong simulation of Gaussian quantum circuits with non-Gaussian input states are derived. Non-Gaussian states are those for which the Gaussian factorization of correlation functions is not applicable [80].

By demonstrating a mapping between bosonic quantum computation and continuous-variable sampling computation, where the measurement comprises a double quadrature detection, Chabaud and Walschaers [66] adapt classical algorithms derived in [66] to derive a general algorithm for the strong simulation of bosonic quantum computation, which includes Gaussian boson sampling. They prove that the complexity of this algorithm scales with a measure of non-Gaussianity, the stellar rank of both the input state and measurement set-up. This enables a quantification of non-Gaussianity, including from the nature of the measurement, as a resource for achieving quantum advantage in bosonic computation.

III.2 Approximate classical simulation

III.2.1 Low-order marginal samplers

Recent experiments have surpassed the threshold of applicability of exact samplers [16, 17, 18]. Even for experiments below this threshold [15], the computation time can still be very long and resource intensive. Therefore, approximate methods that are more readily scaled to larger size have been proposed. These cannot validate GBS, but they are useful in quantifying a computational advantage.

One approach presented by Villalonga et al [23], exploits the computational efficiency of computing low-order marginal probabilities of the ideal GBS distribution to generate photon count patterns by sampling from a distribution that contains the correct low-order marginals. We note that the relevant marginal probabilities must are first be evaluated before sampling.

Two methods are implemented to compute marginals, which take the form of multivariate cumulants, also called connected correlations or Ursell functions. The first method corresponds to a Boltzmann machine (BM) and computes pattern probabilities of the form [23]

	$\displaystyle\left\langle\hat{\Pi}(\boldsymbol{c})\right\rangle$	$\displaystyle=\frac{1}{Z}\exp\left(\sum_{i}\lambda_{i}\hat{\pi}_{i}(c_{i})+\sum_{i<j}\lambda_{i,j}\hat{\pi}_{i}(c_{i})\hat{\pi}_{j}(c_{j})\right.$
		$\displaystyle\left.+\sum_{i<j<k}\lambda_{i,j,k}\hat{\pi}_{i}(c_{i})\hat{\pi}_{j}(c_{j})\hat{\pi}_{k}(c_{k})+\dots\right)$		(14)

where $Z$ is the partition function that normalizes the distribution and $\lambda_{i},\lambda_{i,j},\dots$ are parameters computed using a mean-field approximation.

Each term in the exponent corresponds to a marginal probability of the ideal distribution, where a BM spoofer using only the first two summations were implemented due to scalability. The BM method is only applicable to GBS with threshold detectors, where faked binary patterns are obtained via Gibbs sampling [23].

The second method uses a greedy heuristic to generate discrete binary patterns with approximately correct second and third-order marginals. This algorithm scales exponentially with the desired order of the marginal and the length of the patterns, which are typically equal to the number of modes. Although the greedy method was originally developed for GBS with threshold detectors, it has since been extended to PNR detectors [17].

Faked patterns generated from both methods were compared to experimental samples from Jiuzhang 1.0 and 2.0. The best comparison results came from computing the total variational distance

\Delta\delta=\delta_{m}-\delta_{e},

(15)

where $\delta_{m}$ is the difference between pattern probabilities of the ideal distribution and the marginal spoofers, and $\delta_{e}$ is the difference between experiment and ideal.

Since computing pattern probabilities is computationally challenging, comparisons were limited to only 14 modes. However, although the first-order BM sampler was always beaten by the experiment, faked samples generated from the second-order BM and second and third-order greedy algorithm were closer to the ideal distribution than the experiment.

Comparisons of the cross-entropy measure, which is a widely used quantum advantage benchmark in random circuit sampling quantum computers [13, 81, 82], were also performed by Villalonga et al [23]. It remains an open question whether this is a useful measure for boson sampling networks [24].

In general, if experimental samples obtain a smaller cross entropy than the spoofer, there is evidence of computational advantage. As was the case for the total variational distance, when compared to samples from the first-order BM, experimental samples were immediately larger, but for all other spoofers results were mixed, with all samples obtaining a similar cross entropy to the experiment [23].

An algorithm introduced by Oh et al [24] was specifically designed to spoof the cross entropy measure of GBS networks using up to second-order marginal probabilities. This algorithm produced samples that successfully spoofed the small scale test networks in Ref.[17], but became too computationally demanding to spoof larger scale networks that claim quantum advantage [15, 16, 17].

III.2.2 Classical P-function samplers

Practical implementations of GBS will inevitably produce decoherence and losses. These may arise within the network, as discussed above, or in the input states as discussed in subsection IV.2. For large enough decoherence and loss, either the inputs, outputs or both are transformed to classical states [35, 52].

For such classical states, GBS loses its quantum advantage and an efficient classical simulation is possible. This was first shown by Rahimi-Keshari et al [34] for Fock state boson sampling and later extended to GBS by Qi et al [35], although the fundamental ideas are much older than this [83]. In both cases, the condition for classical simulation hinges on whether the resulting phase-space output distribution, simulated using a set of experimental inputs, was non-negative. These conditions hinge on the well known result that, for some states, the Glauber-Sudarshan and Wigner representations produce negative distributions on phase-space [84, 54], which is why they are typically referred to as quasi-probability distributions.

We define any classical algorithm that samples from the output distribution of a classical state GBS as a classical GBS sampler. Although the most commonly tested classical state is the fully decoherent thermal state, it is an extreme case and and has been thoroughly disproved for all current experimental networks [15, 16, 17, 18]. However, a more realistic scenario is a classical approximation to pure squeezed states.

Two such states have recently been proposed called squashed and squished states [85, 18]. Unlike thermal states, which have a quadrature variance of $\Delta_{x_{j}}^{2}=\Delta_{y_{j}}^{2}$ , squashed and squished states maintain the unequal variances of squeezed states (see Fig.(2) for a diagram of the variances for different states). Despite this, decoherence has caused the once squeezed quadrature variance to become $\Delta_{x_{j}}^{2}=1$ , whilst $\Delta_{y_{j}}^{2}>1$ . Therefore, squashed and squished states are classical states. No true squeezing occurs, since neither variance is below the vacuum limit. We note that the difference between these two states is that squished states must contain the same mean number of photons as the input squeezed state [18], whereas the squashed state photon number can vary.

Comparisons of all experimental networks with simulated thermal states have, expectedly, failed [15, 16, 17, 18]. However, Juizhang 1.0 data was shown to have a large degree of input decoherence [30], and hence simulated squashed states were shown to model samples from Jiuzhang 1.0 as well as the theoretical ideal distribution for some statistical tests [85]. An efficient phase-space sampler, which can generate binary patterns for any classical input state, later showed that squashed state count patterns were closer to the simulated ideal distribution than the experimental data set in Jiuzhang 2.0 that claims quantum advantage [32]. This method is reviewed in more detail in section IV. The most recent GBS experiments with PNR and click detectors, Borealis and Jiuzhang 3.0, produced outputs that were closer to the ideal than simulated squashed and squished states, although squished states have only been tested against samples from Jiuzhang 3.0 to date.

The classical GBS samplers implemented in these tests assume either all $N\subset M$ inputs or all $M$ outputs are classical states. A more physically realistic scenario is one where the inputs and/or outputs are mixtures of classical and quantum states. In order to model this, a classical sampler that scales with the loss rate $\eta$ of the linear network was introduced by Oh et al [25].

The aim of this algorithm is to simulate the output covariance matrix $\boldsymbol{V}_{p}$ of a pure squeezed state GBS using a tensor network method. This covariance matrix arises from a decomposition of the actual output covariance matrix $\boldsymbol{V}=\boldsymbol{W}+\boldsymbol{V}_{p}$ , where $\boldsymbol{W}\geq 0$ is a random displacement matrix arising from classical photons in the outputs.

As outlined in the previous subsection, the computational complexity of computing the ideal GBS, corresponding to $\boldsymbol{V}_{p}$ in this case, scales with the number of system modes and hence detected photons. However, only non-classical output photons contribute to this computational complexity. Therefore, if more experimental output photons are classical rather than non-classical, the matrix $\boldsymbol{W}$ will dominate the computation of $\boldsymbol{V}$ [25], which one should then be able to simulate efficiently.

This is the general principle of the algorithm implemented by Oh et al [25], where the number of non-classical output photons are computed from the actual squeezing parameter $s=-1/2\ln(1-\eta)$ . Clearly, as the loss rate increases, the non-classical photons decrease, in turn causing the number of classical photons to increase, allowing $\boldsymbol{W}$ to dominate the output covariance matrix.

Samples obtained from this method were used to calculate the total variational distance and cross entropy for the small scale test networks of Ref. [17], as well as second and higher-order cumulants for larger scale networks [15, 16, 17, 18]. In all cases, samples produced from this classical sampler were closer to the ideal distributions than all the experiments, highlighting the extent to which the loss rate $\eta$ play a role in affecting claims of quantum advantage in the current generation of GBS.

IV Validating Gaussian boson sampling in phase-space

One of the many open questions in GBS research is how to efficiently validate large scale experimental networks. The exact methods reviewed above either suffer from scalability issues [20, 21, 22], or else they don’t simulate higher-order correlations [23] due to computational complexity. Due to the rapid growth of experimental networks, even higher order correlations will be produced due to the increase in interference pathways. The requirement of a simulation that validates a quantum computer is that it allows the computation of measurable probabilities with an accuracy at least equal to the experimental sampling error.

Such correlations and probabilities play an increasingly important role in characterization of the output distribution [86], even when losses are present [53, 87], despite their increased sensitivity to decoherence. Therefore, scalable methods are needed that simulate higher-order correlations without performing the impractical $\#P$ -hard direct computation of the count samples. To this end, we review recent theoretical methods that simulate grouped count probabilities (GCPs) of GBS networks using the normally ordered positive-P phase-space representation.

These methods can be used for networks with threshold detectors, and have successfully been used to compare theory and experiment for samples from Jiuzhang 1.0 [30] and Jiuzhang 2.0 [32]. We emphasize that the positive-P representation does not produce discrete count patterns. However, the simulated output distributions have identical moments to the ideal GBS distributions.

Before we briefly review previous results presented in [30], we first outline the necessary background theory on phase-space representations, focusing on normally ordered methods, and GCPs. The interested reader is referred to Refs. [30, 31, 32] for more details, while the highly efficient phase-space code, xQSim, can be downloaded from public domain repositories [88].

IV.1 Phase-space methods

Originally developed by Wigner for symmetrically ordered operators [89], phase-space representations establish a mapping between quantum operators of different orderings to probability distributions defined on the classical phase-space variables of position and momentum [90, 54], which are more commonly rewritten in terms of the complex coherent state amplitude vectors $\boldsymbol{\alpha}$ , $\boldsymbol{\alpha}^{*}$ .

Moyal [91] later showed how one can use these methods can be used to compute the dynamics of operators. Due to this, phase-space representations are frequently used to compute the operator master equation [92, 90], which for some representations, corresponds to the second-order Fokker-Planck equation (FPE), which is commonly used in statistical mechanics.

The FPE in turn, can be mapped to a stochastic differential equation (SDE) that introduces randomly fluctuating terms and, for some real or complex vector $\boldsymbol{a}$ , takes the general form [93]

\frac{d}{dt}\boldsymbol{a}=\boldsymbol{A}(\boldsymbol{a})+\boldsymbol{B}(\boldsymbol{a})\xi(t),

(16)

where $\boldsymbol{A}$ is a vector function and $\boldsymbol{B}$ is a matrix, both of which are typically known, while $\xi(t)$ is a real Gaussian noise term with $\left\langle\xi\right\rangle=0$ and $\left\langle\xi_{i}(t)\xi_{j}(t^{\prime})\right\rangle=\delta(t-t^{\prime})\delta_{ij}$ . These randomly fluctuating terms, defined as the derivative of a Wiener process [93], play an analogous role to quantum and thermal fluctuations for applications in quantum mechanics.

Each solution of an SDE is a stochastic trajectories in phase-space and physically corresponds to a single experiment. Therefore, averages over an ensemble of trajectories $E_{S}$ corresponds to a mean value from multiple experimental shots.

Such dynamical methods have been successfully applied to a variety of quantum optics systems [94, 95, 96, 97] (also see Ref.([71]) for a brief review), including the CIM [26, 27]. However, as is clear from subsection II.2, linear networks are not modeled dynamically and hence cannot produce an SDE.

Instead, phase-space representations model linear networks as stochastic difference equations (SDFE) where the $M$ -th order SDFE takes the general form [98]

\boldsymbol{a}_{n+1}=\sum_{m=0}^{M}\boldsymbol{A}_{m}(n-m,\boldsymbol{a}_{n-m})+\boldsymbol{B}(n,\boldsymbol{a}_{n})\xi_{n},

(17)

where $\boldsymbol{a}_{n}$ , $\boldsymbol{A}_{m}$ , $\boldsymbol{B}$ and $\xi_{n}$ are discrete analogs to their continues variable definitions in Eq.(16). This becomes clearer when comparing with Eq.(4).

Due to the randomly fluctuating term, SDEs don’t have analytical solutions and must be computed numerically using a variety of methods [99]. As is also the case with numerical computations of non-stochastic differential equations, SDEs are approximated as difference equations for numerical computation. Although one usually has practical issues such small step size limits for SDEs [99, 98], SDEs and SDFEs are two sides of the same coin.

Due to the use of PNR and threshold detectors, we focus on simulating linear networks using the normally ordered Glauber-Sudarshan diagonal P-representation and generalized P-representations. Non-normally ordered Wigner and Q-function methods are possible also [30, 31, 32], but these have too large a sampling error to be useful for simulations of photon counting.

IV.1.1 Generalized P-representation

The generalized P-representation developed by Drummond and Gardiner [100] is a family of normally ordered phase-space representations that produce exact and strictly positive distributions on phase-space for any quantum state. It was developed as a non-classical extension of the Glauber-Sudarshan diagonal P-representation, which is defined as

\hat{\rho}=\int P(\boldsymbol{\alpha})\left|\boldsymbol{\alpha}\right\rangle\left\langle\boldsymbol{\alpha}\right|\text{d}^{2}\boldsymbol{\alpha},

where $\left|\boldsymbol{\alpha}\right\rangle$ is a coherent state vector.

Due to the absence of off-diagonal terms in the density matrix, which represent quantum superpositions, the diagonal P-representation produces a distribution $P(\boldsymbol{\alpha})$ that is negative and singular for non-classical states such as Fock and squeezed states.

To account for this, the generalized P-representation introduces the projection operator

\hat{\Lambda}(\boldsymbol{\alpha},\boldsymbol{\beta})=\frac{\left|\boldsymbol{\alpha}\right\rangle\left\langle\boldsymbol{\beta}^{*}\right|}{\left\langle\boldsymbol{\beta}^{*}|\boldsymbol{\alpha}\right\rangle},

(18)

which doubles the phase-space dimension. This increased dimension allows quantum superpositions to exist, since the basis now generates independent coherent state amplitudes $\boldsymbol{\alpha}$ , $\boldsymbol{\beta}$ with off-diagonal amplitudes $\boldsymbol{\beta}\neq\boldsymbol{\alpha}^{*}$ , which define a quantum phase-space of larger dimensionality.

The most useful generalized-P method for simulating linear networks with squeezed state inputs is the positive P-representation, which expands the density matrix as the $4M$ -dimensional volume integral

\hat{\rho}=\int\int P(\boldsymbol{\alpha},\boldsymbol{\beta})\hat{\Lambda}(\boldsymbol{\alpha},\boldsymbol{\beta})\text{d}^{2M}\boldsymbol{\alpha}\text{d}^{2M}\boldsymbol{\beta}.

(19)

Here, $\boldsymbol{\alpha}$ , $\boldsymbol{\beta}$ can vary along the entire complex plane and by taking the real part of Eq.(19), the density matrix becomes hermitian, thus allowing efficient sampling.

The other generalized-P method, called the complex P-representation, requires $P(\boldsymbol{\alpha},\boldsymbol{\beta})$ to be complex, resulting from its definition as a contour integral [100]. This makes the complex P-representation useful for simulating Fock state boson sampling, which requires a complex weight term $\Omega$ to be applied to the sampled distribution [101, 102].

One of the key reasons the positive P-representation is useful for simulating photonic networks arises from the moment relationship

	$\displaystyle\left\langle\hat{a}_{j_{1}}^{\dagger}\dots\hat{a}_{j_{n}}\right\rangle$	$\displaystyle=\left\langle\beta_{j_{1}}\dots\alpha_{j_{n}}\right\rangle_{P},$
		$\displaystyle=\int\int P(\boldsymbol{\alpha},\boldsymbol{\beta})\left(\beta_{j_{1}}\dots\alpha_{j_{n}}\right)\text{d}^{2M}\boldsymbol{\alpha}\text{d}^{2M}\boldsymbol{\beta},$		(20)

where $\left\langle\dots\right\rangle$ denotes a quantum expectation value and $\left\langle\dots\right\rangle_{P}$ a positive-P ensemble average. Therefore, normally ordered operator moments are exactly equivalent to positive-P phase-space moments, which is also valid for any generalized P-representation.

The reader familiar with phase-space methods may know that other representations, such as the Wigner and Husimi Q function, also output a positive, non-singular distribution for Gaussian, non-classical states.

While this is certainly true, for experiments using normally ordered detectors, one must re-order every non-normally ordered operator to obtain normal ordering. This introduces a term corresponding to vacuum noise in the initial phase-space samples, resulting in sampling errors that increase exponentially for higher-order correlations, thereby making such methods unsuitable for simulating photonic networks [32].

Using the coherent state expansion of pure squeezed states [103], one can define the input state Eq.(1) in terms of the positive P-representation as

\hat{\rho}^{(\text{in})}=\text{Re}\int\int P(\boldsymbol{\alpha},\boldsymbol{\beta})\hat{\Lambda}(\boldsymbol{\alpha},\boldsymbol{\beta})\text{d}\boldsymbol{\alpha}\text{d}\boldsymbol{\beta}.

(21)

The resulting positive-P distribution for input pure squeezed states is [30]

P(\boldsymbol{\alpha},\boldsymbol{\beta})=\prod_{j}C_{j}e^{-(\alpha_{j}^{2}+\beta_{j}^{2})(\gamma_{j}^{-1}+1/2)+\alpha_{j}\beta_{j}},

(22)

where $C_{j}=\sqrt{1+\gamma_{j}}/(\pi\gamma_{j})$ is a normalization constant and $\gamma_{j}=e^{2r_{j}}-1$ .

Output samples are then readily obtained by transforming the input coherent amplitudes as $\boldsymbol{\alpha}^{\prime}=\boldsymbol{T}\boldsymbol{\alpha}$ , $\boldsymbol{\beta}^{\prime}=\boldsymbol{T}^{*}\boldsymbol{\beta}$ , which corresponds to sampling from the output density matrix

\hat{\rho}^{(\text{out})}=\text{Re}\int\int P(\boldsymbol{\alpha},\boldsymbol{\beta})\hat{\Lambda}(\boldsymbol{\alpha}^{\prime},\boldsymbol{\beta}^{\prime})\text{d}\boldsymbol{\alpha}\text{d}\boldsymbol{\beta}.

(23)

IV.2 Grouped count probabilities

For GBS with threshold detectors, the number of possible binary patterns obtained from an experiment is $\approx 2^{M}$ with each pattern having a probability of roughly $\left\langle\hat{\Pi}(\boldsymbol{c})\right\rangle\approx 2^{-M}$ . Samples from large scale networks are exponentially sparse, requiring binning to obtain meaningful statistics. It is therefore necessary to define a grouped count observable that corresponds to an experimentally measurable quantity.

The most general observable of interest is a $d$ -dimensional grouped count probability (GCP), defined as [30]

\mathcal{G}_{\boldsymbol{S}}^{(n)}(\boldsymbol{m})=\left\langle\prod_{j=1}^{d}\left[\sum_{\sum c_{i}=m_{j}}\hat{\Pi}_{S_{j}}(\boldsymbol{c})\right]\right\rangle,

(24)

which is the probability of observing $\boldsymbol{m}=(m_{1},\dots,m_{d})$ grouped counts of output modes $\boldsymbol{M}=(M_{1},M_{2},\dots)$ contained within disjoint subsets $\boldsymbol{S}=(S_{1},S_{2},\dots)$ . Here, similar to Glauber’s original definition from intensity correlations [83], $n=\sum_{j=1}^{d}M_{j}\leq M$ is the GCP correlation order.

Each grouped count $m_{j}$ is obtained by summing over binary patterns $\boldsymbol{c}$ for modes contained within a subset $S_{j}$ . For example, a $d=1$ dimensional GCP, typically called total counts, generates grouped counts as $m_{j}=\sum_{i}^{M}c_{i}$ , whilst $d>1$ gives $\boldsymbol{m}=(m_{1}=\sum_{i=1}^{M/d}c_{i},\dots,m_{d}=\sum_{i=M/d+1}^{M}c_{i})$ . This definition also includes more traditional low-order marginal count probabilities and moments.

Although a variety of observables can be generated using GCPs, such as the marginal probabilities commonly used by spoofing algorithms [23, 24], multi-dimensional GCPs are particularly useful for comparisons with experiment. The increased dimension causes the number of grouped counts to grow, e.g. for $d=2$ $\boldsymbol{m}=(m_{1},m_{2})$ , which in turn increases the number of count bins (data points) available for comparisons, providing a fine grained comparison of results. This also causes effects of higher order correlations to become more statistically significant in the data [32].

One the most useful applications arises by randomly permuting the modes within each subset $S_{j}$ , which results in a different grouped count for each permutation. The number of possible permutation tests scales as [32]

\frac{\binom{M}{M/d}}{d}=\frac{M!}{d(M/d)!(M-M/d)!},

(25)

where different correlations are compared in each comparison test. This leads to a very large number of distinct tests, making good use of the available experimental data.

Despite these advantages, caution must be taken when comparing very high dimensional GCPs, since the increased number of bins means that there are fewer counts per bin. This causes the experimental sampling errors to increase, reducing the significance of statistical tests [32].

V Computational phase-space methods

To simulate GCPs in phase-space, input stochastic amplitudes $\boldsymbol{\alpha}$ , $\boldsymbol{\beta}$ corresponding to the phase-space distribution of the input state $\hat{\rho}^{(\text{in})}$ must first be generated. However, although pure squeezed states are the theoretical ’gold standard’ of GBS, practically they are challenging to create in a laboratory and inevitably decoherence will arise from equipment.

V.1 Decoherent input states

A more realistic model is one that includes decoherence in the input state. Such a model was proposed in [30] and assumes the intensity of the input light is weakened by a factor of $1-\epsilon$ whilst adding $n_{j}^{\text{th}}=\epsilon n_{j}$ thermal photons per mode. The number of input photons per mode remains unchanged from the pure squeezed state definition, but the coherence of photons is now altered as $\tilde{m}=(1-\epsilon)m_{j}$ . To account for possible measurement errors, a correction factor $t$ is also applied to the transmission matrix, to improve the fit to the simulated distribution.

The advantage of this decoherence model, which is a type of thermal squeezed state [104, 47], is that it allows a simple method for generating phase-space samples for any Gaussian state following

	$\displaystyle\alpha_{j}$	$\displaystyle=\frac{1}{2}\left(\Delta_{x_{j}}w_{j}+i\Delta_{y_{j}}w_{j+M}\right)$
	$\displaystyle\beta_{j}$	$\displaystyle=\frac{1}{2}\left(\Delta_{x_{j}}w_{j}-i\Delta_{y_{j}}w_{j+M}\right),$		(26)

where $\left\langle w_{j}w_{k}\right\rangle=\delta_{jk}$ are real Gaussian noises that model quantum fluctuations in each quadrature and

	$\displaystyle\Delta_{x_{j}}^{2}$	$\displaystyle=2(n_{j}+\tilde{m}_{j})$
	$\displaystyle\Delta_{y_{j}}^{2}$	$\displaystyle=2(n_{j}-\tilde{m}_{j}),$		(27)

are the thermal squeezed state quadrature variances.

As $\epsilon\rightarrow 1$ the inputs become classical, since $\Delta_{x_{j}}^{2},\,\Delta_{y_{j}}^{2}\geq 1$ , but small amounts of thermalization causes the inputs to remain non-classical given, for example, with a squeezing orientation of $\Delta_{x_{j}}^{2}<0$ . So long as the state is Gaussian, one can model a variety of inputs, both classical and non-classical, by simply varying $\epsilon$ , where $\epsilon=0$ corresponds to a pure squeezed state and $\epsilon=1$ to a thermal state.

The input stochastic samples are also straightforwardly extended to non-normally ordered representations, as outlined in detail in [30, 31, 32].

Performing the summation over binary patterns can now be efficiently simulated in phase-space using the multi-dimensional inverse discrete Fourier transform [30]

\mathcal{G}_{\boldsymbol{S}}^{(n)}\left(\boldsymbol{m}\right)=\frac{1}{\prod_{j}(M_{j}+1)}\sum_{\boldsymbol{k}}\tilde{\mathcal{G}}_{M}^{(n)}(\boldsymbol{k})e^{i\sum_{j}k_{j}\theta_{j}m_{j}},

(28)

where the Fourier observable simulates all possible correlations generated in an experimental network and is defined as

\mathcal{\tilde{G}}_{\boldsymbol{S}}^{(n)}\left(\boldsymbol{k}\right)=\left\langle\prod_{j=1}^{d}\bigotimes_{i\in S_{j}}\left(\pi_{i}(0)+\pi_{i}(1)e^{-ik_{j}\theta_{j}}\right)\right\rangle_{P}.

(29)

Here, $\theta_{j}=2\pi/(M_{j}+1)$ , $k_{j}=0,\dots,M_{j}$ and $\pi_{j}=\exp(-n^{\prime}_{j})(\exp(n^{\prime}_{j})-1)^{c_{j}}$ is the positive-P click observable, obtained from the equivalence Eq.(20), where $n^{\prime}_{j}=\alpha^{\prime}_{j}\beta^{\prime}_{j}$ is the output photon number.

This simulation method is highly scalable, with most observables taking a few minutes to simulate on a desktop computer for current experimental scales. To highlight this scalability, much larger simulations of network sizes of up to $M=16,000$ modes have also been performed [30].

V.2 Phase-space classical sampler

If classical states are input into a GBS, the network can be efficiently simulated using the diagonal P-representation, which arises as a special case of the positive P-representation if $P(\boldsymbol{\alpha},\boldsymbol{\beta})=P(\boldsymbol{\alpha})\delta(\boldsymbol{\alpha}^{*}-\boldsymbol{\beta})$ . Due to this, initial stochastic samples are still generated using Eq.(26), except now one has rotated to a classical phase-space with $\boldsymbol{\beta}=\boldsymbol{\alpha}^{*}$ .

Similar to non-classical states, the input density matrix for any classical state is defined as

\hat{\rho}^{(\text{in})}=\int P(\boldsymbol{\alpha})\left|\boldsymbol{\alpha}\right\rangle\left\langle\boldsymbol{\alpha}\right|\text{d}^{2}\boldsymbol{\alpha},

(30)

where the form of the distribution $P(\boldsymbol{\alpha})$ changes depending on the state. Input amplitudes are again transformed to outputs following $\boldsymbol{\alpha}^{\prime}=\boldsymbol{T}\boldsymbol{\alpha}$ and are used to define the output state

\hat{\rho}^{(\text{out})}=\int P(\boldsymbol{\alpha})\left|\boldsymbol{\alpha}^{\prime}\right\rangle\left\langle\boldsymbol{\alpha}^{\prime}\right|\text{d}^{2}\boldsymbol{\alpha}.

(31)

Using the output coherent amplitudes, one can now efficiently generate binary patterns corresponding to any classical state input into a linear network. In order to conserve the simulated counts for each ensemble, corresponding to a single experimental shot, the $j$ -th output mode of the $k$ -th ensemble is independently and randomly sampled via the Bernoulli distribution [32]

P_{j}^{(k)}(c_{j}^{(\text{class)}})=(p_{j}^{(k)})^{c_{j}^{(\text{class)}}}(1-p_{j}^{(k)})^{1-c_{j}^{(\text{class)}}}.

(32)

Here, $c_{j}^{(\text{class})}=0,1$ is the classically generated bit of the $j$ -th mode where the probability of $c_{j}^{(\text{class})}=1$ , the ’success’ probability, is

p_{j}^{(k)}=(\pi_{j}(1))^{(k)}=\left(1-e^{-n^{\prime}_{j}}\right)^{(k)}.

(33)

This is simply the click probability of the $k$ -th stochastic ensemble with an output photon number of $n^{\prime}_{j}=\left|\alpha_{j}\right|^{2}$ .

For an $M$ -mode network, each stochastic ensemble outputs the classical faked pattern of the form

(\boldsymbol{c}^{(\text{class})})^{(k)}=[P_{1}^{(k)},P_{2}^{(k)},\dots,P_{M}^{(k)}],

(34)

which are binned to obtain GCPs, denoted $\mathcal{G}^{(\text{class})}$ , that can be compared to simulated distributions.

In the limit of large numbers of patterns, binned classical fakes are approximately equal to the simulated output distribution corresponding to the density matrix Eq. (31). An example of this can be seen in Fig. (3) where $4\times 10^{6}$ binary patterns are generated by sending $N=20$ thermal states into an $M=N$ mode lossless linear network represented by a Haar random unitary. The binned classical patterns produce a total count distribution that agrees with simulations within sampling error, because for this classical input example there is no quantum advantage.

V.3 Statistical tests

For comparisons to be meaningful, statistical tests are needed to quantify differences between experiment and theory. To this end, Refs. [30, 31] implement a slightly altered version of a standard chi-square test, which is frequently used in random number generator validation [105, 106], and is defined as

\chi_{ES}^{2}=\sum_{i=1}^{k}\frac{\left(\mathcal{G}_{i,E}-\bar{\mathcal{G}}_{i,S}\right)^{2}}{\sigma_{i}^{2}}.

(35)

Here, $k$ is the number of valid photon count bins, which we define as a bin with more than $10$ counts, and $\mathcal{\bar{G}}_{i,S}$ is the phase-space simulated GCP ensemble mean for any input state $S$ , which converges to the true theoretical GCP, $\mathcal{G}_{i,S}$ , in the limit $E_{S}\rightarrow\infty$ . The experimental GCP is defined as $\mathcal{G}_{i,E}$ , and $\sigma_{i}^{2}=\sigma_{T,i}^{2}+\sigma_{E,i}^{2}$ is the sum of experimental, $\sigma_{E,i}^{2}$ , and theoretical, $\sigma_{T,i}^{2}$ , sampling errors of the $i$ -th bin.

The chi-square test result follows a chi-square distribution that converges to a normal distribution when $k\rightarrow\infty$ due to the central limit theorem [107, 108]. One can use this to introduce another test that determines how many standard deviations away the result is from its mean. The aim of this test is to obtain the probability of observing the output $\chi_{ES}^{2}$ result using standard probability theory. For example, an output of $6\sigma$ , where $\sigma$ is the standard deviation of the normal distribution, indicates the data has a very small probability of being observed.

To do this, Ref.[32] implemented the Z-score, or Z-statistic, test which is defined as

Z_{ES}=\frac{\left(\chi_{ES}^{2}/k\right)^{1/3}-\left(1-2/(9k)\right)}{\sqrt{2/(9k)}},

(36)

where $\left(\chi_{ES}^{2}/k\right)^{1/3}$ is the Wilson-Hilferty (WH) transformed chi-square statistic [107], which allows a faster convergence to a normal distribution when $k\geq 10$ [107, 108], and $\mu=1-\sigma^{2}$ , $\sigma^{2}=2/(9k)$ are the corresponding mean and variance of the normal distribution.

The Z-statistic allows one to determine the probability of the count patterns. A result of $Z_{ES}>6$ would indicate that experimental distributions are so far from the simulated results that patterns may be displaying non-random behavior. Due to this, and to present a unified comparison notation, we compute the Z-statistic of the chi-square results presented in [30] using Eq.(36).

Valid experimental results with correct distributions should have $\chi_{EI}^{2}/k\approx 1$ , where the subscript $I$ denotes the ideal GBS distribution, with $Z_{EI}\approx 1$ . For claims of quantum advantage, one must simultaneously prove that $Z_{CI}\gg 1$ , where the subscript $C$ denotes binary patterns from the best classical fake, such as the diagonal-P method described above. This is the ’gold standard’ and would show that, within sampling errors, experimental samples are valid, and closer to the ideal distribution than any classical fake.

In the more realistic scenario of thermalized squeezed inputs, one may still have quantum advantage if $Z_{ET}\approx 1$ while $Z_{CT}\gg 1$ . Therefore, these four observables are of the most interest for comparisons of theory and experiment, and are given below.

V.4 Comparisons with experiment

Throughout this section, we primarily review comparisons from Jiuzhang 1.0, for purposes of illustration [30]. A thorough comparison of all data sets obtained in Jiuzhang 2.0 is presented elsewhere [32].

We first review comparisons of total counts, which is the probability of observing $m$ clicks in any pattern and is usually one of the first observables experimentalists compare samples to. This is because in the limit of a large number of clicks, one can estimate the ideal distribution as a Gaussian distribution via the central limit theorem.

To simulate the ideal total counts distribution in phase-space using GCPs we let $n=M$ and $\boldsymbol{S}=\{1,2,\dots,M\}$ , giving $\mathcal{G}_{\{1,\dots,M\}}^{(M)}(m)$ . For Jiuzhang 1.0, one obtains a Z-statistic of $Z_{EI}\approx 340$ for $k=63$ valid bins. Clearly, experimental samples are far from the ideal and indicate photon count distributions are not what would be expected from an ideal GBS.

To determine whether these differences either increase or decrease when higher-order correlations become more prevalent in the simulations, the dimension of the GCP is increased to $d=2$ . In this case, Jiuzhang 1.0 sees an almost doubling in Z values for comparisons with the simulated ideal distribution (see Fig.(4)), where $Z_{EI}\approx 647$ is obtained from $k=978$ . [30]. The increase in the number of valid bins with GCP dimension causes the normal distribution of the WH transformed chi-square statistic to have a smaller variance. When compared to a single dimension, where $k=63$ gives $\sigma^{2}\approx 3.5\times 10^{-3}$ , binning with $d>1$ causes experimental samples to now pass a more stringent test, as $k=978$ produces a normal distribution with variance $\sigma^{2}\approx 2.3\times 10^{-4}$ .

Simulating the more realistic scenario of thermalized squeezed states, a thermalized component of $\epsilon=0.0932\pm 0.0005$ and a transmission matrix correction of $t=1.0235\pm 0.0005$ is used as to compare a simulated model to samples from Jiuzhang 1.0 [30]. In this case, an order of magnitude improvement in the resulting Z value is observed where $Z_{ET}\approx 17\pm 2$ .

Despite this significant improvement, differences are still large enough that $Z_{ET}>1$ . Similar results were also obtained for data sets with the largest recorded photon counts in Jiuzhang 2.0, that is data sets claiming quantum advantage [32]. However, this is not the case for data sets with small numbers of recorded photons which typically give $Z_{ET}\approx 1$ [32], although these experiments should be easily replicated by exact samplers. The large amount of apparent thermalization is the likely reason why simulated squashed states described Jiuzhang 1.0 samples just as well as the ideal GBS in Ref. [85].

When higher-order correlations are considered, samples from Jiuzhang 1.0 are far from the expected correlation moments of the ideal distribution. Although including the above fitting parameters improves this result with $Z_{ET}\approx 31$ , the samples still deviate significantly from the theoretical thermalized distribution [30].

Unlike comparisons of total counts, samples from all data sets in Jiuzhang 2.0 satisfy $Z_{EI},Z_{ET}>1$ for simulations with $d>1$ , meaning higher-order correlations also display significant departures from the expected theoretical results, even for the simpler cases with low numbers of photon counts [32]. The reasons for this are not currently known.

VI Summary

In order to effectively validate GBS quantum computers, scalable methods are needed which capture the entire interference complexity of linear networks. The positive-P phase-space representation is the only currently known method which can simulate every measurable grouped output distribution with non-classical input states, allowing efficient comparisons of theory and experiment for data that is available on a reasonable time-scale.

One of the important issues here is the extraordinary relative sparseness of the experimental data, which makes it completely impossible to experimentally recover any reasonable estimate of the full distribution. Thus, while the full distribution is exponentially hard to compute, it is equally hard to measure. This means that comparisons of theory and experiment always involve some type of grouping or binning of the available data.

The next significant point is that one can both experimentally measure and theoretically compute the grouped distributions. This can be carried out theoretically with great precision using the positive-P phase-space simulation method, combined with a Fourier transform binning algorithm. These do not add greatly to the computational overhead, giving exact tests that are computable in just a few minutes, which is of great practical value.

The resulting statistical tests employed in these comparisons are far more scalable than ones implemented in many previous comparisons [15, 16, 17, 18, 23, 24, 25], as they don’t require the computation of photon count pattern probabilities, which limits the comparisons that can be performed using these tests. Exact simulation using direct photon counts is impractical in the large-scale regime where quantum advantage is found.

Statistical testing shows that the GBS experiments tested are far from the ideal GBS output distributions which are obtained from injecting pure squeezed vacuum state inputs into a linear network. The reason for the discrepancy is that some form of decoherence is inevitable in such large quantum systems, and this makes the ideal standard too high a bar that is unlikely to ever be fully realized. A more reasonable goal is an output distribution obtained from including some decoherence in the inputs, although the amount of decoherence must be small enough that input states remain non-classical.

In summary, the positive P-representation provides an effective, scalable method to validate quantum photonic network data. It is not limited to quantum computing applications such as the GBS, as the theory presented here can be readily adapted to other optical networks, and can include non-ideal features that occur in practical experiments. Having a target distribution which is non-ideal yet non-classical is the “Goldilocks” target calculation of these devices: a porridge which is neither too hot nor too cold.

Declarations

Availability of data and materials

Phase-space GBS simulation software xQSim can be downloaded from public domain repositories at [88].

Competing interests

The Authors declare no competing interests.

Funding

This research was funded through grants from NTT Phi Laboratories and the Australian Research Council Discovery Program.

Authors’ contributions

A.S.D wrote the original draft, created figures and performed numerical simulations. All authors contributed to reviewing and editing this manuscript.

References

Feynman [1982] R. P. Feynman, Int. J. Theor. Phys. 21, 467 (1982).
Cirac and Zoller [1995] J. I. Cirac and P. Zoller, Phys. Rev. Lett. 74, 4091 (1995).
Cirac and Zoller [2000] J. I. Cirac and P. Zoller, Nature 404, 579 (2000).
Knill et al. [2001] E. Knill, R. Laflamme, and G. J. Milburn, Nature 409, 46 (2001).
Devoret and Schoelkopf [2013] M. H. Devoret and R. J. Schoelkopf, Science 339, 1169 (2013).
Aaronson and Arkhipov [2011] S. Aaronson and A. Arkhipov, in Proceedings of the 43rd Annual ACM Symposium on Theory of Computing (ACM Press, 2011) pp. 333–342.
Hamilton et al. [2017] C. S. Hamilton, R. Kruse, L. Sansoni, S. Barkhofen, C. Silberhorn, and I. Jex, Phys. Rev. Lett. 119, 170501 (2017).
Quesada et al. [2018] N. Quesada, J. M. Arrazola, and N. Killoran, Physical Review A 98, 062322 (2018).
Yamamoto et al. [2017] Y. Yamamoto, K. Aihara, T. Leleu, K.-i. Kawarabayashi, S. Kako, M. Fejer, K. Inoue, and H. Takesue, npj Quantum Information 3, 1 (2017).
Kok et al. [2007] P. Kok et al., Rev. Mod. Phys. 79, 135 (2007).
McMahon et al. [2016] P. L. McMahon, A. Marandi, Y. Haribara, R. Hamerly, C. Langrock, S. Tamate, T. Inagaki, H. Takesue, S. Utsunomiya, K. Aihara, R. L. Byer, M. M. Fejer, H. Mabuchi, and Y. Yamamoto, Science 354, 614 (2016).
Honjo et al. [2021] T. Honjo, T. Sonobe, K. Inaba, T. Inagaki, T. Ikuta, Y. Yamada, T. Kazama, K. Enbutsu, T. Umeki, R. Kasahara, K. ichi Kawarabayashi, and H. Takesue, Science Advances 7, eabh0952 (2021), https://www.science.org/doi/pdf/10.1126/sciadv.abh0952 .
Arute et al. [2019] F. Arute, K. Arya, R. Babbush, D. Bacon, J. C. Bardin, R. Barends, R. Biswas, S. Boixo, F. G. Brandao, D. A. Buell, et al., Nature 574, 505 (2019).
Wu et al. [2021] Y. Wu, W.-S. Bao, S. Cao, F. Chen, M.-C. Chen, X. Chen, T.-H. Chung, H. Deng, Y. Du, D. Fan, M. Gong, C. Guo, C. Guo, S. Guo, L. Han, L. Hong, H.-L. Huang, Y.-H. Huo, L. Li, N. Li, S. Li, Y. Li, F. Liang, C. Lin, J. Lin, H. Qian, D. Qiao, H. Rong, H. Su, L. Sun, L. Wang, S. Wang, D. Wu, Y. Xu, K. Yan, W. Yang, Y. Yang, Y. Ye, J. Yin, C. Ying, J. Yu, C. Zha, C. Zhang, H. Zhang, K. Zhang, Y. Zhang, H. Zhao, Y. Zhao, L. Zhou, Q. Zhu, C.-Y. Lu, C.-Z. Peng, X. Zhu, and J.-W. Pan, Phys. Rev. Lett. 127, 180501 (2021).
Zhong et al. [2020] H.-S. Zhong, H. Wang, Y.-H. Deng, M.-C. Chen, L.-C. Peng, Y.-H. Luo, J. Qin, D. Wu, X. Ding, Y. Hu, et al., Science 370, 1460 (2020).
Zhong et al. [2021] H.-S. Zhong, Y.-H. Deng, J. Qin, H. Wang, M.-C. Chen, L.-C. Peng, Y.-H. Luo, D. Wu, S.-Q. Gong, H. Su, Y. Hu, P. Hu, X.-Y. Yang, W.-J. Zhang, H. Li, Y. Li, X. Jiang, L. Gan, G. Yang, L. You, Z. Wang, L. Li, N.-L. Liu, J. J. Renema, C.-Y. Lu, and J.-W. Pan, Phys. Rev. Lett. 127, 180502 (2021).
Madsen et al. [2022] L. S. Madsen, F. Laudenbach, M. F. Askarani, F. Rortais, T. Vincent, J. F. F. Bulmer, F. M. Miatto, L. Neuhaus, L. G. Helt, M. J. Collins, A. E. Lita, T. Gerrits, S. W. Nam, V. D. Vaidya, M. Menotti, I. Dhand, Z. Vernon, N. Quesada, and J. Lavoie, Nature 606, 75 (2022).
Deng et al. [2023] Y.-H. Deng, Y.-C. Gu, H.-L. Liu, S.-Q. Gong, H. Su, Z.-J. Zhang, H.-Y. Tang, M.-H. Jia, J.-M. Xu, M.-C. Chen, H.-S. Zhong, J. Qin, H. Wang, L.-C. Peng, J. Yan, Y. Hu, J. Huang, H. Li, Y. Li, Y. Chen, X. Jiang, L. Gan, G. Yang, L. You, L. Li, N.-L. Liu, J. J. Renema, C.-Y. Lu, and J.-W. Pan, Gaussian Boson Sampling with Pseudo-Photon-Number Resolving Detectors and Quantum Computational Advantage (2023), arxiv:2304.12240 [quant-ph] .
Drummond and Gardiner [1980a] P. D. Drummond and C. W. Gardiner, Journal of Physics A: Mathematical and General 13, 2353 (1980a).
Quesada and Arrazola [2020] N. Quesada and J. M. Arrazola, Physical Review Research 2, 023005 (2020).
Quesada et al. [2022] N. Quesada, R. S. Chadwick, B. A. Bell, J. M. Arrazola, T. Vincent, H. Qi, and R. García-Patrón, PRX Quantum 3, 010306 (2022).
Bulmer et al. [2022] J. F. F. Bulmer, B. A. Bell, R. S. Chadwick, A. E. Jones, D. Moise, A. Rigazzi, J. Thorbecke, U.-U. Haus, T. Van Vaerenbergh, R. B. Patel, I. A. Walmsley, and A. Laing, Sci. Adv. 8, eabl9236 (2022).
Villalonga et al. [2021] B. Villalonga, M. Y. Niu, L. Li, H. Neven, J. C. Platt, V. N. Smelyanskiy, and S. Boixo, arXiv preprint arXiv:2109.11525 (2021).
Oh et al. [2022] C. Oh, L. Jiang, and B. Fefferman, Spoofing cross entropy measure in boson sampling (2022), arXiv:2210.15021 [quant-ph] .
Oh et al. [2023] C. Oh, M. Liu, Y. Alexeev, B. Fefferman, and L. Jiang, arXiv preprint arXiv:2306.03709 (2023).
Kiesewetter and Drummond [2022a] S. Kiesewetter and P. D. Drummond, Optics Letters 47, 649 (2022a).
Kiesewetter and Drummond [2022b] S. Kiesewetter and P. D. Drummond, Phys. Rev. A 106, 022409 (2022b).
Wang et al. [2013] Z. Wang, A. Marandi, K. Wen, R. L. Byer, and Y. Yamamoto, Physical Review A 88, 063853 (2013).
Yamamura et al. [2017] A. Yamamura, K. Aihara, and Y. Yamamoto, Physical Review A 96, 053834 (2017).
Drummond et al. [2022] P. D. Drummond, B. Opanchuk, A. Dellios, and M. D. Reid, Phys. Rev. A 105, 012427 (2022).
Dellios et al. [2022] A. Dellios, P. D. Drummond, B. Opanchuk, R. Y. Teh, and M. D. Reid, Physics Letters A 429, 127911 (2022).
Dellios et al. [2023] A. S. Dellios, B. Opanchuk, M. D. Reid, and P. D. Drummond, Validation tests for GBS quantum computers using grouped count probabilities (2023), arxiv:2211.03480 [math-ph, physics:quant-ph] .
Kruse et al. [2019] R. Kruse, C. S. Hamilton, L. Sansoni, S. Barkhofen, C. Silberhorn, and I. Jex, Physical Review A 100, 032326 (2019).
Rahimi-Keshari et al. [2016] S. Rahimi-Keshari, T. C. Ralph, and C. M. Caves, Phys. Rev. X 6, 021039 (2016).
Qi et al. [2020] H. Qi, D. J. Brod, N. Quesada, and R. García-Patrón, Physical review letters 124, 100502 (2020).
Aaronson [2011] S. Aaronson, Proceedings of the Royal Society of London A: Mathematical, Physical and Engineering Sciences 467, 3393 (2011).
Scheel [2004] S. Scheel, http://arxiv.org/abs/quant-ph/0406127 (2004).
Broome et al. [2013] M. A. Broome, A. Fedrizzi, S. Rahimi-Keshari, J. Dove, S. Aaronson, T. C. Ralph, and A. G. White, Science 339, 794 (2013).
Spring et al. [2013] J. B. Spring, B. J. Metcalf, P. C. Humphreys, W. S. Kolthammer, X.-M. Jin, M. Barbieri, A. Datta, N. Thomas-Peter, N. K. Langford, D. Kundys, J. C. Gates, B. J. Smith, P. G. R. Smith, and I. A. Walmsley, Science 339, 798 (2013).
Wang et al. [2019] H. Wang, J. Qin, X. Ding, M.-C. Chen, S. Chen, X. You, Y.-M. He, X. Jiang, L. You, Z. Wang, C. Schneider, J. J. Renema, S. Höfling, C.-Y. Lu, and J.-W. Pan, Phys. Rev. Lett. 123, 250503 (2019).
Wu et al. [2018] J. Wu, Y. Liu, B. Zhang, X. Jin, Y. Wang, H. Wang, and X. Yang, National Science Review 5, 715 (2018).
Hangleiter and Eisert [2022] D. Hangleiter and J. Eisert, Computational advantage of quantum random sampling (2022), arXiv:2206.04079 [cond-mat, physics:quant-ph] .
Yuen [1976] H. P. Yuen, Physical Review A 13, 2226 (1976).
Stoler [1970] D. Stoler, Phys. Rev. D 1, 3217 (1970).
Stoler [1971] D. Stoler, Phys. Rev. D 4, 1925 (1971).
Walls [1983] D. F. Walls, nature 306, 141 (1983).
Kim et al. [1989] M. S. Kim, F. A. M. de Oliveira, and P. L. Knight, Physical Review A 40, 2494 (1989).
Drummond and Hillery [2014] P. D. Drummond and M. Hillery, The quantum theory of nonlinear optics (Cambridge University Press, 2014).
Drummond and Ficek [2004] P. D. Drummond and Z. Ficek, eds., Quantum Squeezing (Springer-Verlag, Berlin, Heidelberg, New York, 2004).
Vahlbruch et al. [2016] H. Vahlbruch, M. Mehmet, K. Danzmann, and R. Schnabel, Physical Review Letters 117, 110801 (2016).
Walschaers [2018] M. Walschaers, Statistical Benchmarks for Quantum Transport in Complex Systems: From Characterisation to Design, Springer Theses (Springer International Publishing, Cham, 2018).
García-Patrón et al. [2019] R. García-Patrón, J. J. Renema, and V. Shchesnovich, Quantum 3, 169 (2019), arxiv:1712.10037 .
Shchesnovich [2021] V. Shchesnovich, Quantum 5, 423 (2021), arxiv:1905.11458 [quant-ph] .
Walls and Milburn [2008] D. Walls and G. Milburn, Quantum Optics (Springer, 2008).
Hong et al. [1987] C. K. Hong, Z. Y. Ou, and L. Mandel, Phys. Rev. Lett. 59, 2044 (1987).
Reid [1989] M. D. Reid, Phys. Rev. A 40, 913 (1989).
Furusawa et al. [1998] A. Furusawa, J. L. Sørensen, S. L. Braunstein, C. A. Fuchs, H. J. Kimble, and E. S. Polzik, science 282, 706 (1998).
Teh et al. [2021] R. Y. Teh, M. Gessner, M. D. Reid, and M. Fadel, arXiv preprint arXiv:2108.06926 (2021).
Sperling et al. [2012] J. Sperling, W. Vogel, and G. S. Agarwal, Phys. Rev. A 85, 023820 (2012).
Achilles et al. [2003] D. Achilles, C. Silberhorn, C. Śliwa, K. Banaszek, and I. A. Walmsley, Opt. Lett. 28, 2387 (2003).
Fitch et al. [2003] M. J. Fitch, B. C. Jacobs, T. B. Pittman, and J. D. Franson, Phys. Rev. A 68, 043814 (2003).
Provazník et al. [2020] J. Provazník, L. Lachman, R. Filip, and P. Marek, Opt. Express 28, 14839 (2020).
Deshpande et al. [2022] A. Deshpande, A. Mehta, T. Vincent, N. Quesada, M. Hinsche, M. Ioannou, L. Madsen, J. Lavoie, H. Qi, J. Eisert, D. Hangleiter, B. Fefferman, and I. Dhand, Sci. Adv. 8, eabi7894 (2022).
Pashayan et al. [2020] H. Pashayan, S. D. Bartlett, and D. Gross, Quantum 4, 223 (2020).
Chabaud et al. [2021] U. Chabaud, G. Ferrini, F. Grosshans, and D. Markham, Phys. Rev. Research 3, 033018 (2021).
Chabaud and Walschaers [2023] U. Chabaud and M. Walschaers, Phys. Rev. Lett. 130, 090602 (2023).
Gardiner and Zoller [2000] C. W. Gardiner and P. Zoller, Quantum Noise, 2nd ed. (Springer, Berlin, 2000).
Drummond et al. [2002] P. D. Drummond, K. Dechoum, and S. Chaturvedi, Phys. Rev. A 65, 033806 (2002).
Deuar and Drummond [2007] P. Deuar and P. D. Drummond, Physical Review Letters 98, 120402 (2007).
Drummond et al. [2014] P. D. Drummond, B. Opanchuk, L. Rosales-Zárate, and M. D. Reid, Phys. Scr. 2014, 014009 (2014).
Drummond and Chaturvedi [2016] P. D. Drummond and S. Chaturvedi, Physica Scripta 91, 073007 (2016).
Titulaer and Glauber [1965] U. M. Titulaer and R. J. Glauber, Physical Review 140, B676 (1965).
Reid and Walls [1986] M. D. Reid and D. F. Walls, Phys. Rev. A 34, 1260 (1986).
Graham [1973] R. Graham, in Springer Tracts in Modern Physics (Springer Berlin Heidelberg, 1973) pp. 1–97.
Steel et al. [1998] M. J. Steel, M. K. Olsen, L. I. Plimak, P. D. Drummond, S. M. Tan, M. J. Collett, D. F. Walls, and R. Graham, Phys. Rev. A 58, 4824 (1998).
Opanchuk et al. [2014] B. Opanchuk, L. Arnaud, and M. D. Reid, Phys. Rev. A 89, 062101 (2014).
Ng et al. [2019a] K. L. Ng, B. Opanchuk, M. D. Reid, and P. D. Drummond, Physical review letters 122, 203604 (2019a).
Ng et al. [2019b] K. L. Ng, R. Polkinghorne, B. Opanchuk, and P. D. Drummond, Journal of Physics A Mathematical General 52 (2019b).
Opanchuk et al. [2019] B. Opanchuk, L. Rosales-Zárate, R. Y. Teh, B. J. Dalton, A. Sidorov, P. D. Drummond, and M. D. Reid, Phys. Rev. A 100, 060102 (2019).
Lachman and Filip [2022] L. Lachman and R. Filip, Progress in Quantum Electronics 83, 100395 (2022).
Boixo et al. [2018] S. Boixo, S. V. Isakov, V. N. Smelyanskiy, R. Babbush, N. Ding, Z. Jiang, M. J. Bremner, J. M. Martinis, and H. Neven, Nature Physics 14, 595 (2018).
Aaronson and Gunn [2020] S. Aaronson and S. Gunn, On the Classical Hardness of Spoofing Linear Cross-Entropy Benchmarking (2020), arxiv:1910.12085 [quant-ph] .
Glauber [1963] R. J. Glauber, Phys. Rev. 130, 2529 (1963).
Hillery et al. [1984] M. Hillery, R. F. O’Connell, M. O. Scully, and E. P. Wigner, Phys. Rep. 106, 121 (1984).
Martínez-Cifuentes et al. [2022] J. Martínez-Cifuentes, K. M. Fonseca-Romero, and N. Quesada, Classical models are a better explanation of the Jiuzhang Gaussian Boson Samplers than their targeted squeezed light models (2022), arXiv:2207.10058 [quant-ph] .
Moran [1968] P. A. P. Moran, An introduction to Probability Theory (Oxford, Clarendon Press, 1968).
Shchesnovich [2022] V. Shchesnovich, Boson sampling cannot be faithfully simulated by only the lower-order multi-boson interferences (2022), arxiv:2204.07792 [math-ph, physics:quant-ph] .
Drummond and Dellios [2023] P. D. Drummond and A. S. Dellios, GitHub - peterddrummond/xqsim: Quantum network simulations in phase space, https://github.com/peterddrummond/xqsim (2023).
Wigner [1932] E. Wigner, Phys. Rev. 40, 749 (1932).
Carmichael [2002] H. J. Carmichael, Statistical methods in quantum optics 1. Master Equations and Fokker-Planck Equations. (Springer, Berlin, 2002).
Moyal [1949] J. E. Moyal, Mathematical Proceedings of the Cambridge Philosophical Society 45, 99 (1949).
Gardiner and Zoller [2004] C. Gardiner and P. Zoller, Quantum Noise: A Handbook of Markovian and Non-Markovian Quantum Stochastic Methods with Applications to Quantum Optics, Springer Series in Synergetics (Springer, 2004).
Gardiner [2004] C. Gardiner, Handbook of Stochastic Methods for Physics, Chemistry, and the Natural Sciences, Springer complexity (Springer, 2004).
Drummond et al. [1981] P. D. Drummond, C. W. Gardiner, and D. F. Walls, Phys. Rev. A 24, 914 (1981).
Rosales-Zárate et al. [2014] L. Rosales-Zárate, B. Opanchuk, P. D. Drummond, and M. D. Reid, Phys. Rev. A 90, 022109 (2014).
Teh et al. [2020a] R. Y. Teh, P. D. Drummond, and M. D. Reid, Physical Review Research 2, 043387 (2020a).
Teh et al. [2020b] R. Y. Teh, F.-X. Sun, R. Polkinghorne, Q. Y. He, Q. Gong, P. D. Drummond, and M. D. Reid, Physical Review A 101, 043807 (2020b).
Rodkina and Kelly [2010] A. Rodkina and C. Kelly (2010) pp. 1517–1520.
Kloeden and Platen [1992] P. E. Kloeden and E. Platen, in Numerical Solution of Stochastic Differential Equations (Springer Berlin Heidelberg, Berlin, Heidelberg, 1992) pp. 103–160.
Drummond and Gardiner [1980b] P. D. Drummond and C. W. Gardiner, J. Phys. A 13, 2353 (1980b).
Opanchuk et al. [2018] B. Opanchuk, L. Rosales-Zárate, M. D. Reid, and P. D. Drummond, Physical Review A 97, 042304 (2018).
Drummond and Opanchuk [2020] P. D. Drummond and B. Opanchuk, Physical Review Research 2, 033304 (2020).
Adam et al. [1994] P. Adam, I. Földesi, and J. Janszky, Physical Review A 49, 1281 (1994).
Fearn and Collett [1988] H. Fearn and M. Collett, Journal of Modern Optics 35, 553 (1988).
Rukhin et al. [2010] A. L. Rukhin, J. Soto, J. R. Nechvatal, M. E. Smid, E. B. Barker, S. D. Leigh, M. Levenson, M. Vangel, D. L. Banks, et al., A statistical test suite for random and pseudorandom number generators for cryptographic applications (2010).
Knuth [2014] D. E. Knuth, Art of computer programming, volume 2: Seminumerical algorithms (Addison-Wesley Professional, 2014).
Wilson and Hilferty [1931] E. B. Wilson and M. M. Hilferty, Proc. Natl. Acad. Sci. U.S.A. 17, 684 (1931).
Johnson [1970] N. L. Johnson, Continuous Univariate Distributions, Houghton Mifflin Series in Statistics (Houghton Mifflin, Boston, 1970).