This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Quantum Generative Adversarial Networks in a Silicon Photonic Chip with Maximum Expressibility

Haoran Ma College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Liao Ye College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Fanjie Ruan College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Zichao Zhao College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Maohui Li College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Yuehai Wang College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China Jianyi Yang [email protected] College of Information Science and Electronic Engineering, Zhejiang University, Hangzhou 310027, China
Abstract

Generative adversarial networks (GANs) have achieved remarkable success with realistic tasks such as creating realistic images, texts, and audio. Combining GANs and quantum computing, quantum GANs are thought to have an exponential advantage over their classical counterparts due to the stronger expressibility of quantum circuits. In this research, a two-qubit silicon quantum photonic chip is created, capable of executing arbitrary controlled-unitary (CU^C\hat{U}) operations and generating any 2-qubit pure state, thus making it an excellent platform for quantum GANs. To capture complex data patterns, a hybrid generator is proposed to inject nonlinearity into quantum GANs. As a demonstration, three generative tasks, covering both pure quantum versions of GANs (PQ-GAN) and hybrid quantum-classical GANs (HQC-GANs), are successfully carried out on the chip, including high-fidelity single-qubit state learning, classical distributions loading, and compressed image production. The experiment results prove that silicon quantum photonic chips have great potential in generative learning applications.

1 Introduction

Quantum computing is one of the most fascinating computing paradigms today because of its potential for exponential speedup compared to classical computers in certain tasks. For example, Shor’s algorithm can factor enormous numbers in polynomial time[1], whereas the HHL algorithm can considerably accelerate the solution of linear systems[2]. However, these algorithms require a quantum computer that is error resilient. Despite recent breakthroughs[3], quantum devices are not yet capable of running these algorithms for practical applications. To utilize restricted quantum resources, the hybrid quantum-classical (HQC) framework has been proposed[4, 5, 6]. The main idea of HQC is to split tasks into two parts: one executed efficiently on quantum devices and the other completed on classical computers[6]. Therefore, the computational burden on quantum computers is reduced. Quantum machine learning (QML) is one of the beneficiaries of HQC, in which quantum devices are outfitted with parameterized quantum gates with trainable parameters, known as the parametrized quantum circuit (PQC), and a classical computer uses the measured results to adapt these parameters for learning. Both theoretical and experimental research indicates that QML may outperform its traditional counterpart[7, 8, 9, 10].

Quantum GANs, a breakthrough method in QML, have gained extensive interest for their potential to exhibit quantum advantage on near-term quantum devices[11]. This method is made up of two models: the generator (G) and the discriminator (D), who are engaged in a minimax game in which D learns to discriminate between real and generated data while G strives to produce data that tricks D. Finally, G learns how to capture the distribution of the training dataset and create new data. Quantum GANs can be employed for generating both quantum states[12, 13, 14, 15, 16, 17] and classical data[18, 19, 20, 21, 22, 23, 24, 25, 26, 27]. In the case of generating quantum states, both G and D must be quantum, resulting in PQ-GAN. When generating classical data, either G or D is quantum in most cases, leading to HQC-GAN. In this paper, we show both forms of quantum GANs on a programmable silicon quantum photonic chip.

Silicon quantum photonic is a promising quantum computing platform that uses single photons as quantum information carriers, employs optical waveguides to guide and route photons, and utilizes Mach-Zehnder interferometers (MZIs) to manipulate the photonic qubit, offering phase-stable quantum circuits, high-precision quantum operations, and scalability to large-scale implementations[28, 29]. Recently, silicon quantum photonic chips have been employed to demonstrate quantum GANs. Ref. [17] demonstrates the fault-tolerance of PQ-GANs in photonics by producing high-quality quantum states despite chip noise and imperfections. However, their generator is only capable of creating two-photon maximum entangled states, limiting the expressive power required for generative models. Ref. [27] adopts HQC-GAN as a submodule for financial applications, where the quantum GAN aims to load the financial data into the quantum state of only one photon. Yet, loading data into multi-photon quantum states may be more meaningful, as entanglement is important for realizing quantum advantage[30]. Therefore, designing a chip capable of generating arbitrary multi-photon quantum state is crucial for quantum GANs.

In this work, we suggest a scheme to produce on-chip arbitrary two-qubit pure states by combining asymetrical MZIs (AMZIs), frequency post-selection, and CU^C\hat{U} operation. As a demonstration, a silicon photonic chip is fabricated and used to perform three generative tasks, including both PQ-GAN and HQC-GAN. Compared with Ref. [17], our quantum photonic generator is maximally expressive. In addition, a hybrid generator consisting of classical and quantum layers is proposed to introduce nonlinearity into quantum GANs, thereby capturing more complex data features than the pure quantum version. We start by demonstrating a PQ-GAN, training the device to produce single-qubit pure and mixed states with fidelities as high as 99.41% and 98.39%, respectively. Then, we employ an HQC-GAN to undertake approximately distribution loading, where the data is loaded into the basis occurance probabilities of a two-photon quantum state rather than single-photon in Ref. [27]. Finally, we show how to generate compressed MNIST images with a modified HQC-GAN, where the proposed hybrid generator is utilized. To the best of our knowledge, this is the first proof-of-principle demonstration of quantum GANs to learn mixed quantum states and produce images using a silicon photonic chip.

2 Silicon Photonic Chip for Quantum GANs

2.1 The AMZI with Frequency Post-selection

AMZI is an indispensable component of our chip, whose combination leads to the generation of amplitude-adjustable entangled states, which is crucial for our scheme. In this subsection, we describe the transformation that AMZI applies to the quantum state of photon pairs under post-filtering, which results in the measured coincidence rate (CR) varying with the phase changes of AMZI. The relationship between CR and AMZI’s phase angle φ\varphi is theoretically derived and experimentally verified.

We begin with theoretical derivations. Figure 1a displays a typical AMZI, where signal/idler photon pairs at different wavelengths are injected from port aa and split to port cc and dd. The transmission matrix of the AMZI is

UAMZI\displaystyle U_{AMZI} =[1ii1][ei(βs/il+φ)001][1ii1]\displaystyle=\begin{bmatrix}1&i\\ i&1\end{bmatrix}\cdot\begin{bmatrix}e^{i(\beta_{s/i}\triangle l+\varphi)}&0\\ 0&1\end{bmatrix}\cdot\begin{bmatrix}1&i\\ i&1\end{bmatrix} (1)
=eiβs/il+φ+π2[sin((βs/il+φ)/2)cos((βs/il+φ)/2)cos((βs/il+φ)/2)sin((βs/il+φ)/2)]\displaystyle=e^{i\frac{\beta_{s/i}\triangle l+\varphi+\pi}{2}}\begin{bmatrix}sin((\beta_{s/i}\triangle l+\varphi)/2)&cos((\beta_{s/i}\triangle l+\varphi)/2)\\ cos((\beta_{s/i}\triangle l+\varphi)/2)&-sin((\beta_{s/i}\triangle l+\varphi)/2)\end{bmatrix}

where βs/i\beta_{s/i} is the transimission constant of the signal/idler photon and l\triangle l is the length difference of two arms. Assuming the free spectral range (FSR) of AMZI is designed to be twice the wavelength difference of photon pairs, we have |βslβil|=π|\beta_{s}\triangle l-\beta_{i}\triangle l|=\pi. Therefore, the transmission matrices of the signal and idler photons can be expressed as

Us=eiφ+π2[sin(φ/2)cos(φ/2)cos(φ/2)sin(φ/2)],Ui=eiφ+2π2[cos(φ/2)sin(φ/2)sin(φ/2)cos(φ/2)]U_{s}=e^{i\frac{\varphi+\pi}{2}}\begin{bmatrix}sin(\varphi/2)&cos(\varphi/2)\\ cos(\varphi/2)&-sin(\varphi/2)\end{bmatrix},U_{i}=e^{i\frac{\varphi+2\pi}{2}}\begin{bmatrix}cos(\varphi/2)&sin(\varphi/2)\\ sin(\varphi/2)&-cos(\varphi/2)\end{bmatrix} (2)

The initial state of photon pairs can be written in terms of the creation operator on the vacuum state as asai|vaca^{\dagger}_{s}a^{\dagger}_{i}|vac\rangle. The AMZI performs UsU_{s} and UiU_{i} for signal and idler photons, respectively, where the process can be described as

asai|vac\displaystyle a^{\dagger}_{s}a^{\dagger}_{i}|vac\rangle UAMZIeiφ+π2(sin(φ/2)cs+cos(φ/2)ds)(cos(φ/2)ci+sin(φ/2)di)|vac\displaystyle\stackrel{{\scriptstyle U_{AMZI}}}{{\longrightarrow}}e^{i\varphi+\frac{\pi}{2}}(sin(\varphi/2)c^{\dagger}_{s}+cos(\varphi/2)d^{\dagger}_{s})\cdot(cos(\varphi/2)c^{\dagger}_{i}+sin(\varphi/2)d^{\dagger}_{i})|vac\rangle (3)
=ei(φ+π2){sin(φ/2)2csdi+other.}|vac\displaystyle=e^{i(\varphi+\frac{\pi}{2})}\{sin(\varphi/2)^{2}c^{\dagger}_{s}d^{\dagger}_{i}+other.\}|vac\rangle

By post-selecting the case csdic^{\dagger}_{s}d^{\dagger}_{i} with filters, the success probability of coincidence is sin4(φ/2)sin^{4}(\varphi/2). If the photons enter from port bb, the result becomes cos4(φ/2)=sin4((φ+π)/2)cos^{4}(\varphi/2)=sin^{4}((\varphi+\pi)/2). Thus, we always have

C(φ)Cmaxsin4(φ/2)C(\varphi)\propto C_{max}sin^{4}(\varphi/2) (4)

where CmaxC_{max} is the measured maximum CR, which occurs when signal and idler photons are deterministically separated. This result is also experimentally verified and shown in Figure 1b. The dots are measured CRs by tuning the two AMZIs’ phases φ1\varphi_{1} and φ2\varphi_{2} in Figure 2a, respectively, while the solid lines are fitted results using asin4(αI2+β)+basin^{4}(\alpha I^{2}+\beta)+b.

Refer to caption
Figure 1. a)The scheme of AMZI. (b)The measured CRs vary with the square of the current applied to the two AMZIs.

2.2 Chip Design and Experimental Setup

The quantum GANs are exhibited on a digital silicon quantum photonic chip, as shown schematically in Figure 2a. The chip is pumped with continuous-wave (CW) light at 1550.92 nm, which is amplified to 30 mW by an erbium-doped fiber amplifier (EDFA). After being adjusted by a polarization controller (PC) and filtered by a dense wavelength division multiplexer (DWDM), light is coupled into the chip via a grating coupler with a coupling loss of about 4.5 dB. Photon pairs are created in two 1-cm-long waveguide spiral sources using spontaneous four-wave mixing (SFWM) and routed through single-mode waveguides with 450nm×\times220nm. Interferemeters, which include multimode interferometers (MMIs), beam-splitters and thermo-optic phase shifters (PSs) driven by a current source, are used to manipulate photonic qubits. The output photons are filtered using DWDMs, which selectively isolate signal photons at 1555.75nm (red) for the control qubit and idler photons at 1546.12nm (blue) for the target qubit. After polarization adjustments, photons are detected by superconducting nanowire single-photon detectors (SNSPDs) (Photec, 50Hz dark counts, 90% efficiency). A TimeTagger (Swabian) is utilized to perform coincidence counting, with a coincidence window of 300 ps. The measured coincidence rate (CR) is approximately 3 kHz. A classical computer is used to automate the quantum GANs, encompassing quantum gate manipulation, data acquisition, instant analysis, and the implementation of classical neural networks (NNs).

When pumped, the two spiral sources have equal power, resulting in the production of two-photon NOON state (|2s,i,0+|0,2s,i)/2(|2_{s,i},0\rangle+|0,2_{s,i}\rangle)/\sqrt{2}. The photon pairs from two sources are filtered and partially split by two AMZIs, where the FSR of each AMZI is 19.20nm, equaling twice the wavelength difference of the photon pairs. The photons are then switched by a waveguide crossing, yielding the path-entangled state

|ψ0=eiθ8C1(φ1)|0s|0i+C2(φ2)|1s|1i)/A|\psi\rangle_{0}=e^{i\theta_{8}}\sqrt{C_{1}(\varphi_{1})}|0_{s}\rangle|0_{i}\rangle+\sqrt{C_{2}(\varphi_{2})}|1_{s}\rangle|1_{i}\rangle)/A (5)

where AA is the normalization coefficient, and C1(φ1)C_{1}(\varphi_{1}) and C2(φ2)C_{2}(\varphi_{2}) represent the measured CR of source1 and source2, respectively, which can be calculated using Equation (4). Due to the global phase introduced by AMZI in Equation (1), a relative phase occurs in the entangled state, which can be compensated for by adjusting θ8\theta_{8}. The state can be further described as

Refer to caption
Figure 2. a)Experimental setup and the programmable silicon quantum photonic chip. The chip is made up of three parts: i) creating two-photon entangled states with controllable amplitudes using SFWM and AMZIs, ii) performing controlled unitary operations to produce arbitrary two-qubit pure states, and iii) using single-qubit gates for computing or state tomography. b)The equivalent quantum circuit of the chip. Two waveguides with one equipped with phase shifter corresponds to an Rz^\hat{R_{z}} gate, while the MZI corresponds to an Ry^\hat{R_{y}} gate. The combination of both can carry out any SU(2) transformation.
|ψ0\displaystyle|\psi\rangle_{0} =eiθ8cos(φ)|0s|0i+sin(φ)|1s|1i\displaystyle=e^{i\theta_{8}}cos(\varphi)|0_{s}\rangle|0_{i}\rangle+\sin(\varphi)|1_{s}\rangle|1_{i}\rangle (6)
=α|0s|0i+β|1s|1i\displaystyle=\alpha|0_{s}\rangle|0_{i}\rangle+\beta|1_{s}\rangle|1_{i}\rangle

where the rotation angle φ\varphi is determined by both the two tunable split ratios of AMZIs, with the relationship tan(φ)=C2(φ2)/C1(φ1)tan(\varphi)=\sqrt{C_{2}(\varphi_{2})/C_{1}(\varphi_{1})}. As a result, entangled state with arbitrary amplitude is generated.

Next, a CU^C\hat{U} operation is carried out using the approach in Ref. [31]. The space of idler photon is expanded by adding two waveguide modes. The path of the photon introduces the third dimention indicated by |0p|0_{p}\rangle and |1p|1_{p}\rangle, choosing which operation the idler qubit experiences — V^\hat{V} or U^\hat{U}. The path is controlled by the state of the signal photon, resulting in a superposition of V^\hat{V} and U^\hat{U} operations

α|0sV^|0i|0p+β|1sU^|0i|1p\alpha|0_{s}\rangle\otimes\hat{V}|0_{i}\rangle\otimes|0_{p}\rangle+\beta|1_{s}\rangle\otimes\hat{U}|0_{i}\rangle\otimes|1_{p}\rangle (7)

A waveguide crossing and two MMIs are then used to erase the path information. By coincidence, we have the state after CU^C\hat{U}

|ψ1=α|0sV^|0i+β|1sTU^|0i|\psi\rangle_{1}=\alpha|0_{s}\rangle\otimes\hat{V}|0_{i}\rangle+\beta|1_{s}\rangle_{T}\otimes\hat{U}|0_{i}\rangle (8)

it can be rewritten in the computational basis, which is

|ψ2=p0|00+p1|01+p2|10+p3|11|\psi\rangle_{2}=\sqrt{p_{0}}|00\rangle+\sqrt{p_{1}}|01\rangle+\sqrt{p_{2}}|10\rangle+\sqrt{p_{3}}|11\rangle (9)

where p0+p1+p2+p3=1p_{0}+p_{1}+p_{2}+p_{3}=1 and the relative phases have been ignored. Obviously, the coefficient before each basis is arbitrary, thus the state can be any two-qubit pure state. This is a deviation from prior works[32, 33, 34, 35, 36]. Finally, additional PSs and MZIs are added as single-qubit gates to perform state tomography or other sophisticated operations.

The equivalent quantum circuit is depicted in Figure 2b, correlating one-to-one with the chip’s PS values, except for θ10\theta_{10} and θ11\theta_{11} because of broken PS θ11\theta_{11}. However, this does not affect our experiments because the issue can be addressed by altering θ8\theta_{8} and θ9\theta_{9} (See Appendix A). The parameterized gate R^y(φ)\hat{R}_{y}(\varphi) is constructed by the two AMZIs and PS θ8\theta_{8} combined. The remaining R^z\hat{R}_{z} and R^y\hat{R}_{y} gates are built respectively using PSs added to the waveguides and the MZIs. In the quantum circuit context, the ability of our technique to generate arbitrary states is further confirmed by Ref. [37], which states that the combination of R^y\hat{R}_{y} gate on the control qubit and two unitaries controlled by |0C|0\rangle_{C} and |1C|1\rangle_{C} can prepare any two-qubit state. For the sake of simplicity, we will illustrate our chip’s configuration in subsequent experiments using quantum circuit language.

3 Experiments and Results

3.1 Learn Single-qubit State

Generating quantum states is vital in quantum computing[30]. We first exhibited the PQ-GAN, whose architecture is illustrated in Figure 3a. The black box provides quantum real data defined by a density matrix σ\sigma, yet the internal physical structure and quantum process are not exposed. The generator G, which corresponds to the quantum circuit inside the red dashed box in Figure 3b, can generate any quantum state (single-qubit in our case), including pure and mixed states. The mixed state is created by partially tracing the entangled state. D performs quantum measurements (\mathcal{M}) on both the real and produced states, intending to differentiate between them based on the statistical features of the measurement results pρ=tr(ρ)p_{\rho}=tr(\mathcal{M}\rho) and pσ=tr(σ)p_{\sigma}=tr(\mathcal{M}\sigma). Figure 3b depicts D’s quantum circuit as the portion within the blue dashed box. The final two gates of the signal qubit are unused, with settings set to perform the identity operation II. The optimization mission of PQ-GAN is

min𝜽𝒈max𝜽𝒅tr[(𝜽𝒅)ρ(𝜽𝒈)]tr[(𝜽𝒅)σ]\mathop{min}\limits_{\bm{\theta_{g}}}\mathop{max}\limits_{\bm{\theta_{d}}}tr[\mathcal{M}(\bm{\theta_{d}})\rho(\bm{\theta_{g}})]-tr[\mathcal{M}(\bm{\theta_{d}})\sigma] (10)

where 𝜽𝒈\bm{\theta_{g}} and 𝜽𝒅\bm{\theta_{d}} represent the trainable phases, which are initialized with normal distribution 𝒩(μ=0,σ=0.2)\mathcal{N}(\mu=0,\sigma=0.2). During the training process, G and D are alternatively optimized until they converge. To eliminate the influence of random factors, for pure and mixed state learning, we trained with different and identical beginning parameters, respectively, for 5 rounds, with 200 epochs per round. In each training epoch, D was trained for three steps first, then G for one. Gradient descent (ascent) is used to update the parameters. The detailed experimental parameters can be found in Appendix B.

A pure single-qubit state is learned first. The real state is selected as σp=(|0+|1)(0|+1|)/2\sigma_{p}=(|0\rangle+|1\rangle)(\langle 0|+\langle 1|)/2, which is prepared by setting R^y(ϕ)=I\hat{R}_{y}(\phi)=I and V^=H\hat{V}=H. The density matrix, depicted in Figure 3d, is acquired by quantum state tomography (QST). Figure 3c plots the curve of negative D loss, with the solid line representing the mean value and the shaded area representing the standard deviation (STD) over 5 rounds. The negative D loss keeps decreasing and approaches zero, implying that D is unable to discern the source of the state and that G has successfully replicated the genuine state. To track the resemblance between the produced and real states, we retrieved ρp\rho_{p} by QST and estimated the fidelity with F(ρ,σ)=[Tr(ρσρ)]2F(\rho,\sigma)=[Tr(\sqrt{{\sqrt{\rho}\sigma\sqrt{\rho}}})]^{2} at each epoch. As illustrated in Figure3c, FF approaches 1 after approximately 100 epochs. The final generated state is depicted in Figure 3d, with a fidelity as high as 99.41±0.5499.41\pm 0.54%. Such a high fidelity implies that our chip can easily learn pure states.

Refer to caption
Figure 3. a)The structure of PQ-GAN, where real data is a quantum state σ\sigma, G generates arbitral states ρ\rho, and D does projective measurements \mathcal{M}. b)The components of G and D used in the chip, which are shown in the form of quantum circuits. c)The tracking of negative D loss and the fidelity of pure state learning over 200 epochs. d) The measured dense matrix of σp\sigma_{p} and ρp\rho_{p} with a mean state fidelity of 99.41%99.41\%. e)The mixed state learning process. f)The measured dense matrix of σm\sigma_{m} and ρm\rho_{m} with a mean state fidelity of 98.39%98.39\%.

Then, the learning of a mixed state σm=(7|00|+3|11|)/10\sigma_{m}=(7|0\rangle\langle 0|+3|1\rangle\langle 1|)/10 is investigated. The real state σm\sigma_{m} is created by randomly preparing {|0,|1}\{|0\rangle,|1\rangle\} with probabilities {0.7,0.3}\{0.7,0.3\}. Figure 3e plots the evolution of negative D loss and fidelities, which oscillate around 0 and 1 after 75 epochs, respectively. This disconvergence of mixed state learning is known as limit cycles, which result from the bilinear nature of the loss function when using gradient descent to generate mixed states in adversarial training[14]. The occurance of bilinear problem may be due to the mixed state being positioned inside the Bloch sphere, where the nonlinear restrictions from the Bloch boundary are not satisfied[14]. We displayed σm\sigma_{m} and the final generated ρm\rho_{m} in Figure 3f, with the fidelity 98.39±0.6098.39\pm 0.60%. The ability to demonstrate mixed-state learning comes from the fact that the chip can produce single-qubit states in any mixedness by tracing out two-qubit pure states.

3.2 Load Classical Distribution

Quantum GANs can encode classical data into quantum states using a low-depth quantum circuit by training the generator, overcoming the data input bottleneck in QML[19, 20]. Since the distribution is arbitrary, a generator with high expressivity is required. We demonstrate that our chip has the capabilities by performing three distribution loading tasks. To achieve this goal, an HQC-GAN architecture is adopted, and its diagram is depicted in Figure 4a. It is the quantum version of Wassertein GAN with gradient penalty (WGAN-GP)[38, 39], which employs Wassertein distance (or Earth mover’s distance) as the loss function, alleviating

Refer to caption
Figure 4. a) The architecture of HQC-GAN, the quantum variant of WGAN-GP, where the generator is replaced by a PQC U(𝜽𝒈)U(\bm{\theta_{g}}). b)G’s quantum circuit as configured on the chip, with just three trainable parameters and the rest fixed to perform operation II. c)-e)Plots of the three distributions, which are the normal, log-normal, and bimodal distributions from left to right. f)-h) Curves of KLD (orange) and critic loss (blue) of three distributions across 500 epochs.

the issue of unstable training, mode collapse, and vanishing gradients in standard GAN[40, 41].

The HQC-GAN has two components: a quantum G and a classical critic. G is a PQC parameterized with 𝜽𝒈\bm{\theta_{g}}, and the generated data is encoded into the occurance probabilities of basis states, which can be acquired by measuring along the computational basis. The measurement operation also incorporates the stochastic aspect required by G. In our experiment, the chip plays the role of G. In Figure 4b, the chip is set to realize a quantum circuit with only three trainable parameters, as the R^y\hat{R}_{y} gate is sufficient to change the probability on each basis. It can create any 2-qubit state with real amplitudes as illustrated in Equation (9), representing any distribution with four data points p=[p0,p1,p2,p3]\vec{p}=[p_{0},p_{1},p_{2},p_{3}]^{\top}. The critic, parameterized with 𝜽𝒄\bm{\theta_{c}}, is comparable to the discriminator in standard GAN[42], except that the final sigmoid layer has been removed, resulting in an unbounded output. We construct the critic as a fully-connected NN implemented with Pytorch, which consists of an input layer, two hidden layers and an output layer with a node arrangement of 4-5-3-1. Leaky ReLU functions are used in both the input and hidden layers[43]. The real data is obtained by sampling from a classical random distribution.

During the training, the critic is fed both real and generated data, and the output is used to alternately train G and the critic. The optimization mission of HQC-GAN is

min𝜽𝒈max𝜽𝒅D𝜽𝒅(G(𝜽𝒈))D𝜽𝒅(𝒙)+λ𝔼p𝒙^(𝒙^)[(𝒙^D𝜽𝒅(𝒙^)21)2]\mathop{min}\limits_{\bm{\theta_{g}}}\mathop{max}\limits_{\bm{\theta_{d}}}D_{\bm{\theta_{d}}}(G(\bm{\theta_{g}}))-D_{\bm{\theta_{d}}}(\bm{x})+\lambda\mathbb{E}_{p_{\bm{\hat{x}}}(\bm{\hat{x}})}[(||\nabla_{\bm{\hat{x}}}D_{\bm{\theta_{d}}}(\bm{\hat{x}})||_{2}-1)^{2}] (11)

where 𝒙\bm{x} is the real data distribution, p𝒙^(𝒙^)p_{\bm{\hat{x}}}(\bm{\hat{x}}) indicates the distribution of 𝒙^=𝒙+ζ(G(𝜽𝒈)𝒙)\bm{\hat{x}}=\bm{x}+\zeta(G(\bm{\theta_{g}})-\bm{x}) with ζU(0,1)\zeta\sim U(0,1) and λ\lambda is the gradient penalty coefficient. The introduction of the gradient penalty aims to meet the 1-Lipschitz continuity criterion required for WGAN[40, 41]. See Appendix B for more detailed experimental settings.

We used our chip to learn three distributions: the normal distribution with X𝒩(μ=1.5,σ=1)X\sim\mathcal{N}(\mu=1.5,\sigma=1), the log-normal distribution with X𝒩(μ=0.5,σ=1)X\sim\mathcal{LN}(\mu=0.5,\sigma=1) and the bimodel distribution being the superposition of X𝒩(μ=0,σ=0.5)X\sim\mathcal{N}(\mu=0,\sigma=0.5) and X𝒩(μ=2,σ=0.3)X\sim\mathcal{N}(\mu=2,\sigma=0.3). The real data, represented by the black solid lines in Figure 4c-e, are obtained by sampling 10,000 points and truncating them into the interval [0,3][0,3]. We trained the network in five rounds with varied initialization settings, each with 500 epoches. For each epoch, the critic is trained three times first, then G is trained once.

The blue histograms in Figure 4c-e exhibit the average value of learning results, which mimic the target distributions. Figure 4f-h show the evolution of the critic loss and the Kullback-Leibler divergence (KLD), with the solid line representing the mean value over 5 rounds and the shadow part indicating the STD. In traditional GANs, the KLD is commonly used to estimate the similarity of two distributions and is often employed as a convergence indicator. As noted, the critic loss and KLD have a comparable convergence tendency and finally approach 0. The final KLD value for all three distributions is less than 0.05, suggesting that the real distributions are successfully loaded.

3.3 Generate Compressed Images

As an important application, classical GANs have been utilized to generate realistic images that do not exist in the training dataset. In this application, nonlinearity is critical for capturing complicated patterns and nonlinear correlations in the data. Unfortunately, when a PQC is substituted for a classical G, the quantum G can only conduct linear transformations since quantum evolution is unitary. To introduce nonlinearity, one typical way is to trace the qubits out[23, 44, 45], i.e., to add auxiliary qubits to the generator, and discard them during the measurement stage. However, this method is not suitable for our experiment since only two qubits are available. To address this challenge, we introduce a new method by placing a classical NN in front of the original quantum G, giving rise to a new hybrid G that performs well in the job of learning compressed MNIST images.

Figure 5a depicts the training framework, which resembles the structure seen in Figure 4a. The significant distinction is the addition of a noise source ZZ and the use of the hybrid G. The noise vector 𝒛\bm{z} is obtained by sampling from a uniform distribution within the range [0, 1]. As previously stated, the hybrid G is made up of two parts: one classical, which is a fully-connected NN, and the other quantum, which is a PQC. The incorporation of the classical NN enables straightforward nonlinear data transformations using activation functions. In this experiment, a 2×22\times 2 NN activated by Leaky ReLU functions is used. Figure 5b illustrates the quantum part implemented by our chip, which consists of an encoding layer and a training layer UG(𝜽𝒒)U_{G}(\bm{\theta_{q}}). The encoding layer occurs because the equation R^y(ϕ1+ϕ2)=R^y(ϕ1)+R^y(ϕ2)\hat{R}_{y}(\phi_{1}+\phi_{2})=\hat{R}_{y}(\phi_{1})+\hat{R}_{y}(\phi_{2}) holds true. To acquire generated data, the noise vector 𝒛\bm{z} is first input into the classic part of G, and undergoes a nonlinear transformation to become G(𝜽𝒄;𝒛)G(\bm{\theta_{c}};\bm{z}). The results are then encoded into the angle of R^y(z)\hat{R}_{y}(z) gates and transformed into the output state |G(𝜽𝒄,𝜽𝒒;𝒛)=UG(𝜽𝒒)|G(𝜽𝒄;𝒛)|G(\bm{\theta_{c}},\bm{\theta_{q}};\bm{z})\rangle=U_{G}(\bm{\theta_{q}})|G(\bm{\theta_{c}};\bm{z})\rangle. Finally, measuring along the computational basis yields the generated data G(𝜽𝒄,𝜽𝒒;𝒛)G(\bm{\theta_{c}},\bm{\theta_{q}};\bm{z}).

The training process is similar to the data loading task, except that in each epoch, a batch of data is sampled separately from the generated and real dataset, rather than a single distribution, to optimize G and D. The hyperparameters used in this experiment can be found in Appendix B. The optimization objective is

min𝜽𝒈𝒄,𝜽𝒈𝒒max𝜽𝒅L(𝜽𝒈𝒄,𝜽𝒈𝒒,𝜽𝒅)=1Ni=0N1[D𝜽𝒅(G(𝜽𝒈𝒄,𝜽𝒈𝒒;𝒛𝒊))D𝜽𝒅(𝒙𝒊)+λ𝔼p𝒙^𝒊(𝒙^𝒊)(𝒙𝒊^D𝜽𝒅(𝒙^𝒊)21)2]\mathop{min}\limits_{\bm{\theta_{g_{c}}},\bm{\theta_{g_{q}}}}\mathop{max}\limits_{\bm{\theta_{d}}}L(\bm{\theta_{g_{c}}},\bm{\theta_{g_{q}}},\bm{\theta_{d}})=\frac{1}{N}\sum_{i=0}^{N-1}[D_{\bm{\theta_{d}}}(G(\bm{\theta_{g_{c}}},\bm{\theta_{g_{q}}};\bm{z_{i}}))-D_{\bm{\theta_{d}}}(\bm{x_{i}})+\lambda\mathbb{E}_{p_{\bm{\hat{x}_{i}}}(\bm{\hat{x}_{i}})}(||\nabla_{\bm{\hat{x_{i}}}}D_{\bm{\theta_{d}}}(\bm{\hat{x}_{i}})||_{2}-1)^{2}] (12)

where 𝒙𝒊\bm{x_{i}} is a sample from the training dataset, and N is the batch size. The backpropagation approach is ineffective for optimizing 𝜽𝒈𝒄\bm{\theta_{g_{c}}} due to the quantum component between the classical NN and the output. Thus, we utilize the finite difference approach to get the approximate gradient, which is

d𝜽𝒈𝒄=L(𝜽𝒈𝒄+ϵ)L(𝜽𝒈𝒄ϵ)2ϵd\bm{\theta_{g_{c}}}=\frac{L(\bm{\theta_{g_{c}}}+\epsilon)-L(\bm{\theta_{g_{c}}}-\epsilon)}{2\epsilon} (13)

where ϵ\epsilon is a small incremental value.

Refer to caption
Figure 5. a)The HQC-GAN framework for learning images, in which G is a combination of a NN and a PQC. The classical NN introduces nonlinearity for G by performing a nonlinear transformation on the input noise. b)The corresponding quantum circuit of U(𝜽𝒈𝒒)U(\bm{\theta_{g_{q}}}), which includes an encoding layer and a training layer. c)Visualization of binarized MNIST images after IPCA, with real data examples on the left and chip-generated examples on the right. d)The progress of critic loss (blue) and KLD (orange) for digit 0, with the solid line and the shaded zone representing the average and STD of five runs with different initialization. The embedded images illustrate the evolution of the generated digit 0 as epochs proceed.

As a demonstration, we use this framework to train the chip to produce MNIST images. The MNIST dataset contains 60,000 handwritten images ranging from 0 to 9 with a size of k=28×28k=28\times 28, exceeding our chip’s capabilities. Inspired by Ref. [46, 47], we preprocess the pictures by compressing them to dimension k=3k=3 with the principal component analysis (PCA) algorithm, resulting in the feature data x=[x0,x1,x2]\vec{x}=[x_{0},x_{1},x_{2}]^{\top} for each compressed image. To match the image data with the quantum state represented in Equation (9), we augment x\vec{x} to x=[x,0.5]{x}^{\prime}=[\vec{x}^{\top},0.5]^{\top}, and utilize the map pi=xi/i=03xip_{i}=x_{i}^{\prime}/\sum_{i=0}^{3}x_{i}^{\prime}, which assures a one-to-one mapping between x\vec{x} and p\vec{p}, retaining all information.

We then trained the chip to generate all nine digits (0–9) one at a time. The batch size is set to be N=5N=5, and 5 rounds with each 200 epochs are conducted. To visualize the quality of the created images, we utilize the inverse PCA (IPCA) method to retrieve the data to the size of 28×2828\times 28, and display the real and generated images with binarization in Figure 5c. The images are randomly selected from training and generated datasets. As can be seen, we produced images that are comparable in quality to the original. Figure 5d displays the curve of critic loss and KLD of digit 0, which converge to be approximately 0. The convergence and creation of high-quality images demonstrate the efficacy of our strategy.

4 Discussion and Conclusion

Our scheme for generating arbitrary 2-qubit pure states can be extended to arbitrary 2-qudit by adopting high-dimensional encoding. In our strategy, AMZI is used to modulate the splitting ratio of photon pairs, resulting in an entangled state with controllable amplitude. In Ref. [35], the authors employ high-dimensional encoding to realize 4-dimensional CU^C\hat{U} gates, but the AMZI’s phases are fixed to deterministic values for separating photon pairs, leaving only the maximum entangled state available. Our approach can be directly translated to the device by changing the AMZI step to obtain arbitrary 2-ququart pure states without any further changes. We predict that as the encoding dimension increases, our scheme’s advantages will become more apparent. Furthermore, because the essence of our approach is to post-select photons in specific wavelengths through filters in order to change the quantum state amplitudes, it has a wider range of applications. In most quantum photonic computing chips, the first step is to generate entangled states[17, 48, 49, 50]. Our approach can be considered an alternative solution for tuning the amplitude to perform more complex computations. In Ref. [51], the authors change the quantum state amplitude by tuning micro-ring filters, similar to our scheme, but applied in quantum networks.

Our hybrid generator can alternatively be viewed as first preparing latent data with a conventional NN before inputting it into the quantum generator. This is distinct from existing hybrid generator structures, in which the quantum circuit prepares latent data and the classical neural network does post-processing, resulting in the creation of exclusively classical data[24, 25, 26]. In contrast, our hybrid generator maintains information in the quantum state, allowing for easier integration with subsequent quantum circuits. In this sense, it acts as a distribution loader to be a subset of larger algorithms[52], but more advanced than the pure quantum version[19, 20]. However, as mentioned before, the drawback is that due to the quantum layer between the NN and the measurement outcomes, backpropagation cannot be used, leading to time-consuming gradient acquisition for each parameter. As a result, the number of parameters in the classical NN must be carefully selected to minimize the runtime.

It is acknowledged that due to the chip’s limiting size, only four dimensions are available—which allows just conceptual demonstrations of quantum GANs. The few available qubits are also a challenge for other quantum computing platforms at present, but combining more advanced algorithms may help alleviate this issue. For instance, a variant of HQC-GAN called patch GAN has been proposed in Ref. [23], which combines the generated results from small-scale quantum subgenerators to represent large-scale data, opening up new prospects for the recent applications of quantum GANs.

In conclusion, we have designed a silicon quantum photonic chip made up of two spiral waveguides, two AMZIs, and 14 phase shifters that can generate any two-qubit pure state. The single chip is utilized for learning both quantum and classical data. Initially, we presented a PQ-GAN on this chip, training it to produce a high-fidelity single-qubit pure and mixed state. Then, we then integrated our quantum device with classical discriminators to show an HQC-GAN, which loaded three classical distributions onto the basis-state probabilities of two photonic qubits. Finally, we presented a hybrid generator that combines conventional NNs with quantum circuits for obtaining nonlinearity, and we used it to produce compressed MNIST images that are comparable in quality to the training set. Our work paves the way for the application of quantum photonic chips in generative learning. In the future, we expect to combine more advanced algorithms and larger-scale chips to implement quantum GANs, expecting to address more practical problems and applications in finance[52], drug discovery[25, 26] and more.

Appendix A

Figure A1 shows a microscope photograph of the chip. The entire quantum chip has a dimension of 3mm ×\times 0.8mm. The white lines and square pads act as electric conduits and metal pads for delivering electric signals to the PSs to manipulate the qubits. However, the PS θ11\theta_{11} was broken. We demonstrate that this has no impact on our experiments.

As depicted in Figure A2, the transformations executed by the phase shifters θ8\theta_{8} through θ11\theta_{11} are Rz(θ8)R_{z}(\theta_{8}), Ry(θ9)R_{y}(\theta_{9}), Rz(θ10)R_{z}(\theta_{10}) and UbrokU_{brok}, respectively. Total transformation can be stated as

U(θ8,θ9,θ10)=UbrokRz(θ10)Ry(θ9)Rz(θ8)U(\theta_{8},\theta_{9},\theta_{10})=U_{brok}R_{z}(\theta_{10})R_{y}(\theta_{9})R_{z}(\theta_{8}) (A1)

Since the combination of RzR_{z} and RyR_{y} gates can represent any 2-dimensional unitary, we can always find {θ80,θ90,θ100}\{\theta_{8}^{0},\theta_{9}^{0},\theta_{10}^{0}\} such that Rz(θ100)Ry(θ90)Rz(θ80)=UbrokR_{z}(\theta_{10}^{0})R_{y}(\theta_{9}^{0})R_{z}(\theta_{8}^{0})=U_{brok}^{\dagger}. Thus, we have

I=UbrokRz(θ100)Ry(θ90)Rz(θ80)=UbrokUbrokI=U_{brok}R_{z}(\theta_{10}^{0})R_{y}(\theta_{9}^{0})R_{z}(\theta_{8}^{0})=U_{brok}U_{brok}^{\dagger} (A2)

Assuming additional phase shifts {dθ8,dθ9}\{d\theta_{8},d\theta_{9}\} is added, then

U(θ80+dθ8,θ90+dθ9,θ100)\displaystyle U(\theta_{8}^{0}+d\theta_{8},\theta_{9}^{0}+d\theta_{9},\theta_{10}^{0}) (A3)
=\displaystyle= UbrokRz(θ100)Ry(θ90+dθ9)Rz(θ80+dθ8)\displaystyle U_{brok}R_{z}(\theta_{10}^{0})R_{y}(\theta_{9}^{0}+d\theta_{9})R_{z}(\theta_{8}^{0}+d\theta_{8})
=\displaystyle= UbrokRz(θ100)Ry(θ90)Rz(θ80)IRz(θ80)Ry(dθ9)Rz(θ80+dθ8)\displaystyle\underbrace{U_{brok}R_{z}(\theta_{10}^{0})R_{y}(\theta_{9}^{0})R_{z}(\theta_{8}^{0})}_{I}R_{z}(-\theta_{8}^{0})R_{y}(d\theta_{9})R_{z}(\theta_{8}^{0}+d\theta_{8})
=\displaystyle= Rz(θ80)Ry(dθ9)Rz(θ80+dθ8)\displaystyle R_{z}(-\theta_{8}^{0})R_{y}(d\theta_{9})R_{z}(\theta_{8}^{0}+d\theta_{8})

where Rz(θ80)R_{z}(-\theta_{8}^{0}) has no physical meaning as it cannot be measured. By redefining θ9=dθ9\theta_{9}=d\theta_{9}, θ8=θ80+dθ8\theta_{8}=\theta_{8}^{0}+d\theta_{8} and ignoring Rz(θ80)R_{z}(-\theta_{8}^{0}), the final transformation is obtained as

U(θ8,θ9,θ100)=Ry(θ9)Rz(θ8)U(\theta_{8},\theta_{9},\theta_{10}^{0})=R_{y}(\theta_{9})R_{z}(\theta_{8}) (A4)

As a result, our experiment is unaffected by the broken phase shifter θ11\theta_{1}1. This is why, in our quantum circuit shown in Figure 2b, the final quantum gates for the signal photon qubit are RzR_{z} and RyR_{y}.

Refer to caption
Figure A1. The microscopic photograph of the chip.
Refer to caption
Figure A2. The connection between cascade phase-shifters and quantum gates, where θ11\theta_{11} is broken.

Appendix B

In the experiments, our device implements the quantum component, while PyTorch is used to build the traditional NN. The three tasks are set up under various experimental circumstances.

In the single-qubit state learning problem, the parameters G and D are optimized using gradient descent (ascent) at learning rates of 0.02 and 0.1, respectively. For the distribution loading task, the learning rates for G and the critic are 0.08 and 0.1, respectively. In the compressed MNIST learning problem, the learning rates for G’s classical portion, G’s quantum part, and the critic are 0.02, 0.08, and 0.02, respectively.

In the quantum state learning issue, a vanilla gradient is used and no optimizer is applied. The last two tasks employ the RMSProp optimizer[53] to update the parameters of both G and the critic, with optimizer hyperparameters set to β=0.9\beta=0.9. The objective function has a gradient penalty coefficient of λ=0.5\lambda=0.5.

The parameter-shift rule[54] is used to compute the gradients of the quantum portion in all three challenges, whereas PyTorch computes the gradients of the classical discriminator automatically. For the MNIST image learning issue, the gradients of G’s classical portion are computed using finite differences, with an incremental value of ϵ=0.02\epsilon=0.02.

Acknowledgements

This work was supported by the National Key Research and Development Program of China (Grants No. 2021YFB2800201), the National Natural Science Foundation of China (Grants No. U22A2082), the Ningbo Science and Technology Program (Grants No. 2023Z073), and the Provincial Natural Science Fundation of Zhejiang (Grants No. LD21F040003).

References

  • [1] P. W. Shor, SIAM review 1999, 41, 2 303.
  • [2] A. W. Harrow, A. Hassidim, S. Lloyd, Physical review letters 2009, 103, 15 150502.
  • [3] D. Bluvstein, S. J. Evered, A. A. Geim, S. H. Li, H. Zhou, T. Manovitz, S. Ebadi, M. Cain, M. Kalinowski, D. Hangleiter, et al., Nature 2023, 1–3.
  • [4] A. Peruzzo, J. McClean, P. Shadbolt, M.-H. Yung, X.-Q. Zhou, P. J. Love, A. Aspuru-Guzik, J. L. O’brien, Nature communications 2014, 5, 1 4213.
  • [5] E. Farhi, J. Goldstone, S. Gutmann, arXiv preprint arXiv:1411.4028 2014.
  • [6] K. Mitarai, M. Negoro, M. Kitagawa, K. Fujii, Physical Review A 2018, 98, 3 032309.
  • [7] J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, S. Lloyd, Nature 2017, 549, 7671 195.
  • [8] V. Havlíček, A. D. Córcoles, K. Temme, A. W. Harrow, A. Kandala, J. M. Chow, J. M. Gambetta, Nature 2019, 567, 7747 209.
  • [9] I. Cong, S. Choi, M. D. Lukin, Nature Physics 2019, 15, 12 1273.
  • [10] J. Liu, K. H. Lim, K. L. Wood, W. Huang, C. Guo, H.-L. Huang, Science China Physics, Mechanics & Astronomy 2021, 64, 9 290311.
  • [11] S. Lloyd, C. Weedbrook, Physical review letters 2018, 121, 4 040502.
  • [12] P.-L. Dallaire-Demers, N. Killoran, Physical Review A 2018, 98, 1 012324.
  • [13] M. Benedetti, E. Grant, L. Wossnig, S. Severini, New Journal of Physics 2019, 21, 4 043023.
  • [14] P. Braccia, F. Caruso, L. Banchi, New Journal of Physics 2021, 23, 5 053024.
  • [15] M. Y. Niu, A. Zlokapa, M. Broughton, S. Boixo, M. Mohseni, V. Smelyanskyi, H. Neven, Physical Review Letters 2022, 128, 22 220505.
  • [16] L. Hu, S.-H. Wu, W. Cai, Y. Ma, X. Mu, Y. Xu, H. Wang, Y. Song, D.-L. Deng, C.-L. Zou, et al., Science advances 2019, 5, 1 eaav2761.
  • [17] Y. Wang, S. Xue, Y. Wang, Y. Liu, J. Ding, W. Shi, D. Wang, Y. Liu, X. Fu, G. Huang, et al., Optics Letters 2023, 48, 20 5197.
  • [18] J. Zeng, Y. Wu, J.-G. Liu, L. Wang, J. Hu, Physical Review A 2019, 99, 5 052306.
  • [19] C. Zoufal, A. Lucchi, S. Woerner, npj Quantum Information 2019, 5, 1 103.
  • [20] H. Situ, Z. He, Y. Wang, L. Li, S. Zheng, Information Sciences 2020, 538 193.
  • [21] J. Romero, A. Aspuru-Guzik, Advanced Quantum Technologies 2021, 4, 1 2000003.
  • [22] M. S. Rudolph, N. B. Toussaint, A. Katabarwa, S. Johri, B. Peropadre, A. Perdomo-Ortiz, Physical Review X 2022, 12, 3 031010.
  • [23] H.-L. Huang, Y. Du, M. Gong, Y. Zhao, Y. Wu, C. Wang, S. Li, F. Liang, J. Lin, Y. Xu, et al., Physical Review Applied 2021, 16, 2 024051.
  • [24] D. Herr, B. Obert, M. Rosenkranz, Quantum Science and Technology 2021, 6, 4 045004.
  • [25] J. Li, R. O. Topaloglu, S. Ghosh, IEEE transactions on quantum engineering 2021, 2 1.
  • [26] P.-Y. Kao, Y.-C. Yang, W.-Y. Chiang, J.-Y. Hsiao, Y. Cao, A. Aliper, F. Ren, A. Aspuru-Guzik, A. Zhavoronkov, M.-H. Hsieh, et al., Journal of Chemical Information and Modeling 2023, 63, 11 3307.
  • [27] H. Zhang, L. Wan, S. Ramos-Calderer, Y. Zhan, W.-K. Mok, H. Cai, F. Gao, X. Luo, G.-Q. Lo, L. C. Kwek, et al., Photonics Research 2023, 11, 10 1703.
  • [28] J. Wang, F. Sciarrino, A. Laing, M. G. Thompson, Nature Photonics 2020, 14, 5 273.
  • [29] L. Feng, M. Zhang, J. Wang, X. Zhou, X. Qiang, G. Guo, X. Ren, Photonics Research 2022, 10, 10 A135.
  • [30] M. A. Nielsen, I. L. Chuang, Quantum computation and quantum information, Cambridge university press, 2010.
  • [31] X.-Q. Zhou, T. C. Ralph, P. Kalasuwan, M. Zhang, A. Peruzzo, B. P. Lanyon, J. L. O’brien, Nature communications 2011, 2, 1 413.
  • [32] S. Paesani, A. A. Gentile, R. Santagati, J. Wang, N. Wiebe, D. P. Tew, J. L. O’Brien, M. G. Thompson, Physical review letters 2017, 118, 10 100503.
  • [33] J. Wang, S. Paesani, R. Santagati, S. Knauer, A. A. Gentile, N. Wiebe, M. Petruzzella, J. L. O’brien, J. G. Rarity, A. Laing, et al., Nature Physics 2017, 13, 6 551.
  • [34] R. Santagati, J. Wang, A. A. Gentile, S. Paesani, N. Wiebe, J. R. McClean, S. Morley-Short, P. J. Shadbolt, D. Bonneau, J. W. Silverstone, et al., Science advances 2018, 4, 1 eaap9646.
  • [35] Y. Chi, J. Huang, Z. Zhang, J. Mao, Z. Zhou, X. Chen, C. Zhai, J. Bao, T. Dai, H. Yuan, et al., Nature communications 2022, 13, 1 1166.
  • [36] J. Huang, X. Li, X. Chen, C. Zhai, Y. Zheng, Y. Chi, Y. Li, Q. He, Q. Gong, J. Wang, Nature Communications 2024, 15, 1 2601.
  • [37] G.-L. Long, Y. Sun, Physical Review A 2001, 64, 1 014303.
  • [38] M. Arjovsky, S. Chintala, L. Bottou, In International conference on machine learning. PMLR, 2017 214–223.
  • [39] I. Gulrajani, F. Ahmed, M. Arjovsky, V. Dumoulin, A. C. Courville, Advances in neural information processing systems 2017, 30.
  • [40] M. Arjovsky, L. Bottou, arXiv preprint arXiv:1701.04862 2017.
  • [41] T. A. Ngo, T. Nguyen, T. C. Thang, Electronics 2023, 12, 4 856.
  • [42] I. Goodfellow, J. Pouget-Abadie, M. Mirza, B. Xu, D. Warde-Farley, S. Ozair, A. Courville, Y. Bengio, Advances in neural information processing systems 2014, 27.
  • [43] D. Pedamonti, arXiv preprint arXiv:1804.02763 2018.
  • [44] E. Farhi, H. Neven, arXiv preprint arXiv:1802.06002 2018.
  • [45] K. H. Wan, O. Dahlsten, H. Kristjánsson, R. Gardner, M. Kim, npj Quantum information 2017, 3, 1 36.
  • [46] S. A. Stein, B. Baheri, D. Chen, Y. Mao, Q. Guan, A. Li, B. Fang, S. Xu, In 2021 IEEE International Conference on Quantum Computing and Engineering (QCE). IEEE, 2021 71–81.
  • [47] S. Nguemto, V. Leyton-Ortega, arXiv preprint arXiv:2208.02165 2022.
  • [48] Y. Wang, S. Xue, Y. Wang, J. Ding, W. Shi, D. Wang, Y. Liu, Y. Liu, X. Fu, G. Huang, et al., Optics Letters 2023, 48, 14 3745.
  • [49] X. Qiang, X. Zhou, J. Wang, C. M. Wilkes, T. Loke, S. O’Gara, L. Kling, G. D. Marshall, R. Santagati, T. C. Ralph, et al., Nature photonics 2018, 12, 9 534.
  • [50] X. Qiang, Y. Wang, S. Xue, R. Ge, L. Chen, Y. Liu, A. Huang, X. Fu, P. Xu, T. Yi, et al., Science Advances 2021, 7, 9 eabb8375.
  • [51] Y. Zheng, C. Zhai, D. Liu, J. Mao, X. Chen, T. Dai, J. Huang, J. Bao, Z. Fu, Y. Tong, et al., Science 2023, 381, 6654 221.
  • [52] N. Stamatopoulos, D. J. Egger, Y. Sun, C. Zoufal, R. Iten, N. Shen, S. Woerner, Quantum 2020, 4 291.
  • [53] T. Tieleman, COURSERA: Neural networks for machine learning 2012, 4, 2 26.
  • [54] G. E. Crooks, arXiv preprint arXiv:1905.13311 2019.