Universal adversarial perturbations for multiple classification tasks with quantum classifiers

Abstract

Quantum adversarial machine learning is an emerging field that studies the vulnerability of quantum learning systems against adversarial perturbations and develops possible defense strategies. Quantum universal adversarial perturbations are small perturbations, which can make different input samples into adversarial examples that may deceive a given quantum classifier. This is a field that was rarely looked into but worthwhile investigating because universal perturbations might simplify malicious attacks to a large extent, causing unexpected devastation to quantum machine learning models. In this paper, we take a step forward and explore the quantum universal perturbations in the context of heterogeneous classification tasks. In particular, we find that quantum classifiers that achieve almost state-of-the-art accuracy on two different classification tasks can be both conclusively deceived by one carefully-crafted universal perturbation. This result is explicitly demonstrated with well-designed quantum continual learning models with elastic weight consolidation method to avoid catastrophic forgetting, as well as real-life heterogeneous datasets from hand-written digits and medical MRI images. Our results provide a simple and efficient way to generate universal perturbations on heterogeneous classification tasks and thus would provide valuable guidance for future quantum learning technologies.

I Introduction

The interplay between quantum computing and machine learning gives rise to a research frontier of quantum machine learning [1, 2]. The attempting quantum computing characteristics hold the intriguing potential to trigger a revolution in traditional machine learning study [3, 4, 5, 6, 7, 8, 9, 10]. Along this direction, a series of careful investigations have been conducted and various kinds of quantum classifiers have been introduced in Refs. [11, 12, 13, 14]. These results imply potential unparalleled advantages for quantum classifiers compared with their classical counterparts [12]. The proposed quantum classifiers are typically composed of “parameterized layers” [12] and “entangled layers” [12]. Such classifiers can reach the state-of-art accuracy of over 95% on binary classification tasks by utilizing limited computational resources in acceptable time complexity. Given the advantages of this design, we implement our classifier with a similar approach.

With the emergence of quantum machine learning methods, a serious concern comes to the reliability of these proposed models. There is sufficient evidence proving that quantum classifiers are as fragile as classical classifiers when encountering adversarial perturbations [15]. In Refs. [16, 17, 18], the authors provide important evidence that quantum classifiers are almost as vulnerable as classical ones. By applying like-wise carefully crafted subtle adversarial perturbations generated by white-box or black-box attack algorithms, the quantum learning systems can be conclusively deceived regardless of the data types (either classical or quantum mechanical). Further studies on the robustness of quantum classifiers have also been carried out in Ref. [19, 20]. Moreover, in Refs. [21, 22] a series of specific quantum adversarial machine learning algorithms and physical experiment implementations have been introduced to attack quantum classifiers in different scenarios, including differential evolution algorithm [23] and zeroth-order optimization [24] for discrete attacks, and fast gradient sign method [25], projected gradient descent method [26], and momentum iterative method [27] for continuous attacks. Such algorithms are well-designed, well-tested, and proven to be robust so that they can generate adversarial samples with high equality. In practice, Ref. [28] performs quantum adversarial learning on programmable superconducting quantum devices, which depict an intriguing blueprint for future quantum devices and applications.

However, most existing works on quantum adversarial learning have focused on adversarial perturbations for single input samples. In other words, the perturbation for different legitimate images is different, which indicates extra calculation for every image to be perturbed. In classical adversarial machine learning, universal adversarial perturbations have been introduced and investigated in Ref. [29]. More recently, this has been carried over to the quantum domain in Ref. [30], where the authors studied universal adversarial examples and perturbations. This paper not only introduced an effective method for generating universal perturbations for a single classifier but also demonstrated that increasing the strength of perturbations by a certain extent can result in moderate universal adversarial risk. The conclusion in this paper indicates that universal perturbations may exist even for different classification tasks. However, a clear demonstration of such universal perturbations remains lacking and there is no known efficient algorithm to find these perturbations hitherto.

In this work, we fill up this gap by studying universal perturbations for multiple classification tasks, in the context of quantum continual learning [31]. Here, the expression “multiple classification tasks” can be more precisely stated as “heterogeneous classification tasks”. We use the word “heterogeneous” to indicate datasets that are generated from different patterns. Since we want to study universal perturbations for multiple classification tasks, the datasets of different tasks should be independently distributed. For example, we take two MNIST handwritten digits classification tasks as “homogeneous” because both datasets are generated from the same pattern. Additionally, we take the MNIST handwritten digit and the MedNIST MRI image classification tasks as “heterogeneous” because both datasets are generated from different patterns. For the sake of simplicity, we shall use the phrase “multiple classification tasks” to represent “heterogeneous classification tasks”.

Specifically, our goal is to find an efficient way to generate universal perturbations for quantum classifiers tackling multiple classification tasks. We simulate two classifiers by merging them using quantum continual learning with elastic weight consolidation (EWC) [32, 33, 34] techniques, then conduct a simple basic iteration method to attack the merged classifier. We find that universal perturbations that can make different input samples into adversarial examples for different classification tasks indeed exist and can be obtained efficiently by the iteration algorithm. In particular, we can obtain a robust multiple classification tasks classifier with a stable average accuracy of over $90\%$ and a loss of around $0.62$ . By adding the universal perturbation with strength $0.02$ , the average accuracy decreases from $93.3\%$ to $28.5\%$ with fidelity of $0.79$ and loss of around $0.8$ . Specifically, the previously trained task’s accuracy decreases from $94.5\%$ to $24.5\%$ with fidelity of $0.84$ and loss of $0.73$ . The later trained task’s accuracy decreases from $92.0\%$ to $32.5\%$ with fidelity of $0.76$ and loss of $0.70$ . Our results reveal the universal feature for adversarial perturbations from a new multi-task perspective, which would be useful for future quantum learning technologies.

II Concepts and Notations

II.1 The existing quantum adversarial attack methods

Adversarial machine learning has been extensively discussed in the literature, as evidenced by the work by Huang et al. [35]. This field can be approached from two distinct perspectives as a “fact” and an “optimization problem.” The “fact” perspective highlights the existence of subtle perturbations that, when applied to the input of machine learning models, can significantly modify their outputs. On the other hand, the “optimization problem” perspective involves transforming the task into a “maximizing-minimizing problem” and leveraging algorithms with the assistance of computational power to generate adversarial samples. Various logical methods can be employed to attack machine learning systems, resulting in different strategies for adversarial attacks. In this paper, we follow the classification and analysis of adversarial attacks presented in the work by Vorobeychik et al. [36] with a particular focus on supervised learning scenarios.

In the context of supervised learning scenarios, our analysis considers a dataset $D_{N}$ comprising $N$ elements, where each element is denoted by $x^{(i)}$ and $y^{(i)}$ , representing the input data to be classified and its corresponding label, respectively [37, 38, 39]. To ensure compatibility with the quantum classifier, the input data $x^{(i)}$ needs to be transformed into a valid quantum state $\left|x^{(i)}\right\rangle$ before the classification process. Assuming that we have trained a quantum classification model $F$ to achieve high accuracy on dataset $D_{N}$ , we can affirm that for every input data element $x^{(i)}$ , the discrepancy between the model’s output $F\left(\left|x^{(i)}\right\rangle\right)$ and ground truth label $y^{(i)}$ , which is calculated from the loss function $L$ , is constrained within an acceptable range $\epsilon$ :

\sum_{i=1}^{n}L\left(F\left(\left|x^{(i)}\right\rangle\right),y^{(i)}\right)\leq\epsilon.

(1)

From an alternative standpoint, the training process in machine learning predominantly focuses on minimizing the loss function. In contrast, the adversarial process aims to maximize the loss function by applying a perturbation $\delta^{(i)}$ within the constraints of a small region $\Delta$ . A commonly employed technique involves imposing an $l_{p}$ -norm bound, which restricts the magnitude of the perturbation. Consequently, the aforementioned description can be summarized as follows:

\begin{aligned} &\max_{\delta^{(i)}}\quad L\left(F\left(\left|x^{(i)}+\delta^{(i)}\right\rangle\right),y^{(i)}\right)\\ &\text{s.t.}\quad\left(\sum_{i=1}^{N}||\delta^{(i)}||^{p}\right)^{\frac{1}{p}}\leq\Delta\end{aligned}.

(2)

Intuitively, the objective of this optimization problem is to achieve an imperceptible perturbation that generates a significant change in the loss function. An intuitive approach is to apply the perturbation along the maximum gradient ascent direction, as the gradient of the loss function indicates the rate of change in a given direction. Suppose we define the function $J(x^{(i)})$ that calculates the absolute gradient value and ascending direction of the gradient for $x^{(i)}$ in a multi-dimensional Hilbert space, as well as the strength of the perturbation denoted by $\epsilon$ . In this context, the calculation of perturbation $\delta^{(i)}$ can be summarized as follows:

\delta^{(i)}=\epsilon\times\mathrm{sgn}(J(x^{(i)})).

(3)

Most white-box attacks, such as FGSM [25] (fast gradient sign method), PGD [26] (projected gradient descent), and MIM [27] (moment iteration method), draw inspiration from directly calculating gradients and selecting an appropriate perturbation. Meanwhile, for black-box attacks, direct access to the gradient is not available. Substitute optimization methods like differential evolution algorithms are employed to identify the pixels that can significantly alter the value of the loss function. In quantum adversarial machine learning, gradient plays a crucial role as it indicates the regions where the loss function can be easily manipulated. The utilization of gradient information enables us to create minor perturbations specifically in regions characterized by higher absolute gradient values. These perturbations cause a shift in the loss function, guiding the model toward a desired direction and leading to a significant alteration in the model’s performance, thereby producing an adversarial effect.

II.2 The universal adversarial perturbation

According to the aforementioned description, the perturbation $\delta^{(i)}$ is calculated individually for each input sample $x^{(i)}$ . However, this approach requires attackers to compute specific information for each input, which can be challenging in many attacking scenarios. To address this limitation, a new type of adversarial method called “universal perturbation” has been developed. The underlying principle is to exploit the uniformity within a single task, identify similarities in gradient information across selected samples, and apply the same universal perturbation to transform most original samples into adversarial samples. In other words, the objective is to find a universal perturbation $\delta$ that combines the perturbation strength $\epsilon$ and the gradient sign information $J$ from the entire selected dataset $X$ :

\delta=\epsilon\times\mathrm{sgn}(J(X)).

(4)

By adding this universal perturbation to the original image, we expect to achieve the adversarial transformation of multiple input samples simultaneously.

Extensive evidence has convincingly shown the effectiveness and potential risks associated with universal perturbations for quantum classifiers [30]. Taking this understanding a step further, we pose an intriguing question: Can we generate a universal perturbation capable of deceiving a set of classifiers performing different tasks? To simplify the scenario, consider two classifiers, $A$ and $B$ . Their respective datasets are denoted as $X_{A}$ and $X_{B}$ . Additionally, we define the function $J$ as the operator that captures gradient information. The formulation of the universal perturbation can be expressed as follows:

\delta=\epsilon\times\mathrm{sgn}(J(X_{A},X_{B})).

(5)

In scenarios where gradient information is not exclusively associated with a single input sample, it becomes unavoidable that the resulting perturbation encompasses distributed information from the entire selected dataset, which may not perfectly align with each individual input sample. To overcome this challenge, it is essential to employ larger perturbation strengths and enhance the functionality of gradient information-extraction methods. By doing so, we can effectively address the issue of incorporating dataset-wide information and improve the alignment between perturbations and individual input samples.

In order to differentiate between adversarial examples generated using gradient information from individual input samples and those created using universal perturbations, we propose the term “universal examples” to refer specifically to adversarial examples crafted using universal perturbations. Consequently, we define the adversarial attack that utilizes universal examples as a “universal attack”. This terminology serves the purpose of distinguishing between various approaches and strategies employed in the field of adversarial machine learning. By introducing these specific terms, we can more precisely categorize and discuss the different techniques and methodologies involved in adversarial machine learning.

II.3 Quantum continual learning and catastrophic forgetting

The design of algorithms to implement the concept of finding universal perturbation poses a significant challenge. Currently, performing calculations on multiple quantum classifiers with different structures is difficult due to limitations in qubit volume and incoherences arising from inconsistencies of quantum circuit parameters. To address this issue, we propose to introduce the quantum continual learning method to merge two target classifiers into one robust continual learning classifier that is capable of tackling multiple classification tasks. Provided that a universal perturbation can be found on this continual learning classifier, the argument of effective universal perturbation of multiple classification tasks shall still hold. Intuitively, quantum continual learning techniques offer a powerful approach to simplify the generating process of universal perturbation and demonstration of universal vulnerability between different classification tasks.

Quantum continual learning, as explored in the work by Jiang et al. [31], encompasses the study of training a classifier capable of performing multiple tasks using the same set of parameters. However, a significant issue arises when training a classifier on two datasets sequentially, as it often results in a substantial decrease in accuracy for the previously trained task. This phenomenon is known as “catastrophic forgetting” [40], which occurs due to the propensity of the learning process for the second task to modify essential parameters associated with the first task. Consequently, the acquisition of robust quantum continual learning models becomes crucial in the context of generating universal perturbations.

Recent research has indicated that animals encounter similar challenges of forgetting when acquiring new knowledge. Interestingly, a method that involves safeguarding specific excitatory synapses that play a crucial role in preserving past experiences has been discovered [41]. To address the issue of catastrophic forgetting, a straightforward approach is to protect the crucial parameters associated with the first task while training the second task. This concept has inspired the development of an effective method in both classical and quantum classifiers to mitigate catastrophic forgetting, known as the elastic weight consolidation (EWC) method [42]. Consider two independent classification tasks, denoted as $A$ and $B$ . The training process can be viewed as maximizing the likelihood function $P(\theta|X)$ , where $\theta$ represents the parameter being trained. By employing simple Bayes rules [43], a transformation can be applied as follows:

\mathrm{log}P(\theta|X)=\underset{\text{element 1}}{\mathrm{log}P(X_{B}|\theta)}+\underset{\text{element 2}}{\mathrm{log}P(\theta|X_{A})}-\underset{\text{element 3}}{\mathrm{log}P(X_{B})},

(6)

where the first element corresponds to the loss function $L_{B}(\theta)$ associated with task B, while the third element remains constant with respect to $\theta$ . Consequently, the problem can be reformulated as an optimization problem involving only the first two elements. From a technical standpoint, calculating the posterior probability $\log P(\theta|X_{A})$ directly can be challenging. To simplify the calculation, the prior probability can be simplified by Gaussian distribution, as suggested in previous studies in Ref. [44]. In addition, we utilize a second-order Taylor expansion, as described by Pourahmadi Ref. [45], centered around the optimal parameter $\theta_{A}$ obtained from the previously trained task $A$ . This expansion allows us to approximate the loss function of the current task using the Hessian matrix $H_{\theta_{A}}$ and neglect higher-order terms, as proposed in [31]:

\mathrm{log}P(\theta|X_{A})=\mathrm{log}P(\theta_{A}|X_{A})+\frac{1}{2}(\theta-\theta_{A})^{T}H_{\theta_{A}}(\theta-\theta_{A}).

(7)

By incorporating the Hessian matrix, we can effectively capture the curvature of the loss landscape and make more precise adjustments to the model’s parameters during the continual learning process. This approach helps mitigate the interference caused by the introduction of new tasks, allowing for smoother transitions and improved preservation of previously learned knowledge. This approximation allows us to simplify the calculations and make the optimization process more manageable in practice.

The Hessian matrix [46, 47] is a square matrix composed of second-order derivatives of a scalar-valued function. It provides valuable information about the local curvature of the function when it has multiple variables. Furthermore, it is important to highlight that the expectation value of the Hessian matrix is equivalent to the Fisher information matrix, as elucidated in the tutorial by Ly et al. [48]. The Fisher information matrix plays a fundamental role in statistical learning and is widely employed in various statistical estimation and inference tasks [48, 49, 50]. Recently, these methods have been extended to the quantum domain [51, 52]. Here we introduce the Fisher information matrix to this approximation for three major reasons mentioned in [42]. Firstly, it provides insights into the second derivative of the loss function around a minimum point, offering information about the local curvature. Secondly, it can be computed solely from first-order derivatives, making it computationally feasible for large models. Additionally, the Hessian matrix is guaranteed to be positive and semi-definite. Leveraging these characteristics, we can express the loss function of task $B$ by incorporating a diagonal precision matrix, which is constructed using the diagonal elements $F_{i}$ of the Fisher information matrix $F$ . To control the impact of the EWC constraint, we introduce the hyper-parameter $\lambda$ to represent its strength:

L(\theta)=L_{B}(\theta)+\frac{\lambda}{2}\sum_{i}F_{i}(\theta_{i}-\theta_{A,i})^{2}.

(8)

The primary objective of quantum continual learning is to achieve satisfactory performance on task $B$ while preserving relatively high performance on the original task $A$ . To accomplish this goal, the EWC method introduces regularization to the original loss function $L(B)$ to penalize deviations from the optimal solution obtained for task $A$ , taking into account the varying importance of different parameters. Specifically, the importance of the quantum classifier parameters is assessed using the Fisher information matrix. The diagonal elements of this matrix serve as weights for the penalty term. Intuitively, each Fisher information matrix corresponds to a Hessian matrix, with the diagonal elements representing the local curvature landscape along different variables and directions. These curvature landscapes are closely connected to the concept of gradients: larger curvatures indicate greater significance, suggesting that even slight shifts in a small range can result in significant differences in the value of the loss function.

In other words, parameters with larger curvatures should be less susceptible to modifications during the continual learning process. By incorporating the penalty based on the Fisher information matrix, the EWC method effectively protects these important parameters from catastrophic forgetting, allowing the model to retain valuable knowledge from task $A$ while adapting to new tasks. This explanation provides a high-level overview of the concepts behind quantum continual learning and how the EWC method mitigates the issue of catastrophic forgetting.

III Method for finding quantum universal perturbation

In our study, we aim to leverage the methods discussed in the previous section to generate universal perturbations that can deceive a set of classifiers performing different classification tasks. Typically, universal perturbations and attacks require specific sets of qubits and circuit parameters as contextual information. To simplify the analysis and strengthen the validation of the universal vulnerability of quantum classifiers, we focus on classifiers with the same number of qubits and identical circuit parameter structures, which we refer to as “homogeneous classifiers.”

By ensuring that the classifiers have the same number of qubits and circuit parameters, we ensure that they operate within the same Hilbert space and possess comparable learning capabilities. This allows us to suppress the interference caused by variations in qubit size and circuit structure. By working with “homogeneous classifiers”, we can confidently evaluate and demonstrate the universal vulnerability of quantum classifiers, as any observed universal perturbation or attack can be attributed to the shared characteristics of the classifiers rather than differences in their underlying architectures.

The methodology we employ is straightforward and allows us to approximate the effect of attacking two “homogeneous classifiers” as attacking a single classifier capable of performing two classification tasks. To demonstrate the universality of the vulnerability of quantum classifiers, we perform quantum adversarial machine learning algorithms on a quantum continual learning classifier that is capable of handling both tasks.

We start by training a robust quantum classifier using dataset $X_{A}$ for the first classification task, denoted as task $A$ . This initial training process aims to establish a solid foundation for the classifier’s performance on task $A$ . Subsequently, we apply the method of quantum continual training, along with the EWC method, to mitigate catastrophic forgetting. This approach involves training a new classifier while protecting the important parameters learned from task $A$ , denoted as $\theta_{A}$ . The process of continual learning can be intuitively understood as a merging process of the two classifiers, where the knowledge from task $A$ is retained while incorporating new knowledge from task $B$ . By the end of the continual learning process, we should obtain a mixed model with parameters $\theta_{AB}$ that can effectively classify both datasets $X_{A}$ and $X_{B}$ with high accuracy, typically around 90%.

Finally, to generate the universal perturbation, we utilize the “quantum-adapted basic iterative method (qBIM)” in conjunction with the average gradient computed from the entire dataset. The qBIM algorithm leverages the gradient information to iteratively generate perturbations that are universally effective across the trained classifiers, thereby showcasing the vulnerability of the quantum classifiers. By following this methodology, we can demonstrate the generation of universal perturbations and validate the vulnerability of quantum classifiers in the context of “homogeneous classifiers” performing different classification tasks.

III.1 Training a quantum continual learning model

Based on the theoretical analysis of continual learning, the classification tasks of the two classifiers must be independently distributed. To demonstrate the effectiveness of our approach, we have selected two distinct datasets: MNIST handwritten digits and MedNIST medical images.

The MNIST dataset, introduced by LeCun et al. [53], consists of grayscale images of handwritten digits ranging from $0$ to $9$ . On the other hand, the MedNIST dataset [54, 55] contains medical images of different body parts, such as the hand and breast. These two datasets are chosen due to their minimal correlations, making them suitable for approximating independent distributions. To ensure fairness in our experiments, we assume that the collected dataset samples from both MNIST and MedNIST are of the same size. We specifically select images of digits $1$ and $9$ from the MNIST dataset and corresponding images of the hand and breast from the MedNIST dataset. This enables us to create a binary classification task for our quantum classifier.

Before feeding the images into a quantum circuit, we preprocess them by rescaling their size from $28\times 28$ to $64\times 64$ . This ensures that we make full use of the available 12 active qubits and enables the images to be represented as logical wave functions, which can be input into the quantum circuit. Additionally, we normalize the images to ensure consistent and effective processing. Furthermore, to facilitate the calculation of cross-entropy loss, we convert the labels into a one-hot encoding format. In total, we create a dataset sample consisting of 1000 training samples and $200$ testing samples for constructing our classifier. By utilizing these datasets and following the aforementioned preprocessing steps, we can proceed with training and evaluating the performance of our quantum classifier on the binary classification task involving MNIST and MedNIST images.

To construct our quantum classifier, we utilize the Yao framework [56], which provides a platform for building quantum circuits. Our classifier consists of both parameterized layers and entangled layers. The parameterized layers in our quantum classifier are composed of the Rz-Rx-Rz circuit applied to each active qubit in the circuit. This circuit structure allows us to introduce rotations along the X-axis and Z-axis of the quantum state’s Bloch sphere. These rotations play a crucial role in manipulating the quantum state and extracting relevant information for classification. On the other hand, the entangled layers in our classifier involve the application of CNOT gates. These gates create entanglement between every qubit in the circuit, enabling the qubits to share and distribute quantum information across the system. This entanglement is instrumental in capturing complex relationships and correlations among the qubits, enhancing the expressive power of our quantum classifier [57]. For our specific implementation, we choose a quantum classifier with a depth of $20$ , utilizing $12$ qubits. Each depth corresponds to one parameterized layer followed by one entangled layer. By increasing the depth, we enable the classifier to capture more intricate patterns and dependencies within the input data. The chosen structure, proven to be robust and efficient in quantum machine learning, ensures that our quantum classifier is capable of effectively handling the classification tasks at hand.

The training process of our quantum classifier involves several steps, starting with training on the MNIST dataset and then applying the continual learning method to train on the MedNIST dataset. Throughout the process, we perform measurements on the last qubit of the quantum circuit, specifically qubit number $12$ , using single qubit state operators in the form of one-hot diagonal elements. Each operator represents a specific class in the classification task. During the training for a single classification task, we follow a specific set of operations. Firstly, we initialize the parameters of the quantum circuit with random values. Secondly, we record the data from both the MNIST and MedNIST databases as classical complex matrices in “.mat” files. Before applying the data to the quantum circuit in the Yao platform, we convert it into the form of a quantum state. Thirdly, for each iteration, the circuit calculates the current loss, accuracy, and fidelity value, and computes the gradients using the auto-differentiation functionality. Fourthly, the gradients are applied to update all parameters of the quantum circuit using the Adam optimizer, implemented in Flux.jl. Finally, when the predefined number of iterations is reached, the learning process concludes, returning the trained circuit parameters along with historical accuracy, loss, and fidelity.

In the first learning process on the MNIST dataset, we employ the “Flux.Adam” optimizer [58, 59] with a learning rate of $0.005$ . The training is performed with a batch size of 100 and for a total of 30 epochs. As a result, we obtain a model with an accuracy of 97.6% on the MNIST dataset.

In the subsequent classification task on the MedNIST dataset, we employ the EWC method to mitigate catastrophic forgetting. To implement this method, we first need to calculate the Fisher information matrix before the training process. The calculation of the Fisher information matrix can be performed through the following steps:

1.

Initialize the Fisher matrix as a square matrix of zeros with a size equal to the length of the circuit parameters.
2.

For each input sample in the batch, compute the expectation value, denoted as expect, and the gradient, denoted as grad. Then calculate the gradient for the Fisher matrix as $\texttt{grad\_fisher}=\texttt{grad}/\texttt{expect}$ .
3.

Add the value $\texttt{grad\_fisher}\times\texttt{grad\_fisher}^{\text{T}}$ to the Fisher matrix.
4.

Finally, take the mean value of the Fisher matrix over the batch size to obtain the resulting Fisher information matrix.

The training process for the MedNIST classification task follows a similar approach to the training on the MNIST dataset. However, there are a few differences. Firstly, the initial circuit parameters should be set as the circuit parameters obtained from the previously trained task. Secondly, during training, an additional term called the EWC punishment term, denoted as EWCpunish, is introduced. This term incorporates the EWC strength hyperparameter, $\lambda$ , the diagonal elements of the Fisher information matrix, fim, the current parameters of the continual training circuit, params, and the previously trained circuit parameters, pre_params. The expression for EWCpunish is as follows:

\texttt{EWCpunish}=\lambda\times\texttt{fim}\times(\texttt{params}-\texttt{pre\_params}).

(9)

In the second learning process, we keep the same learning rate and reduce the training epochs to $20$ to prevent overfitting. During this process, we also apply the EWC method, which introduces an additional parameter, $\lambda$ , to achieve the desired effect of continual learning. The choice of the optimal $\lambda$ value is heavily influenced by the characteristics of the dataset and the classifier. In our experiment, we set $\lambda$ to $750$ , which resulted in an average accuracy of $93.3\%$ for both classifications. Specifically, the accuracy for the first classification task is $94.5\%$ , while the accuracy for the second classification task is $92.0\%$ . Despite a slight decrease in the performance of the merged classifier on the first task, both classification accuracies remain at an acceptable level of around $90\%$ . The training process is presented in Fig. 1.

Refer to caption — (a) The training process for the first task

III.2 Applying quantum Universal Perturbation

During the perturbation process, we employ the “quantum basic iteration method (qBIM)” to generate adversarial examples and evaluate the effectiveness of the attack at different perturbation strengths. The qBIM method is a variation of the well-known Fast Gradient Sign Method (FGSM). Unlike FGSM, which generates adversarial examples in a single step, qBIM iteratively applies FGSM with a small step size. This iterative approach allows for adjustments in the perturbation direction during adversarial machine learning, resulting in more robustly generated samples. Let $\epsilon$ denote the step size, $x_{k}$ represent an arbitrary sample from the collective dataset ${X_{A},X_{B}}$ , $\pi_{C}$ denote the projection operator that normalizes the wave function, and $L$ denote the loss function. At each iteration step $k+1$ , the qBIM algorithm can be described as follows:

x_{k+1}=\pi_{C}[x_{k}+\epsilon\times\mathrm{sgn}(\nabla L(F(X_{A},X_{B})))].

(10)

To provide a more detailed explanation, here is a pseudo-code representation of the quantum-adapted basic iterative method (qBIM) for finding effective universal perturbation, as presented in Algorithm 1:

Algorithm 1 Quantum-adapted basic iterative method

1:The trained model

F

,loss function

L

, step size

\epsilon

, number of iteration

n

, legitimate datasets

X_{A}

and

X_{B}

, size of dataset

num

2:Adversarial sample

X_{adv}

3:put the two datasets together into new dataset

X_{all}=(X_{A},X_{B})

4:put

X_{all}

into quantum state

\left|X_{all}\right\rangle

5:copy variable

X_{all}

into newly built variable

X_{adv}

6:put

X_{adv}

into quantum state

\left|X_{adv}\right\rangle

7:for every sample in collective dataset

X_{all}

8: calculate the gradient

J_{i}=\nabla L(F(X_{A},X_{B}))

9:end for

10:gradient information

J=\textbf{mean}(\sum_{i=1}^{num}(J_{i}))

11:for

k=1,...,n

12: put

\left|X_{adv}\right\rangle

into classifier to get result

F(\left|X_{adv}\right\rangle)

13: calculate the accuracy of

F(\left|X_{adv}\right\rangle)

and record

14: calculate the loss of

F(\left|X_{adv}\right\rangle)

and record

15: calculate the fidelity between

X_{all}

and

X_{adv}

and record

16: for every sample

i

in collective dataset

X_{all}

17:

X_{adv}[i]=X_{adv}[i]+\epsilon\times\mathrm{sgn}(J)

18:

X_{adv}[i]/=\mathrm{norms}(X_{adv}[i])

19: end for

20:end for

21:return

X_{adv}

We conducted a perturbation experiment on the concatenated dataset of the two trained tasks using $30$ iterations and a total perturbation strength of $0.02$ . In each iteration, we calculated the gradient information of the initial unperturbed $1024$ circuit input pixels to determine the perturbation direction. Specifically, we computed the mean value of the pixel-wise gradient for all $400$ samples ( $200$ samples for each classification task) to indicate the direction of gradient ascent, which served as the perturbation direction. The perturbation itself was obtained by multiplying the perturbation-per-step value with the sign of the gradient value. During each attacking step, we first apply the calculated perturbation to every single sample from two classification tasks to generate universal adversarial examples. Then, we clip the pixel value of universal adversarial examples within the range of $[0,1)$ to avoid meaningless values. Furthermore, we normalize the adversarial example after clipping to ensure that the generated adversarial example is in a legal quantum state.

During the perturbation process, the average accuracy of the two datasets decreased from $93.3\%$ to $28.5\%$ with fidelity of $0.79$ . Here we present a curve illustration in Fig. 2. Specifically, the accuracy of the individual tasks dropped from $94.5\%$ and $92.0\%$ to $24.5\%$ (fidelity = $0.84$ ) and $32.5\%$ (fidelity = $0.76$ ), respectively. This significant reduction in accuracy demonstrates that the universal perturbation generated by the mean value of the gradient for all dataset samples is capable of deceiving the classifier on both classification tasks almost completely. Additionally, it is worth noting that the accuracy and fidelity of the tasks do not change synchronously. The accuracy of the previously trained task experiences a rapid decline at the beginning of the perturbation process and then stabilizes at a higher fidelity. On the other hand, the later-trained task remains less sensitive to perturbation, maintaining a higher accuracy and lower fidelity compared to the previously trained task. This observation suggests that the previously trained task is more vulnerable, as the classifier’s performance deteriorates significantly even with a smaller amount of perturbation.

Here, we plot some examples of universal adversarial samples after calculation in Fig. 3. For samples in MNIST handwritten digits, the distribution of non-zero value pixels is rather concentrated to the track of digit, thus the probability distribution after perturbing is almost unchanged except for the blurring at margin areas. However, the samples in the MedNIST dataset tend to distribute more evenly, resulting in a shallow color due to normalization steps. All of the adversarial samples listed above are wrongly classified into the other class by the quantum classifier with fidelity of around $51\%$ to $52\%$ which is just enough to deceive the classifier. We retain such a level of deceiving fidelity to minimize the perturbation and try to keep the fidelity value as high as possible.

IV Further Experiments

The quantum classifier mentioned in previous paragraphs is the fully connected quantum classifier in particular. Meanwhile, all datasets used in numerical experiments are classical datasets previously. Such limitations might weaken our conclusion. Thus, to verify the effectiveness of the adversarial machine learning method in the above chapters, we also conducted further experiments on the quantum convolutional neural network(QCNN) classifier and the quantum data respectively.

IV.1 The QCNN scenario

The QCNN classifier is a powerful classification model in the realm of quantum classifiers. According to the numerical experiment presented in Ref.[11], the QCNN classifier is applied to $28\times 28$ MNIST classification task. The QCNN classifier gets a similar performance to the classical CNN classifier and outperforms the fully connected classifier. This provides sufficient evidence that the QCNN classifier structure mentioned in Ref.[11] is a promising tool for classification tasks.

In our experiment, we follow Ref.[11] and construct a QCNN classifier with a similar structure on $12$ qubits. Specifically, our QCNN classifier contains $2$ convolutional layers, $2$ pooling layers, and $20$ fully connected layers. The convolutional layers find hidden states by applying rotational gates and controlled gates on adjacent qubits. The pooling layers use controlled rotational gates to reduce the qubits firstly from $12$ to $6$ , then from $6$ to $3$ . At last, the fully connected layers are applied with a depth of $20$ on the $3$ remaining qubits. During the decoding process, a measurement gate is applied to the last qubit of the circuit.

In the first training process, we set the learning rate to $0.005$ with $30$ training epochs. The dataset used for training is MNIST handwritten digits, the same as the previously mentioned experiment. A small fluctuation can be observed around epoch $10$ , but the QCNN classifier reached a state-of-the-art accuracy above $95\%$ around epoch $15$ . As mentioned in plenty of previous works, the QCNN classifier performs very well on a single classification task. Given the same datasets, hyper-parameter settings, and computing resources, the QCNN classifier trains much more quickly than the fully connected quantum classifier. Also, the QCNN classifier gets a lower loss than the fully connected quantum classifier after training, which leads to higher confidence in binary classification.

In the second training process, we keep the learning rate fixed at $0.005$ , while changing the training steps to only $10$ training epochs to avoid over-fitting. Also, we raise the additional parameter $\lambda$ in the EWC method to $2000$ to protect the previously trained task. The training dataset is still the MedNIST hand and breast MRI images. After training, the QCNN classifier reaches an average accuracy of $91\%$ on both training tasks, which is $92.1\%$ on the first classification task and $89.7\%$ on the second classification task. The training curve of both training processes of the QCNN classifier is presented in Fig.4.

Here, we want to point out the potential threat of the QCNN classifier when applying it to continuous learning scenarios. Unlike fully connected quantum classifiers, the QCNN classifier contains a relatively small number of parameters. Intuitively, fewer training parameters lead to lower generalization capability of the given classifier. This intuition is perfectly verified in our experiment as we find out that the QCNN classifier can perform well on one training task but cannot maintain similar performance on multiple training tasks. Specifically, continuous drastic fluctuations can be observed in the training curves. We attempt to solve this issue by increasing the depth of the fully connected layer. Such an approach indeed alleviates the problem of insufficient training parameters but still requires careful adjustment of hyper-parameters.

During the attacking process, we take similar hyper-parameter settings to the previous experiment, namely $0.02$ total perturbation strength, and $30$ perturbing iterations. The average accuracy of the two datasets decreased from $91\%$ to $34\%$ with fidelity of $81.5\%$ . The accuracy of individual tasks dropped from 92.1% and $89.7\%$ to $31.5\%$ (fidelity= $0.85$ ) and $36.5\%$ (fidelity= $0.78$ ). The attack process is drawn as a curve in Fig.5. Such an attacking process of the QCNN classifier is consistent with the situation of the fully connected quantum classifier. This result indicates that the vulnerability of the QCNN and the fully connected quantum classifier is similar.

IV.2 Quantum data scenario

The quantum classifiers are specially gifted to process quantum data. According to the evidence provided in Ref.[21, 12], quantum data can be efficiently learned by quantum classifiers and attacked by adversarial machine learning algorithms.

In our experiment, we choose a 12-qubit symmetry-protected topological (SPT) state dataset similar to Ref.[12] to verify the effectiveness of universal perturbation. Specifically, in the dataset containing SPT (Symmetry Protected Topological) states, we examine a one-dimensional cluster-Ising model featuring periodic boundary conditions. Suppose $\hat{\sigma}_{x}^{i}$ , $\hat{\sigma}_{y}^{i}$ , $\hat{\sigma}_{z}^{i}$ are Pauli matrices and $\lambda$ represents the relative strength of adjacent neighbour interaction. The Hamiltonian of this model can be put as the equation below:[12]

H(\lambda)=-\sum_{j=1}^{N}\hat{\sigma}_{x}^{(j-1)}\hat{\sigma}_{z}^{(j)}\hat{\sigma}_{x}^{(j+1)}+\lambda\sum_{j=1}^{N}\hat{\sigma}_{y}^{(j)}\hat{\sigma}_{y}^{(j+1)}

(11)

The model experiences a persistent quantum phase shift at $\lambda=1$ , effectively distinguishing between two distinct phases. For $\lambda$ values less than $1$ , it enters a cluster phase characterized by nonlocal hidden order. In contrast, for $\lambda$ values greater than $1$ , it transitions into an antiferromagnetic phase with well-defined long-range order and a significant staggered magnetization. To comprehensively explore this transition, we systematically sweep $\lambda$ across the range from $0$ to $2$ , utilizing intervals of $0.001$ . We collect the resulting ground states at each interval, establishing them as our dataset for both training and testing purposes. We introduce the quantum data SPT in the second training process.

In the first training process, we keep the learning rate at $0.005$ and train for $30$ epochs on the MNIST handwritten dataset. The training is conducted on a fully connected quantum classifier on $12$ qubits with a depth of $20$ . The training process generates state-of-the-art accuracy above $95\%$ on the MNIST handwritten dataset.

In the second training process, we keep the learning rate at $0.005$ but lower the training epochs to $20$ . The additional parameter $\lambda$ in the EWC method is adjusted to $500$ . The training dataset is the previously generated SPT dataset. After training, the fully connected quantum classifier reaches an average accuracy of $98\%$ on both training tasks, which is $98.6\%$ on the first classification task and $97.5\%$ on the second training task. The training curve of both training processes is presented in Fig.6.

During the attacking process, we lower the total perturbation strength to $0.015$ and the perturbing iteration to $15$ . The average accuracy of the two datasets decreased from $98\%$ to $20.3\%$ with fidelity of $0.63$ . The accuracy of individual tasks dropped from $98.6\%$ and $97.5\%$ to $4\%$ and $36.5\%$ . The attack process is drawn as a curve in Fig.7. After introducing quantum data, the continuous model becomes more fragile than the cases before introducing quantum data. It can be observed in Fig.7 that the second classification task on the SPT quantum dataset encounters a sharp decrease in the first attacking iterations.

V Conclusion and Discussion

This work presents a small glimpse into the emerging field of quantum adversarial machine learning, especially in finding universal perturbations. Typically, we first trained a classifier that can achieve almost state-of-the-art accuracy on two classification tasks with quantum continual learning techniques and elastic weight consolidation method to avoid catastrophic forgetting. The datasets of classification tasks are taken in binary form and encoded into quantum states. By applying the aforementioned quantum-adapted basic iterative method, we generated a universal perturbation that can deceive the classifier on both tasks conclusively with limited perturbation strength and comparably high fidelity. However, there are still numerous questions that are worth further exploration.

One of the challenges in this work is identifying which segment of the perturbation is useful. While the gradient consists of balanced information from the two classifiers trained on different tasks, it is difficult to precisely identify the boundary of a given classification task in practice. Mathematically characterizing this border remains an open question in the field. Based on previous numerical experiments in Ref. [30], gradient information of one particular classification task can be generated into universal perturbation through likewise approaches. In this work, we found that the universal perturbation generated by gradient information of two different and independently distributed classification tasks can also deceive the quantum classifier successfully. It is essential to develop robust tools and techniques that can effectively discern the useful part of the universal perturbation generated by different tasks’ gradients. Further research works addressing this issue might provide a clearer understanding of why universal perturbation can lower the accuracy of quantum classifiers.

Furthermore, it is important to note that the universal perturbation used in this work focuses specifically on supervised learning binary classification scenarios. This topic can be extended to multi-classification tasks and unsupervised or reinforcement learning scenarios. Developing effective methods for generating universal perturbations in these contexts is an ongoing research area. Currently, the existing study of quantum adversarial machine learning along this direction is relatively limited and requires further investigations.

As observed from the above results, there were notable differences between the previously trained task and the later trained task during the perturbation process. The success of a quantum continual learning process lies in the application of the EWC method, which aims to protect crucial parameters to maintain the robustness of quantum classifiers across multiple training tasks. This suggests that certain circuit parameters hold greater importance compared to others. The previously trained classification task, having experienced a loss of specific important parameters, remains more vulnerable compared to the later trained classification task. This phenomenon might hint at a potential correlation between circuit parameters and classifier performance. Investigating the characteristics of continual learning in this context would be an intriguing avenue for further exploration.

It is worth noting that our work primarily focuses on classifiers with identical structures and datasets of equal sizes. However, in future endeavors involving universal quantum attacks, it would be valuable to explore attacking techniques that can be applied to classifiers with diverse structures and datasets of varying sizes.

The topic of defense strategy is also an interesting direction to be discovered. As far as we are concerned, adversarial training, randomized encoding, and quantum noise are several defense strategies that have proven to be useful for quantum adversarial attacks. Adversarial training defends adversarial attacks by putting adversarial samples into the training set so that the quantum classifier will be immune to similar attacks after training. In Ref.[16], the authors provide nice explanations and numerical experiments about this approach. Randomized encoding was also carefully discussed in Ref.[20]. By randomly encoding the legitimate data samples through unitary or quantum error correction encoders, the authors provide an interesting approach to protect quantum classifiers from adversarial attacks. Moreover, in Ref.[60], authors also show that quantum noises can protect quantum classifiers from adversaries. However, adopting similar defense strategies in continual learning settings still needs substantial effort. Hence, we shall leave the discussion of defense strategy for future investigation.

VI Acknowledgements

We thank Dong-Ling Deng, Weikang Li, and Si Jiang for helpful inspiration and discussion. We acknowledge Weikang Li, Si Jiang, Wenjie Jiang, Weiyuan Gong, and Sirui Lu for discussing and sharing their numerical simulation codes.

Appendix A Specific Numerical Experiment Details

The quantum classifier used in this research was initially proposed by Ref. [12]. All numerical results are performed on Yao.jl framework [56, 61].

Yao is a highly efficient computing software that simulates quantum computing behavior with automatic differentiation and provides a vast range of powerful assistant interfaces for research. Now, such a tool is attracting more and more researchers around the globe.

Typically, for classical image data like MNIST and MedNIST, we need to encode them into logical quantum states before putting them into the quantum circuit for calculation. We are using amplitude encoding for the $64\times 64$ rescaled input data because this encoding technique can greatly reduce the quantum qubit space complexity from $\Theta(N)$ to $\Theta(\mathrm{log}(N))$ . Considering the lack of computational resources of qubit size, reducing the number of necessary qubits is a very important philosophy to enhance the potential computational power of a quantum circuit. To implement amplitude encoding, we simply create a “zero-state” quantum state of size 12 and assign the initial value in matrix form through the “.state” interface.

An illustration of a section of our trained quantum circuit is shown in Fig.8. Specifically, we put one “Parameterized Layer” and one “Entangled Layer” together as one depth of our quantum circuit. In the “Parameterized Layer”, we apply sequential quantum rotation gates on X-Z-X respectively. For arbitrary parameter $\theta$ , gate $R_{X}$ is equivalent to $e^{i\theta/2X}$ , and gate $R_{Z}$ is equivalent to $e^{i\theta/2Z}$ . For every gate in the quantum circuit, we take the angles $\theta$ as the parameters to be optimized in the training process. Such structures were reported and tested to be robust and hardware-efficient in Ref. [12]. The “Entangled Layer” on the other hand is a composite of single-layered controlled not gates to create entanglement between different qubits. As mentioned in Ref. [62, 63], entanglements are a vital key to the great potential of quantum computing.

Here, we take measurements on the last qubit of our quantum circuit. Without a doubt, similar measurements can also be made on other qubit indexes with similar philosophies. Speaking of measurement, we want to highlight that we are using a threshold of probability of $0.5$ . In other words, for binary implementation, we set two measurement operators, namely $op0$ and $op1$ . The classification decision is made by determining on which operator the measured probability exceeds the threshold level. More interestingly, we provide insight into performing multiple classifications in this context by setting multiple measuring operators and multiple measuring qubits. Technically, provided that we are doing a $N$ classification task, we can take measurements on $\mathrm{log}(N)$ number of qubits and discern them in a similar way.

In order to achieve a stable performance, we take 12 qubits and 20 composite layers to provide comparably sufficient parameters for initial training and quantum continual training. Based on our numerical experiments and duplicate verification, such a choice of parameters can provide a robust and efficient quantum classifier.

The QCNN classifier refers to a similar technical routine with fully connected quantum classifiers, the structure can be seen in Fig.9.

Appendix B Technical details for EWC method

In the above sections, we provide a brief explanation of the elastic weight method. To be specific, we provide some technical details for the EWC method here in the appendix. Suppose the Gaussian distribution equation is defined as $N(\mu,\sigma)$ and we use a function $f(\theta)$ to mark the logarithmic value of this probability, then we can do the calculation as follow:

P(X_{A}|\theta)=\frac{1}{\sqrt{2\pi}\sigma}e^{-\frac{(\theta-\mu)^{2}}{2\sigma^{2}}}

(12)

f(\theta)=\mathrm{log}P(X_{A}|\theta)

(13)

Let the optimal solution be $\theta_{A}$ , then the equation after Taylor expansion is as follows:

\begin{cases}f^{\prime}(\theta_{A})=0\\ f(\theta)=f(\theta_{A})+f^{\prime}(\theta_{A})(\theta-\theta_{A})+f^{\prime\prime}(\theta_{A})\frac{(\theta-\theta_{A})^{2}}{2}\end{cases}

(14)

Since the element $\mathrm{log}\frac{1}{\sqrt{2\pi}\sigma}$ and element $f(\theta_{A})$ are constant, we can simplify the above equation as follow:

-\frac{(\theta-\mu)^{2}}{2\sigma^{2}}=f^{\prime\prime}(\theta_{A})\frac{(\theta-\theta_{A})^{2}}{2}

(15)

Then after the above calculation we can get results: $\mu=\theta_{A}$ and $\sigma^{2}=-\frac{1}{f^{\prime\prime}(\theta_{A})}$ . Now we can simulate the posterior probability with an acceptable error range. According to Bayes’s rules, the distribution form of $p(\theta|X_{A})$ and $p(X_{A}|\theta)$ is the same, the proof is trivial. But this form is not efficient to calculate, more transformations need to be applied.

The Fisher information matrix [64, 65, 48] is the covariance of the probability distribution gradient. Officially, the Fisher information is a measurement of the amount of information that an observable random variable $X$ carries about an unknown parameter $\theta$ of a distribution that models $X$ . For batched input $X={X_{1},X_{2},\ ...\ ,X_{n}}$ , the Fisher information matrix can be taken down as:

F=\frac{1}{n}\sum_{i=1}^{n}\nabla\mathrm{log}P(X_{i}|\theta)\nabla\mathrm{log}P(X_{i}|\theta)^{T}

(16)

Unfortunately, such a form is still not friendly for calculation, we can further simplify the above equation using the Hessian matrix. The Hessian matrix [46, 47] $H$ is a square matrix that is a composite of second-order partial derivatives of a scalar-valued function, describing the local curvature of a function of many variables. For machine learning, the Hessian matrix is very easy to acquire, the question is how to link the Hessian matrix and the Fisher information matrix. The answer is that the Fisher information matrix is the negative expectation of the Hessian matrix. To prove this, we can take two steps: transform the Hessian matrix $H$ and calculate its expectation. The proof is as follows.

First, we rewrite the Hessian matrix in a novel form of partial derivatives:

\displaystyle\begin{aligned} &H_{\mathrm{log}P(X|\theta)}=J(\nabla\mathrm{log}P(X|\theta))=J(\frac{\nabla P(X|\theta)}{P(X|\theta)})\\ &=\frac{H_{P(X|\theta)}P(X|\theta)-\nabla P(X|\theta)\nabla P(X|\theta)^{T}}{P(X|\theta)^{2}}\\ &=\frac{H_{P(X|\theta)}}{P(X|\theta)}-(\frac{\nabla P(X|\theta)}{P(X|\theta)})(\frac{\nabla P(X|\theta)}{P(X|\theta)})^{T}\end{aligned}

(17)

Second, we calculate the expectation of the Hessian matrix in the partial derivative form:

\displaystyle\begin{aligned} &\underset{P(X|\theta)}{\textbf{E}}[\frac{H_{P(X|\theta)}}{P(X|\theta)}-(\frac{\nabla P(X|\theta)}{P(X|\theta)})(\frac{\nabla P(X|\theta)}{P(X|\theta)})^{T}]\\ &=\int\frac{H_{P(X|\theta)}}{P(X|\theta)}P(X|\theta)d\theta-\underset{P(X|\theta)}{\textbf{E}}[\nabla\mathrm{log}P(X|\theta)\nabla\mathrm{log}P(X|\theta)^{T}]\\ &=\int H_{P(X|\theta)}d\theta-F=-F\end{aligned}

(18)

By applying an intuitive mathematical trick, we successfully find a solution to link the Hessian matrix and the Fisher information matrix.

On the other hand, for 1-dimensional functions, the Hessian matrix has property as follows:

H_{\mathrm{log}P(X|\theta)}=(\mathrm{log}P(X|\theta))^{\prime\prime}

(19)

Thus, for the Fisher information matrix of parameter $\theta_{i}$ and dataset with $n$ samples, we can rewrite in second derivative form as follow:

\displaystyle F_{i}=-\underset{P(X|\theta)}{\textbf{E}}[H_{\mathrm{log}P(X|\theta)}]=-\frac{1}{n}\sum f_{i}^{\prime\prime}(\theta_{A,i})

(20)

For easier calculation, we import fisher information matrix $\mathbf{F}$ and hyper-parameter $\lambda$ to evaluate the importance of element2 to element1. We can put the calculation above as:

		$\displaystyle\underset{\theta}{\mathrm{min}}\ (\mathrm{log}p(\theta\|X_{A}))=\underset{\theta}{\mathrm{min}}\ (f^{\prime\prime}(\theta_{A})\frac{(\theta-\theta_{A})^{2}}{2})$		(21)
		$\displaystyle=\underset{\theta}{\mathrm{min}}\ (\frac{\lambda}{2}\sum_{i}\mathbf{(}F)_{i}(\frac{(\theta{i}-\theta_{Ai})^{2}}{2}))$		(21)

References

Das Sarma et al. [2019] S. Das Sarma, D.-L. Deng, and L.-M. Duan, Machine learning meets quantum physics, Phys. Today 72, 48 (2019).
Cerezo et al. [2022] M. Cerezo, G. Verdon, H.-Y. Huang, L. Cincio, and P. J. Coles, Challenges and opportunities in quantum machine learning, Nature Computational Science 2, 567 (2022).
Li and Deng [2022] W. Li and D.-L. Deng, Recent advances for quantum classifiers, Science China Physics, Mechanics & Astronomy 65, 220301 (2022).
Huang et al. [2022a] H.-Y. Huang, M. Broughton, J. Cotler, S. Chen, J. Li, M. Mohseni, H. Neven, R. Babbush, R. Kueng, J. Preskill, et al., Quantum advantage in learning from experiments, Science 376, 1182 (2022a).
Pan et al. [2023] X. Pan, Z. Lu, W. Wang, Z. Hua, Y. Xu, W. Li, W. Cai, X. Li, H. Wang, Y.-P. Song, et al., Deep quantum neural networks on a superconducting processor, Nat. Commun. 14, 4006 (2023).
Wu et al. [2023] Y.-D. Wu, G. Chiribella, and N. Liu, Quantum-enhanced learning of continuous-variable quantum states, arXiv preprint arXiv:2303.05097 (2023).
Biamonte et al. [2017] J. Biamonte, P. Wittek, N. Pancotti, P. Rebentrost, N. Wiebe, and S. Lloyd, Quantum machine learning, Nature 549, 195 (2017).
Xiao et al. [2022] T. Xiao, J. Fan, and G. Zeng, Parameter estimation in quantum sensing based on deep reinforcement learning, npj Quantum Information 8, 2 (2022).
Huang et al. [2022b] H.-Y. Huang, R. Kueng, G. Torlai, V. V. Albert, and J. Preskill, Provably efficient machine learning for quantum many-body problems, Science 377, eabk3333 (2022b).
Xiao et al. [2023] T. Xiao, X. Zhai, X. Wu, J. Fan, and G. Zeng, Practical advantage of quantum machine learning in ghost imaging, Communications Physics 6, 171 (2023).
Oh et al. [2020] S. Oh, J. Choi, and J. Kim, A tutorial on quantum convolutional neural networks (qcnn), in 2020 International Conference on Information and Communication Technology Convergence (ICTC) (IEEE, 2020) pp. 236–239.
Li et al. [2022] W. Li, Z. Lu, and D.-L. Deng, Quantum neural network classifiers: A tutorial, SciPost Physics Lecture Notes , 061 (2022).
Bausch [2020] J. Bausch, Recurrent quantum neural networks, Advances in neural information processing systems 33, 1368 (2020).
Liu et al. [2022] Z. Liu, P.-X. Shen, W. Li, L.-M. Duan, and D.-L. Deng, Quantum capsule networks, Quantum Sci. Technol. 8, 015016 (2022).
Shen et al. [2021] P. Shen, W. Jiang, W. Li, Z. Lu, and D. Deng, Adversarial learning in quantum artificial intelligence, Acta Phys. Sin. 70, 140302 (2021).
Lu et al. [2020] S. Lu, L.-M. Duan, and D.-L. Deng, Quantum adversarial machine learning, Physical Review Research 2, 033212 (2020).
Liao et al. [2021] H. Liao, I. Convy, W. J. Huggins, and K. B. Whaley, Robust in practice: Adversarial attacks on quantum machine learning, Physical Review A 103, 042427 (2021).
Edwards and Rawat [2020] D. Edwards and D. B. Rawat, Quantum adversarial machine learning: Status, challenges and perspectives, in 2020 Second IEEE International Conference on Trust, Privacy and Security in Intelligent Systems and Applications (TPS-ISA) (IEEE, 2020) pp. 128–133.
Guan et al. [2021] J. Guan, W. Fang, and M. Ying, Robustness verification of quantum classifiers, in Computer Aided Verification: 33rd International Conference, CAV 2021, Virtual Event, July 20–23, 2021, Proceedings, Part I 33 (Springer, 2021) pp. 151–174.
Gong et al. [2022] W. Gong, D. Yuan, W. Li, and D.-L. Deng, Enhancing quantum adversarial robustness by randomized encodings, arXiv preprint arXiv:2212.02531 (2022).
Jiang et al. [2019] S. Jiang, S. Lu, and D.-L. Deng, Adversarial machine learning phases of matter, arXiv preprint arXiv:1910.13453 (2019).
Zhang et al. [2022] H. Zhang, S. Jiang, X. Wang, W. Zhang, X. Huang, X. Ouyang, Y. Yu, Y. Liu, D.-L. Deng, and L.-M. Duan, Experimental demonstration of adversarial examples in learning topological phases, Nature communications 13, 4993 (2022).
Das and Suganthan [2010] S. Das and P. N. Suganthan, Differential evolution: A survey of the state-of-the-art, IEEE transactions on evolutionary computation 15, 4 (2010).
Chen et al. [2017] P.-Y. Chen, H. Zhang, Y. Sharma, J. Yi, and C.-J. Hsieh, Zoo: Zeroth order optimization based black-box attacks to deep neural networks without training substitute models, in Proceedings of the 10th ACM workshop on artificial intelligence and security (2017) pp. 15–26.
Goodfellow et al. [2014] I. J. Goodfellow, J. Shlens, and C. Szegedy, Explaining and harnessing adversarial examples, arXiv preprint arXiv:1412.6572 (2014).
Madry et al. [2017] A. Madry, A. Makelov, L. Schmidt, D. Tsipras, and A. Vladu, Towards deep learning models resistant to adversarial attacks, arXiv preprint arXiv:1706.06083 (2017).
Dong et al. [2018] Y. Dong, F. Liao, T. Pang, H. Su, J. Zhu, X. Hu, and J. Li, Boosting adversarial attacks with momentum, in Proceedings of the IEEE conference on computer vision and pattern recognition (2018) pp. 9185–9193.
Ren et al. [2022] W. Ren, W. Li, S. Xu, K. Wang, W. Jiang, F. Jin, X. Zhu, J. Chen, Z. Song, P. Zhang, et al., Experimental quantum adversarial learning with programmable superconducting qubits, Nature Computational Science 2, 711 (2022).
Moosavi-Dezfooli et al. [2017] S.-M. Moosavi-Dezfooli, A. Fawzi, O. Fawzi, and P. Frossard, Universal adversarial perturbations, in Proceedings of the IEEE conference on computer vision and pattern recognition (2017) pp. 1765–1773.
Gong and Deng [2022] W. Gong and D.-L. Deng, Universal adversarial examples and perturbations for quantum classifiers, National Science Review 9, nwab130 (2022).
Jiang et al. [2022] W. Jiang, Z. Lu, and D.-L. Deng, Quantum continual learning overcoming catastrophic forgetting, Chinese Physics Letters 39, 050303 (2022).
McCloskey and Cohen [1989] M. McCloskey and N. J. Cohen, Catastrophic interference in connectionist networks: The sequential learning problem, in Psychology of learning and motivation, Vol. 24 (Elsevier, 1989) pp. 109–165.
Robins [1995] A. Robins, Catastrophic forgetting, rehearsal and pseudorehearsal, Connection Science 7, 123 (1995).
French [1999] R. M. French, Catastrophic forgetting in connectionist networks, Trends in cognitive sciences 3, 128 (1999).
Huang et al. [2011] L. Huang, A. D. Joseph, B. Nelson, B. I. Rubinstein, and J. D. Tygar, Adversarial machine learning, in Proceedings of the 4th ACM workshop on Security and artificial intelligence (2011) pp. 43–58.
Vorobeychik and Kantarcioglu [2018] Y. Vorobeychik and M. Kantarcioglu, Adversarial machine learning, Synthesis Lectures on Artificial Intelligence and Machine Learning 12, 1 (2018).
LeCun et al. [2015] Y. LeCun, Y. Bengio, and G. Hinton, Deep learning, nature 521, 436 (2015).
Cunningham et al. [2008] P. Cunningham, M. Cord, and S. J. Delany, Supervised learning, Machine learning techniques for multimedia: case studies on organization and retrieval , 21 (2008).
Osisanwo et al. [2017] F. Osisanwo, J. Akinsola, O. Awodele, J. Hinmikaiye, O. Olakanmi, J. Akinjobi, et al., Supervised machine learning algorithms: classification and comparison, International Journal of Computer Trends and Technology (IJCTT) 48, 128 (2017).
Dunjko and Briegel [2018] V. Dunjko and H. J. Briegel, Machine learning & artificial intelligence in the quantum domain: a review of recent progress, Reports on Progress in Physics 81, 074001 (2018).
Yang et al. [2009] G. Yang, F. Pan, and W.-B. Gan, Stably maintained dendritic spines are associated with lifelong memories, Nature 462, 920 (2009).
Kirkpatrick et al. [2017] J. Kirkpatrick, R. Pascanu, N. Rabinowitz, J. Veness, G. Desjardins, A. A. Rusu, K. Milan, J. Quan, T. Ramalho, A. Grabska-Barwinska, et al., Overcoming catastrophic forgetting in neural networks, Proceedings of the national academy of sciences 114, 3521 (2017).
Stone [2013] J. V. Stone, Bayes’ rule: a tutorial introduction to bayesian analysis, (2013).
Chopin [2011] N. Chopin, Fast simulation of truncated gaussian distributions, Statistics and Computing 21, 275 (2011).
Pourahmadi [1984] M. Pourahmadi, Taylor expansion of and some applications, The American Mathematical Monthly 91, 303 (1984).
Thacker [1989] W. C. Thacker, The role of the hessian matrix in fitting models to measurements, Journal of Geophysical Research: Oceans 94, 6177 (1989).
Mizutani [2008] E. Mizutani, A tutorial on stagewise backpropagation for efficient gradient and hessian evaluations, SCIS & ISIS SCIS & ISIS 2008 , 68 (2008).
Ly et al. [2017] A. Ly, M. Marsman, J. Verhagen, R. P. Grasman, and E.-J. Wagenmakers, A tutorial on fisher information, Journal of Mathematical Psychology 80, 40 (2017).
Kunstner et al. [2019] F. Kunstner, P. Hennig, and L. Balles, Limitations of the empirical fisher approximation for natural gradient descent, Advances in neural information processing systems 32 (2019).
Frieden [2000] B. R. Frieden, Physics from fisher information: a unification (2000).
Petz and Ghinea [2011] D. Petz and C. Ghinea, Introduction to quantum fisher information, in Quantum probability and related topics (World Scientific, 2011) pp. 261–281.
Liu et al. [2020] J. Liu, H. Yuan, X.-M. Lu, and X. Wang, Quantum fisher information matrix and multiparameter estimation, Journal of Physics A: Mathematical and Theoretical 53, 023001 (2020).
LeCun et al. [2010] Y. LeCun, C. Cortes, C. Burges, et al., Mnist handwritten digit database (2010).
Yang et al. [2021] J. Yang, R. Shi, and B. Ni, Medmnist classification decathlon: A lightweight automl benchmark for medical image analysis, in IEEE 18th International Symposium on Biomedical Imaging (ISBI) (2021) pp. 191–195.
Yang et al. [2023] J. Yang, R. Shi, D. Wei, Z. Liu, L. Zhao, B. Ke, H. Pfister, and B. Ni, Medmnist v2-a large-scale lightweight benchmark for 2d and 3d biomedical image classification, Scientific Data 10, 41 (2023).
Luo et al. [2020] X.-Z. Luo, J.-G. Liu, P. Zhang, and L. Wang, Yao. jl: Extensible, efficient framework for quantum algorithm design, Quantum 4, 341 (2020).
Deng et al. [2017] D.-L. Deng, X. Li, and S. Das Sarma, Quantum entanglement in neural network states, Phys. Rev. X 7, 021021 (2017).
Innes [2018] M. Innes, Flux: Elegant machine learning with julia, Journal of Open Source Software 3, 602 (2018).
Kingma and Ba [2014] D. P. Kingma and J. Ba, Adam: A method for stochastic optimization, arXiv preprint arXiv:1412.6980 (2014).
Du et al. [2021] Y. Du, M.-H. Hsieh, T. Liu, D. Tao, and N. Liu, Quantum noise protects quantum classifiers against adversaries, Physical Review Research 3, 023153 (2021).
Bezanson et al. [2012] J. Bezanson, S. Karpinski, V. B. Shah, and A. Edelman, Julia: A fast dynamic language for technical computing, arXiv preprint arXiv:1209.5145 (2012).
Ugwuishiwu et al. [2020] C. Ugwuishiwu, U. Orji, C. Ugwu, and C. Asogwa, An overview of quantum cryptography and shor’s algorithm, Int. J. Adv. Trends Comput. Sci. Eng 9 (2020).
Bulger et al. [2003] D. Bulger, W. P. Baritompa, G. R. Wood, et al., Implementing pure adaptive search with grover’s quantum algorithm, Journal of optimization theory and applications 116, 517 (2003).
Šafránek [2018] D. Šafránek, Simple expression for the quantum fisher information matrix, Physical Review A 97, 042322 (2018).
Spall [2005] J. C. Spall, Monte carlo computation of the fisher information matrix in nonstandard settings, Journal of Computational and Graphical Statistics 14, 889 (2005).