A proposal of adaptive parameter tuning for robust stabilizing control of $N$ –level quantum angular momentum systems

Shoju Enami and Kentaro Ohki This work was supported by JSPS KAKENHI Grant Number JP19K03619 and JP20H02168.S. Enami and K. Ohki are with Department of Applied Mathematics and Physics, Graduate School of Informatics, Kyoto University, 606-8501 Yoshida-Honmachi, Sakyo-ku, Kyoto, Japan. [email protected]

Abstract

Stabilizing control synthesis is one of the central subjects in control theory and engineering, and it always has to deal with unavoidable uncertainties in practice. In this study, we propose an adaptive parameter tuning algorithm for robust stabilizing quantum feedback control of $N$ -level quantum angular momentum systems with a robust stabilizing controller proposed by [Liang, Amini, and Mason, SIAM J. Control Optim., 59 (2021), pp. 669-692]. The proposed method ensures local convergence to the target state. Besides, numerical experiments indicate its global convergence if the learning parameters are adequately determined.

I Introduction

Stabilizing controller synthesis is one of the central problems in control systems, even if systems are described by quantum mechanics [1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 11]. Unfortunately, stabilizing control is vulnerable to failure due to the existence of uncertainties in practice. There are two conventional approaches to overcome this problem; robust control [12, 13] and adaptive control [14, 15]. Robust control ensures performance of the control system under worst-case scenario against a given set of uncertainties. It has been actively studied for quantum systems[16, 17, 18, 19, 20, 21], as well as classical systems. The most common problem of robust control is that it is difficult to specify the uncertainties in advance, and even if possible, the robust controller tends to yield conservative control performance. On the other hand, adaptive control operates on the system by learning model parameters. Adaptive approaches for quantum system identification and filtering have also been studied [22, 23, 24]. However, these studies do not consider a stochastic continuous measurement signal, which is known as a homodyne measurement signal in physics and one of the commonly used detection models for quantum physics, and no real-time adaptive control framework has been proposed in the previous studies so far.

Recently, Liang et. al. [21] derived certain conditions for robust stabilization of $N$ –level quantum angular momentum systems with uncertain parameters and initial state. They revealed that the accurate estimation of the multiplication of the two parameters is essential for their robust stabilization. This fact is important because it ensures robust stabilization by accurate estimation of the multiplication of the parameters only rather than the parameters individually. Motivated by [21], we propose an adaptive parameter tuning algorithm with stabilizing control. To the best of our knowledge, this is the first study on adaptive control for quantum systems with continuous-time measurement feedback. The proposed adaptive law is aimed to minimize the difference between the original and model outputs. The method is simple, but it works well in numerical experiments under certain assumptions, and ensure local convergence.

I-A Contributions

The contributions of this study are summarized as follows:

•

An adaptive parameter tuning algorithm for robust quantum stabilizing control is proposed (Equation (6)).
•

An asymptotic property of the estimate under steady-state conditions is derived (Proposition 4).
•

Local convergence of the proposed method is evaluated under certain assumptions (Theorem 6).

I-B Organization

The rest of this paper is organized as follows. The problem is stated in Section II. In Sec. III, we propose an adaptive parameter tuning algorithm and the analytical results are shown. The proposed method is evaluated numerically and compared with the application of [21] in Sec. IV. We conclude the paper in Sec. V.

I-C Notation

$\mathbb{N}$ , $\mathbb{R}$ and $\mathbb{C}$ are natural, real, and complex numbers, respectively, and $\mathrm{i}:=\sqrt{-1}$ . $\mathbb{R}^{n\times m}$ and $\mathbb{C}^{n\times m}$ are real and complex $n\times m$ matrices, respectively. $X^{\ast}$ implies the Hermitian conjugate of matrix $X$ . We use $I_{n}\in\mathbb{C}^{n\times n}$ as the identify matrix. For $X=X^{\ast}\in\mathbb{C}^{n\times n}$ , $X>0$ ( $X\geq 0$ ) indicates that $X$ is a positive-(semi)definite matrix. When two positive-semidefinite matrices $X$ and $Y$ satisfy $X=Y^{2}$ , we denote $Y=\sqrt{X}$ . The absolute value of a square matrix is defined as $|X|:=\sqrt{X^{\ast}X}$ and for $X\in\mathbb{C}^{n\times n}$ , the trace norm is defined as $\|X\|_{\rm Tr}:=\mathrm{Tr}[|X|]$ . $\mathcal{S}(\mathbb{C}^{n}):=\{\rho\in\mathbb{C}^{n\times n}\ |\ \rho=\rho^{\ast}\geq 0,\ \mathrm{Tr}[\rho]=1\}$ . Denote $[X,Y]_{-}:=XY-YX$ $\forall X,Y\in\mathbb{C}^{n\times n}$ . $\mathbb{E}_{w}$ indicates the expectation in terms of a random variable or a stochastic process $w$ . $O(\varepsilon)$ is Landau’s $O$ as $\varepsilon\to 0$ .

II Problem Formulation

II-A Measurement-based Feedback Quantum Systems

Let $J\in\mathbb{N}$ and $N:=2J+1$ , and let us consider the following quantum stochastic differential equation [2, 9, 21].

$\displaystyle d\rho(t)=$	$\displaystyle\mathrm{i}[H_{\omega}(u(t)),\rho(t)]_{-}dt-\frac{M}{2}[J_{z},[J_{z},\rho(t)]_{-}]_{-}dt$
	$\displaystyle+\sqrt{\eta M}\left(J_{z}\rho(t)+\rho(t)J_{z}-2\mathrm{Tr}[J_{z}\rho(t)]\rho(t)\right)$
	$\displaystyle\quad\quad\times\left(dy(t)-2\sqrt{\eta M}\mathrm{Tr}[J_{z}\rho(t)]dt\right)$	(1)

with an initial state $\rho(0)\in\mathcal{S}(\mathbb{C}^{N})$ , where $\rho(t)\in\mathcal{S}(\mathbb{C}^{N})$ is a conditional state of the system, $u(t)$ is the control input, $y(t)$ is the measurement output, $H_{\omega}(u):=\omega J_{z}+uJ_{y}$ ,

	$\displaystyle J_{z}:=$	$\displaystyle\mathrm{diag}(J,J-1,\ \dots,\ -J+1,-J),$
	$\displaystyle J_{y}:=$	$\displaystyle\begin{bmatrix}0&-\mathrm{i}c_{1}&0&\cdots&0\\ \mathrm{i}c_{1}&\ddots&\ddots&&\vdots\\ 0&\ddots&\ddots&\ddots&0\\ \vdots&&\ddots&\ddots&-\mathrm{i}c_{N-1}\\ 0&\cdots&0&\mathrm{i}c_{N-1}&0\end{bmatrix},$

$c_{m}=\frac{1}{2}\sqrt{(2J+1-m)m}$ , $m=1,\dots,N-1$ , and $\omega>0$ , $M>0$ is the coupling constant, and $\eta\in(0,1]$ denotes measurement efficiency [25]. $u(t)$ is control input and throughout this paper, $u$ is assumed bounded. $\rho(t)$ is called the state, which is a quantum counterpart of (conditional) probability law. Equation (1) is called stochastic master equation having $N$ different equilibrium points if the control input $u(t)=0$ . We denote each equilibrium point as $\rho_{n}\in\mathcal{S}(\mathbb{C}^{N})$ , $n=0,\dots,2J$ , and the target state is described by $\rho_{\bar{n}}$ . Note that $\rho_{n}$ consists of an eigenvector of $J_{z}$ , i.e.,

\displaystyle J_{z}\rho_{n}=\rho_{n}J_{z}=(J-n)\rho_{n}\quad\forall n\in\{0,\dots,2J\}.

A stabilization problem of (1) is to ensure that the state converges to a desirable equilibrium point. Therefore, model uncertainty must be considered in practical situations as different uncertainties exist in the model, initial state, and parameters of (1). In this paper, we consider parametric uncertainty and unknown initial condition. Then, the nominal model is described as follows.

$\displaystyle d\hat{\rho}(t)=$	$\displaystyle\mathrm{i}[H_{\hat{\omega}}(u(t)),\hat{\rho}(t)]_{-}dt-\frac{\hat{M}}{2}[J_{z},[J_{z},\hat{\rho}(t)]_{-}]_{-}dt$
	$\displaystyle+\sqrt{\hat{\eta}\hat{M}}\left(J_{z}\hat{\rho}(t)+\hat{\rho}(t)J_{z}-2\mathrm{Tr}[J_{z}\hat{\rho}(t)]\hat{\rho}(t)\right)$
	$\displaystyle\quad\quad\times\left(dy(t)-2\sqrt{\hat{\eta}\hat{M}}\mathrm{Tr}[J_{z}\hat{\rho}(t)]dt\right)$	(2)

with its initial state $\hat{\rho}(0)\in\mathcal{S}(\mathbb{C}^{N})$ . The differences from (1) are the initial state $\hat{\rho}(0)$ and the parameters $(\hat{\omega},\hat{M},\hat{\eta})$ . Because the accessible state is $\hat{\rho}(t)$ , the goal of the stabilization problem is to find a feedback controller $u(t)=u_{FB}(\hat{\rho}(t))$ that ensures $\lim_{t\to\infty}\rho(t)=\rho_{\bar{n}}$ as one of the stochastic convergences.

II-B Previous Work

Liang et. al. [21] found certain sufficient conditions when the nominal state stabilization becomes the true state stabilization. One of their main results is that if the ratio of $\hat{\eta}\hat{M}$ and $\eta M$ is close to $1$ , then there exists a stabilizing controller. For convenience, we write $\hat{\theta}:=\sqrt{\hat{\eta}\hat{M}}$ and $\theta:=\sqrt{\eta M}$ , and then the following result holds [21].

Theorem 1 ([21, Propositions 4.16 and 4.18])

Suppose $\hat{\theta}$ satisfies

\displaystyle\alpha_{\bar{n}}<\frac{\hat{\theta}}{\theta}-1<\beta_{\bar{n}},

(3)

where

	$\displaystyle\alpha_{\bar{n}}:=$	$\displaystyle\left\{\begin{array}[]{ll}\displaystyle-\frac{1}{2N-1},&\bar{n}\in\{0,2J\},\\ \displaystyle-\frac{1}{N-2},&\bar{n}=J,\\ \displaystyle-\frac{1}{L_{\bar{n}}+1},&\mbox{otherwise},\end{array}\right.$
	$\displaystyle\beta_{\bar{n}}:=$	$\displaystyle\left\{\begin{array}[]{ll}\displaystyle\frac{1}{2}\left(\sqrt{\frac{N+1}{N-1}}-1\right),&\bar{n}\in\{0,2J\},\\ \displaystyle\frac{1}{N-2},&\bar{n}=J,\\ \displaystyle\frac{1}{L_{\bar{n}}-1},&\mbox{otherwise},\end{array}\right.$

and $L_{\bar{n}}:=4|J-\bar{n}|\max\{\bar{n},2J-\bar{n}\}$ , $\hat{\rho}(0)$ is positive-definite, and $\rho(0)\in\mathcal{S}(\mathbb{C}^{N})$ . Then, there exists an asymptotically stabilizing control law that ensures

\displaystyle(\rho(t),\hat{\rho}(t))\xrightarrow{t\to\infty}(\rho_{\bar{n}},\rho_{\bar{n}})\quad\mbox{a.s.}

Note that Theorem 1 is only part of their results. See [21] for the details.

II-C Problem Statement

Before stating our problem, we present a minor modification of Theorem 1.

Corollary 2

Let $\hat{\theta}(t):=\sqrt{\hat{\eta}(t)\hat{M}(t)}$ be a time varying parameter and we assume that there exists $t_{0}>0$ that satisfies the following constraint;

\displaystyle\alpha_{\bar{n}}<\frac{\hat{\theta}(t)}{\theta}-1<\beta_{\bar{n}}\quad\forall t\geq t_{0},

(4)

where $\alpha_{\bar{n}}$ and $\beta_{\bar{n}}$ are the same as defined in Theorem 1, $\hat{\rho}(t_{0})>0$ , and $\rho(0)\in\mathcal{S}(\mathbb{C}^{N})$ . Then, there exists a stabilizing control law.

Proof:

The proof is the same as [21, Propositions 4.16 and 4.18], so we omit it here. ∎

Corollary 2 implies that if we can set the parameter $\hat{\theta}(t)$ appropriately, the stabilization is achieved even if the initial parameter $\hat{\theta}(0)$ does not satisfy the condition (3). Therefore, the problem we deal with is how to estimate $\theta$ while stabilizing the state $\rho(t)$ . Adaptive parameter tuning $\hat{\theta}(t)$ with stabilizing control is a simple and useful solution for the problem, as shown in the next section.

III Proposed Method and Theoretical Results

Owing to the work of Liang et al.[21], there is an acceptable uncertainty of the parameter $\theta$ that ensures the convergence of the state to the target state. Hence, we only focus on the parameter tuning of $\hat{M}(t)$ . The adaptive model is then described as follows (Fig. 1).

$\displaystyle d\hat{\rho}(t)=$	$\displaystyle\mathrm{i}[H_{\hat{\omega}}(u(t)),\hat{\rho}(t)]_{-}dt-\frac{\hat{M}(t)}{2}[J_{z},[J_{z},\hat{\rho}(t)]_{-}]_{-}dt$
	$\displaystyle+\sqrt{\hat{\eta}\hat{M}(t)}\left(J_{z}\hat{\rho}(t)+\hat{\rho}(t)J_{z}-2\mathrm{Tr}[J_{z}\hat{\rho}(t)]\hat{\rho}(t)\right)$
	$\displaystyle\quad\quad\times\left(dy(t)-2\sqrt{\hat{\eta}\hat{M}(t)}\mathrm{Tr}[J_{z}\hat{\rho}(t)]dt\right),$	(5)

where $(\hat{\omega},\hat{M}(0),\hat{\eta})$ are given and $\hat{M}(t)$ is calculated by our proposed parameter tuning algorithm below.

Refer to caption — Figure 1: The proposed adaptive controller

III-A Proposed Adaptive Parameter Tuning Method

For convenience, we use $\hat{\theta}(t):=\sqrt{\hat{\eta}\hat{M}(t)}$ , $\hat{x}(t):=\mathrm{Tr}[J_{z}\hat{\rho}(t)]$ , and $x(t):=\mathrm{Tr}[J_{z}\rho(t)]$ . Then, we propose the following parameter tuning algorithm.

	$\displaystyle d\hat{\theta}(t)=$	$\displaystyle f(t)\left\{-\hat{x}(t)^{2}\hat{\theta}(t)dt+\frac{1}{2}\hat{x}(t)dy(t)\right\},$		(6)
	$\displaystyle f(t):=$	$\displaystyle(Kt+1)^{-p},\quad t\geq 0,$		(7)

where $p\in(0,1]$ and $K>0$ . Note that we update $\hat{M}(t)$ as $\hat{M}(t)=\hat{\theta}(t)^{2}/\hat{\eta}$ . From the filtering theory [25, 26], $dy(t)$ can be replaced by $dw(t)+2\theta x(t)$ , where $w(t)$ is a standard Wiener process, and (6) then gives

\displaystyle d\hat{\theta}(t)=

\displaystyle f(t)\hat{x}(t)\left\{(\theta x(t)-\hat{\theta}(t)\hat{x}(t))dt+\frac{1}{2}dw(t)\right\}.

If the noise $w(t)$ is removed, updating $\hat{\theta}(t)$ by Eq. (6) implies the same as instant gradient method of the cost function $|\theta x(t)-\hat{\theta}(t)\hat{x}(t)|^{2}$ with the weight $f(t)$ . This is a type of Robbins-Monro algorithm for continuous-time problems [27, 28, 29]. Clearly, if $x(t)=\hat{x}(t)\neq 0$ for all $t\geq 0$ , then the parameter tuning law is the continuous-time Robbins-Monro algorithm, which is guaranteed to converge to the true parameter. Unfortunately, the assumption $x(t)=\hat{x}(t)\neq 0$ $\forall t\geq 0$ may not hold; therefore, we need to seek the condition when the parameter $\hat{\theta}(t)$ converges to the region described by (4). Note that the true parameter $\theta$ cannot be an equilibrium point of the system (6). Thus, the noise $w$ is unavoidable, so we examine how to choose the parameter $(K,p)$ and obtain the accurate estimate asymptotically.

Remark 3

Because each unknown parameter is a positive constant, the adaptive parameter $\hat{\theta}(t)$ must to be positive. However, the solution of (6) is not ensured to be positive, so when $\hat{\theta}(t)$ becomes negative, we replace it with $0$ or a small positive number in practical implementations.

III-B Asymptotic Property of the Estimate

Here, we describe that the choice of the parameters $p$ and $K$ of (7) is valid from the following proposition. For convenience, we write $\mathbb{E}_{w}[\bullet]\equiv\mathbb{E}_{w}[\bullet|\rho(0),\hat{\rho}(0)]$ .

Proposition 4

Suppose that a pair of initial states $(\rho(0),\hat{\rho}(0))$ is in some $(\rho_{n},\rho_{m})$ , $m\neq J$ , and $u(t)=0$ , and considering (6) with $p\in\mathbb{R}$ and $K>0$ , the followings hold.

For the mean of the $\hat{\theta}(t)$ ,

\displaystyle\lim_{t\to\infty}\mathbb{E}_{w}[\hat{\theta}(t)]=

\displaystyle\left\{\begin{array}[]{ll}\mbox{(depend on $\hat{\theta}(0)$)},&p>1,\\ \theta\frac{J-n}{J-m},&p\in(-\infty,1].\end{array}\right.

For the variance of the $\hat{\theta}(t)$ , $V(\hat{\theta}(t)):=\mathbb{E}_{w}[(\hat{\theta}(t)-\mathbb{E}_{w}[\hat{\theta}(t)])^{2}]$ ,

	$\displaystyle\limsup_{t\to\infty}V(\hat{\theta}(t))\leq$	$\displaystyle\ \frac{1}{8},\quad p>1,$
	$\displaystyle\lim_{t\to\infty}V(\hat{\theta}(t))=$	$\displaystyle\left\{\begin{array}[]{ll}0,&p\in(0,1],\\ \frac{1}{8},&p=0,\\ \infty,&p<0.\end{array}\right.$

Proof:

Denote the integral of $f(t)$ by

	$\displaystyle F(t):=$	$\displaystyle\int_{0}^{t}f(\tau)d\tau$
	$\displaystyle=$	$\displaystyle\left\{\begin{array}[]{ll}\frac{1}{K(1-p)}\{(Kt+1)^{1-p}-1\},&p\in\mathbb{R}\setminus\{1\},\\ \frac{1}{K}\ln(Kt+1),&p=1.\end{array}\right.$		(10)

We only prove the convergence of $V(\hat{\theta}(t))$ . Note that $x=J-n$ and $\hat{x}=J-m$ from the assumption. Then, the explicit solution of $V(\hat{\theta}(t))$ is

	$\displaystyle V(\hat{\theta}(t))=$	$\displaystyle e^{-2\hat{x}^{2}F(t)}V(\hat{\theta}(0))$
		$\displaystyle+\frac{\hat{x}^{2}}{4}e^{-2\hat{x}^{2}F(t)}\int_{0}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)^{2}d\tau.$

As $\theta$ is a deterministic uncertain parameter and $\hat{M}(0)$ is given, $V(\hat{\theta}(0))=0$ . If $p=0$ , it implies that $f(t)=1$ and therefore, the claim of the theorem trivially holds. The other cases are as follows.

If $p\in(0,\infty)$ , $f(t)^{2}\leq f(t)$ because $f(t)\in[0,1]$ and $f(t)\leq f(\tau_{0})$ for all $t\geq\tau_{0}$ , where $\tau_{0}>0$ is arbitrary chosen. From simple calculation and using the above-mentioned properties,

		$\displaystyle\int_{0}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)^{2}d\tau$
	$\displaystyle=$	$\displaystyle\int_{0}^{\tau_{0}}e^{2\hat{x}^{2}F(\tau)}f(\tau)^{2}d\tau+\int_{\tau_{0}}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)^{2}d\tau$
	$\displaystyle\leq$	$\displaystyle\int_{0}^{\tau_{0}}e^{2\hat{x}^{2}F(\tau)}f(\tau)d\tau+f(\tau_{0})\int_{\tau_{0}}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)d\tau$
	$\displaystyle=$	$\displaystyle\int_{F(0)}^{F(\tau_{0})}e^{2\hat{x}^{2}s}ds+f(\tau_{0})\int_{F(\tau_{0})}^{F(t)}e^{2\hat{x}^{2}s}ds$
	$\displaystyle=$	$\displaystyle\frac{1}{2\hat{x}^{2}}\Big{\{}e^{2\hat{x}^{2}F(\tau_{0})}-e^{2\hat{x}^{2}F(0)}$
		$\displaystyle\quad\quad\quad+f(\tau_{0})e^{2\hat{x}^{2}F(t)}-f(\tau_{0})e^{2\hat{x}^{2}F(\tau_{0})}\Big{\}},$

and therefore,

	$\displaystyle V(\hat{\theta}(t))\leq$	$\displaystyle\frac{f(\tau_{0})}{8}(1-e^{-2\hat{x}^{2}(F(t)-F(\tau_{0})})$
		$\displaystyle+\frac{1}{8}e^{-2\hat{x}^{2}F(t)}\Big{\{}e^{2\hat{x}^{2}F(\tau_{0})}-e^{2\hat{x}^{2}F(0)}\Big{\}}.$

As $\tau_{0}$ can be chosen to be arbitrarily large, the first term of the right-hand side of the above equation can be arbitrarily small. As $t\to\infty$ , $e^{-2\hat{x}^{2}F(t)}\to 0$ for $p\in(0,1]$ and $e^{-2\hat{x}^{2}F(t)}\to e^{-2\hat{x}^{2}/(K(p-1))}>0$ for $p>1$ . Then, the last term of the right-hand side of the equation remains finite and it is less than $1/8$ . Therefore, the claim of the theorem holds for $p\in(0,\infty)$ .

If $p\in(-\infty,0)$ , $f(t)^{2}\geq f(t)$ because $f(t)\geq 1$ and $f(t)\geq f(\tau_{0})$ for all $t\geq\tau_{0}$ , where $\tau_{0}>0$ is arbitrary chosen. From simple calculation and using the above-mentioned properties,

		$\displaystyle\int_{0}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)^{2}d\tau$
	$\displaystyle\geq$	$\displaystyle\int_{0}^{\tau_{0}}e^{2\hat{x}^{2}F(\tau)}f(\tau)d\tau+f(\tau_{0})\int_{\tau_{0}}^{t}e^{2\hat{x}^{2}F(\tau)}f(\tau)d\tau$
	$\displaystyle\geq$	$\displaystyle\frac{f(\tau_{0})}{2\hat{x}^{2}}\Big{\{}e^{2\hat{x}^{2}F(t)}-e^{2\hat{x}^{2}F(\tau_{0})}\Big{\}}$

and therefore, for $t>\tau_{0}$

\displaystyle V(\hat{\theta}(t))\geq

\displaystyle\frac{f(\tau_{0})}{8}\left(1-e^{-2\hat{x}^{2}(F(t)-F(\tau_{0}))}\right)

holds. The second term of the right-hand side of the above inequality vanishes as $t\to\infty$ and because $f(\tau_{0})$ can be arbitrarily large, if $\tau_{0}$ is large, then $V(\hat{\theta}(t))\to\infty$ as $t\to\infty$ .

∎

From Proposition 4, if $\rho(t)$ and $\hat{\rho}(t)$ are in the same equilibrium state, the parameter $\hat{\theta}(t)$ updated by (6) converges to the true value with probability one. Unfortunately, since the true state $\rho(t)$ is not accessible, we cannot confirm whether $\rho(t)$ and $\hat{\rho}(t)$ are practically in the same equilibrium state. To avoid being trapped in different equilibrium points before learning the parameter accurately, we employ feedforward control in the following subsection.

Remark 5

A key to prove Proposition 4 is that, $f:[0,\infty)\to[0,1]$ is a non-increasing function with $\lim_{t\to\infty}f(t)=0$ , while its integral $F(t)=\int_{0}^{t}f(\tau)d\tau$ becomes a non-decreasing function with $\lim_{t\to\infty}F(t)=\infty$ . This is a minor difference from the continuous-time Robbins-Monro algorithms because they require the square integrability of the function $f(t)$ (e.g., [28, Theorem 1]). Searching for a preferable function for learning $\theta$ is beyond the scope of this study, and we only use (7) and do not consider other functions. We established some convergence rate problems in [30], which can be referred for details.

III-C Local Convergence Property

In this subsection, we evaluate our tuning algorithm (6) with the following control input.

\displaystyle u(t)=

\displaystyle u_{FB}(\hat{\rho}(t))+u_{FF}(t),

where $u_{FF}(t)\in[0,\infty)$ is a strictly decreasing, bounded continuous function with $\lim_{t\to\infty}u_{FF}(t)=0$ , and $u_{FB}$ is a stabilizing feedback control law if $\hat{\theta}(t)$ satisfies the condition of Corollary 2 (e.g., of (4.22) or (4.23) in [21].) The role of $u_{FF}$ is to eliminate the $\hat{\rho}(t)$ from the target state $\rho_{\bar{n}}$ before learning the parameter accurately. Then, our proposed method ensures local convergence under certain assumptions. For convenience, we write $\mathbb{E}_{w}^{\prime}[\bullet]:=\mathbb{E}_{w}[\bullet|\rho(t_{0}),\hat{\rho}(t_{0})]$ .

Theorem 6

Let $t_{0}>0$ satisfy $f(t_{0})<\varepsilon$ for a given sufficiently small $\varepsilon>0$ . Considering $\rho(t)$ and $\hat{\rho}(t)$ are the solutions of (1) and (5) starting from $\rho(t_{0})$ , $\hat{\rho}(t_{0})\in\mathcal{S}(\mathbb{\mathbb{C}^{N}})$ , respectively, we choose the feedforward control $u_{FF}(t)$ that satisfies $u_{FF}(t)\leq f(t)^{2}$ for all $t\geq t_{0}$ and the feedback control $u_{FB}$ that satisfies $\mathbb{E}_{w}^{\prime}[|u_{FB}(\hat{\rho})|]=O(\varepsilon^{2})$ if $\hat{\rho}$ satisfies $\mathbb{E}_{w}^{\prime}[\|\hat{\rho}-\rho_{\bar{n}}\|_{\mathrm{Tr}}]<\varepsilon$ . Let $\hat{\theta}(t)$ be the solution of the parameter tuning algorithm (6) with $p\in(0.5,1]$ and $K>0$ and its initial value $\hat{\theta}(t_{0})$ satisfy $|1-\hat{\theta}(t_{0})/\theta|<\varepsilon$ . Suppose that $\max\{\mathbb{E}_{w}^{\prime}[\|\rho(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}],\mathbb{E}_{w}^{\prime}[\|\hat{\rho}(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}]\}<\varepsilon$ and $\max\{\mathbb{E}_{w}^{\prime}[\|\rho(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}^{2}],\mathbb{E}_{w}^{\prime}[\|\hat{\rho}(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}^{2}]\}<\varepsilon^{2}$ for all $t\geq t_{0}$ . Besides, assume that the following inequality holds for almost all $t\geq t_{0}$

\displaystyle\Delta(\rho(t),\hat{\rho}(t),\hat{\theta}(t))\geq 0\quad\mbox{ a.s.,}

(11)

and the equality holds iff $V_{\rho(t)}(J_{z})=V_{\hat{\rho}(t)}(J_{z})=0$ , where

		$\displaystyle\Delta(\rho,\hat{\rho},\hat{\theta})$
	$\displaystyle:=$	$\displaystyle\left(3V_{\rho}(J_{z})^{2}+2V_{\rho}(J_{z})V_{\hat{\rho}}(J_{z})+3V_{\hat{\rho}}(J_{z})^{2}\right)$
		$\displaystyle-2V_{\hat{\rho}}(J_{z})\left(\mathrm{Tr}[J_{z}(\rho-\hat{\rho})]+\mathrm{Tr}[J_{z}\hat{\rho}]\left(1-\frac{\hat{\theta}}{\theta}\right)\right),$

$V_{\rho}(J_{z}):=\mathrm{Tr}[J_{z}^{2}\rho]-\mathrm{Tr}[J_{z}\rho]^{2}$ . Then, for $\bar{n}\neq J$ ,

\displaystyle\lim_{t\to\infty}(\rho(t),\hat{\rho}(t))=(\rho_{\bar{n}},\rho_{\bar{n}})\mbox{ and }\lim_{t\to\infty}\hat{\theta}(t)=\theta\ \mbox{a.s.}

Proof:

See Appendix.

∎

Some readers may think the assumptions of Theorem 6 are too strong to be valid in practice; however, several numerical experiments support that they may hold in many cases, one of which is demonstrated in the following section.

IV NUMERICAL EXPERIMENTS

In this section, we examine the proposed method numerically. The dimension of the quantum system is $N=5$ , and the Euler-Maruyama method is used with $0.01$ time step width. We use the true parameters as $(\omega,M,\eta)=(0.5,1,0.9)$ and the initial parameters of the adaptive system as $(\hat{\omega},\hat{M}(0),\hat{\eta})=(1,25,1)$ , for which the system cannot be stabilized by merely using the feedback control in [21]. The true initial state $\rho(0)$ is randomly generated for each realization and the initial adaptive state is fixed to $\hat{\rho}(0)=\frac{1}{N}I$ . The target state $\rho_{\bar{n}}$ is set as $\bar{n}=0$ . We set the control inputs as follows.

\displaystyle u_{FF}(t):=f(t)^{2},\quad u_{FB}(\hat{\rho}):=

\displaystyle 4(1-\mathrm{Tr}[\hat{\rho}\rho_{\bar{n}}])^{2}.

The parameters of (7) are chosen as $(K,p)=(20,0.6)$ , and then the simulation is run with 1000 realizations. The results are shown in Figs. 2 and 3. Fig. 2 represents the trajectories of the ratio $\hat{\theta}(t)/\theta$ and Fig. 3 represents the distance $d(t)=d_{B}((\rho(t),\hat{\rho}(t)),(\rho_{\bar{n}},\rho_{\bar{n}}))$ [21],

		$\displaystyle d_{B}((\rho,\hat{\rho}),(\rho_{n},\rho_{m}))$
	$\displaystyle=$	$\displaystyle\sqrt{2-2\sqrt{\mathrm{Tr}[\rho\rho_{n}]}}+\sqrt{2-2\sqrt{\mathrm{Tr}[\hat{\rho}\rho_{m}]}}.$

We also evaluate whether the inequality (11) holds in Fig. 4. From the figures, all sample trajectories of $\hat{\theta}(t)$ and $(\rho(t),\hat{\rho}(t))$ appear to converge to $\theta$ and the target state $\rho_{0}$ , respectively. The inequality (11) sometimes does not hold at the beginning of the simulations, but all sample trajectories satisfy it after $t=450$ , shown by the blue dashed line in Fig. 4, until the states converge to the target states. Moreover, even though our proposed method does not ensure satisfying the condition (3) at all times after a certain point, we confirmed that all sample trajectories of the $\hat{\theta}(t)/\theta$ ratio satisfy the condition of Corollary 2, i.e., $\hat{\theta}(t)/\theta\in(1+\alpha_{0},1+\beta_{0})\simeq(0.889,1.11)$ , after $t=666$ . This result implies that the proposed method ensures that all sample trajectories are in the neighborhood of the true value with a significantly high probability. Although we confirmed that $(K,p)=(20,0.3)$ , which does not satisfy the condition of Theorem 6, also works well, but the result is omitted due to the page limitation.

V CONCLUSION AND FUTURE WORK

In this paper, we proposed an adaptive parameter tuning algorithm for robust stabilizing control of quantum angular momentum systems. The asymptotic property of the estimate and local convergence of the states were evaluated analytically, and numerical experiments show that the proposed method works well for systems with large parametric uncertainty.

The relaxation of Theorem 6’s assumptions and the global convergence property are interesting works for future.

ACKNOWLEDGMENTS

The authors gratefully acknowledge the helpful comments and suggestions of the anonymous reviewers.

References

[1] R. van Handel, J. K. Stockton, and H. Mabuchi, “Feedback control of quantum state reduction,” IEEE Transactions on Automatic Control, vol. 50, no. 6, pp. 768–780, 2005.
[2] M. Mirrahimi and R. van Handel, “Stabilizing Feedback Controls for quantum systems,” SIAM Journal on Control and Optimization, vol. 46, no. 2, pp. 445–467, 2007.
[3] K. Tsumura, “Global stabilization of n-dimensional quantum spin systems via continuous feedback,” in Proceedings of 2007 American Control Conference. IEEE, 2007, pp. 2129–2134.
[4] A. Sarlette, Z. Leghtas, M. Brune, J. M. Raimond, and P. Rouchon, “Stabilization of nonclassical states of one- and two-mode radiation fields by reservoir engineering,” Physical Review A, vol. 86, p. 012114, Jul 2012.
[5] F. Ticozzi, R. Lucchese, P. Cappellaro, and L. Viola, “Hamiltonian Control of Quantum Dynamical Semigroups: Stabilization and Convergence Speed,” IEEE Transaction on Automatic Control, vol. 57, no. 8, pp. 1931–1944, 2012.
[6] F. Ticozzi, K. Nishio, and C. Altafini, “Stabilization of Stochastic Quantum Dynamics via Open and Closed Loop Control,” IEEE Transaction on Automatic Control, vol. 58, pp. 74–85, 2013.
[7] P. Scaramuzza and F. Ticozzi, “Switching quantum dynamics for fast stabilization,” Physical Review A, vol. 91, p. 062314, Jun 2015.
[8] F. Ticozzi, L. Zuccato, P. D. Johnson, and L. Viola, “Alternating projections methods for discrete-time stabilization of quantum states,” IEEE Transactions on Automatic Control, vol. 63, no. 3, pp. 819–826, 2017.
[9] W. Liang, N. H. Amini, and P. Mason, “On exponential stabilization of $N$ -level quantum angular momentum systems,” SIAM Journal on Control and Optimization, vol. 57, no. 6, pp. 3939–3960, 2019.
[10] G. Cardona, A. Sarlette, and P. Rouchon, “Exponential stabilization of quantum systems under continuous non-demolition measurements,” Automatica, vol. 112, p. 108719, 2020.
[11] J. Wen, Y. Shi, J. Jia, and J. Zeng, “Exponential stabilization of two-level quantum systems based on continuous noise-assisted feedback,” Results in Physics, vol. 22, p. 103929, 2021.
[12] K. Zhou, J. C. Doyle, and K. Glover, Robust and Optimal Control. Prentice Hall Upper Saddle River, NJ, 1996.
[13] I. R. Petersen, V. A. Ugrinovskii, and A. V. Savkin, Robust Control Design using $H^{\infty}$ Methods. Springer Verlag, 2000.
[14] K. S. Narendra and A. M. Annaswamy, Stable Adaptive Systems. Prentice-Hall, Inc. Upper Saddle River, NJ, USA, 1989.
[15] M. Krstic, P. V. Kokotovic, and I. Kanellakopoulos, Nonlinear and Adaptive Control Design. John Wiley & Sons, Inc., 1995.
[16] M. R. James, “Risk-sensitive optimal control of quantum systems,” Physical Review A, vol. 69, no. 3, p. 032108, 2004.
[17] ——, “A quantum Langevin formulation of risk-sensitive optimal control,” Journal of Optics B: Quantum and Semiclassical Optics, vol. 7, p. S198, 2005.
[18] D. Dong, C. Chen, B. Qi, I. R. Petersen, and F. Nori, “Robust manipulation of superconducting qubits in the presence of fluctuations,” Scientific Reports, vol. 5, p. 7873, 2015.
[19] I. G. Vladimirov, I. R. Petersen, and M. R. James, “Risk-sensitive performance criteria and robustness of quantum systems with a relative entropy description of state uncertainty,” in 23rd International Symposium on Mathematical Theory of Networks and Systems, Hong Kong, Jul. 2018, pp. 482–488.
[20] M. R. James, H. I. Nurdin, and I. R. Petersen, “ $H^{\infty}$ Control of Linear Quantum Stochastic Systems,” IEEE Transactions on Automatic Control, vol. 53, no. 8, pp. 1787–1803, 2008.
[21] W. Liang, N. H. Amini, and P. Mason, “Robust Feedback Stabilization of $N$ -Level Quantum Spin Systems,” SIAM Journal on Control and Optimization, vol. 59, no. 1, pp. 669–692, jan 2021.
[22] S. Bonnabel, M. Mirrahimi, and P. Rouchon, “Observer-based Hamiltonian identification for quantum systems,” Automatica, vol. 45, no. 5, pp. 1144–1155, 2009.
[23] Z. Leghtas, M. Mirrahimi, and P. Rouchon, “Back and forth nudging for quantum state estimation by continuous weak measurement,” in Proceedings of the 2011 American Control Conference. IEEE, 2011.
[24] R. S. Gupta and M. J. Biercuk, “Adaptive filtering of projective quantum measurements using discrete stochastic methods,” Physical Review A, vol. 104, p. 012412, 2021.
[25] L. Bouten, R. van Handel, and M. R. James, “An Introduction to Quantum Filtering,” SIAM Journal on Control and Optimization, vol. 46, no. 6, pp. 2199–2241, 2007.
[26] A. Bain and D. Crişan, Fundamentals of Stochastic Filtering, ser. Stochastic Modelling and Applied Probability. Springer Verlag, 2009, vol. 60.
[27] H. Robbins and S. Monro, “A Stochastic Approximation Method,” The Annals of Mathematical Statistics, vol. 22, no. 3, pp. 400–407, 1951.
[28] H.-F. Chen, “Continuous-time stochastic approximation: convergence and asymptotic efficiency,” Stochastics and Stochastic Reports, vol. 51, no. 3-4, pp. 217–239, 1994.
[29] H. Kushner, “Stochastic approximation: a survey,” Wiley Interdisciplinary Reviews: Computational Statistics, vol. 2, no. 1, pp. 87–96, 2010.
[30] S. Enami and K. Ohki, “Convergence analysis of adaptive tuning parameter for robust stabilizing control of $N$ -level quantum systems,” in Proceedings of 60th Annual Conference of the Society of Instrument and Control Engineers of Japan, 2021.
[31] X. Mao, “Stochastic versions of the LaSalle theorem,” Journal of Differential Equations, vol. 153, no. 1, pp. 175–195, 1999.
[32] H. J. Kushner, Stochastic Stability and Control, Academic Press, 1967.

Proof of Theorem 6

To prove Theorem 6, we evaluate $G(\rho(t),\hat{\rho}(t),\hat{\theta}(t)):=C_{x}(t)+C_{\theta}(t)+V_{\rho(t)}(J_{z})+V_{\hat{\rho}(t)}(J_{z})$ , where $C_{\theta}(t):=\left|1-\frac{\hat{\theta}(t)}{\theta}\right|^{2}$ , and $C_{x}(t):=|x(t)-\hat{x}(t)|^{2}$ . Our proof mainly follows the argument of the proof of Theorem 2.1 in [31].

First, we evaluate $V_{\hat{\rho}(t)}(J_{z})$ and $V_{\rho(t)}(J_{z})$ .

Lemma 7

If $\mathbb{E}_{w}^{\prime}[\|\hat{\rho}(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}]<\varepsilon$ holds for some small $\varepsilon>0$ , then

\displaystyle V_{\hat{\rho}(t)}(J_{z})=\varepsilon\hat{\alpha}(t)+\varepsilon^{2}\mathrm{Tr}[(\hat{\rho}(t)-\rho_{\bar{n}})J_{z}]^{2},

where $\hat{\alpha}(t):=\mathrm{Tr}[(\hat{\rho}(t)-\rho_{\bar{n}})(J_{z}-(J-\bar{n})I_{n})^{2}]$ is a nonnegative number and $\alpha(t)=0$ iff $\hat{\rho}(t)=\rho_{\bar{n}}$ .

Proof:

If $\mathbb{E}_{w}^{\prime}[\|\hat{\rho}(t)-\rho\|_{\mathrm{Tr}}]<\varepsilon$ holds for some small $\varepsilon>0$ , there exists $X(t)=X(t)^{\ast}\in\mathbb{C}^{N\times N}$ that satisfies $\hat{\rho}(t)=\rho+\varepsilon X(t)$ and $\mathbb{E}_{w}^{\prime}[\|X(t)\|_{\mathrm{Tr}}]<1$ . This implies that if $\mathbb{E}_{w}^{\prime}[\|\hat{\rho}(t)-\rho_{\bar{n}}\|_{\mathrm{Tr}}]<\varepsilon$ holds for a small $\varepsilon>0$ , then

\displaystyle V_{\hat{\rho}(t)}(J_{z})=\varepsilon\underbrace{\mathrm{Tr}[X(t)(J_{z}-(J-\bar{n})I_{n})^{2}]}_{=\hat{\alpha}(t)}+\varepsilon^{2}\mathrm{Tr}[X(t)J_{z}]^{2}

holds. Since $\rho_{\bar{n}}+\varepsilon X(t)\in\mathcal{S}(\mathbb{C}^{N})$ , the $(\bar{n}+1)$ -th diagonal element of $X(t)$ needs to be nonpositive and the other diagonal elements are nonnegative, and $\mathrm{Tr}[X(t)]=0$ . The $(\bar{n}+1)$ -th diagonal element of $(J_{z}-(J-\bar{n})I_{n})^{2}$ becomes $0$ , so $\hat{\alpha}(t)$ is nonnegative and $\hat{\alpha}(t)=0$ iff $\hat{\rho}(t)=\rho_{\bar{n}}$ .

∎

Therefore, $\mathbb{E}_{w}^{\prime}[V_{\hat{\rho}(t)}(J_{z})]=O(\varepsilon)$ . Similar argument gives $\mathbb{E}_{w}^{\prime}[V_{\hat{\rho}(t)}(J_{z})^{2}]=O(\varepsilon^{2})$ and $\mathbb{E}_{w}^{\prime}[V_{\rho(t)}(J_{z})]=O(\varepsilon)$ from the assumptions. Note that $dy(t)$ in (5) can be replaced by $\theta x(t)dt+dw(t)$ . Let $\mathcal{L}$ be the infinitesimal generator [32]. Using the classical Ito calculus, the infinitesimal generators of $V_{\rho(t)}(J_{z})$ and $V_{\hat{\rho}(t)}(J_{z})$ are as follows.

	$\displaystyle\mathcal{L}V_{\rho(t)}(J_{z})=$	$\displaystyle-4\theta^{2}V_{\rho(t)}(J_{z})^{2}-\mathrm{i}u(t)\mathrm{Tr}[J_{y}[J_{z},\rho(t)]_{-}],$
	$\displaystyle\mathcal{L}V_{\hat{\rho}(t)}(J_{z})\leq$	$\displaystyle-4\theta^{2}V_{\hat{\rho}(t)}(J_{z})^{2}-\mathrm{i}u(t)\mathrm{Tr}[J_{y}[J_{z},\hat{\rho}(t)]_{-}]$
		$\displaystyle+2\theta^{2}V_{\hat{\rho}(t)}(J_{z})(x(t)-\hat{x}(t))$
		$\displaystyle+2\theta\hat{x}(t)V_{\hat{\rho}(t)}(J_{z})(\theta-\hat{\theta}(t))$
		$\displaystyle+4(\theta^{2}-\hat{\theta}(t)^{2})V_{\hat{\rho}(t)}(J_{z})^{2}$
		$\displaystyle+2(\hat{\theta}(t)-\theta)V_{\hat{\rho}(t)}(J_{z})(\theta x(t)-\hat{\theta}(t)\hat{x}(t)).$

Since $\mathbb{E}_{w}^{\prime}[V_{\rho(t)}(J_{z})^{2}]=O(\varepsilon^{2})$ , $\mathbb{E}_{w}^{\prime}[[\rho(t),J_{z}]_{-}]=O(\varepsilon)$ , $\mathbb{E}_{w}^{\prime}[[\hat{\rho}(t),J_{z}]_{-}]=O(\varepsilon)$ , $\mathbb{E}_{w}^{\prime}[u_{FB}(\hat{\rho}(t))]=O(\varepsilon^{2})$ , and $u_{FF}(t)=O(f(t)^{2})$ ,

	$\displaystyle\mathbb{E}_{w}^{\prime}[\mathcal{L}(V_{\rho(t)}(J_{z})+V_{\hat{\rho}(t)}(J_{z}))]$
$\displaystyle=$	$\displaystyle\mathbb{E}_{w}^{\prime}\Bigg{[}-4\theta^{2}(V_{\rho(t)}(J_{z})^{2}+V_{\hat{\rho}(t)}(J_{z})^{2})+\sigma_{1}u_{FF}(t)$
	$\displaystyle+2\theta^{2}V_{\hat{\rho}(t)}(J_{z})\left((x(t)-\hat{x}(t))+\hat{x}(t)\left(1-\frac{\hat{\theta}(t)}{\theta}\right)\right)\Bigg{]}$
	$\displaystyle+O(\varepsilon^{3})+O(\varepsilon C_{\theta}(t))+O\left(\varepsilon^{2}\sqrt{C_{\theta}(t)}\right),$	(12)

where $\sigma_{1}:=\max_{\rho\in\mathcal{S}(\mathbb{C}^{N})}|\mathrm{Tr}[J_{y}[J_{z},\rho]_{-}]|$ .

Next, we calculate the infinitesimal generator of $C_{x}(t)$ and $C_{\theta}(t)$ . From simple calculation,

		$\displaystyle\mathcal{L}C_{x}(t)$
	$\displaystyle\leq$	$\displaystyle 2\|u_{FB}(\hat{\rho}(t))\|\left\|\mathrm{Tr}[J_{y}[\rho(t)-\hat{\rho}(t),J_{z}]_{-}]\right\|\sqrt{C_{x}(t)}$
		$\displaystyle+8J\sigma_{1}u_{FF}(t)+4\hat{\theta}(t)\theta V_{\hat{\rho}(t)}(J_{z})$
		$\displaystyle\quad\times\Bigg{\{}-C_{x}(t)+\|\hat{x}(t)\|\sqrt{C_{\theta}(t)}\sqrt{C_{x}(t)}\Bigg{\}}$
		$\displaystyle+\left(\theta V_{\rho(t)}(J_{z})-\hat{\theta}(t)V_{\hat{\rho}(t)}(J_{z})\right)^{2}.$

From the definition of $C_{\theta}(t)$ ,

	$\displaystyle\mathcal{L}C_{\theta}(t)\leq$	$\displaystyle 2f(t)\Bigg{\{}-\hat{x}(t)^{2}C_{\theta}(t)+\|\hat{x}(t)\|\sqrt{C_{\theta}(t)}\sqrt{C_{x}(t)}$
		$\displaystyle\hskip 56.9055pt+\frac{\hat{x}(t)^{2}f(t)}{8\theta^{2}}\Bigg{\}}.$

Since the expectation of the right-hand side of the above inequality is at most $O(\varepsilon^{2})$ for small $t-t_{0}>0$ , $\mathbb{E}_{w}^{\prime}[C_{\theta}(t)]-C_{\theta}(t_{0})=\int_{t_{0}}^{t}\mathbb{E}_{w}^{\prime}[\mathcal{L}C_{\theta}(\tau)]d\tau\leq(t-t_{0})\times O(\varepsilon^{2})$ , where the Dynkin’s formula [32] is used. Let $a(t):=4\hat{\theta}(t)\theta V_{\hat{\rho}(t)}(J_{z})$ and $b(t):=2f(t)$ . Note that $a(t)=0$ iff $V_{\hat{\rho}(t)}(J_{z})=0$ . Then,

	$\displaystyle\mathbb{E}_{w}^{\prime}[\mathcal{L}(C_{x}(t)+C_{\theta}(t))]$
$\displaystyle\leq$	$\displaystyle\mathbb{E}_{w}^{\prime}\Bigg{[}2\|u_{FB}(\hat{\rho}(t))\|\left\|\mathrm{Tr}[J_{y}[\rho(t)-\hat{\rho}(t),J_{z}]_{-}]\right\|\sqrt{C_{x}(t)}$
	$\displaystyle\quad\quad+8J\sigma_{1}u_{FF}(t)$
	$\displaystyle\quad\quad-a(t)C_{x}(t)-b(t)\hat{x}(t)^{2}C_{\theta}(t)$
	$\displaystyle\quad\quad+(a(t)+b(t))\|\hat{x}(t)\|\sqrt{C_{x}(t)C_{\theta}(t)}$
	$\displaystyle\quad\quad+\left(\theta V_{\rho(t)}(J_{z})-\hat{\theta}(t)V_{\hat{\rho}(t)}(J_{z})\right)^{2}+\frac{J^{2}b(t)^{2}}{16\theta^{2}}\Bigg{]}$
$\displaystyle=$	$\displaystyle\mathbb{E}_{w}^{\prime}\Bigg{[}\underbrace{\theta^{2}\left(V_{\rho(t)}(J_{z})-V_{\hat{\rho}(t)}(J_{z})\right)^{2}}_{O(\varepsilon^{2})}$
	$\displaystyle\quad\quad+\underbrace{\frac{J^{2}b(t)^{2}}{16\theta^{2}}+8J\sigma_{1}u_{FF}(t)}_{O(f(t)^{2})}$
	$\displaystyle\quad\quad-\big{(}\underbrace{a(t)C_{x}(t)}_{=O(\varepsilon^{3})}+\underbrace{b(t)\|J-\bar{n}\|^{2}C_{\theta}(t)}_{=O(f(t)C_{\theta}(t))}\big{)}+O(\varepsilon^{4})\Bigg{]}.$	(13)

Note that

\mathbb{E}_{w}^{\prime}[|u_{FB}(\hat{\rho}(t))|\left|\mathrm{Tr}[J_{y}[\rho(t)-\hat{\rho}(t),J_{z}]_{-}]\right|\sqrt{C_{x}(t)}]=O(\varepsilon^{4}).

From (12) and (13),

		$\displaystyle\mathbb{E}_{w}^{\prime}\left[\mathcal{L}G(\rho(t),\hat{\rho}(t),\hat{\theta}(t))\right]$
	$\displaystyle\leq$	$\displaystyle\mathbb{E}_{w}^{\prime}\Bigg{[}\theta^{2}\left(V_{\rho(t)}(J_{z})-V_{\hat{\rho}(t)}(J_{z})\right)^{2}+\frac{J^{2}b(t)^{2}}{16\theta^{2}}$
		$\displaystyle\quad\quad+(1+8J)\sigma_{1}u_{FF}(t)$
		$\displaystyle\quad\quad-4\theta^{2}\left(V_{\rho(t)}(J_{z})^{2}+V_{\hat{\rho}(t)}(J_{z})^{2}\right)$
		$\displaystyle\quad\quad+2\theta^{2}V_{\hat{\rho}(t)}(J_{z})$
		$\displaystyle\quad\quad\quad\times\left((x(t)-\hat{x}(t))+\hat{x}(t)\left(1-\frac{\hat{\theta}(t)}{\theta}\right)\right)$
		$\displaystyle\quad\quad-b(t)\|J-\bar{n}\|^{2}C_{\theta}(t)+O(\varepsilon^{3})\Bigg{]}$
	$\displaystyle=$	$\displaystyle-\mathbb{E}_{w}^{\prime}\left[\Delta(\rho(t),\hat{\rho}(t),\hat{\theta}(t))+b(t)\|J-\bar{n}\|^{2}C_{\theta}(t)\right]$
		$\displaystyle+\gamma(t)+O(\varepsilon^{3}),$

where $\gamma(t):=\frac{J^{2}b(t)^{2}}{16\theta^{2}}+(1+8J)\sigma_{1}u_{FF}(t)$ . As $p\in(0.5,1]$ , $b(t)^{2}=4f(t)^{2}$ and $u_{FF}(t)$ are integrable, i.e., $\gamma(t)$ is integrable.

Together with the assumption (11), $\mathbb{E}_{w}^{\prime}[C_{\theta}(t)]\leq O(\varepsilon^{2})$ for all $t\geq t_{0}$ and using Dynkin’s formula [32],

		$\displaystyle\mathbb{E}_{w}^{\prime}\left[G(\rho(\infty),\hat{\rho}(\infty),\hat{\theta}(\infty))\right]-G(\rho(t_{0}),\hat{\rho}(t_{0}),\hat{\theta}(t_{0}))$
		$\displaystyle+\int_{t_{0}}^{\infty}\mathbb{E}_{w}^{\prime}[\Delta(\rho(\tau),\hat{\rho}(\tau),\hat{\theta}(\tau))]d\tau$
		$\displaystyle+\|J-\bar{n}\|^{2}\int_{t_{0}}^{\infty}b(\tau)\mathbb{E}_{w}^{\prime}[C_{\theta}(\tau)]d\tau+\int_{t_{0}}^{\infty}\mathbb{E}_{w}^{\prime}[O(\varepsilon^{3})]d\tau$
	$\displaystyle\leq$	$\displaystyle\int_{t_{0}}^{\infty}\gamma(\tau)d\tau<\infty$

holds. Note that the integrand of the last term of the left-hand side of the first inequality $\mathbb{E}_{w}^{\prime}[O(\varepsilon^{3})]$ converges to zero faster than the other terms. The other terms of the left-hand side are positive and need to be finite. Hence, $\lim_{t\to\infty}\Delta(\rho(t),\hat{\rho}(t),\hat{\theta}(t))=0$ a.s. Since $x(t)$ or $\hat{x}(t)$ fluctuates randomly if $\rho(t)\neq 0$ or $\hat{\rho}(t)\neq 0$ , $\lim_{t\to\infty}\Delta(\rho(t),\hat{\rho}(t),\hat{\theta}(t))=0$ implies that $V_{\rho(t)}(J_{z})$ and $V_{\hat{\rho}(t)}(J_{z})$ converge to the origin. From the assumption that $\rho(t)$ and $\hat{\rho}(t)$ stay in the neighborhood of $\rho_{\bar{n}}$ , the convergence of $V_{\rho(t)}(J_{z})$ and $V_{\hat{\rho}(t)}(J_{z})$ implies $(\rho(t),\hat{\rho}(t))$ converges to $(\rho_{\bar{n}},\rho_{\bar{n}})$ . Furthermore, since $b(t)$ is not integrable and, although we skip the proof, $\hat{\theta}(t)$ is continuous in $t$ , $\lim_{t\to\infty}C_{\theta}(t)=0$ a.s. Therefore, $\lim_{t\to\infty}\hat{\theta}(t)=\theta$ a.s.

A proposal of adaptive parameter tuning for robust stabilizing control of NN–level quantum angular momentum systems

Abstract

I Introduction

I-A Contributions

I-B Organization

I-C Notation

II Problem Formulation

II-A Measurement-based Feedback Quantum Systems

II-B Previous Work

Theorem 1 ([21, Propositions 4.16 and 4.18])

II-C Problem Statement

Corollary 2

Proof:

III Proposed Method and Theoretical Results

III-A Proposed Adaptive Parameter Tuning Method

Remark 3

III-B Asymptotic Property of the Estimate

Proposition 4

Proof:

Remark 5

III-C Local Convergence Property

Theorem 6

Proof:

IV NUMERICAL EXPERIMENTS

V CONCLUSION AND FUTURE WORK

ACKNOWLEDGMENTS

References

Proof of Theorem 6

Lemma 7

Proof:

A proposal of adaptive parameter tuning for robust stabilizing control of $N$ –level quantum angular momentum systems