This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Relative Error-based Time-limited 2\mathcal{H}_{2} Model Order Reduction via Oblique Projection

Umair Zulfiqar Xin Du [email protected] Qiuyan Song Zhi-Hua Xiao Victor Sreeram School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, 200444, China Shanghai Key Laboratory of Power Station Automation Technology, Shanghai University, Shanghai, 200444, China School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, 434023, China Department of Electrical, Electronic, and Computer Engineering, The University of Western Australia, Perth, 6009, Australia
Abstract

In time-limited model order reduction, a reduced-order approximation of the original high-order model is obtained that accurately approximates the original model within the desired limited time interval. Accuracy outside that time interval is not that important. The error incurred when a reduced-order model is used as a surrogate for the original model can be quantified in absolute or relative terms to access the performance of the model reduction algorithm. The relative error is generally more meaningful than an absolute error because if the original and reduced systems’ responses are of small magnitude, the absolute error is small in magnitude as well. However, this does not necessarily mean that the reduced model is accurate. The relative error in such scenarios is useful and meaningful as it quantifies percentage error irrespective of the magnitude of the system’s response. In this paper, the necessary conditions for a local optimum of the time-limited 2\mathcal{H}_{2} norm of the relative error system are derived. Inspired by these conditions, an oblique projection algorithm is proposed that ensures small 2\mathcal{H}_{2}-norm relative error within the desired time interval. Unlike the existing relative error-based model reduction algorithms, the proposed algorithm does not require solutions of large-scale Lyapunov and Riccati equations. The proposed algorithm is compared with time-limited balanced truncation, time-limited balanced stochastic truncation, and time-limited iterative Rational Krylov algorithm. Numerical results confirm the superiority of the proposed algorithm over these existing algorithms.

keywords:
2\mathcal{H}_{2} norm, model order reduction, oblique projection, optimal, relative error, time-limited
journal: Journal of  Templates

1 Introduction

Computer-aided simulation and analysis have been a major driver in scientific innovation and progress for the last several decades. The dynamical systems and processes, which can be natural or artificial, are mathematically modeled to be used in a computer program that can simulate and analyze that system for endless possible inputs, settings, and scenarios. During the mathematical modeling process, the behavior of the system or process is generally described by partial differential equations (PDEs). To utilize the mathematical tools available in dynamical system theory, the PDEs are converted into ordinary differential equations (ODEs), which are then expressed as a state-space model [1, 2].

Let u(t)m×1u(t)\in\mathbb{R}^{m\times 1} be the inputs, y(t)p×1y(t)\in\mathbb{R}^{p\times 1} be the outputs, and x(t)n×1x(t)\in\mathbb{R}^{n\times 1} be the states of the state-space model of the dynamical system HH. Then the state-space equations of HH can be written as

x˙(t)\displaystyle\dot{x}(t) =Ax(t)+Bu(t),\displaystyle=Ax(t)+Bu(t),
y(t)\displaystyle y(t) =Cx(t)+Du(t),\displaystyle=Cx(t)+Du(t),

wherein An×nA\in\mathbb{R}^{n\times n}, Bn×mB\in\mathbb{R}^{n\times m}, Cp×nC\in\mathbb{R}^{p\times n}, and Dp×mD\in\mathbb{R}^{p\times m}. The state-space realization (A,B,C,D)(A,B,C,D) has the following equivalence with the transfer function of HH

H(s)=C(sIA)1B+D.\displaystyle H(s)=C(sI-A)^{-1}B+D.

Note that the simulation of HH requires the solution of nn differential equations. The computing power of modern-day computers has been exponentially increasing following Moore’s law, and memory resources have been increasing tremendously [2]. However, the complexity of modern-day systems and processes has also been increasing tremendously following the scientific and technological advancements enabled by high-speed chip manufacturing. Therefore, the value of nn in most modern-day systems and processes is normally several thousand. The simulation and analysis of such a high-order model remain a computational challenge, which has been a consistent motivation for developing efficient model order reduction (MOR) algorithms [3, 4, 5]. MOR is a process of obtaining a reduced-order model (ROM) that has similar properties and behavior but can be simulated cheaply without exhausting the available computational resources. The ROM can then be used as a surrogate for the original model with tolerable numerical error.

Let us denote the ROM as H^\hat{H}. Further, let us denote the states and outputs of H^\hat{H} as x^(t)\hat{x}(t) and y^(t)\hat{y}(t), respectively. Then the state-space equations of H^\hat{H} can be written as

x^˙(t)\displaystyle\dot{\hat{x}}(t) =A^x^(t)+B^u(t),\displaystyle=\hat{A}\hat{x}(t)+\hat{B}u(t),
y^(t)\displaystyle\hat{y}(t) =C^x^(t)+D^u(t),\displaystyle=\hat{C}\hat{x}(t)+\hat{D}u(t),

wherein A^r×r\hat{A}\in\mathbb{R}^{r\times r}, B^r×m\hat{B}\in\mathbb{R}^{r\times m}, C^p×r\hat{C}\in\mathbb{R}^{p\times r}, D^p×m\hat{D}\in\mathbb{R}^{p\times m}, and rnr\ll n. The state-space realization (A^,B^,C^,D^)(\hat{A},\hat{B},\hat{C},\hat{D}) has the following equivalence with the transfer function of H^\hat{H}

H^(s)=C^(sIA^)1B^+D^.\displaystyle\hat{H}(s)=\hat{C}(sI-\hat{A})^{-1}\hat{B}+\hat{D}.

Ideally, the MOR algorithm should have three main properties:

  1. 1.

    H^\hat{H} should closely mimic HH in the frequency and/or time domains.

  2. 2.

    H^\hat{H} should preserve important system properties like stability, passivity, contractivity, etc.

  3. 3.

    The computational cost required to obtain H^\hat{H} should be way less than that required to simulate HH.

No MOR algorithm has all these properties; nevertheless, the significance of a MOR algorithm is generally accessed against these three standards. Mathematically, two possible ways to express the MOR problem are the following:

  1. 1.

    H=H^+ΔaddH=\hat{H}+\Delta_{add}.

  2. 2.

    H=H^(I+Δrel)H=\hat{H}(I+\Delta_{rel}).

Most of the MOR algorithms use additive error Δadd=HH^\Delta_{add}=H-\hat{H} as their performance criterion, i.e., they minimize Δadd\Delta_{add}. The quality of approximation in these algorithms is accessed by computing system norms (like 2\mathcal{H}_{2} norm and \mathcal{H}_{\infty} norm) of Δadd\Delta_{add}. If the output responses of HH and H^\hat{H} are inherently small, a small output response of Δadd\Delta_{add} can be a misleading notion of accuracy. The relative error Δrel\Delta_{rel} is a better approximation criterion as it reveals percentage error. However, Δrel\Delta_{rel} is a less used approximation criterion because most of the algorithms that use this criterion are computationally expensive due to the inherent nature of the problem, which will be discussed later. Various performance specifications and specific properties to be preserved lead to various MOR families. Within the family, several algorithms with different discourses are available, each having some distinct features. In the sequel, a brief literature survey of the important MOR algorithms is presented.

Balanced truncation (BT) is one of the most famous MOR techniques due to its useful theoretical properties, like stability preservation and supreme accuracy [6]. BT has led to several important MOR algorithms constituting a rich family of algorithms known as balancing- or gramian-based MOR techniques in the literature. The ROM produced by BT is often suboptimal in the \mathcal{H}_{\infty} norm, i.e., a suboptimum of Δadd(s)||\Delta_{add}(s)||_{\mathcal{H}_{\infty}}. An apriori upper bound on Δadd(s)||\Delta_{add}(s)||_{\mathcal{H}_{\infty}} for the ROM generated by BT was proposed in [7]. Most of the algorithms in the BT family offer a similar apriori upper bound, which is a distinct and important feature of balancing-based MOR algorithms. BT is a computationally expensive algorithm as it requires the solution of two large-scale Lyapunov equations, which limits its applicability to medium-scale systems. To overcome this problem, several numerical algorithms are reported that compute low-rank solutions of these Lyapunov equations, which are cheaper to compute [8, 9, 10, 11]. By using these low-rank solutions, the applicability of BT is extended to large-scale systems [12]. Some other numerical methods to avoid ill-conditioning in the BT algorithm are also reported, like [13, 14]. BT has been extended to preserve other system properties like passivity and contractivity in [15, 16, 17] and [18], respectively. Further, its applicability has also been extended to more general classes of linear and nonlinear systems, cf.[19, 20, 21, 22]. A closely related yet different MOR method known as optimal Hankel norm approximation (OHNA) is proposed in [23]. The ROM produced by OHNA is optimal in the Hankel norm, and an apriori upper bound on Δadd(s)||\Delta_{add}(s)||_{\mathcal{H}_{\infty}}, which is half of that in the case of BT, also holds [24]. An important extension to BT, which is relevant to the problem considered in this paper, is the balanced stochastic truncation (BST) [25]. It minimizes Δrel(s)||\Delta_{rel}(s)||_{\mathcal{H}_{\infty}}, and an apriori upper bound on Δrel(s)||\Delta_{rel}(s)||_{\mathcal{H}_{\infty}} also exists [26]. If the original model is a stable minimum phase system, BST guarantees that the ROM preserves this property.

In a large-scale setting, the computation of the \mathcal{H}_{\infty} norm is expensive. Moreover, the MOR algorithms that tend to reduce the \mathcal{H}_{\infty} norm of error are generally also expensive [27, 28, 29]. The 2\mathcal{H}_{2} norm, on the other hand, can be computed cheaply due to its relation with the system gramians, for which several efficient low-rank algorithms are available. Interestingly, the MOR algorithms that seek to ensure less error in the 2\mathcal{H}_{2} norm happen to be the most computationally efficient ones in the available literature [30]. The selection of the 2\mathcal{H}_{2} norm as a performance specification in this paper is motivated by this factor. A locally optimal solution in the 2\mathcal{H}_{2} norm is also possible, and several efficient algorithms are available that achieve the local optimum upon convergence. The necessary conditions for a local optimum of Δadd(s)22||\Delta_{add}(s)||_{\mathcal{H}_{2}}^{2} (known as Wilson’s conditions) are related to interpolation conditions, i.e., H^(s)\hat{H}(s) interpolates H(s)H(s) at selected points in the ss-plane, cf. [31, 32, 33, 34]. The iterative rational Krylov algorithm (IRKA) is the pioneer and most famous interpolation algorithm for single-input single-output (SISO) systems that can achieve the local optimum of Δadd(s)22||\Delta_{add}(s)||_{\mathcal{H}_{2}}^{2} upon convergence [32]. IRKA is generalized for multi-input multi-output (MIMO) systems in [33]. A more general algorithm that does not assume that H(s)H(s) and H^(s)\hat{H}(s) have simple poles is presented in [34] that can capture the local optimum of Δadd(s)22||\Delta_{add}(s)||_{\mathcal{H}_{2}}^{2} upon convergence. This algorithm is based on the Sylvester equations and uses the connection between the Sylvester equations and projection established in [35]. Its numerical properties are further improved in [36], and a more robust algorithm is presented. To speed up convergence in IRKA, some trust region-based techniques are proposed in [37]. Some other 2\mathcal{H}_{2}-optimal algorithms based on manifold theory are also reported, like [38, 39, 40, 41]. To guarantee stability, some suboptimal solutions to Δadd(s)22||\Delta_{add}(s)||_{\mathcal{H}_{2}}^{2} are reported, like [42, 43, 44].

The simulation of a model does not encompass the entire time horizon since no practical system is run over the entire time range. Thus it is reasonable to ensure good accuracy in the actual time range of operation, which is often limited to a small interval. For instance, low-frequency oscillations in interconnected power systems only last for 15 seconds, after which these are successfully damped by the power system stabilizers and damping controllers. The first 15 seconds thus become important in small-signal stability analysis of interconnected power systems [45, 46, 47]. Similarly, in the time-limited optimal control problem, the behavior of the plant in the desired time interval is important [48]. This motivates time-limited MOR, wherein it is focused to ensure supreme accuracy in the desired time interval rather than trying to achieve good accuracy over the entire time range. BT was generalized for the time-limited MOR problem in [49] to obtain a time-limited BT (TLBT) algorithm. TLBT does not retain the stability preservation and apriori error bound properties of BT. However, an 2\mathcal{H}_{2} norm-based upper bound on Δadd\Delta_{add} is proposed in [50, 51] for TLBT. Some ad hoc approaches to guarantee stability in TLBT are also reported, like [52]; however, these are quite inaccurate and computationally expensive. TLBT, like BT, is computationally expensive in a large-scale setting and requires solutions of two large-scale Lyapunov equations. In [53], these Lyapunov equations are replaced with their low-rank solutions to extend the applicability of TLBT to large-scale systems. TLBT is further generalized to preserve more properties like passivity and to extend its applicability to a more general class of linear and nonlinear systems in the literature, cf. [54, 55, 56, 57, 58]. BST is extended to the time-limited MOR scenario in [59]; however, an upper bound on Δrel\Delta_{rel} does not exist for time-limited BST (TLBST).

In [60], a definition for the time-limited 2\mathcal{H}_{2} norm is presented, and the necessary conditions for a local optimum of Δadd(s)2,τ2||\Delta_{add}(s)||_{\mathcal{H}_{2,\tau}}^{2} (wherein 2,τ\mathcal{H}_{2,\tau} represents the time-limited 2\mathcal{H}_{2} norm) are derived. Then, IRKA is heuristically extended to time-limited IRKA (TLIRKA) for achieving a local optimum of Δadd(s)2,τ2||\Delta_{add}(s)||_{\mathcal{H}_{2,\tau}}^{2}. However, TLIRKA does not satisfy the necessary conditions for a local optimum. In [61], interpolation-based necessary conditions for a local optimum of Δadd(s)2,τ2||\Delta_{add}(s)||_{\mathcal{H}_{2,\tau}}^{2} are derived, and a nonlinear optimization-based algorithm is proposed to construct the local optimum. This algorithm, however, is computationally expensive, and its applicability is limited to the order of a few hundred. Another nonlinear optimization-based algorithm for this problem is reported in [62], which is very similar to that given in [61]. The equivalence between gramian-based conditions and interpolation conditions is established in [63, 64]. A stability-preserving interpolation algorithm that achieves a subset of the necessary conditions is proposed in [65]. To the best of the authors’ knowledge, there is no algorithm in the literature that seeks to reduce Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}, which has motivated the results obtained in this paper.

In this paper, necessary conditions for the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} are derived. It is shown that it is inherently not possible to achieve a local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} in an oblique projection framework. Nevertheless, inspired by these necessary conditions, a computationally efficient oblique projection-based algorithm is proposed that seeks to obtain the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} and offers high fidelity. Unlike TLBST, the proposed algorithm does not require the solutions of large-scale Lyapunov and Ricatti equations. The proposed algorithm is numerically compared to TLBT, TLBST, and TLIRKA by considering benchmark examples. The numerical results confirm that the proposed algorithm offers a small relative error within the desired time interval.

2 Preliminaries

Let the original high-order system be an nn-th order m×mm\times m square linear time-invariant system HH and (A,B,C,D)(A,B,C,D) be its state-space realization. Further, let us denote the impulse response of HH as h(t)=CeAtBh(t)=Ce^{At}B. The time-limited controllability gramian P=0tdeAτBBTeATτ𝑑τP=\int_{0}^{t_{d}}e^{A\tau}BB^{T}e^{A^{T}\tau}d\tau and time-limited observability gramian Q=0tdeATτCTCeAτ𝑑τQ=\int_{0}^{t_{d}}e^{A^{T}\tau}C^{T}Ce^{A\tau}d\tau of the state-space realization (A,B,C)(A,B,C) within the desired time interval [0,td][0,t_{d}] sec solve the following Lyapunov equations

AP+PAT+BBTeAtdBBTeATtd\displaystyle AP+PA^{T}+BB^{T}-e^{At_{d}}BB^{T}e^{A^{T}t_{d}} =0,\displaystyle=0,
ATQ+QA+CTCeATtdCTCeAtd\displaystyle A^{T}Q+QA+C^{T}C-e^{A^{T}t_{d}}C^{T}Ce^{At_{d}} =0,\displaystyle=0,

cf. [49].

Definition 2.1.

The time-limited 2\mathcal{H}_{2} norm of H(s)H(s) is the root mean square of its impulse response within the desired time interval [0,td][0,t_{d}] sec [60, 61], i.e.,

H(s)2,τ\displaystyle||H(s)||_{\mathcal{H}_{2,\tau}} =0tdtrace(h(τ)hT(τ))𝑑τ,\displaystyle=\sqrt{\int_{0}^{t_{d}}trace\Big{(}h(\tau)h^{T}(\tau)\Big{)}d\tau},
=trace(CPCT)\displaystyle=\sqrt{trace(CPC^{T})}
=0tdtrace(hT(τ)h(τ))𝑑τ,\displaystyle=\sqrt{\int_{0}^{t_{d}}trace\Big{(}h^{T}(\tau)h(\tau)\Big{)}d\tau},
=trace(BTQB).\displaystyle=\sqrt{trace(B^{T}QB)}.

2.1 Problem Formulation

In an oblique projection framework, the state-space matrices of the ROM are obtained as

A^\displaystyle\hat{A} =W^TAV^,\displaystyle=\hat{W}^{T}A\hat{V}, B^\displaystyle\hat{B} =W^TB,\displaystyle=\hat{W}^{T}B, C^\displaystyle\hat{C} =CV^,\displaystyle=C\hat{V}, D^\displaystyle\hat{D} =D,\displaystyle=D,

wherein V^n×r\hat{V}\in\mathbb{R}^{n\times r}, W^n×r\hat{W}\in\mathbb{R}^{n\times r}, W^TV^=I\hat{W}^{T}\hat{V}=I, the columns of V^\hat{V} span an rr-dimensional subspace along with the kernels of W^T\hat{W}^{T}, and Π=V^W^T\Pi=\hat{V}\hat{W}^{T} is an oblique projector [66].

In time-limited MOR, it is aimed to ensure that the difference y(t)y^(t)y(t)-\hat{y}(t) is small within the desired time interval [0,td][0,t_{d}] sec. The 2,τ\mathcal{H}_{2,\tau}-norm is an effective measure to quantify the error in the desired time interval [60, 61]. The problem of relative error-based time-limited 2\mathcal{H}_{2} MOR via oblique projection is to construct the reduction matrices V^n×r\hat{V}\in\mathbb{R}^{n\times r} and W^n×r\hat{W}\in\mathbb{R}^{n\times r} such that the ROM H^(s)\hat{H}(s) obtained with these matrices ensures a small 2,τ\mathcal{H}_{2,\tau}-norm of the relative error Δrel(s)\Delta_{rel}(s), i.e.,

minH^(s)order=rΔrel(s)2,τ.\displaystyle\underset{\begin{subarray}{c}\hat{H}(s)\\ \textnormal{order}=r\end{subarray}}{\text{min}}||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}.

2.2 Existing Oblique Projection Techniques

In this subsection, three important oblique projection algorithms for time-limited MOR are briefly reviewed. These algorithms are the most relevant to the new results obtained in this paper.

2.2.1 Time-limited Balanced Truncation (TLBT)

In TLBT [49], the reduction matrices are computed from the contragradient transformation of PP and QQ as W^TPW^=V^TQV^=diag(σ1,,σr)\hat{W}^{T}P\hat{W}=\hat{V}^{T}Q\hat{V}=diag(\sigma_{1},\cdots,\sigma_{r}) where σi+1σi\sigma_{i+1}\geq\sigma_{i}. The resultant ROM offers a small additive error Δadd\Delta_{add} in the desired time interval [0,td][0,t_{d}] sec.

2.2.2 Time-limited Balanced Stochastic Truncation (TLBST)

The controllability gramian WcW_{c} of the pair (A,B)(A,B) solves the following Lyapunov equation

AWc+WcAT+BBT=0.\displaystyle AW_{c}+W_{c}A^{T}+BB^{T}=0.

Define BsB_{s} and AsA_{s} as the following

Bs\displaystyle B_{s} =WcCT+BDT,\displaystyle=W_{c}C^{T}+BD^{T}, As\displaystyle A_{s} =ABs(DDT)1C.\displaystyle=A-B_{s}(DD^{T})^{-1}C.

Now, let XsX_{s} solves the following Riccati equation

AsTXs+XsAs+XsBs(DDT)1BsTXs+CT(DDT)1C=0.\displaystyle A_{s}^{T}X_{s}+X_{s}A_{s}+X_{s}B_{s}(DD^{T})^{-1}B_{s}^{T}X_{s}+C^{T}(DD^{T})^{-1}C=0.

The time-limited observability gramian XτX_{\tau} of the pair (A,D1(CBsTXs))\big{(}A,D^{-1}(C-B_{s}^{T}X_{s})\big{)} solves the following Lyapunov equation

ATXτ+XτA\displaystyle A^{T}X_{\tau}+X_{\tau}A +(D1(CBsTXs))T(D1(CBsTXs))\displaystyle+\big{(}D^{-1}(C-Bs^{T}X_{s})\big{)}^{T}\big{(}D^{-1}(C-Bs^{T}X_{s})\big{)}
eATtd(D1(CBsTXs))T(D1(CBsTXs))eAtd=0.\displaystyle-e^{A^{T}t_{d}}\big{(}D^{-1}(C-Bs^{T}X_{s})\big{)}^{T}\big{(}D^{-1}(C-Bs^{T}X_{s})\big{)}e^{At_{d}}=0.

In TLBST [59], the reduction matrices are computed from the contragradient transformation of PP and XτX_{\tau} as W^TPW^=V^TXτV^=diag(σ1,,σr)\hat{W}^{T}P\hat{W}=\hat{V}^{T}X_{\tau}\hat{V}=diag(\sigma_{1},\cdots,\sigma_{r}) where σi+1σi\sigma_{i+1}\geq\sigma_{i}. The resultant ROM offers a small relative error Δrel\Delta_{rel} in the desired time interval [0,td][0,t_{d}] sec.

2.2.3 Time-limited Iterative Rational Krylov Algorithm (TLIRKA)

TLIRKA starts with an arbitrary guess of the ROM’s state-space matrices (A^,B^,C^)(\hat{A},\hat{B},\hat{C}), wherein A^\hat{A} has simple eigenvalues [60]. Let A^=S^ΛS^1\hat{A}=\hat{S}\Lambda\hat{S}^{-1} be the eigenvalue decomposition of A^\hat{A}. Further, let P12P_{12} and YτY_{\tau} satisfy the following Sylvester equations

AP12+P12A^T+BB^TeAtdBB^TeA^Ttd\displaystyle AP_{12}+P_{12}\hat{A}^{T}+B\hat{B}^{T}-e^{At_{d}}B\hat{B}^{T}e^{\hat{A}^{T}t_{d}} =0,\displaystyle=0, (1)
ATYτ+YτA^CTC^+eATtdCTC^eA^td\displaystyle A^{T}Y_{\tau}+Y_{\tau}\hat{A}-C^{T}\hat{C}+e^{A^{T}t_{d}}C^{T}\hat{C}e^{\hat{A}t_{d}} =0.\displaystyle=0. (2)

The reduction matrices are computed as V^=orth(P12S^)\hat{V}=orth(P_{12}\hat{S}^{-*}) and W^=orth(YτS^\hat{W}=orth\big{(}Y_{\tau}\hat{S} (V^YτS^)1)(\hat{V}^{*}Y_{\tau}\hat{S})^{-1}\big{)}, wherein orth()orth(\cdot) and ()(\cdot)^{*} represent orthogonal basis and Hermitian, respectively. The ROM is updated using these reduction matrices, and the process is repeated until the relative change in the eigenvalues of A^\hat{A} stagnates. Upon convergence, a ROM that offers a small Δadd(s)2,τ||\Delta_{add}(s)||_{\mathcal{H}_{2,\tau}} is achieved.

3 Main Results

In this section, first, a state-space realization and an expression for the 2,τ\mathcal{H}_{2,\tau} norm of Δrel(s)\Delta_{rel}(s) are obtained. Second, necessary conditions for the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} are derived. Third, an oblique projection algorithm is presented, which seeks to satisfy these conditions. Fourth, the algorithmic and computational aspects of the proposed algorithm are discussed.

3.1 State-space Realization and 2,τ\mathcal{H}_{2,\tau} norm of Δrel(s)\Delta_{rel}(s)

Let us assume that H^(s)\hat{H}(s) is invertible and stable, i.e., H^(s)\hat{H}(s) is a stable minimum phase transfer function. Then, it can readily be noted that Δrel(s)\Delta_{rel}(s) can be expressed as Δrel(s)=H^1(s)Δadd(s)\Delta_{rel}(s)=\hat{H}^{-1}(s)\Delta_{add}(s). Further, if the matrix DD is full rank, the following state-space realization exists for H^1(s)\hat{H}^{-1}(s)

Ai\displaystyle A_{i} =A^B^D1C^,\displaystyle=\hat{A}-\hat{B}D^{-1}\hat{C}, Bi\displaystyle B_{i} =B^D1,\displaystyle=-\hat{B}D^{-1}, Ci\displaystyle C_{i} =D1C^,\displaystyle=D^{-1}\hat{C}, Di\displaystyle D_{i} =D1,\displaystyle=D^{-1},

cf. [67, 68]. Then, a possible state-space realization of H^1(s)Δadd(s)\hat{H}^{-1}(s)\Delta_{add}(s) is given as

Arel\displaystyle A_{rel} =[A000A^0BiCBiC^Ai],\displaystyle=\begin{bmatrix}A&0&0\\ 0&\hat{A}&0\\ B_{i}C&-B_{i}\hat{C}&A_{i}\end{bmatrix}, Brel\displaystyle B_{rel} =[BB^0],\displaystyle=\begin{bmatrix}B\\ \hat{B}\\ 0\end{bmatrix},
Crel\displaystyle C_{rel} =[DiCDiC^Ci].\displaystyle=\begin{bmatrix}D_{i}C&-D_{i}\hat{C}&C_{i}\end{bmatrix}.

Due to the triangular structure of ArelA_{rel}, the matrix exponential eAreltde^{A_{rel}t_{d}} can be expressed as

eAreltd=[eAtd000eA^td0E1E2eAitd],\displaystyle e^{A_{rel}t_{d}}=\begin{bmatrix}e^{At_{d}}&0&0\\ 0&e^{\hat{A}t_{d}}&0\\ E_{1}&E_{2}&e^{A_{i}t_{d}}\end{bmatrix},

cf. [69]. Since AreleAreltd=eAreltdArelA_{rel}e^{A_{rel}t_{d}}=e^{A_{rel}t_{d}}A_{rel}, cf. [70], the following holds

[AiE1E1A+BiCeAtdeAitdBiCAiE2E2A^+BiC^eA^tdeAitdBiC^]=0.\displaystyle\begin{bmatrix}\star&&&&\star&&\star\\ \star&&&&\star&&\star\\ A_{i}E_{1}-E_{1}A+B_{i}Ce^{At_{d}}-e^{A_{i}t_{d}}B_{i}C&&&&A_{i}E_{2}-E_{2}\hat{A}+B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}&&\star\end{bmatrix}=0.

Thus E1E_{1} and E2E_{2} solve the following Sylvester equations

AiE1E1A+BiCeAtdeAitdBiC\displaystyle A_{i}E_{1}-E_{1}A+B_{i}Ce^{At_{d}}-e^{A_{i}t_{d}}B_{i}C =0,\displaystyle=0, (3)
AiE2E2A^+BiC^eA^tdeAitdBiC^\displaystyle A_{i}E_{2}-E_{2}\hat{A}+B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C} =0.\displaystyle=0. (4)
Proposition 3.1.

E2=eA^tdeAitdE_{2}=e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}.

Proof.
AiE2E2A^\displaystyle A_{i}E_{2}-E_{2}\hat{A} =BiC^eA^tdeAitdBiC^\displaystyle=B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}
AiE2E2A^\displaystyle A_{i}E_{2}-E_{2}\hat{A} =BiC^eA^tdeAitdBiC^+BiC^eAitdBiC^eAitd\displaystyle=B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}+B_{i}\hat{C}e^{A_{i}t_{d}}-B_{i}\hat{C}e^{A_{i}t_{d}}

Since AieAitdeAitdAi=0A_{i}e^{A_{i}t_{d}}-e^{A_{i}t_{d}}A_{i}=0, it can readily be noted that

BiC^eAitdeAitdBiC^=eAitdA^A^eAitd.\displaystyle B_{i}\hat{C}e^{A_{i}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}=e^{A_{i}t_{d}}\hat{A}-\hat{A}e^{A_{i}t_{d}}.

Thus

AiE2E2A^\displaystyle A_{i}E_{2}-E_{2}\hat{A} =BiC^eA^tdeAitdBiC^+eAitdA^A^eAitd\displaystyle=B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}+e^{A_{i}t_{d}}\hat{A}-\hat{A}e^{A_{i}t_{d}}
=BiC^eA^tdeAitdBiC^+eAitdA^A^eAitd+A^eA^tdeA^tdA^\displaystyle=B_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}B_{i}\hat{C}+e^{A_{i}t_{d}}\hat{A}-\hat{A}e^{A_{i}t_{d}}+\hat{A}e^{\hat{A}t_{d}}-e^{\hat{A}t_{d}}\hat{A}
=Ai(eA^tdeAitd)(eA^tdeAitd)A^.\displaystyle=A_{i}(e^{\hat{A}t_{d}}-e^{A_{i}t_{d}})-(e^{\hat{A}t_{d}}-e^{A_{i}t_{d}})\hat{A}.

Due to the uniqueness of the Sylvester equation (4), E2=eA^tdeAitdE_{2}=e^{\hat{A}t_{d}}-e^{A_{i}t_{d}}. ∎

Let Prel=[PP12P13P12TP22P23P13TP23TP33]P_{rel}=\begin{bmatrix}P&P_{12}&P_{13}\\ P_{12}^{T}&P_{22}&P_{23}\\ P_{13}^{T}&P_{23}^{T}&P_{33}\end{bmatrix} be the time-limited controllability gramian of the pair (Arel,Brel)(A_{rel},B_{rel}), which solves the following Lyapunov equation

ArelPrel+PrelArelT+BrelBrelTeAreltdBrelBrelTeArelTtd\displaystyle A_{rel}P_{rel}+P_{rel}A_{rel}^{T}+B_{rel}B_{rel}^{T}-e^{A_{rel}t_{d}}B_{rel}B_{rel}^{T}e^{A_{rel}^{T}t_{d}} =0.\displaystyle=0. (5)

By expanding the equation (5), it can be noted that P13P_{13}, P22P_{22}, P23P_{23}, and P33P_{33} solve the following linear matrix equations

AP13+P13AiT+K13\displaystyle AP_{13}+P_{13}A_{i}^{T}+K_{13} =0,\displaystyle=0, (6)
A^P22+P22A^T+K22\displaystyle\hat{A}P_{22}+P_{22}\hat{A}^{T}+K_{22} =0,\displaystyle=0, (7)
A^P23+P23AiT+K23\displaystyle\hat{A}P_{23}+P_{23}A_{i}^{T}+K_{23} =0,\displaystyle=0, (8)
AiP33+P33AiT+K33\displaystyle A_{i}P_{33}+P_{33}A_{i}^{T}+K_{33} =0,\displaystyle=0, (9)

wherein

K13\displaystyle K_{13} =P11CTBiTP12C^TBiTeAtdBBTE1TeAtdBB^TE2T,\displaystyle=P_{11}C^{T}B_{i}^{T}-P_{12}\hat{C}^{T}B_{i}^{T}-e^{At_{d}}BB^{T}E_{1}^{T}-e^{At_{d}}B\hat{B}^{T}E_{2}^{T},
K22\displaystyle K_{22} =B^B^TeA^tdB^B^TeA^Ttd,\displaystyle=\hat{B}\hat{B}^{T}-e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}e^{\hat{A}^{T}t_{d}},
K23\displaystyle K_{23} =P12TCTBiTP22C^TBiTeA^tdB^BTE1TeA^tdB^B^TE2T,\displaystyle=P_{12}^{T}C^{T}B_{i}^{T}-P_{22}\hat{C}^{T}B_{i}^{T}-e^{\hat{A}t_{d}}\hat{B}B^{T}E_{1}^{T}-e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}E_{2}^{T},
K33\displaystyle K_{33} =BiCP13BiC^P23+P13TCTBiTP23TC^TBiT\displaystyle=B_{i}CP_{13}-B_{i}\hat{C}P_{23}+P_{13}^{T}C^{T}B_{i}^{T}-P_{23}^{T}\hat{C}^{T}B_{i}^{T}
E1BBTE1TE2B^BTE1TE1BB^TE2TE2B^B^TE2T.\displaystyle\hskip 46.94687pt-E_{1}BB^{T}E_{1}^{T}-E_{2}\hat{B}B^{T}E_{1}^{T}-E_{1}B\hat{B}^{T}E_{2}^{T}-E_{2}\hat{B}\hat{B}^{T}E_{2}^{T}.

Let Qrel=[Q11Q12Q13Q12TQ22Q23Q13TQ23Q33]Q_{rel}=\begin{bmatrix}Q_{11}&Q_{12}&Q_{13}\\ Q_{12}^{T}&Q_{22}&Q_{23}\\ Q_{13}^{T}&Q_{23}&Q_{33}\end{bmatrix} be the time-limited observability gramian of the pair (Arel,Crel)(A_{rel},C_{rel}), which solves the following Lyapunov equation

ArelTQrel+QrelArel+CrelTCreleArelTtdCrelTCreleAreltd\displaystyle A_{rel}^{T}Q_{rel}+Q_{rel}A_{rel}+C_{rel}^{T}C_{rel}-e^{A_{rel}^{T}t_{d}}C_{rel}^{T}C_{rel}e^{A_{rel}t_{d}} =0.\displaystyle=0. (10)

By expanding the equation (10), it can be noted that Q11Q_{11}, Q12Q_{12}, Q13Q_{13}, Q22Q_{22}, Q23Q_{23}, and Q33Q_{33} solve the following linear matrix equations

ATQ11+Q11A+L11\displaystyle A^{T}Q_{11}+Q_{11}A+L_{11} =0,\displaystyle=0, (11)
ATQ12+Q12A^+L12\displaystyle A^{T}Q_{12}+Q_{12}\hat{A}+L_{12} =0,\displaystyle=0, (12)
ATQ13+Q13Ai+L13\displaystyle A^{T}Q_{13}+Q_{13}A_{i}+L_{13} =0,\displaystyle=0, (13)
A^TQ22+Q22A^+L22\displaystyle\hat{A}^{T}Q_{22}+Q_{22}\hat{A}+L_{22} =0,\displaystyle=0, (14)
A^TQ23+Q23Ai+L23\displaystyle\hat{A}^{T}Q_{23}+Q_{23}A_{i}+L_{23} =0,\displaystyle=0, (15)
AiTQ33+Q33Ai+L33\displaystyle A_{i}^{T}Q_{33}+Q_{33}A_{i}+L_{33} =0,\displaystyle=0, (16)

wherein

L11\displaystyle L_{11} =CTBiTQ13T+Q13BiC+CTDiTDiCeATtdCTDiTDiCeAtd\displaystyle=C^{T}B_{i}^{T}Q_{13}^{T}+Q_{13}B_{i}C+C^{T}D_{i}^{T}D_{i}C-e^{A^{T}t_{d}}C^{T}D_{i}^{T}D_{i}Ce^{At_{d}}
E1TCiTDiCeAtdeATtdCTDiTCiE1E1TCiTCiE1,\displaystyle\hskip 64.01869pt-E_{1}^{T}C_{i}^{T}D_{i}Ce^{At_{d}}-e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}E_{1}-E_{1}^{T}C_{i}^{T}C_{i}E_{1},
L12\displaystyle L_{12} =CTBiTQ23Q13BiC^CTDiTDiC^+eATtdCTDiTDiC^eA^td\displaystyle=C^{T}B_{i}^{T}Q_{23}-Q_{13}B_{i}\hat{C}-C^{T}D_{i}^{T}D_{i}\hat{C}+e^{A^{T}t_{d}}C^{T}D_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}
+E1TCiTDiC^eA^tdeATtdCTDiTCiE2E1TCiTCiE2,\displaystyle\hskip 64.01869pt+E_{1}^{T}C_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}-e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}E_{2}-E_{1}^{T}C_{i}^{T}C_{i}E_{2},
L13\displaystyle L_{13} =CTBiTQ33+CTDiTCieATtdCTDiTCieAitdE1TCiTCieAitd,\displaystyle=C^{T}B_{i}^{T}Q_{33}+C^{T}D_{i}^{T}C_{i}-e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}e^{A_{i}t_{d}}-E_{1}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
L22\displaystyle L_{22} =C^TBiTQ23Q23BiC^+C^TDiTDiC^eA^TtdC^TDiTDiC^eA^td\displaystyle=-\hat{C}^{T}B_{i}^{T}Q_{23}-Q_{23}B_{i}\hat{C}+\hat{C}^{T}D_{i}^{T}D_{i}\hat{C}-e^{\hat{A}^{T}t_{d}}\hat{C}^{T}D_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}
+E2TCiTDiC^eA^td+eA^TtdC^TDiTCiE2E2TCiTCiE2,\displaystyle\hskip 64.01869pt+E_{2}^{T}C_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}+e^{\hat{A}^{T}t_{d}}\hat{C}^{T}D_{i}^{T}C_{i}E_{2}-E_{2}^{T}C_{i}^{T}C_{i}E_{2},
L23\displaystyle L_{23} =C^TBiTQ33C^TDiTCi+eA^TtdC^TDiTCieAitdE2TCiTCieAitd,\displaystyle=-\hat{C}^{T}B_{i}^{T}Q_{33}-\hat{C}^{T}D_{i}^{T}C_{i}+e^{\hat{A}^{T}t_{d}}\hat{C}^{T}D_{i}^{T}C_{i}e^{A_{i}t_{d}}-E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
L33\displaystyle L_{33} =CiTCieAiTtdCiTCieAitd.\displaystyle=C_{i}^{T}C_{i}-e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}.
Proposition 3.2.

Q12=Q13Q_{12}=-Q_{13} and Q22=Q23=Q33Q_{22}=-Q_{23}=Q_{33}.

Proof.

By adding the equations (15) and (16), we get

AiT(Q23+Q33)+(Q23+Q33)Ai\displaystyle A_{i}^{T}(Q_{23}+Q_{33})+(Q_{23}+Q_{33})A_{i} =0.\displaystyle=0.

Thus Q23+Q33=0Q_{23}+Q_{33}=0 and Q33=Q23Q_{33}=-Q_{23}. Similarly, by subtracting the equation (14) from (16), we get

AiT(Q33Q22)+(Q33Q22)Ai\displaystyle A_{i}^{T}(Q_{33}-Q_{22})+(Q_{33}-Q_{22})A_{i} =0.\displaystyle=0.

Thus Q33Q22=0Q_{33}-Q_{22}=0 and Q33=Q22Q_{33}=Q_{22}. Finally, by adding the equations (11) and (12), we get

AT(Q12+Q13)+(Q12+Q13)A^\displaystyle A^{T}(Q_{12}+Q_{13})+(Q_{12}+Q_{13})\hat{A} =0.\displaystyle=0.

Thus Q12+Q13=0Q_{12}+Q_{13}=0 and Q12=Q13Q_{12}=-Q_{13}. ∎

From the definition of the 2,τ\mathcal{H}_{2,\tau} norm, Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} can be written as

Δrel(s)2,τ\displaystyle||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} =trace(CrelPrelCrelT)\displaystyle=\sqrt{trace(C_{rel}P_{rel}C_{rel}^{T})}
=(trace(DiCPCTDiT2DiCP12C^TDiT+2DiCP13CiT\displaystyle=\big{(}trace(D_{i}CPC^{T}D_{i}^{T}-2D_{i}CP_{12}\hat{C}^{T}D_{i}^{T}+2D_{i}CP_{13}C_{i}^{T}
+DiC^P22C^TDiT2DiC^P23CiT+CiP33CiT))12\displaystyle\hskip 85.35826pt+D_{i}\hat{C}P_{22}\hat{C}^{T}D_{i}^{T}-2D_{i}\hat{C}P_{23}C_{i}^{T}+C_{i}P_{33}C_{i}^{T})\big{)}^{\frac{1}{2}}
=trace(BrelTQrelBrel)\displaystyle=\sqrt{trace(B_{rel}^{T}Q_{rel}B_{rel})}
=(trace(BTQ11B+2BTQ12B^+B^TQ22B^))12.\displaystyle=\big{(}trace(B^{T}Q_{11}B+2B^{T}Q_{12}\hat{B}+\hat{B}^{T}Q_{22}\hat{B})\big{)}^{\frac{1}{2}}.

3.2 Necessary Conditions for the Local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}

The derivation of the necessary conditions for a local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} requires several new variables to be defined. For clarity, we define new variables before presenting the optimality conditions. Let us define ξ1\xi_{1}, which solves the following Sylvester equation

AiTξ1ξ1AT+O1\displaystyle A_{i}^{T}\xi_{1}-\xi_{1}A^{T}+O_{1} =0\displaystyle=0 (17)

where

O1=CiTCiE1X11CiTCiE2X12TCiTDiCeAtdX11+CiTDiC^eA^tdX12T.\displaystyle O_{1}=-C_{i}^{T}C_{i}E_{1}X_{11}-C_{i}^{T}C_{i}E_{2}X_{12}^{T}-C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}+C_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}X_{12}^{T}.

ξ1\xi_{1} will be used in deriving the partial derivative of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} with respect to A^\hat{A}. Next, let us define X11X_{11}, X12X_{12}, X13X_{13}, X22X_{22}, and X33X_{33}, which solve the following linear matrix equations

AX11+X11AT+BBT\displaystyle AX_{11}+X_{11}A^{T}+BB^{T} =0,\displaystyle=0,\hskip 14.22636pt (18)
AX12+X12A^T+BB^T\displaystyle AX_{12}+X_{12}\hat{A}^{T}+B\hat{B}^{T} =0,\displaystyle=0, (19)
AX13+X13AiT+X11CTBiTX12C^TBiT\displaystyle AX_{13}+X_{13}A_{i}^{T}+X_{11}C^{T}B_{i}^{T}-X_{12}\hat{C}^{T}B_{i}^{T} =0,\displaystyle=0, (20)
A^X22+X22A^T+B^B^T\displaystyle\hat{A}X_{22}+X_{22}\hat{A}^{T}+\hat{B}\hat{B}^{T} =0,\displaystyle=0, (21)
A^X23+X23AiT+X12TCTBiTX22C^TBiT\displaystyle\hat{A}X_{23}+X_{23}A_{i}^{T}+X_{12}^{T}C^{T}B_{i}^{T}-X_{22}\hat{C}^{T}B_{i}^{T} =0,\displaystyle=0, (22)
AiX33+X33AiT+BiCX13BiC^X23+X13TCTBiTX23TC^TBiT\displaystyle A_{i}X_{33}+X_{33}A_{i}^{T}+B_{i}CX_{13}-B_{i}\hat{C}X_{23}+X_{13}^{T}C^{T}B_{i}^{T}-X_{23}^{T}\hat{C}^{T}B_{i}^{T} =0.\displaystyle=0. (23)

These variables will be used in deriving the partial derivative of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} with respect to A^\hat{A} and B^\hat{B}. Next, let us define ξ2\xi_{2}, which solves the following Sylvester equation

AiTξ2ξ2AT+O2\displaystyle A_{i}^{T}\xi_{2}-\xi_{2}A^{T}+O_{2} =0\displaystyle=0 (24)

where

O2\displaystyle O_{2} =CiTCieAitdX13TCiTDiCeAtdX11CiTCiE1X11+CiTDiC^eA^tdX12T\displaystyle=-C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{13}^{T}-C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}-C_{i}^{T}C_{i}E_{1}X_{11}+C_{i}^{T}D_{i}\hat{C}e^{\hat{A}t_{d}}X_{12}^{T}
2CiTCiE2X12T.\displaystyle\hskip 85.35826pt-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}.

ξ2\xi_{2} will be used in deriving the partial derivative of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} with respect to B^\hat{B}. Next, let us define Y13Y_{13}, Y23Y_{23}, Y33Y_{33}, and ξ3\xi_{3}, which solve the following linear matrix equations

ATY13+Y13Ai+CTBiTY33+CTDiTCi\displaystyle A^{T}Y_{13}+Y_{13}A_{i}+C^{T}B_{i}^{T}Y_{33}+C^{T}D_{i}^{T}C_{i} =0,\displaystyle=0, (25)
A^TY23+Y23AiC^TBiTY33C^TDiTCi\displaystyle\hat{A}^{T}Y_{23}+Y_{23}A_{i}-\hat{C}^{T}B_{i}^{T}Y_{33}-\hat{C}^{T}D_{i}^{T}C_{i} =0,\displaystyle=0, (26)
AiTY33+Y33Ai+CiTCi\displaystyle A_{i}^{T}Y_{33}+Y_{33}A_{i}+C_{i}^{T}C_{i} =0,\displaystyle=0, (27)
AiTξ3ξ3ATO3\displaystyle A_{i}^{T}\xi_{3}-\xi_{3}A^{T}-O_{3} =0,\displaystyle=0, (28)

wherein

O3=Y13TeAtdBBTY23TeA^tdB^BTY33E1BBTY33E2B^BT.\displaystyle O_{3}=Y_{13}^{T}e^{At_{d}}BB^{T}-Y_{23}^{T}e^{\hat{A}t_{d}}\hat{B}B^{T}-Y_{33}E_{1}BB^{T}-Y_{33}E_{2}\hat{B}B^{T}.

These variables will be used in deriving the partial derivative of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} with respect to C^\hat{C}.

The following properties of trace will also be used repeatedly in the derivation of the optimality conditions:

  1. 1.

    Addition: trace(A1+A2+A3)=trace(A1)+trace(A2)+trace(A3)trace(A_{1}+A_{2}+A_{3})=trace(A_{1})+trace(A_{2})+trace(A_{3}).

  2. 2.

    Transpose: trace(A1T)=trace(A1)trace(A_{1}^{T})=trace(A_{1}).

  3. 3.

    Cyclic permutation: trace(A1A2A3)=trace(A3A1A2)=trace(A2A3A1)trace(A_{1}A_{2}A_{3})=trace(A_{3}A_{1}A_{2})=trace(A_{2}A_{3}A_{1}).

  4. 4.

    Partial derivative: Δf(A)A=trace(A(f(A))(ΔA)T)\Delta_{f(A)}^{A}=trace\Big{(}\frac{\partial}{\partial A}\big{(}f(A)\big{)}(\Delta_{A})^{T}\Big{)} where Δf(A)A\Delta_{f(A)}^{A} is the first-order derivative of f(A)f(A) with respect to AA, and ΔA\Delta_{A} is the differential of AA,

cf. [71].

Lastly, the following lemma will also be used repeatedly in the derivation.

Lemma 3.3.

Let UU and VV satisfy the following Sylvester equations

XU+UY+W\displaystyle XU+UY+W =0,\displaystyle=0,
YV+VX+Z\displaystyle YV+VX+Z =0.\displaystyle=0.

Then trace(WV)=trace(ZU)trace(WV)=trace(ZU).

Proof.

See the proof of Lemma 4.1 in [72]. ∎

We are now in a position to state the necessary conditions, which a local optimum ROM (A^,B^,C^,D)(\hat{A},\hat{B},\hat{C},D) must satisfy, in the following theorem.

Theorem 3.4.

Suppose H^(s)\hat{H}(s) is a stable minimum phase system, and the matrix DD is full rank. Then, the local optimum (A^,B^,C^,D)(\hat{A},\hat{B},\hat{C},D) of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} must satisfy the following optimality conditions

Q12TX12+Q22X22+ζ1\displaystyle Q_{12}^{T}X_{12}+Q_{22}X_{22}+\zeta_{1} =0,\displaystyle=0, (29)
Q12TB+Q22B^+ζ2\displaystyle Q_{12}^{T}B+Q_{22}\hat{B}+\zeta_{2} =0,\displaystyle=0, (30)
DiTDiCP12+DiTDiC^P22+ζ3\displaystyle-D_{i}^{T}D_{i}CP_{12}+D_{i}^{T}D_{i}\hat{C}P_{22}+\zeta_{3} =0,\displaystyle=0, (31)

wherein

ζ1\displaystyle\zeta_{1} =Q13TX13+2Q23X23+Q23X23T+Q33X33+tdeAiTtdCiTDiCeAtdX12\displaystyle=Q_{13}^{T}X_{13}+2Q_{23}X_{23}+Q_{23}X_{23}^{T}+Q_{33}X_{33}+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}
+tdeAiTtdCiTCiE1X12tdeAiTtdCiTDiCeAtdX13tdeAiTtdCiTCiE1X13\displaystyle+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}
tdeAiTtdCiTCieAitdX22+tdeAiTtdCiTCieAitdX23T+tdeAiTtdCiTCieAitdX23\displaystyle-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{22}+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}
tdeAiTtdCiTCieAitdX33+ξ1E1T2tdeAiTtdξ1CTBiT,\displaystyle-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}+\xi_{1}E_{1}^{T}-2t_{d}e^{A_{i}^{T}t_{d}}\xi_{1}C^{T}B_{i}^{T},
ζ2\displaystyle\zeta_{2} =2Q13TX11CTDiT+Q23X22CiTQ23X12TCTDiT+Q13TX12CiT\displaystyle=-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}+Q_{23}X_{22}C_{i}^{T}-Q_{23}X_{12}^{T}C^{T}D_{i}^{T}+Q_{13}^{T}X_{12}C_{i}^{T}
Q33X13TCTDiTQ13TX13CiTQ23X23CiT+Q33X23TCiTQ33X33CiT\displaystyle-Q_{33}X_{13}^{T}C^{T}D_{i}^{T}-Q_{13}^{T}X_{13}C_{i}^{T}-Q_{23}X_{23}C_{i}^{T}+Q_{33}X_{23}^{T}C_{i}^{T}-Q_{33}X_{33}C_{i}^{T}
tdeAiTtdCiTDiCeAtdX12CiTtdeAiTtdCiTCiE1X12CiT\displaystyle-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}
+tdeAiTtdCiTDiCeAtdX13CiT+tdeAiTtdCiTCiE1X13CiT\displaystyle+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}C_{i}^{T}+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}C_{i}^{T}
+tdeAiTtdCiTCieAitdX22CiTtdeAiTtdCiTCieAitdX23CiT\displaystyle+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{22}C_{i}^{T}-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}C_{i}^{T}
tdeAiTtdCiTCieAitdX23TCiT+tdeAiTtdCiTCieAitdX33CiT\displaystyle-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}C_{i}^{T}+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}C_{i}^{T}
ξ2E1TCiTξ2eATtdCTDiT+eAiTtdξ2CTDiT+tdeAiTtdξ2CTBiTCiT,\displaystyle-\xi_{2}E_{1}^{T}C_{i}^{T}-\xi_{2}e^{A^{T}t_{d}}C^{T}D_{i}^{T}+e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}D_{i}^{T}+t_{d}e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}B_{i}^{T}C_{i}^{T},
ζ3\displaystyle\zeta_{3} =DiTDiCP13DiTDiC^P23DiTCiP23T+DiTCiP33+BiTY13TP13\displaystyle=D_{i}^{T}D_{i}CP_{13}-D_{i}^{T}D_{i}\hat{C}P_{23}-D_{i}^{T}C_{i}P_{23}^{T}+D_{i}^{T}C_{i}P_{33}+B_{i}^{T}Y_{13}^{T}P_{13}
+2BiTY13TP12+BiTY23P23BiTY23P22BiTY33P23T\displaystyle+2B_{i}^{T}Y_{13}^{T}P_{12}+B_{i}^{T}Y_{23}P_{23}-B_{i}^{T}Y_{23}P_{22}-B_{i}^{T}Y_{33}P_{23}^{T}
+tdBiTeAiTtdY13TeAtdBB^T+tdBiTeAiTtdY23eA^tdB^B^T+tdBiTeAiTtdY33E2B^B^T\displaystyle+t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{13}^{T}e^{At_{d}}B\hat{B}^{T}+t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{23}e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}+t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{2}\hat{B}\hat{B}^{T}
+tdBiTeAiTtdY33E1BB^TtdBiTeAiTtdξ3C^TBiT+BiTξ3E1T.\displaystyle+t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{1}B\hat{B}^{T}-t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}\xi_{3}\hat{C}^{T}B_{i}^{T}+B_{i}^{T}\xi_{3}E_{1}^{T}.
Proof.

The proof is long and tedious and can be found in the appendix given at the end. ∎

3.3 An Oblique Projection Algorithm

In this subsection, we sketch an oblique projection algorithm from the optimality conditions derived in the last subsection. Some numerical and computational issues related to the applicability of the algorithm are also discussed.

3.3.1 Limitation in the Oblique Projection Framework

Let us assume that P22P_{22} and Q22Q_{22} are invertible. Then, the optimal choices of B^\hat{B} and C^\hat{C} are given as

B^\displaystyle\hat{B} =Q221Q12TBQ221ξ2,\displaystyle=-Q_{22}^{-1}Q_{12}^{T}B-Q_{22}^{-1}\xi_{2}, C^\displaystyle\hat{C} =CP12P221(DiTDi)1ξ3P221.\displaystyle=CP_{12}P_{22}^{-1}-(D_{i}^{T}D_{i})^{-1}\xi_{3}P_{22}^{-1}.

If the ROM is obtained via oblique projection, this suggests selecting V^\hat{V} and W^\hat{W} as V^=P12P221\hat{V}=P_{12}P_{22}^{-1} and W^=Q12Q221\hat{W}=-Q_{12}Q_{22}^{-1}, respectively. Due to the oblique projection condition W^TV^=I\hat{W}^{T}\hat{V}=I, we get

Q12TP12+Q22P22=0.\displaystyle Q_{12}^{T}P_{12}+Q_{22}P_{22}=0.

To compute the deviation in the satisfaction of optimality conditions with this choice of reduction subspaces, let us express the optimality condition (29) a bit differently. Note that

X12\displaystyle X_{12} =P12+eAtdX12eA^Ttd\displaystyle=P_{12}+e^{At_{d}}X_{12}e^{\hat{A}^{T}t_{d}} and X22\displaystyle X_{22} =P22+eA^tdX22eA^Ttd,\displaystyle=P_{22}+e^{\hat{A}t_{d}}X_{22}e^{\hat{A}^{T}t_{d}},

cf. [65, 63]. Thus

Q12TX12+Q22X22\displaystyle Q_{12}^{T}X_{12}+Q_{22}X_{22} =Q12TP12+Q22P22+Q12TeAtdX12eA^Ttd+Q22eA^tdX22eA^Ttd.\displaystyle=Q_{12}^{T}P_{12}+Q_{22}P_{22}+Q_{12}^{T}e^{At_{d}}X_{12}e^{\hat{A}^{T}t_{d}}+Q_{22}e^{\hat{A}t_{d}}X_{22}e^{\hat{A}^{T}t_{d}}.

Then, the optimality condition (29) can be rewritten as

Q12TP12+Q22P22+ξ1¯\displaystyle Q_{12}^{T}P_{12}+Q_{22}P_{22}+\bar{\xi_{1}} =0\displaystyle=0

where

ξ1¯=ξ1+Q12TeAtdX12eA^Ttd+Q22eA^tdX22eA^Ttd.\displaystyle\bar{\xi_{1}}=\xi_{1}+Q_{12}^{T}e^{At_{d}}X_{12}e^{\hat{A}^{T}t_{d}}+Q_{22}e^{\hat{A}t_{d}}X_{22}e^{\hat{A}^{T}t_{d}}.

It is now clear that by selecting the reduction matrices as V^=P12P221\hat{V}=P_{12}P^{-1}_{22} and W^=Q12Q221\hat{W}=-Q_{12}Q^{-1}_{22}, deviations in the satisfaction of the optimality conditions (29), (30), and (31) are given by ξ1¯\bar{\xi_{1}}, ξ2\xi_{2}, and ξ3\xi_{3}, respectively. In general, ξ1¯0\bar{\xi_{1}}\neq 0, ξ20\xi_{2}\neq 0, and ξ30\xi_{3}\neq 0. Thus it is inherently not possible to obtain a local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2} in an oblique projection framework. Nevertheless, the reduction matrices V^=P12P221\hat{V}=P_{12}P^{-1}_{22} and W^=Q12Q221\hat{W}=-Q_{12}Q^{-1}_{22} target the optimality conditions and seek to construct a local optimum.

3.3.2 Algorithm

The appropriate reduction matrices for the problem under consideration have so far been identified from the optimality conditions. We now proceed in sketching an algorithm with this choice of reduction matrices. The matrices P12P_{12}, P22P_{22}, Q12Q_{12}, and Q22Q_{22} depend on the ROM, which is unknown. Thus an iterative solution seems inevitable. Note that V^=P12\hat{V}=P_{12} and W^=Q12\hat{W}=Q_{12} span the same subspace, as P221P^{-1}_{22} and Q221-Q^{-1}_{22} only change the basis, cf. [35]. The condition on invertibility of P22P_{22} and Q22Q_{22} in every iteration can be restrictive and cause numerical ill-conditioning or even failure. Therefore, V^=P12\hat{V}=P_{12} and W^=Q12\hat{W}=Q_{12} seem a better choice for the reduction matrices. Further, note that (A^,B^,C^)(\hat{A},\hat{B},\hat{C}) and (V^,W^)(\hat{V},\hat{W}) can be seen as two coupled systems, i.e.,

f(V^,W^)\displaystyle f(\hat{V},\hat{W}) =(A^,B^,C^)\displaystyle=(\hat{A},\hat{B},\hat{C}) and g(A^,B^,C^)\displaystyle g(\hat{A},\hat{B},\hat{C}) =(V^,W^).\displaystyle=(\hat{V},\hat{W}).

The problem of computing the ROM (A^,B^,C^)(\hat{A},\hat{B},\hat{C}) can be seen as a stationary point problem, i.e.,

(A^,B^,C^)=f(g(A^,B^,C^))\displaystyle(\hat{A},\hat{B},\hat{C})=f\big{(}g(\hat{A},\hat{B},\hat{C})\big{)}

with an additional constraint that W^TV^=I\hat{W}^{T}\hat{V}=I. Starting with an arbitrary guess of (A^,B^,C^)(\hat{A},\hat{B},\hat{C}), the process can be continued until convergence to obtain the stationary points. There are still two issues to be addressed before sketching a stationary point solution algorithm. Note that it is so far assumed that the matrix DD is full rank, which essentially ensures D1D^{-1} exists. If the matrix DD is not full rank, we can use a remedy employed in BST, i.e., the so-called ϵ\epsilon-regularization. The rank deficient DD can be replaced with D=ϵID=\epsilon I, wherein ϵ\epsilon is a small positive number enough to ensure the invertibility of DD [73]. In the final ROM, the original DD is plugged back in. This is an effective approach and is used in MATLAB’s built-in function ``bst()"``bst()" for BST. The second assumption we used is that a stable realization of H^1(s)\hat{H}^{-1}(s) exists because H^(s)\hat{H}(s) is a stable minimum phase system. Ensuring this condition in every iteration can be a serious limitation in the applicability of the proposed stationary point iteration algorithm. A solution to this problem is discussed in the sequel.

The classical H2H_{2} norm of Δrel(s)\Delta_{rel}(s) can be expressed in the frequency domain as the following

Δrel(s)2\displaystyle||\Delta_{rel}(s)||_{\mathcal{H}_{2}} =12πtrace(Δrel(jω)Δrel(jω))𝑑ω\displaystyle=\sqrt{\frac{1}{2\pi}\int_{-\infty}^{\infty}trace\Big{(}\Delta_{rel}^{*}(j\omega)\Delta_{rel}(j\omega)\Big{)}d\omega}
=12πtrace(Δadd(jω)H^(jω)H^1(jω)Δadd(jω))𝑑ω.\displaystyle=\sqrt{\frac{1}{2\pi}\int_{-\infty}^{\infty}trace\Big{(}\Delta_{add}^{*}(j\omega)\hat{H}^{-*}(j\omega)\hat{H}^{-1}(j\omega)\Delta_{add}(j\omega)\Big{)}d\omega}.

Let G^(s)\hat{G}^{*}(s) be a minimum phase right spectral factor of H^(s)H^(s)\hat{H}^{*}(s)\hat{H}(s), such that G^(s)G^(s)=H^(s)H^(s)\hat{G}(s)\hat{G}^{*}(s)=\hat{H}^{*}(s)\hat{H}(s). Then Δrel(s)2||\Delta_{rel}(s)||_{\mathcal{H}_{2}} can be rewritten as

Δrel(s)2\displaystyle||\Delta_{rel}(s)||_{\mathcal{H}_{2}} =12πtrace(Δadd(jω)G^1(jω)G^(jω)Δadd(jω))𝑑ω\displaystyle=\sqrt{\frac{1}{2\pi}\int_{-\infty}^{\infty}trace\Big{(}\Delta_{add}^{*}(j\omega)\hat{G}^{-1}(j\omega)\hat{G}^{-*}(j\omega)\Delta_{add}(j\omega)\Big{)}d\omega}
=12πtrace(Δ~rel(jω)Δ~rel(jω))𝑑ω.\displaystyle=\sqrt{\frac{1}{2\pi}\int_{-\infty}^{\infty}trace\Big{(}\tilde{\Delta}_{rel}^{*}(j\omega)\tilde{\Delta}_{rel}(j\omega)\Big{)}d\omega}.

Let us denote the impulse response of Δ~rel(s)\tilde{\Delta}_{rel}(s) as h~rel(t)\tilde{h}_{rel}(t). Then Δrel(s)2||\Delta_{rel}(s)||_{\mathcal{H}_{2}} can be represented in the time domain as

Δrel(s)2\displaystyle||\Delta_{rel}(s)||_{\mathcal{H}_{2}} =0trace(h~rel(τ)h~relT(τ))𝑑τ.\displaystyle=\sqrt{\int_{0}^{\infty}trace\Big{(}\tilde{h}_{rel}(\tau)\tilde{h}_{rel}^{T}(\tau)\Big{)}d\tau}.

Finally, Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} can be written as

Δrel(s)2,τ\displaystyle||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} =0tdtrace(h~rel(τ)h~relT(τ))𝑑τ.\displaystyle=\sqrt{\int_{0}^{t_{d}}trace\Big{(}\tilde{h}_{rel}(\tau)\tilde{h}_{rel}^{T}(\tau)\Big{)}d\tau}.

It can readily be noted that H^1(s)\hat{H}^{-1}(s) can be replaced with G^(s)\hat{G}^{-*}(s) without affecting Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}. The advantage, however, is that Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} can be defined even when H^(s)\hat{H}(s) is not a stable minimum phase system, i.e., by replacing H^1(s)\hat{H}^{-1}(s) with a stable G^(s)\hat{G}^{-*}(s). Next, we construct a state-space realization of G^(s)\hat{G}^{-*}(s) on similar lines with BST, wherein a similar state-space realization is constructed.

The observability gramian Q^\hat{Q} of the pair (A^,C^)(\hat{A},\hat{C}) solves the following Lyapunov equation

A^TQ^+Q^A^+C^TC^=0.\displaystyle\hat{A}^{T}\hat{Q}+\hat{Q}\hat{A}+\hat{C}^{T}\hat{C}=0.

Let us define B^s\hat{B}_{s} and A^s\hat{A}_{s} as the following

B^s\displaystyle\hat{B}_{s} =Q^B^C^TD,\displaystyle=-\hat{Q}\hat{B}-\hat{C}^{T}D, A^s\displaystyle\hat{A}_{s} =A^B^(DTD)1B^sT.\displaystyle=-\hat{A}-\hat{B}(D^{T}D)^{-1}\hat{B}_{s}^{T}.

Let X^s\hat{X}_{s} solves the following Riccati equation

A^sX^s+X^sA^sT+X^sB^s(DTD)1B^sTX^s+B^(DTD)1B^T=0.\displaystyle\hat{A}_{s}\hat{X}_{s}+\hat{X}_{s}\hat{A}_{s}^{T}+\hat{X}_{s}\hat{B}_{s}(D^{T}D)^{-1}\hat{B}_{s}^{T}\hat{X}_{s}+\hat{B}(D^{T}D)^{-1}\hat{B}^{T}=0. (32)

Then, a minimum phase realization of G^(s)\hat{G}^{*}(s) is given as

A^x\displaystyle\hat{A}_{x} =A^T,\displaystyle=-\hat{A}^{T}, B^x\displaystyle\hat{B}_{x} =B^s,\displaystyle=\hat{B}_{s}, C^x\displaystyle\hat{C}_{x} =DT(B^TB^sTX^s),\displaystyle=D^{-T}(\hat{B}^{T}-\hat{B}_{s}^{T}\hat{X}_{s}), D^x=D,\displaystyle\hat{D}_{x}=D, (33)

cf. [68]. Further, a stable realization of G^(s)\hat{G}^{-*}(s) is given as

A^xi\displaystyle\hat{A}_{xi} =A^xB^xD^x1C^x,\displaystyle=\hat{A}_{x}-\hat{B}_{x}\hat{D}_{x}^{-1}\hat{C}_{x}, B^xi\displaystyle\hat{B}_{xi} =B^xD^x1,\displaystyle=-\hat{B}_{x}\hat{D}_{x}^{-1}, C^xi\displaystyle\hat{C}_{xi} =D^x1C^x,\displaystyle=\hat{D}_{x}^{-1}\hat{C}_{x}, D^xi\displaystyle\hat{D}_{xi} =D^x1,\displaystyle=\hat{D}_{x}^{-1}, (34)

cf. [67, 68].

We are now in a position to state the algorithm. The proposed algorithm is named “Time-limited Relative-error 2\mathcal{H}_{2} MOR Algorithm (TLRHMORA)”. The pseudo-code of TLRHMORA is given in Algorithm 1. Step 1 replaces DD with a full rank matrix if DD is rank deficient. Steps 3-4 compute a stable realization of G^(s)\hat{G}^{-*}(s). Steps 5-7 compute the reduction matrices. Steps 8-12 ensure the oblique projection condition W^TV^=I\hat{W}^{T}\hat{V}=I using the bi-orthogonal Gram-Schmidt method, cf. [36]. Step 13 updates the ROM.

Input: Original system: (A,B,C,D)(A,B,C,D); Desired time: tdt_{d} sec; Initial guess: (A^,B^,C^)(\hat{A},\hat{B},\hat{C}); Allowable iteration: kmaxk_{max}.
Output: ROM (A^,B^,C^)(\hat{A},\hat{B},\hat{C}).

1:  if (rank[D]<mrank[D]<m) D=ϵID=\epsilon I end if
2:  k=0k=0, while (not converged) do k=k+1k=k+1.
3:  Solve the equation (32) to compute X^s\hat{X}_{s}.
4:  Compute (A^xi,B^xi,C^xi,D^xi)(\hat{A}_{xi},\hat{B}_{xi},\hat{C}_{xi},\hat{D}_{xi}) from (33) and (34).
5:  Solve the equation (1) to compute P12P_{12}.
6:  Solve the following equations to compute E1E_{1} and E2E_{2}A^xiE1E1A+B^xiCeAtdeA^xitdB^xiC=0\hat{A}_{xi}E_{1}-E_{1}A+\hat{B}_{xi}Ce^{At_{d}}-e^{\hat{A}_{xi}t_{d}}\hat{B}_{xi}C=0,A^xiE2E2A^+B^xiC^eA^tdeA^xitdB^xiC^=0\hat{A}_{xi}E_{2}-E_{2}\hat{A}+\hat{B}_{xi}\hat{C}e^{\hat{A}t_{d}}-e^{\hat{A}_{xi}t_{d}}\hat{B}_{xi}\hat{C}=0.
7:  Compute Q~33\tilde{Q}_{33}, Q~13\tilde{Q}_{13}, Q~23\tilde{Q}_{23}, and Q~12\tilde{Q}_{12} by solvingA^xiTQ~33+Q~33A^xi+C^xiTC^xieAxiTtdC^xiTC^xieAxitd=0\hat{A}_{xi}^{T}\tilde{Q}_{33}+\tilde{Q}_{33}\hat{A}_{xi}+\hat{C}_{xi}^{T}\hat{C}_{xi}-e^{A_{xi}^{T}t_{d}}\hat{C}_{xi}^{T}\hat{C}_{xi}e^{A_{xi}t_{d}}=0,ATQ~13+Q~13A^xi+CTB^xiTQ~33+CTD^xiTC^xieATtdCTDxiTCxieAxitdE1TCxiTCxieAxitd=0A^{T}\tilde{Q}_{13}+\tilde{Q}_{13}\hat{A}_{xi}+C^{T}\hat{B}_{xi}^{T}\tilde{Q}_{33}+C^{T}\hat{D}_{xi}^{T}\hat{C}_{xi}-e^{A^{T}t_{d}}C^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{d}}-E_{1}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{d}}=0,A^TQ~23+Q~23A^xiC^TB^xiTQ~33+C^TD^xiTC^xieA^TtdC^TDxiTCxieAxitdE2TCxiTCxieAxitd=0\hat{A}^{T}\tilde{Q}_{23}+\tilde{Q}_{23}\hat{A}_{xi}-\hat{C}^{T}\hat{B}_{xi}^{T}\tilde{Q}_{33}+\hat{C}^{T}\hat{D}_{xi}^{T}\hat{C}_{xi}-e^{\hat{A}^{T}t_{d}}\hat{C}^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{d}}-E_{2}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{d}}=0,ATQ~12+Q~12A^+CTB^xiTQ~23TQ~13B^xiC^CTD^xiTD^xiC^+eATtdCTDxiTDxiC^eA^td+E1TCxiTDxiC^eA^tdeATtdCTDxiTCxiE2E1TCxiTCxiE2=0A^{T}\tilde{Q}_{12}+\tilde{Q}_{12}\hat{A}+C^{T}\hat{B}_{xi}^{T}\tilde{Q}_{23}^{T}-\tilde{Q}_{13}\hat{B}_{xi}\hat{C}-C^{T}\hat{D}_{xi}^{T}\hat{D}_{xi}\hat{C}+e^{A^{T}t_{d}}C^{T}D_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{d}}+E_{1}^{T}C_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{d}}-e^{A^{T}t_{d}}C^{T}D_{xi}^{T}C_{xi}E_{2}-E_{1}^{T}C_{xi}^{T}C_{xi}E_{2}=0.
8:  for i=1,,ri=1,\ldots,r do
9:  v=P12(:,i)v=P_{12}(:,i), v=k=1i(IP12(:,k)Q~12(:,k)T)vv=\prod_{k=1}^{i}\big{(}I-P_{12}(:,k)\tilde{Q}_{12}(:,k)^{T}\big{)}v.
10:  w=Q~12(:,i)w=\tilde{Q}_{12}(:,i), w=k=1i(IQ~12(:,k)P12(:,k)T)ww=\prod_{k=1}^{i}\big{(}I-\tilde{Q}_{12}(:,k)P_{12}(:,k)^{T}\big{)}w.
11:  v=vv2v=\frac{v}{||v||_{2}}, w=ww2w=\frac{w}{||w||_{2}}, v=vwTvv=\frac{v}{w^{T}v}, V^(:,i)=v\hat{V}(:,i)=v, W^(:,i)=w\hat{W}(:,i)=w.
12:  end for
13:  A^=W^TAV^\hat{A}=\hat{W}^{T}A\hat{V}, B^=W^TB\hat{B}=\hat{W}^{T}B, C^=CV^\hat{C}=C\hat{V}.
14:  if (k=kmaxk=k_{max}) Break loop end if
15:  end while
Algorithm 1 TLRHMORA

3.3.3 General Time Interval

So far, we restricted ourselves for simplicity to the case when the desired time interval starts from 0 sec. The problem can be generalized to any time interval [t1,t2][t_{1},t_{2}] sec by making a few changes. Note that for the general time interval case, PrelP_{rel} and QrelQ_{rel} solve the following Lyapunov equations

ArelPrel+PrelArelT+eArelt1BrelBrelTeArelTt1eArelt2BrelBrelTeArelTt2\displaystyle A_{rel}P_{rel}+P_{rel}A_{rel}^{T}+e^{A_{rel}t_{1}}B_{rel}B_{rel}^{T}e^{A_{rel}^{T}t_{1}}-e^{A_{rel}t_{2}}B_{rel}B_{rel}^{T}e^{A_{rel}^{T}t_{2}} =0,\displaystyle=0,
ArelTQrel+QrelArel+eArelTt1CrelTCreleArelt1eArelTt2CrelTCreleArelt2\displaystyle A_{rel}^{T}Q_{rel}+Q_{rel}A_{rel}+e^{A_{rel}^{T}t_{1}}C_{rel}^{T}C_{rel}e^{A_{rel}t_{1}}-e^{A_{rel}^{T}t_{2}}C_{rel}^{T}C_{rel}e^{A_{rel}t_{2}} =0,\displaystyle=0,

cf. [49]. Accordingly, the reduction matrices in TLRHMORA for the general time interval case can be computed by solving the following equations in each iteration

AP12+P12A^T+eAt1BB^TeA^Tt1eAt2BB^TeA^Tt2\displaystyle AP_{12}+P_{12}\hat{A}^{T}+e^{At_{1}}B\hat{B}^{T}e^{\hat{A}^{T}t_{1}}-e^{At_{2}}B\hat{B}^{T}e^{\hat{A}^{T}t_{2}} =0,\displaystyle=0,
A^xiTQ~33+Q~33A^xi+eAxiTt1C^xiTC^xieAxit1eAxiTt2C^xiTC^xieAxit2\displaystyle\hat{A}_{xi}^{T}\tilde{Q}_{33}+\tilde{Q}_{33}\hat{A}_{xi}+e^{A_{xi}^{T}t_{1}}\hat{C}_{xi}^{T}\hat{C}_{xi}e^{A_{xi}t_{1}}-e^{A_{xi}^{T}t_{2}}\hat{C}_{xi}^{T}\hat{C}_{xi}e^{A_{xi}t_{2}} =0,\displaystyle=0,
ATQ~13+Q~13A^xi+CTB^xiTQ~33+eATt1CTDxiTCxieAxit1+E1,t1TCxiTCxieAxit1\displaystyle A^{T}\tilde{Q}_{13}+\tilde{Q}_{13}\hat{A}_{xi}+C^{T}\hat{B}_{xi}^{T}\tilde{Q}_{33}+e^{A^{T}t_{1}}C^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{1}}+E_{1,t_{1}}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{1}}
eATt2CTDxiTCxieAxit2E1,t2TCxiTCxieAxit2\displaystyle-e^{A^{T}t_{2}}C^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{2}}-E_{1,t_{2}}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{2}} =0,\displaystyle=0,
A^TQ~23+Q~23A^xiC^TB^xiTQ~33+eA^Tt1C^TDxiTCxieAxit1+E2,t1TCxiTCxieAxit1\displaystyle\hat{A}^{T}\tilde{Q}_{23}+\tilde{Q}_{23}\hat{A}_{xi}-\hat{C}^{T}\hat{B}_{xi}^{T}\tilde{Q}_{33}+e^{\hat{A}^{T}t_{1}}\hat{C}^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{1}}+E_{2,t_{1}}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{1}}
eA^Tt2C^TDxiTCxieAxit2E2,t2TCxiTCxieAxit2\displaystyle-e^{\hat{A}^{T}t_{2}}\hat{C}^{T}D_{xi}^{T}C_{xi}e^{A_{xi}t_{2}}-E_{2,t_{2}}^{T}C_{xi}^{T}C_{xi}e^{A_{xi}t_{2}} =0,\displaystyle=0,
ATQ~12+Q~12A^+CTB^xiTQ~23TQ~13B^xiC^eATt1CTDxiTDxiC^eA^t1\displaystyle A^{T}\tilde{Q}_{12}+\tilde{Q}_{12}\hat{A}+C^{T}\hat{B}_{xi}^{T}\tilde{Q}_{23}^{T}-\tilde{Q}_{13}\hat{B}_{xi}\hat{C}-e^{A^{T}t_{1}}C^{T}D_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{1}}
E1,t1TCxiTDxiC^eA^t1+eATt1CTDxiTCxiE2,t1\displaystyle-E_{1,t_{1}}^{T}C_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{1}}+e^{A^{T}t_{1}}C^{T}D_{xi}^{T}C_{xi}E_{2,t_{1}}
+E1,t1TCxiTCxiE2,t1+eATt2CTDxiTDxiC^eA^t2\displaystyle+E_{1,t_{1}}^{T}C_{xi}^{T}C_{xi}E_{2,t_{1}}+e^{A^{T}t_{2}}C^{T}D_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{2}}
+E1,t2TCxiTDxiC^eA^t2eATt2CTDxiTCxiE2,t2\displaystyle+E_{1,t_{2}}^{T}C_{xi}^{T}D_{xi}\hat{C}e^{\hat{A}t_{2}}-e^{A^{T}t_{2}}C^{T}D_{xi}^{T}C_{xi}E_{2,t_{2}}
E1,t2TCxiTCxiE2,t2\displaystyle-E_{1,t_{2}}^{T}C_{xi}^{T}C_{xi}E_{2,t_{2}} =0,\displaystyle=0,

wherein E1,tE_{1,t} and E2,tE_{2,t} can be computed by solving the following Sylvester equations

AxiE1,tE1,tA+BxiCeAteAxitBxiC\displaystyle A_{xi}E_{1,t}-E_{1,t}A+B_{xi}Ce^{At}-e^{A_{xi}t}B_{xi}C =0,\displaystyle=0,
AxiE2,tE2,tA^+BxiC^eA^teAxitBxiC^\displaystyle A_{xi}E_{2,t}-E_{2,t}\hat{A}+B_{xi}\hat{C}e^{\hat{A}t}-e^{A_{xi}t}B_{xi}\hat{C} =0.\displaystyle=0.

3.3.4 Stopping Criterion

Most of the 2\mathcal{H}_{2} MOR algorithms are iterative methods with no guarantee of convergence. The same is the case with TLRHMORA. Further, it has been already discussed that even if TLRHMORA converges, it generally cannot achieve a local optimum due to the inherent construction of the oblique projection framework. Therefore, it is reasonable to stop the algorithm prematurely if it does not converge within an allowable number of iterations. This stopping criterion can help in keeping the computational cost in check, especially in a large-scale setting.

3.3.5 Selection of the Initial Guess

Iterative methods generally improve as the number of iterations progresses. However, a good initial guess can significantly improve the final outcome in terms of accuracy and convergence. It is established in [74, 75, 76] that to ensure a small weighted additive error in the 2\mathcal{H}_{2} norm, the initial guess should have the dominant poles (with large residues) of the original model and the frequency weight. By following this guideline, the initial guess for TLRHMORA should have dominant poles and/or zeros of the original model. Such an initial guess can be constructed by using computationally efficient algorithms proposed in [77] and [78]. Although there is no guarantee that the final ROM will preserve these poles and zeros, it ensures that Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} is small to start with.

3.3.6 Computational Aspects

The computation of X^s\hat{X}_{s} and (A^xi,B^xi,C^xi,D^xi)(\hat{A}_{xi},\hat{B}_{xi},\hat{C}_{xi},\hat{D}_{xi}) can be done cheaply, as it only involves small matrices. Similarly, the matrix exponentials eA^te^{\hat{A}t}, eA^xite^{\hat{A}_{xi}t}, and E2E_{2} can also be computed cheaply, as it involves small matrices. Further, the computation of Q~33\tilde{Q}_{33} and Q~23\tilde{Q}_{23} is again cheap owing to the small-sized matrices involved. However, the matrix exponential eAte^{At} can be a computational challenge if AA is a large-scale matrix. Note that we need this matrix exponential’s products eAtBe^{At}B and CeAtCe^{At} for the computation of P12P_{12}, E1E_{1}, Q~13\tilde{Q}_{13}, and Q~12\tilde{Q}_{12}. In [53], computationally efficient Krylov subspace-based projection methods are proposed that can accurately approximate eAtBe^{At}B and CeAtCe^{At} in a large-scale setting as V^eA^tB^\hat{V}e^{\hat{A}t}\hat{B} and C^eA^tW^T\hat{C}e^{\hat{A}t}\hat{W}^{T}, respectively. When AA is a large-scale matrix, these methods can be used to keep TLRHMORA computationally viable.

Once the matrix exponential’s products eAtBe^{At}B and CeAtCe^{At} are computed, the most computationally demanding step in TLRHOMRA is to compute P12P_{12}, E1E_{1}, Q~13\tilde{Q}_{13}, and Q~12\tilde{Q}_{12} by solving their respective Sylvester equations. These Sylvester equations have the following special structure

KJ+JL+MN=0,\displaystyle KJ+JL+MN=0,

wherein Kn×nK\in\mathbb{R}^{n\times n}, Lr×rL\in\mathbb{R}^{r\times r}, Mn×qM\in\mathbb{R}^{n\times q}, Nq×rN\in\mathbb{R}^{q\times r}, and qnq\ll n. The big matrices KK and MM in this type of Sylvester equation are sparse because the mathematically modeling of most modern-day systems ends up in a large-scale but sparse state-space model. The small matrices LL and NN are dense; hence, giving this equation the name “sparse-dense Sylvester equation”. It frequently arises in 2\mathcal{H}_{2} MOR algorithms, and an efficient solution for this type of Sylvester equation is proposed in [36]. It is highlighted in [36] that the computational time required to solve this equation can be reduced further by using sparse linear matrix solvers, like [79] and [80]. Further, it is noted in [30] that the computational cost of solving a “sparse-dense Sylvester equation” is almost the same as that of constructing a rational Krylov subspace, which is well known to be computationally efficient in a large-scale setting; also see [81]. In short, the solution of a “sparse-dense Sylvester equation” is viable even in a large-scale setting due to the availability of efficient solvers in the literature. The remaining steps in TLRHMORA only involve simple operations that can be executed cheaply. Moreover, by using a maximum number of allowable iterations as a stopping criterion, the number of times these Sylvester equations need to be solved can be limited in case TLRHMORA does not converge quickly.

To conclude, by approximating the matrix exponential’s products eAtBe^{{A}t}B and CeAtCe^{{A}t} using the Krylov subspace methods proposed in [53] and solving the “sparse-dense Sylvester equation” using the efficient solver proposed in [36], TLRHMORA remains a computationally tractable algorithm in a large-scale setting.

4 Numerical Results

In this section, TLRHMORA is applied to three models taken from the benchmark collection for testing MOR algorithms, cf. [82], and its performance is compared with TLBT, TLBST, and TLIRKA. The first two models are SISO systems, while the third model is a MIMO system. All the tests are performed using MATLAB R2016a on a laptop with 16GB memory and a 2GHZ Intel i7 processor. All the Lyapunov and Sylvester equations are solved using MATLAB’s “lyap” command. The Riccati equations are solved using MATLAB’s “care” command. The matrix exponentials are computed using MATLAB’s “expm()” command. The maximum number of iterations in TLRHMORA is set to 50. Since all the models considered in this section have zero DD-matrix, it is replaced with D=0.0001ID=0.0001I in TLBST and TLRHMORA. For fairness, TLIRKA and TLRHMORA are initialized with the same arbitrary initial guess. The desired time interval is chosen arbitrarily for demonstration purposes.

4.1 Clamped Beam

The clamped beam model is a 348th348^{th} order SISO system taken from the benchmark collection of [82]. The desired time interval in this example is selected as [0,0.5][0,0.5] sec for demonstration purposes. ROMs of orders 5105-10 are generated using TLBT, TLBST, TLIRKA, and TLRHMORA. The relative errors Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} are tabulated in Table 1, and it can be seen that TLRHMORA offers supreme accuracy.

Table 1: Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}
Order TLBT TLBST TLIRKA TLRHMORA
5 56.6676 43.4164 45.0637 22.7069
6 56.3121 75.4351 38.5722 9.4143
7 15.6814 10.4421 14.6589 5.5747
8 5.2759 26.6056 4.5883 2.0863
9 6.3876 17.7543 7.1781 1.9531
10 0.5775 1.2804 0.5691 0.5256

To visually analyze what these numbers mean in terms of accuracy, the impulse responses of Δadd(s)\Delta_{add}(s) for 6th6^{th} order ROMs within the desired time interval [0,0.5][0,0.5] sec are plotted in Figure 1.

Refer to caption
Figure 1: Impulse response of Δadd(s)\Delta_{add}(s) within [0,0.5][0,0.5] sec

It can be seen that TLRHMORA offers the best performance. The highest error in this case is incurred by TLBST, and the impulse response plot corresponds accordingly to the numerical error given in Table 1. TLBT, TLIRKA, and TLRHMORA, in this case, have offered comparable accuracy for the most part of the desired time interval. However, TLBT and TLIRKA have incurred large numerical errors near 0 sec. This is the reason why Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} for TLBT and TIRKA is significantly larger than that for TLRHMORA. Note, TLIRKA and TLRHMORA are both 2,τ\mathcal{H}_{2,\tau} MOR algorithms but with different cost functions, i.e., TLIRKA minimizes Δadd(s)2,τ||\Delta_{add}(s)||_{\mathcal{H}_{2,\tau}}, while TLRHMORA minimizes Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}. Although both algorithms are initialized with the same initial guess, it can be seen in Table 1 and Figure 1 that TLRHMORA ensures superior accuracy due to its cost function.

4.2 Artificial Dynamical System

The artificial dynamical model is a 1006th1006^{th} order SISO system taken from the benchmark collection of [82]. The desired time interval in this example is selected as [0,1][0,1] sec for demonstration purposes. ROMs of orders 111511-15 are generated using TLBT, TLBST, TLIRKA, and TLRHMORA. The relative errors Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} are tabulated in Table 2, and it can be seen that TLRHMORA offers supreme accuracy.

Table 2: Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}
Order TLBT TLBST TLIRKA TLRHMORA
11 0.8117 0.1098 0.5251 0.0488
12 0.2270 0.2858 0.1885 0.0511
13 0.2250 4.9610 4.5675 0.0317
14 0.1760 0.2748 2.6495 0.0218
15 0.1392 0.1342 0.8801 0.0188

To visually analyze what these numbers mean in terms of accuracy, the impulse responses of Δadd(s)\Delta_{add}(s) for 11th11^{th} order ROMs within the desired time interval [0,1][0,1] sec are plotted in Figure 2.

Refer to caption
Figure 2: Impulse response of Δadd(s)\Delta_{add}(s) within [0,1][0,1] sec

It can be seen that TLRHMORA offers the best performance. TLBT, TLBST, TLIRKA, and TLRHMORA, in this case, have offered comparable accuracy for the most part of the desired time interval. However, TLBT, TLBST, and TLIRKA have incurred large numerical errors near 0 sec, which can be seen in Figure 3 (a magnified version of Figure 2).

Refer to caption
Figure 3: Impulse response of Δadd(s)\Delta_{add}(s) within [0,0.01][0,0.01] sec

This is the reason why Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} for TLBT, TLBST, and TIRKA is significantly larger than that for TLRHMORA.

4.3 International Space Station

The international space station model is a 270th270^{th} order MIMO system taken from the benchmark collection of [82]. It has three inputs and three outputs. The desired time interval in this example is selected as [0,2][0,2] sec for demonstration purposes. ROMs of orders 595-9 are generated using TLBT, TLBST, TLIRKA, and TLRHMORA. The relative errors Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}} are tabulated in Table 3, and it can be seen that TLRHMORA offers supreme accuracy.

Table 3: Δrel(s)2,τ||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}
Order TLBT TLBST TLIRKA TLRHMORA
5 2.0787 2.1599 2.4186 1.9615
6 2.0865 1.9597 1.9485 1.9367
7 2.0040 1.9317 2.5756 1.5787
8 1.6106 1.8876 1.8760 1.5636
9 1.6166 1.8859 1.9060 1.0615

There are nine impulse response plots of Δadd(s)\Delta_{add}(s) for this example. Therefore, we refrain from showing these for the economy of space.

5 Conclusion

The problem of time-limited 2\mathcal{H}_{2} MOR based on relative error expression is considered. The necessary conditions for a local optimum are derived for the case when the ROM is a stable minimum phase system. Based on these conditions, the reduction matrices for an oblique projection algorithm are identified. Then, a generalized iterative algorithm, which does not require the ROM to be a stable minimum phase system in every iteration, is proposed. Unlike TLBST, the proposed algorithm does not require the solutions of large-scale Lyapunov and Riccati equations. Instead, solutions of small-scale Lyapunov and Riccati equations are required that can be computed cheaply. The numerical results confirm that the proposed algorithm is accurate and offers high fidelity.

Appendix

Let us define the cost function JJ as J=Δrel2,τ2J=||\Delta_{rel}||_{\mathcal{H}_{2,\tau}}^{2} and denote the first-order derivatives of Q11Q_{11}, Q12Q_{12}, Q13Q_{13}, Q22Q_{22}, Q23Q_{23}, Q33Q_{33}, E1E_{1}, and JJ with respect to A^\hat{A} as ΔQ11A^\Delta_{Q_{11}}^{\hat{A}}, ΔQ12A^\Delta_{Q_{12}}^{\hat{A}}, ΔQ13A^\Delta_{Q_{13}}^{\hat{A}}, ΔQ22A^\Delta_{Q_{22}}^{\hat{A}}, ΔQ23A^\Delta_{Q_{23}}^{\hat{A}}, ΔQ33A^\Delta_{Q_{33}}^{\hat{A}}, ΔE1A^\Delta_{E_{1}}^{\hat{A}}, and ΔJA^\Delta_{J}^{\hat{A}}, respectively. Further, let us denote the differential of A^\hat{A} as ΔA^\Delta_{\hat{A}}. By differentiating JJ with respect to A^\hat{A}, we get

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(BTΔQ11A^B+2BTΔQ12A^B^+B^TΔQ22A^B^)\displaystyle=trace(B^{T}\Delta_{Q_{11}}^{\hat{A}}B+2B^{T}\Delta_{Q_{12}}^{\hat{A}}\hat{B}+\hat{B}^{T}\Delta_{Q_{22}}^{\hat{A}}\hat{B})
=trace(BBTΔQ11A^+2BB^T(ΔQ12A^)T+B^B^TΔQ22A^).\displaystyle=trace(BB^{T}\Delta_{Q_{11}}^{\hat{A}}+2B\hat{B}^{T}(\Delta_{Q_{12}}^{\hat{A}})^{T}+\hat{B}\hat{B}^{T}\Delta_{Q_{22}}^{\hat{A}}).

By differentiating the equations (11), (12), and (14) with respect to A^\hat{A}, we get

ATΔQ11A^+ΔQ11A^A+R1\displaystyle A^{T}\Delta_{Q_{11}}^{\hat{A}}+\Delta_{Q_{11}}^{\hat{A}}A+R_{1} =0,\displaystyle=0,\hskip 56.9055pt (35)
ATΔQ12A^+ΔQ12A^A^+R2\displaystyle A^{T}\Delta_{Q_{12}}^{\hat{A}}+\Delta_{Q_{12}}^{\hat{A}}\hat{A}+R_{2} =0,\displaystyle=0, (36)
A^TΔQ22A^+ΔQ22A^A^+R4\displaystyle\hat{A}^{T}\Delta_{Q_{22}}^{\hat{A}}+\Delta_{Q_{22}}^{\hat{A}}\hat{A}+R_{4} =0,\displaystyle=0, (37)

wherein

R1\displaystyle R_{1} =CTBiT(ΔQ13A^)T+ΔQ13A^BiC(ΔE1A^)TCiTDiCeAtdeATtdCTDiTCiΔE1A^\displaystyle=C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{A}})^{T}+\Delta_{Q_{13}}^{\hat{A}}B_{i}C-(\Delta_{E_{1}}^{\hat{A}})^{T}C_{i}^{T}D_{i}Ce^{At_{d}}-e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}\Delta_{E_{1}}^{\hat{A}}
(ΔE1A^)TCiTCiE1E1TCiTCiΔE1A^\displaystyle\hskip 28.45274pt-(\Delta_{E_{1}}^{\hat{A}})^{T}C_{i}^{T}C_{i}E_{1}-E_{1}^{T}C_{i}^{T}C_{i}\Delta_{E_{1}}^{\hat{A}}
R2\displaystyle R_{2} =Q12ΔA^+CTBiTΔQ23A^ΔQ13A^BiC^+tdeATtdCTDiTCieA^tdΔA^\displaystyle=Q_{12}\Delta_{\hat{A}}+C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{A}}-\Delta_{Q_{13}}^{\hat{A}}B_{i}\hat{C}+t_{d}e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}e^{\hat{A}t_{d}}\Delta_{\hat{A}}
+(ΔE1A^)TCiTCieA^td+tdE1TCiTCieA^tdΔA^tdeATtdCTDiTCiE2ΔA^\displaystyle\hskip 28.45274pt+(\Delta_{E_{1}}^{\hat{A}})^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}+t_{d}E_{1}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}\Delta_{\hat{A}}-t_{d}e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}E_{2}\Delta_{\hat{A}}
(ΔE1A^)TCiTCiE2tdE1TCiTCiE2ΔA^\displaystyle\hskip 28.45274pt-(\Delta_{E_{1}}^{\hat{A}})^{T}C_{i}^{T}C_{i}E_{2}-t_{d}E_{1}^{T}C_{i}^{T}C_{i}E_{2}\Delta_{\hat{A}}
R4\displaystyle R_{4} =(ΔA^)TQ22+Q22ΔA^C^TBiTΔQ23A^ΔQ23A^BiC^td(ΔA^)TeA^TtdCiTCieA^td\displaystyle=(\Delta_{\hat{A}})^{T}Q_{22}+Q_{22}\Delta_{\hat{A}}-\hat{C}^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{A}}-\Delta_{Q_{23}}^{\hat{A}}B_{i}\hat{C}-t_{d}(\Delta_{\hat{A}})^{T}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}
tdeA^TtdCiTCieA^tdΔA^+td(ΔA^)TE2TCiTCieA^td+tdE2TCiTCieA^tdΔA^\displaystyle\hskip 28.45274pt-t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}\Delta_{\hat{A}}+t_{d}(\Delta_{\hat{A}})^{T}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}+t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}\Delta_{\hat{A}}
+td(ΔA^)TeA^TtdCiTCiE2+tdeA^TtdCiTCiE2ΔA^td(ΔA^)TE2TCiTCiE2\displaystyle\hskip 28.45274pt+t_{d}(\Delta_{\hat{A}})^{T}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}+t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}\Delta_{\hat{A}}-t_{d}(\Delta_{\hat{A}})^{T}E_{2}^{T}C_{i}^{T}C_{i}E_{2}
tdE2TCiTCiE2ΔA^.\displaystyle\hskip 28.45274pt-t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{2}\Delta_{\hat{A}}.

By applying Lemma 3.3 on the equations (35) and (18), (36) and (19), and (37) and (21), we get

trace(BBTΔQ11A^)\displaystyle trace(BB^{T}\Delta_{Q_{11}}^{\hat{A}}) =trace(R1X11),\displaystyle=trace(R_{1}X_{11}),
trace(BB^T(ΔQ12A^)T)\displaystyle trace\big{(}B\hat{B}^{T}(\Delta_{Q_{12}}^{\hat{A}})^{T}\big{)} =trace(R2TX12),\displaystyle=trace(R_{2}^{T}X_{12}),
trace(B^B^TΔQ22A^)\displaystyle trace(\hat{B}\hat{B}^{T}\Delta_{Q_{22}}^{\hat{A}}) =trace(R4X22).\displaystyle=trace(R_{4}X_{22}).

Thus

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(2Q12TX12(ΔA^)T+2Q22X22(ΔA^)T+2tdeA^TtdCiTDiCeAtdX12(ΔA^)T\displaystyle=trace\Big{(}2Q_{12}^{T}X_{12}(\Delta_{\hat{A}})^{T}+2Q_{22}X_{22}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}
2tdeA^TtdCiTCieA^tdX22(ΔA^)T+2tdeA^TtdCiTCiE1X12(ΔA^)T\displaystyle\hskip 14.22636pt-2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCiE2X22(ΔA^)T2tdE2TCiTDiCeAtdX12(ΔA^)T\displaystyle\hskip 14.22636pt+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}
+2tdE2TCiTCieA^tdX22(ΔA^)T2tdE2TCiTCiE1X12(ΔA^)T\displaystyle\hskip 14.22636pt+2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}
2tdE2TCiTCiE2X22(ΔA^)T+2X11CTBiT(ΔQ13A^)T2X12C^TBiT(ΔQ13A^)T\displaystyle\hskip 14.22636pt-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}+2X_{11}C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{A}})^{T}-2X_{12}\hat{C}^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{A}})^{T}
+2X12TCTBiTΔQ23A^2X22C^TBiTΔQ23A^2CiTCiE1X11(ΔE1A^)T\displaystyle\hskip 14.22636pt+2X_{12}^{T}C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{A}}-2X_{22}\hat{C}^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{A}}-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}
2CiTCiE2X12T(ΔE1A^)T2CiTDiCeAtdX11(ΔE1A^)T\displaystyle\hskip 14.22636pt-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}
+2CiTCieA^tdX12T(ΔE1A^)T).\displaystyle\hskip 14.22636pt+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}\Big{)}.

By differentiating the equations (13), (15), (16), and (3) with respect to A^\hat{A}, we get

ATΔQ13A^+ΔQ13A^Ai+R3\displaystyle A^{T}\Delta_{Q_{13}}^{\hat{A}}+\Delta_{Q_{13}}^{\hat{A}}A_{i}+R_{3} =0,\displaystyle=0, (38)
A^TΔQ23A^+ΔQ23A^Ai+R5\displaystyle\hat{A}^{T}\Delta_{Q_{23}}^{\hat{A}}+\Delta_{Q_{23}}^{\hat{A}}A_{i}+R_{5} =0,\displaystyle=0, (39)
AiTΔQ33A^+ΔQ33A^Ai+R6\displaystyle A_{i}^{T}\Delta_{Q_{33}}^{\hat{A}}+\Delta_{Q_{33}}^{\hat{A}}A_{i}+R_{6} =0,\displaystyle=0, (40)
AiΔE1A^ΔE1A^A+R7\displaystyle A_{i}\Delta_{E_{1}}^{\hat{A}}-\Delta_{E_{1}}^{\hat{A}}A+R_{7} =0,\displaystyle=0, (41)

wherein

R3\displaystyle R_{3} =Q13ΔA^+CTBiTΔQ33A^tdeATtdCTDiTCieAitdΔA^tdE1TCiTCieAitdΔA^\displaystyle=Q_{13}\Delta_{\hat{A}}+C^{T}B_{i}^{T}\Delta_{Q_{33}}^{\hat{A}}-t_{d}e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{A}}-t_{d}E_{1}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{A}}
(ΔE1A^)TCiTCieAitd,\displaystyle\hskip 14.22636pt-(\Delta_{E_{1}}^{\hat{A}})^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
R5\displaystyle R_{5} =(ΔA^)TQ23+Q23ΔA^C^TBiTΔQ33A^+td(ΔA^)TeA^TtdCiTCieAitd\displaystyle=(\Delta_{\hat{A}})^{T}Q_{23}+Q_{23}\Delta_{\hat{A}}-\hat{C}^{T}B_{i}^{T}\Delta_{Q_{33}}^{\hat{A}}+t_{d}(\Delta_{\hat{A}})^{T}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}
+tdeA^TtdCiTCieAitdΔA^tdE2TCiTCieAitdΔA^td(ΔA^)TE2TCiTCieAitd,\displaystyle\hskip 14.22636pt+t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{A}}-t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{A}}-t_{d}(\Delta_{\hat{A}})^{T}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
R6\displaystyle R_{6} =(ΔA^)TQ33+Q33ΔA^tdeAiTtdCiTCieAitdΔA^td(ΔA^)TeAiTtdCiTCieAitd,\displaystyle=(\Delta_{\hat{A}})^{T}Q_{33}+Q_{33}\Delta_{\hat{A}}-t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{A}}-t_{d}(\Delta_{\hat{A}})^{T}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
R7\displaystyle R_{7} =ΔA^E1tdeAitdΔA^BiC,\displaystyle=\Delta_{\hat{A}}E_{1}-t_{d}e^{A_{i}t_{d}}\Delta_{\hat{A}}B_{i}C,

By applying Lemma 3.3 on the equations (38) and (20), and (39) and (22), we get

trace(R3TX13)\displaystyle trace(R_{3}^{T}X_{13}) =trace(X11CTBiT(ΔQ13A^)TX12C^TBiT(ΔQ13A^)T),\displaystyle=trace\big{(}X_{11}C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{A}})^{T}-X_{12}\hat{C}^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{A}})^{T}\big{)},
trace(R5TX23)\displaystyle trace(R_{5}^{T}X_{23}) =trace(X12TCTBiTΔQ23A^X22C^TBiΔQ23A^).\displaystyle=trace\big{(}X_{12}^{T}C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{A}}-X_{22}\hat{C}^{T}B_{i}\Delta_{Q_{23}}^{\hat{A}}\big{)}.

Thus

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(2Q12TX12(ΔA^)T+2Q22X22(ΔA^)T+2Q13TX13(ΔA^)T\displaystyle=trace\Big{(}2Q_{12}^{T}X_{12}(\Delta_{\hat{A}})^{T}+2Q_{22}X_{22}(\Delta_{\hat{A}})^{T}+2Q_{13}^{T}X_{13}(\Delta_{\hat{A}})^{T}
+2Q23X23(ΔA^)T+2Q23X23T(ΔA^)T+2tdeA^TtdCiTDiCeAtdX12(ΔA^)T\displaystyle\hskip 14.22636pt+2Q_{23}X_{23}(\Delta_{\hat{A}})^{T}+2Q_{23}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}
2tdeA^TtdCiTCieA^tdX22(ΔA^)T+2tdeA^TtdCiTCiE1X12(ΔA^)T\displaystyle\hskip 14.22636pt-2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCiE2X22(ΔA^)T2tdE2TCiTDiCeAtdX12(ΔA^)T\displaystyle\hskip 14.22636pt+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}
+2tdE2TCiTCieA^tdX22(ΔA^)T2tdE2TCiTCiE1X12(ΔA^)T\displaystyle\hskip 14.22636pt+2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}
2tdE2TCiTCiE2X22(ΔA^)T2tdeAiTtdCiTDiCeAtdX13(ΔA^)T\displaystyle\hskip 14.22636pt-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCiE1X13(ΔA^)T+2tdeA^TtdCiTCieAitdX23T(ΔA^)T\displaystyle\hskip 14.22636pt-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}
+2tdeAiTtdCiTCieA^tdX23(ΔA^)T2tdeAiTtdCiTCiE2X23(ΔA^)T\displaystyle\hskip 14.22636pt+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}(\Delta_{\hat{A}})^{T}
2tdE2TCiTCieAitdX23T(ΔA^)T+2BiCX13ΔQ33A^2BiC^X23ΔQ33A^\displaystyle\hskip 14.22636pt-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2B_{i}CX_{13}\Delta_{Q_{33}}^{\hat{A}}-2B_{i}\hat{C}X_{23}\Delta_{Q_{33}}^{\hat{A}}
2CiTCiE1X11(ΔE1A^)T2CiTCiE2X12T(ΔE1A^)T\displaystyle\hskip 14.22636pt-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}
2CiTDiCeAtdX11(ΔE1A^)T+2CiTCieA^tdX12T(ΔE1A^)T).\displaystyle\hskip 14.22636pt-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}\Big{)}.

By applying Lemma 3.3 on the equations (40) and (23), we get

trace(R6X33)=2trace(BiCX13ΔQ33A^BiC^X23ΔQ33A^).\displaystyle trace(R_{6}X_{33})=2trace(B_{i}CX_{13}\Delta_{Q_{33}}^{\hat{A}}-B_{i}\hat{C}X_{23}\Delta_{Q_{33}}^{\hat{A}}).

Thus

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(2Q12TX12(ΔA^)T+2Q22X22(ΔA^)T+2Q13TX13(ΔA^)T\displaystyle=trace\Big{(}2Q_{12}^{T}X_{12}(\Delta_{\hat{A}})^{T}+2Q_{22}X_{22}(\Delta_{\hat{A}})^{T}+2Q_{13}^{T}X_{13}(\Delta_{\hat{A}})^{T}
+2Q23X23(ΔA^)T+2Q23X23T(ΔA^)T+2Q33X33(ΔA^)T\displaystyle+2Q_{23}X_{23}(\Delta_{\hat{A}})^{T}+2Q_{23}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2Q_{33}X_{33}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTDiCeAtdX12(ΔA^)T2tdeA^TtdCiTCieA^tdX22(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}-2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCiE1X12(ΔA^)T+2tdeA^TtdCiTCiE2X22(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}
2tdE2TCiTDiCeAtdX12(ΔA^)T+2tdE2TCiTCieA^tdX22(ΔA^)T\displaystyle-2t_{d}E_{2}^{T}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}+2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}
2tdE2TCiTCiE1X12(ΔA^)T2tdE2TCiTCiE2X22(ΔA^)T\displaystyle-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTDiCeAtdX13(ΔA^)T2tdeAiTtdCiTCiE1X13(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCieAitdX23T(ΔA^)T+2tdeAiTtdCiTCieA^tdX23(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCiE2X23(ΔA^)T2tdE2TCiTCieAitdX23T(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCieAitdX33(ΔA^)T2CiTCiE1X11(ΔE1A^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}(\Delta_{\hat{A}})^{T}-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}
2CiTCiE2X12T(ΔE1A^)T2CiTDiCeAtdX11(ΔE1A^)T\displaystyle-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}
+2CiTCieA^tdX12T(ΔE1A^)T).\displaystyle+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}\Big{)}.

By applying Lemma 3.3 on the equations (41) and (17), we get

trace(R7Tξ1)\displaystyle trace(R_{7}^{T}\xi_{1}) =trace(CiTCiE1X11(ΔE1A^)TCiTCiE2X12T(ΔE1A^)T\displaystyle=trace(C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}-C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}
CiTDiCeAtdX11(ΔE1A^)T+CiTCieA^tdX12T(ΔE1A^)T).\displaystyle\hskip 42.67912pt-C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{A}})^{T}+C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{A}})^{T}).

Thus

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(2Q12TX12(ΔA^)T+2Q22X22(ΔA^)T+2Q13TX13(ΔA^)T\displaystyle=trace\Big{(}2Q_{12}^{T}X_{12}(\Delta_{\hat{A}})^{T}+2Q_{22}X_{22}(\Delta_{\hat{A}})^{T}+2Q_{13}^{T}X_{13}(\Delta_{\hat{A}})^{T}
+2Q23X23(ΔA^)T+2Q23X23T(ΔA^)T+2Q33X33(ΔA^)T\displaystyle+2Q_{23}X_{23}(\Delta_{\hat{A}})^{T}+2Q_{23}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2Q_{33}X_{33}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTDiCeAtdX12(ΔA^)T2tdeA^TtdCiTCieA^tdX22(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}-2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCiE1X12(ΔA^)T+2tdeA^TtdCiTCiE2X22(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}
2tdE2TCiTDiCeAtdX12(ΔA^)T+2tdE2TCiTCieA^tdX22(ΔA^)T\displaystyle-2t_{d}E_{2}^{T}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}+2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}
2tdE2TCiTCiE1X12(ΔA^)T2tdE2TCiTCiE2X22(ΔA^)T\displaystyle-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}E_{2}X_{22}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTDiCeAtdX13(ΔA^)T2tdeAiTtdCiTCiE1X13(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}(\Delta_{\hat{A}})^{T}
+2tdeA^TtdCiTCieAitdX23T(ΔA^)T+2tdeAiTtdCiTCieA^tdX23(ΔA^)T\displaystyle+2t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCiE2X23(ΔA^)T2tdE2TCiTCieAitdX23T(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}(\Delta_{\hat{A}})^{T}-2t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCieAitdX33(ΔA^)T+2ξ1E1T(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}(\Delta_{\hat{A}})^{T}+2\xi_{1}E_{1}^{T}(\Delta_{\hat{A}})^{T}
2tdeAiTtdξ1CTBiT(ΔA^)T).\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}\xi_{1}C^{T}B_{i}^{T}(\Delta_{\hat{A}})^{T}\Big{)}.

Since eAitd=eA^tdE2e^{A_{i}t_{d}}=e^{\hat{A}t_{d}}-E_{2}, we get

ΔJA^\displaystyle\Delta_{J}^{\hat{A}} =trace(2Q12TX12(ΔA^)T+2Q22X22(ΔA^)T+2Q13TX13(ΔA^)T\displaystyle=trace\Big{(}2Q_{12}^{T}X_{12}(\Delta_{\hat{A}})^{T}+2Q_{22}X_{22}(\Delta_{\hat{A}})^{T}+2Q_{13}^{T}X_{13}(\Delta_{\hat{A}})^{T}
+2Q23X23(ΔA^)T+2Q23X23T(ΔA^)T+2Q33X33(ΔA^)T\displaystyle+2Q_{23}X_{23}(\Delta_{\hat{A}})^{T}+2Q_{23}X_{23}^{T}(\Delta_{\hat{A}})^{T}+2Q_{33}X_{33}(\Delta_{\hat{A}})^{T}
+2tdeAiTtdCiTDiCeAtdX12(ΔA^)T+2tdeAiTtdCiTCiE1X12(ΔA^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}(\Delta_{\hat{A}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTDiCeAtdX13(ΔA^)T2tdeAiTtdCiTCiE1X13(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}(\Delta_{\hat{A}})^{T}
2tdeAiTtdCiTCieAitdX22(ΔA^)T+2tdeAiTtdCiTCieAitdX23T(ΔA^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{22}(\Delta_{\hat{A}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}(\Delta_{\hat{A}})^{T}
+2tdeAiTtdCiTCieAitdX23(ΔA^)T2tdeAiTtdCiTCieAitdX33(ΔA^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}(\Delta_{\hat{A}})^{T}
+2ξ1E1T(ΔA^)T2tdeAiTtdξ1CTBiT(ΔA^)T).\displaystyle+2\xi_{1}E_{1}^{T}(\Delta_{\hat{A}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}\xi_{1}C^{T}B_{i}^{T}(\Delta_{\hat{A}})^{T}\Big{)}.

Hence,

A^Δrel(s)2,τ2=2(Q12TX12+Q22X22+ζ1),\displaystyle\frac{\partial}{\partial\hat{A}}||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}=2(Q_{12}^{T}X_{12}+Q_{22}X_{22}+\zeta_{1}),

and

Q12TX12+Q22X22+ζ1=0\displaystyle Q_{12}^{T}X_{12}+Q_{22}X_{22}+\zeta_{1}=0

is a necessary condition for the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}.

Let us denote the first-order derivatives of Q11Q_{11}, Q12Q_{12}, Q13Q_{13}, Q22Q_{22}, Q23Q_{23}, Q33Q_{33}, E1E_{1}, and JJ with respect to B^\hat{B} as ΔQ11B^\Delta_{Q_{11}}^{\hat{B}}, ΔQ12B^\Delta_{Q_{12}}^{\hat{B}}, ΔQ13B^\Delta_{Q_{13}}^{\hat{B}}, ΔQ22B^\Delta_{Q_{22}}^{\hat{B}}, ΔQ23B^\Delta_{Q_{23}}^{\hat{B}}, ΔQ33B^\Delta_{Q_{33}}^{\hat{B}}, ΔE1B^\Delta_{E_{1}}^{\hat{B}}, and ΔJB^\Delta_{J}^{\hat{B}}, respectively. Further, let us denote the differential of B^\hat{B} as ΔB^\Delta_{\hat{B}}. By differentiating JJ with respect to B^\hat{B}, we get

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(BTΔQ11B^B+2BTQ12ΔB^+2BTΔQ12B^B^+2B^TQ22ΔB^+B^TΔQ22B^B^)\displaystyle=trace(B^{T}\Delta_{Q_{11}}^{\hat{B}}B+2B^{T}Q_{12}\Delta_{\hat{B}}+2B^{T}\Delta_{Q_{12}}^{\hat{B}}\hat{B}+2\hat{B}^{T}Q_{22}\Delta_{\hat{B}}+\hat{B}^{T}\Delta_{Q_{22}}^{\hat{B}}\hat{B})
=trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T+BBTΔQ11B^+2BB^T(ΔQ12B^)T\displaystyle=trace(2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}+BB^{T}\Delta_{Q_{11}}^{\hat{B}}+2B\hat{B}^{T}(\Delta_{Q_{12}}^{\hat{B}})^{T}
+B^B^TΔQ22B^).\displaystyle\hskip 42.67912pt+\hat{B}\hat{B}^{T}\Delta_{Q_{22}}^{\hat{B}}).

By differentiating the equations (11), (12), and (14) with respect to B^\hat{B}, we get

ATΔQ11B^+ΔQ11B^A+S1\displaystyle A^{T}\Delta_{Q_{11}}^{\hat{B}}+\Delta_{Q_{11}}^{\hat{B}}A+S_{1} =0,\displaystyle=0, (42)
ATΔQ12B^+ΔQ12B^A^+S2\displaystyle A^{T}\Delta_{Q_{12}}^{\hat{B}}+\Delta_{Q_{12}}^{\hat{B}}\hat{A}+S_{2} =0,\displaystyle=0, (43)
A^TΔQ22B^+ΔQ22B^A^+S4\displaystyle\hat{A}^{T}\Delta_{Q_{22}}^{\hat{B}}+\Delta_{Q_{22}}^{\hat{B}}\hat{A}+S_{4} =0,\displaystyle=0, (44)

wherein

S1\displaystyle S_{1} =CTDiT(ΔB^)TQ13TQ13ΔB^DiC+CTBiT(ΔQ13B^)T+ΔQ13B^BiC\displaystyle=-C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{13}^{T}-Q_{13}\Delta_{\hat{B}}D_{i}C+C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{B}})^{T}+\Delta_{Q_{13}}^{\hat{B}}B_{i}C
eATtdCTDiTCiΔE1B^(ΔE1B^)TCiTDiCeAtdE1TCiTCiΔE1B^\displaystyle\hskip 28.45274pt-e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}\Delta_{E_{1}}^{\hat{B}}-(\Delta_{E_{1}}^{\hat{B}})^{T}C_{i}^{T}D_{i}Ce^{At_{d}}-E_{1}^{T}C_{i}^{T}C_{i}\Delta_{E_{1}}^{\hat{B}}
(ΔE1B^)TCiTCiE1,\displaystyle\hskip 28.45274pt-(\Delta_{E_{1}}^{\hat{B}})^{T}C_{i}^{T}C_{i}E_{1},
S2\displaystyle S_{2} =CTDiT(ΔB^)TQ23+Q13ΔB^Ci+CTBiTΔQ23B^ΔQ13B^BiC^\displaystyle=-C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{23}+Q_{13}\Delta_{\hat{B}}C_{i}+C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}}-\Delta_{Q_{13}}^{\hat{B}}B_{i}\hat{C}
+(ΔE1B^)TCiTCieA^tdtdeATtdCTDiTCieAitdΔB^Ci\displaystyle\hskip 28.45274pt+(\Delta_{E_{1}}^{\hat{B}})^{T}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}-t_{d}e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}
tdE1TCiTCieAitdΔB^Ci(ΔE1B^)TCiTCiE2,\displaystyle\hskip 28.45274pt-t_{d}E_{1}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}-(\Delta_{E_{1}}^{\hat{B}})^{T}C_{i}^{T}C_{i}E_{2},
S4\displaystyle S_{4} =CiT(ΔB^)TQ23+Q23ΔB^CiC^TBiTΔQ23B^ΔQ23B^BiC^\displaystyle=C_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{23}+Q_{23}\Delta_{\hat{B}}C_{i}-\hat{C}^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}}-\Delta_{Q_{23}}^{\hat{B}}B_{i}\hat{C}
+tdCiT(ΔB^)TeAiTtdCiTCieA^td+tdeA^TtdCiTCieAitdΔB^Ci\displaystyle\hskip 28.45274pt+t_{d}C_{i}^{T}(\Delta_{\hat{B}})^{T}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}+t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}
tdCiT(ΔB^)TeAiTtdCiTCiE2tdE2TCiTCieAitdΔB^Ci.\displaystyle\hskip 28.45274pt-t_{d}C_{i}^{T}(\Delta_{\hat{B}})^{T}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}-t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}.

By applying Lemma 3.3 on the equations (42) and (18), (43) and (19), and (44) and (21), we get

trace(BBTΔQ11B^)\displaystyle trace(BB^{T}\Delta_{Q_{11}}^{\hat{B}}) =trace(S1X11),\displaystyle=trace(S_{1}X_{11}),
trace(BB^T(ΔQ12B^)T)\displaystyle trace(B\hat{B}^{T}(\Delta_{Q_{12}}^{\hat{B}})^{T}) =trace(S2TX12),\displaystyle=trace(S_{2}^{T}X_{12}),
trace(B^B^TΔQ22B^)\displaystyle trace(\hat{B}\hat{B}^{T}\Delta_{Q_{22}}^{\hat{B}}) =trace(S4X22).\displaystyle=trace(S_{4}X_{22}).

Thus

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T2Q13TX11CTDiT(ΔB^)T\displaystyle=trace\Big{(}2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q23X22CiT(ΔB^)T2Q23X12TCTDiT(ΔB^)T\displaystyle\hskip 28.45274pt+2Q_{23}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{12}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q13TX12CiT(ΔB^)T2tdeAiTtdCiTDiCeAtdX12CiT(ΔB^)T\displaystyle\hskip 28.45274pt+2Q_{13}^{T}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCiE1X12CiT(ΔB^)T+2tdeAiTtdCiTCieA^tdX22CiT(ΔB^)T\displaystyle\hskip 28.45274pt-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCiE2X22CiT(ΔB^)T+2X11CTBiT(ΔQ13B^)T\displaystyle\hskip 28.45274pt-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2X_{11}C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{B}})^{T}
2X12C^TBiT(ΔQ13B^)T+2X12TCTBiTΔQ23B^2X22C^TBiTΔQ23B^\displaystyle\hskip 28.45274pt-2X_{12}\hat{C}^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{B}})^{T}+2X_{12}^{T}C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}}-2X_{22}\hat{C}^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}}
2CiTDiCeAtdX11(ΔE1B^)T2CiTCiE1X11(ΔE1B^)T\displaystyle\hskip 28.45274pt-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}
+2CiTCieA^tdX12T(ΔE1B^)T2CiTCiE2X12T(ΔE1B^)T).\displaystyle\hskip 28.45274pt+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}\Big{)}.

By differentiating the equations (13), (15), and (16) with respect to B^\hat{B}, we get

ATΔQ13B^+ΔQ13B^Ai+S3\displaystyle A^{T}\Delta_{Q_{13}}^{\hat{B}}+\Delta_{Q_{13}}^{\hat{B}}A_{i}+S_{3} =0,\displaystyle=0, (45)
A^TΔQ23B^+ΔQ23B^Ai+S5\displaystyle\hat{A}^{T}\Delta_{Q_{23}}^{\hat{B}}+\Delta_{Q_{23}}^{\hat{B}}A_{i}+S_{5} =0,\displaystyle=0, (46)
AiTΔQ33B^+ΔQ33B^Ai+S6\displaystyle A_{i}^{T}\Delta_{Q_{33}}^{\hat{B}}+\Delta_{Q_{33}}^{\hat{B}}A_{i}+S_{6} =0,\displaystyle=0, (47)

wherein

S3\displaystyle S_{3} =Q13ΔB^CiCTDiT(ΔB^)TQ33CTDiTB^TΔQ33B^\displaystyle=-Q_{13}\Delta_{\hat{B}}C_{i}-C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{33}-C^{T}D_{i}^{T}\hat{B}^{T}\Delta_{Q_{33}}^{\hat{B}}
+tdeATtdCTDiTCieAitdΔB^Ci(ΔE1B^)TCiTCieAitd+tdE1TCiTCieAitdΔB^Ci,\displaystyle+t_{d}e^{A^{T}t_{d}}C^{T}D_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}-(\Delta_{E_{1}}^{\hat{B}})^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}+t_{d}E_{1}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i},
S5\displaystyle S_{5} =Q23ΔB^Ci+CiT(ΔB^)TQ33C^TBiTΔQ33B^\displaystyle=-Q_{23}\Delta_{\hat{B}}C_{i}+C_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{33}-\hat{C}^{T}B_{i}^{T}\Delta_{Q_{33}}^{\hat{B}}
tdeA^TtdCiTCieAitdΔB^Ci+tdE2TCiTCieAitdΔB^Ci(ΔE2B^)TCiTCieAitd,\displaystyle-t_{d}e^{\hat{A}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}+t_{d}E_{2}^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}-(\Delta_{E_{2}}^{\hat{B}})^{T}C_{i}^{T}C_{i}e^{A_{i}t_{d}},
S6\displaystyle S_{6} =CiT(ΔB^)TQ33Q33ΔB^Ci+tdCiT(ΔB^)TeAiTtdCiTCieAitd\displaystyle=-C_{i}^{T}(\Delta_{\hat{B}})^{T}Q_{33}-Q_{33}\Delta_{\hat{B}}C_{i}+t_{d}C_{i}^{T}(\Delta_{\hat{B}})^{T}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}
+tdeAiTtdCiTCieAitdΔB^Ci.\displaystyle+t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}.

By applying Lemma 3.3 on the equations (45) and (20), and (46) and (22), we get

trace(X11CTBiT(ΔQ13B^)TX12C^TBiT(ΔQ13B^)T)=trace(S3TX13),\displaystyle trace(X_{11}C^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{B}})^{T}-X_{12}\hat{C}^{T}B_{i}^{T}(\Delta_{Q_{13}}^{\hat{B}})^{T})=trace(S_{3}^{T}X_{13}),
trace(X12TCTBiTΔQ23B^X22C^TBiTΔQ23B^)=trace(S5TX23).\displaystyle trace(X_{12}^{T}C^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}}-X_{22}\hat{C}^{T}B_{i}^{T}\Delta_{Q_{23}}^{\hat{B}})=trace(S_{5}^{T}X_{23}).

Thus

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T2Q13TX11CTDiT(ΔB^)T\displaystyle=trace\Big{(}2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q23X22CiT(ΔB^)T2Q23X12TCTDiT(ΔB^)T\displaystyle\hskip 7.11317pt+2Q_{23}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{12}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q13TX12CiT(ΔB^)T2Q13TX13CiT(ΔB^)T\displaystyle\hskip 7.11317pt+2Q_{13}^{T}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q23X23CiT(ΔB^)T+2Q33X23TCiT(ΔB^)T\displaystyle\hskip 7.11317pt-2Q_{23}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2Q_{33}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q33X13TCTDiT(ΔB^)T2tdeAiTtdCiTDiCeAtdX12CiT(ΔB^)T\displaystyle\hskip 7.11317pt-2Q_{33}X_{13}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCiE1X12CiT(ΔB^)T+2tdeAiTtdCiTCieA^tdX22CiT(ΔB^)T\displaystyle\hskip 7.11317pt-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCiE2X22CiT(ΔB^)T+2tdeAiTtdCiTDiCeAtdX13CiT(ΔB^)T\displaystyle\hskip 7.11317pt-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTCiE1X13CiT(ΔB^)T2tdeAiTtdCiTCieA^tdX23CiT(ΔB^)T\displaystyle\hskip 7.11317pt+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTCiE2X23CiT(ΔB^)T2tdeAiTtdCiTCieAitdX23TCiT(ΔB^)T\displaystyle\hskip 7.11317pt+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2BiCX13ΔQ33B^2BiC^X23ΔQ33B^2CiTCieAitdX13T(ΔE1B^)T\displaystyle\hskip 7.11317pt+2B_{i}CX_{13}\Delta_{Q_{33}}^{\hat{B}}-2B_{i}\hat{C}X_{23}\Delta_{Q_{33}}^{\hat{B}}-2C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{13}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}
2CiTDiCeAtdX11(ΔE1B^)T2CiTCiE1X11(ΔE1B^)T\displaystyle\hskip 7.11317pt-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}
+2CiTCieA^tdX12T(ΔE1B^)T2CiTCiE2X12T(ΔE1B^)T).\displaystyle\hskip 7.11317pt+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}\Big{)}.

By applying Lemma 3.3 on the equations (47) and (23), we get

trace(S6X33)=trace((2BiCX132BiC^X23)ΔQ33B^).\displaystyle trace(S_{6}X_{33})=trace\big{(}(2B_{i}CX_{13}-2B_{i}\hat{C}X_{23})\Delta_{Q_{33}}^{\hat{B}}\big{)}.

Thus

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T2Q13TX11CTDiT(ΔB^)T\displaystyle=trace\Big{(}2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q23X22CiT(ΔB^)T2Q23X12TCTDiT(ΔB^)T\displaystyle+2Q_{23}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{12}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q13TX12CiT(ΔB^)T2Q13TX13CiT(ΔB^)T\displaystyle+2Q_{13}^{T}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q23X23CiT(ΔB^)T+2Q33X23TCiT(ΔB^)T\displaystyle-2Q_{23}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2Q_{33}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q33X13TCTDiT(ΔB^)T2Q33X33CiT(ΔB^)T\displaystyle-2Q_{33}X_{13}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{33}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTDiCeAtdX12CiT(ΔB^)T2tdeAiTtdCiTCiE1X12CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTCieA^tdX22CiT(ΔB^)T2tdeAiTtdCiTCiE2X22CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTDiCeAtdX13CiT(ΔB^)T+2tdeAiTtdCiTCiE1X13CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCieA^tdX23CiT(ΔB^)T+2tdeAiTtdCiTCiE2X23CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCieAitdX23TCiT(ΔB^)T+2tdeAiTtdCiTCieAitdX33CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2CiTCieAitdX13T(ΔE1B^)T2CiTDiCeAtdX11(ΔE1B^)T\displaystyle-2C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{13}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}
2CiTCiE1X11(ΔE1B^)T+2CiTCieA^tdX12T(ΔE1B^)T2CiTCiE2X12T(ΔE1B^)T).\displaystyle-2C_{i}^{T}C_{i}E_{1}X_{11}(\Delta_{E_{1}}^{\hat{B}})^{T}+2C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}(\Delta_{E_{1}}^{\hat{B}})^{T}\Big{)}.

By differentiating the equation (3) with respect to B^\hat{B}, we get

AiΔE1B^ΔE1B^A+S7=0\displaystyle A_{i}\Delta_{E_{1}}^{\hat{B}}-\Delta_{E_{1}}^{\hat{B}}A+S_{7}=0 (48)

where

S7=ΔB^CiE1ΔB^DiCeAtd+eAitdΔB^DiC+tdeAitdΔB^CiBiC.\displaystyle S_{7}=-\Delta_{\hat{B}}C_{i}E_{1}-\Delta_{\hat{B}}D_{i}Ce^{At_{d}}+e^{A_{i}t_{d}}\Delta_{\hat{B}}D_{i}C+t_{d}e^{A_{i}t_{d}}\Delta_{\hat{B}}C_{i}B_{i}C.

By applying Lemma 3.3 on the equations (48) and (24), we get

trace(S7Tξ2)\displaystyle trace(S_{7}^{T}\xi_{2}) =trace((CiTCieAitdX13TCiTDiCeAtdX11CiTCiE1X11\displaystyle=trace\Big{(}\big{(}-C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{13}^{T}-C_{i}^{T}D_{i}Ce^{At_{d}}X_{11}-C_{i}^{T}C_{i}E_{1}X_{11}
+CiTCieA^tdX12T2CiTCiE2X12T)(ΔE1B^)T).\displaystyle\hskip 56.9055pt+C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{12}^{T}-2C_{i}^{T}C_{i}E_{2}X_{12}^{T}\big{)}(\Delta_{E_{1}}^{\hat{B}})^{T}\big{)}.

Thus

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T2Q13TX11CTDiT(ΔB^)T\displaystyle=trace\Big{(}2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q23X22CiT(ΔB^)T2Q23X12TCTDiT(ΔB^)T\displaystyle+2Q_{23}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{12}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q13TX12CiT(ΔB^)T2Q13TX13CiT(ΔB^)T\displaystyle+2Q_{13}^{T}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q23X23CiT(ΔB^)T+2Q33X23TCiT(ΔB^)T\displaystyle-2Q_{23}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2Q_{33}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q33X13TCTDiT(ΔB^)T2Q33X33CiT(ΔB^)T\displaystyle-2Q_{33}X_{13}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{33}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTDiCeAtdX12CiT(ΔB^)T2tdeAiTtdCiTCiE1X12CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTCieA^tdX22CiT(ΔB^)T2tdeAiTtdCiTCiE2X22CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTDiCeAtdX13CiT(ΔB^)T+2tdeAiTtdCiTCiE1X13CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCieA^tdX23CiT(ΔB^)T+2tdeAiTtdCiTCiE2X23CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{\hat{A}t_{d}}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{2}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCieAitdX23TCiT(ΔB^)T+2tdeAiTtdCiTCieAitdX33CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2ξ2E1TCiT(ΔB^)T2ξ2eATtdCTDi(ΔB^)T+2eAiTtdξ2CTDi(ΔB^)T\displaystyle-2\xi_{2}E_{1}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2\xi_{2}e^{A^{T}t_{d}}C^{T}D_{i}(\Delta_{\hat{B}})^{T}+2e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}D_{i}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdξ2CTBiTCiT(ΔB^)T).\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}B_{i}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}\Big{)}.

Since eAitd=eA^tdE2e^{A_{i}t_{d}}=e^{\hat{A}t_{d}}-E_{2}, we get

ΔJB^\displaystyle\Delta_{J}^{\hat{B}} =trace(2Q12TB(ΔB^)T+2Q22B^(ΔB^)T2Q13TX11CTDiT(ΔB^)T\displaystyle=trace\Big{(}2Q_{12}^{T}B(\Delta_{\hat{B}})^{T}+2Q_{22}\hat{B}(\Delta_{\hat{B}})^{T}-2Q_{13}^{T}X_{11}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q23X22CiT(ΔB^)T2Q23X12TCTDiT(ΔB^)T\displaystyle+2Q_{23}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{12}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q13TX12CiT(ΔB^)T2Q33X13TCTDiT(ΔB^)T\displaystyle+2Q_{13}^{T}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{33}X_{13}^{T}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
2Q13TX13CiT(ΔB^)T2Q23X23CiT(ΔB^)T\displaystyle-2Q_{13}^{T}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{23}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2Q33X23TCiT(ΔB^)T2Q33X33CiT(ΔB^)T\displaystyle+2Q_{33}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2Q_{33}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTDiCeAtdX12CiT(ΔB^)T2tdeAiTtdCiTCiE1X12CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{12}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTDiCeAtdX13CiT(ΔB^)T+2tdeAiTtdCiTCiE1X13CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}D_{i}Ce^{At_{d}}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}E_{1}X_{13}C_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdCiTCieAitdX22CiT(ΔB^)T2tdeAiTtdCiTCieAitdX23CiT(ΔB^)T\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{22}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2tdeAiTtdCiTCieAitdX23TCiT(ΔB^)T+2tdeAiTtdCiTCieAitdX33CiT(ΔB^)T\displaystyle-2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{23}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}+2t_{d}e^{A_{i}^{T}t_{d}}C_{i}^{T}C_{i}e^{A_{i}t_{d}}X_{33}C_{i}^{T}(\Delta_{\hat{B}})^{T}
2ξ2E1TCiT(ΔB^)T2ξ2eATtdCTDiT(ΔB^)T+2eAiTtdξ2CTDiT(ΔB^)T\displaystyle-2\xi_{2}E_{1}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}-2\xi_{2}e^{A^{T}t_{d}}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}+2e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}D_{i}^{T}(\Delta_{\hat{B}})^{T}
+2tdeAiTtdξ2CTBiTCiT(ΔB^)T).\displaystyle+2t_{d}e^{A_{i}^{T}t_{d}}\xi_{2}C^{T}B_{i}^{T}C_{i}^{T}(\Delta_{\hat{B}})^{T}\Big{)}.

Hence,

B^Δrel(s)2,τ2=2(Q12TB+Q22B^+ζ2),\displaystyle\frac{\partial}{\partial\hat{B}}||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}=2(Q_{12}^{T}B+Q_{22}\hat{B}+\zeta_{2}),

and

Q12TB+Q22B^+ζ2=0\displaystyle Q_{12}^{T}B+Q_{22}\hat{B}+\zeta_{2}=0

is a necessary condition for the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}.

Let us denote the first-order derivatives of P13P_{13}, P23P_{23}, P33P_{33}, E1E_{1}, and JJ with respect to C^\hat{C} as ΔP13C^\Delta_{P_{13}}^{\hat{C}}, ΔP23C^\Delta_{P_{23}}^{\hat{C}}, ΔP33C^\Delta_{P_{33}}^{\hat{C}}, ΔE1C^\Delta_{E_{1}}^{\hat{C}}, and ΔJC^\Delta_{J}^{\hat{C}}, respectively. Further, let us denote the differential of C^\hat{C} as ΔC^\Delta_{\hat{C}}. By taking differentiation of JJ with respect to C^\hat{C}, we get

ΔJC^\displaystyle\Delta_{J}^{\hat{C}} =trace(2DiCP12(ΔC^)TDiT+2DiCP13(ΔC^)TDiT+2DiC^P22(ΔC^)TDiT\displaystyle=trace\Big{(}-2D_{i}CP_{12}(\Delta_{\hat{C}})^{T}D_{i}^{T}+2D_{i}CP_{13}(\Delta_{\hat{C}})^{T}D_{i}^{T}+2D_{i}\hat{C}P_{22}(\Delta_{\hat{C}})^{T}D_{i}^{T}
2DiC^P23(ΔC^)TDiT2CiP23T(ΔC^)TDiT+2CiP33(ΔC^)TDiT\displaystyle\hskip 56.9055pt-2D_{i}\hat{C}P_{23}(\Delta_{\hat{C}})^{T}D_{i}^{T}-2C_{i}P_{23}^{T}(\Delta_{\hat{C}})^{T}D_{i}^{T}+2C_{i}P_{33}(\Delta_{\hat{C}})^{T}D_{i}^{T}
+2DiCΔP13C^CiT2DiC^ΔP23C^CiT+CiΔP33C^CiT)\displaystyle\hskip 56.9055pt+2D_{i}C\Delta_{P_{13}}^{\hat{C}}C_{i}^{T}-2D_{i}\hat{C}\Delta_{P_{23}}^{\hat{C}}C_{i}^{T}+C_{i}\Delta_{P_{33}}^{\hat{C}}C_{i}^{T}\Big{)}
=trace(2DiTDiCP12(ΔC^)T+2DiTDiCP13(ΔC^)T+2DiTDiC^P22(ΔC^)T\displaystyle=trace\Big{(}-2D_{i}^{T}D_{i}CP_{12}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}CP_{13}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}\hat{C}P_{22}(\Delta_{\hat{C}})^{T}
2DiTDiC^P23(ΔC^)T2DiTCiP23T(ΔC^)T+2DiTCiP33(ΔC^)T\displaystyle\hskip 56.9055pt-2D_{i}^{T}D_{i}\hat{C}P_{23}(\Delta_{\hat{C}})^{T}-2D_{i}^{T}C_{i}P_{23}^{T}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}C_{i}P_{33}(\Delta_{\hat{C}})^{T}
+2CTDiTCi(ΔP13C^)T2C^TDiTCi(ΔP23C^)T+CiTCiΔP33C^).\displaystyle\hskip 56.9055pt+2C^{T}D_{i}^{T}C_{i}(\Delta_{P_{13}}^{\hat{C}})^{T}-2\hat{C}^{T}D_{i}^{T}C_{i}(\Delta_{P_{23}}^{\hat{C}})^{T}+C_{i}^{T}C_{i}\Delta_{P_{33}}^{\hat{C}}\Big{)}.

By taking differentiation of the equations (6), (8), (9), and (3) with respect to C^\hat{C}, we get

AΔP13C^+ΔP13C^AiT+T1\displaystyle A\Delta_{P_{13}}^{\hat{C}}+\Delta_{P_{13}}^{\hat{C}}A_{i}^{T}+T_{1} =0,\displaystyle=0, (49)
A^ΔP23C^+ΔP23C^AiT+T2\displaystyle\hat{A}\Delta_{P_{23}}^{\hat{C}}+\Delta_{P_{23}}^{\hat{C}}A_{i}^{T}+T_{2} =0,\displaystyle=0, (50)
AiΔP33C^+ΔP33C^AiT+T3\displaystyle A_{i}\Delta_{P_{33}}^{\hat{C}}+\Delta_{P_{33}}^{\hat{C}}A_{i}^{T}+T_{3} =0,\displaystyle=0, (51)
AiΔE1C^ΔE1C^A+T4\displaystyle A_{i}\Delta_{E_{1}}^{\hat{C}}-\Delta_{E_{1}}^{\hat{C}}A+T_{4} =0,\displaystyle=0, (52)

wherein

T1\displaystyle T_{1} =P13(ΔC^)TBiTP12(ΔC^)TBiTeAtdBBT(ΔE1C^)T\displaystyle=P_{13}(\Delta_{\hat{C}})^{T}B_{i}^{T}-P_{12}(\Delta_{\hat{C}})^{T}B_{i}^{T}-e^{At_{d}}BB^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}
+tdeAtdBB^T(ΔC^)TBiTeAiTtd,\displaystyle+t_{d}e^{At_{d}}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}B_{i}^{T}e^{A_{i}^{T}t_{d}},
T2\displaystyle T_{2} =P23(ΔC^)TBiTP22(ΔC^)TBiTeA^tdB^BT(ΔE1C^)T\displaystyle=P_{23}(\Delta_{\hat{C}})^{T}B_{i}^{T}-P_{22}(\Delta_{\hat{C}})^{T}B_{i}^{T}-e^{\hat{A}t_{d}}\hat{B}B^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}
+tdeA^tdB^B^T(ΔC^)TBiTeAiTtd,\displaystyle+t_{d}e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}B_{i}^{T}e^{A_{i}^{T}t_{d}},
T3\displaystyle T_{3} =BiΔC^P33+P33(ΔC^)TBiT+BiCΔP13C^BiC^ΔP23C^BiΔC^P23\displaystyle=B_{i}\Delta_{\hat{C}}P_{33}+P_{33}(\Delta_{\hat{C}})^{T}B_{i}^{T}+B_{i}C\Delta_{P_{13}}^{\hat{C}}-B_{i}\hat{C}\Delta_{P_{23}}^{\hat{C}}-B_{i}\Delta_{\hat{C}}P_{23}
+(ΔP13C^)TCTBiT(ΔP23C^)TC^TBiTP23T(ΔC^)TBiTE1BBT(ΔE1C^)T\displaystyle+(\Delta_{P_{13}}^{\hat{C}})^{T}C^{T}B_{i}^{T}-(\Delta_{P_{23}}^{\hat{C}})^{T}\hat{C}^{T}B_{i}^{T}-P_{23}^{T}(\Delta_{\hat{C}})^{T}B_{i}^{T}-E_{1}BB^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}
ΔE1C^BBTE1T+tdeAitdBiΔC^B^BTE1TE2B^BT(ΔE1C^)TΔE1C^BB^TE2T\displaystyle-\Delta_{E_{1}}^{\hat{C}}BB^{T}E_{1}^{T}+t_{d}e^{A_{i}t_{d}}B_{i}\Delta_{\hat{C}}\hat{B}B^{T}E_{1}^{T}-E_{2}\hat{B}B^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}-\Delta_{E_{1}}^{\hat{C}}B\hat{B}^{T}E_{2}^{T}
+tdE1BB^T(ΔC^)TBiTeAiTtd+tdE2B^B^T(ΔC^)TBiTeAiTtd\displaystyle+t_{d}E_{1}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}B_{i}^{T}e^{A_{i}^{T}t_{d}}+t_{d}E_{2}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}B_{i}^{T}e^{A_{i}^{T}t_{d}}
+tdeAitdBiΔC^B^B^TE2T,\displaystyle+t_{d}e^{A_{i}t_{d}}B_{i}\Delta_{\hat{C}}\hat{B}\hat{B}^{T}E_{2}^{T},
T4\displaystyle T_{4} =BiΔC^E1tdeAitdBiΔC^BiC^.\displaystyle=B_{i}\Delta_{\hat{C}}E_{1}-t_{d}e^{A_{i}t_{d}}B_{i}\Delta_{\hat{C}}B_{i}\hat{C}.

By applying Lemma 3.3 on the equations (49) and (25), (50) and (26), and (51) and (27), we get

trace(T1TY13)\displaystyle trace(T_{1}^{T}Y_{13}) =trace(CTBiTY33(ΔP13C^)T+CTDiTCi(ΔP13C^)T)\displaystyle=trace(C^{T}B_{i}^{T}Y_{33}(\Delta_{P_{13}}^{\hat{C}})^{T}+C^{T}D_{i}^{T}C_{i}(\Delta_{P_{13}}^{\hat{C}})^{T})
trace(T2TY23)\displaystyle trace(T_{2}^{T}Y_{23}) =trace(C^TBiTY33(ΔP23C^)TC^TDiTCi(ΔP23C^)T)\displaystyle=trace(-\hat{C}^{T}B_{i}^{T}Y_{33}(\Delta_{P_{23}}^{\hat{C}})^{T}-\hat{C}^{T}D_{i}^{T}C_{i}(\Delta_{P_{23}}^{\hat{C}})^{T})
trace(T3Y33)\displaystyle trace(T_{3}Y_{33}) =trace(CiTCiΔP33C^).\displaystyle=trace(C_{i}^{T}C_{i}\Delta_{P_{33}}^{\hat{C}}).

Thus

ΔJC^\displaystyle\Delta_{J}^{\hat{C}} =trace(2DiTDiCP12(ΔC^)T+2DiTDiCP13(ΔC^)T+2DiTDiC^P22(ΔC^)T\displaystyle=trace\Big{(}-2D_{i}^{T}D_{i}CP_{12}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}CP_{13}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}\hat{C}P_{22}(\Delta_{\hat{C}})^{T}
2DiTDiC^P23(ΔC^)T2DiTCiP23T(ΔC^)T+2DiTCiP33(ΔC^)T\displaystyle-2D_{i}^{T}D_{i}\hat{C}P_{23}(\Delta_{\hat{C}})^{T}-2D_{i}^{T}C_{i}P_{23}^{T}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}C_{i}P_{33}(\Delta_{\hat{C}})^{T}
+2BiTY13TP13(ΔC^)T+2BiTY13TP12(ΔC^)T+2BiTY23P23(ΔC^)T\displaystyle+2B_{i}^{T}Y_{13}^{T}P_{13}(\Delta_{\hat{C}})^{T}+2B_{i}^{T}Y_{13}^{T}P_{12}(\Delta_{\hat{C}})^{T}+2B_{i}^{T}Y_{23}P_{23}(\Delta_{\hat{C}})^{T}
2BiTY23P22(ΔC^)T2BiTY33P23T(ΔC^)T+2tdBiTeAiTtdY13TeAtdBB^T(ΔC^)T\displaystyle-2B_{i}^{T}Y_{23}P_{22}(\Delta_{\hat{C}})^{T}-2B_{i}^{T}Y_{33}P_{23}^{T}(\Delta_{\hat{C}})^{T}+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{13}^{T}e^{At_{d}}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}
+2tdBiTeAiTtdY23eA^tdB^B^T(ΔC^)T+2tdBiTeAiTtdY33E2B^B^T(ΔC^)T\displaystyle+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{23}e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{2}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}
+2tdBiTeAiTtdY33E1BB^T(ΔC^)T2Y13TeAtdBBT(ΔE1C^)T\displaystyle+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{1}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}-2Y_{13}^{T}e^{At_{d}}BB^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}
2Y23eA^tdB^BT(ΔE1C^)T2Y33E1BBT(ΔE1C^)T2Y33E2B^BT(ΔE1C^)T).\displaystyle-2Y_{23}e^{\hat{A}t_{d}}\hat{B}B^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}-2Y_{33}E_{1}BB^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}-2Y_{33}E_{2}\hat{B}B^{T}(\Delta_{E_{1}}^{\hat{C}})^{T}\Big{)}.

By applying Lemma 3.3 on the equations (52) and (28), we get

trace(T4Tξ3)\displaystyle trace(T_{4}^{T}\xi_{3}) =trace((Y13TeAtdBBTY23eA^tdB^BTY33E1BBT\displaystyle=trace\big{(}(Y_{13}^{T}e^{At_{d}}BB^{T}-Y_{23}e^{\hat{A}t_{d}}\hat{B}B^{T}-Y_{33}E_{1}BB^{T}
Y33E2B^BT)(ΔE1C^)T).\displaystyle\hskip 142.26378pt-Y_{33}E_{2}\hat{B}B^{T})(\Delta_{E_{1}}^{\hat{C}})^{T}\big{)}.

Thus

ΔJC^\displaystyle\Delta_{J}^{\hat{C}} =trace(2DiTDiCP12(ΔC^)T+2DiTDiCP13(ΔC^)T+2DiTDiC^P22(ΔC^)T\displaystyle=trace\Big{(}-2D_{i}^{T}D_{i}CP_{12}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}CP_{13}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}D_{i}\hat{C}P_{22}(\Delta_{\hat{C}})^{T}
2DiTDiC^P23(ΔC^)T2DiTCiP23T(ΔC^)T+2DiTCiP33(ΔC^)T\displaystyle-2D_{i}^{T}D_{i}\hat{C}P_{23}(\Delta_{\hat{C}})^{T}-2D_{i}^{T}C_{i}P_{23}^{T}(\Delta_{\hat{C}})^{T}+2D_{i}^{T}C_{i}P_{33}(\Delta_{\hat{C}})^{T}
+2BiTY13TP13(ΔC^)T+2BiTY13TP12(ΔC^)T+2BiTY23P23(ΔC^)T\displaystyle+2B_{i}^{T}Y_{13}^{T}P_{13}(\Delta_{\hat{C}})^{T}+2B_{i}^{T}Y_{13}^{T}P_{12}(\Delta_{\hat{C}})^{T}+2B_{i}^{T}Y_{23}P_{23}(\Delta_{\hat{C}})^{T}
2BiTY23P22(ΔC^)T2BiTY33P23T(ΔC^)T+2tdBiTeAiTtdY13TeAtdBB^T(ΔC^)T\displaystyle-2B_{i}^{T}Y_{23}P_{22}(\Delta_{\hat{C}})^{T}-2B_{i}^{T}Y_{33}P_{23}^{T}(\Delta_{\hat{C}})^{T}+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{13}^{T}e^{At_{d}}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}
+2tdBiTeAiTtdY23eA^tdB^B^T(ΔC^)T+2tdBiTeAiTtdY33E2B^B^T(ΔC^)T\displaystyle+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{23}e^{\hat{A}t_{d}}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{2}\hat{B}\hat{B}^{T}(\Delta_{\hat{C}})^{T}
+2tdBiTeAiTtdY33E1BB^T(ΔC^)T2tdBiTeAiTtdξ3C^TBiT(ΔC^)T\displaystyle+2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}Y_{33}E_{1}B\hat{B}^{T}(\Delta_{\hat{C}})^{T}-2t_{d}B_{i}^{T}e^{A_{i}^{T}t_{d}}\xi_{3}\hat{C}^{T}B_{i}^{T}(\Delta_{\hat{C}})^{T}
+2BiTξ3E1T(ΔC^)T).\displaystyle+2B_{i}^{T}\xi_{3}E_{1}^{T}(\Delta_{\hat{C}})^{T}\Big{)}.

Hence,

C^Δrel(s)2,τ2=2(DiTDiCP12+DiTDiC^P22+ζ3),\displaystyle\frac{\partial}{\partial\hat{C}}||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}=2(-D_{i}^{T}D_{i}CP_{12}+D_{i}^{T}D_{i}\hat{C}P_{22}+\zeta_{3}),

and

DiTDiCP12+DiTDiC^P22+ζ3=0\displaystyle-D_{i}^{T}D_{i}CP_{12}+D_{i}^{T}D_{i}\hat{C}P_{22}+\zeta_{3}=0

is a necessary condition for the local optimum of Δrel(s)2,τ2||\Delta_{rel}(s)||_{\mathcal{H}_{2,\tau}}^{2}. This completes the proof.

Acknowledgment

This work is supported by the National Natural Science Foundation of China under Grants No. 61873336 and 61803046, the International Corporation Project of Shanghai Science and Technology Commission under Grant 21190780300, in part by The National Key Research and Development Program No. 2020YFB1708200 , and in part by the High-end foreign expert program No. G2021013008L granted by the State Administration of Foreign Experts Affairs (SAFEA).

Competing Interest Statement

The authors declare no conflict of interest.

References

  • [1] W. H. Schilders, H. A. Van der Vorst, J. Rommes, Model order reduction: theory, research aspects and applications, Vol. 13, Springer, 2008.
  • [2] P. Benner, M. Hinze, E. J. W. Ter Maten, Model reduction for circuit simulation, Vol. 74, Springer, 2011.
  • [3] P. Benner, V. Mehrmann, D. C. Sorensen, Dimension reduction of large-scale systems, Vol. 45, Springer, 2005.
  • [4] G. Obinata, B. D. Anderson, Model reduction for control system design, Springer Science & Business Media, 2012.
  • [5] P. Benner, W. Schilders, S. Grivet-Talocia, A. Quarteroni, G. Rozza, L. Miguel Silveira, Model order reduction: Volume 3 applications, De Gruyter, 2020.
  • [6] B. Moore, Principal component analysis in linear systems: Controllability, observability, and model reduction, IEEE transactions on automatic control 26 (1) (1981) 17–32.
  • [7] D. F. Enns, Model reduction with balanced realizations: An error bound and a frequency weighted generalization, in: The 23rd IEEE conference on decision and control, IEEE, 1984, pp. 127–132.
  • [8] S. Gugercin, J.-R. Li, Smith-type methods for balanced truncation of large sparse systems, in: Dimension reduction of large-scale systems, Springer, 2005, pp. 49–82.
  • [9] J.-R. Li, J. White, Low rank solution of Lyapunov equations, SIAM Journal on Matrix Analysis and Applications 24 (1) (2002) 260–280.
  • [10] T. Penzl, A cyclic low-rank Smith method for large sparse Lyapunov equations, SIAM Journal on Scientific Computing 21 (4) (1999) 1401–1418.
  • [11] P. Kürschner, M. A. Freitag, Inexact methods for the low rank solution to large scale Lyapunov equations, BIT Numerical Mathematics 60 (4) (2020) 1221–1259.
  • [12] V. Mehrmann, T. Stykel, Balanced truncation model reduction for large-scale systems in descriptor form, in: Dimension Reduction of Large-Scale Systems, Springer, 2005, pp. 83–115.
  • [13] M. S. Tombs, I. Postlethwaite, Truncated balanced realization of a stable non-minimal state-space system, International Journal of Control 46 (4) (1987) 1319–1330.
  • [14] M. G. Safonov, R. Chiang, A schur method for balanced-truncation model reduction, IEEE Transactions on Automatic Control 34 (7) (1989) 729–733.
  • [15] J. R. Phillips, L. Daniel, L. M. Silveira, Guaranteed passive balancing transformations for model order reduction, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 22 (8) (2003) 1027–1041.
  • [16] B. Yan, S. X.-D. Tan, B. McGaughy, Second-order balanced truncation for passive-order reduction of RLCK circuits, IEEE Transactions on Circuits and Systems II: Express Briefs 55 (9) (2008) 942–946.
  • [17] J. R. Phillips, L. M. Silveira, Poor man’s TBR: A simple model reduction scheme, IEEE transactions on computer-aided design of integrated circuits and systems 24 (1) (2004) 43–55.
  • [18] T. Reis, T. Stykel, Positive real and bounded real balancing for model reduction of descriptor systems, International Journal of Control 83 (1) (2010) 74–88.
  • [19] T. Reis, T. Stykel, Balanced truncation model reduction of second-order systems, Mathematical and Computer Modelling of Dynamical Systems 14 (5) (2008) 391–406.
  • [20] X. Wang, Q. Wang, Z. Zhang, Q. Chen, N. Wong, Balanced truncation for time-delay systems via approximate gramians, in: 16th Asia and South Pacific Design Automation Conference (ASP-DAC 2011), IEEE, 2011, pp. 55–60.
  • [21] L. Zhang, J. Lam, B. Huang, G.-H. Yang, On gramians and balanced truncation of discrete-time bilinear systems, International Journal of Control 76 (4) (2003) 414–427.
  • [22] S. Lall, J. E. Marsden, S. Glavaški, A subspace approach to balanced truncation for model reduction of nonlinear control systems, International Journal of Robust and Nonlinear Control: IFAC-Affiliated Journal 12 (6) (2002) 519–535.
  • [23] K. Glover, All optimal Hankel-norm approximations of linear multivariable systems and their LL_{\infty}-error bounds, International journal of control 39 (6) (1984) 1115–1193.
  • [24] K. Glover, A tutorial on Hankel-norm approximation, From data to model (1989) 26–48.
  • [25] M. Green, Balanced stochastic realizations, Linear Algebra and its Applications 98 (1988) 211–247.
  • [26] M. Green, A relative error bound for balanced stochastic truncation, IEEE Transactions on Automatic Control 33 (10) (1988) 961–965.
  • [27] G. M. Flagg, S. Gugercin, C. A. Beattie, An interpolation-based approach to \mathcal{H}_{\infty} model reduction of dynamical systems, in: 49th IEEE Conference on Decision and Control (CDC), IEEE, 2010, pp. 6791–6796.
  • [28] G. Flagg, C. A. Beattie, S. Gugercin, Interpolatory \mathcal{H}_{\infty} model reduction, Systems & Control Letters 62 (7) (2013) 567–574.
  • [29] A. Castagnotto, C. Beattie, S. Gugercin, Interpolatory methods for \mathcal{H}_{\infty} model reduction of multi-input/multi-output systems, in: Model Reduction of Parametrized Systems, Springer, 2017, pp. 349–365.
  • [30] T. Wolf, 2\mathcal{H}_{2} pseudo-optimal model order reduction, Ph.D. thesis, Technische Universität München (2014).
  • [31] D. Wilson, Optimum solution of model-reduction problem, in: Proceedings of the Institution of Electrical Engineers, Vol. 117, IET, 1970, pp. 1161–1165.
  • [32] S. Gugercin, A. C. Antoulas, C. Beattie, 2\mathcal{H}_{2} model reduction for large-scale linear dynamical systems, SIAM journal on matrix analysis and applications 30 (2) (2008) 609–638.
  • [33] P. Van Dooren, K. A. Gallivan, P.-A. Absil, 2\mathcal{H}_{2}-optimal model reduction of MIMO systems, Applied Mathematics Letters 21 (12) (2008) 1267–1273.
  • [34] Y. Xu, T. Zeng, Optimal 2\mathcal{H}_{2} model reduction for large scale MIMO systems via tangential interpolation., International Journal of Numerical Analysis & Modeling 8 (1) (2011).
  • [35] K. Gallivan, A. Vandendorpe, P. Van Dooren, Sylvester equations and projection-based model reduction, Journal of Computational and Applied Mathematics 162 (1) (2004) 213–229.
  • [36] P. Benner, M. Køhler, J. Saak, Sparse-dense Sylvester equations in 2\mathcal{H}_{2}-model order reduction., MPI. Magdeburg preprints MPIMD/11-11 (2011).
  • [37] C. A. Beattie, S. Gugercin, A trust region method for optimal 2\mathcal{H}_{2} model reduction, in: Proceedings of the 48h IEEE Conference on Decision and Control (CDC) held jointly with 2009 28th Chinese Control Conference, IEEE, 2009, pp. 5370–5375.
  • [38] Y. Jiang, K. Xu, 2\mathcal{H}_{2} optimal reduced models of general MIMO LTI systems via the cross gramian on the Stiefel manifold, Journal of the Franklin Institute 354 (8) (2017) 3210–3224.
  • [39] K. Sato, H. Sato, Structure-preserving 2\mathcal{H}_{2} optimal model reduction based on the Riemannian trust-region method, IEEE Transactions on Automatic Control 63 (2) (2017) 505–512.
  • [40] H. Sato, K. Sato, Riemannian trust-region methods for 2\mathcal{H}_{2} optimal model reduction, in: 2015 54th IEEE Conference on Decision and Control (CDC), IEEE, 2015, pp. 4648–4655.
  • [41] P. Yang, Y.-L. Jiang, K.-L. Xu, A trust-region method for 2\mathcal{H}_{2} model reduction of bilinear systems on the Stiefel manifold, Journal of the Franklin Institute 356 (4) (2019) 2258–2273.
  • [42] H. K. Panzer, S. Jaensch, T. Wolf, B. Lohmann, A greedy rational Krylov method for 2\mathcal{H}_{2}-pseudooptimal model order reduction with preservation of stability, in: 2013 American Control Conference, IEEE, 2013, pp. 5512–5517.
  • [43] T. Wolf, H. K. Panzer, B. Lohmann, 2\mathcal{H}_{2} pseudo-optimality in model order reduction by Krylov subspace methods, in: 2013 European control conference (ECC), IEEE, 2013, pp. 3427–3432.
  • [44] S. Ibrir, A projection-based algorithm for model-order reduction with 2\mathcal{H}_{2} performance: A convex-optimization setting, Automatica 93 (2018) 510–519.
  • [45] P. Kundur, N. J. Balu, M. G. Lauby, Power system stability and control, Vol. 7, McGraw-hill New York, 1994.
  • [46] B. Pal, B. Chaudhuri, Robust control in power systems, Springer Science & Business Media, 2006.
  • [47] G. Rogers, Power system oscillations, Springer Science & Business Media, 2012.
  • [48] M. Grimble, Solution of finite-time optimal control problems with mixed end constraints in the s-domain, IEEE Transactions on Automatic Control 24 (1) (1979) 100–108.
  • [49] W. Gawronski, J.-N. Juang, Model reduction in limited time and frequency intervals, International Journal of Systems Science 21 (2) (1990) 349–376.
  • [50] M. Redmann, P. Kürschner, An output error bound for time-limited balanced truncation, Systems & Control Letters 121 (2018) 1–6.
  • [51] M. Redmann, An LT2L_{T}^{2}-error bound for time-limited balanced truncation, Systems & Control Letters 136 (2020) 104620.
  • [52] S. Gugercin, A. C. Antoulas, A survey of model reduction by balanced truncation and some new results, International Journal of Control 77 (8) (2004) 748–766.
  • [53] P. Kürschner, Balanced truncation model order reduction in limited time intervals for large systems, Advances in Computational Mathematics 44 (6) (2018) 1821–1844.
  • [54] U. Zulfiqar, M. Imran, A. Ghafoor, M. Liaqat, Time/frequency-limited positive-real truncated balanced realizations, IMA Journal of Mathematical Control and Information 37 (1) (2020) 64–81.
  • [55] P. Benner, S. W. Werner, Frequency-and time-limited balanced truncation for large-scale second-order systems, Linear Algebra and its Applications 623 (2021) 68–103.
  • [56] D. Kumar, A. Jazlan, V. Sreeram, Generalized time limited gramian based model reduction, in: 2017 Australian and New Zealand Control Conference (ANZCC), IEEE, 2017, pp. 47–49.
  • [57] M. Imran, M. Imran, Development of time-limited model reduction technique for one-dimensional and two-dimensional systems with error bound via balanced structure, Journal of the Franklin Institute (2022).
  • [58] H. R. Shaker, M. Tahavori, Time-interval model reduction of bilinear systems, International Journal of Control 87 (8) (2014) 1487–1495.
  • [59] M. Tahavori, H. R. Shaker, Model reduction via time-interval balanced stochastic truncation for linear time invariant systems, International Journal of Systems Science 44 (3) (2013) 493–501.
  • [60] P. Goyal, M. Redmann, Time-limited 2\mathcal{H}_{2}-optimal model order reduction, Applied Mathematics and Computation 355 (2019) 184–197.
  • [61] K. Sinani, S. Gugercin, 2(tf)\mathcal{H}_{2}(t_{f}) optimality conditions for a finite-time horizon, Automatica 110 (2019) 108604.
  • [62] K. Das, S. Krishnaswamy, S. Majhi, 2\mathcal{H}_{2} optimal model order reduction over a finite time interval, IEEE Control Systems Letters 6 (2022) 2467–2472.
  • [63] U. Zulfiqar, V. Sreeram, X. Du, On frequency-and time-limited 2\mathcal{H}_{2}-optimal model order reduction, arXiv preprint arXiv:2102.03603.
  • [64] U. Zulfiqar, 2\mathcal{H}_{2} near-optimal projection methods for finite-horizon model order reduction, Ph.D. thesis, The University of Western Australia, Perth, Australia (2021).
  • [65] U. Zulfiqar, V. Sreeram, X. Du, Time-limited pseudo-optimal 2\mathcal{H}_{2}-model order reduction, IET Control Theory & Applications 14 (14) (2020) 1995–2007.
  • [66] A. C. Antoulas, Approximation of large-scale dynamical systems, SIAM, 2005.
  • [67] K. Zhou, Frequency-weighted \mathcal{H}_{\infty} norm and optimal Hankel norm model reduction, IEEE Transactions on Automatic Control 40 (10) (1995) 1687–1699.
  • [68] K. Zhou, J. Doyle, K. Glover, Robust and optimal control, Control Engineering Practice 4 (8) (1996) 1189–1190.
  • [69] D. A. Bini, S. Dendievel, G. Latouche, B. Meini, Computing the exponential of large block-triangular block-toeplitz matrices encountered in fluid queues, Linear Algebra and its Applications 502 (2016) 387–419.
  • [70] N. N. Smalls, The exponential function of matrices, Master thesis, Georgia State University (2007).
  • [71] K. B. Petersen, M. S. Pedersen, et al., The matrix cookbook, Technical University of Denmark 7 (15) (2008) 510.
  • [72] D. Petersson, A nonlinear optimization approach to 2\mathcal{H}_{2}-optimal modeling and control, Ph.D. thesis, Linköping University Electronic Press (2013).
  • [73] P. Benner, E. S. Quintana-Ortí, G. Quintana-Ortí, Efficient numerical algorithms for balanced stochastic truncation, International Journal of Applied Mathematics and Computer Science 11 (5) (2001) 1123-1150.
  • [74] B. Anić, C. Beattie, S. Gugercin, A. C. Antoulas, Interpolatory weighted-2\mathcal{H}_{2} model reduction, Automatica 49 (5) (2013) 1275–1280.
  • [75] T. Breiten, C. Beattie, S. Gugercin, Near-optimal frequency-weighted interpolatory model reduction, Systems & Control Letters 78 (2015) 8–18.
  • [76] U. Zulfiqar, V. Sreeram, M. I. Ahmad, X. Du, Frequency-weighted 2\mathcal{H}_{2}-optimal model order reduction via oblique projection, International Journal of Systems Science 53 (1) (2022) 182–198.
  • [77] J. Rommes, N. Martins, Efficient computation of multivariable transfer function dominant poles using subspace acceleration, IEEE transactions on power systems 21 (4) (2006) 1471–1483.
  • [78] N. Martins, P. C. Pellanda, J. Rommes, Computation of transfer function dominant zeros with applications to oscillation damping control of large power systems, IEEE transactions on power systems 22 (4) (2007) 1657–1664.
  • [79] T. A. Davis, Algorithm 832: UMFPACK v4. 3—an unsymmetric-pattern multifrontal method, ACM Transactions on Mathematical Software (TOMS) 30 (2) (2004) 196–199.
  • [80] J. W. Demmel, S. C. Eisenstat, J. R. Gilbert, X. S. Li, J. W. Liu, A supernodal approach to sparse partial pivoting, SIAM Journal on Matrix Analysis and Applications 20 (3) (1999) 720–755.
  • [81] H. K. Panzer, Model order reduction by Krylov subspace methods with global error bounds and automatic choice of parameters, Ph.D. thesis, Technische Universität München (2014).
  • [82] Y. Chahlaoui, P. V. Dooren, Benchmark examples for model reduction of linear time-invariant dynamical systems, in: Dimension reduction of large-scale systems, Springer, 2005, pp. 379–392.