$\mathcal{H}_{2}$ -optimal Model Reduction of Linear Quadratic Output Systems in Finite Frequency Range

Umair Zulfiqar [email protected] Zhi-Hua Xiao Qiu-Yan Song Mohammad Monir Uddin Victor Sreeram School of Electronic Information and Electrical Engineering, Yangtze University, Jingzhou, Hubei, 434023, China School of Information and Mathematics, Yangtze University, Jingzhou, Hubei, 434023, China School of Mechatronic Engineering and Automation, Shanghai University, Shanghai, 200444, China Department of Mathematics and Physics, North South University, Dhaka, 1229, Bangladesh Department of Electrical, Electronic, and Computer Engineering, The University of Western Australia, Perth, 6009, Australia

Abstract

Linear quadratic output systems constitute an important class of dynamical systems with numerous practical applications. When the order of these models is exceptionally high, simulating and analyzing these systems becomes computationally prohibitive. In such instances, model order reduction offers an effective solution by approximating the original high-order system with a reduced-order model while preserving the system’s essential characteristics.

In frequency-limited model order reduction, the objective is to maintain the frequency response of the original system within a specified frequency range in the reduced-order model. In this paper, a mathematical expression for the frequency-limited $\mathcal{H}_{2}$ norm is derived, which quantifies the error within the desired frequency interval. Subsequently, the necessary conditions for a local optimum of the frequency-limited $\mathcal{H}_{2}$ norm of the error are derived. The inherent difficulty in satisfying these conditions within a Petrov-Galerkin projection framework is also discussed. Based on the optimality conditions and Petrov-Galerkin projection, a stationary point iteration algorithm is proposed that enforces three of the four optimality conditions upon convergence. A numerical example is provided to illustrate the algorithm’s effectiveness in accurately approximating the original high-order model within the specified frequency interval.

keywords:

\mathcal{H}_{2}

-optimal, frequency-limited, model order reduction, projection, reduced-order model, quadratic output

^†^†journal: ArXiv.org

1 Introduction

This study focuses on a specific class of nonlinear dynamical systems with weak nonlinearity. These systems have linear time-invariant (LTI) state equations but feature quadratic nonlinear terms in the output equation, referred to as linear quadratic output (LQO) systems [1]. They naturally arise in scenarios where it is necessary to observe quantities involving products of state components in either the time or frequency domain. For example, they are used in applications that quantify energy or power, such as the internal energy functional of a system [2] or the objective cost function in optimal quadratic control problems [3]. Additionally, they are used to measure the deviation of a state’s coordinates from a reference point, such as the root mean squared displacement of spatial coordinates around an excitation point, or in stochastic modeling for calculating the variance of a random variable [4].

Consider a LQO system described by the following state and output equations:

\displaystyle G:=\begin{cases}\dot{x}(t)=Ax(t)+Bu(t),\\ y(t)=Cx(t)+\begin{bmatrix}x(t)^{T}M_{1}x(t)\\ \vdots\\ x(t)^{T}M_{p}x(t)\end{bmatrix},\end{cases}

(1)

wherein $x(t)\in\mathbb{R}^{n\times 1}$ represents the state vector, $u(t)\in\mathbb{R}^{m\times 1}$ represents inputs, and $y(t)\in\mathbb{R}^{p\times 1}$ represents outputs. $(A,B,C,M_{1},\cdots,M_{p})$ is the state-space realization of $G$ with $A\in\mathbb{R}^{n\times n}$ , $B\in\mathbb{R}^{n\times m}$ , $C\in\mathbb{R}^{p\times n}$ , and $M_{i}\in\mathbb{R}^{n\times n}$ . The state equation in (1) is identical to that of standard LTI systems. However, the output equation in (1) introduces a nonlinearity in the form of the quadratic function of states $x(t)^{T}M_{i}x(t)$ .

Let us denote $y_{1}(t)=Cx(t)$ and $y_{2,i}=x(t)^{T}M_{i}x(t)$ . The input-output mapping between $u(t)$ and $y_{1}(t)$ is represented by the following transfer function:

\displaystyle G_{1}(s)

\displaystyle=C(sI-A)^{-1}B.

Additionally, the input-output mapping between $u(t)$ and $y_{2,i}(t)$ is represented by the following multivariate transfer function:

\displaystyle G_{2,i}(s_{1},s_{2})

\displaystyle=B^{T}(s_{1}I-A)^{-*}M_{i}(s_{2}I-A)^{-1}B,

cf. [5].

To ensure high fidelity in the mathematical modeling of complex physical phenomena, it is often necessary to use dynamical systems with very high orders, sometimes exceeding several thousand. This high order $n$ makes it computationally difficult or even prohibitive to simulate and analyze the model (1). Therefore, it is important to approximate (1) with a reduced-order model (ROM) of much lower order $k$ (where $k\ll n$ ). This approach simplifies the simulation and analysis processes. The process of creating such a ROM while preserving the key features of the original model is known as model order reduction (MOR); refer to [6, 7, 8, 9, 10] for a deeper understanding of the topic.

Let us denote the $k^{th}$ -order ROM of $G$ as $G_{k}$ , characterized by the following state and output equations:

\displaystyle G_{k}:=\begin{cases}\dot{x_{k}}(t)=A_{k}x_{k}(t)+B_{k}u(t),\\ y_{k}(t)=C_{k}x_{k}(t)+\begin{bmatrix}x_{k}(t)^{T}M_{k,1}x_{k}(t)\\ \vdots\\ x_{k}(t)^{T}M_{k,p}x_{k}(t)\end{bmatrix},\end{cases}

(2)

wherein $A_{k}=W_{k}^{T}AV_{k}\in\mathbb{R}^{k\times k}$ , $B_{k}=W_{k}^{T}B\in\mathbb{R}^{k\times m}$ , $C_{k}=CV_{k}\in\mathbb{R}^{p\times k}$ , and $M_{k,i}=V_{k}^{T}M_{i}V_{k}\in\mathbb{R}^{k\times k}$ , satisfying the Petrov-Galerkin projection condition $W_{k}^{T}V_{k}=I$ . The projection matrices $V_{k}\in\mathbb{R}^{n\times k}$ and $W_{k}\in\mathbb{R}^{n\times k}$ project $G$ onto a reduced subspace to obtain the ROM $G_{k}$ . Various MOR methods differ in how they construct $V_{k}$ and $W_{k}$ . The choice of $V_{k}$ and $W_{k}$ depends on the specific characteristics of $G$ that need to be preserved in $G_{k}$ .

Let $y_{k,1}(t)=C_{k}x_{k}(t)$ and $y_{k,2,i}=x_{k}(t)^{T}M_{k,i}x_{k}(t)$ . The input-output relationships between $u(t)$ and $y_{k,1}(t)$ , as well as $u(t)$ and $y_{k,2,i}(t)$ , are described by the following transfer functions:

	$\displaystyle G_{k,1}(s)$	$\displaystyle=C_{k}(sI-A_{k})^{-1}B_{k},$
	$\displaystyle G_{k,2,i}(s_{1},s_{2})$	$\displaystyle=B_{k}^{T}(s_{1}I-A_{k})^{-*}M_{k,i}(s_{2}I-A_{k})^{-1}B_{k}.$

Throughout this paper, it is assumed that both $A$ and $A_{k}$ are Hurwitz.

The Balanced Truncation (BT) method, introduced in 1981, is a prominent technique for MOR [11]. This approach retains states that significantly effect energy transfer between inputs and outputs, discarding those with minimal impact as indicated by their Hankel singular values. A notable advantage of BT is its ability to estimate errors in advance of creating the ROM [12]. Moreover, BT preserves the stability of the original system. Originally proposed for LTI systems, BT’s application has broadened to include descriptor systems [13], second-order systems [14], linear time-varying systems [15], parametric systems [16], nonlinear systems [17], and bilinear systems [18]. Additionally, BT has been tailored to maintain specific system properties such as positive realness [19], bounded realness [20], passivity [21], and special structural characteristics [22]. For a detailed overview of the various BT algorithms, refer to the survey [23]. BT has been adapted for LQO systems in [24, 25, 26]. Of these algorithms, only the one proposed in [26] preserves the LQO structure in the ROM.

The $\mathcal{H}_{2}$ -optimal MOR problem for standard LTI systems has been thoroughly explored in the literature. Wilson’s conditions, which are necessary conditions for achieving a local optimum of the $\mathcal{H}_{2}$ norm of error, are outlined in [27]. The iterative rational Krylov algorithm (IRKA) was the first to apply interpolation theory to meet these conditions [28]. Other algorithms using Sylvester equations and projection have been developed and enhanced for improved robustness in [29] and [30]. Recently, the $\mathcal{H}_{2}$ -optimal MOR problem for LQO systems has been addressed in [31], where a Sylvester equation-based algorithm has been proposed to achieve a local optimum upon convergence.

Many MOR problems are inherently frequency-limited, with certain frequency ranges being more important. For example, when creating a ROM for a notch filter, it is crucial to minimize the approximation error near the notch frequency [32]. Similarly, for closed-loop stability, the ROM of a plant must accurately capture the system’s behavior in the crossover frequency region [33, 34]. In interconnected power systems, low-frequency oscillations are essential for small-signal stability studies, so the ROM should accurately represent the behavior within the frequency range of inter-area and inter-plant oscillations [35, 36]. This need has led to frequency-limited MOR, which focuses on achieving high accuracy within specific frequency intervals rather than the entire spectrum [37].

The frequency-limited MOR problem aims to create a ROM that ensures that

\displaystyle||G_{1}(j\nu)-G_{k,1}(j\nu)||\hskip 14.22636pt\textnormal{and}\hskip 14.22636pt||G_{2,i}(j\nu_{1},j\nu_{2})-G_{k,2,i}(j\nu_{1},j\nu_{2})||

are small when $\nu$ , $\nu_{1}$ , and $\nu_{2}$ are within the specified frequency range of $[0,\omega]$ rad/sec.

BT typically offers an accurate approximation of the original model across the entire frequency spectrum. In [37], BT was adapted to address the frequency-limited MOR problem, resulting in the frequency-limited BT (FLBT) algorithm. However, FLBT does not preserve the stability and a priori error bounds that BT does. The computational aspects of FLBT and efficient methods for handling large-scale systems are discussed in [38]. Additionally, FLBT has been expanded to cover a wider range of systems, including descriptor systems [39], second-order systems [40], and bilinear systems [41]. FLBT has been recently generalized for LQO systems in [42].

The frequency-limited $\mathcal{H}_{2}$ norm for LTI systems is defined in [43], with Gramian-based conditions outlined for achieving a local optimum. Generalizations of the iterative rational Krylov algorithm (IRKA) have resulted in the development of the frequency-limited IRKA (FLIRKA) in [44, 45, 46], which approximately fulfills these conditions. This paper examines the frequency-limited $\mathcal{H}_{2}$ -optimal MOR problem for LQO systems.

This research work makes several important contributions. First, it introduces the frequency-limited $\mathcal{H}_{2}$ norm ( $\mathcal{H}_{2,\omega}$ norm) for LQO systems and shows how to compute it using frequency-limited system Gramians defined in [42]. Second, it derives the necessary conditions for achieving a local optimum of $||G-G_{k}||_{\mathcal{H}_{2,\omega}}^{2}$ . Third, it compares these conditions with those for standard $\mathcal{H}_{2}$ -optimal model order reduction [31], highlighting that Petrov-Galerkin projection generally cannot achieve a local optimum in the frequency-limited scenario. Fourth, it proposes a stationary point algorithm based on Petrov-Galerkin projection, which meets three of the four necessary conditions for optimality upon convergence. The paper includes a numerical example demonstrating the algorithm’s accuracy within the specified frequency range and showing it outperforms existing methods.

2 Literature Review

In this section, we will briefly explore two key MOR algorithms relevant to LQO systems within the context of the problem at hand. The first is the FLBT [42], and the second is the $\mathcal{H}_{2}$ -optimal MOR method [31].

2.1 Frequency-limited Balanced Truncation (FLBT) [42]

FLBT creates the ROM by truncating states that contribute minimally to the input-output energy transfer within the desired frequency range of $[0,\omega]$ rad/sec. This is achieved by constructing a frequency-limited balanced realization using frequency-limited Gramians and then truncating the states corresponding to the smallest frequency-limited Hankel singular values.

The frequency-limited controllability Gramian $P_{\omega}$ within the desired frequency interval $[0,\omega]$ rad/sec is given by

\displaystyle P_{\omega}=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-1}BB^{T}(j\nu I-A)^{-*}d\nu.

Next, we define $F_{\omega}$ as

\displaystyle F_{\omega}=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-1}d\nu=\frac{j}{\pi}ln(-j\omega I-A).

With this, $P_{\omega}$ can be computed by solving the following Lyapunov equation:

\displaystyle AP_{\omega}+P_{\omega}A^{T}+F_{\omega}BB^{T}+BB^{T}F_{\omega}^{*}

\displaystyle=0.

(3)

The frequency-limited observability Gramian $Q_{\omega}=Y_{\omega}+Z_{\omega}$ within the frequency range $[0,\omega]$ rad/sec is defined as

	$\displaystyle Y_{\omega}$	$\displaystyle=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-*}C^{T}C(j\nu I-A)^{-1}d\nu,$
	$\displaystyle Z_{\omega}$	$\displaystyle=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu_{1}I-A)^{-*}\Bigg{(}\sum_{i=1}^{p}M_{i}\Big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu_{2}I-A)^{-1}BB^{T}$
		$\displaystyle\hskip 108.12054pt(j\nu_{2}I-A)^{-*}d\nu_{2}\Big{)}M_{i}\Bigg{)}(j\nu_{1}I-A)^{-1}d\nu_{1}$
		$\displaystyle=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu_{1}I-A)^{-*}\big{(}\sum_{i=1}^{p}M_{i}P_{\omega}M_{i}\big{)}(j\nu_{1}I-A)^{-1}d\nu_{1}.$

$Y_{\omega}$ , $Z_{\omega}$ , and $Q_{\omega}$ can be determined by solving the following Lyapunov equations:

	$\displaystyle A^{T}Y_{\omega}+Y_{\omega}A+F_{\omega}^{*}C^{T}C+C^{T}CF_{\omega}$	$\displaystyle=0,$
	$\displaystyle A^{T}Z_{\omega}+Z_{\omega}A+\sum_{i=1}^{p}\big{(}F_{\omega}^{*}M_{i}P_{\omega}M_{i}+M_{i}P_{\omega}M_{i}F_{\omega}\big{)}$	$\displaystyle=0,$
	$\displaystyle A^{T}Q_{\omega}+Q_{\omega}A+F_{\omega}^{}C^{T}C+C^{T}CF_{\omega}+\sum_{i=1}^{p}\big{(}F_{\omega}^{}M_{i}P_{\omega}M_{i}+M_{i}P_{\omega}M_{i}F_{\omega}\big{)}$	$\displaystyle=0.$

The frequency-limited Hankel singular values $\sigma_{i}$ are defined as

\displaystyle\sigma_{i}=\sqrt{\lambda_{i}(P_{\omega}Q_{\omega})}\hskip 14.22636pt\textnormal{for}\hskip 14.22636pti=1,\cdots,n,

where $\lambda_{i}(\cdot)$ denotes the eigenvalues. The projection matrices in FLBT are then computed such that $W_{k}^{T}P_{\omega}W_{k}=V_{k}^{T}Q_{\omega}V_{k}=\text{diag}(\sigma_{1},\cdots,\sigma_{k})$ , where $\sigma_{1},\cdots,\sigma_{k}$ are the $k$ largest frequency-limited Hankel singular values of $G$ .

2.2 $\mathcal{H}_{2}$ -optimal MOR Algorithm (HOMORA) [31]

Let us define the matrices $P_{12}$ , $P_{k}$ , $Y_{12}$ , $Y_{k}$ , $Z_{12}$ , $Z_{k}$ , $Q_{12}$ , and $Q_{k}$ , which satisfy the following set of linear matrix equations:

	$\displaystyle AP_{12}+P_{12}A_{k}^{T}+BB_{k}^{T}$	$\displaystyle=0,$
	$\displaystyle A_{k}P_{k}+P_{k}A_{k}^{T}+B_{k}B_{k}^{T}$	$\displaystyle=0,$
	$\displaystyle A^{T}Y_{12}+Y_{12}A_{k}+C^{T}C_{k}$	$\displaystyle=0,$
	$\displaystyle A_{k}^{T}Y_{k}+Y_{k}A_{k}+C_{k}^{T}C_{k}$	$\displaystyle=0,$
	$\displaystyle A^{T}Z_{12}+Z_{12}A_{k}+\sum_{i=1}^{p}M_{i}P_{12}M_{k,i}$	$\displaystyle=0,$
	$\displaystyle A_{k}^{T}Z_{k}+Z_{k}A_{k}+\sum_{i=1}^{p}M_{k,i}P_{k}M_{k,i}$	$\displaystyle=0,$
	$\displaystyle A^{T}Q_{12}+Q_{12}A_{k}+C^{T}C_{k}+\sum_{i=1}^{p}M_{i}P_{12}M_{k,i}$	$\displaystyle=0,$
	$\displaystyle A_{k}^{T}Q_{k}+Q_{k}A_{k}+C_{k}^{T}C_{k}+\sum_{i=1}^{p}M_{k,i}P_{k}M_{k,i}$	$\displaystyle=0.$

According to [31], the necessary conditions for achieving a local optimum of the (squared) $\mathcal{H}_{2}$ -norm of the error, denoted as $||G-G_{k}||_{\mathcal{H}_{2}}^{2}$ , are described by the following set of equations:

$\displaystyle-(Y_{12}+2Z_{12})^{T}P_{12}+(Y_{k}+2Z_{k})P_{k}$	$\displaystyle=0,$	(4)
$\displaystyle-P_{12}^{T}M_{i}P_{12}+P_{k}M_{k,i}P_{k}$	$\displaystyle=0,$	(5)
$\displaystyle-(Y_{12}+2Z_{12})^{T}B+(Y_{k}+2Z_{k})B_{k}$	$\displaystyle=0,$	(6)
$\displaystyle-CP_{12}+C_{k}P_{k}$	$\displaystyle=0.$	(7)

Furthermore, it is shown that these optimality conditions can be met by setting the projection matrices as $V_{k}=P_{12}$ and $W_{k}=(Y_{12}+2Z_{12})\big{(}P_{12}^{T}(Y_{12}+2Z_{12})\big{)}^{-1}$ . Starting with an initial guess for the ROM, the projection matrices are iteratively updated until convergence, at which point the optimality conditions (4)-(7) are satisfied.

3 Main Work

In this section, we define the frequency-limited $\mathcal{H}_{2}$ norm and establish its connection to the frequency-limited observability Gramian. We then derive the necessary conditions for achieving a local optimum of the (squared) frequency-limited $\mathcal{H}_{2}$ norm of the error. Building on these optimality conditions, we present a projection-based iterative algorithm that meets three of the four optimality conditions. The challenge of satisfying the fourth optimality condition within the projection framework is also discussed. Finally, the computational aspects of the proposed algorithm are briefly discussed.

3.1 $\mathcal{H}_{2,\omega}$ norm Definition

The classical $\mathcal{H}_{2}$ norm for LQO systems is defined in the frequency domain as follows:

	$\displaystyle\|\|G\|\|_{\mathcal{H}_{2}}$	$\displaystyle=\Bigg{[}trace\Big{(}\frac{1}{2\pi}\int_{-\infty}^{\infty}G_{1}^{*}(j\nu)G_{1}(j\nu)d\nu$
		$\displaystyle\hskip 28.45274pt+\frac{1}{(2\pi)^{2}}\int_{-\infty}^{\infty}\int_{-\infty}^{\infty}\sum_{i=1}^{p}G_{2,i}^{*}(j\nu_{1},j\nu_{2})G_{2,i}(j\nu_{1},j\nu_{2})d\nu_{1}d\nu_{2}\Big{)}\Bigg{]}^{-\frac{1}{2}},$

cf. [5, 31]. The $\mathcal{H}_{2}$ norm quantifies the output response’s power to unit white noise across the entire frequency spectrum. However, for the problem at hand, we are only interested in the output response’s power within a specific, limited frequency range. This leads to the definition of the frequency-limited $\mathcal{H}_{2}$ norm.

Definition 3.1.

The frequency-limited $\mathcal{H}_{2}$ norm of the LQO system within the frequency interval $[0,\omega]$ rad/sec is defined as

	$\displaystyle\|\|G\|\|_{\mathcal{H}_{2,\omega}}$	$\displaystyle=\Big{[}trace\Big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}G_{1}^{*}(j\nu)G_{1}(j\nu)d\nu$
		$\displaystyle\hskip 28.45274pt+\frac{1}{(2\pi)^{2}}\int_{-\omega}^{\omega}\int_{-\omega}^{\omega}\sum_{i=1}^{p}G_{2,i}^{*}(j\nu_{1},j\nu_{2})G_{2,i}(j\nu_{1},j\nu_{2})d\nu_{1}d\nu_{2}\Big{)}\Big{]}^{-\frac{1}{2}}.$

Proposition 3.2.

The $\mathcal{H}_{2,\omega}$ norm is related to the frequency-limited observability Gramian $Q_{\omega}$ as follows:

\displaystyle||G||_{\mathcal{H}_{2,\omega}}=\sqrt{trace(B^{T}Q_{\omega}B)}.

Proof.

Observe that

	$\displaystyle trace\Big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}G_{1}^{*}(j\nu)G_{1}(j\nu)d\nu\Big{)}$
	$\displaystyle=trace\Big{(}B^{T}\Big{[}\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-*}C^{T}C(j\nu I-A)^{-1}\Big{]}Bd\nu\Big{)}$
	$\displaystyle=trace(B^{T}Y_{\omega}B).$

Additionally, note that

	$\displaystyle trace\Big{(}\frac{1}{(2\pi)^{2}}\int_{-\omega}^{\omega}\int_{-\omega}^{\omega}\sum_{i=1}^{p}G_{2,i}^{*}(j\nu_{1},j\nu_{2})G_{2,i}(j\nu_{1},j\nu_{2})d\nu_{1}d\nu_{2}\Big{)}$
	$\displaystyle=trace\Bigg{(}B^{T}\Big{[}\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu_{2}I-A)^{-*}\Big{(}\sum_{i=1}^{p}M_{i}\Big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu_{1}I-A)^{-1}BB^{T}$
	$\displaystyle\hskip 113.81102pt(j\nu_{1}I-A)^{-*}d\nu_{1}\Big{)}M_{i}\Big{)}(j\nu_{2}I-A)^{-1}d\nu_{2}\Big{]}B\Bigg{)}$
	$\displaystyle=trace(B^{T}Z_{\omega}B).$

Therefore, we have $||G||_{\mathcal{H}_{2,\omega}}=\sqrt{trace\big{(}B^{T}(Y_{\omega}+Z_{\omega})B\big{)}}=\sqrt{trace\big{(}B^{T}(Q_{\omega})B\big{)}}$ . ∎

3.2 $\mathcal{H}_{2,\omega}$ Norm of the Error

Let us define $E=G-G_{k}$ with the following state-space equations

\displaystyle E:=\begin{cases}\dot{x_{e}}(t)=\begin{bmatrix}x(t)\\ x_{k}(t)\end{bmatrix}=A_{e}x_{e}(t)+B_{e}u(t),\\ y_{e}(t)=y(t)-y_{k}(t)=C_{e}x_{e}(t)+\begin{bmatrix}x_{e}(t)^{T}M_{e,1}x_{e}(t)\\ \vdots\\ x_{e}(t)^{T}M_{e,p}x_{e}(t)\end{bmatrix},\end{cases}

wherein

	$\displaystyle A_{e}$	$\displaystyle=\begin{bmatrix}A&0\\ 0&A_{k}\end{bmatrix},$	$\displaystyle B_{e}$	$\displaystyle=\begin{bmatrix}B\\ B_{k}\end{bmatrix},$
	$\displaystyle M_{e,i}$	$\displaystyle=\begin{bmatrix}M_{i}&0\\ 0&-M_{k,i}\end{bmatrix},$	$\displaystyle C_{e}$	$\displaystyle=\begin{bmatrix}C&-C_{k}\end{bmatrix}.$		(8)

Let us define $F_{e,\omega}$ as follows

\displaystyle F_{e,\omega}=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A_{e})^{-1}d\nu=\frac{j}{\pi}ln(-j\omega I-A_{e}).

Then the frequency-limited controllability Gramian $P_{e,\omega}$ and the frequency-limited observability Gramian $Q_{e,\omega}=Y_{e,\omega}+Z_{e,\omega}$ of realization $(A_{e},B_{e},C_{e},M_{e,1},\cdots,M_{e,p})$ can be determined by solving the following Lyapunov equations:

	$\displaystyle\hskip 85.35826ptA_{e}P_{e,\omega}+P_{e,\omega}A_{e}^{T}+F_{e,\omega}B_{e}B_{e}^{T}+B_{e}B_{e}^{T}F_{e,\omega}^{*}=0,$
	$\displaystyle\hskip 88.2037ptA_{e}^{T}Y_{e,\omega}+Y_{e,\omega}A_{e}+F_{e,\omega}^{*}C_{e}^{T}C_{e}+C_{e}^{T}C_{e}F_{e,\omega}=0,$
	$\displaystyle A_{e}^{T}Z_{e,\omega}+Z_{e,\omega}A_{e}+\sum_{i=1}^{p}\big{(}F_{e,\omega}^{*}M_{e,i}P_{e,\omega}M_{e,i}+M_{e,i}P_{e,\omega}M_{e,i}F_{e,\omega}\big{)}=0,$
	$\displaystyle A_{e}^{T}Q_{e,\omega}+Q_{e,\omega}A_{e}+F_{e,\omega}^{*}C_{e}^{T}C_{e}+C_{e}^{T}C_{e}F_{e,\omega}$
	$\displaystyle\hskip 78.24507pt+\sum_{i=1}^{p}\big{(}F_{e,\omega}^{*}M_{e,i}P_{e,\omega}M_{e,i}+M_{e,i}P_{e,\omega}M_{e,i}F_{e,\omega}\big{)}=0.$

Let us partition $P_{e,\omega}$ , $Y_{e,\omega}$ , $Z_{e,\omega}$ , and $Q_{e,\omega}$ according to (8) as follows:

	$\displaystyle P_{e,\omega}$	$\displaystyle=\begin{bmatrix}P_{\omega}&P_{12,\omega}\\ P_{12,\omega}^{*}&P_{k,\omega}\end{bmatrix},$	$\displaystyle Y_{e,\omega}$	$\displaystyle=\begin{bmatrix}Y_{\omega}&-Y_{12,\omega}\\ -Y_{12,\omega}^{*}&Y_{k,\omega}\end{bmatrix},$
	$\displaystyle Z_{e,\omega}$	$\displaystyle=\begin{bmatrix}Z_{\omega}&-Z_{12,\omega}\\ -Z_{12,\omega}^{*}&Z_{k,\omega}\end{bmatrix},$	$\displaystyle Q_{e,\omega}$	$\displaystyle=\begin{bmatrix}Q_{\omega}&-Q_{12,\omega}\\ -Q_{12,\omega}^{*}&Q_{k,\omega}\end{bmatrix}.$

Additionally, define $F_{k,\omega}$ as

\displaystyle F_{k,\omega}=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A_{k})^{-1}d\nu=\frac{j}{\pi}ln(-j\omega I-A_{k}).

The following linear matrix equations then hold:

	$\displaystyle\hskip 110.96556ptAP_{12,\omega}+P_{12,\omega}A_{k}^{T}+F_{\omega}BB_{k}^{T}+BB_{k}^{T}F_{k,\omega}^{*}=0,$		(9)
	$\displaystyle\hskip 98.16191ptA_{k}P_{k,\omega}+P_{k,\omega}A_{k}^{T}+F_{k,\omega}B_{k}B_{k}^{T}+B_{k}B_{k}^{T}F_{k,\omega}^{*}=0,$		(10)
	$\displaystyle\hskip 100.72256ptA^{T}Y_{12,\omega}+Y_{12,\omega}A_{k}+F_{\omega}^{*}C^{T}C_{k}+C^{T}C_{k}F_{k,\omega}=0,$		(11)
	$\displaystyle\hskip 101.00737ptA_{k}^{T}Y_{k,\omega}+Y_{k,\omega}A_{k}+F_{k,\omega}^{*}C_{k}^{T}C_{k}+C_{k}^{T}C_{k}F_{k,\omega}=0,$		(12)
	$\displaystyle\hskip 14.79555ptA^{T}Z_{12,\omega}+Z_{12,\omega}A_{k}+\sum_{i=1}^{p}\big{(}F_{\omega}^{*}M_{i}P_{12,\omega}M_{k,i}+M_{i}P_{12,\omega}M_{k,i}F_{k,\omega}\big{)}=0,$		(13)
	$\displaystyle\hskip 8.5359ptA_{k}^{T}Z_{k,\omega}+Z_{k,\omega}A_{k}+\sum_{i=1}^{p}\big{(}F_{k,\omega}^{*}M_{k,i}P_{k,\omega}M_{k,i}+M_{k,i}P_{k,\omega}M_{k,i}F_{k,\omega}\big{)}=0,$		(14)
	$\displaystyle A^{T}Q_{12,\omega}+Q_{12,\omega}A_{k}+F_{\omega}^{*}C^{T}C_{k}+C^{T}C_{k}F_{k,\omega}$
	$\displaystyle\hskip 102.43008pt+\sum_{i=1}^{p}\big{(}F_{\omega}^{*}M_{i}P_{12,\omega}M_{k,i}+M_{i}P_{12,\omega}M_{k,i}F_{k,\omega}\big{)}=0,$		(15)
	$\displaystyle A_{k}^{T}Q_{k,\omega}+Q_{k,\omega}A_{k}+F_{k,\omega}^{*}C_{k}^{T}C_{k}+C_{k}^{T}C_{k}F_{k,\omega}$
	$\displaystyle\hskip 88.2037pt+\sum_{i=1}^{p}\big{(}F_{k,\omega}^{*}M_{k,i}P_{k,\omega}M_{k,i}+M_{k,i}P_{k,\omega}M_{k,i}F_{k,\omega}\big{)}=0.$		(16)

Finally, the $\mathcal{H}_{2,\omega}$ norm of $E$ can be expressed as:

	$\displaystyle\|\|E\|\|_{\mathcal{H}_{2,\omega}}$	$\displaystyle=\sqrt{trace(B_{e}^{T}Q_{e,\omega}B_{e})}$
		$\displaystyle=\sqrt{trace(B^{T}Q_{\omega}B-2B^{T}Q_{12,\omega}B_{k}+B_{k}^{T}Q_{k,\omega}B_{k})}.$

Corollary 3.3.

The expression $||E||_{\mathcal{H}_{2,\omega}}^{2}=||G||_{\mathcal{H}_{2,\omega}}^{2}-2\langle G,G_{k}\rangle_{\mathcal{H}_{2,\omega}}+||G_{k}||_{\mathcal{H}_{2,\omega}}^{2}$ holds, where $\langle G,G_{k}\rangle_{\mathcal{H}_{2,\omega}}$ denotes the $\mathcal{H}_{2,\omega}$ inner product of $G$ and $G_{k}$ .

Proof.

The first and last terms in the expression for $||E||_{\mathcal{H}_{2,\omega}}^{2}$ are straightforward. The main objective is to demonstrate that the middle term corresponds to the $\mathcal{H}_{2,\omega}$ inner product of $G$ and $G_{k}$ . By expanding the definition of the inner product, we can express it as:

	$\displaystyle\langle G,G_{k}\rangle_{\mathcal{H}_{2,\omega}}$	$\displaystyle=\frac{1}{2\pi}\int_{-\omega}^{\omega}trace\big{(}G_{1}^{*}(j\nu)G_{k,1}(j\nu)d\nu\big{)}$
		$\displaystyle+\frac{1}{4\pi^{2}}trace\Big{(}\int_{-\omega}^{\omega}\int_{-\omega}^{\omega}\sum_{i=1}^{p}\big{(}G_{2,i}^{*}(j\nu_{1},j\nu_{2})G_{k,2,i}(j\nu_{1},j\nu_{2})\big{)}d\nu_{1}d\nu_{2}\Big{)}.$

Furthermore,

	$\displaystyle trace\Big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}G_{1}^{*}(j\nu)G_{k,1}(j\nu)d\nu\Big{)}$
	$\displaystyle=trace\Big{(}B^{T}\big{(}\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-*}C^{T}C_{k}(j\nu I-A_{k})^{-1}d\nu\big{)}B_{k}\Big{)},$
	$\displaystyle trace\Big{(}\frac{1}{4\pi^{2}}\int_{-\omega}^{\omega}\int_{-\omega}^{\omega}\sum_{i=1}^{p}G_{2,i}^{*}(j\nu_{1},j\nu_{2})G_{k,2,i}(j\nu_{1},j\nu_{2})d\nu_{1}d\nu_{2}\Big{)}$
	$\displaystyle=trace\Bigg{(}B^{T}\Big{[}\frac{1}{4\pi^{2}}\int_{-\omega}^{\omega}(j\nu_{1}I-A)^{-*}\Big{(}\sum_{i=1}^{p}M_{i}\big{(}\int_{-\omega}^{\omega}(j\nu_{2}I-A)^{-1}B$
	$\displaystyle\hskip 113.81102ptB_{k}^{T}(j\nu_{2}I-A_{k})^{-*}d\nu_{2}\big{)}M_{k,i}\Big{)}(j\nu_{1}I-A_{k})^{-1}\Big{]}B_{k}\Bigg{)}.$

The Sylvester equations (11) and (13) can be solved by evaluating the following integrals:

	$\displaystyle Y_{12,\omega}$	$\displaystyle=\frac{1}{2\pi}\int_{-\omega}^{\omega}(j\nu I-A)^{-*}C^{T}C_{k}(j\nu I-A_{k})^{-1}d\nu,$
	$\displaystyle Z_{12,\omega}$	$\displaystyle=\frac{1}{4\pi^{2}}\int_{-\omega}^{\omega}(j\nu_{1}I-A)^{-*}\Bigg{(}\sum_{i=1}^{p}M_{i}\Big{(}\int_{-\omega}^{\omega}(j\nu_{2}I-A)^{-1}B$
		$\displaystyle\hskip 113.81102ptB_{k}^{T}(j\nu_{2}I-A_{k})^{-*}\Big{)}M_{k,i}\Bigg{)}(j\nu_{1}I-A_{k})^{-1}d\nu_{1};$

cf. [26, 42] Consequently, the $\mathcal{H}_{2,\omega}$ inner product between $G$ and $G_{k}$ can be written as

\displaystyle\langle G,G_{k}\rangle_{\mathcal{H}_{2,\omega}}=trace\big{(}B^{T}(Y_{12,\omega}+Z_{12,\omega})B_{k}\big{)}=trace(B^{T}Q_{12,\omega}B_{k}).

∎

3.3 Optimality Conditions

In this subsection, we present the necessary conditions for achieving a local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ . These optimality conditions require the introduction of several new variables. We begin by defining $\bar{Z}_{12}$ and $\bar{Z}_{k}$ as the solutions to the following equations:

	$\displaystyle A^{T}\bar{Z}_{12}+\bar{Z}_{12}A_{k}+\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}=0,$
	$\displaystyle A_{k}^{T}\bar{Z}_{k}+\bar{Z}_{k}A_{k}+\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i}=0.$

It is important to note that $P_{12,\omega}$ , $P_{k,\omega}$ , $Z_{12,\omega}$ and $Z_{k,\omega}$ can be derived from $P_{12}$ , $P_{k}$ , $\bar{Z}_{12}$ and $\bar{Z}_{k}$ , respectively, by restricting the integration limits to $[0,\omega]$ rad/sec in their integral definitions. Next, we define $\tilde{P}_{12}$ , $\tilde{P}_{k}$ , $\tilde{Z}_{12}$ , and $\tilde{Z}_{k}$ as follows:

$\displaystyle\tilde{P}_{12}$	$\displaystyle=P_{12}\Big{\|}_{-\infty}^{\infty}-P_{12}\Big{\|}_{0}^{\omega}=P_{12}-P_{12,\omega},$	(17)
$\displaystyle\tilde{P}_{k}$	$\displaystyle=P_{k}\Big{\|}_{-\infty}^{\infty}-P_{k}\Big{\|}_{0}^{\omega}=P_{k}-P_{k,\omega},$	(18)
$\displaystyle\tilde{Z}_{12}$	$\displaystyle=\bar{Z}_{12}\Big{\|}_{-\infty}^{\infty}-\bar{Z}_{12}\Big{\|}_{0}^{\omega}=\bar{Z}_{12}-Z_{12,\omega},$	(19)
$\displaystyle\tilde{Z}_{k}$	$\displaystyle=\bar{Z}_{k}\Big{\|}_{-\infty}^{\infty}-\bar{Z}_{k}\Big{\|}_{0}^{\omega}=\bar{Z}_{k}-Z_{k,\omega}.$	(20)

Additionally, we define $V$ , $W$ and $L_{\omega}$ as follows:

	$\displaystyle V$	$\displaystyle=B_{k}B^{T}\bar{Z}_{12}-B_{k}B_{k}^{T}\bar{Z}_{k}+P_{12}^{T}C^{T}C_{k}-P_{k}C_{k}C_{k}^{T}$
		$\displaystyle\hskip 56.9055pt+P_{12}^{T}\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}-P_{k}\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i},$
	$\displaystyle W$	$\displaystyle=\frac{j}{2\pi}\mathcal{L}(-j\nu I-A_{k},V),$
	$\displaystyle L_{\omega}$	$\displaystyle=-Q_{12,\omega}^{}\tilde{P}_{12}-\tilde{Z}_{12}^{}P_{12,\omega}+Q_{k,\omega}\tilde{P}_{k}+\tilde{Z}_{k}P_{k,\omega}+W^{*},$

where $\mathcal{L}(-j\nu I-A_{k},V)$ denotes the Fr’echet derivative of the matrix logarithm $ln(-j\nu I-A_{k})$ in the direction of the matrix $V$ , specifically:

	$\displaystyle\mathcal{L}(-j\nu I-A_{k},V)$	$\displaystyle=\int_{0}^{1}\big{(}\nu(-j\nu I-A_{k}-I)+I\big{)}^{-1}V$
		$\displaystyle\hskip 113.81102pt\big{(}\nu(-j\nu I-A_{k}-I)+I\big{)}^{-1}d\nu;$

cf. [47].

We are now ready to state the necessary conditions for a local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ .

Theorem 3.4.

The local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ must satisfy the following necessary conditions:

$\displaystyle-(Y_{12,\omega}+2Z_{12,\omega})^{*}P_{12,\omega}+(Y_{k,\omega}+2Z_{k,\omega})P_{k,\omega}+L_{\omega}$	$\displaystyle=0,$	(21)
$\displaystyle-P_{12,\omega}^{*}M_{i}P_{12,\omega}+P_{k,\omega}M_{k,i}P_{k,\omega}$	$\displaystyle=0,$	(22)
$\displaystyle-(Y_{12,\omega}+2Z_{12,\omega})^{*}B+(Y_{k,\omega}+2Z_{k,\omega})B_{k}$	$\displaystyle=0,$	(23)
$\displaystyle-CP_{12,\omega}+C_{k}P_{k,\omega}$	$\displaystyle=0.$	(24)

Proof.

The proof of this theorem is lengthy and complex, so it is provided in the Appendix A. ∎

3.4 Comparison with Local Optimum of $||E||_{\mathcal{H}_{2}}^{2}$

In this subsection, we compare the necessary conditions for the local optima of $||E||_{\mathcal{H}_{2}}^{2}$ and $||E||_{\mathcal{H}_{2,\omega}}^{2}$ . To begin, we provide the expression for $||E||_{\mathcal{H}_{2}}$ as presented in [26]. The controllability Gramian $P_{e}$ and the observability Gramian $Q_{e}=Y_{e}+Z_{e}$ of realization $(A_{e},B_{e},C_{e},M_{e,1},\cdots,M_{e,p})$ can be computed by solving the following Lyapunov equations:

	$\displaystyle A_{e}P_{e}+P_{e}A_{e}^{T}+B_{e}B_{e}^{T}$	$\displaystyle=0,$
	$\displaystyle A_{e}^{T}Y_{e}+Y_{e}A_{e}+C_{e}^{T}C_{e}$	$\displaystyle=0,$
	$\displaystyle A_{e}^{T}Z_{e}+Z_{e}A_{e}+\sum_{i=1}^{p}M_{e,i}P_{e}M_{e,i}$	$\displaystyle=0,$
	$\displaystyle A_{e}^{T}Q_{e}+Q_{e}A_{e}+C_{e}^{T}C_{e}+\sum_{i=1}^{p}M_{e,i}P_{e}M_{e,i}$	$\displaystyle=0.$

We then partition $P_{e}$ , $Y_{e}$ , $Z_{e}$ , and $Q_{e}$ according to (8) as follows:

	$\displaystyle P_{e}$	$\displaystyle=\begin{bmatrix}P&P_{12}\\ P_{12}^{T}&P_{k}\end{bmatrix},$	$\displaystyle Y_{e}$	$\displaystyle=\begin{bmatrix}Y&-Y_{12}\\ -Y_{12}^{T}&Y_{k}\end{bmatrix},$
	$\displaystyle Z_{e}$	$\displaystyle=\begin{bmatrix}Z&-Z_{12}\\ -Z_{12}^{T}&Z_{k}\end{bmatrix},$	$\displaystyle Q_{e}$	$\displaystyle=\begin{bmatrix}Q&-Q_{12}\\ -Q_{12}^{T}&Q_{k}\end{bmatrix}.$

The $\mathcal{H}_{2}$ norm of $E$ can be expressed as:

	$\displaystyle\|\|E\|\|_{\mathcal{H}_{2}}$	$\displaystyle=\sqrt{trace(B_{e}^{T}Q_{e}B_{e})}$
		$\displaystyle=\sqrt{trace(B^{T}QB-2B^{T}Q_{12}B_{k}+B_{k}^{T}Q_{k}B_{k})}.$

The optimality conditions (21)-(24) and (4)-(7) are similar, but there are some important differences. By restricting the integration limit of $P_{e}$ and $Q_{e}$ to $[0,\omega]$ rad/sec, the optimality conditions (22)-(24) can be derived from (5)-(7), respectively. However, the optimality condition (4) does not simplify to (21) by merely limiting the integration range.

Furthermore, from the optimality conditions (5)-(7), we can deduce the optimal selections for $M_{k,i}$ , $B_{k}$ , and $C_{k}$ as:

$\displaystyle M_{k,i}$	$\displaystyle=P_{k}^{-1}P_{12}^{T}M_{i}P_{12}P_{k}^{-1},$	(25)
$\displaystyle B_{k}$	$\displaystyle=(Y_{k}+2Z_{k})^{-1}(Y_{12}+2Z_{12})^{T}B,$	(26)
$\displaystyle C_{k}$	$\displaystyle=CP_{12}P_{k}^{-1}.$	(27)

By restricting the integration limits of $P_{e}$ and $Q_{e}$ to $[0,\omega]$ rad/sec, we can derive the frequency-limited optimal choices for $M_{k,i}$ , $B_{k}$ , and $C_{k}$ from (25)-(27) as follows:

$\displaystyle M_{k,i}$	$\displaystyle=P_{k,\omega}^{-1}P_{12,\omega}^{*}M_{i}P_{12,\omega}P_{k,\omega}^{-1},$	(28)
$\displaystyle B_{k}$	$\displaystyle=(Y_{k,\omega}+2Z_{k,\omega})^{-1}(Y_{12,\omega}+2Z_{12,\omega})^{*}B,$	(29)
$\displaystyle C_{k}$	$\displaystyle=CP_{12,\omega}P_{k,\omega}^{-1}.$	(30)

The optimal projection matrices $V_{k}$ and $W_{k}$ for computing a local optimum of $||E||_{\mathcal{H}_{2}}^{2}$ are given by:

\displaystyle V_{k}

\displaystyle=P_{12}P_{k}^{-1},

\displaystyle W_{k}

\displaystyle=(Y_{12}+2Z_{12})(Y_{k}+2Z_{k})^{-1}.

In the frequency-limited scenario, by setting:

\displaystyle V_{k}

\displaystyle=P_{12,\omega}P_{k,\omega}^{-1},

\displaystyle W_{k}

\displaystyle=(Y_{12,\omega}+2Z_{12,\omega})(Y_{k,\omega}+2Z_{k,\omega})^{-1},

we make with the optimal choices for $M_{k,i}$ , $B_{k}$ , and $C_{k}$ as indicated by the optimality conditions (22)-(24). However, with this choice of $V_{k}$ and $W_{k}$ , determining an optimal $A_{k}$ remains elusive. By enforcing the Petrov–Galerkin projection condition $W_{k}^{*}V_{k}=I$ , we ensure

\displaystyle-(Y_{12,\omega}+2Z_{12,\omega})^{*}P_{12,\omega}+(Y_{k,\omega}+2Z_{k,\omega})P_{k,\omega}=0.

It is important to note that, generally, $L_{\omega}$ does not simplify to zero with this choice of projection matrices. Consequently, this selection introduces a deviation in the optimality condition (21) quantified by $L_{\omega}$ . In contrast, in the classical $\mathcal{H}_{2}$ -optimal MOR framework, enforcing the Petrov–Galerkin projection condition $W_{k}^{T}V_{k}=I$ satisfies the optimality condition (4) and achieves a local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ . In summary, it is generally impossible to attain a local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ within the projection framework. While the optimality conditions (22)-(24) can be precisely met, the optimality condition (21) can only be approximately satisfied.

Up to this point, we have determined the appropriate projection matrices $V_{k}=P_{12,\omega}P_{k,\omega}^{-1}$ and $W_{k}=(Y_{12,\omega}+2Z_{12,\omega})(Y_{k,\omega}+2Z_{k,\omega})^{-1}$ for the problem at hand. However, these matrices depend on the ROM $(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})$ , which is unknown. Therefore, equations (2) and (9)-(14) form a coupled system of equations, expressed as:

	$\displaystyle(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})$	$\displaystyle=f(P_{12,\omega},P_{k,\omega},Y_{12,\omega},Y_{k,\omega},Z_{12,\omega},Z_{k,\omega}),$
	$\displaystyle(P_{12,\omega},P_{k,\omega},Y_{12,\omega},Y_{k,\omega},Z_{12,\omega},Z_{k,\omega})$	$\displaystyle=g(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p}).$

The stationary points of the function

\displaystyle(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})=f\big{(}g(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})\big{)}

satisfy the optimality conditions (22)-(24). Additionally, by enforcing the Petrov-Galerkin projection condition $W_{k}^{*}V_{k}=I$ , the optimality condition (21) is nearly satisfied, with the deviation quantified by $L_{\omega}$ . In the classical $\mathcal{H}_{2}$ -optimal MOR scenario, the situation is similar; however, enforcing the Petrov–Galerkin projection condition $W_{k}^{T}V_{k}=I$ at the stationary points ensures that all the optimality conditions (4)-(7) are fully satisfied.

In the classical $\mathcal{H}_{2}$ -optimal MOR case, it is demonstrated that if the reduction matrices are chosen as $V_{k}=P_{12}$ and $W_{k}=Y_{12}+2Z_{12}$ instead of $V_{k}=P_{12}P_{k}^{-1}$ and $W_{k}=(Y_{12}+2Z_{12})(Y_{k}+2Z_{k})^{-1}$ , the stationary points satisfy:

\displaystyle P_{k}

\displaystyle=I,\hskip 14.22636pt\textnormal{and}\hskip 14.22636ptY_{k}+2Z_{k}=I.

Thus, the projection matrices $V_{k}=P_{12}$ and $W_{k}=Y_{12}+2Z_{12}$ , along with the Petrov–Galerkin projection condition $W_{k}^{T}V_{k}=I$ , satisfy all the optimality conditions (4)-(7). However, using $V_{k}=P_{12,\omega}$ and $W_{k}=Y_{12,\omega}+2Z_{12,\omega}$ along with the Petrov–Galerkin projection condition $W_{k}^{*}V_{k}=I$ does not satisfy any of the optimality conditions (21)-(24).

Theorem 3.5.

If $W_{k}^{*}F_{\omega}B=F_{k,\omega}B_{k}$ , $CF_{\omega}V_{k}=C_{k}F_{k,\omega}$ , and $V_{k}^{*}M_{i}F_{\omega}V_{k}=M_{k,i}F_{k,\omega}$ , then selecting $V_{k}=P_{12,\omega}$ and $W_{k}=Y_{12,\omega}+2Z_{12,\omega}$ , together with the Petrov–Galerkin projection condition $W_{k}^{*}V_{k}=I$ ensures that:

\displaystyle P_{k,\omega}

\displaystyle=I,\hskip 14.22636pt\textnormal{and}\hskip 14.22636ptY_{k,\omega}+2Z_{k,\omega}=I,

(31)

which in turn satisfies the optimality conditions (22)-(24).

Proof.

The proof is provided in Appendix B. ∎

In general, choosing $V_{k}=P_{12,\omega}$ and $W_{k}=Y_{12,\omega}+2Z_{12,\omega}$ while enforcing the Petrov–Galerkin projection condition $W_{k}^{*}V_{k}=I$ does not meet the conditions $W_{k}^{*}F_{\omega}B=F_{k,\omega}B_{k}$ , $CF_{\omega}V_{k}=C_{k}F_{k,\omega}$ , and $V_{k}^{T}M_{i}F_{\omega}V_{k}=M_{k,i}F_{k,\omega}$ . Consequently, the condition (31) is not satisfied, and therefore, optimality conditions (22)-(24) are not fulfilled.

3.5 Algorithm

So far, for simplicity, we have considered $V_{k}$ and $W_{k}$ as complex matrices in the problem under consideration. However, in practice, using complex projection matrices results in a complex ROM, which is undesirable since most practical dynamical systems are represented by real mathematical models. This issue can be addressed by extending the desired frequency interval to include negative frequencies, i.e., $[-\omega,0]$ rad/sec. Additionally, we have assumed that the desired frequency interval starts from $0$ rad/sec for simplicity. For any general frequency interval $[-\omega_{2},-\omega_{1}]\cup[\omega_{1},\omega_{2}]$ rad/sec, the only modification needed is in the computation of $F_{\omega}$ and $F_{k,\omega}$ given by:

	$\displaystyle F_{\omega}$	$\displaystyle=Re\Big{(}\frac{j}{\pi}ln\big{(}(j\omega_{1}I+A\big{)}^{-1}(j\omega_{2}I+A)\big{)}\Big{)},$		(32)
	$\displaystyle F_{k,\omega}$	$\displaystyle=Re\Big{(}\frac{j}{\pi}ln\big{(}(j\omega_{1}I+A_{k}\big{)}^{-1}(j\omega_{2}I+A_{k})\big{)}\Big{)};$		(33)

see [48] for more details.

We are now ready to present the algorithm, referred to in this paper as the “Frequency-limited $\mathcal{H}_{2}$ Near-optimal Iterative Algorithm (FLHNOIA)”. The algorithm begins with an arbitrary initial guess for the ROM and iteratively updates it until convergence is achieved. Convergence is quantified by the stagnation in the relative change of the state-space matrices of the ROM. In each iteration, Steps (4) and (5) calculate the projection matrices, while Steps (6)-(10) bi-orthogonalize these matrices using the bi-orthogonal Gram–Schmidt method to ensure that $W_{k}^{T}V_{k}=I$ .

Algorithm 1 FLHNOIA

Input: Full order system: $(A,B,C,M_{1},\cdots,M_{p})$ ; Desired frequency interval: $[\omega_{1},\omega_{2}]$ rad/sec; Initial guess of ROM: $(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})$ ; Tolerance: $tol$ . Output: ROM: $(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})$ .

1: Compute

F_{\omega}

from (32).

2: while(relative change in

(A_{k},B_{k},C_{k},M_{k,1},\cdots,M_{k,p})

>

tol

)

3: Compute

F_{k,\omega}

from (33).

4: Solve equations (9)-(14) to compute

P_{12,\omega}

P_{k,\omega}

Y_{12,\omega}

Y_{k,\omega}

Z_{12,\omega}

, and

Z_{k,\omega}

5: Set

V_{k}=P_{12,\omega}P_{k,\omega}^{-1}

and

W_{k}=(Y_{12,\omega}+2Z_{12,\omega})(Y_{k,\omega}+2Z_{k,\omega})^{-1}

6: for

l=1,\ldots,k

v=V_{k}(:,l)

v=\prod_{j=1}^{l}\big{(}I-V_{k}(:,j)W_{k}(:,j)^{T}\big{)}v

w=W_{k}(:,l)

w=\prod_{j=1}^{l}\big{(}I-W_{k}(:,j)V_{k}(:,j)^{T}\big{)}w

v=\frac{v}{||v||_{2}}

w=\frac{w}{||w||_{2}}

v=\frac{v}{w^{T}v}

V_{k}(:,l)=v

W_{k}(:,l)=w

10: end for

11: Update the ROM as

A_{k}=W_{k}^{T}AV_{k}

B_{k}=W_{k}^{T}B

C_{k}=CV_{k}

M_{k,i}=V_{k}^{T}M_{i}V_{k}

12: end while

Remark 1.

For evaluating convergence, observing the stagnation of the ROM poles provides a more dependable measure compared to analyzing state-space realizations. This is because $\mathcal{H}_{2}$ -optimal MOR techniques often produce ROMs with varied state-space representations but identical transfer functions. Therefore, the stagnation of ROM poles is commonly used as a convergence criterion in $\mathcal{H}_{2}$ -optimal MOR algorithms, due to its proven effectiveness [44].

3.6 Computational Aspects

In this subsection, we briefly discuss the efficient implementation of FLHNOIA. Step (1) of FLHNOIA involves the computation of the matrix logarithm $F_{\omega}$ , which can become computationally expensive when the order $n$ of the original model is large. In such cases, Krylov subspace-based methods proposed in [38] can be utilized to approximate $F_{\omega}B$ , $CF_{\omega}$ , and $MF_{\omega}$ . The most computationally intensive task in each iteration is solving the Sylvester equations (9), (11), and (13). The state-space matrices of most high-order dynamical systems are sparse, making these equations “sparse-dense” Sylvester equations, which are commonly encountered in $\mathcal{H}_{2}$ -optimal MOR algorithms. A “sparse-dense” Sylvester equation typically has the structure:

\displaystyle\mathcal{A}\mathcal{B}+\mathcal{B}\mathcal{C}+\mathcal{D}\mathcal{E}

\displaystyle=0,

where the large matrices $\mathcal{A}\in\mathbb{R}^{n\times n}$ and $\mathcal{D}\in\mathbb{R}^{n\times d}$ ( $d\ll n$ ) are sparse, while the smaller matrices $\mathcal{C}\in\mathbb{R}^{r\times r}$ and $\mathcal{E}\in\mathbb{R}^{d\times r}$ are dense. An efficient algorithm for solving this type of Sylvester equation is proposed in [30]. The remaining steps in FLHNOIA involve basic matrix computations and the solution of simple Lyapunov equations, which can be executed with minimal computational cost.

4 Illustrative Example

This section provides an illustrative example to validate the key properties of FLHNOIA. Consider a sixth-order LQO system defined by the following state-space representation:

	$\displaystyle A$	$\displaystyle=\begin{bmatrix}-9&-29&-100&-82&-19&-2\\ 1&0&0&0&0&0\\ 0&1&0&0&0&0\\ 0&0&1&0&0&0\\ 0&0&0&1&0&0\\ 0&0&0&0&1&0\end{bmatrix},$
	$\displaystyle B$	$\displaystyle=\begin{bmatrix}1&0&0&0&0&0\end{bmatrix}^{T},$
	$\displaystyle C$	$\displaystyle=\begin{bmatrix}0&0&0&0&-1&1\end{bmatrix},$
	$\displaystyle M_{1}$	$\displaystyle=diag(0.7,0.4,0.1,0.1,0.1,0.1).$

The desired frequency range for this example is $[5,6]$ rad/sec. To initialize FLHNOIA, the following initial guess is employed:

	$\displaystyle A_{k}$	$\displaystyle=\begin{bmatrix}-0.0340&-0.1400&0.0124\\ 0.1400&-0.1579&0.1424\\ 0.0484&-0.2275&-0.1438\end{bmatrix},$
	$\displaystyle B_{k}$	$\displaystyle=\begin{bmatrix}-0.1592&0.2076&0.1170\end{bmatrix}^{T},$
	$\displaystyle C_{k}$	$\displaystyle=\begin{bmatrix}-0.1592&-0.2076&0.0433\end{bmatrix},$
	$\displaystyle M_{k,1}$	$\displaystyle=\begin{bmatrix}0.0030&0.0029&-0.0004\\ 0.0029&0.0032&0.0044\\ -0.0004&0.0044&0.1763\end{bmatrix}.$

FLHNOIA was terminated when the change in eigenvalues of $A_{k}$ stagnated, as the change in the ROM’s state-space realization persisted. The resulting final ROM is:

	$\displaystyle A_{k}$	$\displaystyle=\begin{bmatrix}-2.0431&-1.5758&1.6738\\ -7.9694&-5.4214&1.1098\\ -11.0229&-0.8018&-0.4171\end{bmatrix},$
	$\displaystyle B_{k}$	$\displaystyle=\begin{bmatrix}0.0000&0.0353&0.0366\end{bmatrix}^{T},$
	$\displaystyle C_{k}$	$\displaystyle=\begin{bmatrix}0.2714&0.0407&-0.0183\end{bmatrix},$
	$\displaystyle M_{k,1}$	$\displaystyle=\begin{bmatrix}68.5264&56.9059&-52.7212\\ 56.9059&3244.3314&-1886.9907\\ -52.7212&-1886.9907&1103.3966\end{bmatrix}.$

The numerical results below confirm that this ROM effectively satisfies the optimality conditions (22)-(24):

	$\displaystyle\|\|-P_{12,\omega}^{T}M_{i}P_{12,\omega}+P_{k,\omega}M_{k,i}P_{k,\omega}\|\|_{2}$	$\displaystyle=1.5960\times 10^{-11},$
	$\displaystyle\|\|-(Y_{12,\omega}+2Z_{12,\omega})^{T}B+(Y_{k,\omega}+2Z_{k,\omega})B_{k}\|\|_{2}$	$\displaystyle=2.3583\times 10^{-6},$
	$\displaystyle\|\|-CP_{12,\omega}+C_{k}P_{k,\omega}\|\|_{2}$	$\displaystyle=1.3523\times 10^{-9}.$

Next, a third-order ROM is generated using BT, FLBT, and HOMORA, with the same initial ROM used to initialize HOMORA. Figures 1 and 2 display the relative error on a logarithmic scale within the specified frequency range of $5$ to $6$ rad/sec. As illustrated, FLBT and FLHNOIA exhibit superior accuracy.

Refer to caption — Figure 1: Relative Error $\frac{|G_{1}(j\nu)-G_{k,1}(j\nu)|}{|G_{1}(j\nu)|}$ within $[5,6]$ rad/sec

5 Conclusion

This research addresses the problem of $\mathcal{H}_{2}$ -optimal MOR within a specified finite frequency range. To measure the output strength within this range, we introduce the frequency-limited $\mathcal{H}_{2}$ norm for LQO systems. We derive the necessary conditions for achieving local optima of the squared frequency-limited $\mathcal{H}_{2}$ norm of the error and compare these conditions to those of the standard, unconstrained $\mathcal{H}_{2}$ -optimal MOR problem. The study highlights the limitations of the Petrov-Galerkin projection method in satisfying all optimality conditions in the frequency-limited context. As a result, we propose a Petrov-Galerkin projection algorithm that meets three out of the four optimality conditions. Numerical experiments are conducted to validate the theoretical results and demonstrate the algorithm’s effectiveness in achieving high accuracy within the specified frequency range.

Appendix A

In this appendix, we present the proof of Theorem 3.4. Throughout the proof, the following properties of the trace operation are utilized repeatedly:

1.

Trace of Hermitian: $trace(XYZ)=trace(Z^{*}Y^{*}Z^{*})$ .
2.

Circular permutation in trace: $trace(XYZ)=trace(ZXY)=trace(YZX)$ .
3.

Trace of addition: $trace(X+Y+Z)=trace(X)+trace(Y)+trace(Z)$ ;

cf. [49].

Let us define the cost function $J$ as the component of $||E||_{H_{2,\omega}}^{2}$ that depends on the ROM, expressed as:

\displaystyle J=trace(-2B^{T}Q_{12,\omega}B_{k}+B_{k}^{T}Q_{k,\omega}B_{k}).

When a small first-order perturbation $\Delta_{A_{k}}$ is added to $A_{k}$ , $J$ changes to $J+\Delta_{J}^{A_{k}}$ . This causes $Q_{12,\omega}$ and $Q_{k,\omega}$ to perturb to $Q_{12,\omega}+\Delta_{Q_{12,\omega}}^{A_{k}}$ and $Q_{k,\omega}+\Delta_{Q_{k,\omega}}^{A_{k}}$ , respectively. Consequently, the first-order terms of $\Delta_{J}^{A_{k}}$ are given by:

\displaystyle\Delta_{J}^{A_{k}}=trace(2B^{T}\Delta_{Q_{12,\omega}}^{A_{k}}B_{k}+B_{k}^{T}\Delta_{Q_{k,\omega}}^{A_{k}}B_{k}).

Furthermore, it is evident from (15) and (16) that $\Delta_{Q_{12,\omega}}^{A_{k}}$ and $\Delta_{Q_{k,\omega}}^{A_{k}}$ satisfy the following Lyapunov equations:

	$\displaystyle A^{T}\Delta_{Q_{12,\omega}}^{A_{k}}+\Delta_{Q_{12,\omega}}^{A_{k}}A_{k}+Q_{12,\omega}\Delta_{A_{k}}+C^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}+\sum_{i=1}^{p}\big{(}M_{i}P_{12,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 128.0374pt+F_{\omega}^{*}M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}+M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}F_{k,\omega}\big{)}=0,$
	$\displaystyle\hskip 81.09052ptA\Delta_{P_{12,\omega}}^{A_{k}}+\Delta_{P_{12,\omega}}^{A_{k}}A_{k}^{T}+P_{12,\omega}(\Delta_{A_{k}})^{T}+BB_{k}^{T}(\Delta_{F_{k,\omega}}^{A_{k}})^{*}=0,$
	$\displaystyle A_{k}^{T}\Delta_{Q_{k,\omega}}^{A_{k}}+\Delta_{Q_{k,\omega}}^{A_{k}}A_{k}+(\Delta_{A_{k}})^{T}Q_{k,\omega}+Q_{k,\omega}\Delta_{A_{k}}$
	$\displaystyle\hskip 18.49411pt+(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}C_{k}+C_{k}^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}+\sum_{i=1}^{p}\big{(}(\Delta_{F_{k,\omega}}^{A_{k}})^{}M_{k,i}P_{k,\omega}M_{k,i}$
	$\displaystyle\hskip 18.49411pt+M_{k,i}P_{k,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}+F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}M_{k,i}+M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}M_{k,i}F_{k,\omega}\big{)}=0,$
	$\displaystyle A_{k}\Delta_{P_{k,\omega}}^{A_{k}}+\Delta_{P_{k,\omega}}^{A_{k}}A_{k}^{T}+\Delta_{A_{k}}P_{k,\omega}+P_{k,\omega}(\Delta_{A_{k}})^{T}$
	$\displaystyle\hskip 170.71652pt+\Delta_{F_{k,\omega}}^{A_{k}}B_{k}B_{k}^{T}+B_{k}B_{k}^{T}(\Delta_{F_{k,\omega}}^{A_{k}})^{*}=0,$

where

\displaystyle\Delta_{F_{k,\omega}}^{A_{k}}=\frac{j}{\pi}\mathcal{L}(-j\nu I-A_{k},\Delta_{A_{k}})+o(||\Delta_{A_{k}}||);

cf. [47]. Since we are only concerned with first-order perturbations, the term $o(||\Delta_{A_{k}}||)$ will be omitted for the rest of the proof. Now,

	$\displaystyle trace\Big{(}BB_{k}^{T}(\Delta_{Q_{12,\omega}}^{A_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}(-AP_{12}-P_{12}A_{k}^{T})(\Delta_{Q_{12,\omega}}^{A_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}-AP_{12}(\Delta_{Q_{12,\omega}}^{A_{k}})^{}-P_{12}A_{k}^{T}(\Delta_{Q_{12,\omega}}^{A_{k}})^{}\Big{)}$
	$\displaystyle=trace\Big{(}P_{12}^{T}(-A^{T}\Delta_{Q_{12,\omega}}^{A_{k}}-A_{k}\Delta_{Q_{12,\omega}}^{A_{k}})\Big{)}$
	$\displaystyle=trace\Big{(}P_{12}^{T}Q_{12,\omega}\Delta_{A_{k}}+P_{12}^{T}C^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}+\sum_{i=1}^{p}\big{(}P_{12}^{T}M_{i}P_{12,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt+P_{12}^{T}F_{\omega}^{T}M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}+F_{k,\omega}P_{12}^{T}M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}\big{)}\Big{)}$
	$\displaystyle=trace\Big{(}Q_{12,\omega}^{*}P_{12}\Delta_{A_{k}}^{T}+P_{12}^{T}C^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}+\sum_{i=1}^{p}\big{(}P_{12}^{T}M_{i}P_{12,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt+P_{12}^{T}F_{\omega}^{*}M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}+F_{k,\omega}P_{12}^{T}M_{i}\Delta_{P_{12,\omega}}^{A_{k}}M_{k,i}\big{)}\Big{)}.$

Similarly, note that:

	$\displaystyle trace(B_{k}B_{k}^{T}\Delta_{Q_{k,\omega}}^{A_{k}})$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}P_{k}-P_{k}A_{k}^{T}\big{)}\Delta_{Q_{k,\omega}}^{A_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}P_{k}\big{(}-A_{k}^{T}\Delta_{Q_{k,\omega}}^{A_{k}}-\Delta_{Q_{k,\omega}}^{A_{k}}A_{k}\big{)}\Big{)}$
	$\displaystyle=trace\Bigg{(}P_{k}\Big{(}(\Delta_{A_{k}})^{T}Q_{k,\omega}+Q_{k,\omega}\Delta_{A_{k}}+(\Delta_{F_{k,\omega}}^{A_{k}})^{*}C_{k}^{T}C_{k}+C_{k}^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt+\sum_{i=1}^{p}\big{(}(\Delta_{F_{k,\omega}}^{A_{k}})^{*}M_{k,i}P_{k,\omega}M_{k,i}+M_{k,i}P_{k,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt+F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}M_{k,i}+M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}M_{k,i}F_{k,\omega}\big{)}\Big{)}\Bigg{)}$
	$\displaystyle=trace\Big{(}2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{*}C_{k}^{T}C_{k}P_{k}$
	$\displaystyle\hskip 56.9055pt+\sum_{i=1}^{p}\big{(}2(\Delta_{F_{k,\omega}}^{A_{k}})^{*}M_{k,i}P_{k,\omega}M_{k,i}P_{k}+M_{k,i}F_{k,\omega}P_{k}M_{k}\Delta_{P_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt+M_{k,i}P_{k}F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}\big{)}\Big{)}.$

Therefore:

	$\displaystyle\Delta_{J}^{A_{k}}=trace\Big{(}-2(Q_{12,\omega})^{*}P_{12}(\Delta_{A_{k}})^{T}+2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}$
	$\displaystyle\hskip 56.9055pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}CP_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}C_{k}P_{k}$
	$\displaystyle\hskip 56.9055pt+\sum_{i=1}^{p}\big{(}-2M_{i}F_{\omega}P_{12}M_{k,\omega}(\Delta_{P_{12,\omega}}^{A_{k}})^{}-2M_{i}P_{12}F_{k,\omega}^{}M_{k,i}(\Delta_{P_{12,\omega}}^{A_{k}})^{*}$
	$\displaystyle\hskip 56.9055pt+M_{k,i}F_{k,\omega}P_{k}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}+M_{k,i}P_{k}F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}M_{k,i}(P_{12,\omega})^{}M_{i}P_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{*}M_{k,i}P_{k,\omega}M_{k,i}P_{k}\big{)}\Big{)}.$

Since

	$\displaystyle P_{12,\omega}=F_{\omega}P_{12}+P_{12}F_{k,\omega}^{*},$
	$\displaystyle P_{k,\omega}=F_{k,\omega}P_{k}+P_{k}F_{k,\omega}^{*},$

we have:

	$\displaystyle\Delta_{J}^{A_{k}}=trace\Big{(}-2(Q_{12,\omega})^{*}P_{12}(\Delta_{A_{k}})^{T}+2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}$
	$\displaystyle\hskip 28.45274pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}CP_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}C_{k}P_{k}$
	$\displaystyle\hskip 28.45274pt+\sum_{i=1}^{p}\big{(}-2M_{i}P_{12,\omega}M_{k,i}(\Delta_{P_{12,\omega}}^{A_{k}})^{*}+M_{k,i}P_{k,\omega}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 28.45274pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}M_{k,i}(P_{12,\omega})^{}M_{i}P_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{*}M_{k,i}P_{k,\omega}M_{k,i}P_{k}\big{)}\Big{)};$

cf. [50]. Note that

	$\displaystyle trace\Big{(}\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}(\Delta_{P_{12,\omega}}^{A_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A^{T}\bar{Z}_{12}-\bar{Z}_{12}A_{k}\big{)}(\Delta_{P_{12,\omega}}^{A_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A\Delta_{P_{12,\omega}}^{A_{k}}-\Delta_{P_{12,\omega}}^{A_{k}}A_{k}^{T}\big{)}\bar{Z}_{12}^{*}\Big{)}$
	$\displaystyle=trace\big{(}\bar{Z}_{12}^{}P_{12,\omega}(\Delta_{A_{k}})^{T}+\bar{Z}_{12}^{}BB_{k}^{T}(\Delta_{F_{k,\omega}}^{A_{k}})^{T}\big{)}.$

Additionally, note that

	$\displaystyle trace\Big{(}\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}^{T}\bar{Z}_{k}-\bar{Z}_{k}A_{k}\big{)}\Delta_{P_{k,\omega}}^{A_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}\Delta_{P_{k,\omega}}^{A_{k}}-\Delta_{P_{k,\omega}}^{A_{k}}A_{k}^{T}\big{)}\bar{Z}_{k}\Big{)}$
	$\displaystyle=2trace\big{(}\bar{Z}_{k}P_{k,\omega}(\Delta_{A_{k}})^{T}+\bar{Z}_{k}B_{k}B_{k}^{T}(\Delta_{F_{k,\omega}}^{A_{k}})^{T}\big{)}.$

Thus,

	$\displaystyle\Delta_{J}^{A_{k}}$	$\displaystyle=trace\Big{(}-2(Q_{12,\omega})^{*}P_{12}(\Delta_{A_{k}})^{T}+2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}$
		$\displaystyle\hskip 56.9055pt-2\bar{Z}_{12}^{*}P_{12,\omega}(\Delta_{A_{k}})^{T}+2\bar{Z}_{k}P_{k,\omega}(\Delta_{A_{k}})^{T}$
		$\displaystyle\hskip 56.9055pt-2B_{k}B^{T}\bar{Z}_{12}\Delta_{F_{k,\omega}}^{A_{k}}+2B_{k}B_{k}^{T}\bar{Z}_{k}\Delta_{F_{k,\omega}}^{A_{k}}$
		$\displaystyle\hskip 56.9055pt-2P_{12}^{T}C^{T}C_{k}\Delta_{F_{k,\omega}}^{A_{k}}+2P_{k}C_{k}C_{k}^{T}\Delta_{F_{k,\omega}}^{A_{k}}$
		$\displaystyle\hskip 56.9055pt-2\sum_{i=1}^{p}P_{12}^{T}M_{i}P_{12,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}$
		$\displaystyle\hskip 56.9055pt+2\sum_{i=1}^{p}P_{k}M_{k,i}P_{k,\omega}M_{k,i}\Delta_{F_{k,\omega}}^{A_{k}}\Big{)}$
		$\displaystyle=trace\Big{(}-2(Q_{12,\omega})^{*}P_{12}(\Delta_{A_{k}})^{T}+2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}$
		$\displaystyle\hskip 56.9055pt-2\bar{Z}_{12}^{*}P_{12,\omega}(\Delta_{A_{k}})^{T}+2\bar{Z}_{k}P_{k,\omega}(\Delta_{A_{k}})^{T}-2V\Delta_{F_{k,\omega}}^{A_{k}}\Big{)}.$

Recall that

\displaystyle\Delta_{F_{k,\omega}}^{A_{k}}=\frac{j}{2\pi}\mathcal{L}(-A_{k}-j\nu I,-\Delta_{A_{k}}).

By exchanging the trace and integral operations, we obtain

\displaystyle trace(V\Delta_{F_{k,\omega}}^{A_{k}})=-trace(W\Delta_{A_{k}});

cf. [48]. Hence,

\displaystyle\Delta_{J}^{A_{k}}=2trace\Big{(}\big{(}-Q_{12,\omega}^{*}P_{12}+Q_{k,\omega}P_{k}-\bar{Z}_{12}^{*}P_{12,\omega}+\bar{Z}_{k}P_{k,\omega}+W^{*}\big{)}(\Delta_{A_{k}})^{T}\Big{)}.

Therefore, the gradient of $J$ with respect of $A_{k}$ is given by

\displaystyle\nabla_{J}^{A_{k}}=2\big{(}-Q_{12,\omega}^{*}P_{12}+Q_{k,\omega}P_{k}-\bar{Z}_{12}^{*}P_{12,\omega}+\bar{Z}_{k}P_{k,\omega}+W^{*}\big{)};

cf. [47]. Consequently,

\displaystyle-Q_{12,\omega}^{*}P_{12}+Q_{k,\omega}P_{k}-\bar{Z}_{12}^{*}P_{12,\omega}+\bar{Z}_{k}P_{k,\omega}+W^{*}=0

(34)

is a necessary condition for a local optimum of $||E||_{H_{2,\omega}}^{2}$ . Moreover, substituting (17)-(20) into (34), it simplifies to

\displaystyle-Q_{12,\omega}^{*}P_{12,\omega}+Q_{k,\omega}P_{k,\omega}-Z_{12,\omega}^{*}P_{12,\omega}+Z_{k,\omega}P_{k,\omega}+L_{\omega}=0.

Additionally, since $Q_{12,\omega}=Y_{12,\omega}+Z_{12,\omega}$ and $Q_{k,\omega}=Y_{k,\omega}+Z_{k,\omega}$ , we arrive at

\displaystyle-(Y_{12,\omega}+2Z_{12,\omega})^{*}P_{12,\omega}+(Y_{k,\omega}+2Z_{k,\omega})P_{k,\omega}+L_{\omega}

\displaystyle=0.

By introducing a small first-order perturbation $\Delta_{M_{k,i}}$ to $M_{k,i}$ , $J$ is perturbed to $J+\Delta_{J}^{M_{k,i}}$ . Consequently, $Q_{12,\omega}$ and $Q_{k,\omega}$ are perturbed to $Q_{12,\omega}+\Delta_{Q_{12,\omega}}^{M_{k,i}}$ and $Q_{k,\omega}+\Delta_{Q_{k,\omega}}^{M_{k,i}}$ , respectively. As a result, the first-order terms of $\Delta_{J}^{M_{k,i}}$ are given by

\displaystyle\Delta_{J}^{M_{k,i}}=trace(-2B^{T}\Delta_{Q_{12,\omega}}^{M_{k,i}}B_{k}+B_{k}^{T}\Delta_{Q_{k,\omega}}^{M_{k,i}}B_{k}).

Furthermore, it can be easily observed from (15) and (16) that $\Delta_{Q_{12,\omega}}^{M_{k,i}}$ and $\Delta_{Q_{k,\omega}}^{M_{k,i}}$ satisfy the following Lyapunov equations:

	$\displaystyle A^{T}\Delta_{Q_{12,\omega}}^{M_{k,i}}+\Delta_{Q_{12,\omega}}^{M_{k,i}}A_{k}+F_{\omega}^{*}M_{i}P_{12,\omega}\Delta_{M_{k,i}}+M_{i}P_{12,\omega}\Delta_{M_{k,i}}F_{k,\omega}=0,$
	$\displaystyle A_{k}^{T}\Delta_{Q_{k,\omega}}^{M_{k,i}}+\Delta_{Q_{k,\omega}}^{M_{k,i}}A_{k}+F_{k,\omega}^{*}\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}+\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}F_{k,\omega}$
	$\displaystyle\hskip 85.35826pt+F_{k,\omega}^{*}M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}+M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}F_{k,\omega}=0.$

Observe that

	$\displaystyle trace(B^{T}\Delta_{Q_{12,\omega}}^{M_{k,i}}B_{k})$
	$\displaystyle=trace\big{(}BB_{k}^{T}(\Delta_{Q_{12,\omega}}^{M_{k,i}})^{*}\big{)}$
	$\displaystyle=trace\Big{(}\big{(}-AP_{12}-P_{12}A_{k}^{T}\big{)}(\Delta_{Q_{12,\omega}}^{M_{k,i}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A^{T}\Delta_{Q_{12,\omega}}^{M_{k,i}}-\Delta_{Q_{12,\omega}}^{M_{k,i}}A_{k}\big{)}P_{12}^{T}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}F_{\omega}^{*}M_{i}P_{12,\omega}\Delta_{M_{k,i}}+M_{i}P_{12,\omega}\Delta_{M_{k,i}}F_{k,\omega}\big{)}P_{12}^{T}\Big{)}$
	$\displaystyle=trace\big{(}P_{12}^{T}F_{\omega}^{*}M_{i}P_{12,\omega}\Delta_{M_{k,i}}+F_{k,\omega}P_{12}^{T}M_{i}P_{12,\omega}\Delta_{M_{k,i}}\big{)}$
	$\displaystyle=trace\big{(}(P_{12,\omega})^{}M_{i}F_{\omega}P_{12}(\Delta_{M_{k,i}})^{T}+(P_{12,\omega})^{}M_{i}P_{12}F_{k,\omega}^{*}(\Delta_{M_{k,i}})^{T}\big{)}$
	$\displaystyle=trace\big{(}(P_{12,\omega})^{*}M_{i}P_{12,\omega}(\Delta_{M_{k,i}})^{T}\big{)}.$

Additionally, note that

	$\displaystyle trace(B_{k}^{T}\Delta_{Q_{k,\omega}}^{M_{k,i}}B_{k})$	$\displaystyle=trace\big{(}B_{k}B_{k}^{T}\Delta_{Q_{k,\omega}}^{M_{k,i}}\big{)}$
		$\displaystyle=trace\Big{(}\big{(}-A_{k}P_{k}-P_{k}A_{k}^{T}\big{)}\Delta_{Q_{k,\omega}}^{M_{k,i}}\Big{)}$
		$\displaystyle=trace\Big{(}\big{(}-A_{k}^{T}\Delta_{Q_{k,\omega}}^{M_{k,i}}-\Delta_{Q_{k,\omega}}^{M_{k,i}}A_{k}\big{)}P_{k}\Big{)}$
		$\displaystyle=trace\Big{(}\big{(}F_{k,\omega}^{*}\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}+\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}F_{k,\omega}$
		$\displaystyle+F_{k,\omega}^{*}M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}+M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}F_{k,\omega}\big{)}P_{k}\Big{)}$
		$\displaystyle=trace\big{(}F_{k,\omega}^{*}\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}P_{k}+\Delta_{M_{k,i}}P_{k,\omega}M_{k,i}F_{k,\omega}P_{k}$
		$\displaystyle+F_{k,\omega}^{*}M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}P_{k}+M_{k,i}P_{k,\omega}\Delta_{M_{k,i}}F_{k,\omega}P_{k}\big{)}$
		$\displaystyle=trace\big{(}2P_{k,\omega}M_{k,i}P_{k,\omega}\big{)}.$

Thus, $\Delta_{J}^{M_{k,i}}$ becomes

\displaystyle\Delta_{J}^{M_{k,i}}=2trace\big{(}(-P_{12,\omega}^{*}M_{i}P_{12,\omega}+P_{k,\omega}M_{k,i}P_{k,\omega})(\Delta_{M_{k,i}})^{T}\big{)}.

Hence, the gradient of $J$ with respect to $M_{k,i}$ is given by

\displaystyle\nabla_{J}^{M_{k,i}}=2(-P_{12,\omega}^{*}M_{i}P_{12,\omega}+P_{k,\omega}M_{k,i}P_{k,\omega}).

Therefore,

\displaystyle-P_{12,\omega}^{*}M_{i}P_{12,\omega}+P_{k,\omega}M_{k,i}P_{k,\omega}=0

is a necessary condition for the local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ .

By introducing a small first-order perturbation $\Delta_{B_{k}}$ to $B_{k}$ , $J$ is perturbed to $J+\Delta_{J}^{B_{k}}$ . Consequently, $P_{12,\omega}$ , $P_{k,\omega}$ , $Q_{12,\omega}$ and $Q_{k,\omega}$ are perturbed to $P_{12,\omega}+\Delta_{P_{12,\omega}}^{B_{k}}$ , $P_{k,\omega}+\Delta_{P_{k,\omega}}^{B_{k}}$ , $Q_{12,\omega}+\Delta_{Q_{12,\omega}}^{B_{k}}$ , and $Q_{k,\omega}+\Delta_{Q_{k,\omega}}^{B_{k}}$ , respectively. As a result, the first-order terms of $\Delta_{J}^{B_{k}}$ are given by

	$\displaystyle\Delta_{J}^{B_{k}}$	$\displaystyle=trace\big{(}-2Q_{12,\omega}^{*}B(\Delta_{B_{k}})^{T}+2Q_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}$
		$\displaystyle\hskip 85.35826pt-2BB_{k}(\Delta_{Q_{12,\omega}}^{B_{k}})^{*}+B_{k}B_{k}^{T}\Delta_{Q_{k,\omega}}^{B_{k}}\big{)}.$

It follows from (9)-(16) that $\Delta_{P_{12,\omega}}^{B_{k}}$ , $\Delta_{P_{k,\omega}}^{B_{k}}$ , $\Delta_{Q_{12,\omega}}^{B_{k}}$ , and $\Delta_{Q_{k,\omega}}^{B_{k}}$ satisfy the following equations:

	$\displaystyle A\Delta_{P_{12,\omega}}^{B_{k}}+\Delta_{P_{12,\omega}}^{B_{k}}A_{k}^{T}+F_{\omega}B(\Delta_{B_{k}})^{T}+B(\Delta_{B_{k}})^{T}F_{k,\omega}^{*}=0,$
	$\displaystyle A_{k}\Delta_{P_{k,\omega}}^{B_{k}}+\Delta_{P_{k,\omega}}^{B_{k}}A_{k}^{T}+F_{k,\omega}\Delta_{B_{k}}B_{k}^{T}+\Delta_{B_{k}}B_{k}^{T}F_{k,\omega}^{*}$
	$\displaystyle\hskip 85.35826pt+F_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}+B_{k}(\Delta_{B_{k}})^{T}F_{k,\omega}^{*}=0$
	$\displaystyle A^{T}\Delta_{Q_{12,\omega}}^{B_{k}}+\Delta_{Q_{12,\omega}}^{B_{k}}A_{k}+F_{\omega}^{*}\sum_{i=1}^{p}M_{i}\Delta_{P_{12,\omega}}^{B_{k}}M_{k,i}+\sum_{i=1}^{p}M_{i}\Delta_{P_{12,\omega}}^{B_{k}}M_{k,i}F_{k,\omega}=0$
	$\displaystyle A_{k}^{T}\Delta_{Q_{k,\omega}}^{B_{k}}+\Delta_{Q_{k,\omega}}^{B_{k}}A_{k}+F_{k,\omega}^{*}\sum_{i=1}^{p}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}M_{k,i}+\sum_{i=1}^{p}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}M_{k,i}F_{k,\omega}=0.$

Note that

	$\displaystyle trace\big{(}BB_{k}^{T}(\Delta_{Q_{12,\omega}}^{B_{k}})^{*}\big{)}$
	$\displaystyle=trace\Big{(}\big{(}-AP_{12}-P_{12}A_{k}^{T}\big{)}(\Delta_{Q_{12,\omega}}^{B_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A^{T}\Delta_{Q_{12,\omega}}^{B_{k}}-\Delta_{Q_{12,\omega}}^{B_{k}}A_{k}\big{)}P_{12}^{T}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}F_{\omega}^{*}\sum_{i=1}^{p}M_{i}\Delta_{P_{12,\omega}}^{B_{k}}M_{k,i}+\sum_{i=1}^{p}M_{i}\Delta_{P_{12,\omega}}^{B_{k}}M_{k,i}F_{k,\omega}\big{)}P_{12}^{T}\Big{)}$
	$\displaystyle=trace\Big{(}\sum_{i=1}^{p}M_{k,i}P_{12}^{T}F_{\omega}^{*}M_{i}\Delta_{P_{12,\omega}}^{B_{k}}+\sum_{i=1}^{p}M_{k,i}F_{k,\omega}P_{12}^{T}M_{i}\Delta_{P_{12,\omega}^{B_{k}}}\Big{)}$
	$\displaystyle=trace\Big{(}\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}(\Delta_{P_{12,\omega}}^{B_{k}})^{*}\Big{)}.$

Furthermore,

	$\displaystyle trace\big{(}B_{k}B_{k}^{T}\Delta_{Q_{k,\omega}}^{B_{k}}\big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}P_{k}-P_{k}A_{k}^{T}\big{)}\Delta_{Q_{k,\omega}}^{B_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}^{T}\Delta_{Q_{k,\omega}}^{B_{k}}-\Delta_{Q_{k,\omega}}^{B_{k}}A_{k}\big{)}P_{k}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}\sum_{i=1}^{p}F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}M_{k,i}+\sum_{i=1}^{p}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}M_{k,i}F_{k,\omega}\big{)}P_{k}\Big{)}$
	$\displaystyle=trace\Big{(}\sum_{i=1}^{p}M_{k,i}P_{k}F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}+\sum_{i=1}^{p}M_{k,i}F_{k,\omega}P_{k}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}\Big{)}.$

Thus,

	$\displaystyle\Delta_{J}^{B_{k}}$	$\displaystyle=trace\Big{(}-2Q_{12,\omega}^{*}B(\Delta_{B_{k}})^{T}+2Q_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}$
		$\displaystyle\hskip 56.9055pt-2\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}(\Delta_{P_{12,\omega}}^{B_{k}})^{T}+\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}\Big{)}.$

Additionally,

	$\displaystyle trace\Big{(}\sum_{i=1}^{p}M_{i}P_{12,\omega}M_{k,i}(\Delta_{P_{12,\omega}}^{B_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A^{T}\bar{Z}_{12}-\bar{Z}_{12}A_{k}\big{)}(\Delta_{P_{12,\omega}}^{B_{k}})^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A\Delta_{P_{12,\omega}}^{B_{k}}-\Delta_{P_{12,\omega}}^{B_{k}}A_{k}^{T}\big{)}\bar{Z}_{12}^{*}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}F_{\omega}B(\Delta_{B_{k}})^{T}+B(\Delta_{B_{k}})^{T}F_{k,\omega}^{}\big{)}\bar{Z}_{12}^{}\Big{)}$
	$\displaystyle=trace\big{(}\bar{Z}_{12}^{}F_{\omega}B(\Delta_{B_{k}})^{T}+F_{k,\omega}^{}\bar{Z}_{12}^{*}B(\Delta_{B_{k}})^{T}\big{)}$
	$\displaystyle=trace\big{(}Z_{12,\omega}^{*}B(\Delta_{B_{k}})^{T}\big{)}.$

Similarly,

	$\displaystyle trace\Big{(}\sum_{i=1}^{p}M_{k,i}P_{k,\omega}M_{k,i}\Delta_{P_{k,\omega}}^{B_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}^{T}\bar{Z}_{k}-\bar{Z}_{k}A_{k}\big{)}\Delta_{P_{k,\omega}}^{B_{k}}\Big{)}$
	$\displaystyle=trace\Big{(}\big{(}-A_{k}\Delta_{P_{k,\omega}}^{B_{k}}-\Delta_{P_{k,\omega}}^{B_{k}}A_{k}^{T}\big{)}\bar{Z}_{k}\Big{)}$
	$\displaystyle=trace\Big{(}2\bar{Z}_{k}F_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}+2F_{k,\omega}^{*}\bar{Z}_{k}B_{k}(\Delta_{B_{k}})^{T}\Big{)}$
	$\displaystyle=2trace\big{(}Z_{k,\omega}^{*}B_{k}(\Delta_{B_{k}})^{T}\big{)}.$

Thus,

	$\displaystyle\Delta_{J}^{B_{k}}$	$\displaystyle=trace\big{(}-2Q_{12,\omega}^{*}B(\Delta_{B_{k}})^{T}+2Q_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}$
		$\displaystyle\hskip 56.9055pt-2Z_{12,\omega}^{*}B(\Delta_{B_{k}})^{T}+2Z_{k,\omega}B_{k}(\Delta_{B_{k}})^{T}\big{)}.$

Therefore, the gradient of $J$ with respect to $B_{k}$ is

\displaystyle\nabla_{J}^{B_{k}}=2(-Q_{12,\omega}^{*}B+Q_{k,\omega}B_{k}-Z_{12,\omega}^{*}B+Z_{k,\omega}B_{k}).

Hence,

	$\displaystyle-Q_{12,\omega}^{}B+Q_{k,\omega}B_{k}-Z_{12,\omega}^{}B+Z_{k,\omega}B_{k}$
	$\displaystyle=-(Y_{12,\omega}+2Z_{12,\omega})^{*}B+(Y_{k,\omega}+2Z_{k,\omega})B_{k}$
	$\displaystyle=0$

is a necessary condition for the local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ .

First, we reformat the cost function $J$ as follows:

	$\displaystyle J$	$\displaystyle=trace(-2B^{T}(Y_{12,\omega}+Z_{12,\omega})B_{k}+B_{k}^{T}(Y_{k,\omega}+Z_{k,\omega})B_{k})$
		$\displaystyle=trace(-2B^{T}Y_{12,\omega}B_{k}-2B^{T}Z_{12,\omega}B_{k}+B_{k}^{T}Y_{k,\omega}B_{k}+B_{k}^{T}Z_{k,\omega}B_{k}).$

Since

\displaystyle trace(-2B^{T}Y_{12,\omega}B_{k}+B_{k}^{T}Y_{k,\omega}B_{k})=trace(2CP_{12,\omega}C_{k}^{T}+C_{k}P_{k,\omega}C_{k}^{T}),

we can write

\displaystyle J=trace(2CP_{12,\omega}C_{k}^{T}+C_{k}P_{k,\omega}C_{k}^{T}-2B^{T}Z_{12,\omega}B_{k}+B_{k}^{T}Z_{k,\omega}B_{k});

cf. [50]. By adding a small first-order perturbation $\Delta_{C_{k}}$ to $C_{k}$ , the cost function $J$ is perturbed to $J+\Delta_{J}^{C_{k}}$ . The first-order terms of $\Delta_{J}^{C_{k}}$ are given by:

\displaystyle\Delta_{J}^{C_{k}}=trace(2CP_{12,\omega}(\Delta_{C_{k}})^{T}+2C_{k}P_{k,\omega}(\Delta_{C_{k}})^{T})

Thus, the gradient of $J$ with respect to $C_{k}$ is:

\displaystyle\nabla_{J}^{C_{k}}=2(CP_{12,\omega}+C_{k}P_{k,\omega}).

Therefore, the necessary condition for the local optimum of $||E||_{\mathcal{H}_{2,\omega}}^{2}$ is:

\displaystyle CP_{12,\omega}+C_{k}P_{k,\omega}=0.

This concludes the proof.

Appendix B

By pre-multiplying (9) with $W_{k}^{*}$ , we obtain:

	$\displaystyle W_{k}^{}\big{(}AP_{12,\omega}+P_{12,\omega}A_{k}^{T}+F_{\omega}BB_{k}^{T}+BB_{k}^{T}F_{k,\omega}^{}\big{)}$	$\displaystyle=0$
	$\displaystyle A_{k}+A_{k}^{T}+F_{k,\omega}B_{k}B_{k}^{T}+B_{k}B_{k}^{T}F_{k,\omega}$	$\displaystyle=0.$

Given the uniqueness of solution for equation (10), we have $P_{k,\omega}=I$ .

Note that $Y_{12,\omega}+2Z_{12,\omega}$ and $Y_{k,\omega}+2Z_{k,\omega}$ satisfies the following equations:

	$\displaystyle A^{T}(Y_{12,\omega}+2Z_{12,\omega})+(Y_{12,\omega}+2Z_{12,\omega})A_{k}+F_{\omega}CC_{k}^{T}+CC_{k}^{T}F_{k,\omega}^{*}$
	$\displaystyle\hskip 83.93553pt+2\sum_{i=1}^{p}\big{(}F_{\omega}^{*}M_{i}P_{12,\omega}M_{k,i}+M_{i}P_{12,\omega}M_{k,i}F_{k,\omega}\big{)}=0,$		(35)
	$\displaystyle A_{k}^{T}(Y_{k,\omega}+2Z_{k,\omega})+(Y_{k,\omega}+2Z_{k,\omega})A_{k}+F_{k,\omega}C_{k}C_{k}^{T}+C_{k}C_{k}^{T}F_{k,\omega}^{*}$
	$\displaystyle\hskip 83.93553pt+2\sum_{i=1}^{p}\big{(}F_{k,\omega}^{*}M_{k,i}P_{k,\omega}M_{k,i}+M_{k,i}P_{k,\omega}M_{k,i}F_{k,\omega}\big{)}=0.$		(36)

Taking the Hermitian of equation (35) and post-multiplying it by $V_{k}$ , we get:

	$\displaystyle\Big{(}A_{k}^{T}(Y_{12,\omega}+2Z_{12,\omega})^{}+(Y_{12,\omega}+2Z_{12,\omega})^{}A+C_{k}^{T}CF_{\omega}+F_{k,\omega}^{*}C_{k}^{T}C$
	$\displaystyle\hskip 93.89418pt+\sum_{i=1}^{p}2M_{k,i}P_{12,\omega}^{}M_{i}F_{\omega}+\sum_{i=1}^{p}2F_{k,\omega}^{}M_{k,i}P_{12,\omega}^{*}M_{i}\Big{)}V_{k}$
	$\displaystyle=A_{k}^{T}+A_{k}+C_{k}^{T}C_{k}F_{k,\omega}+F_{k,\omega}^{*}C_{k}^{T}C_{k}$
	$\displaystyle\hskip 93.89418pt+\sum_{i=1}^{p}2M_{k,i}M_{k,i}F_{k,\omega}+\sum_{i=1}^{p}2F_{k,\omega}^{*}M_{k,i}M_{k,i}=0.$

With the uniqueness of solutions for equations (10) and (36), we have $P_{k,\omega}=I$ and $Y_{k,\omega}+2Z_{k,\omega}=I$ . As a result, the optimality conditions (22)-(24) are met with $V_{k}=P_{12,\omega}$ , and $W_{k}=Y_{12,\omega}+2Z_{12,\omega}$ .

References

[1] J. M. Montenbruck, S. Zeng, F. Allgöwer, Linear systems with quadratic outputs, in: 2017 American Control Conference (ACC), IEEE, 2017, pp. 1030–1034.
[2] A. Van Der Schaft, Port-hamiltonian systems: An introductory survey, in: International Congress of Mathematicians, European Mathematical Society Publishing House (EMS Ph), 2006, pp. 1339–1365.
[3] B. Picinbono, P. Devaut, Optimal linear-quadratic systems for detection and estimation, IEEE Transactions on Information Theory 34 (2) (1988) 304–311.
[4] Q. Aumann, S. W. Werner, Structured model order reduction for vibro-acoustic problems using interpolation and balancing methods, Journal of Sound and Vibration 543 (2023) 117363.
[5] S. Reiter, S. W. Werner, Interpolatory model order reduction of large-scale dynamical systems with root mean squared error measures, arXiv preprint arXiv:2403.08894 (2024).
[6] R. Van Beeumen, K. Van Nimmen, G. Lombaert, K. Meerbergen, Model reduction for dynamical systems with quadratic output, International Journal for Numerical Methods in Engineering 91 (3) (2012) 229–248.
[7] P. Benner, V. Mehrmann, D. C. Sorensen, Dimension reduction of large-scale systems, Vol. 45, Springer, 2005.
[8] A. Antoulas, D. Sorensen, K. A. Gallivan, P. Van Dooren, A. Grama, C. Hoffmann, A. Sameh, Model reduction of large-scale dynamical systems, in: Computational Science-ICCS 2004: 4th International Conference, Kraków, Poland, June 6-9, 2004, Proceedings, Part III 4, Springer, 2004, pp. 740–747.
[9] A. C. Antoulas, R. Ionutiu, N. Martins, E. J. W. ter Maten, K. Mohaghegh, R. Pulch, J. Rommes, M. Saadvandi, M. Striebel, Model order reduction: Methods, concepts and properties, Coupled Multiscale Simulation and Optimization in Nanoelectronics (2015) 159–265.
[10] A. C. Antoulas, C. A. Beattie, S. Gugercin, Interpolatory methods for model reduction, SIAM, 2020.
[11] B. Moore, Principal component analysis in linear systems: Controllability, observability, and model reduction, IEEE Transactions on Automatic Control 26 (1) (1981) 17–32.
[12] D. F. Enns, Model reduction with balanced realizations: An error bound and a frequency weighted generalization, in: The 23rd IEEE Conference on Decision and Control, IEEE, 1984, pp. 127–132.
[13] V. Mehrmann, T. Stykel, Balanced truncation model reduction for large-scale systems in descriptor form, in: Dimension Reduction of Large-Scale Systems: Proceedings of a Workshop held in Oberwolfach, Germany, October 19–25, 2003, Springer, 2005, pp. 83–115.
[14] T. Reis, T. Stykel, Balanced truncation model reduction of second-order systems, Mathematical and Computer Modelling of Dynamical Systems 14 (5) (2008) 391–406.
[15] H. Sandberg, A. Rantzer, Balanced truncation of linear time-varying systems, IEEE Transactions on Automatic Control 49 (2) (2004) 217–229.
[16] N. T. Son, P.-Y. Gousenbourger, E. Massart, T. Stykel, Balanced truncation for parametric linear systems using interpolation of gramians: a comparison of algebraic and geometric approaches, Model Reduction of Complex Dynamical systems (2021) 31–51.
[17] B. Kramer, K. Willcox, Balanced truncation model reduction for lifted nonlinear systems, in: Realization and Model Reduction of Dynamical Systems: A Festschrift in Honor of the 70th Birthday of Thanos Antoulas, Springer, 2022, pp. 157–174.
[18] P. Benner, P. Goyal, M. Redmann, Truncated gramians for bilinear systems and their advantages in model order reduction, Model reduction of Parametrized Systems (2017) 285–300.
[19] N. Wong, V. Balakrishnan, Fast positive-real balanced truncation via quadratic alternating direction implicit iteration, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 26 (9) (2007) 1725–1731.
[20] C. Guiver, M. R. Opmeer, Bounded real and positive real balanced truncation for infinite-dimensional systems, Mathematical Control and Related Fields 3 (1) (2013) 83–119.
[21] N. Wong, V. Balakrishnan, C.-K. Koh, Passivity-preserving model reduction via a computationally efficient project-and-balance scheme, in: Proceedings of the 41st Annual Design Automation Conference, 2004, pp. 369–374.
[22] A. Sarkar, J. M. Scherpen, Structure-preserving generalized balanced truncation for nonlinear Port-Hamiltonian systems, Systems & Control Letters 174 (2023) 105501.
[23] S. Gugercin, A. C. Antoulas, A survey of model reduction by balanced truncation and some new results, International Journal of Control 77 (8) (2004) 748–766.
[24] R. Van Beeumen, K. Meerbergen, Model reduction by balanced truncation of linear systems with a quadratic output, in: AIP Conference Proceedings, Vol. 1281, American Institute of Physics, 2010, pp. 2033–2036.
[25] R. Pulch, A. Narayan, Balanced truncation for model order reduction of linear dynamical systems with quadratic outputs, SIAM Journal on Scientific Computing 41 (4) (2019) A2270–A2295.
[26] P. Benner, P. Goyal, I. P. Duff, Gramians, energy functionals, and balanced truncation for linear dynamical systems with quadratic outputs, IEEE Transactions on Automatic Control 67 (2) (2021) 886–893.
[27] D. Wilson, Optimum solution of model-reduction problem, in: Proceedings of the Institution of Electrical Engineers, Vol. 117, IET, 1970, pp. 1161–1165.
[28] S. Gugercin, A. C. Antoulas, C. Beattie, $\mathcal{H}_{2}$ model reduction for large-scale linear dynamical systems, SIAM journal on matrix analysis and applications 30 (2) (2008) 609–638.
[29] Y. Xu, T. Zeng, Optimal $\mathcal{H}_{2}$ model reduction for large scale MIMO systems via tangential interpolation, International Journal of Numerical Analysis & Modeling 8 (1) (2011).
[30] P. Benner, M. Køhler, J. Saak, Sparse-dense Sylvester equations in $\mathcal{H}_{2}$ -model order reduction, Preprint MPIMD/11-11, Max Planck Institute Magdeburg, available from http://www.mpi-magdeburg.mpg.de/preprints/ (Dec. 2011).
[31] S. Reiter, I. Pontes Duff, I. V. Gosea, S. Gugercin, $\mathcal{H}_{2}$ optimal model reduction of linear systems with multiple quadratic outputs, arXiv preprint arXiv:2405.05951 (2024).
[32] R. W. Aldhaheri, Frequency-domain model reduction approach to design iir digital filters using orthonormal bases, AEU-International Journal of Electronics and Communications 60 (6) (2006) 413–420.
[33] G. Obinata, B. D. Anderson, Model reduction for control system design, Springer Science & Business Media, 2012.
[34] D. F. Enns, Model reduction for control system design, Tech. Rep. NASA-CR-170417, NASA, CA (1985).
[35] U. Zulfiqar, V. Sreeram, X. Du, Finite-frequency power system reduction, International Journal of Electrical Power & Energy Systems 113 (2019) 35–44.
[36] U. Zulfiqar, V. Sreeram, X. Du, Frequency-limited pseudo-optimal rational Krylov algorithm for power system reduction, International Journal of Electrical Power & Energy Systems 118 (2020) 105798.
[37] W. Gawronski, J.-N. Juang, Model reduction in limited time and frequency intervals, International Journal of Systems Science 21 (2) (1990) 349–376.
[38] P. Benner, P. Kürschnerrschner, J. Saak, Frequency-limited balanced truncation with low-rank approximations, SIAM Journal on Scientific Computing 38 (1) (2016) A471–A499.
[39] M. Imran, A. Ghafoor, Model reduction of descriptor systems using frequency limited Gramians, Journal of the Franklin Institute 352 (1) (2015) 33–51.
[40] P. Benner, S. W. Werner, Frequency-and time-limited balanced truncation for large-scale second-order systems, Linear Algebra and its Applications 623 (2021) 68–103.
[41] H. R. Shaker, M. Tahavori, Frequency-interval model reduction of bilinear systems, IEEE Transactions on Automatic Control 59 (7) (2013) 1948–1953.
[42] Q.-Y. Song, U. Zulfiqar, Z.-H. Xiao, M. M. Uddin, V. Sreeram, Balanced truncation of linear systems with quadratic outputs in limited time and frequency intervals, arXiv preprint arXiv:2402.11445 (2024).
[43] D. Petersson, J. Löfberg, Model reduction using a frequency-limited $\mathcal{H}_{2}$ -cost, Systems & Control Letters 67 (2014) 32–39.
[44] P. Vuillemin, C. Poussot-Vassal, D. Alazard, $\mathcal{H}_{2}$ optimal and frequency limited approximation methods for large-scale LTI dynamical systems, IFAC Proceedings Volumes 46 (2) (2013) 719–724.
[45] X. Du, K. I. B. Iqbal, M. M. Uddin, A. M. Fony, M. T. Hossain, M. I. Ahmad, M. S. Hossain, Computational techniques for $\mathcal{H}_{2}$ optimal frequency-limited model order reduction of large-scale sparse linear systems, Journal of Computational Science 55 (2021) 101473.
[46] U. Zulfiqar, X. Du, Q.-Y. Song, V. Sreeram, On frequency-and time-limited $\mathcal{H}_{2}$ -optimal model order reduction, Automatica 153 (2023) 111012.
[47] N. J. Higham, Functions of matrices: theory and computation, SIAM, 2008.
[48] D. Petersson, A nonlinear optimization approach to $\mathcal{H}_{2}$ -optimal modeling and control, Ph.D. thesis, Linköping University Electronic Press (2013).
[49] K. B. Petersen and M. S. Pedersen, The matrix cookbook, Technical University of Denmark 7 (15) (2008) 510.
[50] U. Zulfiqar, V. Sreeram, X. Du, Adaptive frequency-limited $\mathcal{H}_{2}$ -model order reduction, Asian Journal of Control 24 (6) (2022) 2807–2823.

	$\displaystyle\Delta_{J}^{A_{k}}=trace\Big{(}-2(Q_{12,\omega})^{*}P_{12}(\Delta_{A_{k}})^{T}+2Q_{k,\omega}P_{k}(\Delta_{A_{k}})^{T}$
	$\displaystyle\hskip 56.9055pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}CP_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{}C_{k}^{T}C_{k}P_{k}$
	$\displaystyle\hskip 56.9055pt+\sum_{i=1}^{p}\big{(}-2M_{i}F_{\omega}P_{12}M_{k,\omega}(\Delta_{P_{12,\omega}}^{A_{k}})^{}-2M_{i}P_{12}F_{k,\omega}^{}M_{k,i}(\Delta_{P_{12,\omega}}^{A_{k}})^{*}$
	$\displaystyle\hskip 56.9055pt+M_{k,i}F_{k,\omega}P_{k}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}+M_{k,i}P_{k}F_{k,\omega}^{*}M_{k,i}\Delta_{P_{k,\omega}}^{A_{k}}$
	$\displaystyle\hskip 56.9055pt-2(\Delta_{F_{k,\omega}}^{A_{k}})^{}M_{k,i}(P_{12,\omega})^{}M_{i}P_{12}+2(\Delta_{F_{k,\omega}}^{A_{k}})^{*}M_{k,i}P_{k,\omega}M_{k,i}P_{k}\big{)}\Big{)}.$

ℋ2\mathcal{H}_{2}-optimal Model Reduction of Linear Quadratic Output Systems in Finite Frequency Range

Abstract

keywords:

1 Introduction

2 Literature Review

2.1 Frequency-limited Balanced Truncation (FLBT) [42]

2.2 ℋ2\mathcal{H}_{2}-optimal MOR Algorithm (HOMORA) [31]

3 Main Work

3.1 ℋ2,ω\mathcal{H}_{2,\omega} norm Definition

Definition 3.1.

Proposition 3.2.

Proof.

3.2 ℋ2,ω\mathcal{H}_{2,\omega} Norm of the Error

Corollary 3.3.

Proof.

3.3 Optimality Conditions

Theorem 3.4.

Proof.

3.4 Comparison with Local Optimum of ‖E‖ℋ22||E||_{\mathcal{H}_{2}}^{2}

Theorem 3.5.

Proof.

3.5 Algorithm

Remark 1.

3.6 Computational Aspects

4 Illustrative Example

5 Conclusion

Appendix A

Appendix B

References

$\mathcal{H}_{2}$ -optimal Model Reduction of Linear Quadratic Output Systems in Finite Frequency Range

2.2 $\mathcal{H}_{2}$ -optimal MOR Algorithm (HOMORA) [31]

3.1 $\mathcal{H}_{2,\omega}$ norm Definition

3.2 $\mathcal{H}_{2,\omega}$ Norm of the Error

3.4 Comparison with Local Optimum of $||E||_{\mathcal{H}_{2}}^{2}$