Koopman-type inverse operator for linear non-minimum phase systems with disturbances*

Yuhan Li, and Xiaoqiang Ji*, IEEE Member * This work was partially supported by Shenzhen Science and Technology Program (Grant No. RCBS20210706092219050), Guangdong Basic and Applied Basic Research Foundation (Grant No. 2022A1515110411, Grant No. 2023A1515012883), and Shenzhen Institute of Artiﬁcial Intel-ligence and Robotics for Society (AC01202201001).Y.Li, and X. Ji are with the Shenzhen Institute of Artificial Intelligence and Robotics for Society, Shenzhen 518172, China, and the School of Science and Engineering, The Chinese University of Hong Kong, Shenzhen, Shenzhen 518172, China.* Corresponding author, email: [email protected]

Abstract

In this paper, a novel Koopman-type inverse operator for linear time-invariant non-minimum phase systems with stochastic disturbances is proposed. This operator employs functions of the desired output to directly calculate the input. Furthermore, it can be applied as a data-driven approach for systems with unknown parameters yet a known relative degree, which is a departure from the majority of existing data-driven methods that are only applicable to minimum phase systems. Based on this foundation, we use the Monte Carlo approach to develop an improved Koopman-type method for addressing the issue of inaccurate parameter estimation in data-driven systems with large disturbances. The simulation results justify the tracking accuracy of Koopman-type operator.

\UseRawInputEncoding

I INTRODUCTION

Numerous studies have demonstrated that output tracking for non-minimum phase (NMP) systems poses more significant challenges than minimum phase (MP) systems. A system is considered to have non-minimum phase (or possess unstable zeros in the linear case) if there exists a (nonlinear) state feedback capable of maintaining the system output at an identical zero level, while simultaneously causing the internal dynamics to become unstable (Isidori and Alberto, 1985). For linear systems, the inverse of the system will make the unstable zeros of the original system transfer function become the poles, which cause the instability of the inverse (Butterworth et al., 2018).

Various approaches have been proposed to tackle the difficulties in NPM systems, including decomposing the system into external and internal dynamics and solving for the internal dynamics (Devasia et al., 1996; Devasia and PadenFrom, 1998; Estrada, 2021). Berget and Tomas (2020) proposed a method for constructing a new output that eliminates the unstable part of the zero dynamics, while Zundert et al. (2019) have decomposed NMP systems into a stable part and an unstable part for separate control. Ma et al. (2020) solved the problem of random perturbations in NMP systems by using two cost functions that possess the dual property. However, one of the main disadvantages of model-based control methods is their reliance on accurate models of the systems, which can be difficult and time-consuming to develop and validate for complex NMP systems.

In recent years, data-driven approaches have garnered attention for their ability to address the challenge of obtaining accurate models for model-based control strategies. Among the classic methods for non-minimum phase systems are virtual reference feedback tuning (VRFT) (Campi et al., 2002) and correlation-based tuning (CBT) (Van Heusden et al., 2011). However, CBT and VRFT require a comprehensive dataset to ensure model accuracy and may rely on the selection of certain empirical parameters, as noted by Rallo et al. (2016). To overcome these limitations, Suresh Kumar et al. (2022) developed a data-driven control method that integrates internal model control and VRFT and designed a generalizable methodology for data-driven identification of nonlinear dynamics based on the Koopman operator. Markovsky et al. (2022) proposed and solved the data-driven dynamic interpolation and approximation problem. However, most data-driven NMP control methods often require large amounts of data for training, and their applicability conditions and control accuracy have not been mathematically proven. Mamakoukas et al. (2021) designed a generalizable methodology for data-driven identification of nonlinear dynamics based on the Koopman operator.

In this paper, we propose a data-driven method that requires very few parameters and training data and provide a proof for the tracking accuracy. For systems without disturbances, only a small period of input-output data is needed to determine the parameters of our method.

The Koopman method, initially introduced by Koopman (1931), has proven to be a valuable mathematical tool in transforming complex nonlinear systems into higher-dimensional linear systems. This transformation enables the application of established linear system theory to control nonlinear systems. In recent years, the Koopman method has gained significant attention in various fields, including cybernetics and machine learning. Mauroy et al. (2021) introduced the use of Koopman operators in control systems. Mamakoukas et al. (2020) demonstrated the effectiveness of the Koopman operator for a robotic system at the experimental level. Additionally, Klus et al. (2020) presented an application of the Koopman operator in data-driven methods, and Leon and Devasia (2022) proposed a Koopman-type data-driven approach to control linear MP systems. Further research on the Koopman method holds the potential to revolutionize the field of nonlinear control systems.

The Koopman method has been widely studied in control systems, with research falling into two main categories: selecting eigenfunctions of the Koopman operator experimentally and obtaining desired coefficients using machine learning methods, or introducing a Koopman method under an MP system and proving its applicability and effectiveness. However, there is currently no proof of the practicality of the Koopman method for NMP systems. In this paper, we propose a Koopman-type control method for linear NMP systems with stochastic disturbances and prove its applicability conditions and tracking effects. The contributions of this paper are twofold: first, to the best of our knowledge, this paper is the first to use the Koopman method for the control of linear NMP systems and provide a complete proof; second, we demonstrate that our Koopman-type operator is still applicable for systems with random perturbations.

The remainder of the paper is organized as follows, the introduction of basic notation and the Koopman method is in Section II, the problem of NMP system control is in Section III, the Koopman-type method is proposed in Section IV with the theoretical analysis and the demonstration of tracking error. Section V includes simulation results justifies the tracking accuracy theorem presented in Chapter IV. A brief conclusion is in Section IV.

II PRELIMINARIES

A. Notation

Throughout this paper, let $\mathbb{R}$ denotes the set of real numbers, $\mathbb{R}^{m\times n}$ denotes the set of $m\times n$ matrix, and $\mathbb{Z}^{+}$ denotes the set of positive integers. For vector $\mathbf{a}$ , $\mathbf{b}$ , $||\mathbf{a}||$ is the $l_{2}$ norm, dim( $\mathbf{a}$ ) is the dimension of vector $\mathbf{a}$ , $<\mathbf{a},\mathbf{b}>$ is the inner product, . For matrix $\mathbf{A}$ , $||\mathbf{A}||$ is the matrix Frobenius norm, $\mathbf{A}^{T}$ is the transpose of matrix $\mathbf{A}$ , $\mathbf{A}^{\dagger}$ is the Moore-Penrose pseudoinverse of matrix $\mathbf{A}$ . $U[x]$ is a uniform distribution on interval $[-x,x]$ for $x\in\mathbb{R}^{+}$ . For any matrix $\mathbf{A}=(a_{ij})$ , where $a_{ij}$ are random variables, then $E[\mathbf{A}]=(E[a_{ij}])$ is the mathematical expectation of matrix $\mathbf{A}$ . $y^{(r)}(t)$ is the r-th derivative of $y$ respect to $t$ . B. Koopman method

Consider a (Banach) space $\mathscr{F}$ of observables $f:\boldmath{X}\rightarrow\mathbb{C}$ . The Koopman operator $U_{s}:\mathscr{F}\rightarrow\mathscr{F}$ associated with the map $\boldmath{S}:\boldmath{X}\rightarrow\boldmath{X}$ is defined through the composition (Mauroy et al., 2020)

U_{s}f=f\circ\boldmath{S}\ \quad f\in\mathscr{F}

(1)

$U_{s}$ is Koopman operator which convert nonlinear systems to finite or infinite dimensional linear systems by decomposition of eigenequations and eigenvalues of $U_{s}$ and treat the nonlinear system as a higher order linear system.

The conventional Koopman method necessitates the identification of specific observables, such that they constitute a linear system. Subsequently, the linear system composed of these observables is analyzed to derive the control strategy for the linear system. The final step involves solving the inverse of the observables to obtain the desired parameters of the original system, such as the input function $u(t)$ of the system. A noteworthy concept inspired by the Koopman approach is the potential to directly and linearly acquire the desired parameters through observables. A valuable idea inspired by Yan and Devasia (2022) is along the lines of the Koopman approach is the possibility of getting the parameters we want directly and linearly through observables. This modified Koopman method we call Koopman-type method.

III PROBLEM STATEMENT

Consider a non-minimum phase (NMP) stochastic linear time-invariant (LTI) system defined by

	$\displaystyle\dot{\mathbf{x}}(t)$	$\displaystyle=\mathbf{A}\mathbf{x}(t)+\mathbf{B}u(t)+\mathbf{G}w(t)$		(2)
	$\displaystyle y(t)$	$\displaystyle=\mathbf{C}\mathbf{x}(t)+h(t)$		(2)

Where states $\mathbf{x}(t)\in\mathbb{R}^{n}$ , outpute $y(t)\in\mathbb{R}$ , $h(t)$ and $w(t)$ are stochastic disturbance functions, $\mathbf{A}\in\mathbb{R}^{n\times n}$ , $\mathbf{B},\mathbf{G}\in\mathbb{R}^{n\times 1}$ , and $\mathbf{C}\in\mathbb{R}^{1\times n}$ . Assumption 1: $h(t)$ , $w(t)$ are bounded, i.e. $\sup\limits_{t\in\mathbb{R}}|h(t)|<\infty$ and $\sup\limits_{t\in\mathbb{R}}|w(t)|<\infty$

Perform Laplace transform on (1),

	$\displaystyle Y(s)=$	$\displaystyle[\mathbf{C}(s\mathbf{I}_{n}-\mathbf{A})^{-1}\mathbf{B}]U(s)+$		(3)
		$\displaystyle[\mathbf{C}(s\mathbf{I}_{n}-\mathbf{A})^{-1}\mathbf{G}]W(s)+H(s)$		(3)

Write the first part of (3) in the form

		$\displaystyle\mathbf{C}(s\mathbf{I}_{n}-\mathbf{A})^{-1}\mathbf{B}=$		(4)
		$\displaystyle k\cdot\frac{s^{n-r}+b_{n-r-1}\cdot s^{n-r-1}+...+b_{0}}{s^{n}+a_{n-1}s^{n-1}+...+a_{1}s+a_{0}}$		(4)

Assumption 2: The relative degree $r\leq n$ is known and the exact value of matrix $\mathbf{A},\mathbf{B},\mathbf{C},\mathbf{G}$ are unknown. Assumption 3: The polynomial in the numerator of (4) has roots with real parts greater than zero but no roots with real parts equal to zero, while the roots of the denominator polynomial have all real parts less than zero.

Take $y_{e}=y(t)-h(t)$ , $\mathbf{\xi}=(y_{e},\dot{y}_{e},\ddot{y}_{e},...,y_{e}^{(r-1)})^{T}$ , $\mathbf{\eta}=(x_{1},x_{2},...,x_{n-r})^{T}$ , we call $\eta$ the internal states, where $x_{i}$ is the i-th element in $\mathbf{x}$ . Using linear transformations the same as in Hendricks et al. (2008), the system can be written as

	$\displaystyle\dot{\mathbf{\xi}}(t)$	$\displaystyle=\mathbf{A}_{1}\mathbf{\xi}(t)+\mathbf{A}_{2}\mathbf{\eta}(t)+\mathbf{B}_{1}u(t)+\mathbf{G}_{1}w(t)$		(5)
	$\displaystyle\dot{\mathbf{\eta}}(t)$	$\displaystyle=\mathbf{A}_{3}y(t)+\mathbf{A}_{4}\mathbf{\eta}(t)+\mathbf{G}_{2}w(t)$		(5)

Where

	$\displaystyle\mathbf{A}_{3}=(0,0,...,1)^{T}\in\mathbb{R}^{n\times 1}$		(6)
	$\displaystyle\mathbf{B}_{1}=(0,0,...,k)^{T}\in\mathbb{R}^{n\times 1}$
	$\displaystyle\mathbf{G}_{1}=(0,0,...,g)^{T}\in\mathbb{R}^{n\times 1}$

and

	$\displaystyle\mathbf{A}_{1}$	$\displaystyle=\left(\begin{array}[]{ccc}0&\cdots&0\\ \vdots&\vdots&\vdots\\ -&\mathbf{r}^{T}&-\\ \end{array}\right)\mathbf{A}_{2}=\left(\begin{array}[]{ccc}0&\cdots&0\\ \vdots&\vdots&\vdots\\ -&\mathbf{s}^{T}&-\\ \end{array}\right)$		(7)
	$\displaystyle\mathbf{A}_{4}$	$\displaystyle=\left(\begin{array}[]{cccc}0&1&0&\cdots\\ \vdots&\ddots&\ddots&\vdots\\ 0&\cdots&0&1\\ -b_{0}&-b_{1}&\cdots&-b_{n-r-1}\end{array}\right)$		(7)

Denote $y_{d}(t)$ as the desired output, the task of this paper is to find a Koopman-type operator to calculate input $u(t)$ directly from $y_{d}$ such that under this input, the actual output $y(t)$ and $y_{d}(t)$ differ by an acceptable error as time grows.

Assumption 1

$y_{d}$ is r-order continuously differentiable and there is $M>0$ , $y_{d}(t)\leq M$ for any $t\in\mathbb{R}$ .

IV MAIN RESULTS

In this section, we describe the Koopman-type operator in detail. In part A, we present the specific formulation of the Koopman-type method and conduct a theoretical analysis of its tracking accuracy. In part B, we introduce a data-driven approach for determining the parameters required by the Koopman-type method. In part C, we designed an improved Koopman-type method by incorporating the Monte Carlo technique to improve the accuracy of parameters estimation in data-driven processes for systems with large disturbances. A. Koopman-type operator and its performance analysis

Definition 1 Koopman-type operator

	$\displaystyle\mathcal{A}:=$	$\displaystyle\{q^{-d\cdot\Delta t}y_{d}(t)\|d\in\{0,\pm 1\cdots\pm N-1,N\}\}$		(8)
	$\displaystyle\mathcal{D}:=$	$\displaystyle\{y_{d}^{(i)}\|i\in\{1,2\cdots r\}\}$		(8)

Where $q^{-d\cdot\Delta t}$ is $d\cdot\Delta t$ pure time delay. Take $\Phi$ be a column vector containing every function in $\mathcal{A}$ and $\mathcal{D}$ , we define the Koopman-type operator in the form

u(t)=<\mathbf{K},\Phi(t)>

(9)

Where $\mathbf{K}$ is a constant column vector with dimension $2N+r$ . Remark 1: $\mathbf{K}$ is an undetermined parameter of the Koopman-type operator, for different systems we need to determine different $\mathbf{K}$ to improve the accuracy of tracking. Theorem 1: There exists $\delta>0$ , for any $\epsilon>0$ , there exist $\hat{N}$ , $\Delta\hat{t}$ and $\mathbf{K}$ , if $N>\hat{N}$ , and $\Delta t<\Delta\hat{t}$ , then from the Koopman-type operator, the output of system (1) satisfies

|y(t)-y_{d}(t)|\leq\epsilon+\delta\sup\limits_{t\in\mathbb{R}}|w(t)|+\sup\limits_{t\in\mathbb{R}}|r(t)|

(10)

for large enough t.

Proof By the linear variable substitution of $\mathbf{x}$ , Zou and Devasia (1999) , the second equation of system (3) can be written as

\dot{\eta^{\prime}}(t)=\mathbf{A}_{3^{\prime}}y_{e}(t)+\mathbf{A}_{4^{\prime}}\eta^{\prime}(t)+\mathbf{G}_{2^{\prime}}w(t)

(11)

Where

\mathbf{A}_{4^{\prime}}=\begin{bmatrix}\mathbf{A}_{4^{\prime}-}&0\\ 0&\mathbf{A}_{4^{\prime}+}\end{bmatrix}

(12)

Where all eigenvalues of $\mathbf{A}_{4^{\prime}-}(\mathbf{A}_{4^{\prime}+})$ , have negative (positive) real part. Without of lose generality, assume $\mathbf{A}_{4}$ is directly this form. From Devasia (1996), the unique solution of the second equation in system (3) with the assumption $\eta(\pm\infty)=0$ for given $y_{d}$ is

\eta(t)=\int_{-\infty}^{+\infty}\phi(t-\tau)(\mathbf{A}_{3}y_{d}(\tau)+\mathbf{G}_{2}w(\tau))d\tau

(13)

Where

\phi(t)=\begin{bmatrix}1(t)e^{\mathbf{A}_{4-}}&0\\ 0&-1(-t)e^{\mathbf{A}_{4+}}\end{bmatrix}

(14)

Lemma 1: There exists positive scalars $\alpha>0,\beta>0,\gamma>0$ such that

||\eta(t)-\eta_{N}(t)||\leq\beta e^{-\alpha N\Delta t}+\gamma\sup\limits_{t\in\mathbb{R}}|w(t)|

(15)

where $\eta(t)$ is calcullated in (13), and

\eta_{N}(t)=\int_{t-N\Delta t}^{t+N\Delta t}\phi(t-\tau)(\mathbf{A}_{3}y_{d}(\tau)+\mathbf{G}_{2}w(\tau))d\tau

(16)

Proof The eigenvalues of $\mathbf{A}_{4-}$ and $-\mathbf{A}_{4+}$ have negative real parts so there exists postive scalars $\kappa_{1}>0$ , $\kappa_{2}>0$ , $\kappa_{3}>0$ , $\alpha_{1}>0$ , $\alpha_{2}>0$ , $\alpha_{3}>0$ such that, see Desoer and Vidyasagar (2009)

$\displaystyle\|\|\phi(t)\|\|$	$\displaystyle\leq\kappa_{1}e^{-\alpha_{1}t}$	(17)
$\displaystyle\|\|\phi(-t)\|\|$	$\displaystyle\leq\kappa_{2}e^{-\alpha_{2}t}$
$\displaystyle\|\|e^{\mathbf{A}t}\|\|$	$\displaystyle\leq\kappa_{3}e^{-\alpha_{3}t}$

Then

	$\displaystyle\|\|\eta(t)-\eta_{N}(t)\|\|=$	$\displaystyle\|\|\int_{-\infty}^{t-N\Delta t}\phi(t-\tau)\mathbf{A}_{3}y_{d}(\tau)d\tau+$
		$\displaystyle\int_{t+N\Delta t}^{\infty}\phi(t-\tau)\mathbf{A}_{3}y_{d}(\tau)d\tau+$
		$\displaystyle\int_{-\infty}^{+\infty}\phi(t-\tau)\mathbf{G}_{2}w(\tau)d\tau\|\|$
	$\displaystyle\leq$	$\displaystyle M\|\|\mathbf{A}_{3}\|\|(\|\|\int_{-\infty}^{t-N\Delta t}\kappa_{1}e^{-\alpha_{1}(t-\tau)}d\tau\|\|+$
		$\displaystyle\|\|\int_{t+N\Delta t}^{\infty}\kappa_{2}e^{-\alpha_{2}(t-\tau)}d\tau\|\|)+$
		$\displaystyle\sup\limits_{t\in\mathbb{R}}\|w(t)\|\cdot\|\|\mathbf{G}_{2}\|\|$
		$\displaystyle(\|\|\int_{-\infty}^{t}\kappa_{1}e^{-\alpha_{1}(t-\tau)}d\tau\|\|+$
		$\displaystyle\|\|\int_{t}^{\infty}\kappa_{2}e^{-\alpha_{2}(t-\tau)}d\tau\|\|)$
	$\displaystyle=$	$\displaystyle M\|\|\mathbf{A}_{3}\|\|(\frac{\kappa_{1}}{\alpha_{1}}e^{-\alpha_{1}N\Delta t}+\frac{\kappa_{2}}{\alpha_{2}}e^{-\alpha_{2}N\Delta t})+$
		$\displaystyle\sup\limits_{t\in\mathbb{R}}\|w(t)\|\cdot\|\|\mathbf{A}_{2}\|\|(\frac{\kappa_{1}}{\alpha_{1}}+\frac{\kappa_{2}}{\alpha_{2}})$

Take $\alpha=\min{\alpha_{1},\alpha_{2}}$ , $\beta=2M||\mathbf{A}_{3}||\max{\frac{\kappa_{1}}{\alpha_{1}},\frac{\kappa_{2}}{\alpha_{2}}}$ , $\gamma=||\mathbf{G}_{2}||(\frac{\kappa_{1}}{\alpha_{1}}+\frac{\kappa_{2}}{\alpha_{2}})$ $\blacksquare$

By the definition of integration, we can write the integral as partial sum

$\displaystyle\eta_{N}(t)=$	$\displaystyle\int_{t-N\Delta t}^{t+N\Delta t}\phi(t-\tau)\mathbf{A}_{3}y(\tau)d\tau$	(18)
$\displaystyle\approx$	$\displaystyle\Delta t\cdot\sum_{\tau=0}^{2N-1}\phi((-N+\tau)\Delta t)\mathbf{A}_{3}q^{(N-\tau-1)\Delta t}y_{d}(t)$
$\displaystyle\triangleq$	$\displaystyle\hat{\eta}_{N}(t)$

For any $\epsilon_{1}>0$ , we can find small enough $\Delta\hat{t}$ , such that for $\Delta t\leq\Delta\hat{t}$ $\sup\limits_{t\in\mathbb{R}}||\eta_{T}(t)-\hat{\eta}_{t}(t)||<\epsilon_{1}$ , then

	$\displaystyle\|\|\eta(t)-\hat{\eta}_{N}(t)\|\|$	$\displaystyle\leq\|\|\eta_{N}(t)-\hat{\eta}_{N}(t)\|\|+\|\|\eta(t)-\eta_{N}(t)\|\|$		(19)
		$\displaystyle\leq\beta e^{-\alpha N\Delta t}+\gamma\sup\limits_{t\in\mathbb{R}}\|w(t)\|+\epsilon_{1}$		(19)

Take

	$\displaystyle u(t)\triangleq$	$\displaystyle\frac{1}{k}[y_{d}^{(r)}(t)-\mathbf{r}^{T}\xi_{d}(t)-\mathbf{s}^{T}\eta(t)-g\cdot w(t)]$		(20)
	$\displaystyle\hat{u}(t)\triangleq$	$\displaystyle\frac{1}{k}[y_{d}^{(r)}(t)-\mathbf{r}^{T}\xi_{d}(t)-\mathbf{s}^{T}\hat{\eta}_{N}(t)]$		(20)

Then

	$\displaystyle\|u(t)-\hat{u}(t)\|<$	$\displaystyle\frac{1}{k}(\|\|\mathbf{s}\|\|(\beta e^{-\alpha N\Delta t}+\epsilon_{1})+$		(21)
		$\displaystyle(\|\|\mathbf{s}\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|)$		(21)

For some initial state $\mathbf{x}(0)$ , under inpute $u(t)$ ,we have, the outpute $y(t)$ can be strictly equal to $y_{d}(t)-h(t)$ , then

	$\displaystyle y_{d}(t)$	$\displaystyle=\mathbf{C}\mathbf{x}(t)$		(22)
		$\displaystyle=\mathbf{C}(e^{\mathbf{A}t}\mathbf{x}(0)+\int_{0}^{t}e^{\mathbf{A}(t-\tau)}(\mathbf{B}u(\tau)+\mathbf{G}w(\tau))d\tau)$		(22)

Assume the true system has initial states $\hat{\mathbf{x}}(0)$ , under the outpute $\hat{u}(t)$ , take

	$\displaystyle\hat{y}_{d}(t)$	$\displaystyle=C\mathbf{x}(t)$		(23)
		$\displaystyle=C(e^{At}\hat{\mathbf{x}}(0)+\int_{0}^{t}e^{A(t-\tau)}(B\hat{u}(\tau)+Gw(\tau))d\tau)$		(23)

Then

	$\displaystyle\|\hat{y}_{d}(t)-y_{d}(t)\|=$	$\displaystyle\|\mathbf{C}(e^{\mathbf{A}t}(\hat{\mathbf{x}}(0)-\mathbf{x}(0))+$
		$\displaystyle\int_{0}^{t}e^{\mathbf{A}(t-\tau)}(\hat{u}(\tau)-u(\tau))d\tau\|$
	$\displaystyle\leq$	$\displaystyle\|\|\mathbf{C}\|\|(\beta_{3}e^{-t\alpha_{3}}\cdot\|\|\mathbf{x}(0)-\hat{\mathbf{x}}(0)\|\|+$
		$\displaystyle\frac{1}{k}(\|\|\mathbf{s}\|\|(\beta e^{-\alpha N\Delta t}+\epsilon_{1})+$
		$\displaystyle(\|\|s\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|)\frac{\beta_{3}}{\alpha_{3}}(1-e^{-\alpha_{3}t}))$

The true output

\displaystyle y(t)=\hat{y}_{d}(t)+h(t)

(24)

		$\displaystyle\lim\limits_{t\to+\infty}\|y(t)-y_{d}(t)\|$
	$\displaystyle\leq$	$\displaystyle\lim\limits_{t\to+\infty}\|y(t)-\hat{y}_{d}(t)\|+\|\hat{y}_{d}(t)-y_{d}(t)\|$
	$\displaystyle\leq$	$\displaystyle\sup\limits_{t\in\mathbb{R}}\|h(t)\|+\|\|\mathbf{C}\|\|\frac{\|\|\mathbf{s}\|\|\beta_{3}}{k\alpha_{3}}(\epsilon_{1}+\beta e^{-\alpha N\Delta t})+$
		$\displaystyle\frac{\beta_{3}}{\alpha_{3}}(\|\|\mathbf{s}\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|$

Take $\epsilon_{1}$ small enough and $\hat{N}$ such that $||\mathbf{C}||\frac{||\mathbf{s}||\beta_{3}}{k\alpha_{3}}(\epsilon_{1}+\beta e^{-\alpha N\Delta t})<\epsilon$ and $\delta=\frac{\beta_{3}}{\alpha_{3}}(||\mathbf{s}||\gamma+g)$ $\blacksquare\\ \\$ B. Data-driven pseudoinverse approach for Koopman-
type operator parameter optimization design

Theorem 1 illustrates that for the system with disturbance, if a suitable $\mathbf{K}$ can be found, then our Koopman-type operator can control this system with error depending on the disturbances. In this part we will show how to find $\mathbf{K}$ by data-driven pseudoinverse method. Using every function $\phi_{i}$ in $\Phi$ as the input to the system and measuring the output $y_{i}$ of the system. Denote the output vector by $\mathbb{O}_{i}$

		$\displaystyle\mathbb{O}_{i}=[\phi_{i}(t_{1}),\phi_{i}(t_{2})\cdots\phi_{i}(t_{j})],$		(25)
		$\displaystyle j\in\mathbb{Z}^{+}and\ t_{1}<t_{2}\cdots<t_{j}$		(25)

Remark 2: For different input functions, $\phi_{i}$ , $t_{1},t_{2},...,t_{j}$ are same and $t_{1}$ should be large enough to reduce the effect of the initial dynamic. Take

\mathbb{O}=[\mathbb{O}_{1}^{T},\mathbb{O}_{2}^{T},...]^{T}

(26)

\mathbb{O}_{d}=[y_{d}(t_{1}),y_{d}(t_{2}),...,y_{d}(t_{j})]

(27)

We calculate K by minimize

||\mathbb{O}_{d}-\mathbf{K}^{T}\mathbb{O}||

(28)

and the solution to this optimal problem can be written in exact form

\mathbf{K}^{T}=\mathbb{O}_{d}\mathbb{O}^{\dagger}

(29)

C. Improved Koopman-type method to handle large
disturbances

Monte Carlo methods tend to perform well in dealing with stochastic problems. For systems with large perturbations, the parameters of the Koopman-type operator obtained by the data-driven method may be very inaccurate, this section we will prove the errors in the data-driven process can be reduced using Monte Carlo methods. Theorem 2: Assume $E[\mathbb{O}]$ is the expection of the output matrix, and $\mathbb{O}^{1},\mathbb{O}^{2},...$ are the output matrix obtained from system (5), take $\overline{\mathbb{O}}_{N}=\frac{1}{N}\sum^{N}_{n=1}\mathbb{O}^{n}$ . For given $\Phi$ , if $E[r(t)]=E[w(t)]=0$ , and $E[\int_{0}^{t}w^{2}(t)]<\infty$ for any $t\in\mathbb{R}$ then there exists $\sigma>0$ , with probability $1-p$ , $||\overline{\mathbb{O}}_{N}-E[\mathbb{O}]||_{\infty}<\frac{\sigma}{\sqrt{Np}}$ .

Proof Denote $y_{s}$ as the actual output of system (1)

		$\displaystyle Var(y_{s}(t)-E[y_{s}(t)])$		(30)
		$\displaystyle=Var(\int_{0}^{t}e^{\mathbf{A}(t-\tau)}\mathbf{G}w(\tau)d\tau)+Var(r(t))$
		$\displaystyle=E[(\int_{0}^{t}e^{\mathbf{A}(t-\tau)}\mathbf{G}w(\tau)d\tau)^{2}]+Var(r(t))$
		$\displaystyle\leq\|\|\mathbf{G}\|\|^{2}(\int_{0}^{t}(\kappa_{3}e^{-\alpha_{3}(t-\tau)})^{2}d\tau)E[\int_{0}^{t}w^{2}(\tau)d\tau]+$
		$\displaystyle Var(r(t))$
		$\displaystyle\leq\frac{\kappa_{3}^{2}\|\|\mathbf{G}\|\|^{2}}{2\alpha_{3}}E[\int_{0}^{t}w^{2}(\tau)d\tau]+Var(r(t))$

The second to the last inequality hold by Cauchy Inequality. Take

V^{2}=\frac{\kappa_{3}^{2}||\mathbf{G}||^{2}}{2\alpha_{3}}\sup\limits_{t\in[0,t_{j}]}E[\int_{0}^{t}w^{2}(\tau)d\tau]+\sup\limits_{t\in\mathbb{R}}Var(r(t))

(31)

Then $Var(y_{s}(t)-E[y(t)])\leq V^{2}$ for all $t\in[0,t_{j}]$ , by Chebyshev’s Inequality, with probablity $1-sp$ , where s is the number of elements in matrix $\mathbb{O}$

	$\displaystyle\|\|\overline{\mathbb{O}}_{N}-E[\mathbb{O}]\|\|_{\infty}$	$\displaystyle=\|\|\frac{1}{N}\sum^{N}_{n=1}(\mathbb{O}^{n}-E[\mathbb{O}])\|\|_{\infty}$		(32)
		$\displaystyle\leq\frac{V}{\sqrt{Np}}$		(32)

Take $\sigma=V\sqrt{s}$ $\blacksquare\\$ Remark 3 According to this theorem, for systems subjected to substantial disturbances, an effective strategy to mitigate the error and attain an accurate estimation of $\mathbf{K}$ is to employ averaging over multiple measurements.

V SIMULATION RESULTS

In this section, we present a linear NMP system and validate the results in Section IV through simulation. In Part A, the system’s parameters and the simulation methodology are designed. In Part B, data-driven approach in Section IV, Part B is applied to determine Koopman-type operator’s parameters. Then the function provided by the Koopman operator is applied to the system to confirm the tracking accuracy demonstrated in Section IV, Part A. Lastly, in Part C, we corroborate that the Improved Koopman-type method proposed in Section IV, Part C is more effective in systems with substantial perturbations. A. Simulation system and parameter design

The simulation system we choose is

	$\displaystyle\dot{\mathbf{x}}$	$\displaystyle=\begin{pmatrix}0&1&0&0\\ 0&0&1&0\\ 0&0&0&1\\ -1&-4&-5.5&-3.5\end{pmatrix}\mathbf{x}+\begin{pmatrix}0\\ 0\\ 0\\ 1\end{pmatrix}u(t)+\mathbf{G}w$		(33)
	$\displaystyle y$	$\displaystyle=\begin{pmatrix}-2&1&1&0\end{pmatrix}x+h$		(33)

Where $\mathbf{G}$ is a $1\times 4$ matrix. And $w,h$ are bounded disturbances, heir characteristics will be determined according to the specific simulation requirements. The system is non-minimum phase, and it has relative degree 2. The initial stste at $t=0$ is set to $\mathbf{x}(0)=\textbf{0}$ .

The function being tracked is set to $y_{d}=\sin(0.1t)$ . We take $T=10,\Delta t=0.5$ in Theorem 1. The output vector $\mathbb{O}_{i}$ are generated by $y_{d}(t+T-i\Delta t)=\sin(0.1(t+10-0.5i))$ as the input of the system for $i\in\{1,2\cdots 40\}$ . And $\mathbb{O}_{41}$ , $\mathbb{O}_{42}$ are generated by $\dot{y}_{d}=0.1\cos(0.1t)$ , $\dot{y}_{d}=-0.01\sin(0.1t)$ . All these output vector has $t_{j}=50+0.5j$ for $j\in{1,2\cdots 100}$ in (19). Parameters of the Koopman-type operator are calculated by (25).

For a given input function $u(t)$ , we calculate the value at y(t) by

y(t)=\mathbf{C}(\int_{0}^{t}e^{\mathbf{A}\tau}(\mathbf{B}u(\tau)+\mathbf{G}w(\tau)))+h(t)

(34)

For the calculation of the integral we use partition method, in order to ensure the accuracy and do not occupy too much memory, the length of partition length is 0.01, which is accurate compare to $\Delta=0.05$ . B. Performance of Koopman-type operator in
disturbanced system under data-driven pseudoinverse approach

In this part, random disturbances are added to both the input and output. We take $\mathbf{G}$ =(0 0 0 1)^T, $w(t)=U[0.05|u(t)|],h(t)=U[0.05|y(t)|]$ . Solving the Koopman operator parameters by Data-driven method and data simulation are performed in the perturbed system. Figures 1 and 2 show the simulation results and errors for this disturbed system, and the images vary slightly from simulation to simulation due to the presence of random functions in the system. From Figure 2, the tracking error is less than $0.05max(y_{d}(t))=0.05$ .

Refer to caption — Figure 1: Simulation result for disturbed system

C. Performance of improved Koopman-type operator

In this part, we introduce a simulated scenario in which large disturbances are added to both the input and output of the system during the data-driven approach for solving the Koopman-type operator coefficients. Specifically, we set $\mathbf{G}$ to be (0 0 0 1)^T, and adopt $w(t)=U[0.2|u(t)|]$ and $h(t)=U[0.2|y(t)|]$ to model the larger errors in measurement that are often encountered in practical applications. To obtain different values of $K$ , we select values of $N$ from 1, 5, and 10 in Theorem 2. After data-driven process, we get a parameter vector $\mathbf{K}$ , then we input this K into the Koopman-type operator, and the resulting input function is used as the input to the unperturbed system, the error with the tracked function is obtained. Figure 3 illustrates the impact of varying Monte Carlo sampling times on the resulting tracking performance.

VI CONCLUSIONS

In this paper, we design a data-driven Koopman-type operator for linear non-minimum phase (NMP) systems, and give theoretical proof of the control accuracy of NMP systems using the Koopman-type method, this is the first proof for Koopman-type method in NMP systems. We demonstrate that the tracking accuracy of our Koopman-type operator is directly proportional to the ground-bound of the NMP system perturbation, given sufficiently long causal and non-causal desired outputs and appropriately small sampling intervals. Additionally, applying the Monte Carlo method helps mitigate the impact of perturbations on measurement results during the data-driven process. Our ongoing research efforts focus on extending this approach to address linear time-varying non-minimum phase systems.

References

[1] Thomas Berger. Tracking with prescribed performance for linear non-minimum phase systems. Automatica, 115:108909, 2020.
[2] Jeffrey A Butterworth, Lucy Y Pao, and Daniel Y Abramovitch. The effect of nonminimum-phase zero locations on the performance of feedforward model-inverse control techniques in discrete-time systems. In 2008 American control conference, pages 2696–2702. IEEE, 2008.
[3] Charles A Desoer and Mathukumalli Vidyasagar. Feedback systems: input-output properties. SIAM, 2009.
[4] Santosh Devasia, Degang Chen, and Brad Paden. Nonlinear inversion-based output tracking. IEEE Transactions on Automatic Control, 41(7):930–942, 1996.
[5] Santosh Devasia and B Paden. Stable inversion for nonlinear nonminimum-phase time-varying systems. IEEE Transactions on Automatic Control, 43(2):283–288, 1998.
[6] Michael Estrada. Toward the Control of Non-Linear, Non-Minimum Phase Systems via Feedback Linearization and Reinforcement Learning. PhD thesis, University of California Berkeley, CA, USA, 2021.
[7] Elbert Hendricks, Ole Jannerup, and Paul Haase Sørensen. Linear systems control: deterministic and stochastic methods. Springer, 2008.
[8] Alberto Isidori. Nonlinear control systems: an introduction. Springer, 1985.
[9] Stefan Klus, Feliks Nüske, Sebastian Peitz, Jan-Hendrik Niemann, Cecilia Clementi, and Christof Schütte. Data-driven approximation of the koopman generator: Model reduction, system identification, and control. Physica D: Nonlinear Phenomena, 406:132416, 2020.
[10] Bernard O Koopman. Hamiltonian systems and transformation in hilbert space. Proceedings of the National Academy of Sciences, 17(5):315–318, 1931.
[11] Xuehui Ma, Fucai Qian, and Shiliang Zhang. Dual control for stochastic systems with multiple uncertainties. In 2020 39th Chinese Control Conference (CCC), pages 1001–1006. IEEE, 2020.
[12] Giorgos Mamakoukas, Maria L Castano, Xiaobo Tan, and Todd D Murphey. Derivative-based koopman operators for real-time control of robotic systems. IEEE Transactions on Robotics, 37(6):2173–2192, 2021.
[13] Ivan Markovsky and Florian Dörfler. Data-driven dynamic interpolation and approximation. Automatica, 135:110008, 2022.
[14] Alexandre Mauroy, Yoshihiko Susuki, and Igor Mezić. Introduction to the koopman operator in dynamical systems and control theory. The koopman operator in systems and control: concepts, methodologies, and applications, pages 3–33, 2020.
[15] Chanin Panjapornpon, Masoud Soroush, and Warren D Seider. A model-based control method applicable to unstable, non-minimum-phase, nonlinear processes. In Proceedings of the 2004 American Control Conference, volume 4, pages 2921–2924. IEEE, 2004.
[16] Gianmarco Rallo, Simone Formentin, and Sergio M Savaresi. On data-driven control design for non-minimum-phase plants: A comparative view. In 2016 IEEE 55th Conference on Decision and Control (CDC), pages 7159–7164. IEEE, 2016.
[17] Chiluka Suresh Kumar, Seshagiri Rao Ambati, Shirish H Sonawane, and Gara Uday Bhaskar Babu. Robust control of discrete minimum and non-minimum phase systems via data-driven virtual reference feedback tuning and imc. The Canadian Journal of Chemical Engineering.
[18] Klaske Van Heusden, Alireza Karimi, and Dominique Bonvin. Data-driven model reference control with asymptotically guaranteed stability. International Journal of Adaptive Control and Signal Processing, 25(4):331–351, 2011.
[19] Jurgen van Zundert and Tom Oomen. Stable inversion of lptv systems with application in position-dependent and non-equidistantly sampled systems. International Journal of Control, 92(5):1022–1032, 2019.
[20] Leon Liangwu Yan and Santosh Devasia. Precision data-enabled koopman-type inverse operators for linear systems. IFAC-PapersOnLine, 55(37):181–186, 2022.
[21] Chengpu Yu and Michel Verhaegen. Data-driven fault estimation of non-minimum phase lti systems. Automatica, 92:181–187, 2018.
[22] Qingze Zou and Santosh Devasia. Preview-based stable-inversion for output tracking of linear systems. 1999.

	$\displaystyle\|\|\eta(t)-\eta_{N}(t)\|\|=$	$\displaystyle\|\|\int_{-\infty}^{t-N\Delta t}\phi(t-\tau)\mathbf{A}_{3}y_{d}(\tau)d\tau+$
		$\displaystyle\int_{t+N\Delta t}^{\infty}\phi(t-\tau)\mathbf{A}_{3}y_{d}(\tau)d\tau+$
		$\displaystyle\int_{-\infty}^{+\infty}\phi(t-\tau)\mathbf{G}_{2}w(\tau)d\tau\|\|$
	$\displaystyle\leq$	$\displaystyle M\|\|\mathbf{A}_{3}\|\|(\|\|\int_{-\infty}^{t-N\Delta t}\kappa_{1}e^{-\alpha_{1}(t-\tau)}d\tau\|\|+$
		$\displaystyle\|\|\int_{t+N\Delta t}^{\infty}\kappa_{2}e^{-\alpha_{2}(t-\tau)}d\tau\|\|)+$
		$\displaystyle\sup\limits_{t\in\mathbb{R}}\|w(t)\|\cdot\|\|\mathbf{G}_{2}\|\|$
		$\displaystyle(\|\|\int_{-\infty}^{t}\kappa_{1}e^{-\alpha_{1}(t-\tau)}d\tau\|\|+$
		$\displaystyle\|\|\int_{t}^{\infty}\kappa_{2}e^{-\alpha_{2}(t-\tau)}d\tau\|\|)$
	$\displaystyle=$	$\displaystyle M\|\|\mathbf{A}_{3}\|\|(\frac{\kappa_{1}}{\alpha_{1}}e^{-\alpha_{1}N\Delta t}+\frac{\kappa_{2}}{\alpha_{2}}e^{-\alpha_{2}N\Delta t})+$
		$\displaystyle\sup\limits_{t\in\mathbb{R}}\|w(t)\|\cdot\|\|\mathbf{A}_{2}\|\|(\frac{\kappa_{1}}{\alpha_{1}}+\frac{\kappa_{2}}{\alpha_{2}})$

	$\displaystyle\|\|\eta(t)-\hat{\eta}_{N}(t)\|\|$	$\displaystyle\leq\|\|\eta_{N}(t)-\hat{\eta}_{N}(t)\|\|+\|\|\eta(t)-\eta_{N}(t)\|\|$		(19)
		$\displaystyle\leq\beta e^{-\alpha N\Delta t}+\gamma\sup\limits_{t\in\mathbb{R}}\|w(t)\|+\epsilon_{1}$		(19)

	$\displaystyle\|u(t)-\hat{u}(t)\|<$	$\displaystyle\frac{1}{k}(\|\|\mathbf{s}\|\|(\beta e^{-\alpha N\Delta t}+\epsilon_{1})+$		(21)
		$\displaystyle(\|\|\mathbf{s}\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|)$		(21)

	$\displaystyle\|\hat{y}_{d}(t)-y_{d}(t)\|=$	$\displaystyle\|\mathbf{C}(e^{\mathbf{A}t}(\hat{\mathbf{x}}(0)-\mathbf{x}(0))+$
		$\displaystyle\int_{0}^{t}e^{\mathbf{A}(t-\tau)}(\hat{u}(\tau)-u(\tau))d\tau\|$
	$\displaystyle\leq$	$\displaystyle\|\|\mathbf{C}\|\|(\beta_{3}e^{-t\alpha_{3}}\cdot\|\|\mathbf{x}(0)-\hat{\mathbf{x}}(0)\|\|+$
		$\displaystyle\frac{1}{k}(\|\|\mathbf{s}\|\|(\beta e^{-\alpha N\Delta t}+\epsilon_{1})+$
		$\displaystyle(\|\|s\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|)\frac{\beta_{3}}{\alpha_{3}}(1-e^{-\alpha_{3}t}))$

		$\displaystyle\lim\limits_{t\to+\infty}\|y(t)-y_{d}(t)\|$
	$\displaystyle\leq$	$\displaystyle\lim\limits_{t\to+\infty}\|y(t)-\hat{y}_{d}(t)\|+\|\hat{y}_{d}(t)-y_{d}(t)\|$
	$\displaystyle\leq$	$\displaystyle\sup\limits_{t\in\mathbb{R}}\|h(t)\|+\|\|\mathbf{C}\|\|\frac{\|\|\mathbf{s}\|\|\beta_{3}}{k\alpha_{3}}(\epsilon_{1}+\beta e^{-\alpha N\Delta t})+$
		$\displaystyle\frac{\beta_{3}}{\alpha_{3}}(\|\|\mathbf{s}\|\|\gamma+g)\sup\limits_{t\in\mathbb{R}}\|w(t)\|$