Integrated Adaptive Control and Reference Governors for Constrained Systems with State-Dependent Uncertainties

Pan Zhao^1,∗, Ilya Kolmanovsky², Naira Hovakimyan¹ This work is supported by AFOSR, NASA and NSF under the NRI grant #1830639, CPS grant #1932529, and AI Institute: Planning grant #2020289.¹P. Zhao and N. Hovakimyan are with the Department of Mechanical Science and Engineering, University of Illinois at Urbana-Champaign, Urbana, IL 61801, USA. Email: {panzhao2, nhovakim}@illinois.edu. Corresponding author: P. Zhao.²I. Kolmanovsky is with the Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI 48109, USA. Email: [email protected].

Abstract

This paper presents an adaptive reference governor (RG) framework for a linear system with matched nonlinear uncertainties that can depend on both time and states, subject to both state and input constraints. The proposed framework leverages an ${\mathcal{L}_{1}}$ adaptive controller ( ${\mathcal{L}_{1}}$ AC) that estimates and compensates for the uncertainties, and provides guaranteed transient performance, in terms of uniform bounds on the error between actual states and inputs and those of a nominal (i.e., uncertainty-free) system. The uniform performance bounds provided by the ${\mathcal{L}_{1}}$ AC are used to tighten the pre-specified state and control constraints. A reference governor is then designed for the nominal system using the tightened constraints, and guarantees robust constraint satisfaction. Moreover, the conservatism introduced by the constraint tightening can be systematically reduced by tuning some parameters within the ${\mathcal{L}_{1}}$ AC. Compared with existing solutions, the proposed adaptive RG framework can potentially yield less conservative results for constraint enforcement due to the removal of uncertainty propagation along a prediction horizon, and improved tracking performance due to the inherent uncertainty compensation mechanism. Simulation results for a flight control example illustrate the efficacy of the proposed framework.

Index Terms:

Constrained Control; Robust Adaptive control; Uncertain Systems; Reference Governor

I Introduction

There has been a growing interest in developing control methods that can handle state and/or input constraints. Examples of such constraints include actuator magnitude and rate limits, bounds imposed on process variables to ensure safe and efficient system operation, and collision/obstacle avoidance requirements. There are several choices for a control practitioner when dealing with constraints. One choice is to adopt the model predictive control (MPC) framework [1, 2], in which the state and input constraints can be incorporated into the optimization problem for computing the control signals. Another route is to augment a well-designed nominal controller, that already achieves high performance for small signals, with constraint handling capability that protects the system against constraint violations in transients for large signals. The second route is attractive to practitioners who are interested in preserving an existing/legacy controller or are concerned with the computational cost, tuning complexity, stability, robustness, certification issues, and/or other requirements satisfactorily addressed by the existing controller. The reference governor (RG) is an example of the second approach. As its name suggests, RG is an add-on scheme for enforcing pointwise-in-time state and control constraints by modifying the reference command to a well-designed closed-loop system. The RG acts like a pre-filter that, based on the current value of the desired reference command $r(t)$ and of the states (measured or estimated) $x(t)$ , generates a modified reference command $v(t)$ which avoids constraint violations. Since its advent, variants of RGs have been proposed for both linear and nonlinear systems. See the survey paper [3] and references therein. While RG has been extensively studied for systems for which exact dynamic models are available, the design of RG for uncertain systems, i.e., systems with unknown parameters, state-dependent uncertainties, unmodelled dynamics and/or external disturbances, has been less addressed.

I-A Related Work

Robust Approaches: As mentioned in [3], the RG can be straightforwardly modified to handle unmeasured set-bounded disturbances by taking into account all possible realizations of the disturbances when determining the maximal output admissible set [4]. For uncertain systems, various robust or tube MPC schemes have also been proposed [5, 6, 7, 8, 9, 10, 11] and summarized in [12], most of which consider parametric uncertainties and bounded disturbances with only a few exceptions (e.g., [10, 11]) that consider state-dependent uncertainties. However, robust approaches often lead to conservative results when the disturbances are large.

Adaptive and uncertainty compensation based approaches could potentially achieve less conservative results than robust approaches. Along these lines, various adaptive MPC strategies with performance guarantees have been proposed for systems with unknown parameters [13, 14, 15] and state-dependent uncertainties [16, 17]. In particular, [15] uses an ${\mathcal{L}_{1}}$ adaptive controller [18] to compensate for matched parametric uncertainties so that the uncertain plant behaves close to a nominal model, and uses robust MPC to handle the error between the combined system, consisting of the uncertain plant and the adaptive controller, and the nominal model. To the best of our knowledge, all of the existing adaptive MPC solutions, including [15] involve propagation of uncertainties along a prediction horizon. Reference [19] merged a Lyapunov function based RG with a disturbance cancelling controller based on an input observer to achieve non-conservative treatment of uncertainties. Unfortunately, a bound on the rate of change of the disturbance is needed for the design, which is often difficult to obtain when the disturbance is dependent on states. Additionally, input constraints were not considered in that work.
State-dependent uncertainties (SDUs): If a system is affected by SDUs, and the states are limited to a compact set, it is always possible to bound the SDU with a worst-case value and to apply the robust approaches (e.g., robust or tube MPC [5, 6, 7]) developed for bounded disturbances. However, by accounting for the state dependence, one can improve performance and reduce conservatism, as demonstrated in robust MPC solutions in [20, 11]. Adaptive MPC solutions which account for SDUs have been proposed in [16, 17]. These solutions essentially rely on computing the uncertainty or state bounds along the prediction horizon using the Lipschitz proprieties of SDUs, and solving a robust MPC problem, using the computed bounds.

I-B Contributions

The contributions of this paper are as follows. Firstly, for constrained control under uncertainties, we develop an ${\mathcal{L}_{1}}$ -RG framework for linear systems with matched nonlinear uncertainties that could depend on both time and states, and with both input and state constraints. Our adaptive robust RG framework leverages an ${\mathcal{L}_{1}}$ adaptive controller ( ${\mathcal{L}_{1}}$ AC) to estimate and compensate for the uncertainties, and to guarantee uniform bounds on the error between actual states and inputs and those of a nominal (i.e., uncertainty-free) closed-loop system. These uniform bounds characterize tubes in which actual states and control inputs are guaranteed to stay despite the uncertainties. A reference governor designed for the nominal system with constraints tightened using these uniform bounds guarantees robust constraint satisfaction in the presence of uncertainties. Additionally, we show that these uniform bounds on state and input errors, and thus the conservatism induced by constraint tightening can be arbitrarily reduced in theory by tuning the filter bandwidth and estimation sample time parameters of the ${\mathcal{L}_{1}}$ AC. Secondly, as a separate contribution to ${\mathcal{L}_{1}}$ adaptive control, we propose a novel scaling technique that allows deriving separate tight uniform bounds on each state and adaptive control input, as opposed to a single bound for all states, or adaptive control inputs in existing ${\mathcal{L}_{1}}$ AC solutions [18]. The ability to provide such separate tight bounds makes an ${\mathcal{L}_{1}}$ AC particularly attractive to be integrated with an RG for simultaneous constraint enforcement and improved trajectory tracking. Thirdly, we validate the efficacy of the proposed ${\mathcal{L}_{1}}$ -RG framework on a flight control example and we compare it with both baseline and robust RG solutions in simulations.

Compared to existing literature, in particular, robust/adaptive MPC, ${\mathcal{L}_{1}}$ -RG has the following novel aspects:

•

Thanks to the uncertainty compensation and transient performance guarantees available for the ${\mathcal{L}_{1}}$ AC, ${\mathcal{L}_{1}}$ -RG, (under suitable assumptions,) does not require uncertainty propagation along the prediction horizon. This uncertainty propagation is generally required in all existing robust and adaptive MPC approaches, and incurs conservatism, which is avoided by ${\mathcal{L}_{1}}$ -RG.
•

${\mathcal{L}_{1}}$ -RG simultaneously improves tracking performance and enforces the constraints, while existing robust/disturbance-observer-based RG or robust/adaptive MPC solutions except a few such as [15, 19], focus on constraint satisfaction only.
•

Within ${\mathcal{L}_{1}}$ -RG, the uniform bounds on the state and input errors (used for constraint tightening) and thus the conservatism induced by constraint tightening can be made arbitrarily small, which cannot be achieved by existing methods.
•

${\mathcal{L}_{1}}$ -RG is able to handle uncertainties that can nonlinearly depend on both time and states. Such a case has not been considered by previous adaptive MPC solutions that are based on uncertainty compensation. For instance, the solution in [15], which also leverages an ${\mathcal{L}_{1}}$ AC, only treats parametric uncertainties and state constraints.

The paper is structured as follows. Section II formally states the problem. Section III provides an overview of the proposed solution and discusses preliminaries related to RG and ${\mathcal{L}_{1}}$ AC design. Section IV introduces a scaling technique to derive separate and tight performance bounds for an ${\mathcal{L}_{1}}$ AC, while Section V presents synthesis and performance analysis of the proposed ${\mathcal{L}_{1}}$ -RG framework. Section VI includes validation of the proposed ${\mathcal{L}_{1}}$ -RG framework on a flight control problem in simulations.

Notations: Let $\mathbb{R}$ , $\mathbb{R}_{+}$ and $\mathbb{Z}_{+}$ denote the set of real, non-negative real, and non-negative integer numbers, respectively. $\mathbb{R}^{n}$ and $\mathbb{R}^{m\times n}$ denote the $n$ -dimensional real vector space and the set of real $m$ by $n$ matrices, respectively. $\mathbb{Z}_{i}$ and $\mathbb{Z}_{1}^{n}$ denote the integer sets $\{i,i+1,\cdots\}$ and $\{1,2,\cdots,n\}$ , respectively. $I_{n}$ denotes an identity matrix of size $n$ , and $0$ is a zero matrix of a compatible dimension. $\left\lVert\cdot\right\rVert$ and $\left\lVert\cdot\right\rVert_{\infty}$ denote the $2$ -norm and $\infty$ -norm of a vector or a matrix, respectively. The $\mathcal{L}_{\infty}$ - and truncated $\mathcal{L}_{\infty}$ -norm of a function $x:\mathbb{R}_{+}\rightarrow\mathbb{R}^{n}$ are defined as $\left\lVert x\right\rVert_{\mathcal{L}_{\infty}}\triangleq\sup_{t\geq 0}\left\lVert x(t)\right\rVert_{\infty}$ and $\left\lVert x\right\rVert_{\mathcal{L}_{\infty}^{[0,T]}}\triangleq\sup_{0\leq t\leq T}\left\lVert x(t)\right\rVert_{\infty}$ , respectively. The Laplace transform of a function $x(t)$ is denoted by $x(s)\triangleq\mathfrak{L}[x(t)]$ . For a vector $x$ , $x_{i}$ denotes the $i$ th element of $x$ . Given a positive scalar $\rho$ , $\Omega(\rho)\triangleq\{z\in\mathbb{R}^{n}:\left\lVert z\right\rVert_{\infty}\leq\rho\}$ denotes a high dimensional ball set of radius $\rho$ and centered at the origin, while its dimension $n$ can be deduced from the context. For a high-dimensional set $\mathcal{X}$ , $\textup{int}(\mathcal{X})$ denotes the interior of $\mathcal{X}$ and $\mathcal{X}_{i}$ denotes the projection of $\mathcal{X}$ onto the $i$ th coordinate. For given sets $\mathcal{X},{\mathcal{Y}}\subset\mathbb{R}^{n}$ , $\mathcal{X}\oplus{\mathcal{Y}}\triangleq\{x+y:x\in\mathcal{X},y\in{\mathcal{Y}}\}$ is the Minkowski set sum and $\mathcal{X}\ominus{\mathcal{Y}}\triangleq\{z:z+y\in\mathcal{X},\forall y\in{\mathcal{Y}}\}$ is the Pontryagin set difference.

II Problem statement

Consider an uncertain linear system represented by

\left\{\begin{aligned} \dot{x}(t)&=Ax(t)+B(u(t)+f(t,x(t))),\hfill\\ y(t)&=Cx(t),\ x(0)=x_{0},\\ \end{aligned}\right.

(1)

where $x(t)\in\mathbb{R}^{n}$ , $u(t)\in\mathbb{R}^{m}$ and $y(t)\in\mathbb{R}^{m}$ are the state, input and output vectors, respectively, $x_{0}\in\mathbb{R}^{n}$ is the initial state vector, $f(t,x(t))\in\mathbb{R}^{m}$ denotes the uncertainty that can depend on both time and states, and $A,~{}B,$ and $C$ are matrices of compatible dimensions. We want to design a control law for $u(t)$ such that the output vector $y(t)$ tracks a reference signal $r(t)$ while satisfying the specified state and control constraints:

\begin{gathered}x(t)\in\mathcal{X},\quad u(t)\in\mathcal{U},\quad\forall t\geq 0,\hfill\end{gathered}

(2)

where $\mathcal{X}\subset\mathbb{R}^{n}$ and ${\mathcal{U}}\subset\mathbb{R}^{m}$ are pre-specified convex and compact sets with $0$ in the interior. Note that 2 can also represent constraints on some of the states and/or inputs.

Suppose a baseline controller is available and achieves desired performance for the nominal (i.e., uncertainty-free) system given a small desired reference command $r(t)$ to track. To enforce state and input constraints 2 for the nominal system with larger signals, one can simply leverage the conventional RG, which will generate a modified reference command $v(t)$ based on $r(t)$ . In such a case, the baseline controller can be selected as

u_{\textup{b}}(t)=K_{x}x(t)+K_{v}v(t),

(3)

where $K_{x}$ and $K_{v}$ are feedback and feedforward gains. For both improved tracking performance and constraint enforcement in the presence of the uncertainty $f(t,x)$ , we leverage an ${\mathcal{L}_{1}}$ AC. To this end, we adopt a compositional control law:

u(t)=u_{\textup{b}}(t)+u_{\textup{a}}(t),

(4)

where $u_{\textup{a}}(t)$ is the vector of the adaptive control inputs designed to cancel $f(t,x)$ . With 3, the uncertain system 1 can be rewritten as

\left\{\begin{aligned} \dot{x}(t)&={A_{m}}x(t)+{B_{v}}v(t)+B(u_{\textup{a}}(t)+f(t,x(t))),\\ y(t)&=Cx(t),\ x(0)=x_{0},\end{aligned}\right.

(5)

where ${A_{m}}\triangleq A+B{K_{x}}$ is a Hurwitz matrix and ${B_{v}}\triangleq B{K_{v}}$ .

The problem to be tackled can be stated as follows: Given an uncertain system 1, a baseline controller 3 and a desired reference signal $r(t)$ , design a RG (for determining $v(t)$ ) and the ${\mathcal{L}_{1}}$ AC for $u_{\textup{a}}(t)$ such that the output signal $y(t)$ tracks $r(t)$ whenever possible, while the state and input constraints 2 are satisfied. We make the following assumption on the uncertainty.

Assumption 1.

Given a compact set ${\mathcal{Z}}$ , there exist known positive constants $L_{f_{j},{\mathcal{Z}}}$ , $l_{f_{j},{\mathcal{Z}}}$ and $b_{f_{j},{\mathcal{Z}}}$ ( $j\in\mathbb{Z}_{1}^{m}$ ) such that for any $x,z\in{\mathcal{Z}}$ and $t,\tau\geq 0$ , the following inequalities hold for each $j\in\mathbb{Z}_{1}^{m}$ :


$\displaystyle\left\lvert f_{j}(t,x)-f_{j}(\tau,z)\right\rvert$	$\displaystyle\leq L_{f_{j},{\mathcal{Z}}}\left\lVert x-z\right\rVert_{\infty}+l_{f_{j},{\mathcal{Z}}}\left\lvert t-\tau\right\rvert,$	(6a)
$\displaystyle\left\lvert f_{j}(t,x)\right\rvert$	$\displaystyle\leq b_{f_{j},{\mathcal{Z}}},$	(6b)

where $f_{j}(t,x)$ denotes the $i$ th element of $f(t,x)$ .

Remark 1.

Assumption 1 indicates that in the compact set ${\mathcal{Z}}$ , $f_{j}(t,x)$ is Lipschitz continuous with respect to $x$ with a known Lipschitz constant $L_{f_{j},{\mathcal{Z}}}$ , has a bounded rate of variation $l_{f_{j},{\mathcal{Z}}}$ with respect to $t$ , and is uniformly bounded by a constant $b_{f_{j},{\mathcal{Z}}}$ .

In fact, given the local Lipschitz constant $L_{f_{j},{\mathcal{Z}}}$ and the bounded rate of variation $l_{f_{j},{\mathcal{Z}}}$ , a uniform bound for $f_{j}(t,x)$ in ${\mathcal{Z}}$ can always be derived if the bound on $f_{j}(t,x^{\ast})$ for an arbitrary $x^{\ast}$ in ${\mathcal{Z}}$ and any $t\geq 0$ is known. For instance, assuming we know $\left\lvert f_{j}(t,0)\right\rvert\leq b^{i}_{0}$ , from 6a, we have that $\left\lvert f_{j}(t,x)-f_{j}(t,0)\right\rvert\leq L_{f_{j},{\mathcal{Z}}}\left\lVert x\right\rVert_{\infty}$ , which immediately leads to $\left\lvert f_{j}(t,x)\right\rvert\leq b_{0}^{i}+L_{f_{j},{\mathcal{Z}}}\max_{x\in\mathcal{X}}\left\lVert x\right\rVert_{\infty}$ , for any $x\in{\mathcal{Z}}$ and $t\geq 0$ . In practice, some prior knowledge about the uncertainty (e.g., $f_{j}$ depends on only a few instead of all states) may be leveraged to obtain a tighter bound than the preceding one, derived using the Lipschitz continuity and triangular inequalities. This motivates the assumption on the uniform bound in 6b.

Under the conditions in Assumption 1, we immediately obtain that for any $x,z\in{\mathcal{Z}}$ and $t,\tau\geq 0$ ,


$\displaystyle\left\lVert f(t,x)-f(\tau,z)\right\rVert_{\infty}$	$\displaystyle\leq{L_{f,{\mathcal{Z}}}}\left\lVert x-z\right\rVert_{\infty}+l_{f,{\mathcal{Z}}}\left\lvert t-\tau\right\rvert,$	(7a)
$\displaystyle\left\lVert f(t,x)\right\rVert_{\infty}$	$\displaystyle\leq{b_{f,{\mathcal{Z}}}},$	(7b)

where

L_{f,{\mathcal{Z}}}=\max_{j\in\mathbb{Z}_{1}^{m}}L_{f_{j},{\mathcal{Z}}},\quad l_{f,{\mathcal{Z}}}=\max_{j\in\mathbb{Z}_{1}^{m}}l_{f_{j},{\mathcal{Z}}},\quad b_{f,{\mathcal{Z}}}=\max_{j\in\mathbb{Z}_{1}^{m}}b_{f_{j},{\mathcal{Z}}}.

(8)

Remark 2.

Our choice of making assumptions on $f_{j}(t,x)$ instead of on $f(t,x)$ as in 8 facilitates deriving an individual bound on each state and on each adaptive input (see Section IV for details).

Remark 3.

In principle, given the uniform bound on $f(t,x)$ in 7b obtained from Assumption 1, constraints can be enforced via robust RG or robust MPC approaches that handle bounded disturbances, as discussed in Section I-A. However, when this bound is large, robust approaches can yield overly conservative performance.

III Overview and Preliminaries

In this section, we first present an overview of the proposed ${\mathcal{L}_{1}}$ -RG framework and then introduce some preliminary results that provides a foundation for the ${\mathcal{L}_{1}}$ -RG framework.

III-A Overview of the ${\mathcal{L}_{1}}$ -RG Framework

Figure 1 depicts the proposed ${\mathcal{L}_{1}}$ -RG framework. As shown in Fig. 1, ${\mathcal{L}_{1}}$ -RG is comprised of two integrated components. The first one is an ${\mathcal{L}_{1}}$ AC designed to compensate for the uncertainty $f(t,x)$ and to guarantee uniform bounds on the errors between actual states and inputs, and those of the nominal closed-loop system:


$\displaystyle\dot{x}_{\textup{n}}(t)$	$\displaystyle={A_{m}}x_{\textup{n}}(t)+{B_{v}}v(t),\ x_{\textup{n}}(0)=x_{0},$	(9a)
$\displaystyle u_{\textup{n}}(t)$	$\displaystyle=K_{x}x_{\textup{n}}(t)+K_{v}v(t).$	(9b)

The second component is a RG designed for the nominal system 9a with tightened constraints computed using the uniform bounds guaranteed by the ${\mathcal{L}_{1}}$ AC.

Refer to caption — Figure 1: Diagram of the proposed ${\mathcal{L}_{1}}$ -RG framework

More formally, we will design the ${\mathcal{L}_{1}}$ AC to ensure

\displaystyle x(t)-x_{\textup{n}}(t)\in\tilde{\mathcal{X}},\quad u(t)-u_{\textup{n}}(t)\in\tilde{\mathcal{U}},\quad\forall t\geq 0,

(10)

where $x(t)$ and $u(t)$ are the vectors of states and of the total control inputs of the closed-loop system 5:

\displaystyle u(t)-u_{\textup{n}}(t)=K_{x}(x(t)-x_{\textup{n}}(t))+u_{\textup{a}}(t),

(11)

where $u(t)$ is given by 4 and $u_{\textup{n}}(t)$ by 9b, and $\tilde{\mathcal{X}}$ and $\tilde{\mathcal{U}}$ are some pre-computed hyperrectangular sets dependent on the properties of $f(t,x)$ and of the ${\mathcal{L}_{1}}$ AC. The details will be given in Theorem 3 in Section III-C. Define

\mathcal{X}_{\textup{n}}\triangleq\mathcal{X}\ominus\tilde{\mathcal{X}},\quad{\mathcal{U}}_{\textup{n}}\triangleq{\mathcal{U}}\ominus\tilde{\mathcal{U}}.

(12)

Then for robust constraint enforcement, one just needs to design a RG for the nominal system 9 with tightened constraints given by

x_{\textup{n}}(t)\in\mathcal{X}_{\textup{n}},\ u_{\textup{n}}(t)\in{\mathcal{U}}_{\textup{n}},\quad\forall t\geq 0.

(13)

III-B Reference Governor Design for a Nominal System

We now introduce the RG for the nominal system 9 to enforce the constraints 13. We use the discrete-time RG approach of [3] that uses a discrete-time model:

\left\{\begin{aligned} \mathrm{x_{\textup{n}}}(k+1)&=\hat{A}_{m}\mathrm{x_{\textup{n}}}(k)+\hat{B}_{v}\mathrm{v}(k),\ \mathrm{x}(0)=x_{0},\\ \mathrm{u}_{\textup{n}}(k)&=K_{x}\mathrm{x_{\textup{n}}}(k)+K_{v}\mathrm{v}(k),\end{aligned}\right.

(14)

where $\mathrm{x_{\textup{n}}}(k)$ , $\mathrm{v}(k)$ and $\mathrm{u_{\textup{n}}}(k)$ denotes the vectors of states, of reference command inputs, and of nominal control inputs, respectively, and $\hat{A}_{m}$ and $\hat{B}_{v}$ are computed from $A_{m}$ and $B_{v}$ in 9 assuming a sampling time, $T_{d}$ . When doing the discretization, we ensure that the discrete-time system 14 has the same states as the continuous-time system at all sampling instants. This can be achieved by using the zero-order hold discretization, since

v(t)=\mathrm{v}(kT_{d}),\quad\forall t\in[kT_{d},(k+1)T_{d}),

(15)

which indicates that $v(t)$ is piecewise constant. The constraints 13 are imposed in discrete-time as

\mathrm{x_{\textup{n}}}(k)\in\hat{\mathcal{X}}_{\textup{n}},\ \mathrm{u_{\textup{n}}}(k)\in\hat{\mathcal{U}}_{\textup{n}},\quad\forall k\in\mathbb{Z}_{+},

(16)

where $\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ are tightened versions of $\mathcal{X}_{\textup{n}}$ and ${\mathcal{U}}_{\textup{n}}$ , respectively, introduced to avoid inter-sample constraint violations, and are defined by


$\displaystyle\hat{\mathcal{X}}_{\textup{n}}$	$\displaystyle\triangleq\mathcal{X}_{\textup{n}}\ominus\left\{z\in\mathbb{R}^{n}:\left\lVert z\right\rVert_{\infty}\leq\nu(T_{d})\right\},$	(17a)
$\displaystyle\hat{\mathcal{U}}_{\textup{n}}$	$\displaystyle\triangleq{\mathcal{U}}_{n}\ominus\left\{z\in\mathbb{R}^{m}:\left\lVert z\right\rVert_{\infty}\leq\left\lVert K_{x}\right\rVert_{\infty}\nu(T_{d})\right\},$	(17b)

while

\nu(T_{d})\!\triangleq\!\max_{\tau\in[0,T_{d}]}\!\left\lVert e^{A_{m}\tau}\!\!-\!I_{n}\right\rVert_{\infty}\!\max_{x\in\mathcal{X}_{n},v\in\mathcal{V}}\!\left\lVert x\!+\!A_{m}^{-1}B_{v}v\right\rVert_{\infty}\!,

(18)

with $\mathcal{V}$ denoting the set of all possible reference commands output by the RG.

The following lemma formally guarantees that no inter-sample constraint violations will happen for the continuous-time system 9 when the constraints for the discrete-time system 21 are satisfied at all sampling instants.

Lemma 1.

Consider the continuous-time system 9 and its discrete-time counterpart 14 that has the same states as 9 at all sampling instants. If for the discrete-time system 14,

\mathrm{x_{\textup{n}}}(k)\in\hat{\mathcal{X}}_{\textup{n}},\ \mathrm{u_{\textup{n}}}(k)\in\hat{\mathcal{U}}_{\textup{n}},\quad k\in\mathbb{Z}_{+},

(19)

with $\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ defined in 17, then 13 holds for the continuous-time system 9.

Proof.

See Section A-A. ∎

Remark 4.

From 18 and 17, we can see that $\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ are close to $\mathcal{X}_{\textup{n}}$ and ${\mathcal{U}}_{\textup{n}}$ , respectively, when $T_{d}$ is small. For practical implementation, inter-sample constraint violations may not be a big concern when $T_{d}$ is small. Under such case, we can simply set $\hat{\mathcal{X}}_{\textup{n}}=\mathcal{X}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}={\mathcal{U}}_{\textup{n}}$ .

Define

\mathrm{y}_{\textup{n}}^{c}(k)\!\triangleq\!\begin{bmatrix}\mathrm{x_{\textup{n}}}(k)\\ K_{x}\mathrm{x_{\textup{n}}}(k)\!+\!K_{v}\mathrm{v}(k)\end{bmatrix}\!=\!\underbrace{\begin{bmatrix}I_{n}\\ K_{x}\end{bmatrix}}_{\triangleq\ C_{c}}\mathrm{x_{\textup{n}}}(k)\!+\!\underbrace{\begin{bmatrix}0\\ K_{v}\end{bmatrix}}_{\triangleq\ D_{c}}\mathrm{v}(k).

(20)

Then, the constraints 16 can be rewritten as

\mathrm{y}_{\textup{n}}^{c}(k)\in{\mathcal{Y}}_{\textup{n}}\triangleq\hat{\mathcal{X}}_{\textup{n}}\times\hat{\mathcal{U}}_{\textup{n}},\quad\forall k\in\mathbb{Z}_{+},

(21)

where $\times$ denotes the cross product.

Remark 5.

In case there are no constraints on certain states and/or inputs, one can remove the rows of $C_{c}$ and $D_{c}$ defined in 20 corresponding to these states and/or inputs, and adjust the sets $\hat{\mathcal{X}}_{\textup{n}}$ , $\hat{\mathcal{U}}_{\textup{n}}$ and ${\mathcal{Y}}_{\textup{n}}$ accordingly.

Similar to most RG schemes, the RG scheme we adopt here computes at each time instant a command $\mathrm{v}(k)$ such that, if it is constantly applied from the time instant $k$ onward, the ensuing output will always satisfy the constraints. More formally, we define the maximal output admissible set $O_{\infty}$ [21] as the set of all states $\mathrm{x_{\textup{n}}}$ and inputs $\mathrm{v}$ , such that the predicted response from the initial state $\mathrm{x_{\textup{n}}}$ and with a constant input $\mathrm{v}$ satisfies the constraints 21, i.e.,

O_{\infty}\triangleq\{(\mathrm{v},\mathrm{x_{\textup{n}}}):\hat{\mathrm{y}}_{\textup{n}}^{c}(k|\mathrm{v},\mathrm{x_{\textup{n}}})\in{\mathcal{Y}}_{n},\ \forall k\in\mathbb{Z}_{+}\},

(22)

where for system 14 the output prediction $\hat{\mathrm{y}}_{\textup{n}}^{c}(k|\mathrm{x_{\textup{n}}},\mathrm{v})$ is given by

\displaystyle\hat{\mathrm{y}}_{\textup{n}}^{c}(k|\mathrm{v},\mathrm{x_{\textup{n}}})\!=~{}C_{c}\hat{A}_{m}^{k}\mathrm{x_{\textup{n}}}\!+\!C_{c}\sum_{j=1}^{k}\hat{A}_{m}^{j-1}\hat{B}_{v}\mathrm{v}\!+\!\!D_{c}\mathrm{v}=C_{c}\hat{A}_{m}^{k}\mathrm{x_{\textup{n}}}\!+\!C_{c}(I_{n}\!-\!\!\hat{A}_{m})^{\!-1}(I_{n}\!-\!\!\hat{A}_{m}^{k})\hat{B}_{v}\mathrm{v}\!+\!\!D_{c}\mathrm{v}.

(23)

Define $\tilde{O}_{\infty}$ as a slightly tightened version of $O_{\infty}$ obtained by constraining the command $\mathrm{v}$ so that the associated steady-state output $\bar{\mathrm{y}}_{\textup{n}}^{c}=(D_{c}+C_{c}(I_{n}\!-\!\hat{A}_{m})^{-1}\hat{B}_{v})\mathrm{v}$ satisfies constraints with a nonzero (typically small) margin $\epsilon>0$ , i.e.,

{\tilde{O}}_{\infty}=O_{\infty}\cap O^{\epsilon},

(24)

where $O^{\epsilon}\triangleq\{(\mathrm{v},\mathrm{x_{\textup{n}}}):\bar{\mathrm{y}}_{\textup{n}}^{c}\in(1-\epsilon){\mathcal{Y}}_{\textup{n}}\}.$ Clearly, ${\tilde{O}}_{\infty}$ can be made arbitrarily close to $O_{\infty}$ by decreasing $\epsilon$ . Based on the currently available state $\mathrm{x_{\textup{n}}}(k)$ at an instant $k$ , the RG computes $\mathrm{v}(k)$ so that

(\mathrm{v}(k),\mathrm{x_{\textup{n}}}(k))\in\tilde{O}_{\infty}.

(25)

It is proven in [21] that if $\hat{A}_{m}$ is Schur, $(\hat{A}_{m},C_{c})$ is observable, ${\mathcal{Y}}_{\textup{n}}$ is compact with $0$ in the interior, and $\epsilon>0$ is sufficiently small, then the set ${\tilde{O}}_{\infty}$ is finitely determined, i.e., there exists a finite index $k^{\star}$ such that

{\tilde{O}}_{\infty}\!=\tilde{O}_{k^{\star}}=\!\{(\mathrm{v},\mathrm{x_{\textup{n}}})\!:\hat{\mathrm{y}}_{\textup{n}}^{c}(k|\mathrm{v},\mathrm{x_{\textup{n}}})\!\in\!{\mathcal{Y}}_{n},\ k=0,1,\dots,k^{\star}\}\cap O^{\epsilon}.

(26)

Moreover, ${\tilde{O}}_{\infty}$ is positively invariant, which means that if $(\mathrm{v}(k),\mathrm{x_{\textup{n}}}(k))\in{\tilde{O}}_{\infty}$ and $\mathrm{v}(k)$ is applied to the system at time $k$ , then $(\mathrm{v}(k),\mathrm{x_{\textup{n}}}(k+1))\in{\tilde{O}}_{\infty}$ . Furthermore, if ${\mathcal{Y}}_{\textup{n}}$ is convex, then ${\tilde{O}}_{\infty}$ is also convex.

Remark 6.

The process of computing $k^{\star}$ involves computing sets $\tilde{O}_{k}$ for $k=1,2,\dots$ , and checking the condition $\tilde{O}_{k}=\tilde{O}_{k+1}$ ; $k^{\star}$ is the minimum $k$ for which this condition holds.

The proposed ${\mathcal{L}_{1}}$ -RG framework can leverage most of existing RG schemes developed for uncertainty-free systems. As an illustration and demonstration in Section VI, we choose the scalar RG introduced in [22, 23]. The scalar RG computes at each time instant $k$ a command $\mathrm{v}(k)$ which is the best approximation of the desired set-point $\mathrm{r}(k)$ along the line segment connecting $\mathrm{v}(k-1)$ and $\mathrm{r}(k)$ that ensures $(\mathrm{v}(k),\mathrm{x_{\textup{n}}}(k))\in{\tilde{O}}_{\infty}$ . More specifically, the scalar RG solves at each discrete time $k$ , the following optimization problem:


$\displaystyle\kappa(k)=\max_{\kappa\in[0,1]}$	$\displaystyle\kappa$	(27a)
s.t.	$\displaystyle\mathrm{v}=\mathrm{v}(k-1)+\kappa(\mathrm{r}(k)-\mathrm{r}(k-1)),$	(27b)
	$\displaystyle(\mathrm{v},\mathrm{x_{\textup{n}}}(k))\in{\tilde{O}}_{\infty},$	(27c)

where $\kappa(k)$ is a scalar adjustable bandwidth parameter and $\mathrm{v}(k)=\mathrm{v}(k-1)+\kappa(k)(\mathrm{r}(k)-\mathrm{v}(k-1))$ is the modified reference command to be applied to the system. If there is no danger of constraint violation, $\kappa(k)=1$ and $\mathrm{v}(k)=\mathrm{r}(k)$ so that the RG does not interfere with the desired operation of the system. If $\mathrm{v}(k)=\mathrm{r}(k)$ would cause a constraint violation, the value of $\kappa(k)$ is decreased by the RG. In the extreme case, $\kappa(k)=0$ , $\mathrm{v}(k)=\mathrm{v}(k-1)$ , which means that the RG momentarily isolates the system from further variations of the reference command for constraint enforcement. Due to the positive invariance of ${\tilde{O}}_{\infty}$ , $\mathrm{v}(k)=\mathrm{v}(k-1)$ always satisfies the constraints, which ensures recursive feasibility under the condition that at $t=0$ a command $\mathrm{v}(0)$ is known such that $\left(\mathrm{v}(0),\mathrm{x_{\textup{n}}}(0)\right)\in{\tilde{O}}_{\infty}$ . Response properties of the scalar RG, including conditions for the finite-time convergence of $\mathrm{v}(k)$ to $\mathrm{r}(k)$ are detailed in [23].

III-C ${\mathcal{L}_{1}}$ Adaptive Control Design and Uniform Performance Bounds

We now present an ${\mathcal{L}_{1}}$ AC that guarantees the bounds in (10), without considering the state and control constraints in 2. We first recall some basic definitions and facts from control theory, and introduce some definitions and lemmas.

Definition 1.

[24, Section III.F] For a stable proper MIMO system $\mathcal{H}(s)$ with input $\mathrm{u}(t)\in\mathbb{R}^{m}$ and output $\mathrm{y}(t)\in\mathbb{R}^{p}$ , its ${\mathcal{L}_{1}}$ norm is defined as

\left\lVert\mathcal{H}(s)\right\rVert_{\mathcal{L}_{1}}\triangleq\sup_{\mathrm{x}(0)=0,\left\lVert\mathrm{u}\right\rVert_{\mathcal{L}_{\infty}}\leq 1}{\left\lVert\mathrm{y}\right\rVert_{\mathcal{L}_{\infty}}}.\vspace{-2mm}

(28)

The following lemma follows directly from Definition 1.

Lemma 2.

For a stable proper MIMO system $\mathcal{H}(s)$ with states $\mathrm{x}(t)\in\mathbb{R}^{n}$ , inputs $\mathrm{u}(t)\in\mathbb{R}^{m}$ and outputs $\mathrm{y}(t)\in\mathbb{R}^{p}$ , under zero initial states, i.e., $\mathrm{x}(0)=0$ , we have $\left\lVert\mathrm{y}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq\left\lVert\mathcal{H}(s)\right\rVert_{\mathcal{L}_{1}}\left\lVert\mathrm{u}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}$ , for any $\tau\geq 0$ . Furthermore, for any matrix $\mathrm{T}\in\mathbb{R}^{q\times p}$ , we have $\left\lVert\mathrm{T}\mathcal{H}(s)\right\rVert_{\mathcal{L}_{\infty}}\leq\left\lVert\mathrm{T}\right\rVert_{\infty}\left\lVert\mathcal{H}(s)\right\rVert_{\mathcal{L}_{\infty}}$ .

A unique feature of an $\mathcal{L}_{1}$ AC is a low-pass filter ${\mathcal{C}}(s)$ (with DC gain ${\mathcal{C}}(0)=I_{m}$ ) that decouples the estimation loop from the control loop, thereby allowing for arbitrarily fast adaptation without sacrificing the robustness [18]. For simplicity, we can select ${\mathcal{C}}(s)$ to be a first-order transfer function matrix

{\mathcal{C}}(s)=\textup{diag}({\mathcal{C}}_{1}(s),\dots,{\mathcal{C}}_{m}(s)),\ {\mathcal{C}}_{j}(s)\triangleq\frac{k_{f}^{j}}{(s+k_{f}^{j})},\ j\in\mathbb{Z}_{1}^{m},

(29)

where $k^{j}_{f}$ ( $j\in\mathbb{Z}_{1}^{m}$ ) is the bandwidth of the filter for the $j$ th input channel. We now introduce a few notations that will be used later:


$\displaystyle\mathcal{H}_{xm}(s)$	$\displaystyle\!\triangleq\!(sI_{n}\!-\!A_{m})^{-1}\!B,\ \mathcal{H}_{xv}(s)\!\triangleq\!(sI_{n}\!-\!A_{m})^{-1}\!B_{v},$	(30a)
$\displaystyle\mathcal{G}_{xm}(s)$	$\displaystyle\!\triangleq\!\mathcal{H}_{xm}(s)(I_{m}-{\mathcal{C}}(s)),\$	(30b)

where $A_{m},B_{v}$ correspond to system 9 and $B$ to 1. Also, letting $x_{\textup{in}}(t)$ be the state of the system $\dot{x}_{\textup{in}}(t)=A_{m}x_{\textup{in}}(t),\ x_{\textup{in}}(0)=x_{0},$ we have $x_{\textup{in}}(s)\triangleq(sI_{n}-A_{m})^{-1}x_{0}$ . Defining $\rho_{\textup{in}}\triangleq\left\lVert s(sI_{n}-A_{m})^{-1}\right\rVert_{\mathcal{L}_{1}}\max_{x_{0}\in\mathcal{X}_{0}}\left\lVert x_{0}\right\rVert_{\infty}$ , and further considering that $A_{m}$ is Hurwitz and $\mathcal{X}_{0}$ is compact, we have $\left\lVert x_{\textup{in}}\right\rVert_{\mathcal{L}_{\infty}}\leq\rho_{\textup{in}}$ according to Lemma 2.

III-C1 ${\mathcal{L}_{1}}$ adaptive control architecture

For stability guarantees, the filter ${\mathcal{C}}(s)$ in (29) needs to ensure that there exists a positive constant $\rho_{r}$ and a (small) positive constant $\gamma_{1}$ such that


$\displaystyle\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}<\rho_{r}-\left\lVert\mathcal{H}_{xv}(s)\right\rVert_{\mathcal{L}_{1}}$	$\displaystyle\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}-\rho_{\textup{in}},$	(31a)
$\displaystyle\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}L_{f,\mathcal{X}_{a}}$	$\displaystyle<1,$	(31b)

where

	$\displaystyle\rho$	$\displaystyle\triangleq\rho_{r}+\gamma_{1},$		(32)
	$\displaystyle\mathcal{X}_{r}$	$\displaystyle\triangleq\Omega(\rho_{r}),\ \mathcal{X}_{a}\triangleq\Omega(\rho).$		(33)

Remark 7.

We will show in Lemma 3 and Theorem 1 that $\rho_{r}$ and $\rho$ are actually uniform bounds on the states of a non-adaptive reference system (defined in (43)) and of the adaptive system, respectively.

Remark 8.

Note that $\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}\rightarrow 0$ , when the bandwidth of the filter ${\mathcal{C}}(s)$ goes to infinity, i.e., $k_{f}^{j}\rightarrow\infty$ for all $j\in\mathbb{Z}_{1}^{m}$ . Furthermore, $b_{f,\Omega(\rho_{r})}$ can be bounded using the Lipschitz property 7a of $f(t,x)$ in $\Omega(\rho_{r})$ , and $L_{f,\Omega(\rho)}$ is bounded given any $\rho>0$ . Therefore, 31 can always be satisfied under a sufficiently high bandwidth for ${\mathcal{C}}(s)$ .

A typical ${\mathcal{L}_{1}}$ AC is comprised of three elements, namely a state predictor, an adaptive law and a low-pass filtered control law. For the system 5, the state predictor is defined by

\dot{\hat{x}}(t)=A_{m}x(t)+B_{v}v(t)+B(u_{\textup{a}}(t)+\hat{\sigma}_{1}(t))+B^{\perp}\hat{\sigma}_{2}(t)+A_{e}\tilde{x}(t),\ \hat{x}(0)=x_{0},

(34)

where $\tilde{x}(t)=\hat{x}(t)-x(t)$ is the prediction error, $A_{e}$ is a Hurwitz matrix, $B^{\perp}\in\mathbb{R}^{n\times(n-m)}$ is an arbitrary matrix satisfying $B^{\perp}B=0$ and $\textup{rank}\left(\left[B\ B^{\perp}\right]\right)=n$ , and $\hat{\sigma}_{1}(t)$ and $\hat{\sigma}_{2}(t)$ are estimated matched and unmatched disturbances, respectively. The estimates $\hat{\sigma}_{1}(t)$ and $\hat{\sigma}_{2}(t)$ are updated by the following piecewise-constant adaptive law (similar to that in [18, Section 3.3]):

\left\{\begin{aligned} &\begin{bmatrix}\hat{\sigma}_{1}(t)\\ \hat{\sigma}_{2}(t)\end{bmatrix}&&=\begin{bmatrix}\hat{\sigma}_{1}(iT)\\ \hat{\sigma}_{2}(iT)\end{bmatrix},\quad t\in[iT,(i+1)T),\\ &\begin{bmatrix}\hat{\sigma}_{1}(iT)\\ \hat{\sigma}_{2}(iT)\end{bmatrix}&&=-\!\left[B\ B^{\perp}\right]^{-1}\Phi^{-1}(T)e^{A_{e}T}\tilde{x}(iT),\end{aligned}\right.

(35)

where $T$ is the estimation sampling time and $\Phi(T)\triangleq A_{e}^{-1}\left(e^{A_{e}T}\!-I_{n}\right)$ . Finally, the control law is given by

u_{\textup{a}}(s)=-{\mathcal{C}}(s)\mathfrak{L}\left[\hat{\sigma}_{1}(t)\right].

(36)

The control law 36 tries to cancel the estimated (matched) uncertainty within the bandwidth of the filter ${\mathcal{C}}(s)$ . Additionally, unmatched uncertainty estimate ( $\hat{\sigma}_{2}(t)$ ) appears in 34 and 35, although the system dynamics 5 contains only matched uncertainty. This is due to the adoption of the piecewise-constant adaptive law, which may produce nonzero value for $\hat{\sigma}_{2}(t)$ . However, a non-zero $\hat{\sigma}_{2}(t)$ will not cause an issue either for implementation or for performance guarantee. Additionally, it is possible to prove that $\lim_{T\rightarrow 0}\hat{\sigma}_{2}(t)=0$ for any $t\geq 0$ [25], i.e., the estimated unmatched uncertainty will be close to zero when $T$ is small.

III-C2 Uniform performance bounds

We first define some constants:


$\displaystyle\bar{\alpha}_{0}(T)$	$\displaystyle\triangleq\int_{0}^{T}\left\lVert e^{A_{e}(T-\tau)}B\right\rVert_{\infty}d\tau,$	(37a)
$\displaystyle\bar{\alpha}_{1}(T)$	$\displaystyle\triangleq\max_{t\in[0,T]}\left\lVert e^{A_{e}t}\right\rVert_{\infty},$	(37b)
$\displaystyle\bar{\alpha}_{2}(T)$	$\displaystyle\triangleq\max_{t\in[0,T]}\int_{0}^{t}\left\lVert e^{A_{e}(t-\tau)}\Phi^{-1}(T)e^{A_{e}T}\right\rVert_{\infty}d\tau,$	(37c)
$\displaystyle\gamma_{0}(T)$	$\displaystyle\triangleq b_{f,\mathcal{X}_{a}}\bar{\alpha}_{0}(T)\left(\bar{a}_{1}(T)+\bar{a}_{2}(T)+1\right),$	(37d)

where $\alpha_{0}(T),~{}\alpha_{1}(T)$ and $\alpha_{2}(T)$ are defined in 37a, 37b and 37c, respectively. Clearly, $b_{f,\mathcal{X}_{a}}$ for a compact set $\mathcal{X}_{a}$ and $\lim_{T\rightarrow 0}\bar{\alpha}_{1}(T)=0$ are bounded, and $\lim_{T\rightarrow 0}\bar{\alpha}_{0}(T)=0$ . By using Taylor series expansion of $e^{A_{e}T}$ , one can show that $\lim_{T\rightarrow 0}\int_{0}^{T}\left\lVert\Phi^{-1}(T)\right\rVert_{\infty}d\tau$ is bounded, which implies that $\lim_{T\rightarrow 0}\bar{\alpha}_{2}(T)$ is bounded. As a result, we have

\lim_{T\rightarrow 0}\gamma_{0}(T)=0.

(38)

Further define

$\displaystyle\rho_{ur}\triangleq$	$\displaystyle\left\lVert{\mathcal{C}}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}},$	(39)
$\displaystyle\gamma_{2}\triangleq$	$\displaystyle\left\lVert{\mathcal{C}}(s)\right\rVert_{\mathcal{L}_{1}}\!L_{f,\mathcal{X}_{a}}\gamma_{1}\!+\!\left\lVert{\mathcal{C}}(s)B^{\dagger}(sI_{n}\!-\!A_{e})\right\rVert_{\mathcal{L}_{1}}\!\gamma_{0}(T),$	(40)
$\displaystyle\rho_{u_{\textup{a}}}\triangleq$	$\displaystyle\rho_{ur}+\gamma_{2},$	(41)

where $\gamma_{1}$ is introduced in 32. Due to 38 and 31b, we can always select a small enough $T>0$ such that

\frac{\left\lVert\mathcal{H}_{xm}(s){\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\right\rVert_{\mathcal{L}_{1}}}{1-\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}L_{f,\mathcal{X}_{a}}}\gamma_{0}(T)<\gamma_{1},

(42)

where $\mathcal{X}_{a}$ is defined in 33 and $B^{\dagger}$ is the pseudo-inverse of $B$ .

Following the convention for performance analysis of an ${\mathcal{L}_{1}}$ AC[18], we introduce the following reference system:


$\displaystyle\dot{x}_{\textup{r}}(t)$	$\displaystyle\!=\!{A_{m}}x_{\textup{r}}(t)\!+\!{B_{v}}v(t)\!+\!B({u_{\textup{r}}}(t)\!+\!f(t,x_{\textup{r}}(t))),\hfill$	(43a)
$\displaystyle{u_{\textup{r}}}(s)$	$\displaystyle\!=\!-{\mathcal{C}}(s)\mathfrak{L}\left[f(t,x_{\textup{r}}(t))\right],\quad x_{\textup{r}}(0)\!=\!x_{0},$	(43b)

Clearly, the control law in the reference system 43 partially cancels the uncertainty $f(t,x_{\textup{r}}(t)))$ within the bandwidth of the filter ${\mathcal{C}}(s)$ . Moreover, the control law depends on the true uncertainties and is thus not implementable. The reference system is introduced to help characterize the performance of the adaptive closed-loop system, which will be done in four sequential steps: (i) establishing the bounds on the states and inputs of the reference system (Lemma 3); (ii) quantifying the difference between the states and inputs of the adaptive system and those of the reference system (Theorem 1); (iii) quantifying the difference between the states and inputs of the reference system and those of the nominal system (Lemma 5); (iv) based on the results from (ii) and (iii), quantifying the difference between the states and inputs of the adaptive system and those of the nominal system (Theorem 2).

The proofs of these lemmas and theorems mostly follow the typical ${\mathcal{L}_{1}}$ AC analysis procedure [18], and are included in appendices for completeness.

For notation brevity, we define:

\eta(t)\triangleq f(t,x(t)),\quad\eta_{\textup{r}}(t)\triangleq f(t,x_{\textup{r}}(t)).

(44)

To provide an overview, Table I summarizes the different (error) systems involved in this section and their related theorems/lemmas, the uniform bounds, the ${\mathcal{L}_{1}}$ AC parameters and conditions.

TABLE I: An overview of different (error) systems involved in Section III-C, and their related theorem/lemma, uniform bounds,

{\mathcal{L}_{1}}

AC parameters and conditions

	(Error) System	Theorem/Lemma	Uniform Bounds on States and Inputs	${\mathcal{L}_{1}}$ AC Parameters	Conditions
1	Nominal system 9	Lemma 2	$\left\lVert x_{\textup{n}}\right\rVert_{\mathcal{L}_{\infty}}\!\leq\!\rho_{\textup{in}}+\left\lVert\mathcal{H}_{xv}\right\rVert_{\mathcal{L}_{1}}\left\lVert v\right\rVert_{\infty}$	N/A	N/A
2	Reference system 43	Lemma 3	$\left\lVert x_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}\!<\!\rho_{r}$ , $\left\lVert u_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}\!<\!\rho_{ur}$	${\mathcal{C}}(s)$	31a
3	Diff. b/t reference and adaptive systems	Theorem 1	$\left\lVert x_{\textup{r}}\!-\!x\right\rVert_{\mathcal{L}_{\infty}}\!\leq\!\gamma_{1},\ \left\lVert u_{\textup{r}}\!-\!u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}}\!\leq\!\gamma_{2}$	$A_{e},~{}T,~{}{\mathcal{C}}(s)$	42 and 31a
4	Diff. b/t reference and nominal systems	Lemma 5	$\left\\|x_{\textup{r}}\!-\!x_{\textup{n}}\right\\|_{{\mathcal{L}}{{}_{\infty}}}\!\leq\!\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}$	${\mathcal{C}}(s)$	31a
5	Adaptive system: 5 and the ${\mathcal{L}_{1}}$ AC	Theorem 1	$\left\lVert x\right\rVert_{\mathcal{L}_{\infty}}<\rho$ , $\left\lVert u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}}<\rho_{u}$	$A_{e},~{}T,~{}{\mathcal{C}}(s)$	42 and 31a
6	Diff. b/t adaptive and nominal systems	Theorem 2	$\left\lVert x\!-\!x_{\textup{n}}\right\rVert_{\mathcal{L}_{\infty}}\!\leq\!\tilde{\rho}$	$A_{e},~{}T,~{}{\mathcal{C}}(s)$	42 and 31a

The proofs for Lemmas 3, 4, 1 and 5 are given in Sections A-B, A-C, A-D and A-E.

Lemma 3.

For the closed-loop reference system in (43) subject to Assumption 1 and the stability condition in (31a), we have

	$\displaystyle\left\lVert x_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle<\rho_{r},$		(45)
	$\displaystyle\left\lVert u_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle<\rho_{ur},$		(46)

where $\rho_{r}$ is introduced in 31a, and $\rho_{ur}$ is defined in 39.

From 5 and 34, the prediction error dynamics are given by

\displaystyle\dot{\tilde{x}}(t)

\displaystyle=A_{e}\tilde{x}(t)+B\left(\hat{\sigma}_{1}(t)-f(t,x(t))\right)+B^{\perp}\hat{\sigma}_{2}(t).

(47)

The following lemma establishes a bound on the prediction error under the assumption that the actual states and adaptive inputs are bounded.

Lemma 4.

Given the uncertain system (5) subject to Assumption 1, the state predictor (34) and the adaptive law (35), if

\left\lVert x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq\rho,\quad\left\lVert u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq\rho_{u_{\textup{a}}},

(48)

with $\rho$ and $\rho_{u_{\textup{a}}}$ defined in 41 and 32, respectively, then

\displaystyle\left\lVert\tilde{x}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq\gamma_{0}(T).

(49)

Theorem 1.

Given the uncertain system (5) subject to Assumption 1 and the reference system (43) subject to the conditions 31b and 31a with a constant $\gamma_{1}>0$ , with the ${\mathcal{L}_{1}}$ AC defined via 34, 35 and 36 subject to the sample time constraint (42), we have


$\displaystyle\left\lVert x\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\rho,$	(50a)
$\displaystyle\left\lVert u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\rho_{u_{\textup{a}}},$	(50b)
$\displaystyle\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\gamma_{1},$	(50c)
$\displaystyle\left\lVert u_{\textup{r}}-u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\gamma_{2},$	(50d)

where $\rho$ , $\gamma_{2}$ and $\rho_{u_{\textup{a}}}$ are defined in 32, 40 and 41, respectively.

Remark 9.

For an arbitrarily small $\gamma_{1}>0$ , one can always find a small enough $T$ such that the constraint 42 is satisfied. According to 40, $\gamma_{2}$ depends on $\gamma_{1}$ and $\gamma_{0}(T)$ , and can be made arbitrarily small by reducing $\gamma_{1}$ and $T$ . Thus, by reducing $T$ , both $\gamma_{1}$ and $\gamma_{2}$ can be made arbitrarily small, which indicates that the difference between the inputs and states of the adaptive system and those of the reference system can be made arbitrarily small from Theorem 1.

Lemma 5.

Given the reference system (43) and the nominal system (9a), subject to Assumption 1, and the condition 31a, we have

\displaystyle{\left\|{{x_{\textup{r}}}-x_{\textup{n}}}\right\|_{{\mathcal{L}}{{}_{\infty}}}}

\displaystyle\leq\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}

(51)

Remark 10.

When the bandwidth of the filter ${\mathcal{C}}(s)$ goes to infinity, $\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}$ and thus ${\left\|{{x_{\textup{r}}}-x_{\textup{n}}}\right\|_{{\mathcal{L}}{{}_{\infty}}}}$ go to 0. This indicates that the difference between the states of the reference system and those of the nominal system can be made arbitrarily small by increasing the filter bandwidth. However, a high-bandwidth filter allows for high-frequency control signals to enter the system under fast adaptation (corresponding to small $T$ ), compromising the robustness. Thus, the filter presents a trade-off between robustness and performance. More details about the role and design of the filter can be found in [18].

From Theorem 1, Lemma 5 and application of the triangle inequality, we can obtain uniform bounds on the error between the actual system 5 and the nominal system 9a, formally stated in the following theorem. The proof is straightforward and thus omitted.

Theorem 2.

Given the uncertain system (5) subject to Assumption 1, the nominal system (9a), and the ${\mathcal{L}_{1}}$ AC defined via 34, 35 and 36 subject to the conditions 31b and 31a with a constant $\gamma_{1}>0$ and the sample time constraint (42), we have

	$\displaystyle\left\lVert x-x_{\textup{n}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\tilde{\rho},$		(52)
	$\displaystyle\left\lVert u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\leq\rho_{u_{\textup{a}}},$		(53)

where $\rho_{u_{\textup{a}}}$ is defined in 41, and

\displaystyle\tilde{\rho}

\displaystyle\triangleq\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}+\gamma_{1}.

(54)

Remark 11.

From Remarks 10 and 9, by decreasing $T$ and increasing the bandwidth of the filter ${\mathcal{C}}(s)$ , one can make (i) the states of the adaptive system arbitrarily close to those of the nominal system; and (ii) the adaptive inputs $u_{\textup{a}}(t)$ arbitrarily close to $f(t,x)$ , i.e., the true uncertainty, since $f(t,x_{\textup{r}})$ is arbitrarily close to $f(t,x)$ when the error between $x(t)$ and $x_{\textup{r}}(t)$ is arbitrarily small.

IV ${\mathcal{L}_{1}}$ AC with Separate Bounds for States and Inputs

In Section III-C, we presented an ${\mathcal{L}_{1}}$ AC that guarantees uniform bounds on the states and adaptive control inputs of the adaptive system with respect to the nominal system, without consideration of the constraints 2. However, as can be seen from Theorem 2, the uniform bound on $x(t)-x_{\textup{n}}(t)$ or $u_{\textup{a}}(t)$ is represented by the vector- $\infty$ norm, which always leads to the same bound for all the states, $x_{i}-x_{\textup{n},i}(t)$ ( $i\in\mathbb{Z}_{1}^{n}$ ), or all the adaptive inputs, $u_{\textup{a},j}$ ( $j\in\mathbb{Z}_{1}^{m}$ ). The use of vector- $\infty$ norms may lead to conservative bounds for some specific states or adaptive inputs, making it impossible to satisfy the constraints 2 or leading to significantly tightened constraints for the RG design. To reduce such conservatism, this section will present a scaling technique to derive an individual bound for each $x_{i}(t)-x_{\textup{n},i}(t)$ ( $i\in\mathbb{Z}_{1}^{n}$ ) and $u_{\textup{a},j}(t)$ ( $j\in\mathbb{Z}_{1}^{m}$ ).

From Theorem 2, one can see that the bound on $x(t)-x_{\textup{n}}(t)$ (or $u_{\textup{a}}(t)$ ) consists of two parts: the first part is $\gamma_{1}$ (or $\gamma_{2}$ ) that can be made arbitrarily small by reducing $T$ (see Remark 9), while the second part is a bound on $x_{\textup{r}}(t)-x_{\textup{n}}(t)$ (or $u_{\textup{r}}(t)$ ). Next, we will derive an individual bound for each $x_{i}(t)-x_{\textup{r},t}(t)$ (or $u_{\textup{r},j}(t)$ ).
Derive Separate Bounds for States via Scaling: For deriving an individual bound for each $x_{i}(t)-x_{\textup{r},t}(t)$ , we introduce the following coordinate transformations for the reference system 43 and the nominal system 9a for each $i\in\mathbb{Z}_{1}^{n}$ :

\left\{\begin{aligned} \check{x}_{\textup{r}}&=\mathrm{T}_{x}^{i}x_{\textup{r}},\quad\check{x}_{\textup{n}}=\mathrm{T}_{x}^{i}x_{\textup{n}},\\ \check{A}_{m}^{i}&=\mathrm{T}_{x}^{i}A_{m}(\mathrm{T}_{x}^{i})^{-1},\\ \check{B}^{i}&=\mathrm{T}_{x}^{i}B,\quad\check{B}^{i}_{v}=\mathrm{T}_{x}^{i}B_{v},\end{aligned}\right.

(55)

where $\mathrm{T}_{x}^{i}\!>\!0$ is a diagonal matrix that satisfies

\displaystyle\mathrm{T}_{x}^{i}[i]

\displaystyle=1,\ 0<\mathrm{T}_{x}^{i}[k]\leq 1,\ \forall k\neq i,

(56)

with $\mathrm{T}_{x}^{i}[k]$ denoting the $k$ th diagonal element. Under the transformation 55, the reference system 43 is converted to

\left\{\begin{aligned} \dot{\check{x}}_{\textup{r}}(t)&\!=\!\check{A}_{m}^{i}\check{x}_{\textup{r}}(t)\!+\!\check{B}_{v}^{i}v(t)\!+\!\check{B}^{i}(u_{\textup{r}}(t)\!+\!\!\check{f}(t,\check{x}_{\textup{r}}(t))),\\ {u_{\textup{r}}}(s)&\!=\!-{\mathcal{C}}(s)\mathfrak{L}\left[\check{f}(t,\check{x}_{\textup{r}}(t))\right],\ \check{x}(0)\!=\!\mathrm{T}_{x}^{i}x_{0},\end{aligned}\right.

(57)

where

\check{f}(t,\check{x}_{\textup{r}}(t))=f(t,x_{\textup{r}}(t)))=f(t,(\mathrm{T}_{x}^{i})^{-1}\check{x}_{\textup{r}}(t))).

(58)

Given a set ${\mathcal{Z}}$ , define

\check{\mathcal{Z}}\triangleq\{\check{z}\in\mathbb{R}^{n}:(\mathrm{T}_{x}^{i})^{-1}\check{z}\in{\mathcal{Z}}\}.

(59)

Similar to 30, for the transformed reference system 57, we have


$\displaystyle\mathcal{H}_{\check{x}m}^{i}(s)$	$\displaystyle\triangleq(sI_{n}\!-\!\check{A}_{m}^{i})^{-1}\!\check{B}^{i}=\mathrm{T}_{x}^{i}\mathcal{H}_{xm}(s),$	(60a)
$\displaystyle\mathcal{H}_{\check{x}v}^{i}(s)$	$\displaystyle\triangleq(sI_{n}\!-\!\check{A}_{m}^{i})^{-1}\!\check{B}^{i}_{v}=\mathrm{T}_{x}^{i}\mathcal{H}_{xv}(s),$	(60b)
$\displaystyle\mathcal{G}_{\check{x}m}^{i}(s)$	$\displaystyle\triangleq\mathcal{H}_{\check{x}m}^{i}(s)(I_{m}-{\mathcal{C}}(s))=\mathrm{T}_{x}^{i}\mathcal{G}_{xm}(s),$	(60c)

where $\mathcal{H}_{xm},~{}\mathcal{H}_{xv},~{}\mathcal{G}_{xm}$ are defined in 30. By applying the transformation 55 to the nominal system 9a, we obtain

\left\{\begin{aligned} \dot{\check{x}}_{\textup{n}}(t)&=\check{A}_{m}^{i}{\check{x}}_{\textup{n}}(t)+\check{B}^{i}_{v}v(t),\ {\check{x}}_{\textup{n}}(0)\!=\!\mathrm{T}_{x}^{i}x_{0},\\ y_{\textup{n}}(t)&=\check{C}{\check{x}}_{\textup{n}}(t).\end{aligned}\right.

(61)

Letting ${\check{x}}_{\textup{in}}(t)$ be the state of the system $\dot{\check{x}}_{\textup{in}}(t)=\check{A}_{m}^{i}{\check{x}}_{\textup{in}}(t)$ with ${\check{x}}_{\textup{in}}(0)=\check{x}_{\textup{n}}(0)=\mathrm{T}_{x}^{i}x_{0}$ , we have ${\check{x}}_{\textup{in}}(s)\triangleq(sI_{n}-\check{A}_{m}^{i})^{-1}{\check{x}}_{\textup{in}}(0)=\mathrm{T}_{x}^{i}(sI_{n}-A_{m})^{-1}x_{0}$ . Defining

\check{\rho}_{\textup{in}}^{i}\triangleq\left\lVert s\mathrm{T}_{x}^{i}(sI_{n}-A_{m})^{-1}\right\rVert_{\mathcal{L}_{1}}\max_{x_{0}\in\mathcal{X}_{0}}\left\lVert x_{0}\right\rVert_{\infty},

(62)

and further considering Lemma 2, we have $\left\lVert{\check{x}}_{\textup{in}}\right\rVert_{\mathcal{L}_{\infty}}\leq\check{\rho}_{\textup{in}}^{i}$ . Similar to 31a, for the transformed reference system 57, consider the following condition:

\displaystyle\left\lVert\mathcal{G}_{\check{x}m}^{i}(s)\right\rVert_{\mathcal{L}_{1}}b_{\check{f},\check{\mathcal{X}}_{r}}<\check{\rho}_{r}^{i}-\left\lVert\mathcal{H}_{\check{x}v}^{i}(s)\right\rVert_{\mathcal{L}_{1}}\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}-\check{\rho}_{\textup{in}}^{i},

(63)

where $\mathcal{X}_{r}$ is defined in 33 and $\check{\mathcal{X}}_{r}$ is defined according to 59 and $\check{\rho}_{r}^{i}$ is a positive constant to be determined. Then we have the following result.

Lemma 6.

Consider the reference system (43) subject to Assumption 1, the nominal system 9a, the transformed reference system 57 and transformed nominal system 61 obtained by applying 55 with any $\mathrm{T}_{x}^{i}$ satisfying 56. Suppose that 31a holds with some constants $\rho_{r}$ and $\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}$ . Then, there exists an constant $\check{\rho}_{r}^{i}\leq\rho_{r}$ such that 63 holds with the same $\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}$ . Furthermore,

	$\displaystyle\left\lvert x_{\textup{r},i}(t)\right\rvert$	$\displaystyle\leq\check{\rho}_{r}^{i},\ \forall t\geq 0,$		(64)
	$\displaystyle\left\lvert x_{\textup{r,i}}(t)-x_{\textup{n},i}(t)\right\rvert$	$\displaystyle\leq\left\lVert\mathcal{G}_{\check{x}m}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}},\ \forall t\geq 0,$		(65)

where we re-define

\mathcal{X}_{r}\triangleq\left\{z\in\mathbb{R}^{n}:\left\lvert z_{i}\right\rvert\leq\check{\rho}_{r}^{i},i\in\mathbb{Z}_{1}^{n}\right\}.

(66)

Proof.

For any $\mathrm{T}_{x}^{i}$ satisfying 56 with an arbitrary $i\in\mathbb{Z}_{1}^{n}$ , we have $\left\lVert\mathrm{T}_{x}^{i}\right\rVert_{\infty}=1$ . Therefore, under the transformation 55, considering 60 and 62 and Lemma 2, we have


$\displaystyle\left\lVert\mathcal{H}_{\check{x}m}^{i}(s)\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\!\leq\!\left\lVert\mathrm{T}_{x}^{i}\right\rVert_{\infty}\left\lVert\mathcal{H}_{xm}(s)\right\rVert_{\mathcal{L}_{\infty}}\!=\!\left\lVert\mathcal{H}_{xm}(s)\right\rVert_{\mathcal{L}_{\infty}}\!,$	(67a)
$\displaystyle\left\lVert\mathcal{H}_{\check{x}v}^{i}(s)\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\!\leq\!\left\lVert\mathrm{T}_{x}^{i}\right\rVert_{\infty}\left\lVert\mathcal{H}_{xv}(s)\right\rVert_{\mathcal{L}_{\infty}}\!=\!\left\lVert\mathcal{H}_{xv}(s)\right\rVert_{\mathcal{L}_{\infty}}\!,$	(67b)
$\displaystyle\left\lVert\mathcal{G}_{\check{x}m}^{i}(s)\right\rVert_{\mathcal{L}_{\infty}}$	$\displaystyle\!\leq\!\left\lVert\mathrm{T}_{x}^{i}\right\rVert_{\infty}\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{\infty}}\!=\!\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{\infty}}\!,$	(67c)
$\displaystyle\check{\rho}_{\textup{in}}^{i}$	$\displaystyle\!\leq\!\left\lVert\mathrm{T}_{x}^{i}\right\rVert_{\infty}\rho_{\textup{in}}\!=\!\rho_{\textup{in}}.$	(67d)

It follows from Lemma 3 that $x_{\textup{r}}(t)\in\mathcal{X}_{r}$ for any $t\geq 0$ , which, together with 55, implies $\check{x}_{\textup{r}}(t)\in\check{\mathcal{X}}_{r}$ for any $t\geq 0$ , where $\check{\mathcal{X}}_{r}$ is defined via 59. Considering 58 and 59, for any compact set $\mathcal{X}_{r}$ , we have

b_{\check{f},\check{\mathcal{X}}_{r}}=b_{f,\mathcal{X}_{r}}.

(68)

Now suppose that constants $\rho_{r}$ and $\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}$ satisfy 31a. Then, due to 68 and 67, with $\check{\rho}_{r}^{i}=\rho_{r}$ and the same $\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}$ , 63 is satisfied.

Additionally, if 63 holds, by applying Lemma 3 to the transformed reference system 57, we obtain that $\left\lVert\check{x}_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}\leq\check{\rho}_{r}^{i}$ , implying that $\left\lvert\check{x}_{\textup{r},i}(t)\right\rvert\leq\check{\rho}_{r}^{i}$ for any $t\geq 0$ . Since $\check{x}_{\textup{r},i}(t)=x_{\textup{r},i}(t)$ due to the constraint 56 on $\mathrm{T}_{x}^{i}$ , we have 64. Equation 64 is equivalent to $x_{\textup{r}}(t)\in\mathcal{X}_{r}$ for any $t\geq 0$ , with the re-definition of $\mathcal{X}_{r}$ in 66. Following the proof of Lemma 5, one can obtain $\left\|{\check{x}_{\textup{r}}-\check{x}_{\textup{n}}}\right\|_{{\mathcal{L}}{{}_{\infty}}}\leq\left\lVert\mathcal{G}_{\check{x}m}\right\rVert_{\mathcal{L}_{1}}b_{\check{f},\check{\mathcal{X}}_{r}}=\left\lVert\mathcal{G}_{\check{x}m}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\check{\mathcal{X}}_{r}}$ , where the equality is due to 68. Further considering $\check{x}_{\textup{r},i}(t)=x_{\textup{r},i}(t)$ and $\check{x}_{\textup{n},i}(t)=x_{\textup{n},i}(t)$ due to the constraint 56 on $\mathrm{T}_{x}^{i}$ , we have 65. ∎

Remark 12.

Lemma 3 and Lemma 5 imply $\left\lvert x_{\textup{r},i}(t)\right\rvert\leq\rho_{r}$ and $\left\lvert x_{\textup{r},i}(t)-x_{\textup{n},i}(t)\right\rvert\leq\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}b_{f,\Omega(\rho_{r})}$ , respectively, for all $i\in\mathbb{Z}_{1}^{n}$ and $t\geq 0$ . Lemma 6 indicates that by applying the coordinate transformation 55 and leveraging the condition 63 for the transformed system 57, one can obtain a tighter bound on $x_{\textup{r},i}(t)$ than $\rho_{r}$ and a tighter bound on $\left\lvert x_{\textup{r},i}(t)-x_{\textup{n},i}(t)\right\rvert$ than $\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\Omega(\rho_{r})}$ .

Derive Separate Bounds for Adaptive Inputs: From 43b and the structure with ${\mathcal{C}}(s)$ 29, we can obtain

u_{\textup{r},j}(s)=-{\mathcal{C}}_{j}(s)\mathfrak{L}\left[f_{j}(t,x_{\textup{r}}(t))\right],\quad\forall j\in\mathbb{Z}_{1}^{m}.

(69)

Therefore, given a set $\mathcal{X}_{r}$ such that $x_{\textup{r}}(t)\!\in\!\mathcal{X}_{r}$ for any $t\!\geq\!0$ , from Assumptions 1 and 2 we get

\left\lvert u_{\textup{r},j}(t)\right\rvert\leq\left\lVert{\mathcal{C}}_{j}(s)\right\rVert_{\mathcal{L}_{1}}b_{f_{j},\mathcal{X}_{r}},\quad\forall t\geq 0,\ \forall j\in\mathbb{Z}_{1}^{m}.

(70)

With the preceding preparations, we are ready to derive an individual bound for $x_{i}(t)-x_{\textup{n},i}(t)$ ( $i\in\mathbb{Z}_{1}^{n}$ ) and $u_{j}(t)-u_{\textup{n},j}(t)$ ( $j\in\mathbb{Z}_{1}^{m}$ ), as stated in the following theorem.

Theorem 3.

Consider the uncertain system (5) subject to Assumption 1, the nominal system (9a), and the ${\mathcal{L}_{1}}$ AC defined via 34, 35 and 36 subject to the conditions 31b and 31a with constants $\rho_{r}$ and $\gamma_{1}>0$ and the sample time constraint (42). Suppose that for each $i\in\mathbb{Z}_{1}^{n}$ , 63 holds with a constant $\check{\rho}_{r}^{i}$ for the transformed reference system 57 obtained by applying 55. Then, we have


$\displaystyle{x(t)-x_{\textup{n}}(t)}\!\in\!\tilde{\mathcal{X}}\!\triangleq\!\left\{z\!\in\!\mathbb{R}^{n}\!:\!\left\lvert z_{i}\right\rvert\!\leq\!\tilde{\rho}^{i},\ i\in\mathbb{Z}_{1}^{n}\right\}\!,\$	$\displaystyle\forall t\!\geq\!0,$	(71a)
$\displaystyle{u_{\textup{a}}(t)}\!\in\!{\mathcal{U}}_{\textup{a}}\!\triangleq\!\left\{z\!\in\!\mathbb{R}^{m}\!:\!\left\lvert z_{j}\right\rvert\!\leq\!\rho_{u_{\textup{a}}}^{j},\ \!j\!\in\!\mathbb{Z}_{1}^{m}\right\}\!,\$	$\displaystyle\forall t\!\geq\!0,$	(71b)
$\displaystyle u(t)-u_{\textup{n}}(t)\!\in\!\tilde{\mathcal{U}}\!\triangleq\!\left\{z\!\in\!\mathbb{R}^{m}\!:\!\left\lvert z_{j}\right\rvert\!\leq\!\tilde{\rho}_{u}^{j},\ j\!\in\!\mathbb{Z}_{1}^{n}\right\}\!,\$	$\displaystyle\forall t\!\geq\!0,$	(71c)

where


$\displaystyle\rho^{i}$	$\displaystyle\!\triangleq\!\check{\rho}_{r}^{i}\!+\!\gamma_{1},\ \tilde{\rho}^{i}\!\triangleq\!\left\lVert\mathcal{G}_{\check{x}m}^{i}(s)\right\rVert_{\mathcal{L}_{1}}\!\!b_{f,\mathcal{X}_{r}}\!+\!\gamma_{1},$	(72a)
$\displaystyle\rho_{u_{\textup{a}}}^{j}$	$\displaystyle\!\triangleq\!\left\lVert{\mathcal{C}}_{j}(s)\right\rVert_{\mathcal{L}_{1}}b_{f_{j},\mathcal{X}_{r}}\!+\!\gamma_{2},\ \tilde{\rho}_{u}^{j}\!\triangleq\!\rho_{u_{\textup{a}}}^{j}\!+\!\sum_{i=1}^{n}\left\lvert K_{x}[j,i]\right\rvert\tilde{\rho}^{i},$	(72b)

with $\mathcal{X}_{r}$ defined in 66, and $C[j,i]$ denoting the $(j,i)$ element of $C$ .

Proof.

For each $i\in\mathbb{Z}_{1}^{n}$ , Lemma 6 implies $\left\lvert x_{\textup{r},i}(t)\right\rvert\leq\check{\rho}_{r}^{i}$ and $\left\lvert x_{\textup{r,i}}(t)-x_{\textup{n},i}(t)\right\rvert\leq\left\lVert\mathcal{G}_{\check{x}m}^{i}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}$ for all $t\geq 0$ . On the other hand, Theorem 1 indicates that $\left\lvert x_{\textup{r},i}(t)-x_{i}(t)\right\rvert\leq\gamma_{1}$ for any $t\geq 0$ and any $i\in\mathbb{Z}_{1}^{n}$ . Therefore, 71a is true. On the other hand, Theorem 1 indicates that $\left\lvert u_{\textup{r},j}(t)-u_{\textup{a},j}(t)\right\rvert\leq\gamma_{2}$ for any $t\geq 0$ and any $j\in\mathbb{Z}_{1}^{m}$ , which, together with 70, leads to 71b. Finally, 71c follows from 11, 71b and 71a. The proof is complete. ∎

Remark 13.

Theorem 3 provides a way to derive an individual bound on $x_{i}(t)$ , and $x_{i}(t)-x_{\textup{n},i}(t)$ for each $i\in\mathbb{Z}_{1}^{n}$ and on $u_{j}(t)-u_{\textup{n},j}(t)$ for each $j\in\mathbb{Z}_{1}^{m}$ via coordinate transformations. Additionally, similar to the arguments in Remark 11, by decreasing $T$ and increasing the bandwidth of the filter ${\mathcal{C}}(s)$ , one can make $\tilde{\rho}^{i}$ ( $i\in\mathbb{Z}_{1}^{n}$ ) arbitrarily small, i.e., making the states of the adaptive system arbitrarily close to those of the nominal system, and make the bounds on $u_{\textup{a},j}(t)$ and $u_{j}(t)-u_{\textup{n},j}(t)$ arbitrarily close to the bound on the true uncertainty $f_{j}(t,x)$ for $x\in\mathcal{X}_{a}$ , for each $j\in\mathbb{Z}_{1}^{m}$ .

According to Theorems 2 and 3, the procedure for designing an ${\mathcal{L}_{1}}$ AC with separate bounds on states and adaptive inputs can be summarized in Algorithm 1.

Algorithm 1 Designing an

{\mathcal{L}_{1}}

AC with separate bounds

1:uncertain system 5 subject to Assumption 1, initial parameters

A_{e}

{\mathcal{C}}(s)

and

T

to define an

{\mathcal{L}_{1}}

AC,

\gamma_{1}

\mathcal{X}_{0}

\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}

, tol

2:procedure DecideFilterUncertBnd(

{\mathcal{C}}(s)

\gamma_{1}

\mathcal{X}_{0}

\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}

)

3: while condition 31a or 31b does not hold do

4: Increase the bandwidth of

{\mathcal{C}}(s)

\triangleright

See Remark 8.

5: end while

\triangleright

\rho_{r}

\mathcal{X}_{r}\!=\!\Omega(\rho_{r})

and

b_{f,\mathcal{X}_{r}}

will be computed.

6:end procedure

7:Set

b_{f,\mathcal{X}_{r}}^{old}=b_{f,\mathcal{X}_{r}}

8:procedure DeriveSepStateBnds(

b_{f,\mathcal{X}_{r}}

\gamma_{1}

{\mathcal{C}}(s)

\mathcal{X}_{0}

\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}

)

9: for

i=1,\dots,n

10: Select

\mathrm{T}_{x}^{i}

satisfying 56 and apply the transformation 55

11: Evaluate 60 and compute

\check{\rho}_{\textup{in}}^{i}

according to 62

12: Compute

\check{\rho}_{r}^{i}

that satisfies 63

\triangleright

Such a

\check{\rho}_{r}^{i}\leq\rho_{r}

is guaranteed to exist from Lemma 6

13: Set

\rho^{i}=\check{\rho}_{r}^{i}+\gamma_{1}

\tilde{\rho}^{i}=\left\lVert\mathcal{G}^{i}_{\check{x}m}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}+\gamma_{1}

14: end for

15:end procedure

16:Set

\mathcal{X}_{r}\!=\!\left\{\!z\!\in\!\mathbb{R}^{n}\!:\!\left\lvert z_{i}\right\rvert\!\leq\!\check{\rho}_{r}^{i}\right\}

and update

b_{f,\mathcal{X}_{r}}

17:if

b_{f,\mathcal{X}_{r}}^{old}-b_{f,\mathcal{X}_{r}}>\textup{tol}

then

18: Set

b_{f,\mathcal{X}_{r}}^{old}=b_{f,\mathcal{X}_{r}}

and go to step 8

19:end if

20:Set

\rho=\max_{i\in\mathbb{Z}_{1}^{n}}\rho^{i}

and compute

\mathcal{X}_{a}

via 33

21:procedure DeriveSepInputBnds(

\mathcal{X}_{r},\{\tilde{\rho}^{i}\}_{i\in\mathbb{Z}_{1}^{n}},\gamma_{2},{\mathcal{C}}(s)

)

22: for

j=1,\dots,m

23: Compute

\rho_{u_{\textup{a}}}^{j}

and

\tilde{\rho}_{u}^{j}

according to 72b

24: end for

25:end procedure

26:procedure DecideSampleTime(

A_{e}

{\mathcal{C}}(s)

T

\mathcal{X}_{a}

)

27: while constraint 42 does not hold do

28: Decrease

T

\triangleright

Small

T

can enforce 42 due to 38.

29: end while

30:end procedure

31:An

{\mathcal{L}_{1}}

AC defined by 36, 35 and 34 with parameters

A_{e}

and

{\mathcal{C}}(s)

and

T

\rho^{i}

and

\tilde{\rho}^{i}

for

i\in\mathbb{Z}_{1}^{n}

\rho_{u_{\textup{a}}}^{j}

and

\tilde{\rho}_{u}^{j}

for

j\in\mathbb{Z}_{1}^{m}

Remark 14.

One can try different $\mathrm{T}_{x}^{i}$ in step 10 of Algorithm 1 and select the one that yields the tightest bound for the $i$ th state.

Remark 15.

The conditions 42 and 31 can be quite conservative for some systems, due to the frequent use of inequalities related to the ${\mathcal{L}_{1}}$ norm (stated in Lemma 2), Lipschitz continuity and matrix/vector norms. As a result, the bandwidth of the filter ${\mathcal{C}}(s)$ computed via 31 could be unnecessarily high, while the sample time $T$ computed via 42 under a given $\gamma_{1}$ could be unnecessarily small. Based on our experience, assuming that some bounds $\tilde{\rho}^{i}$ ( $i\in\mathbb{Z}_{1}^{n}$ ) and $\tilde{\rho}_{u}^{j}$ ( $j\in\mathbb{Z}_{1}^{m}$ ) satisfying 71 are derived under a specific filter ${\mathcal{C}}^{\star}(s)$ and $T^{\star}$ that satisfy 42 and 31, those bounds will most likely be respected in simulations even if we decrease the bandwidth of ${\mathcal{C}}^{\star}(s)$ by $1\sim 3$ times and/or increase $T^{\star}$ by $1\sim 10$ times.

V ${\mathcal{L}_{1}}$ -RG: Adaptive Reference Governor for Constrained Control Under Uncertainties

Leveraging the uniform bounds on state and input errors guaranteed by the ${\mathcal{L}_{1}}$ AC, we now integrate the ${\mathcal{L}_{1}}$ AC and the RG introduced in Section III-B to synthesize the ${\mathcal{L}_{1}}$ -RG framework for simultaneously enforcing the constraints 2 and improving the tracking performance.

V-A ${\mathcal{L}_{1}}$ -RG Design

We first make the following assumption.

Assumption 2.

$\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ defined by 12, 17 and 71 are nonempty. Furthermore, there exists a known command $v(0)$ such that

(v(0),x_{0})\in{\tilde{O}}_{\infty},

(73)

where ${\tilde{O}}_{\infty}$ is defined in 26.

Remark 16.

Considering 26, 73 implies $x_{0}\in\hat{\mathcal{X}}_{\textup{n}}$ and $u_{\textup{n}}(0)=K_{x}x_{0}+K_{v}v(0)\in\hat{\mathcal{U}}_{\textup{n}}$ (since $x_{\textup{n}}(0)=x_{0}$ ) where $\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ , according to 17, are tightened versions of $\mathcal{X}_{\textup{n}}$ and ${\mathcal{U}}_{\textup{n}}$ that are defined in 12. From Remark 13, with a sufficiently high bandwidth for ${\mathcal{C}}(s)$ and sufficiently small $T$ , one can make $\mathcal{X}_{\textup{n}}$ arbitrarily close to $\mathcal{X}$ , and make $\tilde{\mathcal{U}}$ arbitrarily close to the bound set for the true uncertainty in $\mathcal{X}$ . Additionally, as mentioned in Remark 4, $\hat{\mathcal{X}}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}$ are close to $\mathcal{X}_{\textup{n}}$ and ${\mathcal{U}}_{\textup{n}}$ , respectively, when $T_{d}$ is small. As a result, with a sufficiently high bandwidth for ${\mathcal{C}}(s)$ , and sufficiently small $T$ and $T_{d}$ , Assumption 2 roughly states that the initial state stays in $\mathcal{X}$ , and the constraint set ${\mathcal{U}}$ is sufficiently large to ensure enough control authority for tracking an initial reference command $v(0)$ and additionally for compensating the uncertainty in $\mathcal{X}$ .

Under the preceding assumption, the design procedure for ${\mathcal{L}_{1}}$ -RG is summarized in Algorithm 2. Compared to step 3 of Algorithm 1, we additionally constrain $x_{\textup{r}}(t)$ and $x(t)$ to stay in $\mathcal{X}$ for all $t\geq 0$ in step 4 of Algorithm 2. Such constraints can potentially limit the size of uncertainties that need to be compensated and significantly reduce the conservatism of the proposed solution.

Algorithm 2

{\mathcal{L}_{1}}

-RG Design

1:An continuous-time uncertain system 5 subject to Assumption 1, constraint sets

\mathcal{X}

and

{\mathcal{U}}

as in 2,

\mathcal{X}_{0}

\mathcal{V}

(admissible set for

v(t)

), baseline control law in 3, initial parameters

A_{e}

{\mathcal{C}}(s)

and

T

to define an

{\mathcal{L}_{1}}

AC,

\gamma_{1}

T_{d}

and

\epsilon

for RG design, tol

2:procedure

{\mathcal{L}_{1}}

AC-DesignUnderConstraints

3: Compute

\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}

given

\mathcal{V}

4: while 31a with

\mathcal{X}_{r}=\Omega(\rho_{r})\cap\mathcal{X}

or 31b with

\mathcal{X}_{a}=\Omega(\rho_{r}+\gamma_{1})\cap\mathcal{X}

does not hold with any

\rho_{r}

5: Increase the bandwidth of

{\mathcal{C}}(s)

\triangleright

See Remark 8.

6: end while

\triangleright

\mathcal{X}_{r}

and

b_{f,\mathcal{X}_{r}}

will be computed.

7: Set

b_{f,\mathcal{X}_{r}}^{old}=b_{f,\mathcal{X}_{r}}

8: Run DeriveSepStateBnds of Algorithm 1 with

b_{f,\mathcal{X}_{r}}

, and obtain

\check{\rho}_{r}^{i}

\rho^{i}

and

\tilde{\rho}^{i}

for

i\in\mathbb{Z}_{1}^{n}

9: Set

\mathcal{X}_{r}\!=\!\left\{\!z\!\in\!\mathbb{R}^{n}\!:\!\left\lvert z_{i}\right\rvert\!\leq\!\check{\rho}_{r}^{i}\right\}\cap\mathcal{X}

and update

b_{f,\mathcal{X}_{r}}

10: if

b_{f,\mathcal{X}_{r}}^{old}-b_{f,\mathcal{X}_{r}}>\textup{tol}

then

11: Set

b_{f,\mathcal{X}_{r}}^{old}=b_{f,\mathcal{X}_{r}}

and go to step 8

12: end if

13: Set

\mathcal{X}_{a}=\{z\in\mathbb{R}^{n}:\left\lvert z_{i}\right\rvert\leq\rho^{i},\ i\in\mathbb{Z}_{1}^{n}\}\cap\mathcal{X}

14: Run DeriveSepInputBnds of Algorithm 1 with

{\mathcal{C}}(s)

from step 6 and

\mathcal{X}_{r}

from step 9, and obtain

\rho_{u_{\textup{a}}}^{j}

and

\tilde{\rho}_{u}^{j}

for

j\in\mathbb{Z}_{1}^{m}

15: Run DecideSampleTime of Algorithm 1 with

{\mathcal{C}}(s)

from step 6 and

\mathcal{X}_{a}

from step 13, and obtain

T

16: Compute

\tilde{\mathcal{X}}

and

\tilde{\mathcal{U}}

with

\{\tilde{\rho}^{i}\}_{i\in\mathbb{Z}_{1}^{n}}

and

\{\tilde{\rho}_{u}^{j}\}_{j\in\mathbb{Z}_{1}^{m}}

via 71

17:end procedure

18:procedure RG-Design

19: Compute

\mathcal{X}_{\textup{n}}

and

{\mathcal{U}}_{\textup{n}}

with

\mathcal{X}

{\mathcal{U}}

\tilde{\mathcal{X}}

and

\tilde{\mathcal{U}}

via 12

20: Formulate the nominal discrete-time model 14 with the sample time

T_{d}

21: Compute

\hat{\mathcal{X}}_{\textup{n}}

and

\hat{\mathcal{U}}_{\textup{n}}

via 17

\triangleright

With a small

T_{d}

, one may set

\hat{\mathcal{X}}_{\textup{n}}\!=\!\mathcal{X}_{\textup{n}}

\hat{\mathcal{U}}_{\textup{n}}\!=\!{\mathcal{U}}_{\textup{n}}

for practical implementation.

22: Compute the set

{\tilde{O}}_{\infty}

dependent on

\epsilon

according to 26

23:end procedure

24:An

{\mathcal{L}_{1}}

-RG consisting of a RG designed for the nominal system 9a and an

{\mathcal{L}_{1}}

AC to compensate for uncertainties

We are ready to state the guarantees regarding tracking performance and constraint enforcement provided by ${\mathcal{L}_{1}}$ -RG.

Theorem 4.

Consider an uncertain system (5) subject to Assumption 1 and the state and control constraints in 2. Suppose that an ${\mathcal{L}_{1}}$ AC (defined by 36, 35 and 34) and a RG are designed by following Algorithm 2. If Assumption 2 hold, then, under the baseline control law 3 and the ${\mathcal{L}_{1}}$ -RG consisting of the compositional control law 4, the ${\mathcal{L}_{1}}$ AC and the RG for computing the reference command $v(t)$ according to 15 and 27, we have

$\displaystyle x(t)\in\textup{int}(\mathcal{X}),\ u(t)\in\textup{int}({\mathcal{U}}),\quad$	$\displaystyle\forall t\geq 0,$	(74)
$\displaystyle x(t)-x_{\textup{n}}(t)\in\textup{int}(\tilde{\mathcal{X}}),\quad$	$\displaystyle\forall t\geq 0,$	(75)
$\displaystyle y(t)-y_{\textup{n}}(t)\in\{z\in\mathbb{R}^{m}:\left\lvert z_{i}\right\rvert\leq\tilde{\rho}_{y}^{j}\},\quad$	$\displaystyle\forall t\geq 0,$	(76)

where $x_{\textup{n}}(t)$ and $y_{\textup{n}}(t)$ are the states and outputs of the nominal system 9 under the reference command input $v(t)$ , and

\tilde{\rho}_{y}^{j}\!\triangleq\!\sum_{i=1}^{n}\left\lvert C[j,i]\right\rvert\tilde{\rho}^{i},\quad\forall j\in\mathbb{Z}_{1}^{m}.

(77)

Proof.

Equation 73 in Assumption 2 implies $(\mathrm{v}(0),\mathrm{x}_{\textup{n}}(0))\in{\tilde{O}}_{\infty}$ (due to $\mathrm{x_{\textup{n}}}(0)=x_{0}$ ), and $\mathrm{u}_{\textup{n}}(0)=K_{x}\mathrm{x_{\textup{n}}}(0)+K_{v}\mathrm{v}(0)\in\hat{\mathcal{U}}_{\textup{n}}$ . Thus, the reference command $\mathrm{v}(k)$ produced by 27 ensures $\mathrm{x}_{\textup{n}}(k)\in\hat{\mathcal{X}}_{\textup{n}}$ and $\mathrm{u}_{\textup{n}}(k)\in\hat{\mathcal{U}}_{\textup{n}}$ for all $k\in\mathbb{Z}_{+}$ , which, due to Lemma 1, implies

x_{\textup{n}}(t)\in\mathcal{X}_{\textup{n}},\ u_{\textup{n}}(t)\in{\mathcal{U}}_{\textup{n}},\quad\forall t\geq 0.

(78)

Compared to 3, 16 and 20 of Algorithm 1, we restrain $\mathcal{X}_{r}$ and $\mathcal{X}_{a}$ to be subsets of $\mathcal{X}$ in 4, 9 and 13 of Algorithm 2. As a result, if 74 and

x_{\textup{r}}(t)\in\mathcal{X},\quad\forall t\geq 0,

(79)

jointly hold, condition 75 holds according to Theorem 3, while 65 holds according to Lemma 6.

We next prove 74 and 79 by contradiction. Assume 74 or 79 do not hold. The initial condition 73 implies that $x_{0}\in\mathcal{X}_{\textup{n}}\subset\textup{int}(\mathcal{X})$ and $u_{\textup{n}}(0)\in{\mathcal{U}}_{\textup{n}}\subset\textup{int}({\mathcal{U}})$ . As a result, we have $x(0)\in\mathcal{X}$ , $x_{\textup{r}}(0)\in\mathcal{X}$ and $u(0)\in{\mathcal{U}}$ . Since $x(t)$ , $x_{\textup{r}}(t)$ and $u(t)$ are continuous, there must exist a time instant $\tau$ , such that


$\displaystyle x(t)$	$\displaystyle\!\in\!\textup{int}(\mathcal{X}),\ x_{\textup{r}}(t)\!\in\!\textup{int}(\mathcal{X})\textup{ and }u(t)\!\in\!\textup{int}({\mathcal{U}}),\ \forall t\!\in\![0,\tau)$	(80a)
$\displaystyle x(\tau)$	$\displaystyle\!\in\!\textup{bnd}(\mathcal{X})\textup{ or }x_{\textup{r}}(\tau)\!\in\!\textup{bnd}(\mathcal{X})\textup{ or }u(\tau)\!\in\!\textup{bnd}({\mathcal{U}}).$	(80b)

Now consider the interval $[0,\tau]$ . According to Lemma 6, due to 80 and the definitions in 71a and 72a, we have $x_{\textup{r}}(t)-x_{\textup{n}}(t)\in\{z\in\mathbb{R}^{n}:\left\lvert z_{i}\right\rvert\leq\left\lVert\mathcal{G}_{\check{x}m}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\mathcal{X}_{r}}\}\subset\textup{int}(\tilde{\mathcal{X}})$ , which, together with 12 and 78, implies

x_{\textup{r}}(t)\in\textup{int}(\mathcal{X}),\quad\forall t\in[0,\tau].

(81)

Similarly, according to Theorem 3, due to 80 and the definition in 71, we have $x(t)-x_{\textup{n}}(t)\in\textup{int}(\tilde{\mathcal{X}})$ , and $u(t)-u_{\textup{n}}(t)\in\textup{int}(\tilde{\mathcal{U}})$ , for any $t\in[0,\tau]$ , which, together with 12 and 78, implies

x(t)\in\textup{int}(\mathcal{X}),\ u(t)\in\textup{int}({\mathcal{U}}),\quad\forall t\in[0,\tau].

(82)

Both 81 and 82 contradict 80b, which proves 74 and 79. By applying the inference right before 82 again for $t\geq 0$ , we obtain 75, which, together with $y_{j}(t)-y_{\textup{n},j}(t)=\sum_{i=1}^{n}C[j,i]\left(x_{i}(t)-x_{\textup{n},i}(t)\right)$ , leads to 76.∎

VI Simulation Results

We now apply ${\mathcal{L}_{1}}$ -RG to the longitudinal dynamics of an F-16 aircraft. The model was adapted from [26] with slight modifications to remove the actuator dynamics, in which the state vector $x(t)=[\gamma(t),q(t),\alpha(t)]^{\top}$ consists of the flight path angle, pitch rate and angle of attack, and the control input vector $u(t)=[\delta_{e}(t),\delta_{f}(t)]$ includes the elevator deflection and flaperon deflection. The output vector is $y(t)=[\theta(t),\gamma(t)]^{\top}$ , where $\theta(t)=\gamma(t)+\alpha(t)$ is the pitch angle; the reference input vector is $r(t)=[\theta_{c}(t),\gamma_{c}(t)]^{\top}$ , where $\theta_{c}$ and $\gamma_{c}$ are the commanded pitch angle and flight path angle, respectively. The system is subject to state and control constraints:

\left\lvert\alpha(t)\right\rvert\leq 4\!\textup{ deg},\ \left\lvert\delta_{e}(t)\right\rvert\leq 25\!\textup{ deg},\ \left\lvert\delta_{f}(t)\right\rvert\leq 22\!\textup{ deg},

(83)

where the state constraint can also be represented as $x(t)\in\mathcal{X}\triangleq[-10^{3},10^{3}]\times[-10^{3},10^{3}]\times[-4,4]$ following the convention in 2. Furthermore, we assume

\left\lVert r\right\rVert_{\mathcal{L}_{\infty}}\leq 10,\quad x(0)\in\mathcal{X}_{0}=\Omega(0.1).

(84)

The open-loop dynamics are given by

\displaystyle\dot{x}

\displaystyle\!=\!\begin{bmatrix}0&0.0067&1.34\\ 0&-0.869&43.2\\ 0&0.993&-1.34\end{bmatrix}\!x+\begin{bmatrix}0.169&0.252\\ -17.3&-1.58\\ -0.169&-0.252\end{bmatrix}\!(u\!+\!f(t,x)),

(85)

where $f(t,x)=[-0.8\sin(0.4\pi t)-0.1\alpha^{2},0.1-0.2\alpha]^{\top}$ is the uncertainty dependent on both time and $\alpha$ . The feedback and feedforward gains of the baseline controller 3 are selected to be $K_{x}=[3.25,0.891,7.12;-6.10,-0.898,-10.0]$ and $K_{v}=[-3.93,0.679;2.57,3.53]$ . Via simple calculations, we can see that $f(t,x)\in\mathcal{W}=[-2.4,2.4]\times[-0.9,0.9]$ when $x\in\mathcal{X}$ holds.

VI-A ${\mathcal{L}_{1}}$ -RG Design

It can be verified that given any set ${\mathcal{Z}}$ , $L_{f_{1},{\mathcal{Z}}}=0.2\max_{\alpha\in{\mathcal{Z}}_{3}}\left\lvert\alpha\right\rvert$ , $L_{f_{2},{\mathcal{Z}}}=0.2$ , $b_{f_{1},{\mathcal{Z}}}=0.8+0.1\max_{\alpha\in{\mathcal{Z}}_{3}}\alpha^{2}$ , $b_{f_{2},{\mathcal{Z}}}=0.1+0.2\max_{\alpha\in{\mathcal{Z}}_{3}}\left\lvert\alpha\right\rvert$ satisfy Assumption 1. For design of the ${\mathcal{L}_{1}}$ AC in 36, 35 and 34, we select $A_{e}=-10I_{3}$ and parameterize the filter as ${\mathcal{C}}(s)=\frac{k_{f}}{s+k_{f}}I_{2}$ , where $k_{f}>0$ denotes the bandwidth for both input channels. Table II lists the bounds on $x_{i}(t)-x_{\textup{n},i}(t)$ and $u_{j}(t)-u_{\textup{n},j}(t)$ theoretically computed by applying Algorithm 2 under different ${\mathcal{C}}(s)$ and $T$ with and without using the scaling technique in Section IV. When applying the scaling technique, we set $\mathrm{T}_{x}^{i}[k]=0.01$ for each $i,k\in\mathbb{Z}_{1}^{3}$ and $k\neq i$ , which satisfies 56. Several observations can be made from Table II. First, by increasing the filter bandwidth $k_{f}$ and decreasing $T$ , we are able to obtain a smaller $\gamma_{1}$ satisfying 42 and achieve tighter bounds for all states and inputs. In fact, if $k_{f}=10^{3}$ and $T=10^{-7}$ , then $\tilde{\rho}_{u}^{1}$ $\tilde{\rho}_{u}^{2}$ are fairly close to the bounds on $f_{1}(t,x)$ and $f_{2}(t,x)$ for $x\in\mathcal{X}$ , respectively, which is consistent with Remark 13. Additionally, with scaling, we could significantly reduce $\tilde{\rho}^{1}$ and $\tilde{\rho}^{3}$ , the bounds on $\gamma(t)-\gamma_{\textup{n}}(t)$ and $\alpha(t)-\alpha_{\textup{n}}(t)$ , and $\tilde{\rho}_{u}^{1}$ and $\tilde{\rho}_{u}^{2}$ , the bounds on $\delta_{e}(t)-\delta_{e,\textup{n}}(t)$ and $\delta_{f}(t)-\delta_{f,\textup{n}}(t)$ . Moreover, with $\mathrm{T}_{x}^{3}$ , we can verify that the condition 63 holds with $b_{f,\mathcal{X}_{r}}$ as long as $\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}<1.868$ . As mentioned in Remark 15, the conditions 42 and 31 and the resulting bounds $\tilde{\rho}^{i}$ and $\tilde{\rho}_{u}^{j}$ could be conservative. As a result, a larger reference command can potentially be allowed in a practical implementation while keeping $x(t)$ to stay in $\mathcal{X}$ , as demonstrated in the following simulations.

TABLE II: Performance bounds obtained under different filter bandwidth and sample time

T

with and without (W/O) scaling

	$k_{f}=200,\ T=10^{-5}$		$k_{f}=10^{3},\ T=10^{-7}$
	W/O scaling	With scaling	W/O scaling	With scaling
$\gamma_{1}~{}/~{}b_{f,\mathcal{X}_{r}}$	$0.01~{}/~{}2.40$		$2\times 10^{-4}~{}/~{}2.40$
$[\tilde{\rho}^{1},\tilde{\rho}^{2},\tilde{\rho}^{3}]$	$.41[1,1,1]$	$[.015,.41,.038]$	$.043[1,1,1]$	$[.12,4.3,.35]10^{-2}$
$[\tilde{\rho}_{u}^{1},\tilde{\rho}_{u}^{2}]$	$[8.15,9.02]$	$[4.20,2.85]$	$[2.94,1.69]$	$[2.51,1.03]$

Following Algorithm 2, we used the bounds $\tilde{\rho}^{3}$ , $\tilde{\rho}_{u}^{1}$ and $\tilde{\rho}_{u}^{2}$ obtained for the case when $k_{f}=200$ and $T=10^{-5}$ , to tighten the original constraints 83 and then used the tightened constraints to design the RG, for which we chose $T_{d}=0.005$ . Considering that $T_{d}$ was small, we did not consider inter-sample constraint violations and simply set $\hat{\mathcal{X}}_{\textup{n}}=\mathcal{X}_{\textup{n}}$ and $\hat{\mathcal{U}}_{\textup{n}}={\mathcal{U}}_{\textup{n}}$ instead of 17. For comparisons, we also designed a robust RG (RRG) that treats the uncertainty $f(t,x)$ as a bounded disturbance $w(t)\in\mathcal{W}$ , where $\mathcal{W}$ is introduced below 85. RRG design also uses $O_{\infty}$ set (defined in 22 for RG design); however, the prediction of the output, which corresponds to $\hat{\mathrm{y}}_{c}(k|v,x)$ for RG design, becomes a set-valued one taking into account all possible realizations of the disturbance $w(t)$ (see [3] for details). We additionally designed a standard RG by simply ignoring the uncertainty $f(t,x)$ .

VI-B Simulation Results

As mentioned in Remark 15, the value of $T$ theoretically computed according to 42 is often unnecessarily small. For the subsequent simulations, we simply adopted an estimation sample time of 1 millisecond, i.e., $T=0.001$ s. As one can see in the subsequent simulation results, all the bounds derived in Section VI-A for $k_{f}=200$ and $T=10^{-5}$ still hold. The reference command $r(t)$ was set to be $[9,6.5]$ deg for $t\in[0,7.5]$ s, and $[0,0]$ deg for $t\in[7.5,15]$ s. The results are shown in Figs. 4, 3 and 2. In terms of constraint enforcement, Fig. 3 shows that both RRG and ${\mathcal{L}_{1}}$ -RG successfully enforced all the constraints, while violation of the constraints on the state $\alpha(t)$ and the input $\delta_{f}(t)$ happened under RG. However, from Fig. 2, one can see that the RRG was quite conservative, leading to a large difference between the modified reference and original reference commands and subsequently large tracking errors for both $\theta(t)$ and $\gamma(t)$ throughout the simulation. In comparison, the modified reference command under RG reached the original reference command, leading to better tracking performance. Finally, ${\mathcal{L}_{1}}$ -RG yielded the best tracking performance, driving both $\theta(t)$ and $\gamma(t)$ very close to their commanded values at steady state. While noticeable under RG and RRG, the uncertainty-induced swaying in the outputs at steady state was negligible under ${\mathcal{L}_{1}}$ -RG, thanks to the active compensation of the uncertainty by the ${\mathcal{L}_{1}}$ AC. From Fig. 4, one can see that the estimation of the uncertainty within the ${\mathcal{L}_{1}}$ -RG was quite accurate.

We next check whether the derived uniform bounds on the errors in states, $x(t)-x_{\textup{n}}(t)$ , and on the adaptive inputs, $u_{\textup{a}}(t)$ , hold in the simulation. It can be seen from Fig. 5 that the bounds on both $u_{\textup{a},1}(t)$ and $u_{\textup{a},2}(t)$ were respected in the simulation and moreover are fairly tight. Figure 6 reveals that all actual states under ${\mathcal{L}_{1}}$ -RG were fairly close to their nominal counterparts, and moreover, the bound on $x_{i}(t)-x_{\textup{n},i}(t)$ for each $i\in\mathbb{Z}_{1}^{3}$ was respected. Note that $x_{\textup{n}}(t)$ in Fig. 6 was produced by applying the same reference command $v(t)$ yielded by ${\mathcal{L}_{1}}$ -RG to the nominal system 9.

VII Conclusion

In this paper, we developed ${\mathcal{L}_{1}}$ -RG, an adaptive reference governor (RG) framework, for control of linear systems with time- and state-dependent uncertainties subject to both state and input constraints. At the core of ${\mathcal{L}_{1}}$ -RG is an ${\mathcal{L}_{1}}$ adaptive controller that provides guaranteed uniform bounds on the errors between states and inputs of the uncertain system and those of a nominal (i.e., uncertainty-free) system. With such uniform error bounds for constraint tightening, a RG designed for the nominal system with tightened constraints guarantees the satisfaction of the original constraints by the actual states and inputs. Simulation results validate the efficacy and advantages of the proposed approach.

In the future, we will address unmatched uncertainties following [25], and extend the proposed framework to the nonlinear setting leveraging the results in [27, 28]. Additionally, we would like to extend the proposed solution to adaptive MPC.

References

[1] E. F. Camacho and C. B. Alba, Model predictive control. Springer Science & Business Media, 2013.
[2] J. B. Rawlings, D. Q. Mayne, and M. M. Diehl, Model Predictive Control: Theory, Computation, and Design, 2nd Ed. Nob Hill Publishing, 2020.
[3] E. Garone, S. Di Cairano, and I. Kolmanovsky, “Reference and command governors for systems with constraints: A survey on theory and applications,” Automatica, vol. 75, pp. 306–328, 2017.
[4] I. Kolmanovsky and E. G. Gilbert, “Theory and computation of disturbance invariant sets for discrete-time linear systems,” Mathematical Problems in Engineering, vol. 4, no. 4, pp. 317–367, 1998.
[5] E. C. Kerrigan, Robust constraint satisfaction: Invariant sets and predictive control. PhD thesis, University of Cambridge, 2001.
[6] W. Langson, I. Chryssochoos, S. Raković, and D. Q. Mayne, “Robust model predictive control using tubes,” Automatica, vol. 40, no. 1, pp. 125–133, 2004.
[7] S. Rakovic, Robust control of constrained discrete time systems: Characterization and implementation. PhD thesis, University of London, 2005.
[8] D. Q. Mayne, S. V. Raković, R. Findeisen, and F. Allgöwer, “Robust output feedback model predictive control of constrained linear systems,” Automatica, vol. 42, no. 7, pp. 1217–1222, 2006.
[9] D. Q. Mayne, E. C. Kerrigan, E. Van Wyk, and P. Falugi, “Tube-based robust nonlinear model predictive control,” International Journal of Robust and Nonlinear Control, vol. 21, no. 11, pp. 1341–1353, 2011.
[10] J. Köhler, R. Soloperto, M. A. Müller, and F. Allgöwer, “A computationally efficient robust model predictive control framework for uncertain nonlinear systems,” IEEE Transactions on Automatic Control, vol. 66, no. 2, pp. 794–801, 2020.
[11] B. T. Lopez, J.-J. E. Slotine, and J. P. How, “Dynamic tube MPC for nonlinear systems,” in Proceedings of American Control Conference, pp. 1655–1662, 2019.
[12] B. Kouvaritakis and M. Cannon, Model Predictive Control: Classical, Robust and Stochastic. Advanced Textbooks in Control and Signal Processing, Springer, London, 2015.
[13] K. Zhang and Y. Shi, “Adaptive model predictive control for a class of constrained linear systems with parametric uncertainties,” Automatica, vol. 117, p. 108974, 2020.
[14] V. Adetola, D. DeHaan, and M. Guay, “Adaptive model predictive control for constrained nonlinear systems,” Systems & Control Letters, vol. 58, no. 5, pp. 320–326, 2009.
[15] K. Pereida, L. Brunke, and A. P. Schoellig, “Robust adaptive model predictive control for guaranteed fast and accurate stabilization in the presence of model errors,” International Journal of Robust and Nonlinear Control, vol. 31, no. 18, pp. 8750–8784, 2021.
[16] X. Wang, L. Yang, Y. Sun, and K. Deng, “Adaptive model predictive control of nonlinear systems with state-dependent uncertainties,” Int. J. Robust Nonlinear Control, vol. 27, no. 17, pp. 4138–4153, 2017.
[17] M. Bujarbaruah, S. H. Nair, and F. Borrelli, “A semi-definite programming approach to robust adaptive MPC under state dependent uncertainty,” in European Control Conference, pp. 960–965, IEEE, 2020.
[18] N. Hovakimyan and C. Cao, $\mathcal{L}_{1}$ Adaptive Control Theory: Guaranteed Robustness with Fast Adaptation. Philadelphia, PA: Society for Industrial and Applied Mathematics, 2010.
[19] T. Polóni, U. Kalabić, K. McDonough, and I. Kolmanovsky, “Disturbance canceling control based on simple input observers with constraint enforcement for aerospace applications,” in IEEE Conference on Control Applications, pp. 158–165, IEEE, 2014.
[20] G. Pin, D. M. Raimondo, L. Magni, and T. Parisini, “Robust model predictive control of nonlinear systems with bounded and state-dependent uncertainties,” IEEE Transactions on Automatic Control, vol. 54, no. 7, pp. 1681–1687, 2009.
[21] E. G. Gilbert and K. T. Tan, “Linear systems with state and control constraints: The theory and application of maximal output admissible sets,” IEEE Transactions on Automatic control, vol. 36, no. 9, pp. 1008–1020, 1991.
[22] A. Bemporad and E. Mosca, “Nonlinear predictive reference filtering for constrained tracking,” in Proceedings of European Control Conference, pp. 1720–1725, 1995.
[23] E. G. Gilbert, I. Kolmanovsky, and K. T. Tan, “Discrete-time reference governors and the nonlinear control of systems with state and control constraints,” International Journal of Robust and Nonlinear control, vol. 5, no. 5, pp. 487–504, 1995.
[24] C. Scherer, P. Gahinet, and M. Chilali, “Multiobjective output-feedback control via LMI optimization,” IEEE Transactions on Automatic Control, vol. 42, no. 7, pp. 896–911, 1997.
[25] P. Zhao, S. Snyder, N. Hovakimyana, and C. Cao, “Robust adaptive control of linear parameter-varying systems with unmatched uncertainties,” arXiv:2010.04600, 2021.
[26] K. M. Sobel and E. Y. Shapiro, “A design methodology for pitch pointing flight control systems,” Journal of Guidance, Control, and Dynamics, vol. 8, no. 2, pp. 181–187, 1985.
[27] A. Lakshmanan, A. Gahlawat, and N. Hovakimyan, “Safe feedback motion planning: A contraction theory and $\mathcal{L}_{1}$ -adaptive control based approach,” in Proceedings of 59th IEEE Conference on Decision and Control (CDC), pp. 1578–1583, 2020.
[28] P. Zhao, A. Lakshmanan, K. Ackerman, A. Gahlawat, M. Pavone, and N. Hovakimyan, “Tube-certified trajectory tracking for nonlinear systems with robust control contraction metrics,” IEEE Robotics and Automation Letters, pp. 1–1, 2022.

Appendix A Proofs

A-A Proof of Lemma 1

Proof.

Since the continuous-time system 9 has the same states as the discrete-time system 14 at all sampling instants, if 19 holds for 14, then we have

x_{\textup{n}}(kT_{d})\in\hat{\mathcal{X}}_{\textup{n}}\subset\mathcal{X}_{\textup{n}},\ u_{\textup{n}}(kT_{d})\in\hat{\mathcal{U}}_{\textup{n}}\subset{\mathcal{U}}_{\textup{n}},\quad\forall k\in\mathbb{Z}_{+},

(86)

for 9. Next we analyze the behavior of 9 between adjacent sampling instants. Towards this end, consider any $t=k^{\ast}T_{d}+\tau$ for some $k^{\ast}\in\mathbb{Z}_{+}$ and $\tau\in[0,T_{d})$ . From 9, we have $x_{\textup{n}}(t)=x_{\textup{n}}(k^{\ast}T_{d}+\tau)=~{}e^{A_{m}\tau}x_{\textup{n}}(k^{\ast}T_{d})+\int_{k^{\ast}T_{d}}^{k^{\ast}T_{d}+\tau}e^{A_{m}(k^{\ast}T_{d}+\tau-\xi)}B_{v}v(\tau)d\xi=e^{A_{m}\tau}x_{\textup{n}}(k^{\ast}T_{d})+\int_{k^{\ast}T_{d}}^{k^{\ast}T_{d}+\tau}e^{A_{m}(k^{\ast}T_{d}+\tau-\xi)}d\xi B_{v}v(k^{\ast}T_{d})=e^{A_{m}\tau}x_{\textup{n}}(k^{\ast}T_{d})+A_{m}^{-1}\left(e^{A_{m}\tau}-I_{n}\right)B_{v}v(k^{\ast}T_{d}),$ where the third equality is due to the fact that $v(k^{\ast}T_{d}+\tau)=v(k^{\ast}T_{d})$ for all $\tau\in[0,T_{d})$ . As a result, we have $x_{\textup{n}}(t)-x_{\textup{n}}(k^{\ast}T_{d})=x_{\textup{n}}(k^{\ast}T_{d}+\tau)-x_{\textup{n}}(k^{\ast}T_{d})=\left(e^{A_{m}\tau}-I_{n}\right)\left(x_{\textup{n}}(k^{\ast}T_{d})+A_{m}^{-1}B_{v}v(k^{\ast}T_{d})\right)$ . Thus, we have


$\displaystyle\left\lVert x_{\textup{n}}(t)-x_{\textup{n}}(k^{\ast}T_{d})\right\rVert_{\infty}$	$\displaystyle\leq\nu(T_{d}),$	(87a)
$\displaystyle\left\lVert u_{\textup{n}}(t)-u_{\textup{n}}(k^{\ast}T_{d})\right\rVert_{\infty}$	$\displaystyle\leq\left\lVert K_{x}\right\rVert_{\infty}\nu(T_{d}),$	(87b)

where $\nu(T_{d})$ defined in 18, while 87b is due to the fact that $u_{\textup{n}}(t)-u_{\textup{n}}(k^{\ast}T_{d})=K_{x}\left(x_{\textup{n}}(t)-x_{\textup{n}}(k^{\ast}T_{d})\right)$ . Considering 87, 86 and 17, we have $x_{\textup{n}}(t)\in\mathcal{X}_{\textup{n}}$ and $u_{\textup{n}}(t)\in\mathcal{X}_{\textup{n}}$ for all $t\geq 0$ . The proof is complete. ∎

A-B Proof of Lemma 3

Proof.

Rewriting the dynamics of the reference system in (43) in the Laplace domain yields

x_{\textup{r}}(s)=\mathcal{G}_{xm}(s)\mathfrak{L}\left[f(t,x_{\textup{r}}(t))\right]+\mathcal{H}_{xv}(s)v(s)+x_{\textup{in}}(s).

(88)

Therefore, from Lemma 2, for any $\xi>0$ , we have

\left\lVert x_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\xi]}}\leq\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}\left\lVert\eta_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\xi]}}+\left\lVert\mathcal{H}_{xv}(s)\right\rVert_{\mathcal{L}_{1}}\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}+\left\lVert x_{\textup{in}}\right\rVert_{\mathcal{L}_{\infty}},

(89)

where $\eta_{\textup{r}}(t)$ is defined in 44. If 45 is not true, since $x_{\textup{r}}(t)$ is continuous and $\left\lVert x_{\textup{r}}(0)\right\rVert_{\infty}<\rho_{r}$ , there exists a $\tau\!>\!0$ such that

\left\lVert x_{\textup{r}}(t)\right\rVert_{\infty}<\rho_{r},\ \forall t\in[0,\tau),\ \textup{and}\ \left\lVert x_{\textup{r}}(\tau)\right\rVert_{\infty}=\rho_{r},

(90)

which implies $x_{\textup{r}}(t)\in\Omega(\rho_{r})$ for any $t$ in $[0,\tau]$ . Further considering 7b that results from Assumption 1, we have

\left\lVert\eta_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq b_{f,\Omega(\rho_{r})}.

(91)

Plugging the preceding inequality into 89 leads to

\rho_{r}\leq\left\lVert\mathcal{G}_{xm}(s)\right\rVert_{\mathcal{L}_{1}}b_{f,\Omega(\rho_{r})}+\left\lVert\mathcal{H}_{xv}(s)\right\rVert_{\mathcal{L}_{1}}\left\lVert v\right\rVert_{\mathcal{L}_{\infty}}+\rho_{\textup{in}},

(92)

which contradicts the condition (31a). Therefore, (45) is true. Equation (46) immediately follows from (45) and 43. ∎

A-C Proof of Lemma 4

Proof.

Due to 48, we have $x(t)\in\Omega(\rho)$ for any $t$ in $[0,\tau]$ . Further considering 7b that results from Assumption 1, we have

\left\lVert f(t,x(t))\right\rVert_{\infty}=\left\lVert\eta(t)\right\rVert_{\infty}\leq b_{f,\Omega(\rho)},\quad\forall t\in[0,\tau].

(93)

From (47), for any $0\leq t<T$ and $i\in\mathbb{Z}_{0}$ , we have

	$\displaystyle\tilde{x}(iT+t)=$	$\displaystyle~{}e^{A_{e}t}\tilde{x}(iT)+\int_{iT}^{iT+t}e^{A_{e}(iT+t-\xi)}[B\ B^{\perp}]\begin{bmatrix}\hat{\sigma}_{1}(iT)\\ \hat{\sigma}_{2}(iT)\end{bmatrix}d\xi-\int_{iT}^{iT+t}e^{A_{e}(iT+t-\xi)}B\eta(\xi)d\xi$
	$\displaystyle=$	$\displaystyle~{}e^{A_{e}t}\tilde{x}(iT)+\int_{0}^{t}e^{A_{e}(t-\xi)}[B\ B^{\perp}]\begin{bmatrix}\hat{\sigma}_{1}(iT)\\ \hat{\sigma}_{2}(iT)\end{bmatrix}d\xi-\int_{0}^{t}e^{A_{e}(t-\xi)}B\eta(iT+\xi)d\xi.$		(94)

Considering the adaptive law (35), the preceding equality implies

\displaystyle\tilde{x}((i+1)T)\!=\!-\!\int_{0}^{T}\!e^{A_{e}(T-\xi)}B\eta(iT\!+\!\xi)d\xi.

(95)

Therefore, for any $i\in\mathbb{Z}_{0}$ with $(i+1)T\leq\tau$ , we have

\displaystyle\left\lVert\tilde{x}((i+1)T)\right\rVert_{\infty}

\displaystyle\leq\!\int_{0}^{T}\!\left\lVert e^{A_{e}(T-\xi)}B\right\rVert_{\infty}\left\lVert\eta(iT\!+\!\xi)\right\rVert_{\infty}d\xi\leq\bar{\alpha}_{0}(T)b_{f,\Omega(\rho)},

(96)

where $\bar{\alpha}_{0}(T)$ is defined in 37a, and the last inequality is due to 93. Since $\tilde{x}(0)=0$ , we therefore have

\left\lVert\tilde{x}(iT)\right\rVert_{\infty}\leq\bar{\alpha}_{0}(T)b_{f,\Omega(\rho)}\leq\gamma_{0}(T),\;\forall iT\leq\tau,i\in\mathbb{Z}_{0}.

(97)

Now consider any $t\in(0,T]$ such that $iT+t\leq\tau$ with $i\in\mathbb{Z}_{0}$ . From (94) and the adaptive law 35, we have

$\displaystyle\left\lVert\tilde{x}(iT+t)\right\rVert_{\infty}\leq$	$\displaystyle\left\lVert e^{A_{e}t}\right\rVert_{\infty}\left\lVert\tilde{x}(iT)\right\rVert_{\infty}+\int_{0}^{t}\left\lVert e^{A_{e}(t-\xi)}\Phi^{-1}(T)e^{A_{e}T}\right\rVert_{\infty}\left\lVert\tilde{x}(iT)\right\rVert_{\infty}d\xi$
	$\displaystyle+\int_{0}^{t}\left\lVert e^{A_{e}(t-\xi)}B\right\rVert_{\infty}\left\lVert\eta(iT+\xi)\right\rVert_{\infty}d\xi$
$\displaystyle\leq$	$\displaystyle\left(\bar{\alpha}_{1}(T)+\bar{\alpha}_{2}(T)+1\right)\bar{\alpha}_{0}(T)b_{f,\Omega(\rho)}=\gamma_{0}(T),$	(98)

where $\bar{\alpha}_{i}(T)$ ( $i=0,1,2$ ) are defined in 37a, 37b and 37c, and the last inequality is partially due to the fact that $\int_{0}^{t}\left\lVert e^{A_{e}(t-\xi)}B\right\rVert_{\infty}d\xi\leq\int_{0}^{T}\left\lVert e^{A_{e}(T-\xi)}B\right\rVert_{\infty}\!d\xi\!=\!\bar{\alpha}_{0}(T)$ . Equations 97 and 98 imply (49). ∎

A-D Proof of Theorem 1

Proof.

We first prove 50c and 50d by contradiction. Assume (50c) or (50d) do not hold. Since $\left\lVert x_{\textup{r}}(0)-x(0)\right\rVert_{\infty}=0<\gamma_{1}$ and $\left\lVert u_{\textup{r}}(0)-u_{\textup{a}}(0)\right\rVert_{\infty}=0<\gamma_{2}$ , and $x(t)$ , $u_{\textup{a}}(t)$ , $x_{\textup{r}}(t)$ and $u_{\textup{r}}(t)$ are all continuous, there must exist an instant $\tau$ such that

\left\lVert x_{\textup{r}}(\tau)-x(\tau)\right\rVert_{\infty}=\gamma_{1}\textup{ or }\left\lVert u_{\textup{r}}(\tau)-u(\tau)\right\rVert_{\infty}=\gamma_{2},

(99)

while

\left\lVert x_{\textup{r}}(t)\!-\!x(t)\right\rVert_{\infty}\!<\!\gamma_{1},\ \left\lVert u_{\textup{r}}(t)\!-\!u(t)\right\rVert_{\infty}\!<\!\gamma_{2},\ \forall t\in[0,\tau).

(100)

This implies that at least one of the following equalities hold:

\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}=\gamma_{1},\quad\left\lVert u_{\textup{r}}-u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}=\gamma_{2}.

(101)

Note that $\left\lVert x_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}\leq\rho_{r}<\rho$ according to Lemma 3 and $\left\lVert x\right\rVert_{\mathcal{L}_{\infty}}\leq\rho_{r}+\gamma_{1}=\rho$ from 101. Further considering 7a that results from Assumption 1, we have that

\left\lVert f(t,x_{\textup{r}}(t))\!-\!f(t,x(t))\right\rVert_{\infty}\!\leq\!L_{f,\Omega(\rho)}\!\left\lVert x_{\textup{r}}\!-\!x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\!,\ \forall t\!\in\![0,\tau].

(102)

The control laws in 36 and 43 indicate

\displaystyle u_{\textup{r}}(s)-u_{\textup{a}}(s)=-{\mathcal{C}}(s)\mathfrak{L}\left[f(t,x_{\textup{r}})-\hat{\sigma}_{1}(s)\right]={\mathcal{C}}(s)\mathfrak{L}\left[f(t,x)\!-\!f(t,x_{\textup{r}})\right]+{\mathcal{C}}(s)(\hat{\sigma}_{1}(s)\!-\!\mathfrak{L}\left[f(t,x)\right]).

(103)

Equation (47) indicates that

\hat{\sigma}_{1}(s)-\mathfrak{L}\left[f(t,x)\right])=B^{\dagger}(sI_{n}-A_{e})\tilde{x}(s).

(104)

Considering (5), 36 and 104, we have

x(s)=\mathcal{G}_{xm}(s)\mathfrak{L}\left[f(t,x)\right]+\mathcal{H}_{xv}(s)v(s)+x_{\textup{in}}(s)-\mathcal{H}_{xm}(s){\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\tilde{x}(s),

(105)

which, together with 88, implies

x_{\textup{r}}(s)-x(s)=\mathcal{G}_{xm}(s)\mathfrak{L}\left[f(t,x_{\textup{r}})-f(t,x)\right]+\mathcal{H}_{xm}(s){\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\tilde{x}(s).

(106)

Therefore, further considering (102) and Lemma 4, we have

\displaystyle\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}\leq

\displaystyle\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}L_{f,\Omega(\rho)}\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}+\!\left\lVert\mathcal{H}_{xm}(s){\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\right\rVert_{\mathcal{L}_{1}}\!\gamma_{0}(T).

The preceding equation, together with 31b, leads to

\displaystyle\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}

\displaystyle\!\leq\!\frac{\left\lVert\mathcal{H}_{xm}(s){\mathcal{C}}(s)B^{\dagger}(sI_{n}\!-\!A_{e})\right\rVert_{\mathcal{L}_{1}}}{1-\left\lVert\mathcal{G}_{xm}\right\rVert_{\mathcal{L}_{1}}L_{f,\Omega(\rho)}}\gamma_{0}(T),

(107)

which, together with the sample time constraint 42, indicates that

\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}<\gamma_{1}.

(108)

On the other hand, it follows from 102, 103, 104 and 108 that

	$\displaystyle\left\lVert u_{\textup{r}}-u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}$	$\displaystyle\leq\left\lVert{\mathcal{C}}(s)\right\rVert_{\mathcal{L}_{1}}L_{f,\Omega(\rho)}\left\lVert x_{\textup{r}}-x\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}+\left\lVert{\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\right\rVert_{\mathcal{L}_{1}}\left\lVert\tilde{x}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}$
		$\displaystyle<\left\lVert{\mathcal{C}}(s)\right\rVert_{\mathcal{L}_{1}}L_{f,\Omega(\rho)}\gamma_{1}+\left\lVert{\mathcal{C}}(s)B^{\dagger}(sI_{n}-A_{e})\right\rVert_{\mathcal{L}_{1}}\gamma_{0}(T).$

Further considering the definition in 40, we have

\left\lVert u_{\textup{r}}-u_{\textup{a}}\right\rVert_{\mathcal{L}_{\infty}^{[0,\tau]}}<\gamma_{2}.

(109)

Note that 108 and 109 contradict the equalities in 101, which proves 50c and 50d. The bounds in 50a and 50b follow directly from 50c, 50d, 45 and 46 and the definitions of $\rho$ and $\rho_{u_{\textup{a}}}$ in 32 and 41. The proof is complete. ∎

A-E Proof of Lemma 5

Proof.

From 9a and 43, we have

x_{\textup{r}}(s)-x_{\textup{n}}(s)=G_{xm}(s)\mathfrak{L}\left[f(t,x_{\textup{r}})\right]=G_{xm}(s)\mathfrak{L}\left[\eta_{\textup{r}}(t)\right].

(110)

According to Lemma 3, we have $x_{\textup{r}}(t)\in\Omega(\rho_{r})$ for any $t\geq 0$ . Further considering 7b that results from Assumption 1, we have $\left\lVert\eta_{\textup{r}}\right\rVert_{\mathcal{L}_{\infty}}\leq b_{f,\Omega(\rho_{r})}$ , which, together with 110, leads to 51. ∎

Integrated Adaptive Control and Reference Governors for Constrained Systems with State-Dependent Uncertainties

Abstract

Index Terms:

I Introduction

I-A Related Work

I-B Contributions

II Problem statement

Assumption 1.

Remark 1.

Remark 2.

Remark 3.

III Overview and Preliminaries

III-A Overview of the ℒ1{\mathcal{L}_{1}}-RG Framework

III-B Reference Governor Design for a Nominal System

Lemma 1.

Proof.

Remark 4.

Remark 5.

Remark 6.

III-C ℒ1{\mathcal{L}_{1}} Adaptive Control Design and Uniform Performance Bounds

Definition 1.

Lemma 2.

III-C1 ℒ1{\mathcal{L}_{1}} adaptive control architecture

Remark 7.

Remark 8.

III-C2 Uniform performance bounds

Lemma 3.

Lemma 4.

Theorem 1.

Remark 9.

Lemma 5.

Remark 10.

Theorem 2.

Remark 11.

IV ℒ1{\mathcal{L}_{1}}AC with Separate Bounds for States and Inputs

Lemma 6.

Proof.

Remark 12.

Theorem 3.

Proof.

Remark 13.

Remark 14.

Remark 15.

V ℒ1{\mathcal{L}_{1}}-RG: Adaptive Reference Governor for Constrained Control Under Uncertainties

V-A ℒ1{\mathcal{L}_{1}}-RG Design

Assumption 2.

Remark 16.

Theorem 4.

Proof.

VI Simulation Results

VI-A ℒ1{\mathcal{L}_{1}}-RG Design

VI-B Simulation Results

VII Conclusion

References

Appendix A Proofs

A-A Proof of Lemma 1

Proof.

A-B Proof of Lemma 3

Proof.

A-C Proof of Lemma 4

Proof.

A-D Proof of Theorem 1

Proof.

A-E Proof of Lemma 5

Proof.

III-A Overview of the ${\mathcal{L}_{1}}$ -RG Framework

III-C ${\mathcal{L}_{1}}$ Adaptive Control Design and Uniform Performance Bounds

III-C1 ${\mathcal{L}_{1}}$ adaptive control architecture

IV ${\mathcal{L}_{1}}$ AC with Separate Bounds for States and Inputs

V ${\mathcal{L}_{1}}$ -RG: Adaptive Reference Governor for Constrained Control Under Uncertainties

V-A ${\mathcal{L}_{1}}$ -RG Design

VI-A ${\mathcal{L}_{1}}$ -RG Design