Application of Terminal Region Enlargement Approach for Discrete Time Quasi Infinite Horizon NMPC

Chinmay Rajhans \IEEEmembershipMember, IEEE and Sowmya Gupta \IEEEmembershipMember, IEEE “This work is not supported by any funding agency.” C. Rajhans is with Department of Electrical Engineering, IIT Bombay, Mumbai, India (e-mail: [email protected]). S. Gupta is with Center for Research in Nano Technology and Science, IIT Bombay, Mumbai, India (e-mail: [email protected]).

Abstract

Ensuring nominal asymptotic stability of the Nonlinear Model Predictive Control (NMPC) controller is not trivial. Stabilizing ingredients such as terminal penalty term and terminal region are crucial in establishing the asymptotic stability. Approaches available in the literature provide limited degrees of freedom for the characterization of the terminal region for the discrete time Quasi Infinite Horizon NMPC (QIH-NMPC) formulation. Current work presents alternate approaches namely arbitrary controller based approach and LQR based approach, which provide large degrees of freedom for enlarging the terminal region. Both the approaches are scalable to system of any state and input dimension. Approach from the literature provides a scalar whereas proposed approaches provide a linear controller and two additive matrices as tuning parameters for shaping of the terminal region. Proposed approaches involve solving modified Lyapunov equations to compute terminal penalty term, followed by explicit characterization of the terminal region. Efficacy of the proposed approaches is demonstrated using benchmark two state system. Terminal region obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area measure when compared to the largest terminal region obtained using the approach from the literature.

{IEEEkeywords}

Asymptotic stability, Control systems, Lyapunov methods, Optimal control, Predictive Control.

1 Introduction

\IEEEPARstart

Model Predictive Control (MPC) finds its applications in every field of engineering [1, 2]. Principle concept for ensuring nominal and robust stability involves inclusion of stabilizing constraints [3]. A significant process has taken place in the area of nominal and robust stability of linear MPC and Nonlinear MPC (NMPC) [4, 5, 6]. Terminal equality constraint is mathematically equivalent to equating the terminal state to the target set point or target steady state operating point [7]. Primary limitation of terminal equality constraint is that it is very conservative and often leads to infeasibility specifically for constrained formulations. Michalska and Mayne conceptualized dual mode MPC scheme wherein the novel idea of terminal region was introduced [8]. NMPC controller is expected to drive the plant trajectory into a region, termed as terminal region or terminal set, around the origin in the finite number of steps using the feasible inputs. Subsequently, local linear controller will take the plant trajectory to the origin. This idea was extended by Chen and Allgöwer with the concept of Quasi Infinite Horizon - Nonlinear Model Predictive Control (QIH-NMPC) formulation [9].

One seminar contribution of [9] was that it gave explicit approaches for the characterization of the terminal ingredients for the continuous time QIH-NMPC formulations. Several researchers have developed approaches for the characterization of the terminal constraints for the continuous time QIH-NMPC formulations [9, 10, 11, 12] and for the discrete time QIH-NMPC formulations [13, 14, 15]. It may be noted that discrete time formulations require separate considerations due to the concept of sampling time vastly affecting the terminal region shape and size [16, 17]. One of the major limitation of these approaches available in the literature is that all of them provide a single scalar tuning parameter giving only one degree of freedom for enlarging the terminal region. Recently, [18] presented one approach for terminal region characterization, which is discrete time equivalent of the continuous time approach by Chen and Allgöwer[9]. However, in addition to the scalar tuning parameter limitation, Yu et al.’s approach also was suitable when linearized discrete time system at the operating point is controllable. Due to such limitation, applicability of the approach was reduced and also resulted in conservative terminal regions.

Current work aims at extending the concepts presented by Rajhans et al. in [15] and providing larger degrees of freedom for the terminal region characterization of the discrete time QIH-NMPC formulations. For the proposed approaches in the current work, two tuning matrices are provided which further increase the degrees of freedom available to the designer. Proposed approaches provide three degrees of freedom namely a) linear stabilizing controller, b) additive state weighting matrix, and c) additive input weighting matrix.

Efficacy of the proposed arbitrary controller based approach and LQR based approach is demonstrated using the benchmark two state system. This system is used by various researchers for demonstrating their stability results [19, 9, 20, 18].

Second section presents the discrete time QIH-NMPC formulation in detail. In addition, approach by Yu et al. [18] is stated mathematically along with its limitation. Third section presents two alternate approaches for the computation of the terminal penalty and for the characterization of the terminal region. Forth section presents numerical characterization of the terminal region using the approaches presented in the third section. Fifth section presents the simulation case study results in detail. Sixth section presents the conclusions derived.

1.1 Notation

Two norm of a vector $\mathbf{y}$ is defined as $|\mathbf{y}|:=\sqrt{\mathbf{y}^{T}\mathbf{y}}$ . Weighted norm having weighting matrix ${\mathbf{A}}$ is defined as $\mathbf{y_{A}}:=\sqrt{\mathbf{y^{T}Ay}}$ . $norm(\mathbf{A})$ represents a two norm of matrix $\mathbf{A}$ . $\lambda_{min}(\mathbf{A})$ indicates the smallest eigenvalue of $\mathbf{A}$ . $\rho_{max}(\mathbf{A})$ indicates the maximum magnitude eigenvalue of $\mathbf{A}$ . $\mathbb{R}$ is a set of real numbers. $\mathbb{N}$ is a set of natural numbers.

Consider an autonomous discrete time system

\displaystyle\mathbf{x}(k+1)=\tilde{\mathbf{F}}({\mathbf{x}}(k))

(1)

with initial condition ${\bf{x}}(0)={\mathbf{x}}_{0}$ . Consider a set $\mathcal{X}\subset\mathbb{R}^{n_{x}}$ such that $\mathbf{0}\in\mathcal{X}$ .

2 Discrete Time QIH-NMPC Formulation

Consider a discrete time nonlinear system model given as

	$\displaystyle\mathbf{x}(k+1)$	$\displaystyle={\mathbf{F}}({\mathbf{x}}(k),{\mathbf{u}}(k))$		(2)
	$\displaystyle{\bf{x}}(0)$	$\displaystyle={\bf{x}}_{0}$		(3)

where $\mathbf{x}(k)\in\mathcal{X}\subset\mathbb{R}^{n_{x}}$ denotes the state vector and $\mathbf{u}(k)\in\mathcal{U}\subset\mathbb{R}^{n_{u}}$ denotes the input vector. Note that origin $(\mathbf{x}(k)=\mathbf{0},\mathbf{u}(k)=\mathbf{0})$ is the equilibrium point of the system (2) due to shift of the origin.

Following assumptions are made:

D1: System dynamics function $\mathbf{F}:\mathbb{R}^{n_{x}}\times\mathbb{R}^{n_{u}}\to\mathbb{R}^{n_{x}}$ is twice continuously differentiable.
D2: The origin $\mathbf{0}\in\mathbb{R}^{n_{x}}$ is an equilibrium point of the system (2) i.e. $\mathbf{F}\left(\mathbf{0},\mathbf{0}\right)=\mathbf{0}$ .
D3: The inputs $\mathbf{u}(k)$ are constrained inside a closed and convex set $\mathcal{U}\subset\mathbb{R}^{n_{u}}$ .
D4: The system (2) has a unique solution for any initial condition $\mathbf{x}_{0}\in\mathcal{X}$ and any input $\mathbf{u}(k)\in\mathcal{U}$ .
D5: The state $\mathbf{x}(k)$ is perfectly known at any sampling instant $k$ i.e. all the states are measured.
D6: External disturbances do not affect the system dynamics.

2.1 QIH-NMPC Formulation

For the discrete time system given by (2), QIH-NMPC formulation is stated as follows:

\begin{array}[]{c}\min\\ \overline{\mathbf{u}}_{(k,k+N-1)}\end{array}J\left(\mathbf{x}(k),\overline{\mathbf{u}}_{[k,k+N-1]}\right)

(4)

with

	$\displaystyle J\left(\mathbf{x}(k),\overline{\mathbf{u}}_{[k,k+N-1]}\right)$	$\displaystyle=$	$\displaystyle\sum_{i=k}^{k+N-1}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)^{T}\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\}$		(8)
			$\displaystyle+\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)$		(8)

\overline{\mathbf{u}}_{[k,k+N-1]}=\left\{\bar{\mathbf{u}}(i)\in\mathcal{U}:i\in\mathbb{N},k\leq i\leq k+N-1\right\}

(9)

subject to

$\displaystyle\mathbf{z}(i+1)$	$\displaystyle=$	$\displaystyle\mathbf{F}\left(\mathbf{z}(i),\overline{\mathbf{u}}(i)\right)\text{ for }k\leq i\leq k+N-1$	(10)
$\displaystyle\mathbf{z}(k)$	$\displaystyle=$	$\displaystyle\mathbf{x}(k)$	(11)
$\displaystyle\mathbf{z}(i)$	$\displaystyle\in$	$\displaystyle\mathcal{X}\text{ for }k\leq i\leq k+N$	(12)
$\displaystyle\bar{\mathbf{u}}(i)$	$\displaystyle\in$	$\displaystyle\mathcal{U}\text{ for }k\leq i\leq k+N-1$	(13)
$\displaystyle\mathbf{z}\left(k+N\right)$	$\displaystyle\in$	$\displaystyle\Omega$	(14)

where $\mathbf{W}_{x}$ and $\mathbf{W}_{u}$ are state and input weighting matrices of dimension $\left(n_{x}\times n_{x}\right)$ , $\left(n_{u}\times n_{u}\right)$ respectively. $\mathbf{P}$ is the terminal penalty matrix of dimension $\left(n_{x}\times n_{x}\right)$ . $\mathbf{W}_{x},\mathbf{W}_{u},\mathbf{P}$ are symmetric positive definite matrices. $N$ is a finite prediction horizon length. $\mathbf{z}(i)$ denotes the predicted state in the QIH-NMPC formulation and $\overline{\mathbf{u}}(i)$ denotes the future input moves. The set $\Omega$ is termed as the terminal region in the neighborhood of the origin. The set $\mathcal{X}_{N}\subset\mathcal{X}\subset\mathbb{R}^{n_{x}}$ is termed as the region of attraction is the set of all feasible initial conditions i.e. it is a set of all initial conditions $\mathbf{x}_{0}$ such that terminal inequality constraint (14) is satisfied with inputs constrained given by equation (9) satisfied.

2.2 Design and Implementation of QIH-NMPC Formulation

The terminal region $\Omega$ is chosen as an invariant set for the nonlinear system (2) controlled by local linear controller with gain matrix $\mathbf{K}$ . The terminal penalty term is chosen such that for all trajectories starting from any point inside the terminal region $\Omega$ , with approximation that a single cost term having larger value that the sum of all the predicted stage cost terms from end of horizon to infinity and is given as follows:

\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)\geq\sum_{i=k+N}^{\infty}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\}

(15)

with $\overline{\mathbf{u}}(i)=-\mathbf{K}\mathbf{z}(i)\in\mathcal{U}$ for all $k\geq k+N$ and for all $\mathbf{z}(k+N)\in\Omega$ .

It is assumed that the solution to the optimal problem (4) with stage cost defined by (8) with input set given by (9) subject to the predicted state dynamics (10) with initial condition (11), predicted states constraint (12), future input moves constraint (13) and terminal state constraint (14) i.e. $\overline{\mathbf{u}}_{[k,k+N-1]}^{*}$ exists and can be computed numerically. Controller is implemented as a moving horizon framework. Accordingly, only the first control move $\mathbf{u}(k)=\overline{\mathbf{u}}^{*}(k)$ is implemented in the plant. Entire process is repeated at the next sampling instant $k+1$ . The term Quasi Infinite is because of the fact that the NMPC formulation depicts the stability properties of the infinite horizon formulation, however, the actual implementation contains finite horizon. Such implementation is achieved with the help of the equation (15). However, only ensuring the terminal penalty term satisfying the condition (15) is not sufficient to guarantee the nominal asymptotic stability of the NMPC controller, hence terminal constraint as given by (14) becomes inevitable. It may be noted that local linear controller with gain matrix $\mathbf{K}$ is not used for implementation of the NMPC controller and is only a mathematical construct to characterize the terminal region $\Omega$ .

2.3 Approach from the Literature

Before proceeding to application of the proposed arbitrary controller based approach and LQR based approach, a look at Yu et al’s approach from [18] is necessary. Consider, Jacobian linearization of the nonlinear system (2) in the neighborhood the origin as,

\mathbf{x}(k+1)=\mathbf{\Phi x}(k)+\mathbf{\Gamma u}(k)

(16)

where

\mathbf{\Phi}=\left[\frac{\partial\mathbf{F}}{\partial\mathbf{x}}\right]_{\left(\mathbf{0},\mathbf{0}\right)}\text{ and \ }\mathbf{\Gamma}=\left[\frac{\partial\mathbf{F}}{\partial\mathbf{u}}\right]_{\left(\mathbf{0},\mathbf{0}\right)}

One additional assumption is required at this stage.

D7: The linearized system (16) is stabilizable.

Yu et al. characterize the terminal region as,

\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\alpha,-\mathbf{Lx}(k)\in\mathcal{U}\right\}

(17)

where linear gain $\mathbf{L}$ and the terminal penalty matrix $\mathbf{P}$ are the steady state solutions of the modified Lyapunov equation given as follows:

\kappa^{2}\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\mathbf{Q}^{*}

(18)

\mathbf{Q}^{*}=\mathbf{W}_{x}+\mathbf{L}^{T}\mathbf{W}_{u}\mathbf{L}

(19)

where $\mathbf{\Phi}_{L}=\mathbf{\Phi-\Gamma L}$ and parameter $\kappa$ is chosen such that $1<\kappa<1/\left[\rho_{max}\left(\mathbf{\Phi}_{L}\right)\right]$ with $\rho_{max}(\mathbf{\Phi}_{L})$ being the maximum amplitude eigenvalue of $\mathbf{\Phi}_{L}$ . Since the gain $\mathbf{L}$ is stabilizing, all the eigenvalues of $\mathbf{\Phi}_{L}$ lie inside the unit circle, hence $\rho_{max}(\mathbf{\Phi}_{L})<1$ . It can be noted that once stage cost weighting matrices $\mathbf{W}_{x},\mathbf{W}_{u}$ are chosen, there is barely any degree of freedom left to the designer for shaping of the terminal region. This results in very conservative terminal regions. In addition, for the systems where there is uncontrollable eigenvalue of $\mathbf{\Phi}$ is close to unit circle, value of $\kappa$ becomes nearly $1$ , resulting in a negligible margin for shaping of the terminal region. These limitations are overcome by using the arbitrary controller based approach wherein additive tuning matrices are introduced for providing large degrees of freedom for enlarging of the terminal region and is presented in the subsequent section.

3 Alternate Approaches for Terminal Region Characterization

Two approaches are proposed in this section. First is an arbitrary controller based approach as proposed by Rajhans et al. in [15], and second is an LQR based approach. In the arbitrary controller based approach, an arbitrary stabilizing linear is designed using any of the methods available in the literature such as pole placement [21], Linear Quadratic Gaussian (LQG) control [22] and so on. In the LQR based approach, an LQG controller is designed using the weighting matrices of the QIH-NMPC formulation.

Lemma 1

Suppose that assumptions D1 to D7 are satisfied and a stabilizing linear feedback control law is designed i.e. $\mathbf{\Phi}_{L}=(\mathbf{\Phi-\Gamma L})$ is stable indicating all the eigenvalues are inside the unit circle. Let $\Delta\mathbf{Q}$ is any positive definite matrix. Let matrix $\mathbf{P}$ denote the solution of the following modified Lyapunov equation:

\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-(\mathbf{Q}^{*}+\Delta\mathbf{Q)}

(20)

where $\mathbf{Q}^{*}$ is defined by equation (19). Then there exists a constant $\alpha>0$ which defines an ellipsoid of the form

\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\alpha,-\mathbf{Lx}(k)\in\mathcal{U}\right\}

(21)

such that $\Omega$ is an invariant set for the nonlinear system given by (2) with linear controller $\mathbf{u}(k)=-\mathbf{Lx}(k)$ . Additionally, for any $\mathbf{z}(k+N)\in\Omega$ the inequality given by (22) holds true.

\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)\geq\sum_{i=k+N}^{\infty}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\}

(22)

Proof 3.1.

Since $\mathbf{\Phi}_{L}=(\mathbf{\Phi-\Gamma L})$ is stable, hence, the eigenvalues of $\mathbf{\Phi}_{L}$ are inside the unit circle. Using the solvability condition of the modified Lyapunov equation, a unique $\mathbf{P}>0$ can be computed which solves the equation (20). According to Assumption D2, the origin $\mathbf{0}\in\mathbb{R}^{n_{u}}$ is in the interior of the input constraints set $\mathcal{U}$ . Accordingly, we can compute a constant $\gamma$ which defined a set $\Omega_{\gamma}$ such that

\Omega_{\gamma}\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\gamma,-\mathbf{Lx}(k)\in\mathcal{U}\right\}

(23)

Now, let $0<\alpha\leq\gamma$ specify a region of the form given by equation (24).

\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}\leq\alpha\right\}

(24)

As the input constraints are satisfied in $\Omega_{\gamma}$ and $\Omega\subseteq\Omega_{\gamma}$ (by virtue of $0<\alpha\leq\gamma$ ), the system dynamics can be equivalently viewed as an input unconstrained system in the set $\Omega$ . Consider a vector $\mathbf{\psi}_{L}(\mathbf{x})$ representing the nonlinearity in the system dynamics defined as

\mathbf{\Psi}_{L}(\mathbf{x}(k))=\mathbf{F}(\mathbf{x}(k),-\mathbf{Lx}(k))-\mathbf{\Phi}_{L}\mathbf{x}(k)

(25)

Note for a linear system $\mathbf{\Psi}_{L}(\mathbf{x}(k))=\mathbf{0}$ . Consider a Lyapunov candidate defined as

V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)

(26)

The difference of Lyapunov candidate $V(\mathbf{x}(k))$ can be expressed as follows:

	$\displaystyle\Delta V(\mathbf{x}(k))$	$\displaystyle=V(\mathbf{x}(k+1))-V(\mathbf{x}(k))$
		$\displaystyle=\mathbf{x}(k+1)^{T}\mathbf{P}\mathbf{x}(k+1)-\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)$		(27)

Substituting from (25) into (27),

$\displaystyle\Delta V(\mathbf{x}(k))$	$\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}\right)\mathbf{x}(k)$
	$\displaystyle+2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)$
	$\displaystyle+\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k))$	(28)

Using equation (20) into (28),

$\displaystyle\Delta V(\mathbf{x}(k))$	$\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}+\Delta\mathbf{Q}\right)\mathbf{x}(k)$
	$\displaystyle+2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)$
	$\displaystyle+\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k))$	(29)

Define $\chi(\mathbf{x}(k))$ as

	$\displaystyle\chi(\mathbf{x}(k)):=$	$\displaystyle\mathbf{x}(k)^{T}\left(\Delta\mathbf{Q}\right)\mathbf{x}(k)-2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)$
		$\displaystyle-\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k))$		(30)

Using equation (30) into (29),

\displaystyle\Delta V(\mathbf{x}(k))

\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k)-\chi(\mathbf{x}(k))

(31)

If $\Omega$ is chosen such that

\displaystyle\chi(\mathbf{x}(k))\geq 0

(32)

then

\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k)

(33)

Summing the inequality (33) over the interval, $[k+N,\infty),$ it follows that

V(\mathbf{x}(k+N))\geq\sum_{i=k+N}^{\infty}\mathbf{x}(i)^{T}\mathbf{Q}^{*}\mathbf{x}(i)

(34)

Using (19) and (26), inequality (34) becomes identical to (22) and holds true for any $\mathbf{x}(k+N))\in\Omega$ .

Lemma 3.2.

Suppose that assumptions D1 to D7 are satisfied. Let $\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x}$ and $\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u}$ be positive definite matrices. Let matrix $\mathbf{L}$ and $\mathbf{P}$ denote the steady state solution of the following modified Lyapunov equations:

	$\displaystyle\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\left(\widetilde{\mathbf{W}}_{x}+\mathbf{L}^{T}\widetilde{\mathbf{W}}_{u}\mathbf{L}\right)$		(35)
	$\displaystyle\mathbf{L}=\left(\widetilde{\mathbf{W}}_{u}+\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Gamma}\right)^{-1}\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Phi}$		(36)

Then there exists a constant $\alpha>0$ which defines an ellipsoid of the form (21) such that $\Omega$ is an invariant set for the nonlinear system given by (2) with linear controller $\mathbf{u}(k)=-\mathbf{Lx}(k)$ . Additionally, for any $\mathbf{z}(k+N)\in\Omega$ the inequality given by (22) holds true.

Proof 3.3.

Define the following matrices,

\Delta\mathbf{Q}=\Delta\mathbf{W}_{x}+\mathbf{L}^{T}\Delta\mathbf{W}_{u}\mathbf{L}

(37)

with

	$\displaystyle\Delta\mathbf{W}_{x}:=\widetilde{\mathbf{W}}_{x}-\mathbf{W}_{x}$		(38)
	$\displaystyle\Delta\mathbf{W}_{u}:=\widetilde{\mathbf{W}}_{u}-\mathbf{W}_{u}$		(39)

Consider a Lyapunov candidate defined as

V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)

(40)

The difference of Lyapunov candidate $V(\mathbf{x}(k))$ can be expressed as follows:

	$\displaystyle\Delta V(\mathbf{x}(k))$	$\displaystyle=V(\mathbf{x}(k+1))-V(\mathbf{x}(k))$
		$\displaystyle=\mathbf{x}(k+1)^{T}\mathbf{P}\mathbf{x}(k+1)-\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)$		(41)

Substituting from (35), (37), (38), and (39) into (41) and upon simplification,

\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k)

(42)

Rest of the proof is similar to that of the proof of the lemma 1.

Lemma 3.4.

Let the assumptions D1-D7 hold true. For the nominal discrete time system, feasibility of QIH-NMPC formulation problem (4) at sampling instant $k=0$ implies its feasibility for all $k>0$ .

Proof 3.5.

Proof is identical to the proof of the lemma 2 from [15].

Theorem 3.6.

Let a) Assumptions D1-D7 hold true and b) the discrete time QIH-NMPC problem be feasible at $k=0$ . The nominal nonlinear system (2) controlled with QIH-NMPC controller is asymptotically stable at the origin.

Proof 3.7.

From equation (26) from the lemma (1), consider the Lyapunov candidate function

V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)

(43)

Consider the following three properties [23]:

•

$V(\mathbf{0})=(\mathbf{0}^{T})\mathbf{P}(\mathbf{0})=0$ .
•

Since $\mathbf{P}$ is a positive definite matrix, $V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)>0$ for all $\mathbf{x}(k)\neq\mathbf{0}$ .
•

Using (33) and $\mathbf{Q}^{*}>0$ implies

$\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\mathbf{Q}^{*}\mathbf{x}(k)<0$ (44)

Thus, the candidate function $V(\mathbf{x}(k))$ is a Lyapunov function for the nonlinear system for $\mathbf{x}\in\Omega$ under QIH-NMPC controller. Hence, the closed loop system is asymptotically stable at the origin.

4 CHARACTERIZATION OF THE TERMINAL REGION

Lemma 1 or lemma 3.2 gave the conditions for explicit characterization of the terminal region. It is possible to compute the terminal region numerically and subsequently implement the QIH-NMPC controller.

Steps for characterization of the terminal region using arbitrary controller based approach and LQR based approach are given below:

S1

Computation of Upper Bound Set:
Compute the largest value of $\gamma$ such that inputs constraints are satisfied in the set $\Omega_{\gamma}$ given by (45).

\Omega_{\gamma}\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\gamma,-\mathbf{Kx}(k)\in\mathcal{U}\right\}

(45)

This can be formulated as a Quadratic Programming (QP) problem if the constraints are defined by upper bound and lower bound on each of the input variables. Typically the set $\Omega_{\gamma}$ would be tangential to at least one of the input constraints.

S2

Computation of the Terminal Region using inequality based method:
Compute the largest $\alpha\in(0,\gamma]$ such that

\displaystyle\left[\begin{array}[]{c}\min\\ \mathbf{x}(k)\in\Omega\end{array}\chi(\mathbf{x}(k))\right]=0

(48)

The condition given by (48) ensures that $\mathbf{\Psi}(\mathbf{x})>0$ for all $\mathbf{x}\in\Omega$ , which is the necessary condition to further establish the nominal asymptotic stability.

The step S2 is implemented as follows:
Initially $\alpha=\gamma$ and condition (32) i.e. $(\chi(\mathbf{x}(k))\geq 0)$ is checked. If (32) is true, then $\alpha=\gamma$ . If (32) is false i.e. $\chi(\mathbf{x})(k)<0)$ for at least one $\mathbf{x}(k)\in\Omega$ , then the value of $\alpha$ is further reduced by a multiplicative factor $\beta<1$ and $\beta\approx 1$ . The process continues until condition (32) is satisfied.

Terminal region shape changes according to the computed $\mathbf{P}$ matrix and its size changes according to the value of $\alpha$ . Comparison of the terminal regions obtained using various approaches is carried out using measurement of the area for the system with a state dimension of 2. Area of the terminal region $\Omega$ defined by (21) is given by

\displaystyle A_{2}=\frac{\pi\alpha}{\sqrt{det(\mathbf{P})}}

(49)

5 Simulation Case Study

Effectiveness of the proposed approaches for the terminal region characterization and its applicability to QIH-NMPC discrete time simulations is demonstrated using the benchmark two state system which is used by several researchers [19, 9, 18].

5.1 Choice of Tuning Matrices

According to the design of the arbitrary controller based approach, the gain matrix $\mathbf{K}$ can be any arbitrary stabilizing linear controller. However, in order to simplify the computations, simulation results are presented with the following choice. Identical linear controller gain is also used in the case of LQR based approach. Controller gain $\mathbf{K}$ is the steady state solution of the simultaneous equations (50) and (51).

	$\displaystyle\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\left(\mathbf{W}_{x}+\mathbf{L}^{T}\mathbf{W}_{u}\mathbf{L}\right)$		(50)
	$\displaystyle\mathbf{L}=\left(\mathbf{W}_{u}+\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Gamma}\right)^{-1}\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Phi}$		(51)

The tuning matrix $\Delta\mathbf{Q}$ for the arbitrary controller based approach can be any positive definite matrix. However, in order to simplify and structure the computations of the terminal region, following parameterization is carried out:

\Delta\mathbf{Q}=\widetilde{\mathbf{W}}_{x}+\mathbf{K}^{T}\widetilde{\mathbf{W}}_{u}\mathbf{K}

(52)

In order to further simplify the numerical computation of the terminal region using arbitrary controller based approach, additional parameterization is carried out and is given as follows:

\widetilde{\mathbf{W}}_{x}=\rho_{x}\mathbf{W}_{x}\text{ and }\widetilde{\mathbf{W}}_{u}=\rho_{u}\mathbf{W}_{u}

(53)

Note that it is sufficient to have $\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x}$ or $\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u}$ to satisfy $\Delta\mathbf{Q}>0$ , however, usually both $\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x}$ and $\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u}$ is preferred in practice. Using the matrices (53) into (52),

\Delta\mathbf{Q}=\rho_{x}\mathbf{W}_{x}+\rho_{u}\mathbf{K}^{T}\mathbf{W}_{u}\mathbf{K}

(54)

where $\rho_{x}$ and $\rho_{u}$ are the tuning scalars. For the arbitrary controller based approach, conditions $\rho_{x}>0$ , and $\rho_{u}>0$ and for the LQR based approach, conditions $\rho_{x}>1$ and $\rho_{u}>1$ are imposed.

Efficacy of having two tuning parameters is efficiently demonstrated using the case study in the next sub-section. In the case study, in the first iteration, parameter $\rho_{x}$ is increased from $0$ (for arbitrary controller based approach) or $1$ (for LQR based approach) to a large value, keeping $\rho_{u}$ constant at $0$ or $1$ respectively. A value of $\rho_{x}$ is chosen which leads to larger terminal region in the first iteration. Subsequently, in the second iteration, parameter $\rho_{u}$ is increased from $0$ (for arbitrary controller based approach) or $1$ (for LQR based approach) to larger value, keeping $\rho_{x}$ as constant to the values obtained in the first iteration.

5.2 Two State System

Discrete time system dynamics are given by (55)-(56). These equations are obtained by using the continuous time equations given in [19, 9] with a simple first order Euler integration to obtain the discrete time equations identical to the ones given in [18].

	$\displaystyle x_{1}(k+1)$	$\displaystyle=x_{1}(k)+T\left(x_{2}(k)+u(k)\left(\mu_{0}+(1-\mu_{0})x_{1}(k)\right)\right)$		(55)
	$\displaystyle x_{2}(k+1)$	$\displaystyle=x_{2}(k)+T\left(x_{1}(k)+u(k)\left(\mu_{0}-4(1-\mu_{0})x_{2}(k)\right)\right)$		(56)

where parameter $\mu_{0}=0.5$ , $\mathbf{x}(k)=\left[x_{1}(k)~{}~{}x_{2}(k)\right]^{T}$ is the state vector, and $u(k)$ is the input. The chosen equilibrium operating point is given as $\mathbf{x}(k)=\left[0~{}~{}0\right]^{T}$ and $\mathbf{u}(k)=\left[0\right]$ . It may be noted that sampling interval chosen for computations is $T=0.1$ s, which is identical to the one chosen in [18]. The eigenvalues of the linearized matrix at the origin are $(0.9048,1.1052)$ , indicating that the chosen operating point is unstable resulting in a challenging control problem.

The discrete time QIH-NMPC weighting matrices are chosen identical to the one given in [18] and are given as follows:

\displaystyle\mathbf{W}_{x}=\left[\begin{array}[]{ccc}1&0\\ 0&1\end{array}\right]\text{ and }\mathbf{W}_{u}=\left[\begin{array}[]{cc}0.5\end{array}\right]

(60)

The input constraint used in the QIH-NMPC formulation is identical to the one given in [9, 18] and is given as follows:

\mathcal{U}=\left\{\mathbf{u}(k)\in\mathbb{R}~{}|~{}-2\leq\mathbf{u}(k)\leq 2\right\}

(61)

Table 1 presents the terminal regions obtained using the arbitrary controller based approach when only one tuning parameter $\rho_{x}$ is varied for the two state system.

Table 1: Terminal Region for arbitrary controller based approach for Two State System for Varying

\rho_{x}

only

$\rho_{x}$	$\gamma$	$\alpha$	Area
0.1	9.8803	0.1537	0.0391
1	11.3196	0.1700	0.0300
5	17.7167	0.7247	0.0590
20	41.7057	4.1387	0.1174
50	89.6837	10.6782	0.1325
100	169.6470	22.2603	0.1428

From the table 1, it may be observed that the terminal region area has nearly saturated at $\rho_{x}=100$ and is used as the value for the second iteration. Table 2 presents the terminal regions obtained using the arbitrary controller based approach when the second tuning parameter $\rho_{u}$ is varied, keeping first tuning parameter $\rho_{x}$ as constant. It may be noted that in this particular system, value of $\alpha$ which defines the terminal region boundary are found identical to its upper bound $\gamma$ and it probably because of the mild nonlinearity of the system dynamics near the origin.

Table 2: Terminal Region for arbitrary controller based approach for Two State System for Varying

\rho_{u}

only

$\rho_{u}$	$\gamma$	$\alpha$	Volume
1	177.8	22.6	0.1416
5	210.3	28.2	0.1626
10	250.9	33.2	0.1750
100	981.8	106.5	0.2838
200	1793.9	131.2	0.2588

It can be noticed from the table 2 that the terminal region area has reached at $\rho_{u}=100$ .

Similar to the arbitrary controller based approach, LQR based approach is implemented. Table 3 presents the terminal regions obtained using the LQR based approach when only one tuning parameter $\rho_{x}$ is varied and using the inequality method for the two state system.

Table 3: Terminal Region for LQR based approach for Two State System for Varying

\rho_{x}

only

$\rho_{x}$	$\gamma$	$\alpha$	Area
5	7.9304	0.2574	0.0253
20	6.6979	1.2780	0.0467
50	6.6179	3.1971	0.0575
100	7.2567	4.7021	0.0479

From the table 3, terminal region with largest area is obtained at $\rho_{x}=50$ , which is the value used for the second iteration. Table 4 presents the terminal regions obtained using the LQR based approach when the second tuning parameter $\rho_{u}$ is varied, keeping first tuning parameter $\rho_{x}$ as constant.

Table 4: Terminal Region for LQR based approach for Two State System for Varying

\rho_{u}

only

$\rho_{u}$	$\gamma$	$\alpha$	Volume
1.1	7.832	5.159	0.0517
5	33.490	17.369	0.1269
10	72.006	27.836	0.1689
20	158.61	42.685	0.2093
50	450.21	71.597	0.2542
100	972.03	95.677	0.2576
200	2051.1	125.50	0.2501

It can be noticed from the table 4 that the terminal region area has reached maximum saturation around $\rho_{u}=100$ .

Table 5 presents a comparison of the terminal regions obtained using approach given in [18], Arbitrary Controller (AC) based approach, and LQR based approach for the two state system system. Table presents the tuning parameters, values of $\gamma$ (upper bound on $\alpha$ ) and $\alpha$ (which defines the terminal region boundary) and the area of each of the terminal regions.

Table 5: Comparison of Terminal Regions for Two State System using Various Approaches

Approach	Parameter	$\gamma$	$\alpha$	Area
Yu et al. [18]	$\kappa=1/0.91$	123.66	0.778	0.0271
AC	$\rho_{x}=100$ , $\rho_{u}=0$	169.65	22.26	0.1428
AC	$\rho_{x}=100$ , $\rho_{u}=100$	981.8	106.5	0.2838
LQR	$\rho_{x}=50$ , $\rho_{u}=1$	6.6179	3.1971	0.0575
LQR	$\rho_{x}=50$ , $\rho_{u}=100$	972.03	95.677	0.2576

From the table 5, it can be observed that the largest terminal regions obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area respectively when compared to the largest terminal region obtained using the approach given by Yu et al. in [18]. It also illustrates the significant impact of the second degree of freedom in the form of tuning parameter $\rho_{u}$ (or tuning parameter matrix $\widetilde{\mathbf{W}}_{u}$ ) on the size of the terminal region.

The terminal region obtained using the arbitrary controller based approach (with $\rho_{x}=100$ and $\rho_{u}=100$ ) having largest area is $\Omega=\left\{\mathbf{x}^{T}\mathbf{P}\mathbf{x}\leq 106.5\right\}$ with an area of $0.2838$ . Corresponding values of linear gain matrix $\mathbf{K}$ and terminal penalty matrix $\mathbf{P}$ are given as follows:

\mathbf{K}=\left[\begin{array}[]{cc}2.2534&2.2534\end{array}\right]

(62)

\mathbf{P}=10^{3}\times\left[\begin{array}[]{cc}1.5249&0.9677\\ 0.9677&1.5249\end{array}\right]

(63)

The terminal region obtained using the LQR based approach (with $\rho_{x}=50$ and $\rho_{u}=100$ ) having largest area is $\Omega=\left\{\mathbf{x}^{T}\mathbf{P}\mathbf{x}\leq 95.677\right\}$ with an area of $0.2576$ . Corresponding values of linear gain matrix $\mathbf{K}$ and terminal penalty matrix $\mathbf{P}$ are given as follows:

\mathbf{K}=\left[\begin{array}[]{cc}2.2534&2.2534\end{array}\right]

(64)

\mathbf{P}=10^{3}\times\left[\begin{array}[]{cc}1.5098&0.9582\\ 0.9582&1.5098\end{array}\right]

(65)

Note that the gain matrix is identical in both the approaches by choice. Discrete time QIH-NMPC controller simulations are carried out using MATLAB software. Sampling interval used for computer simulations is $0.1$ seconds. Initial condition for the state is identical to the one given in [18] and is given as $\mathbf{x}_{0}=[-3~{}~{}2]^{T}$ .

Table 6 gives minimum prediction horizon length required to satisfy the terminal constraint. It can be observed that the minimum prediction horizon required in the case of arbitrary controller based approach and LQR based approach is significantly smaller as compared to the prediction horizon required in the case of terminal constraint obtained using Yu et al.’s approach. It is well established in the literature that the computation time required for NMPC iterations increases exponentially with the prediction horizon length [4, 9, 6]. Results obtained effectively validate the advantage of having larger terminal regions.

Table 6: Minimum Prediction Horizon Length Required for Various Initial Conditions in the Two State System

Approach	$N$
Yu et al.’s [18] approach	28
Arbitrary controller based approach	11
LQR based approach	12

Figure 1 depicts the plot of the boundaries of the terminal regions obtained using three approaches namely a) approach given by Yu et al. in [18], b) LQR based approach, and c) Arbitrary controller based approach for the two state system for an identical sampling interval of $0.1$ s. It may be noted that the terminal regions obtained using arbitrary controller based approach and LQR based approach are significantly larger than the approach by Yu et al. In addition there are several states which lie inside or near to the larger terminal regions obtained using the proposed approaches resulting in feasible initial conditions with a very small prediction horizon length, which will require a relatively larger prediction horizon lengths when the terminal region given in [18] is used as terminal inequality constraint.

Refer to caption — Figure 1: Two states system: Graphical Comparison of the Terminal Regions

5.3 Conclusions

Approaches presented in the literature for the terminal region characterization for the discrete time QIH-NMPC formulations provide limited degrees of freedom and often result in a conservative terminal region. Two approaches are presented in this work namely an arbitrary controller based approach and an LQR based approach. Proposed approaches provide two additive matrices as tuning parameters for enlargement of the terminal region. Both the approaches are scalable to system of any state and input dimension.

Efficacy of the approaches is demonstrated using benchmark two state system. It is observed that the terminal region obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area measure when compared to the largest terminal region obtained using the approach given by Yu et al. [18]. Future research would involve choosing a completely arbitrary linear controller gain and parameterizing the approaches using different combinations of additive tuning matrices.

References

[1] S. J. Qin and T. A. Badgwell, “A survey of industrial model predictive control technology,” Control Engineering Practice, vol. 11, no. 7, pp. 733–764, Jul 2003.
[2] E. F. Camacho and C. C. Bordons, Model predictive control. Springer, 2007.
[3] J. Rawlings and K. Muske, “The stability of constrained receding horizon control,” IEEE Transactions on Automatic Control, vol. 38, no. 10, pp. 1512–1516, Oct 1993.
[4] D. Q. Mayne, J. B. Rawlings, C. V. Rao, and P. O. M. Scokaert, “Constrained model predictive control: Stability and optimality,” Automatica, vol. 36, no. 6, pp. 789–814, Jun 2000.
[5] D. Q. Mayne, “Model predictive control: Recent developments and future promise,” Automatica, vol. 50, no. 12, pp. 2967–2986, Dec 2014.
[6] J. B. Rawlings, D. Q. Mayne, and M. Diehl, Model predictive control: theory, computation, and design, 2nd ed. Nob Hill Publishing, LLC, 2017.
[7] S. S. Keerthi and E. G. Gilbert, “Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations,” Journal of Optimization Theory and Applications, vol. 57, no. 2, pp. 265–293, may 1988.
[8] H. Michalska and D. Mayne, “Robust receding horizon control of constrained nonlinear systems,” IEEE Transactions on Automatic Control, vol. 38, no. 11, pp. 1623–1633, Nov 1993.
[9] H. Chen and F. Allgöwer, “A quasi-infinite horizon nonlinear model predictive control scheme with guaranteed stability,” Automatica, vol. 34, no. 10, pp. 1205–1217, Oct 1998.
[10] W.-H. Chen, J. O’Reilly, and D. J. Ballance, “On the terminal region of model predictive control for non-linear systems with input/state constraints,” International Journal of Adaptive Control and Signal Processing, vol. 17, no. 3, pp. 195–207, apr 2003.
[11] S. Lucia, P. Rumschinski, A. J. Krener, and R. Findeisen, “Improved Design of Nonlinear Model Predictive Controllers,” in 5th IFAC Conference on Nonlinear Model Predictive Control NMPC 2015, vol. 48. Seville: IFAC-PapersOnLine, jan 2015, pp. 254–259.
[12] C. Rajhans, S. C. Patwardhan, and H. Pillai, “Two alternate approaches for characterization of the terminal region for continuous time quasi-infinite horizon NMPC,” in 2016 12th IEEE International Conference on Control and Automation (ICCA). IEEE, Jun 2016, pp. 98–103.
[13] D. Limón Marruedo, “Control predictivo de sistemas no lineales con restricciones: estabilidad y robustez,” Ph.D. dissertation, University of Seville, 2002.
[14] T. A. Johansen, “Approximate explicit receding horizon control of constrained nonlinear systems,” Automatica, vol. 40, no. 2, pp. 293–300, Feb 2004.
[15] C. Rajhans, S. C. Patwardhan, and H. Pillai, “Discrete Time Formulation of Quasi Infinite Horizon Nonlinear Model Predictive Control Scheme with Guaranteed Stability,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 7181–7186, Jul 2017.
[16] K. J. Astrom and B. Wittenmark, Computer-controlled systems: theory and design. Prentice Hall, 1997.
[17] L. Grüne and J. Pannek, Nonlinear Model Predictive Control: Theory and Algorithms. Springer-Verlag London, 2011. [Online]. Available: https://www.springer.com/gp/book/9780857295002
[18] S. Yu, T. Qu, F. Xu, H. Chen, and Y. Hu, “Stability of finite horizon model predictive control with incremental input constraints,” Automatica, vol. 79, pp. 265–272, May 2017.
[19] D. Mayne and H. Michalska, “Receding horizon control of nonlinear systems,” IEEE Transactions on Automatic Control, vol. 35, no. 7, pp. 814–824, jul 1990.
[20] D. Limon, T. Alamo, and E. F. Camacho, “Enlarging the domain of attraction of MPC controllers,” Automatica, vol. 41, no. 4, pp. 629–635, apr 2005.
[21] T. Kailath and P. Hall, Linear Systems, ser. Information and System Sciences Series. Prentice-Hall, 1980. [Online]. Available: https://books.google.co.in/books?id=ggYqAQAAMAAJ
[22] D. E. Kirk, Optimal Control Theory An Introduction Englewood Cliffs New Jersey. New Jersey: Prentice-Hall Inc., 1970.
[23] H. K. Khalil, Nonlinear systems. Prentice Hall, 2002.