This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Application of Terminal Region Enlargement Approach for Discrete Time Quasi Infinite Horizon NMPC

Chinmay Rajhans    \IEEEmembershipMember, IEEE    and Sowmya Gupta    \IEEEmembershipMember, IEEE “This work is not supported by any funding agency.” C. Rajhans is with Department of Electrical Engineering, IIT Bombay, Mumbai, India (e-mail: [email protected]). S. Gupta is with Center for Research in Nano Technology and Science, IIT Bombay, Mumbai, India (e-mail: [email protected]).
Abstract

Ensuring nominal asymptotic stability of the Nonlinear Model Predictive Control (NMPC) controller is not trivial. Stabilizing ingredients such as terminal penalty term and terminal region are crucial in establishing the asymptotic stability. Approaches available in the literature provide limited degrees of freedom for the characterization of the terminal region for the discrete time Quasi Infinite Horizon NMPC (QIH-NMPC) formulation. Current work presents alternate approaches namely arbitrary controller based approach and LQR based approach, which provide large degrees of freedom for enlarging the terminal region. Both the approaches are scalable to system of any state and input dimension. Approach from the literature provides a scalar whereas proposed approaches provide a linear controller and two additive matrices as tuning parameters for shaping of the terminal region. Proposed approaches involve solving modified Lyapunov equations to compute terminal penalty term, followed by explicit characterization of the terminal region. Efficacy of the proposed approaches is demonstrated using benchmark two state system. Terminal region obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area measure when compared to the largest terminal region obtained using the approach from the literature.

{IEEEkeywords}

Asymptotic stability, Control systems, Lyapunov methods, Optimal control, Predictive Control.

1 Introduction

\IEEEPARstart

Model Predictive Control (MPC) finds its applications in every field of engineering [1, 2]. Principle concept for ensuring nominal and robust stability involves inclusion of stabilizing constraints [3]. A significant process has taken place in the area of nominal and robust stability of linear MPC and Nonlinear MPC (NMPC) [4, 5, 6]. Terminal equality constraint is mathematically equivalent to equating the terminal state to the target set point or target steady state operating point [7]. Primary limitation of terminal equality constraint is that it is very conservative and often leads to infeasibility specifically for constrained formulations. Michalska and Mayne conceptualized dual mode MPC scheme wherein the novel idea of terminal region was introduced [8]. NMPC controller is expected to drive the plant trajectory into a region, termed as terminal region or terminal set, around the origin in the finite number of steps using the feasible inputs. Subsequently, local linear controller will take the plant trajectory to the origin. This idea was extended by Chen and Allgöwer with the concept of Quasi Infinite Horizon - Nonlinear Model Predictive Control (QIH-NMPC) formulation [9].

One seminar contribution of [9] was that it gave explicit approaches for the characterization of the terminal ingredients for the continuous time QIH-NMPC formulations. Several researchers have developed approaches for the characterization of the terminal constraints for the continuous time QIH-NMPC formulations [9, 10, 11, 12] and for the discrete time QIH-NMPC formulations [13, 14, 15]. It may be noted that discrete time formulations require separate considerations due to the concept of sampling time vastly affecting the terminal region shape and size [16, 17]. One of the major limitation of these approaches available in the literature is that all of them provide a single scalar tuning parameter giving only one degree of freedom for enlarging the terminal region. Recently, [18] presented one approach for terminal region characterization, which is discrete time equivalent of the continuous time approach by Chen and Allgöwer[9]. However, in addition to the scalar tuning parameter limitation, Yu et al.’s approach also was suitable when linearized discrete time system at the operating point is controllable. Due to such limitation, applicability of the approach was reduced and also resulted in conservative terminal regions.

Current work aims at extending the concepts presented by Rajhans et al. in [15] and providing larger degrees of freedom for the terminal region characterization of the discrete time QIH-NMPC formulations. For the proposed approaches in the current work, two tuning matrices are provided which further increase the degrees of freedom available to the designer. Proposed approaches provide three degrees of freedom namely a) linear stabilizing controller, b) additive state weighting matrix, and c) additive input weighting matrix.

Efficacy of the proposed arbitrary controller based approach and LQR based approach is demonstrated using the benchmark two state system. This system is used by various researchers for demonstrating their stability results [19, 9, 20, 18].

Second section presents the discrete time QIH-NMPC formulation in detail. In addition, approach by Yu et al. [18] is stated mathematically along with its limitation. Third section presents two alternate approaches for the computation of the terminal penalty and for the characterization of the terminal region. Forth section presents numerical characterization of the terminal region using the approaches presented in the third section. Fifth section presents the simulation case study results in detail. Sixth section presents the conclusions derived.

1.1 Notation

Two norm of a vector 𝐲\mathbf{y} is defined as |𝐲|:=𝐲T𝐲|\mathbf{y}|:=\sqrt{\mathbf{y}^{T}\mathbf{y}}. Weighted norm having weighting matrix 𝐀{\mathbf{A}} is defined as 𝐲𝐀:=𝐲𝐓𝐀𝐲\mathbf{y_{A}}:=\sqrt{\mathbf{y^{T}Ay}}. norm(𝐀)norm(\mathbf{A}) represents a two norm of matrix 𝐀\mathbf{A}. λmin(𝐀)\lambda_{min}(\mathbf{A}) indicates the smallest eigenvalue of 𝐀\mathbf{A}. ρmax(𝐀)\rho_{max}(\mathbf{A}) indicates the maximum magnitude eigenvalue of 𝐀\mathbf{A}. \mathbb{R} is a set of real numbers. \mathbb{N} is a set of natural numbers.

Consider an autonomous discrete time system

𝐱(k+1)=𝐅~(𝐱(k))\displaystyle\mathbf{x}(k+1)=\tilde{\mathbf{F}}({\mathbf{x}}(k)) (1)

with initial condition 𝐱(0)=𝐱0{\bf{x}}(0)={\mathbf{x}}_{0}. Consider a set 𝒳nx\mathcal{X}\subset\mathbb{R}^{n_{x}} such that 𝟎𝒳\mathbf{0}\in\mathcal{X}.

2 Discrete Time QIH-NMPC Formulation

Consider a discrete time nonlinear system model given as

𝐱(k+1)\displaystyle\mathbf{x}(k+1) =𝐅(𝐱(k),𝐮(k))\displaystyle={\mathbf{F}}({\mathbf{x}}(k),{\mathbf{u}}(k)) (2)
𝐱(0)\displaystyle{\bf{x}}(0) =𝐱0\displaystyle={\bf{x}}_{0} (3)

where 𝐱(k)𝒳nx\mathbf{x}(k)\in\mathcal{X}\subset\mathbb{R}^{n_{x}} denotes the state vector and 𝐮(k)𝒰nu\mathbf{u}(k)\in\mathcal{U}\subset\mathbb{R}^{n_{u}} denotes the input vector. Note that origin (𝐱(k)=𝟎,𝐮(k)=𝟎)(\mathbf{x}(k)=\mathbf{0},\mathbf{u}(k)=\mathbf{0}) is the equilibrium point of the system (2) due to shift of the origin.

Following assumptions are made:

D1

System dynamics function 𝐅:nx×nunx\mathbf{F}:\mathbb{R}^{n_{x}}\times\mathbb{R}^{n_{u}}\to\mathbb{R}^{n_{x}} is twice continuously differentiable.

D2

The origin 𝟎nx\mathbf{0}\in\mathbb{R}^{n_{x}} is an equilibrium point of the system (2) i.e. 𝐅(𝟎,𝟎)=𝟎\mathbf{F}\left(\mathbf{0},\mathbf{0}\right)=\mathbf{0}.

D3

The inputs 𝐮(k)\mathbf{u}(k) are constrained inside a closed and convex set 𝒰nu\mathcal{U}\subset\mathbb{R}^{n_{u}}.

D4

The system (2) has a unique solution for any initial condition 𝐱0𝒳\mathbf{x}_{0}\in\mathcal{X} and any input 𝐮(k)𝒰\mathbf{u}(k)\in\mathcal{U}.

D5

The state 𝐱(k)\mathbf{x}(k) is perfectly known at any sampling instant kk i.e. all the states are measured.

D6

External disturbances do not affect the system dynamics.

2.1 QIH-NMPC Formulation

For the discrete time system given by (2), QIH-NMPC formulation is stated as follows:

min𝐮¯(k,k+N1)J(𝐱(k),𝐮¯[k,k+N1])\begin{array}[]{c}\min\\ \overline{\mathbf{u}}_{(k,k+N-1)}\end{array}J\left(\mathbf{x}(k),\overline{\mathbf{u}}_{[k,k+N-1]}\right) (4)

with

J(𝐱(k),𝐮¯[k,k+N1])\displaystyle J\left(\mathbf{x}(k),\overline{\mathbf{u}}_{[k,k+N-1]}\right) =\displaystyle= i=kk+N1{𝐳(i)T𝐖x𝐳(i)+𝐮¯(i)T𝐖u𝐮¯(i)}\displaystyle\sum_{i=k}^{k+N-1}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)^{T}\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\} (8)
+𝐳(k+N)T𝐏𝐳(k+N)\displaystyle+\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)
𝐮¯[k,k+N1]={𝐮¯(i)𝒰:i,kik+N1}\overline{\mathbf{u}}_{[k,k+N-1]}=\left\{\bar{\mathbf{u}}(i)\in\mathcal{U}:i\in\mathbb{N},k\leq i\leq k+N-1\right\} (9)

subject to

𝐳(i+1)\displaystyle\mathbf{z}(i+1) =\displaystyle= 𝐅(𝐳(i),𝐮¯(i)) for kik+N1\displaystyle\mathbf{F}\left(\mathbf{z}(i),\overline{\mathbf{u}}(i)\right)\text{ for }k\leq i\leq k+N-1 (10)
𝐳(k)\displaystyle\mathbf{z}(k) =\displaystyle= 𝐱(k)\displaystyle\mathbf{x}(k) (11)
𝐳(i)\displaystyle\mathbf{z}(i) \displaystyle\in 𝒳 for kik+N\displaystyle\mathcal{X}\text{ for }k\leq i\leq k+N (12)
𝐮¯(i)\displaystyle\bar{\mathbf{u}}(i) \displaystyle\in 𝒰 for kik+N1\displaystyle\mathcal{U}\text{ for }k\leq i\leq k+N-1 (13)
𝐳(k+N)\displaystyle\mathbf{z}\left(k+N\right) \displaystyle\in Ω\displaystyle\Omega (14)

where 𝐖x\mathbf{W}_{x} and 𝐖u\mathbf{W}_{u} are state and input weighting matrices of dimension (nx×nx)\left(n_{x}\times n_{x}\right), (nu×nu)\left(n_{u}\times n_{u}\right) respectively. 𝐏\mathbf{P} is the terminal penalty matrix of dimension (nx×nx)\left(n_{x}\times n_{x}\right). 𝐖x,𝐖u,𝐏\mathbf{W}_{x},\mathbf{W}_{u},\mathbf{P} are symmetric positive definite matrices. NN is a finite prediction horizon length. 𝐳(i)\mathbf{z}(i) denotes the predicted state in the QIH-NMPC formulation and 𝐮¯(i)\overline{\mathbf{u}}(i) denotes the future input moves. The set Ω\Omega is termed as the terminal region in the neighborhood of the origin. The set 𝒳N𝒳nx\mathcal{X}_{N}\subset\mathcal{X}\subset\mathbb{R}^{n_{x}} is termed as the region of attraction is the set of all feasible initial conditions i.e. it is a set of all initial conditions 𝐱0\mathbf{x}_{0} such that terminal inequality constraint (14) is satisfied with inputs constrained given by equation (9) satisfied.

2.2 Design and Implementation of QIH-NMPC Formulation

The terminal region Ω\Omega is chosen as an invariant set for the nonlinear system (2) controlled by local linear controller with gain matrix 𝐊\mathbf{K}. The terminal penalty term is chosen such that for all trajectories starting from any point inside the terminal region Ω\Omega, with approximation that a single cost term having larger value that the sum of all the predicted stage cost terms from end of horizon to infinity and is given as follows:

𝐳(k+N)T𝐏𝐳(k+N)i=k+N{𝐳(i)T𝐖x𝐳(i)+𝐮¯(i)𝐖u𝐮¯(i)}\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)\geq\sum_{i=k+N}^{\infty}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\} (15)

with 𝐮¯(i)=𝐊𝐳(i)𝒰\overline{\mathbf{u}}(i)=-\mathbf{K}\mathbf{z}(i)\in\mathcal{U} for all kk+Nk\geq k+N and for all 𝐳(k+N)Ω\mathbf{z}(k+N)\in\Omega.

It is assumed that the solution to the optimal problem (4) with stage cost defined by (8) with input set given by (9) subject to the predicted state dynamics (10) with initial condition (11), predicted states constraint (12), future input moves constraint (13) and terminal state constraint (14) i.e. 𝐮¯[k,k+N1]\overline{\mathbf{u}}_{[k,k+N-1]}^{*} exists and can be computed numerically. Controller is implemented as a moving horizon framework. Accordingly, only the first control move 𝐮(k)=𝐮¯(k)\mathbf{u}(k)=\overline{\mathbf{u}}^{*}(k) is implemented in the plant. Entire process is repeated at the next sampling instant k+1k+1. The term Quasi Infinite is because of the fact that the NMPC formulation depicts the stability properties of the infinite horizon formulation, however, the actual implementation contains finite horizon. Such implementation is achieved with the help of the equation (15). However, only ensuring the terminal penalty term satisfying the condition (15) is not sufficient to guarantee the nominal asymptotic stability of the NMPC controller, hence terminal constraint as given by (14) becomes inevitable. It may be noted that local linear controller with gain matrix 𝐊\mathbf{K} is not used for implementation of the NMPC controller and is only a mathematical construct to characterize the terminal region Ω\Omega.

2.3 Approach from the Literature

Before proceeding to application of the proposed arbitrary controller based approach and LQR based approach, a look at Yu et al’s approach from [18] is necessary. Consider, Jacobian linearization of the nonlinear system (2) in the neighborhood the origin as,

𝐱(k+1)=𝚽𝐱(k)+𝚪𝐮(k)\mathbf{x}(k+1)=\mathbf{\Phi x}(k)+\mathbf{\Gamma u}(k) (16)

where

𝚽=[𝐅𝐱](𝟎,𝟎) and 𝚪=[𝐅𝐮](𝟎,𝟎)\mathbf{\Phi}=\left[\frac{\partial\mathbf{F}}{\partial\mathbf{x}}\right]_{\left(\mathbf{0},\mathbf{0}\right)}\text{ and \ }\mathbf{\Gamma}=\left[\frac{\partial\mathbf{F}}{\partial\mathbf{u}}\right]_{\left(\mathbf{0},\mathbf{0}\right)}

One additional assumption is required at this stage.

D7

The linearized system (16) is stabilizable.

Yu et al. characterize the terminal region as,

Ω{𝐱(k)nx|𝐱(k)T𝐏𝐱(k)α,𝐋𝐱(k)𝒰}\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\alpha,-\mathbf{Lx}(k)\in\mathcal{U}\right\} (17)

where linear gain 𝐋\mathbf{L} and the terminal penalty matrix 𝐏\mathbf{P} are the steady state solutions of the modified Lyapunov equation given as follows:

κ2𝚽LT𝐏𝚽L𝐏=𝐐\kappa^{2}\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\mathbf{Q}^{*} (18)
𝐐=𝐖x+𝐋T𝐖u𝐋\mathbf{Q}^{*}=\mathbf{W}_{x}+\mathbf{L}^{T}\mathbf{W}_{u}\mathbf{L} (19)

where 𝚽L=𝚽𝚪𝐋\mathbf{\Phi}_{L}=\mathbf{\Phi-\Gamma L} and parameter κ\kappa is chosen such that 1<κ<1/[ρmax(𝚽L)]1<\kappa<1/\left[\rho_{max}\left(\mathbf{\Phi}_{L}\right)\right] with ρmax(𝚽L)\rho_{max}(\mathbf{\Phi}_{L}) being the maximum amplitude eigenvalue of 𝚽L\mathbf{\Phi}_{L}. Since the gain 𝐋\mathbf{L} is stabilizing, all the eigenvalues of 𝚽L\mathbf{\Phi}_{L} lie inside the unit circle, hence ρmax(𝚽L)<1\rho_{max}(\mathbf{\Phi}_{L})<1. It can be noted that once stage cost weighting matrices 𝐖x,𝐖u\mathbf{W}_{x},\mathbf{W}_{u} are chosen, there is barely any degree of freedom left to the designer for shaping of the terminal region. This results in very conservative terminal regions. In addition, for the systems where there is uncontrollable eigenvalue of 𝚽\mathbf{\Phi} is close to unit circle, value of κ\kappa becomes nearly 11, resulting in a negligible margin for shaping of the terminal region. These limitations are overcome by using the arbitrary controller based approach wherein additive tuning matrices are introduced for providing large degrees of freedom for enlarging of the terminal region and is presented in the subsequent section.

3 Alternate Approaches for Terminal Region Characterization

Two approaches are proposed in this section. First is an arbitrary controller based approach as proposed by Rajhans et al. in [15], and second is an LQR based approach. In the arbitrary controller based approach, an arbitrary stabilizing linear is designed using any of the methods available in the literature such as pole placement [21], Linear Quadratic Gaussian (LQG) control [22] and so on. In the LQR based approach, an LQG controller is designed using the weighting matrices of the QIH-NMPC formulation.

Lemma 1

Suppose that assumptions D1 to D7 are satisfied and a stabilizing linear feedback control law is designed i.e. 𝚽L=(𝚽𝚪𝐋)\mathbf{\Phi}_{L}=(\mathbf{\Phi-\Gamma L}) is stable indicating all the eigenvalues are inside the unit circle. Let Δ𝐐\Delta\mathbf{Q} is any positive definite matrix. Let matrix 𝐏\mathbf{P} denote the solution of the following modified Lyapunov equation:

𝚽LT𝐏𝚽L𝐏=(𝐐+Δ𝐐)\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-(\mathbf{Q}^{*}+\Delta\mathbf{Q)} (20)

where 𝐐\mathbf{Q}^{*} is defined by equation (19). Then there exists a constant α>0\alpha>0 which defines an ellipsoid of the form

Ω{𝐱(k)nx|𝐱(k)T𝐏𝐱(k)α,𝐋𝐱(k)𝒰}\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\alpha,-\mathbf{Lx}(k)\in\mathcal{U}\right\} (21)

such that Ω\Omega is an invariant set for the nonlinear system given by (2) with linear controller 𝐮(k)=𝐋𝐱(k)\mathbf{u}(k)=-\mathbf{Lx}(k). Additionally, for any 𝐳(k+N)Ω\mathbf{z}(k+N)\in\Omega the inequality given by (22) holds true.

𝐳(k+N)T𝐏𝐳(k+N)i=k+N{𝐳(i)T𝐖x𝐳(i)+𝐮¯(i)𝐖u𝐮¯(i)}\mathbf{z}(k+N)^{T}\mathbf{P}\mathbf{z}(k+N)\geq\sum_{i=k+N}^{\infty}\left\{\begin{array}[]{c}\mathbf{z}(i)^{T}\mathbf{W}_{x}\mathbf{z}(i)\\ +\overline{\mathbf{u}}(i)\mathbf{W}_{u}\overline{\mathbf{u}}(i)\end{array}\right\} (22)
Proof 3.1.

Since 𝚽L=(𝚽𝚪𝐋)\mathbf{\Phi}_{L}=(\mathbf{\Phi-\Gamma L}) is stable, hence, the eigenvalues of 𝚽L\mathbf{\Phi}_{L} are inside the unit circle. Using the solvability condition of the modified Lyapunov equation, a unique 𝐏>0\mathbf{P}>0 can be computed which solves the equation (20). According to Assumption D2, the origin 𝟎nu\mathbf{0}\in\mathbb{R}^{n_{u}} is in the interior of the input constraints set 𝒰\mathcal{U}. Accordingly, we can compute a constant γ\gamma which defined a set Ωγ\Omega_{\gamma} such that

Ωγ{𝐱(k)nx|𝐱(k)T𝐏𝐱(k)γ,𝐋𝐱(k)𝒰}\Omega_{\gamma}\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\gamma,-\mathbf{Lx}(k)\in\mathcal{U}\right\} (23)

Now, let 0<αγ0<\alpha\leq\gamma specify a region of the form given by equation (24).

Ω{𝐱(k)nx|𝐱(k)T𝐏𝐱α}\Omega\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}\leq\alpha\right\} (24)

As the input constraints are satisfied in Ωγ\Omega_{\gamma} and ΩΩγ\Omega\subseteq\Omega_{\gamma} (by virtue of 0<αγ0<\alpha\leq\gamma), the system dynamics can be equivalently viewed as an input unconstrained system in the set Ω\Omega. Consider a vector ψL(𝐱)\mathbf{\psi}_{L}(\mathbf{x}) representing the nonlinearity in the system dynamics defined as

𝚿L(𝐱(k))=𝐅(𝐱(k),𝐋𝐱(k))𝚽L𝐱(k)\mathbf{\Psi}_{L}(\mathbf{x}(k))=\mathbf{F}(\mathbf{x}(k),-\mathbf{Lx}(k))-\mathbf{\Phi}_{L}\mathbf{x}(k) (25)

Note for a linear system 𝚿L(𝐱(k))=𝟎\mathbf{\Psi}_{L}(\mathbf{x}(k))=\mathbf{0}. Consider a Lyapunov candidate defined as

V(𝐱(k))=𝐱(k)T𝐏𝐱(k)V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k) (26)

The difference of Lyapunov candidate V(𝐱(k))V(\mathbf{x}(k)) can be expressed as follows:

ΔV(𝐱(k))\displaystyle\Delta V(\mathbf{x}(k)) =V(𝐱(k+1))V(𝐱(k))\displaystyle=V(\mathbf{x}(k+1))-V(\mathbf{x}(k))
=𝐱(k+1)T𝐏𝐱(k+1)𝐱(k)T𝐏𝐱(k)\displaystyle=\mathbf{x}(k+1)^{T}\mathbf{P}\mathbf{x}(k+1)-\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k) (27)

Substituting from (25) into (27),

ΔV(𝐱(k))\displaystyle\Delta V(\mathbf{x}(k)) =𝐱(k)T(𝚽LT𝐏𝚽L𝐏)𝐱(k)\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}\right)\mathbf{x}(k)
+2𝚿L(𝐱(k))T𝐏𝚽L𝐱(k)\displaystyle+2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)
+𝚿L(𝐱(k))T𝐏𝚿L(𝐱(k))\displaystyle+\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k)) (28)

Using equation (20) into (28),

ΔV(𝐱(k))\displaystyle\Delta V(\mathbf{x}(k)) =𝐱(k)T(𝐐+Δ𝐐)𝐱(k)\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}+\Delta\mathbf{Q}\right)\mathbf{x}(k)
+2𝚿L(𝐱(k))T𝐏𝚽L𝐱(k)\displaystyle+2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)
+𝚿L(𝐱(k))T𝐏𝚿L(𝐱(k))\displaystyle+\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k)) (29)

Define χ(𝐱(k))\chi(\mathbf{x}(k)) as

χ(𝐱(k)):=\displaystyle\chi(\mathbf{x}(k)):= 𝐱(k)T(Δ𝐐)𝐱(k)2𝚿L(𝐱(k))T𝐏𝚽L𝐱(k)\displaystyle\mathbf{x}(k)^{T}\left(\Delta\mathbf{Q}\right)\mathbf{x}(k)-2\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Phi}_{L}\mathbf{x}(k)
𝚿L(𝐱(k))T𝐏𝚿L(𝐱(k))\displaystyle-\mathbf{\Psi}_{L}(\mathbf{x}(k))^{T}\mathbf{P}\mathbf{\Psi}_{L}(\mathbf{x}(k)) (30)

Using equation (30) into (29),

ΔV(𝐱(k))\displaystyle\Delta V(\mathbf{x}(k)) =𝐱(k)T(𝐐)𝐱(k)χ(𝐱(k))\displaystyle=-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k)-\chi(\mathbf{x}(k)) (31)

If Ω\Omega is chosen such that

χ(𝐱(k))0\displaystyle\chi(\mathbf{x}(k))\geq 0 (32)

then

ΔV(𝐱(k))𝐱(k)T(𝐐)𝐱(k)\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k) (33)

Summing the inequality (33) over the interval, [k+N,),[k+N,\infty), it follows that

V(𝐱(k+N))i=k+N𝐱(i)T𝐐𝐱(i)V(\mathbf{x}(k+N))\geq\sum_{i=k+N}^{\infty}\mathbf{x}(i)^{T}\mathbf{Q}^{*}\mathbf{x}(i) (34)

Using (19) and (26), inequality (34) becomes identical to (22) and holds true for any 𝐱(k+N))Ω\mathbf{x}(k+N))\in\Omega.

Lemma 3.2.

Suppose that assumptions D1 to D7 are satisfied. Let 𝐖~x>𝐖x\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x} and 𝐖~u>𝐖u\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u} be positive definite matrices. Let matrix 𝐋\mathbf{L} and 𝐏\mathbf{P} denote the steady state solution of the following modified Lyapunov equations:

𝚽LT𝐏𝚽L𝐏=(𝐖~x+𝐋T𝐖~u𝐋)\displaystyle\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\left(\widetilde{\mathbf{W}}_{x}+\mathbf{L}^{T}\widetilde{\mathbf{W}}_{u}\mathbf{L}\right) (35)
𝐋=(𝐖~u+𝚪T𝐏𝚪)1𝚪T𝐏𝚽\displaystyle\mathbf{L}=\left(\widetilde{\mathbf{W}}_{u}+\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Gamma}\right)^{-1}\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Phi} (36)

Then there exists a constant α>0\alpha>0 which defines an ellipsoid of the form (21) such that Ω\Omega is an invariant set for the nonlinear system given by (2) with linear controller 𝐮(k)=𝐋𝐱(k)\mathbf{u}(k)=-\mathbf{Lx}(k). Additionally, for any 𝐳(k+N)Ω\mathbf{z}(k+N)\in\Omega the inequality given by (22) holds true.

Proof 3.3.

Define the following matrices,

Δ𝐐=Δ𝐖x+𝐋TΔ𝐖u𝐋\Delta\mathbf{Q}=\Delta\mathbf{W}_{x}+\mathbf{L}^{T}\Delta\mathbf{W}_{u}\mathbf{L} (37)

with

Δ𝐖x:=𝐖~x𝐖x\displaystyle\Delta\mathbf{W}_{x}:=\widetilde{\mathbf{W}}_{x}-\mathbf{W}_{x} (38)
Δ𝐖u:=𝐖~u𝐖u\displaystyle\Delta\mathbf{W}_{u}:=\widetilde{\mathbf{W}}_{u}-\mathbf{W}_{u} (39)

Consider a Lyapunov candidate defined as

V(𝐱(k))=𝐱(k)T𝐏𝐱(k)V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k) (40)

The difference of Lyapunov candidate V(𝐱(k))V(\mathbf{x}(k)) can be expressed as follows:

ΔV(𝐱(k))\displaystyle\Delta V(\mathbf{x}(k)) =V(𝐱(k+1))V(𝐱(k))\displaystyle=V(\mathbf{x}(k+1))-V(\mathbf{x}(k))
=𝐱(k+1)T𝐏𝐱(k+1)𝐱(k)T𝐏𝐱(k)\displaystyle=\mathbf{x}(k+1)^{T}\mathbf{P}\mathbf{x}(k+1)-\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k) (41)

Substituting from (35), (37), (38), and (39) into (41) and upon simplification,

ΔV(𝐱(k))𝐱(k)T(𝐐)𝐱(k)\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\left(\mathbf{Q}^{*}\right)\mathbf{x}(k) (42)

Rest of the proof is similar to that of the proof of the lemma 1.

Lemma 3.4.

Let the assumptions D1-D7 hold true. For the nominal discrete time system, feasibility of QIH-NMPC formulation problem (4) at sampling instant k=0k=0 implies its feasibility for all k>0k>0.

Proof 3.5.

Proof is identical to the proof of the lemma 2 from [15].

Theorem 3.6.

Let a) Assumptions D1-D7 hold true and b) the discrete time QIH-NMPC problem be feasible at k=0k=0. The nominal nonlinear system (2) controlled with QIH-NMPC controller is asymptotically stable at the origin.

Proof 3.7.

From equation (26) from the lemma (1), consider the Lyapunov candidate function

V(𝐱(k))=𝐱(k)T𝐏𝐱(k)V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k) (43)

Consider the following three properties [23]:

  • V(𝟎)=(𝟎T)𝐏(𝟎)=0V(\mathbf{0})=(\mathbf{0}^{T})\mathbf{P}(\mathbf{0})=0.

  • Since 𝐏\mathbf{P} is a positive definite matrix, V(𝐱(k))=𝐱(k)T𝐏𝐱(k)>0V(\mathbf{x}(k))=\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)>0 for all 𝐱(k)𝟎\mathbf{x}(k)\neq\mathbf{0}.

  • Using (33) and 𝐐>0\mathbf{Q}^{*}>0 implies

    ΔV(𝐱(k))𝐱(k)T𝐐𝐱(k)<0\displaystyle\Delta V(\mathbf{x}(k))\leq-\mathbf{x}(k)^{T}\mathbf{Q}^{*}\mathbf{x}(k)<0 (44)

Thus, the candidate function V(𝐱(k))V(\mathbf{x}(k)) is a Lyapunov function for the nonlinear system for 𝐱Ω\mathbf{x}\in\Omega under QIH-NMPC controller. Hence, the closed loop system is asymptotically stable at the origin.

4 CHARACTERIZATION OF THE TERMINAL REGION

Lemma 1 or lemma 3.2 gave the conditions for explicit characterization of the terminal region. It is possible to compute the terminal region numerically and subsequently implement the QIH-NMPC controller.

Steps for characterization of the terminal region using arbitrary controller based approach and LQR based approach are given below:

S1

Computation of Upper Bound Set:
Compute the largest value of γ\gamma such that inputs constraints are satisfied in the set Ωγ\Omega_{\gamma} given by (45).

Ωγ{𝐱(k)nx|𝐱(k)T𝐏𝐱(k)γ,𝐊𝐱(k)𝒰}\Omega_{\gamma}\equiv\left\{\mathbf{x}(k)\in\mathbb{R}^{n_{x}}|\mathbf{x}(k)^{T}\mathbf{P}\mathbf{x}(k)\leq\gamma,-\mathbf{Kx}(k)\in\mathcal{U}\right\} (45)

This can be formulated as a Quadratic Programming (QP) problem if the constraints are defined by upper bound and lower bound on each of the input variables. Typically the set Ωγ\Omega_{\gamma} would be tangential to at least one of the input constraints.

S2

Computation of the Terminal Region using inequality based method:
Compute the largest α(0,γ]\alpha\in(0,\gamma] such that

[min𝐱(k)Ωχ(𝐱(k))]=0\displaystyle\left[\begin{array}[]{c}\min\\ \mathbf{x}(k)\in\Omega\end{array}\chi(\mathbf{x}(k))\right]=0 (48)

The condition given by (48) ensures that 𝚿(𝐱)>0\mathbf{\Psi}(\mathbf{x})>0 for all 𝐱Ω\mathbf{x}\in\Omega, which is the necessary condition to further establish the nominal asymptotic stability.

The step S2 is implemented as follows:
Initially α=γ\alpha=\gamma and condition (32) i.e. (χ(𝐱(k))0)(\chi(\mathbf{x}(k))\geq 0) is checked. If (32) is true, then α=γ\alpha=\gamma. If (32) is false i.e. χ(𝐱)(k)<0)\chi(\mathbf{x})(k)<0) for at least one 𝐱(k)Ω\mathbf{x}(k)\in\Omega, then the value of α\alpha is further reduced by a multiplicative factor β<1\beta<1 and β1\beta\approx 1. The process continues until condition (32) is satisfied.

Terminal region shape changes according to the computed 𝐏\mathbf{P} matrix and its size changes according to the value of α\alpha. Comparison of the terminal regions obtained using various approaches is carried out using measurement of the area for the system with a state dimension of 2. Area of the terminal region Ω\Omega defined by (21) is given by

A2=παdet(𝐏)\displaystyle A_{2}=\frac{\pi\alpha}{\sqrt{det(\mathbf{P})}} (49)

5 Simulation Case Study

Effectiveness of the proposed approaches for the terminal region characterization and its applicability to QIH-NMPC discrete time simulations is demonstrated using the benchmark two state system which is used by several researchers [19, 9, 18].

5.1 Choice of Tuning Matrices

According to the design of the arbitrary controller based approach, the gain matrix 𝐊\mathbf{K} can be any arbitrary stabilizing linear controller. However, in order to simplify the computations, simulation results are presented with the following choice. Identical linear controller gain is also used in the case of LQR based approach. Controller gain 𝐊\mathbf{K} is the steady state solution of the simultaneous equations (50) and (51).

𝚽LT𝐏𝚽L𝐏=(𝐖x+𝐋T𝐖u𝐋)\displaystyle\mathbf{\Phi}_{L}^{T}\mathbf{P}\mathbf{\Phi}_{L}-\mathbf{P}=-\left(\mathbf{W}_{x}+\mathbf{L}^{T}\mathbf{W}_{u}\mathbf{L}\right) (50)
𝐋=(𝐖u+𝚪T𝐏𝚪)1𝚪T𝐏𝚽\displaystyle\mathbf{L}=\left(\mathbf{W}_{u}+\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Gamma}\right)^{-1}\mathbf{\Gamma}^{T}\mathbf{P}\mathbf{\Phi} (51)

The tuning matrix Δ𝐐\Delta\mathbf{Q} for the arbitrary controller based approach can be any positive definite matrix. However, in order to simplify and structure the computations of the terminal region, following parameterization is carried out:

Δ𝐐=𝐖~x+𝐊T𝐖~u𝐊\Delta\mathbf{Q}=\widetilde{\mathbf{W}}_{x}+\mathbf{K}^{T}\widetilde{\mathbf{W}}_{u}\mathbf{K} (52)

In order to further simplify the numerical computation of the terminal region using arbitrary controller based approach, additional parameterization is carried out and is given as follows:

𝐖~x=ρx𝐖x and 𝐖~u=ρu𝐖u\widetilde{\mathbf{W}}_{x}=\rho_{x}\mathbf{W}_{x}\text{ and }\widetilde{\mathbf{W}}_{u}=\rho_{u}\mathbf{W}_{u} (53)

Note that it is sufficient to have 𝐖~x>𝐖x\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x} or 𝐖~u>𝐖u\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u} to satisfy Δ𝐐>0\Delta\mathbf{Q}>0, however, usually both 𝐖~x>𝐖x\widetilde{\mathbf{W}}_{x}>\mathbf{W}_{x} and 𝐖~u>𝐖u\widetilde{\mathbf{W}}_{u}>\mathbf{W}_{u} is preferred in practice. Using the matrices (53) into (52),

Δ𝐐=ρx𝐖x+ρu𝐊T𝐖u𝐊\Delta\mathbf{Q}=\rho_{x}\mathbf{W}_{x}+\rho_{u}\mathbf{K}^{T}\mathbf{W}_{u}\mathbf{K} (54)

where ρx\rho_{x} and ρu\rho_{u} are the tuning scalars. For the arbitrary controller based approach, conditions ρx>0\rho_{x}>0, and ρu>0\rho_{u}>0 and for the LQR based approach, conditions ρx>1\rho_{x}>1 and ρu>1\rho_{u}>1 are imposed.

Efficacy of having two tuning parameters is efficiently demonstrated using the case study in the next sub-section. In the case study, in the first iteration, parameter ρx\rho_{x} is increased from 0 (for arbitrary controller based approach) or 11 (for LQR based approach) to a large value, keeping ρu\rho_{u} constant at 0 or 11 respectively. A value of ρx\rho_{x} is chosen which leads to larger terminal region in the first iteration. Subsequently, in the second iteration, parameter ρu\rho_{u} is increased from 0 (for arbitrary controller based approach) or 11 (for LQR based approach) to larger value, keeping ρx\rho_{x} as constant to the values obtained in the first iteration.

5.2 Two State System

Discrete time system dynamics are given by (55)-(56). These equations are obtained by using the continuous time equations given in [19, 9] with a simple first order Euler integration to obtain the discrete time equations identical to the ones given in [18].

x1(k+1)\displaystyle x_{1}(k+1) =x1(k)+T(x2(k)+u(k)(μ0+(1μ0)x1(k)))\displaystyle=x_{1}(k)+T\left(x_{2}(k)+u(k)\left(\mu_{0}+(1-\mu_{0})x_{1}(k)\right)\right) (55)
x2(k+1)\displaystyle x_{2}(k+1) =x2(k)+T(x1(k)+u(k)(μ04(1μ0)x2(k)))\displaystyle=x_{2}(k)+T\left(x_{1}(k)+u(k)\left(\mu_{0}-4(1-\mu_{0})x_{2}(k)\right)\right) (56)

where parameter μ0=0.5\mu_{0}=0.5, 𝐱(k)=[x1(k)x2(k)]T\mathbf{x}(k)=\left[x_{1}(k)~{}~{}x_{2}(k)\right]^{T} is the state vector, and u(k)u(k) is the input. The chosen equilibrium operating point is given as 𝐱(k)=[00]T\mathbf{x}(k)=\left[0~{}~{}0\right]^{T} and 𝐮(k)=[0]\mathbf{u}(k)=\left[0\right]. It may be noted that sampling interval chosen for computations is T=0.1T=0.1s, which is identical to the one chosen in [18]. The eigenvalues of the linearized matrix at the origin are (0.9048,1.1052)(0.9048,1.1052), indicating that the chosen operating point is unstable resulting in a challenging control problem.

The discrete time QIH-NMPC weighting matrices are chosen identical to the one given in [18] and are given as follows:

𝐖x=[1001] and 𝐖u=[0.5]\displaystyle\mathbf{W}_{x}=\left[\begin{array}[]{ccc}1&0\\ 0&1\end{array}\right]\text{ and }\mathbf{W}_{u}=\left[\begin{array}[]{cc}0.5\end{array}\right] (60)

The input constraint used in the QIH-NMPC formulation is identical to the one given in [9, 18] and is given as follows:

𝒰={𝐮(k)|2𝐮(k)2}\mathcal{U}=\left\{\mathbf{u}(k)\in\mathbb{R}~{}|~{}-2\leq\mathbf{u}(k)\leq 2\right\} (61)

Table 1 presents the terminal regions obtained using the arbitrary controller based approach when only one tuning parameter ρx\rho_{x} is varied for the two state system.

Table 1: Terminal Region for arbitrary controller based approach for Two State System for Varying ρx\rho_{x} only
ρx\rho_{x} γ\gamma α\alpha Area
0.1 9.8803 0.1537 0.0391
1 11.3196 0.1700 0.0300
5 17.7167 0.7247 0.0590
20 41.7057 4.1387 0.1174
50 89.6837 10.6782 0.1325
100 169.6470 22.2603 0.1428

From the table 1, it may be observed that the terminal region area has nearly saturated at ρx=100\rho_{x}=100 and is used as the value for the second iteration. Table 2 presents the terminal regions obtained using the arbitrary controller based approach when the second tuning parameter ρu\rho_{u} is varied, keeping first tuning parameter ρx\rho_{x} as constant. It may be noted that in this particular system, value of α\alpha which defines the terminal region boundary are found identical to its upper bound γ\gamma and it probably because of the mild nonlinearity of the system dynamics near the origin.

Table 2: Terminal Region for arbitrary controller based approach for Two State System for Varying ρu\rho_{u} only
ρu\rho_{u} γ\gamma α\alpha Volume
1 177.8 22.6 0.1416
5 210.3 28.2 0.1626
10 250.9 33.2 0.1750
100 981.8 106.5 0.2838
200 1793.9 131.2 0.2588

It can be noticed from the table 2 that the terminal region area has reached at ρu=100\rho_{u}=100.

Similar to the arbitrary controller based approach, LQR based approach is implemented. Table 3 presents the terminal regions obtained using the LQR based approach when only one tuning parameter ρx\rho_{x} is varied and using the inequality method for the two state system.

Table 3: Terminal Region for LQR based approach for Two State System for Varying ρx\rho_{x} only
ρx\rho_{x} γ\gamma α\alpha Area
5 7.9304 0.2574 0.0253
20 6.6979 1.2780 0.0467
50 6.6179 3.1971 0.0575
100 7.2567 4.7021 0.0479

From the table 3, terminal region with largest area is obtained at ρx=50\rho_{x}=50, which is the value used for the second iteration. Table 4 presents the terminal regions obtained using the LQR based approach when the second tuning parameter ρu\rho_{u} is varied, keeping first tuning parameter ρx\rho_{x} as constant.

Table 4: Terminal Region for LQR based approach for Two State System for Varying ρu\rho_{u} only
ρu\rho_{u} γ\gamma α\alpha Volume
1.1 7.832 5.159 0.0517
5 33.490 17.369 0.1269
10 72.006 27.836 0.1689
20 158.61 42.685 0.2093
50 450.21 71.597 0.2542
100 972.03 95.677 0.2576
200 2051.1 125.50 0.2501

It can be noticed from the table 4 that the terminal region area has reached maximum saturation around ρu=100\rho_{u}=100.

Table 5 presents a comparison of the terminal regions obtained using approach given in [18], Arbitrary Controller (AC) based approach, and LQR based approach for the two state system system. Table presents the tuning parameters, values of γ\gamma (upper bound on α\alpha) and α\alpha (which defines the terminal region boundary) and the area of each of the terminal regions.

Table 5: Comparison of Terminal Regions for Two State System using Various Approaches
Approach Parameter γ\gamma α\alpha Area
Yu et al. [18] κ=1/0.91\kappa=1/0.91 123.66 0.778 0.0271
AC ρx=100\rho_{x}=100, ρu=0\rho_{u}=0 169.65 22.26 0.1428
AC ρx=100\rho_{x}=100, ρu=100\rho_{u}=100 981.8 106.5 0.2838
LQR ρx=50\rho_{x}=50, ρu=1\rho_{u}=1 6.6179 3.1971 0.0575
LQR ρx=50\rho_{x}=50, ρu=100\rho_{u}=100 972.03 95.677 0.2576

From the table 5, it can be observed that the largest terminal regions obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area respectively when compared to the largest terminal region obtained using the approach given by Yu et al. in [18]. It also illustrates the significant impact of the second degree of freedom in the form of tuning parameter ρu\rho_{u} (or tuning parameter matrix 𝐖~u\widetilde{\mathbf{W}}_{u}) on the size of the terminal region.

The terminal region obtained using the arbitrary controller based approach (with ρx=100\rho_{x}=100 and ρu=100\rho_{u}=100) having largest area is Ω={𝐱T𝐏𝐱106.5}\Omega=\left\{\mathbf{x}^{T}\mathbf{P}\mathbf{x}\leq 106.5\right\} with an area of 0.28380.2838. Corresponding values of linear gain matrix 𝐊\mathbf{K} and terminal penalty matrix 𝐏\mathbf{P} are given as follows:

𝐊=[2.25342.2534]\mathbf{K}=\left[\begin{array}[]{cc}2.2534&2.2534\end{array}\right] (62)
𝐏=103×[1.52490.96770.96771.5249]\mathbf{P}=10^{3}\times\left[\begin{array}[]{cc}1.5249&0.9677\\ 0.9677&1.5249\end{array}\right] (63)

The terminal region obtained using the LQR based approach (with ρx=50\rho_{x}=50 and ρu=100\rho_{u}=100) having largest area is Ω={𝐱T𝐏𝐱95.677}\Omega=\left\{\mathbf{x}^{T}\mathbf{P}\mathbf{x}\leq 95.677\right\} with an area of 0.25760.2576. Corresponding values of linear gain matrix 𝐊\mathbf{K} and terminal penalty matrix 𝐏\mathbf{P} are given as follows:

𝐊=[2.25342.2534]\mathbf{K}=\left[\begin{array}[]{cc}2.2534&2.2534\end{array}\right] (64)
𝐏=103×[1.50980.95820.95821.5098]\mathbf{P}=10^{3}\times\left[\begin{array}[]{cc}1.5098&0.9582\\ 0.9582&1.5098\end{array}\right] (65)

Note that the gain matrix is identical in both the approaches by choice. Discrete time QIH-NMPC controller simulations are carried out using MATLAB software. Sampling interval used for computer simulations is 0.10.1 seconds. Initial condition for the state is identical to the one given in [18] and is given as 𝐱0=[32]T\mathbf{x}_{0}=[-3~{}~{}2]^{T}.

Table 6 gives minimum prediction horizon length required to satisfy the terminal constraint. It can be observed that the minimum prediction horizon required in the case of arbitrary controller based approach and LQR based approach is significantly smaller as compared to the prediction horizon required in the case of terminal constraint obtained using Yu et al.’s approach. It is well established in the literature that the computation time required for NMPC iterations increases exponentially with the prediction horizon length [4, 9, 6]. Results obtained effectively validate the advantage of having larger terminal regions.

Table 6: Minimum Prediction Horizon Length Required for Various Initial Conditions in the Two State System
Approach NN
Yu et al.’s [18] approach 28
Arbitrary controller based approach 11
LQR based approach 12

Figure 1 depicts the plot of the boundaries of the terminal regions obtained using three approaches namely a) approach given by Yu et al. in [18], b) LQR based approach, and c) Arbitrary controller based approach for the two state system for an identical sampling interval of 0.10.1s. It may be noted that the terminal regions obtained using arbitrary controller based approach and LQR based approach are significantly larger than the approach by Yu et al. In addition there are several states which lie inside or near to the larger terminal regions obtained using the proposed approaches resulting in feasible initial conditions with a very small prediction horizon length, which will require a relatively larger prediction horizon lengths when the terminal region given in [18] is used as terminal inequality constraint.

Refer to caption

Figure 1: Two states system: Graphical Comparison of the Terminal Regions

5.3 Conclusions

Approaches presented in the literature for the terminal region characterization for the discrete time QIH-NMPC formulations provide limited degrees of freedom and often result in a conservative terminal region. Two approaches are presented in this work namely an arbitrary controller based approach and an LQR based approach. Proposed approaches provide two additive matrices as tuning parameters for enlargement of the terminal region. Both the approaches are scalable to system of any state and input dimension.

Efficacy of the approaches is demonstrated using benchmark two state system. It is observed that the terminal region obtained using the arbitrary controller based approach and LQR based approach are approximately 10.4723 and 9.5055 times larger by area measure when compared to the largest terminal region obtained using the approach given by Yu et al. [18]. Future research would involve choosing a completely arbitrary linear controller gain and parameterizing the approaches using different combinations of additive tuning matrices.

References

  • [1] S. J. Qin and T. A. Badgwell, “A survey of industrial model predictive control technology,” Control Engineering Practice, vol. 11, no. 7, pp. 733–764, Jul 2003.
  • [2] E. F. Camacho and C. C. Bordons, Model predictive control.   Springer, 2007.
  • [3] J. Rawlings and K. Muske, “The stability of constrained receding horizon control,” IEEE Transactions on Automatic Control, vol. 38, no. 10, pp. 1512–1516, Oct 1993.
  • [4] D. Q. Mayne, J. B. Rawlings, C. V. Rao, and P. O. M. Scokaert, “Constrained model predictive control: Stability and optimality,” Automatica, vol. 36, no. 6, pp. 789–814, Jun 2000.
  • [5] D. Q. Mayne, “Model predictive control: Recent developments and future promise,” Automatica, vol. 50, no. 12, pp. 2967–2986, Dec 2014.
  • [6] J. B. Rawlings, D. Q. Mayne, and M. Diehl, Model predictive control: theory, computation, and design, 2nd ed.   Nob Hill Publishing, LLC, 2017.
  • [7] S. S. Keerthi and E. G. Gilbert, “Optimal infinite-horizon feedback laws for a general class of constrained discrete-time systems: Stability and moving-horizon approximations,” Journal of Optimization Theory and Applications, vol. 57, no. 2, pp. 265–293, may 1988.
  • [8] H. Michalska and D. Mayne, “Robust receding horizon control of constrained nonlinear systems,” IEEE Transactions on Automatic Control, vol. 38, no. 11, pp. 1623–1633, Nov 1993.
  • [9] H. Chen and F. Allgöwer, “A quasi-infinite horizon nonlinear model predictive control scheme with guaranteed stability,” Automatica, vol. 34, no. 10, pp. 1205–1217, Oct 1998.
  • [10] W.-H. Chen, J. O’Reilly, and D. J. Ballance, “On the terminal region of model predictive control for non-linear systems with input/state constraints,” International Journal of Adaptive Control and Signal Processing, vol. 17, no. 3, pp. 195–207, apr 2003.
  • [11] S. Lucia, P. Rumschinski, A. J. Krener, and R. Findeisen, “Improved Design of Nonlinear Model Predictive Controllers,” in 5th IFAC Conference on Nonlinear Model Predictive Control NMPC 2015, vol. 48.   Seville: IFAC-PapersOnLine, jan 2015, pp. 254–259.
  • [12] C. Rajhans, S. C. Patwardhan, and H. Pillai, “Two alternate approaches for characterization of the terminal region for continuous time quasi-infinite horizon NMPC,” in 2016 12th IEEE International Conference on Control and Automation (ICCA).   IEEE, Jun 2016, pp. 98–103.
  • [13] D. Limón Marruedo, “Control predictivo de sistemas no lineales con restricciones: estabilidad y robustez,” Ph.D. dissertation, University of Seville, 2002.
  • [14] T. A. Johansen, “Approximate explicit receding horizon control of constrained nonlinear systems,” Automatica, vol. 40, no. 2, pp. 293–300, Feb 2004.
  • [15] C. Rajhans, S. C. Patwardhan, and H. Pillai, “Discrete Time Formulation of Quasi Infinite Horizon Nonlinear Model Predictive Control Scheme with Guaranteed Stability,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 7181–7186, Jul 2017.
  • [16] K. J. Astrom and B. Wittenmark, Computer-controlled systems: theory and design.   Prentice Hall, 1997.
  • [17] L. Grüne and J. Pannek, Nonlinear Model Predictive Control: Theory and Algorithms.   Springer-Verlag London, 2011. [Online]. Available: https://www.springer.com/gp/book/9780857295002
  • [18] S. Yu, T. Qu, F. Xu, H. Chen, and Y. Hu, “Stability of finite horizon model predictive control with incremental input constraints,” Automatica, vol. 79, pp. 265–272, May 2017.
  • [19] D. Mayne and H. Michalska, “Receding horizon control of nonlinear systems,” IEEE Transactions on Automatic Control, vol. 35, no. 7, pp. 814–824, jul 1990.
  • [20] D. Limon, T. Alamo, and E. F. Camacho, “Enlarging the domain of attraction of MPC controllers,” Automatica, vol. 41, no. 4, pp. 629–635, apr 2005.
  • [21] T. Kailath and P. Hall, Linear Systems, ser. Information and System Sciences Series.   Prentice-Hall, 1980. [Online]. Available: https://books.google.co.in/books?id=ggYqAQAAMAAJ
  • [22] D. E. Kirk, Optimal Control Theory An Introduction Englewood Cliffs New Jersey.   New Jersey: Prentice-Hall Inc., 1970.
  • [23] H. K. Khalil, Nonlinear systems.   Prentice Hall, 2002.