This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Stochastic Model Predictive Control for Linear Systems with Unbounded Additive Uncertainties

Fei Li, Huiping Li, , and Yuyao He This work was supported in part by the National Natural Science Foundation of China (NSFC) under Grant 61922068, 61733014; in part by Shaanxi Provincial Funds for Distinguished Young Scientists under Grant 2019JC-14; in part by Aoxiang Youth Scholar Program under Grant 20GH0201111. (Corresponding author: Huiping Li)F. Li, H. Li and Y. He are with the School of Marine Science and Technology, Northwestern Polytechnical University, Xi’an 710072, China (e-mail: lifei100400@mail.nwpu.edu.cn; lihuiping@nwpu.edu.cn; heyyao@nwpu.edu.cn).
Abstract

This paper presents two stochastic model predictive control methods for linear time-invariant systems subject to unbounded additive uncertainties. The new methods are developed by formulating the chance constraints into deterministic form, which are treated in analogy with robust constraints, by using the probabilistic reachable set. Firstly, the probabilistically resolvable time-varying tube-based stochastic model predictive control algorithm is designed by employing the time-varying probabilistic reachable sets as the tubes. Secondly, by utilizing the probabilistic positively invariant set, the probabilistically resolvable constant tube-based stochastic model predictive control algorithm is developed by employing the constantly tightened constraints in the entire prediction horizons. In addition, to enhance the feasibility of the algorithms, the soft constraints are imposed to the state initializations. The algorithm feasibility and closed-loop stability results are provided. The efficacy of the approaches are demonstrated by means of numerical simulations.

Index Terms:
Tube-based stochastic MPC, uncertain system, model predictive control, chance constraints, constrained control.

I Introduction

Model predictive control (MPC), also known as the receding horizon control, is inarguably the most widely implemented modern control strategy due to its conceptual simplicity and its eminent capability of handling complex dynamics and fulfilling system constraints, as well as compromising the optimal control performance and the computational load [1, 2, 3, 4, 5]. The control performance depends, to a great extent, on the accuracy of the model. Therefore, the traditional MPC actually could not obtain the real optimal control performance, because the uncertainties, such as external disturbance and model mismatch, are widely existing. By assuming the uncertainty is bounded and considering the worst-case, the robust MPC is proposed to deal with model uncertainty [6, 7, 8]. Among the robust MPC, the tube-based MPC attracts huge attention due to its simply handling of constraints in the optimal control problem. In 2001, two typical tube-based MPC approaches are developed almost simultaneously. One of them uses the current measured state as the initial nominal state [9], while the other one adopts the nominal state predicted at the first time instant [10]. Based on [10], the David Mayne’s tube-based MPC (DTMPC) [11] with flexible initialization method is presented to reduce the conservatism, where the nominal state can be searched near the actual state within an invariant set. However, all the robust MPC frameworks do not utilize the possibly existing statistical properties of the uncertainty, albeit the statistical information might be available in many cases and the worst-case scenarios unlikely occur in practice. Therefore, robust approaches may lead to overly conservatism [12]. By employing chance constraints, the stochastic MPC (SMPC), which allows for an admissible level of constraint violation in a probabilistic sense, exploits the stochastic properties of uncertainty to achieve less conservatism in constraints satisfaction and gain better control performance [13]. Generally, SMPC methods can be classified into the analytic approximation methods [14, 15, 16] and the scenario-based methods [17, 18, 19]. The former ones reformulate the chance constraints into equivalent deterministic form by means of constraint tightening, while the latter ones generate a sufficient number of randomly samples to satisfy the chance constraints [20].

In the case of bounded uncertainty, the recursive feasiblility and stability have been established in several SMPC methods. For instance, in [21] the authors propose an SMPC scheme of time-varying constraint tightening along the prediction horizon based on the tubes of fix-shaped cross section and variable scaling which is computed offline according to chance violation. The tubes in [22] are constructed directly by making explicit use of the distribution of the additive disturbance. A recursively feasible SMPC scheme is proposed in [23] by exerting additional constrains on the first predicted step. The work [24] guarantees recursive feasibility and asymptotic stability by computing the constraint tightening based on confidence region of uncertainty propagation and adopting the flexible initialization methods. When considering the unbounded stochastic uncertainty, it is typically impossible to guarantee recursive feasibility, since the uncertainty is indeed possibly (even with small probability) large enough to make the future optimization problem infeasible [13]. One straightforward approach for guaranteeing recursive feasibility of the SMPC problems with possibly unbounded uncertainties is to set the initial nominal state as the predicted value of the previous time instant. However, it disregards the most recent state measurement, without allowing for the feedback, which may degrade the closed-loop performance [20]. To cope with this issue, an improved method of choosing the initialization between a closed-loop strategy and an open-loop one online is proposed [25, 26]. The key idea in this approach is to choose the closed-loop strategy using the measurement when the problem is feasible, and to choose the open-loop strategy when infeasible. Although this approach guarantees recursive feasibility, it requires to solve two optimisation problems at any time instant to decide the choice between the two initializing strategies. Another recent work [27] sets the initial nominal state equal to the predicted value of the first time instant, and introduces the indirect feedback via the cost function, which facilitates the recursive feasibility analysis. A special approach guarantees recursive feasibility for the system with unbounded disturbances in the case of that the chance constraint is defined as a discounted sum of violation probabilities on an infinite horizon [28].

However, regardless of the different assumptions for uncertainty, the nominal state initializations of the aforementioned works are mainly chosen to be the current measured state as in a typical tube-based MPC [9], which may cause that the open-loop nominal state is not an accurate estimate of the actual closed-loop one, and more complex proofs of recursive feasibility and stability are required [29]. While some of the nominal state initializations are set as the predicted value of the previous time instant, which disregard the feedback of measured value, may degenerate the control performance.

In this paper, the model predictive control problem of linear time-invariant (LTI) system under unbounded additive uncertainties is studied. To solve the problem, we propose two tube-based SMPC schemes based on the framework of the DTMPC [11], which fall in the category of analytic approximation method. The proposed tube-based SMPC schemes can be combined with the probabilistic reachable set (PRS) generated by any method to deal with the chance constraints. The novelty of the proposed method lies in that the flexible state initialization technique is used to reduce the conservatism and force the actual state stay within the relevant PRS around the nominal state with a specified probability. To the best of our knowledge, this framework has not been adopted in previous approaches of tube-based SMPC for unbounded uncertainties. The main contributions of this paper are as follows:

  • A new SMPC algorithm for the LTI system subject to unbounded uncertainties termed as the probabilistically resolvable time-varying tube-based SMPC (pTTSMPC) is developed. In particular, the PRS built from the uncertainty propagation is treated as the tube, by which the actual states will be restricted in the vicinity of the nominal states. The PRS is time-varying along the prediction horizon, and renders the tightened constraints change accordingly, resulting in a less conservative tightening strategy. With these techniques, the pTTSMPC algorithm is then designed based on the framework of the DTMPC [11] which can provide accurate prediction errors between the actual and nominal states, and offer an easier way to guarantee the feasibility and stability properties [29].

  • The theoretical properties of the developed algorithms are analyzed. In particular, the probabilistic resolvability, closed-loop chance constraints satisfaction and system stability are proved. Furthermore, the soft constraint is applied to the initial state which strengthens the probabilistic resolvability to stronger algorithm feasibility.

The remainder of this paper is organized as follows. Section II introduces the problem to be solved and states the required definitions. Section III presents the pTTSMPC scheme, and Section IV develops the pCTSMPC approach. In Section V, the numerical examples and comparison studies are presented and the paper is concluded in Section VI.

Notations: \mathbb{R} represents the set of all real numbers. We denote the set of all integers which are equal or greater than ii by i\mathbb{N}_{i}, and the set of all consecutive integers {i,,j}\{i,\cdots,j\} by ij\mathbb{N}_{i}^{j}. xk{x}_{k} denotes the value of a variable xx at time kk, and xk|tx_{k|t} denotes the kk-step-ahead predicted value of xx at time tt. The probability of the occurrence of event AA is indicated with Pr(A)\textbf{Pr}(A). Q0Q\succ 0 refers to a positive definite matrix, and xQ2=xTQx\|x\|_{Q}^{2}=x^{T}Qx represents the weighted norm. For a random variable xx, its expected value is indicated with E(x)\textbf{E}(x). A random variable xx of distribution 𝒬x\mathcal{Q}^{x} is denoted by x𝒬xx\sim\mathcal{Q}^{x}, and a Gaussian distribution with mean μ\mu and variance WW is represented as 𝒩(μ,W)\mathcal{N}(\mu,W). The notation AB={a+b|aA,bB}A\oplus B=\{a+b|a\in A,b\in B\} denotes the Minkowski sum, and AB={aA|a+bA,bB}A\ominus B=\{a\in A|a+b\in A,\forall b\in B\} represents the Pontryagin set difference.

II Problem Statement and Preliminaries

II-A Problem Statement

Consider a discrete-time LTI system subject to unbound additive uncertainties

xk+1=Axk+Buk+wk,x_{k+1}=Ax_{k}+Bu_{k}+w_{k}, (1)

where xknx_{k}\in\mathbb{R}^{n}, ukmu_{k}\in\mathbb{R}^{m} and wknw_{k}\in\mathbb{R}^{n} are the system state, control input and uncertainty at time kk, respectively. It is assumed that all the system states are measured perfectly, and the unbounded uncertainty wk𝒬ww_{k}\sim\mathcal{Q}^{w} is independently and identically distributed (i.i.d.). Moreover, the system in (1) is subject to the following constraints on states and inputs

xk𝕏,uk𝕌,k0,x_{k}\in\mathbb{X},u_{k}\in\mathbb{U},k\in\mathbb{N}_{0}, (2)

where 𝕏n\mathbb{X}\subset\mathbb{R}^{n}, and 𝕌m\mathbb{U}\subset\mathbb{R}^{m} are compact and each set contains the origin in its interior.

Taking advantage of the stochastic nature of the system dynamics, the chance constraints for all the predicted states and inputs in future horizon, which allow constraint violation with a probability no greater than ε(0,1)\varepsilon\in(0,1), is adopted to reduce the conservatism. The objective of this paper is to design SMPC algorithms to stabilize the system in (1) under the following chance constraints:

Pr(xk𝕏)1ε,k0,\displaystyle\textbf{Pr}(x_{k}\in\mathbb{X})\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (3a)
Pr(uk𝕌)1ε,k0.\displaystyle\textbf{Pr}(u_{k}\in\mathbb{U})\geq 1-\varepsilon,k\in\mathbb{N}_{0}. (3b)

II-B Preliminaries

In this subsection, we recall some definitions which will facilitate algorithms design.

Definition 1 (Robust Positively Invariant Set [30]).

A set ZZ is said to be a robust positively invariant (RPI) set for the discrete-time system xk+1=f(xk,wk)x_{k+1}=f(x_{k},w_{k}), if xk+1Zx_{k+1}\in Z, k0k\in\mathbb{N}_{0}, when x0Zx_{0}\in Z.

Definition 2 (Probabilistic Positively Invariant Set [31]).

A set 𝒟\mathcal{D} is said to be a probabilistic positively invariant (PPI) set of probability level pp for the discrete-time system xk+1=f(xk,wk)x_{k+1}=f(x_{k},w_{k}), if Pr(xk+1𝒟)p\textbf{Pr}(x_{k+1}\in\mathcal{D})\geqslant p, k0k\in\mathbb{N}_{0}, when x0𝒟x_{0}\in\mathcal{D}.

Definition 3 (Confidence Region [32]).

A set p\mathcal{E}_{p} is said to be a confidence region of probability level pp for a random variable xx, if Pr(xp)p\textbf{Pr}(x\in\mathcal{E}_{p})\geqslant p.

Definition 4 (nn-step Probabilistic Reachable Set [26, 27]).

A set n\mathcal{R}_{n} is an nn-step probabilistic reachable set (nn-step PRS) of probability level pp for the discrete-time system xk+1=f(xk,wk)x_{k+1}=f(x_{k},w_{k}), if Pr(xnn)p\textbf{Pr}(x_{n}\in\mathcal{R}_{n})\geqslant p, initialized from x0x_{0}.

Definition 5 (Probabilistic Resolvability [33]).

An optimization problem is probabilistically resolvable if it has feasible solutions at future time steps with a certain probability, given a feasible solution at the current time.

Definition 6 (Almost Surely Asymptotical Stability [34]).

A discrete-time system xk+1=f(xk,wk)x_{k+1}=f(x_{k},w_{k}) is almost surely asymptotically stable in the mean if Pr(limkE(xk)=0)=1\textbf{Pr}(\lim\limits_{k\rightarrow\infty}\textbf{E}(x_{k})=0)=1.

III Probabilistic Time-varying Tube-based SMPC

III-A Problem Decoupling

In order to derive a computationally tractable MPC algorithm, we make use of the techniques from DTMPC [11] to decouple the nominal trajectory planning and the uncertainty handling. Then, the system dynamics in (1) can be separated into a nominal dynamics and an error dynamics as

sk+1=Ask+Bvk,\displaystyle s_{k+1}=As_{k}+Bv_{k}, (4a)
ek+1=Aclek+wk,\displaystyle e_{k+1}=A_{cl}e_{k}+w_{k}, (4b)

where the nominal state is denoted by skns_{k}\in\mathbb{R}^{n} and the nominal input by vkmv_{k}\in\mathbb{R}^{m}. Without loss of generality, the feedback gain KK is chosen as the solution of the linear quadratic regulator (LQR) problem for the dynamics sk+1=Ask+Bvks_{k+1}=As_{k}+Bv_{k} to ensure that Acl=A+BKA_{cl}=A+BK is stable. The state feedback control law is used as in DTMPC

uk=Kek+vk.u_{k}=Ke_{k}+v_{k}. (5)

The state error eke_{k} between the actual state xkx_{k} and the nominal state sks_{k}, and the input error eku=Keke_{k}^{u}=Ke_{k} between the control input uku_{k} and the nominal input vkv_{k} are defined as

ek=xksk,\displaystyle e_{k}=x_{k}-s_{k}, (6a)
eku=ukvk.\displaystyle e_{k}^{u}=u_{k}-v_{k}. (6b)

III-B Probabilistic Reachable Set

Suppose that 1εw\mathcal{E}^{w}_{1-\varepsilon} is a confidence region of probability level 1ε1-\varepsilon for the uncertainty ww, where ε(0,1)\varepsilon\in(0,1). Then

Pr(w1εw)1ε\textbf{Pr}(w\in\mathcal{E}^{w}_{1-\varepsilon})\geq 1-\varepsilon (7)

follows from Definition 3. The formulation methods for confidence region of uncertainty, such as the Chebyshev inequality method [15], the Scenario generation method [20], the Box-shaped method and the Ellipsoidal method [32], can be incorporated with the proposed tube-based SMPC scheme.

Suppose that the confidence region 1εek\mathcal{E}^{e_{k}}_{1-\varepsilon} is a kk-step PRS of probability level 1ε1-\varepsilon for the state error dynamics in (4b), where ε(0,1)\varepsilon\in(0,1). Then,

Pr(ek1εek)1ε,k0,\textbf{Pr}(e_{k}\in\mathcal{E}^{e_{k}}_{1-\varepsilon})\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (8)

follows from Definition 4 directly. The kk-step PRS of probability level 1ε1-\varepsilon for the input error can be calculated as K1εekK\mathcal{E}^{e_{k}}_{1-\varepsilon} due to eku=Keke_{k}^{u}=Ke_{k}. Then

Pr(ekuK1εek)1ε,k0,\textbf{Pr}(e_{k}^{u}\in K\mathcal{E}^{e_{k}}_{1-\varepsilon})\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (9)

is derived.

Any method to compute nested PRS, namely 1εek1εek+1\mathcal{E}^{e_{k}}_{1-\varepsilon}\subseteq\mathcal{E}^{e_{k+1}}_{1-\varepsilon}, k0k\in\mathbb{N}_{0}, can be combined with the proposed tube-based SMPC scheme. For instance, the Ellipsoidal method and Half-space method [27] can be adopted to produce the PRS for uncertainties with mean and variance information. For uncertainty of any type, provided that samples can be obtained, the Scenario approach [35] can be used to formulate the PRS. And the above formulated kk-step PRS has the property that 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}, k0k\in\mathbb{N}_{0} [32].

Especially, consider that the uncertainty ww follows the normal distribution w𝒩(0,W)w\sim\mathcal{N}(0,W). Let 1ε1-\varepsilon be the confidence coefficient of the uncertainty, where ε(0,1)\varepsilon\in(0,1). Then the quantile value corresponding to 1ε1-\varepsilon is α=Φ1(1ε)\alpha=\Phi^{-1}(1-\varepsilon), where Φ1\Phi^{-1} represents the quantile function of the standard normal distribution [16]. Then, the uncertainty confidence region of probability level 1ε1-\varepsilon can be obtained as

1εw{w:αW1/2wαW1/2}.\mathcal{E}^{w}_{1-\varepsilon}\triangleq\{w:-\alpha W^{1/2}\leq w\leq\alpha W^{1/2}\}. (10)

Since the error dynamics in (4b) is linear, the distribution of eke_{k} is still normal. Then, the variance propagation of uncertainty can be evolved as

Σk+1=AclΣkAclT+W,k0,\Sigma_{k+1}=A_{cl}\Sigma_{k}A_{cl}^{T}+W,k\in\mathbb{N}_{0}, (11)

where Σ0=W\Sigma_{0}=W. In addition, Σk+1Σk\Sigma_{k+1}\geq\Sigma_{k} follows from the expanded representation of above equation. The kk-step PRS of probability level 1ε1-\varepsilon can be defined as

1εek{e:αΣk1/2eαΣk1/2},\mathcal{E}^{e_{k}}_{1-\varepsilon}\triangleq\{e:-\alpha\Sigma_{k}^{1/2}\leq e\leq\alpha\Sigma_{k}^{1/2}\}, (12)

which is the error confidence region of probability level 1ε1-\varepsilon, and 1εe0=1εw\mathcal{E}^{e_{0}}_{1-\varepsilon}=\mathcal{E}^{w}_{1-\varepsilon}. Allowing for Σk+1Σk\Sigma_{k+1}\geq\Sigma_{k}, the corresponding probabilistic reachable sets are nested, namely, 1εek1εek+1\mathcal{E}^{e_{k}}_{1-\varepsilon}\subseteq\mathcal{E}^{e_{k+1}}_{1-\varepsilon}. Another property of the kk-step PRS see the following lemma.

Lemma 1.

Consider w𝒩(0,W)w\sim\mathcal{N}(0,W). Let 1εw\mathcal{E}^{w}_{1-\varepsilon} be the uncertainty confidence region defined in (10). Let 1εek\mathcal{E}^{e_{k}}_{1-\varepsilon} be the kk-step PRS defined in (12). Then 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}, k0k\in\mathbb{N}_{0}, holds.

Proof:

According to (11) and (12), we get that
1εek+1={e:αΣk+11/2eαΣk+11/2}={e:α(AclΣkAclT+W)1/2eα(AclΣkAclT+W)1/2}\mathcal{E}^{e_{k+1}}_{1-\varepsilon}=\{e:-\alpha\Sigma_{k+1}^{1/2}\leq e\leq\alpha\Sigma_{k+1}^{1/2}\}=\\ \{e:-\alpha(A_{cl}\Sigma_{k}A_{cl}^{T}+W)^{1/2}\leq e\leq\alpha(A_{cl}\Sigma_{k}A_{cl}^{T}+W)^{1/2}\}.
According to (10) and (12), Acl1εek1εw={e:αAclΣk1/2αW1/2eαAclΣk1/2+αW1/2}A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}=\\ \{e:-\alpha A_{cl}\Sigma_{k}^{1/2}-\alpha W^{1/2}\leq e\leq\alpha A_{cl}\Sigma_{k}^{1/2}+\alpha W^{1/2}\} is deduced. Then, 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon} can be obtained. ∎

III-C Constraint Handling

Because 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}, k0k\in\mathbb{N}_{0}, then 1εeki=0kAcli1εw\mathcal{E}^{e_{k}}_{1-\varepsilon}\subseteq\sum\limits_{i=0}^{k}\oplus A_{cl}^{i}\mathcal{E}^{w}_{1-\varepsilon} follows from the recursively expanding. We define the relaxed PRS as

𝒟ki=0kAcli1εw,k0.\mathcal{D}_{k}\triangleq\sum\limits_{i=0}^{k}\oplus A_{cl}^{i}\mathcal{E}^{w}_{1-\varepsilon},k\in\mathbb{N}_{0}. (13)

Then, 1εek𝒟k𝒟k+1\mathcal{E}^{e_{k}}_{1-\varepsilon}\subseteq\mathcal{D}_{k}\subseteq\mathcal{D}_{k+1} follows directly. And

Pr(ek𝒟k)1ε,k0,\textbf{Pr}(e_{k}\in\mathcal{D}_{k})\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (14)

can be gained from (8), and

Pr(ekuK𝒟k)1ε,k0,\textbf{Pr}(e_{k}^{u}\in K\mathcal{D}_{k})\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (15)

can be inferred from (9).

To guarantee the chance constraints, we propose a strategy to formulate the chance constraint sets as the time-varying deterministic constraints by using the relaxed PRS in (13). Construct the time-varying tightened state constraint set at step kk as

𝒞~k𝕏𝒟k,k0.\tilde{\mathcal{C}}_{k}\triangleq\mathbb{X}\ominus\mathcal{D}_{k},k\in\mathbb{N}_{0}. (16)

Then, 𝒞~k+1𝒞~k\tilde{\mathcal{C}}_{k+1}\subseteq\tilde{\mathcal{C}}_{k} holds owing to 𝒟k𝒟k+1\mathcal{D}_{k}\subseteq\mathcal{D}_{k+1}. It can be seen that, if the nominal state sk𝒞~ks_{k}\in\tilde{\mathcal{C}}_{k}, then the state chance constraint in (3a) is satisfied, i.e., Pr(xk=sk+ek𝕏)1ε\textbf{Pr}(x_{k}=s_{k}+e_{k}\in\mathbb{X})\geq 1-\varepsilon can be guaranteed by (14).

The satisfaction of input chance constraint in (3b), i.e., Pr(uk=vk+eku𝕌)1ε\textbf{Pr}(u_{k}=v_{k}+e_{k}^{u}\in\mathbb{U})\geq 1-\varepsilon can be guaranteed by constructing the tightened input constraint set at step kk as

𝒱~k𝕌K𝒟k,k0.\tilde{\mathcal{V}}_{k}\triangleq\mathbb{U}\ominus K\mathcal{D}_{k},k\in\mathbb{N}_{0}. (17)

Similarly, if the nominal input vk𝒱~kv_{k}\in\tilde{\mathcal{V}}_{k}, then the input chance constraint in (3b) is satisfied according to (15).

Furthermore, let the minimal robust positively invariant (mRPI) set

Zi=0Acli1εwZ\triangleq\sum\limits_{i=0}^{\infty}\oplus A_{cl}^{i}\mathcal{E}^{w}_{1-\varepsilon} (18)

for the error dynamics in (4b) with w1εww\in\mathcal{E}^{w}_{1-\varepsilon} be computed as the outer approximation using the strategy in [36]. Then, 𝒟kZ\mathcal{D}_{k}\subseteq Z and K𝒟kKZK\mathcal{D}_{k}\subseteq KZ can be derived from (13).

III-D Formulation of pTTSMPC

For the cost function, we adopt the stage cost (sk,vk)=skQ2+vkR2\ell(s_{k},v_{k})=\|s_{k}\|_{Q}^{2}+\|v_{k}\|_{R}^{2}, and the terminal cost Vf=sP2V_{f}=\|s\|_{P}^{2}, where Q0Q\succ 0, R0R\succ 0, and PP is the solution of the algebraic Lyapunov equation

(A+BK)TP(A+BK)P=QKTRK.(A+BK)^{T}P(A+BK)-P=-Q-K^{T}RK.

To ensure the feasibility, we construct the terminal constraint set as the maximal output admissible set [37]

𝒳~f{sn:sk𝒞~k,Ksk𝒱~k,k0}.\tilde{\mathcal{X}}_{f}\triangleq\{s\in\mathbb{R}^{n}:s_{k}\in\tilde{\mathcal{C}}_{k},Ks_{k}\in\tilde{\mathcal{V}}_{k},k\in\mathbb{N}_{0}\}. (19)

In this case, 𝒳~f\tilde{\mathcal{X}}_{f} satisfies the axioms [38]:

  • A1:

    Acl𝒳~f𝒳~f,𝒳~f𝕏Z,K𝒳~f𝕌KZA_{cl}\tilde{\mathcal{X}}_{f}\subset\tilde{\mathcal{X}}_{f},\tilde{\mathcal{X}}_{f}\subset\mathbb{X}\ominus Z,K\tilde{\mathcal{X}}_{f}\subset\mathbb{U}\ominus KZ.

  • A2:

    Vf(Aclx)+(x,Kx)Vf(x),x𝒳~fV_{f}(A_{cl}x)+\ell(x,Kx)\leq V_{f}(x),\forall x\in\tilde{\mathcal{X}}_{f}.

On the basis of the time-varying constraint tightening method and the DTMPC framework in [11], the probabilistic time-varying tube-based stochastic finite horizon optimal control problem Nttsmpc(xt)\mathbb{P}^{ttsmpc}_{N}(x_{t}) to be solved at each time instant tt is formulated as follows:

mins0|t,vt\displaystyle\min_{s_{0|t},\textbf{{v}}_{t}} k=0N1(sk|tQ2+vk|tR2)+sN|tP2\displaystyle\sum\limits_{k=0}^{N-1}(\|s_{k|t}\|_{Q}^{2}+\|v_{k|t}\|_{R}^{2})+\|s_{N|t}\|_{P}^{2} (20a)
s.t.\displaystyle s.t.\quad sk|t=Ask|t+Bvk|t,\displaystyle s_{k|t}=As_{k|t}+Bv_{k|t}, (20b)
sk|t𝒞~k+t,k1N1,\displaystyle s_{k|t}\in\tilde{\mathcal{C}}_{k+t},k\in\mathbb{N}_{1}^{N-1}, (20c)
vk|t𝒱~k+t,k0N1,\displaystyle v_{k|t}\in\tilde{\mathcal{V}}_{k+t},k\in\mathbb{N}_{0}^{N-1}, (20d)
xts0|t𝒟t,\displaystyle x_{t}-s_{0|t}\in\mathcal{D}_{t}, (20e)
sN|t𝒳~f.\displaystyle s_{N|t}\in\tilde{\mathcal{X}}_{f}. (20f)

The optimal solution of Nttsmpc(xt)\mathbb{P}^{ttsmpc}_{N}(x_{t}) consists of nominal state s0|ts_{0|t}^{*} and input sequence v(xt)=[v0|t,v1|t,,vN1|t]\textbf{{v}}^{*}(x_{t})=[v_{0|t}^{*},v_{1|t}^{*},\cdots,v_{N-1|t}^{*}], and the associated optimal state sequence for the nominal system is s(xt)=[s0|t,s1|t,,sN|t]\textbf{{s}}^{*}(x_{t})=[s_{0|t}^{*},s_{1|t}^{*},\cdots,s_{N|t}^{*}].

At each time instant, Nttsmpc(xt)\mathbb{P}^{ttsmpc}_{N}(x_{t}) is solved to generate v(xt)\textbf{{v}}^{*}(x_{t}) and s0|ts_{0|t}^{*}, and the optimal control law is designed as:

u(xt)=K(xts0|t)+v0|t.u^{*}(x_{t})=K(x_{t}-s_{0|t}^{*})+v_{0|t}^{*}. (21)

The entire process of the probabilistic time-varying tube-based SMPC is repeated for all t0t\geqslant 0 to yield a receding horizon control strategy.

III-E Properties of pTTSMPC

In this section, the feasibility of the designed pTTSMPC scheme is first presented, then the closed-loop chance constraint satisfaction is verified, and the stability results of the closed-loop system are also provided. Before that, a lemma for developing the feasibility result is presented.

Lemma 2.

Let 1εet\mathcal{E}^{e_{t}}_{1-\varepsilon} be the tt-step PRS of probability level 1ε1-\varepsilon for the state error dynamics in (4b) with wt𝒬ww_{t}\sim\mathcal{Q}^{w}. For xt+1=Axt+But+wt,x_{t+1}=Ax_{t}+Bu_{t}+w_{t}, and sk+1|t=Ask|t+Bvk|ts_{k+1|t}=As_{k|t}+Bv_{k|t}, if xts0|t𝒟tx_{t}\in s_{0|t}\oplus\mathcal{D}_{t} and ut=K(xts0|t)+v0|tu_{t}=K(x_{t}-s_{0|t})+v_{0|t}, then Pr(xt+1s1|t𝒟t+1)1ε\textbf{Pr}(x_{t+1}\in s_{1|t}\oplus\mathcal{D}_{t+1})\geqslant 1-\varepsilon for all wt𝒬ww_{t}\sim\mathcal{Q}^{w}.

Proof:

The proof follows the similar line of reasoning as Proposition 1 in [10]. In (6a) we have defined et=xts0|te_{t}=x_{t}-s_{0|t} and et+1=xt+1s1|te_{t+1}=x_{t+1}-s_{1|t}. Due to xts0|t𝒟tx_{t}\in s_{0|t}\oplus\mathcal{D}_{t}, it follows that et=xts0|t𝒟te_{t}=x_{t}-s_{0|t}\in\mathcal{D}_{t}. Since, according to (4b), (7) and (13), Pr(et+1=Aclet+wtAcl𝒟t1εw)1ε\textbf{Pr}(e_{t+1}=A_{cl}e_{t}+w_{t}\in A_{cl}\mathcal{D}_{t}\oplus\mathcal{E}^{w}_{1-\varepsilon})\geqslant 1-\varepsilon and 𝒟t+1=Acl𝒟t1εw\mathcal{D}_{t+1}=A_{cl}\mathcal{D}_{t}\oplus\mathcal{E}^{w}_{1-\varepsilon} hold, then Pr(xt+1=s1|t+et+1s1|t𝒟t+1)1ε\textbf{Pr}(x_{t+1}=s_{1|t}+e_{t+1}\in s_{1|t}\oplus\mathcal{D}_{t+1})\geqslant 1-\varepsilon follows. ∎

Based on Lemma 2, the probabilistic resolvability result is reported as follows.

Theorem 1 (Probabilistic Resolvability).

Consider the optimal control problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) in (20) for the system dynamics in (1) under the control law in (21). If t=[s0|t,v0|t,v1|t,,vN1|t]\mathcal{M}_{t}=[s_{0|t}^{*},v_{0|t}^{*},v_{1|t}^{*},\cdots,v_{N-1|t}^{*}] is feasible for the problem at time instant tt, then applying the control law in (21) gives the probabilistic resolvability at time instant t+1t+1.

Proof:

The proof follows the similar line of reasoning as Proposition 3 in [11]. Since t\mathcal{M}_{t} is feasible for the problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}), the constraints (20c)-(20f) are satisfied by v(xt)\textbf{{v}}^{*}(x_{t}) and s(xt)\textbf{{s}}^{*}(x_{t}). The shifted input sequence of v(xt)\textbf{{v}}^{*}(x_{t}) is denoted as v(xt)=[v1|t,,vN1|t,KsN|t]\vec{\textbf{{v}}}^{*}(x_{t})=[v_{1|t}^{*},\cdots,v_{N-1|t}^{*},Ks_{N|t}^{*}] and the shifted state sequence of s(xt)\textbf{{s}}^{*}(x_{t}) is denoted as s(xt)=[s1|t,,sN|t,AclsN|t]\vec{\textbf{{s}}}^{*}(x_{t})=[s_{1|t}^{*},\cdots,s_{N|t}^{*},A_{cl}s_{N|t}^{*}]. We choose the candidate solution t+1=[s1|t,v1|t,,vN1|t,KsN|t]\mathcal{M}_{t+1}=[s_{1|t}^{*},v_{1|t}^{*},\cdots,v_{N-1|t}^{*},Ks_{N|t}^{*}]. Hence, the first NN elements of s(xt)\vec{\textbf{{s}}}^{*}(x_{t}) satisfy the time-varying tightened state constraints (20c) owing to the fact that if sk+1|t𝒞~k+1+ts_{k+1|t}^{*}\in\tilde{\mathcal{C}}_{k+1+t}, then sk+1|t𝒞~k+ts_{k+1|t}^{*}\in\tilde{\mathcal{C}}_{k+t} as 𝒞~k+1+t𝒞~k+t\tilde{\mathcal{C}}_{k+1+t}\subseteq\tilde{\mathcal{C}}_{k+t}. And the first N1N-1 elements of v(xt)\vec{\textbf{{v}}}^{*}(x_{t}) satisfy the time-varying tightened input constraint (20d) analogously. Because sN|tX~fs_{N|t}^{*}\in\tilde{X}_{f}, it follows from A1 that AclsN|tX~fA_{cl}s_{N|t}^{*}\in\tilde{X}_{f} and KsN|t𝕌KZKs_{N|t}^{*}\in\mathbb{U}\ominus KZ. So the last element AclsN|tA_{cl}s_{N|t}^{*} of s(xt)\vec{\textbf{{s}}}^{*}(x_{t}) satisfies the terminal constraint (20f). Allowing for K𝒟t+NKZK\mathcal{D}_{t+N}\subseteq KZ, thus KsN|t𝕌KZ𝕌𝒟t+NKs_{N|t}^{*}\in\mathbb{U}\ominus KZ\subseteq\mathbb{U}\ominus\mathcal{D}_{t+N} follows. Therefore, the last element KsN|tKs_{N|t}^{*} of v(xt)\vec{\textbf{{v}}}^{*}(x_{t}) satisfies constraint (20d). Moreover, from Lemma 2 we have that if xts0|t𝒟tx_{t}\in s_{0|t}^{*}\oplus\mathcal{D}_{t}, then Pr(xt+1s1|t𝒟t+1)1ε\textbf{Pr}(x_{t+1}\in s_{1|t}^{*}\oplus\mathcal{D}_{t+1})\geqslant 1-\varepsilon for all wt𝒬ww_{t}\sim\mathcal{Q}^{w}, i.e., the initial constraint (20e) is satisfied with the probability no less than 1ε1-\varepsilon. Summarizing the satisfaction status of the state constraint (20c), the input constraint (20d), the initial constraint (20e), and the terminal constraint (20f), t+1\mathcal{M}_{t+1} is probabilistically resolvable for the problem Nttsmpc(xt+1)\mathbb{P}_{N}^{ttsmpc}(x_{t+1}). ∎

Suppose that the problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) is feasible, then the satisfaction of closed-loop chance constraints follows directly, which is shown in the Theorem 2.

Theorem 2 (Chance Constraint Satisfaction).

Consider the optimal control problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) in (20) for the system dynamics in (1) under the control law in (21). Suppose Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) is recursively feasible, then the closed-loop state chance constraint in (3a) and input chance constraint in (3b) are satisfied.

Proof:

From the feasibility of the optimal control problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) in (20) we have that et=xts0|t𝒟te_{t}=x_{t}-s_{0|t}^{*}\in\mathcal{D}_{t}, t0t\in\mathbb{N}_{0}. Using the fact 𝒟t+1=Acl𝒟t1εw\mathcal{D}_{t+1}=A_{cl}\mathcal{D}_{t}\oplus\mathcal{E}^{w}_{1-\varepsilon}, we can get Pr(Aclet+wt𝒟t+1|et𝒟t)1ε\textbf{Pr}(A_{cl}e_{t}+w_{t}\in\mathcal{D}_{t+1}|e_{t}\in\mathcal{D}_{t})\geq 1-\varepsilon for all wt𝒬ww_{t}\sim\mathcal{Q}^{w} following from (7). Since s1|t𝒞~t+1s_{1|t}^{*}\in\tilde{\mathcal{C}}_{t+1}, Pr(s1|t+Aclet+wt𝒞~t+1𝒟t+1|et𝒟t)1ε\textbf{Pr}(s_{1|t}^{*}+A_{cl}e_{t}+w_{t}\in\tilde{\mathcal{C}}_{t+1}\oplus\mathcal{D}_{t+1}|e_{t}\in\mathcal{D}_{t})\geq 1-\varepsilon holds. Consequently, as xt+1=s1|t+Aclet+wtx_{t+1}=s_{1|t}^{*}+A_{cl}e_{t}+w_{t} and 𝕏=𝒞~t+1𝒟t+1\mathbb{X}=\tilde{\mathcal{C}}_{t+1}\oplus\mathcal{D}_{t+1}, it follows that Pr(xt+1𝕏)1ε\textbf{Pr}(x_{t+1}\in\mathbb{X})\geq 1-\varepsilon, t0t\in\mathbb{N}_{0}. The same argument holds for the input constraint, namely, Pr(ut+1𝕌)1ε\textbf{Pr}(u_{t+1}\in\mathbb{U})\geq 1-\varepsilon, t0t\in\mathbb{N}_{0}. ∎

Finally, the stability result is presented in the following theorem.

Theorem 3 (Stability).

Consider the optimal control problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) in (20) for the system dynamics in (1) under the control law in (21). Suppose that ZZ is the mRPI set for the error dynamics in (4b) and Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) is recursively feasible, the following hold.

  • (a)

    ZZ is probabilistically stable for the closed-loop system.

  • (b)

    If the uncertainty has zero mean, the closed-loop system is almost surely asymptotically stable in the mean.

Proof:

Denote the cost function as J(st,vt)=k=0N1(sk|tQ2+vk|tR2)+sN|tP2J(\textbf{{s}}_{t},\textbf{{v}}_{t})=\sum\limits_{k=0}^{N-1}(\|s_{k|t}\|_{Q}^{2}+\|v_{k|t}\|_{R}^{2})+\|s_{N|t}\|_{P}^{2}, and the optimal cost value as J(st,vt)=minJ(st,vt)J^{*}(\textbf{{s}}_{t},\textbf{{v}}_{t})=\min J(\textbf{{s}}_{t},\textbf{{v}}_{t}). From the Proposition 3 in [11] we have that J(st+1,vt+1)J(st,vt)s0|tQ2v0|tR2J^{*}(\textbf{{s}}_{t+1},\textbf{{v}}_{t+1})-J^{*}(\textbf{{s}}_{t},\textbf{{v}}_{t})\leq-\|s_{0|t}\|_{Q}^{2}-\|v_{0|t}\|_{R}^{2}. Summing both sides of above inequality over all t0t\geq 0 gives the closed-loop performance bound as t=0(s0|tQ2+v0|tR2)J(s0,v0)J(s,v)\sum\limits_{t=0}^{\infty}(\|s_{0|t}\|_{Q}^{2}+\|v_{0|t}\|_{R}^{2})\leq J^{*}(\textbf{{s}}_{0},\textbf{{v}}_{0})-J^{*}(\textbf{{s}}_{\infty},\textbf{{v}}_{\infty}). The left hand side of the inequality is the cost value along the nominal trajectory. Since J(st,vt)0J^{*}(\textbf{{s}}_{t},\textbf{{v}}_{t})\geq 0 for all tt, it follows that t=0(s0|tQ2+v0|tR2)J(s0,v0)\sum\limits_{t=0}^{\infty}(\|s_{0|t}\|_{Q}^{2}+\|v_{0|t}\|_{R}^{2})\leq J^{*}(\textbf{{s}}_{0},\textbf{{v}}_{0}). Given that the optimal cost is necessarily finite if the problem Nttsmpc(xt)\mathbb{P}_{N}^{ttsmpc}(x_{t}) is feasible, and since each term on the left hand side of the above inequality is non-negative, limt(s0|tQ2+v0|tR2)=0\lim\limits_{t\rightarrow\infty}(\|s_{0|t}\|_{Q}^{2}+\|v_{0|t}\|_{R}^{2})=0 follows. Furthermore, since Q0Q\succ 0, R0R\succ 0, it implies that limksk|t=0\lim\limits_{k\rightarrow\infty}s_{k|t}=0 following from Theorem 2.7 in [1]. Consequently, the nominal origin s=0s=0 is exponentially stable for the nominal dynamics by following from Theorem 2.8 in [1]. Moreover, Pr(et+kZ)1ε\textbf{Pr}(e_{t+k}\in Z)\geqslant 1-\varepsilon holds, due to Pr(et+k𝒟t+k)1ε\textbf{Pr}(e_{t+k}\in\mathcal{D}_{t+k})\geqslant 1-\varepsilon and 𝒟t+kZ\mathcal{D}_{t+k}\subseteq Z, hence Pr(limkxt+k=limk(sk|t+et+k)=limket+kZ)1ε\textbf{Pr}(\lim\limits_{k\rightarrow\infty}x_{t+k}=\lim\limits_{k\rightarrow\infty}(s_{k|t}+e_{t+k})=\lim\limits_{k\rightarrow\infty}e_{t+k}\in Z)\geqslant 1-\varepsilon. This completes the proof for part (a).

The proof of part (b) is as follows. Since the uncertainty is zero-mean, allowing for the error dynamics in (4b), Pr(limkE(et+k)=0)=1\textbf{Pr}(\lim\limits_{k\rightarrow\infty}\textbf{E}(e_{t+k})=0)=1 is derived directly. Then Pr(limkE(xt+k)=0)=Pr(limkE(sk|t+et+k)=0)=Pr(limkE(et+k)=0)=1\textbf{Pr}(\lim\limits_{k\rightarrow\infty}\textbf{E}(x_{t+k})=0)=\textbf{Pr}(\lim\limits_{k\rightarrow\infty}\textbf{E}(s_{k|t}+e_{t+k})=0)=\textbf{Pr}(\lim\limits_{k\rightarrow\infty}\textbf{E}(e_{t+k})=0)=1. Thus, according to Definition 6, the closed-loop system is almost surely asymptotically stable in the mean. ∎

III-F Enhanced Feasibility

In order to avoid possible infeasibility when violation occurs, the slack variable can be imposed on the state constraints [40, 16, 39]. However, here, we only impose the slack variable λ\lambda on the flexible initial state constraint in (20e), such that

xts0|tλ𝒟t,λ1.x_{t}-s_{0|t}\in\lambda\mathcal{D}_{t},\lambda\geq 1. (22)

And a finite penalty term

ψ(λ)=γ(11+e(λ1)12)\psi(\lambda)=\gamma(\frac{1}{1+e^{-(\lambda-1)}}-\frac{1}{2}) (23)

is added to the cost function in (20a) to force λ\lambda to tend towards 1, so that the distance between the actual state and the nominal state will be as close as possible while maintaining the feasibility. The weight γ\gamma is chosen large enough so that ψ(λ)\psi(\lambda) makes the best of the penalty function [41]. As a result, the probabilistic resolvability of the optimal control problem Nttsmpc(xt)\mathbb{P}^{ttsmpc}_{N}(x_{t}) is enhanced to strong feasibility by using the soft initial state constraint in (22) and the penalty term in (23). Moreover, this approach allows for solving a single optimisation problem instead of having to verify feasibility and decide which MPC problem to be solved accordingly, as is the case in [25, 15, 26].

IV Probabilistic Constant Tube-based SMPC

IV-A Probabilistic Positively Invariant Set

The relationship between the RPI set and the PPI set is presented in the following proposition.

Proposition 1.

[32] Let ZZ be a RPI set for the error dynamics in (4b) with w1εww\in\mathcal{E}^{w}_{1-\varepsilon}, where 1εw\mathcal{E}^{w}_{1-\varepsilon} is the confidence region of probability 1ε1-\varepsilon for ww. If 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}, k0k\in\mathbb{N}_{0}, then ZZ is also a PPI set of probability level 1ε1-\varepsilon for the error dynamics in (4b) with w𝒬ww\sim\mathcal{Q}^{w}.

Let ZZ be the mRPI set for the error dynamics in (4b) with w1εww\in\mathcal{E}^{w}_{1-\varepsilon} computed as in (18). If 1εw\mathcal{E}^{w}_{1-\varepsilon} is a box-shaped or an ellipsoidal confidence region, then ZZ is also the minimal probabilistic positively invariant (mPPI) set of probability level 1ε1-\varepsilon for the error dynamics in (4b) with w𝒬ww\sim\mathcal{Q}^{w} [32].

Especially, when the uncertainty ww follows the normal distribution w𝒩(0,W)w\sim\mathcal{N}(0,W), the following property holds.

Corollary 1.

Let ZZ be the mRPI set for the error dynamics in (4b) with w1εww\in\mathcal{E}^{w}_{1-\varepsilon}, where 1εw\mathcal{E}^{w}_{1-\varepsilon} is defined in (10). Then ZZ is also the mPPI set of probability level 1ε1-\varepsilon for the error dynamics in (4b) with w𝒩(0,W)w\sim\mathcal{N}(0,W).

Proof:

According to Lemma 1, 1εek+1Acl1εek1εw\mathcal{E}^{e_{k+1}}_{1-\varepsilon}\subseteq A_{cl}\mathcal{E}^{e_{k}}_{1-\varepsilon}\oplus\mathcal{E}^{w}_{1-\varepsilon}, k0k\in\mathbb{N}_{0}. Thus, the claim follows from the Proposition 1. ∎

IV-B Formulation of pCTSMPC

Since ZZ is the mPPI set of probability level 1ε1-\varepsilon for the error dynamics in (4b) with w𝒬ww\sim\mathcal{Q}^{w}, then

Pr(ekZ)1ε,k0,\textbf{Pr}(e_{k}\in Z)\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (24)

follows from Definition 2 directly. And

Pr(ekuKZ)1ε,k0,\textbf{Pr}(e_{k}^{u}\in KZ)\geq 1-\varepsilon,k\in\mathbb{N}_{0}, (25)

can be derived from eku=Keke_{k}^{u}=Ke_{k} subsequently.

Construct the constantly tightened state constraint set as

𝒞¯𝕏Z,\bar{\mathcal{C}}\triangleq\mathbb{X}\ominus Z, (26)

then, the chance constraint in (3a) is satisfied. The reason is that, if the nominal state sk𝒞¯s_{k}\in\bar{\mathcal{C}}, then, Pr(xk=sk+ek𝕏)1ε\textbf{Pr}(x_{k}=s_{k}+e_{k}\in\mathbb{X})\geq 1-\varepsilon can be guaranteed by (24).

The satisfaction of Pr(uk=vk+eku𝕌)1ε\textbf{Pr}(u_{k}=v_{k}+e_{k}^{u}\in\mathbb{U})\geq 1-\varepsilon can be guaranteed by constructing the constantly tightened input constraint set as

𝒱¯𝕌KZ,k0.\bar{\mathcal{V}}\triangleq\mathbb{U}\ominus KZ,k\in\mathbb{N}_{0}. (27)

Thus, if the nominal input vk𝒱¯v_{k}\in\bar{\mathcal{V}}, then the input chance constraint in (3b) is satisfied according to (25).

Furthermore, to ensure the feasibility, we construct the terminal constraint set as

X¯f{sn:sk𝒞¯,Ksk𝒱¯,k0},\bar{X}_{f}\triangleq\{s\in\mathbb{R}^{n}:s_{k}\in\bar{\mathcal{C}},Ks_{k}\in\bar{\mathcal{V}},k\in\mathbb{N}_{0}\}, (28)

which satisfies the axioms A1 and A2 similar with (19).

On the basis of the constant constraint tightening method and the DTMPC framework in [11], the probabilistic constant tube-based stochastic finite horizon optimal control problem Nctsmpc(xt)\mathbb{P}^{ctsmpc}_{N}(x_{t}) to be solved at each time instant tt is formulated as follows:

mins0|t,vt\displaystyle\min_{s_{0|t},\textbf{{v}}_{t}} k=0N1(sk|tQ2+vk|tR2)+sN|tP2\displaystyle\sum\limits_{k=0}^{N-1}(\|s_{k|t}\|_{Q}^{2}+\|v_{k|t}\|_{R}^{2})+\|s_{N|t}\|_{P}^{2} (29a)
s.t.\displaystyle s.t.\quad sk|t=Ask|t+Bvk|t,\displaystyle s_{k|t}=As_{k|t}+Bv_{k|t}, (29b)
sk|t𝒞¯,k1N1,\displaystyle s_{k|t}\in\bar{\mathcal{C}},k\in\mathbb{N}_{1}^{N-1}, (29c)
vk|t𝒱¯,k0N1,\displaystyle v_{k|t}\in\bar{\mathcal{V}},k\in\mathbb{N}_{0}^{N-1}, (29d)
xts0|tZ,\displaystyle x_{t}-s_{0|t}\in Z, (29e)
sN|t𝒳¯f.\displaystyle s_{N|t}\in\bar{\mathcal{X}}_{f}. (29f)

The optimal control law is designed as in (21). At each time instant, Nctsmpc(xt)\mathbb{P}^{ctsmpc}_{N}(x_{t}) is solved to generate input sequence v(xt)\textbf{{v}}^{*}(x_{t}) and nominal state s0|ts_{0|t}^{*}. The entire process of the probabilistic constant tube-based SMPC is repeated for all t0t\geqslant 0 to yield a receding horizon control strategy.

Using the similar argument as in pTTSMPC, we can prove that the optimal control problem Nctsmpc(xt)\mathbb{P}^{ctsmpc}_{N}(x_{t}) in (29) is probabilistically resolvable with the probability no less than 1ε1-\varepsilon. Suppose that the problem Nctsmpc(xt)\mathbb{P}^{ctsmpc}_{N}(x_{t}) is recursively feasible, then the satisfaction of closed-loop chance constraints can be guaranteed, and ZZ is probabilistically stable for the closed-loop system. Moreover, if the uncertainty has zero mean, the closed-loop system is almost surely asymptotically stable in the mean.

Besides, in order to avoid possible infeasibility when violation occurs, we impose the slack variable λ\lambda on the flexible initial state constraint in (29e), such that

xts0|tλZ,λ1.x_{t}-s_{0|t}\in\lambda Z,\lambda\geq 1. (30)

And a finite penalty term as in (23) is added to the cost function in (29a) to force λ\lambda to tend towards 1. As a result, the probabilistic resolvability of the optimal control problem Nctsmpc(xt)\mathbb{P}^{ctsmpc}_{N}(x_{t}) is enhanced to strong feasibility by using the soft initial state constraint in (30).

Remark 1.

In both the pTTSMPC and the pCTSMPC schemes, all the constraint sets can be calculated offline, therefore, the resulting SMPC algorithms require the comparable online computational load with the DTMPC algorithm.

V Numerical Simulation

In this section, the control effects of the proposed probabilistic tube-based SMPC schemes are first illustrated. Then, the chance constraint violation of the proposed methods are compared. In the simulations, the feasible regions in the figures are caculated by using the algorithm in [42], the implementations of the algorithms and the computations of 𝒞~k\tilde{\mathcal{C}}_{k}, 𝒱~k\tilde{\mathcal{V}}_{k}, 𝒞¯\bar{\mathcal{C}}, 𝒱¯\bar{\mathcal{V}} and ZZ are performed by using the MPT3 toolbox [43].

Consider a discrete-time LTI system in [30] subject to additive uncertainties with unbounded support

xk+1=[10.00750.1430.996]xk+[4.7980.115]uk+wk,\displaystyle x_{k+1}=\begin{bmatrix}1&0.0075\\ -0.143&0.996\end{bmatrix}x_{k}+\begin{bmatrix}4.798\\ 0.115\end{bmatrix}u_{k}+w_{k}, (31)

where 𝕏{x2:[23]x[23]}\mathbb{X}\triangleq\left\{x\in\mathbb{R}^{2}:\begin{bmatrix}-2\\ -3\end{bmatrix}\leq x\leq\begin{bmatrix}2\\ 3\end{bmatrix}\right\}, 𝕌{u:|u|0.2}\mathbb{U}\triangleq\{u\in\mathbb{R}:|u|\leq 0.2\}, and wk𝒩(wk,0.042)w_{k}\sim\mathcal{N}(w_{k},0.04^{2}). The state and input constraints are Pr(xk𝕏)0.8\textbf{Pr}(x_{k}\in\mathbb{X})\geq 0.8 and Pr(uk𝕌)0.8\textbf{Pr}(u_{k}\in\mathbb{U})\geq 0.8, k0k\in\mathbb{N}_{0}. The weights of the cost functions are Q=[10010]Q=\begin{bmatrix}1&0\\ 0&10\end{bmatrix} and R=1R=1, respectively. KK is the LQR feedback gain for the unconstrained optimal problem (A,B,Q,R)(A,B,Q,R). The prediction horizon is N=8N=8, the simulation step is Nsim=15N_{\rm sim}=15 and the initial state x0=[2.5,2.8]Tx_{0}=[2.5,2.8]^{T}. Set the weight γ=100\gamma=100 in the cost function of the strongly feasible SMPC approaches.

The simulation of the pTTSMPC, pTTSMPC with enhanced feasibility (denoted as pTTSMPC-en), pCTSMPC and pCTSMPC with enhanced feasibility (denoted as pCTSMPC-en) for the system in (31) are conducted. The results are as follows.

V-A Control Effects

Figs. 1-4 present the closed-loop state trajectories and the relevant initial tubes at time instant tt, t0Nsim1t\in\mathbb{N}_{0}^{N_{\rm sim}-1}, of the four probabilistic tube-based SMPC schemes, respectively. In the figures, the solid-circle line (red) represents the actual trajectories, while the dash-dot line (blue) is the nominal trajectories. The initial tubes s0|t1εets_{0|t}\oplus\mathcal{E}^{e_{t}}_{1-\varepsilon} and s0|tλ1εets_{0|t}\oplus\lambda\mathcal{E}^{e_{t}}_{1-\varepsilon} at time instant tt are shown as the small hexagons along the nominal trajectories in Fig. 1 and Fig. 2. It can be seen that the tubes are varying over time to guarantee the feasibility. And the tube of the pTTSMPC-en approach is almost the same as that of the pTTSMPC version at the same time instant, which means the slack variable λ\lambda is enforced to tend towards 1 at all the simulation time instants.

Refer to caption
Figure 1: Closed-loop state trajectories of the pTTSMPC.
Refer to caption
Figure 2: Closed-loop state trajectories of the pTTSMPC-en.

Analogously, the initial tubes s0|tZs_{0|t}\oplus Z and s0|tλZs_{0|t}\oplus\lambda Z are shown as the small hexagons along the nominal trajectories in Fig. 3 and Fig. 4, respectively. Here, the tubes of the pCTSMPC approach are constant over time. And the relevant time instant tubes of the pCTSMPC-en approach are almost the same as that of the pCTSMPC version, which means the slack variable λ\lambda is enforced to tend towards 1 at all the simulation time instants.

Refer to caption
Figure 3: Closed-loop state trajectories of the pCTSMPC.
Refer to caption
Figure 4: Closed-loop state trajectories of the pCTSMPC-en.

Furthermore, in order to confirm the feasibility of the proposed probabilistic tube-based SMPC schemes, we conduct Ns=10000N_{\rm s}=10000 simulations based on the same parameter setup. The feasibility ratio is defined as the comparative value of the number of feasibility nfn_{f} to the total simulation number NsN_{\rm s}, i.e.,

rf=nfNs×100%.r_{f}=\frac{n_{f}}{N_{\rm s}}\times 100\%.

Table I illustrates the feasibility ratio of the four approaches. Unexpectedly, the experimental feasibility ratios of all the probabilistic tube-based SMPC algorithms for the system in (31) are 100%, that is to say all the schemes possess good practical feasibility.

TABLE I: Feasibility Ratio of Probabilistic Tube-based SMPC Algorithms with 10000 Realizations
algorithm pTTSMPC pTTSMPC-en pCTSMPC pCTSMPC-en
rfr_{f} 100% 100% 100% 100%

V-B Constraint Violation

To verify the chance constraint satisfaction of the proposed probabilistic tube-based SMPC schemes, we introduce three definitions, namely, the empirical maximum constraint violation ratio, the empirical minimum constraint violation ratio and the empirical average constraint violation ratio. The number of violation at time instant tt over NsN_{\rm s} simulations is denoted as nv(t)n_{v}(t). Then the empirical maximum constraint violation ratio is defined as rvmaxmaxrv(t)r^{max}_{v}\triangleq\max r_{v}(t), t0Nsim1t\in\mathbb{N}_{0}^{N_{\rm sim}-1}, with the constraint violation ratio at time instant tt

rv(t)=nv(t)Ns×100%.r_{v}(t)=\frac{n_{v}(t)}{N_{\rm s}}\times 100\%.

The empirical minimum constraint violation ratio is defined as rvminminrv(t)r^{min}_{v}\triangleq\min r_{v}(t), t16t\in\mathbb{N}_{1}^{6}. And the empirical average constraint violation ratio r¯v\bar{r}_{v} of the proposed algorithms for the system in (31) are calculated as the mean of the constraint violation ratios at the first 6 time instants.

Table II illustrates the empirical average constraint violation ratio r¯v\bar{r}_{v}, the empirical maximum constraint violation ratio rvmaxr^{max}_{v}, the empirical minimum constraint violation ratio rvminr^{min}_{v} and the total time consumption of the probabilistic tube-based SMPC algorithms for Ns=10000N_{\rm s}=10000 simulations with the same parameter setup. From the table we can see that: 1) All the proposed probabilistic tube-based SMPC algorithms can guarantee the chance constraint satisfaction. 2) The ratios r¯v\bar{r}_{v}, rvmaxr^{max}_{v}, rvminr^{min}_{v} of the strongly feasible tube-based SMPC algorithms are smaller than those of the related probabilistically resolvable versions due to the slackness of the state initializations. 3) Because of the conservatism of the constant tightening, the constant tube-based SMPC approaches give smaller r¯v\bar{r}_{v}, rvmaxr^{max}_{v}, rvminr^{min}_{v} than the corresponding time-varying tube-based SMPC versions. 4) In spite of resulting in more conservatism, the constant tube-based SMPC approaches require less time consumptions than the corresponding time-varying tube-based SMPC approaches.

TABLE II: Constraint Violation and Time Consumption of Probabilistic Tube-based SMPC Algorithms with 10000 Realizations
algorithm pTTSMPC pTTSMPC-en pCTSMPC pCTSMPC-en
r¯v\bar{r}_{v} 19.71% 19.58% 18.09% 17.97%
rvmaxr^{max}_{v} 20.88% 20.45% 18.42% 18.08%
rvminr^{min}_{v} 18.66% 18.62% 17.92% 17.77%
time(s) 17042 18453 14853 13449

To clearly illustrate the constraint violation of the four algorithms, Figs. 5-8 report the simulation results of them with 100 realizations, respectively. For each figure, the left part demonstrates the closed-loop trajectories with 15-step simulation, while the right part exhibits the detailed constraint violation. It can be seen that, in addition to the satisfaction of chance constraint, the closed-loop trajectories of the actual state will stay in the mPPI set ultimately with a probability greater than the prescribed 80%.

Refer to caption
Figure 5: Chance constraint violation of pTTSMPC.
Refer to caption
Figure 6: Chance constraint violation of pTTSMPC-en.
Refer to caption
Figure 7: Chance constraint violation of pCTSMPC.
Refer to caption
Figure 8: Chance constraint violation of pCTSMPC-en.

V-C Comparison of pTTSMPC Schemes with Different Initialization Methods

To verify the advantage of the proposed flexible initialization method over the existing ones, we further compare the control performance of pTTSMPC schemes with different initialization methods. That is to say, the optimal control problem in (20) with different initial state constraints will be demonstrated for the system in (31) under the same parameter setup. Before that, we define four initialization methods as follows.

  • Case1:

    The proposed flexible initialization in (20e), namely xts0|t𝒟tx_{t}-s_{0|t}\in\mathcal{D}_{t}.

  • Case2:

    The initial nominal state is set to be the predicted value at the previous time instant, namely s0|t=s1|t1s_{0|t}=s_{1|t-1}.

  • Case3:

    The initialization for the nominal state is set as the predicted value at the first time instant as in [27], i.e., s0|t=st|0s_{0|t}=s_{t|0}.

  • Case4:

    The initialization for the nominal state is chosen from the alternative strategies as in [25, 15, 26]. That is, s0|t=xts_{0|t}=x_{t} whenever feasible and s0|t=s1|t1s_{0|t}=s_{1|t-1} otherwise.

Table III illustrates the empirical average constraint violation ratio r¯v\bar{r}_{v}, the empirical maximum constraint violation ratio rvmaxr^{max}_{v}, the empirical minimum constraint violation ratio rvminr^{min}_{v} and the total time consumption of the four pTTSMPC algorithms with different initialization methods based on Ns=10000N_{\rm s}=10000 simulations. From the table we can see that: 1) Case1 gives the largest empirical constraint violation ratios, which is less conservative than other cases. Meanwhile, the time consumption of Case1 is heavier than those of other cases. 2) For the other three cases, the computational loads are comparable. The violations of Case2 and Case3 are obviously lower than those of Case1 due to the lack of feedback information from the recently measured state. Case4 is the most conservative one in comparison with others and suffers possible infeasibility owing to the fully feedback.

TABLE III: Performance of pTTSMPC Schemes with Different Initialization Methods - 10000 Realizations
algorithm Case1 Case2 Case3 Case4
r¯v\bar{r}_{v} 19.71% 12.70% 6.90% 0.007%
rvmaxr^{max}_{v} 20.88% 16.32% 12.09% 0.02%
rvminr^{min}_{v} 18.66% 0% 0% 0%
time(s) 17042 13670 13677 14041

VI Conclusion

Utilizing the concept of PRS, two probabilistically resolvable tube-based stochastic MPC approaches have been developed for linear constrained system with additive unbounded stochastic uncertainties. The results on the feasibility and closed-loop stability are presented. In comparison with existing result, the proposed method can reduce the conservatism in chance constraints satisfaction. The simulation studies verified the feasibility and advantage of the proposed methods. The future work will be the stochastic MPC for nonlinear systems with unbounded disturbances.

References

  • [1] B. Kouvaritakis and M. Cannon, Model Predictive Control: Classical, Robust and Stochastic. Cham, Switzerland: Springer, 2016.
  • [2] D. Q. Mayne, “Model predictive control: Recent developments and future promise,” Automatica, vol. 50, no. 12, pp. 2967–2986, Dec. 2014.
  • [3] C. Huang, F. Naghdy, and H. Du, “Fault tolerant sliding mode predictive control for uncertain steer-by-wire system,” IEEE Trans. Cybern., vol. 49, no. 1, pp. 261–272, Jan. 2019.
  • [4] B. Karg and S. Lucia, “Efficient representation and approximation of model predictive control laws via deep learning,” IEEE Trans. Cybern., vol. 50, no. 9, pp. 3866–3878, Sep. 2020.
  • [5] L. Dong, J. Yan, X. Yuan, H. He, and C. Sun, “Functional nonlinear model predictive control based on adaptive dynamic programming,” IEEE Trans. Cybern., vol. 49, no. 12, pp. 4206–4218, Dec. 2019.
  • [6] A. Bemporad and M. Morari, “Robust model predictive control: A survey,” Robustness in Identif. Control, vol. 245, London: Springer, pp. 207–226, 1999.
  • [7] H. Li and Y. Shi, Robust Receding Horizon Control for Networked and Distributed Nonlinear Systems. Springer, 2017.
  • [8] C. Liu, J. Gao, H. Li, and D. Xu, “A periodic robust model predictive control for constrained continuous-time nonlinear systems: An event-triggered approach,” IEEE Trans. Cybern., vol. 48, no. 5, pp. 1397–1405, May 2018.
  • [9] L. Chisci, J. A. Rossiter, and G. Zappa, “Systems with persistent disturbances: Predictive control with restricted constraints,” Automatica, vol. 37, no. 7, pp. 1019–1028, Jul. 2001.
  • [10] D. Q. Mayne and W. Langson, “Robustifying model predictive control of constrained linear systems,” Electronics Letters, vol. 37, no. 23, pp. 1422–1423, 2001.
  • [11] D. Q. Mayne, M. M. Seron, and S. V. Raković, “Robust model predictive control of constrained linear systems with bounded disturbances,” Automatica, vol. 41, no. 2, pp. 219–224, Feb. 2005.
  • [12] J. B. Rawlings, D. Q. Mayne, and M. M. Diehl, Model Predictive Control: Theory, Computation, and Design. Santa Barbara, USA: Nob Hill Pub., 2017.
  • [13] A. Mesbah, “Stochastic model predictive control: An overview and perspectives for future research,” IEEE Control Syst. Magazine, vol. 36, no. 6, pp. 30–44, Dec. 2016.
  • [14] M. Korda, R. Gondhalekar, F. Oldewurtel, and C. N. Jones, “Stochastic MPC framework for controlling the average constraint violation,” IEEE Trans. Autom. Control, vol. 59, no. 7, pp. 1706–1721, Jul. 2014.
  • [15] M. Farina, L. Giulioni, L. Magni, and R. Scattolini, “An approach to output-feedback MPC of stochastic linear discrete-time systems,” Automatica, vol. 55, pp. 140–149, May 2015.
  • [16] D. Moser, R. Schmied, H. Waschl, and L. del Re, “Flexible spacing adaptive cruise control using stochastic model predictive control,” IEEE Trans. Control Syst. Tech., vol. 26, no. 1, pp. 114–127, Jan. 2018.
  • [17] G. Schildbach, L. Fagiano, C. Frei, and M. Morari, “The scenario approach for stochastic model predictive control with bounds on closed-loop constraint violations,” Automatica, vol. 50, no. 12, pp. 3009–3018, Dec. 2014.
  • [18] M. Lorenzen, F. Dabbene, R. Tempo, and F. Allgöwer, “Stochastic MPC with offline uncertainty sampling,” Automatica, vol. 81, pp. 176–183, Jul. 2017.
  • [19] J. Fleming and M. Cannon, “Stochastic tube MPC for additive and multiplicative uncertainty using sample approximations,” IEEE Trans. Autom. Control, vol. 64, no. 9, pp. 3883–3888, Sep. 2019.
  • [20] M. Farina, L. Giulioni, and R. Scattolini, “Stochastic linear model predictive control with chance constraints - A review,” J. Process Control, vol. 44, pp. 53–67, Aug. 2016.
  • [21] M. Cannon, B. Kouvaritakis, S. V. Raković, and Q. Cheng, “Stochastic tubes in model predictive control with probabilistic constraints,” IEEE Trans. Autom. Control, vol. 56, no. 1, pp. 194–200, Jan. 2011.
  • [22] B. Kouvaritakis, M. Cannon, S. V. Raković, and Q. Cheng, “Explicit use of probabilistic distributions in linear predictive control,” Automatica, vol. 46, no. 10, pp. 1719–1724, Oct. 2010.
  • [23] M. Lorenzen, F. Dabbene, R. Tempo, and F. Allgöwer, “Constraint tightening and stability in stochastic model predictive control,” IEEE Trans. Autom. Control, vol. 62, no. 7, pp. 3165–3177, Jul. 2017.
  • [24] F. Li, H. Li, and Y. He, “Stochastic model predictive control for linear systems with bounded additive uncertainties,” in Proc. 2020 Chinese Autom. Congress (CAC), Shanghai, China, 2020, in press.
  • [25] M. Farina, L. Giulioni, L. Magni, and R. Scattolini, “A probabilistic approach to model predictive control,” in Proc. IEEE Conf. Decision and Control (CDC), Florence, Italy, 2013, pp. 7734–7739.
  • [26] L. Hewing and M. N. Zeilinger, “Stochastic model predictive control for linear systems using probabilistic reachable sets,” in Proc. IEEE Conf. Decision and Control (CDC), Miami Beach, USA, 2018, pp. 5182–5188.
  • [27] L. Hewing, K. P. Wabersich, and M. N. Zeilinger, “Recursively feasible stochastic model predictive control using indirect feedback,” Automatica, vol. 119, Sep. 2020.
  • [28] S. Yan, P. Goulart, and M. Cannon, “Stochastic model predictive control with discounted probabilistic constraints,” in Proc. European Control Conf. (ECC), Limassol, Cyprus, 2018, pp. 1003–1008.
  • [29] D. Q. Mayne, “Competing methods for robust and stochastic MPC,” IFAC PapersOnLine, vol. 51, no. 20, pp. 169–174, 2018.
  • [30] U. Rosolia, X. Zhang, and F. Borrelli, “A stochastic MPC approach with application to iterative learning,” in Proc. IEEE Conf. Decision and Control (CDC), Miami Beach, USA, 2018, pp. 5152–5157.
  • [31] E. Kofman, J. A. De Doná, and M. M. Seron, “Probabilistic set invariance and ultimate boundedness,” Automatica, vol. 48, no. 10, pp. 2670–2676, Oct. 2012.
  • [32] L. Hewing, A. Carron, K. P. Wabersich, and M. N. Zeilinger, “On a correspondence between probabilistic and robust invariant sets for linear systems,” in Proc. European Control Conf. (ECC), Limassol, Cyprus, 2018, pp. 1642–1647.
  • [33] M. Ono, “Joint chance-constrained model predictive control with probabilistic resolvability,” in Proc. American Control Conf. (ACC), Montréal, Canada, 2012, pp. 435–441.
  • [34] T. Hashimoto, “Probabilistic constrained model predictive control for linear discrete-time systems with additive stochastic disturbances,” in Proc. IEEE Conf. Decision and Control (CDC), Florence, Italy, 2013, pp. 6434–6439.
  • [35] L. Hewing and M. N. Zeilinger, “Scenario-based probabilistic reachable sets for recursively feasible stochastic model predictive control,” IEEE Control Syst. Lett., vol. 4, no. 2, pp. 450–455, Apr. 2020.
  • [36] S. V. Raković, E. C. Kerrigan, K. I. Kouramas, and D. Q. Mayne, “Invariant approximations of the minimal robust positively invariant set,” IEEE Trans. Autom. Control, vol. 50, no. 3, pp. 406–410, Mar. 2005.
  • [37] I. Kolmanovsky and E. G. Gilbert, “Maximal output admissible sets for discrete-time systems with disturbance inputs,” in Proc. American Control Conf. (ACC), Seattle, USA, 1995, vol. 3, pp. 1995–1999.
  • [38] D. Q. Mayne, J. B. Rawlings, C. V. Rao, and P. O. M. Scokaert, “Constrained model predictive control: Stability and optimality,” Automatica, vol. 36, no. 6, pp. 789–814, Jun. 2000.
  • [39] J. A. Paulson, E. A. Buehler, R. D. Braatz, and A. Mesbah, “Stochastic model predictive control with joint chance constraints,” Int. J. of Control, vol. 93, no. 1, pp. 126–139, May 2020.
  • [40] M. Korda, R. Gondhalekar, J. Cigler, and F. Oldewurtel, “Strongly feasible stochastic model predictive control,” in Proc. IEEE Conf. Decision and Control (CDC) and European Control Conf. (ECC), Orlando, FL, USA, 2011, pp. 1245–1251.
  • [41] D. Muñoz-Carpintero, G. Hu, and C. J. Spanos, “Stochastic Model Predictive Control with adaptive constraint tightening for non-conservative chance constraints satisfaction,” Automatica, vol. 96, pp. 32–39, Oct. 2018.
  • [42] F. Blanchini and S. Miani, Set-Theoretic Methods in Control. Basel, Switzerland: Birkhäuser, 2015.
  • [43] M. Herceg, M. Kvasnica, C. N. Jones, and M. Morari, “Multi-Parametric Toolbox 3.0,” in Proc. European Control Conf. (ECC), Zurich, Switzerland, 2013, pp. 502–510.