This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

A Simulation Study on Turnpikes in Stochastic LQ Optimal Control

Ruchuan Ou    Michael Heinrich Baumann    Lars Grüne    Timm Faulwasser Institute for Energy Systems, Energy Efficiency and Energy Economics, TU Dortmund University, Germany (e-mail: [email protected]). Mathematical Institute, Universität Bayreuth, Bayreuth, Germany (e-mail: {\{michael.baumann,lars.gruene}\}@uni-bayreuth.de)
Abstract

This paper presents a simulation study on turnpike phenomena in stochastic optimal control problems. We employ the framework of Polynomial Chaos Expansions (PCE) to investigate the presence of turnpikes in stochastic LQ problems. Our findings indicate that turnpikes can be observed in the evolution of PCE coefficients as well as in the evolution of statistical moments. Moreover, the turnpike phenomenon can be observed for optimal realization trajectories and with respect to the optimal stationary distribution. Finally, while adding variance penalization to the objective alters the turnpike, it does not destroy the phenomenon.

keywords:
Stochastic optimal control, turnpike properties, stochastic uncertainty, polynomial chaos expansions

1 Introduction

The last decade has seen substantial progress in terms of optimal and predictive control. This includes the analysis of stochastic MPC for set-point stabilization and the understanding of deterministic economic MPC schemes, wherein the objective is more general than a penalization of the distance to a given set-point.

A crucial point in the analysis of economic MPC is the interplay of turnpike and dissipativity properties. The former refers to the phenomenon that for many Optimal Control Problems (OCPs), the optimal solutions for varying horizon and varying initial conditions are structurally similar. More precisely, the turnpike phenomenon refers to the fact that in the middle part of the horizons the optimal solutions stay within a neighborhood of the optimal steady state, see (Dorfman et al., 1958; McKenzie, 1976; Carlson et al., 1991) for classical references and (Trélat and Zuazua, 2015; Faulwasser et al., 2020; Damm et al., 2014) for more recent results. It is worth to be noted that the research on turnpike properties of OCPs originated in economics. The analysis of the interplay between turnpike and dissipativity notions of OCPs has been investigated in a number of papers; indeed it can be shown that under mild assumptions the turnpike property is equivalent to a certain strict dissipativity notion (Grüne and Müller, 2016; Faulwasser et al., 2017). Moreover, this close relation can be exploited in the analysis of economic MPC schemes, see (Faulwasser et al., 2018) for a recent overview. However, when it comes to economic MPC under uncertainties much less has been done in terms of analysis—see (Bayer et al., 2016)—despite the fact that in the economics literature a number of investigations on turnpike properties in stochastic problems have been conducted see (Marimon, 1989; Kolokoltsov and Yang, 2012).

Regarding numerical computations with stochastic uncertainties, there has been recent interest into Polynomial Chaos Expansions (PCEs). The core idea of PCE is that a random variable can be modeled as an L2L_{2} function in an appropriate Hilbert space and that in this space a polynomial basis can be used to parametrize the random variable by deterministic coefficients. The idea dates back to Wiener (1938). In recent years, PCE methods have been subject to renewed interest and have been widely investigated for uncertainty quantification (Sullivan, 2015). While in principle the number of terms in the series expansion is infinite, numerical implementation requires truncation. Recently, it was shown that for polynomial explicit mappings the truncation from applying Galerkin projection to the first L+1L+1 basis functions can be characterized in closed form, which enables to quantify the error and to choose sufficiently many basis functions such that the error vanishes (Mühlpfordt et al., 2018). In systems and control, PCE has been used in a number of papers, e.g., in (Paulson et al., 2014; Mesbah and Streif, 2015; Kim et al., 2013). The PCE approach is also considered for uncertainty quantification in electrical power systems (Mühlpfordt et al., 2017) and gas networks (Gerster et al., 2019). A major advantage of the PCE framework is that it allows the consideration of a large class of non-Gaussian random variables with finite variance.

The present note conducts a simulation study on turnpike properties in stochastic Linear-Quadratic (LQ) OCPs. Specifically, we employ the PolyChaos.jl package by Mühlpfordt et al. (2020) to solve example problems. The contribution is to demonstrate that in understanding turnpike properties in stochastic LQ OCPs, one needs to compare the stochastically optimal steady state with the optimal distribution in the middle of the optimization horizon. Moreover, our numerical experiments show that the deterministic PCE coefficients of the state and input variables also exhibit a turnpike phenomenon. Indeed one can also observe the turnpike phenomenon if the disturbance realization sequence is identical for different initial conditions. Finally, we also show that the phenomenon is robust under consideration of variance penalization in the objective. Our results illustrate the prospect of systematically using stochastic turnpikes in the analysis of stochastic MPC schemes.

The remainder of the paper is structured as follows: Section 2 recalls the turnpike phenomenon via an illustrative example and we introduce the considered problem set-up. Moreover, we recall the basics of PCE and how one can avoid PCE truncation errors in the LQ-setting. Section 3 presents several examples of stochastic LQ problems, including uncertainty in the initial condition and additive stochastic disturbances. We also present an example which goes beyond the usual minimization of expected value objective. Finally, Section 4 provides a concise summary.

Notation

Deterministic {\{state, input}\} variables are written as x(k),u(k)x(k),u(k) etc., while their stochastic counterparts are written by X(k),U(k)X(k),U(k). The expected value and variance are denoted as 𝔼\mathbb{E} and 𝕍\mathbb{V}. The deterministic PCE coefficients of random variables X(k)X(k), U(k)U(k) are written as x0\textsf{x}_{0}, x1\textsf{x}_{1}, \dots, respectively, as u0\textsf{u}_{0}, u1\textsf{u}_{1}, \dots. 𝕀[k1,k2]\mathbb{I}_{[k_{1},k_{2}]} denotes the set of positive integers {k1,k1+1,,k2}\{k_{1},k_{1}+1,\dots,k_{2}\}.

2 Problem Set-up

To motivate the considered problem setting and the later stochastic examples, we first consider a motivating deterministic example taken from (Grüne, 2013).

Example 1 (Motivating example)

Consider the following deterministic OCP

minu(),x()k=0N1u2(k)\displaystyle\min_{u(\cdot),x(\cdot)}\sum_{k=0}^{N-1}u^{2}(k)
subject to
x(k+1)\displaystyle x(k+1) =2x(k)+u(k),x(0)=x0\displaystyle=2x(k)+u(k),x(0)=x_{0}
x(k)\displaystyle x(k) [2,2]\displaystyle\in[-2,2]

We solve the problem for an increasing sequence of horizons N=3,6,,24N=3,6,...,24. The results are presented in Fig. 1. It can be seen that the optimal solutions all approach a neighborhood of (0,0)(0,0) and depart from it towards the end of the horizon. This phenomenon is known as turnpike property (McKenzie, 1976; Carlson et al., 1991). Subsequently, we are interested in exploring the turnpike phenomenon in stochastic LQ OCPs.

Refer to caption
Figure 1: Solutions of the motivating example.

2.1 Problem Statement

We consider stochastic LQ OCPs of the following form

minU(),X()\displaystyle\min_{U(\cdot),X(\cdot)} k=0N1(X(k),U(k))\displaystyle~{}\sum_{k=0}^{N-1}\ell(X(k),U(k)) (1a)
subject to
X(k+1)\displaystyle X(k+1) =AX(k)+BU(k)+EW(k),\displaystyle=AX(k)+BU(k)+EW(k), (1b)
X(0)\displaystyle X(0) =X0,\displaystyle=X_{0}, (1c)
[X(k)\displaystyle\mathbb{P}[X(k) 𝕏]εx,k𝕀[0,N],\displaystyle\in\mathbb{X}]\leq\varepsilon_{x},k\in\mathbb{I}_{[0,N]}, (1d)
[U(k)\displaystyle\mathbb{P}[U(k) 𝕌]εu,k𝕀[0,N1],\displaystyle\in\mathbb{U}]\leq\varepsilon_{u},k\in\mathbb{I}_{[0,N-1]}, (1e)

whereby at each time step kk, X(k)L2(Ω,k,μ;nx)X(k)\in L^{2}(\Omega,\mathcal{F}_{k},\mu;\mathbb{R}^{n_{x}}) is an i.i.d.i.i.d. random variable on the underlying filtered probability space (Ω,,(k)k=0,,N,μ)(\Omega,\mathcal{F},(\mathcal{F}_{k})_{k=0,\ldots,N},\mu), where Ω\Omega is the set of realizations, \mathcal{F} is a σ\sigma-algebra, (k)k=0,,N(\mathcal{F}_{k})_{k=0,\ldots,N} is a filtration, and μ\mu is the probability measure.

Note that the σ\sigma-algebras k\mathcal{F}_{k} are related by the time evolution of the information, i.e.

01N.\mathcal{F}_{0}\subseteq\mathcal{F}_{1}\subseteq\ldots\subseteq\mathcal{F}_{N}\subseteq\mathcal{F}.

We choose (k)k=0,,N(\mathcal{F}_{k})_{k=0,\ldots,N} as the smallest filtration such that XX is an adapted process (which results form the evolution of the dynamics (1b)), i.e.

k=σ(X(0),,X(k)),k=0,1,.\mathcal{F}_{k}=\sigma(X(0),\dots,X(k)),\quad k=0,1,\dots.

Then, the control at time kk is modeled as a stochastic process which is adapted to the filtration (k)k(\mathcal{F}_{k})_{k}, i.e. U(k)L2(Ω,k,μ;nu)U(k)\in L^{2}(\Omega,\mathcal{F}_{k},\mu;\mathbb{R}^{n_{u}}).

The concept of stochastic filtrations can be understood as a stochastic causality requirement, i.e. the the stochastic input process U(k)U(k) at time step kk may only depend on the realization of the random variables X(0),,X(k)X(0),\dots,X(k) up to time step kk. Note that the influence of the noise, which is also an adapted stochastic process, is handled implicitly via the state recursion. That is, U(k)U(k) depends on X(0),,X(k)X(0),\dots,X(k) and thus also on W(0),,W(k1)W(0),\dots,W(k-1). For details on stochastic filtrations we refer to (Fristedt and Gray, 2013).

The stage cost is given by

:L2(Ω,,μ;nx)×L2(Ω,,μ;nu).\ell:L^{2}(\Omega,\mathcal{F},\mu;\mathbb{R}^{n_{x}})\times L^{2}(\Omega,\mathcal{F},\mu;\mathbb{R}^{n_{u}})\to\mathbb{R}.

Indeed typical choice for \ell are a combination of expected value and variance of some underlying deterministic stage cost function. The considered dynamics are a time-discrete stochastic system subject to noise W()W(\cdot) modelled as a stochastic process. The constraints are written as chance constraints for states and inputs, whereby the underlying sets 𝕏\mathbb{X} and 𝕌\mathbb{U} are assumed to be closed. Moreover, εx\varepsilon_{x} and εu\varepsilon_{u} specify the probabilities with which the chance constraints shall be satisfied.

2.2 Basics of Polynomial Chaos Expansion

In order to obtain a tractable reformulation of the stochastic LQ OCP (1) we consider the framework of Polynomial Chaos Expansions (PCE). For an in-depth introduction we refer to (Sullivan, 2015). The underlying idea of PCE is that random variables from some L2(Ω,,μ;m)L^{2}(\Omega,\mathcal{F},\mu;\mathbb{R}^{m}) with m{nx,nu,nw}m\in\{n_{x},n_{u},n_{w}\} can be described using an appropriate basis. To this end, we consider an orthogonal polynomial basis {ϕi}i=0\{\phi_{i}\}_{i=0}^{\infty} which spans L2(Ω,,μ;m)L^{2}(\Omega,\mathcal{F},\mu;\mathbb{R}^{m}).

Definition 1 (Polynomial chaos expansion)

The polynomial chaos expansion of a real-valued random variable XL2(Ω,,μ;nx)X\in L^{2}(\Omega,\mathcal{F},\mu;\mathbb{R}^{n_{x}}) is

X=i=0xiϕi,withxi=X,ϕiϕi,ϕi,X=\sum_{i=0}^{\infty}\textsf{x}_{i}\phi_{i},\quad\text{with}\quad\textsf{x}_{i}=\frac{\langle X,\phi_{i}\rangle}{\langle\phi_{i},\phi_{i}\rangle}, (2)

where xi\textsf{x}_{i}\in\mathbb{R} is called the iith coefficient and

ϕi,ϕjΩϕi(ω)ϕj(ω)dμ(ω),\langle\phi_{i},\phi_{j}\rangle\doteq\int_{\Omega}\phi_{i}(\omega)\phi_{j}(\omega)\mathrm{d}\mu(\omega),

see (Sullivan, 2015; Xiu and Karniadakis, 2002).

In order to obtain a computationally tractable formulation, one has to truncate the PCE series after L+1L+1 terms

Xi=0Lxiϕi.{X}\approx\sum_{i=0}^{L}\textsf{x}_{i}\phi_{i}. (3)

The choice of the basis polynomials can be inferred via the Wiener-Askey scheme (Sullivan, 2015). For example, in case of a standard Gaussian random variable one would consider Hermite polynomials as they allow modeling the Gaussian with the first two terms of the PCE series. Moreover, in case of explicit polynomial maps—e.g., consider state transitions X(k)=AkX(0)X(k)=A^{k}X(0)—of finite degree, one can quantify the truncation errors arising from considering only the first L+1L+1 PCE coefficients. One may also infer LL\in\mathbb{N} such that no truncation error arises. We refer to Mühlpfordt et al. (2018) for details.

Finally, we remark that whenever the truncated series representation (3) is exact, the first two moments of a random variable XX can be computed in terms of PCE coefficients as follows

𝔼[X]\displaystyle\mathbb{E}[X] =x0 and 𝕍[X]=i=1Lxi2ϕi,ϕi.\displaystyle=\textsf{x}_{0}\text{ and }\mathbb{V}[X]={\sum_{i=1}^{L}}\textsf{x}_{i}^{2}\langle\phi_{i},\phi_{i}\rangle.

3 Simulation Study

We consider numerical examples that show the turnpike phenomena in the stochastic setting: The first example is an extension of the motivating example but with uncertain initial condition and additive Gaussian noise. The second example considers a linearized chemical reactor subject to non-Gaussian noise. The third example extends the second one via variance penalization in the objective.

3.1 Scalar Dynamics with Noise

Consider the stochastic variant of the motivating example

X(k+1)\displaystyle X(k+1) =2X(k)+U(k)+W(k),X(0)=X0,\displaystyle=2X(k)+U(k)+W(k),\quad X(0)=X_{0}, (4a)

where X0X_{0} denotes the initial random variable with known probability distribution pX0p_{X_{0}}, W(k)W(k) denotes system noise modeled as a white Gaussian noise such that all W(k)W(k) have an identical known probability distribution pWp_{W}. We arrive at the following stochastic LQ OCP

minU(),X()k=0N1𝔼[U2(k)]\displaystyle\min_{U(\cdot),X(\cdot)}\quad\sum_{k=0}^{N-1}\mathbb{E}[U^{2}(k)] (5a)
s.t. (4),k𝕀[0,N1]\displaystyle(\ref{equ:example1_dynamic}),\;\forall k\in\mathbb{I}_{[0,N-1]} (5b)
[xminX(k)xmax]εx,k𝕀[0,N]\displaystyle{\mathbb{P}\left[{x}_{min}\leq X(k)\leq{x}_{max}\right]\geq\varepsilon_{x},\;\forall k\in\mathbb{I}_{[0,N]}} (5c)
X0pX0,W(k)pW,k𝕀[0,N1]\displaystyle X_{0}\sim p_{X_{0}},\quad W(k)\sim p_{W},\;\forall k\in\mathbb{I}_{[0,N-1]} (5d)

where xmin=2{x}_{min}=-2 and xmax=2{x}_{max}=2. We approximate the chance constraint as

xmin𝔼[X(k)]±λ(εx)𝕍[X(k)]xmax{x}_{min}\leq\mathbb{E}[X(k)]\pm\lambda(\varepsilon_{x})\sqrt{\mathbb{V}[X(k)]}\leq{x}_{max} (6)

with λ=(1+εx)/(1εx)\lambda=\sqrt{(1+\varepsilon_{x})/(1-\varepsilon_{x})}, for a derivation see (Farina et al., 2013).

We consider X0X_{0} to follow a uniform distribution with the support [x0min,x0max]=[0.6,1.4][{x}_{0min},{x}_{0max}]=[0.6,1.4]. The noise at time k𝕀[0,N1]k\in\mathbb{I}_{[0,N-1]}, W(k)W(k), is a Gaussian distribution with mean 𝔼[W(k)]=0\mathbb{E}[W(k)]=0 and variance 𝕍[W(k)]=0.52\mathbb{V}[W(k)]=0.5^{2}. Additionally, εx\varepsilon_{x} is set to 0.8 and thus we have λ=3\lambda=3.

Without the noise, considering a first-order PCE (L=1L=1) for X(k)X(k) and U(k)U(k) with identical basis functions, exactness of the PCE representation of X(k+1)X(k+1) is guaranteed since the system dynamic is linear, see Mühlpfordt et al. (2018). Therefore, including noise, the PCE dimension needed for an exact representation is determined by the horizon NN. More precisely, L=N+1L=N+1, where two PCE terms are induced by the uncertainty of the initial condition and the rest is caused by the noise. The PCE basis and coefficients read

X(k)=i=01xi(k)ϕi+i=0N1xiw(k)ϕiw\displaystyle X(k)=\sum_{i=0}^{1}\textsf{x}_{i}(k)\phi_{i}+\sum_{i=0}^{N-1}\textsf{x}_{i}^{w}(k)\phi_{i}^{w} (7a)
U(k)=i=01ui(k)ϕi+i=0N1uiw(k)ϕiw\displaystyle U(k)=\sum_{i=0}^{1}\textsf{u}_{i}(k)\phi_{i}+\sum_{i=0}^{N-1}\textsf{u}_{i}^{w}(k)\phi_{i}^{w} (7b)
W(k)=i=0N1wiw(k)ϕiw\displaystyle W(k)=\sum_{i=0}^{N-1}\textsf{w}_{i}^{w}(k)\phi_{i}^{w} (7c)
X0=i=01x0iϕi\displaystyle X_{0}=\sum_{i=0}^{1}\textsf{x}_{0i}\phi_{i} (7d)
with
ϕ0=1,ϕ1=ξ0.5,ϕiw=θi,i𝕀[0,N1]\displaystyle\phi_{0}=1,\;\phi_{1}=\xi-0.5,\;\phi_{i}^{w}=\theta_{i},i\in\mathbb{I}_{[0,N-1]} (7e)
wkw(k)=𝕍[W(k)],wiw(k)=0,i𝕀[0,N1]{k}\displaystyle\textsf{w}_{k}^{w}(k)=\sqrt{\mathbb{V}[W(k)]},\;\textsf{w}_{i}^{w}(k)=0,i\in\mathbb{I}_{[0,N-1]}\setminus\{k\} (7f)

where ξ\xi is a standard uniformly distributed random variable while θi\theta_{i} are independent Gaussian random variables. Additionally, x00=(x0min+x0max)/2\textsf{x}_{00}=({x}_{0min}+{x}_{0max})/2 and x01=x0maxx0min\textsf{x}_{01}={x}_{0max}-{x}_{0min}, since X0X_{0} is uniformly distributed.

Expressing Stochastic Filtrations in the PCE Framework

Stochastic modeling via adapted filtrations expresses the idea that the noise W(k)W(k) at time kk influences the next state X(k+1)X(k+1) and input U(k+1)U(k+1) and the subsequent time instances i>ki>k but not at iki\leq k. In terms of PCE representation this implies that

[Z(0)Z(1)Z(2)Z(M)]=[z0(0)z1(0)z0(1)z1(1)z0(2)z1(2)z0(M)z1(M)][ϕ0ϕ1]+Zw[ϕ0wϕ1wϕ2wϕM1w]\begin{bmatrix}Z(0)\\ Z(1)\\ Z(2)\\ \vdots\\ Z(M)\end{bmatrix}=\begin{bmatrix}\textsf{z}_{0}(0)&\textsf{z}_{1}(0)\\ \textsf{z}_{0}(1)&\textsf{z}_{1}(1)\\ \textsf{z}_{0}(2)&\textsf{z}_{1}(2)\\ \vdots&\vdots\\ \textsf{z}_{0}(M)&\textsf{z}_{1}(M)\end{bmatrix}\begin{bmatrix}\phi_{0}\\ \phi_{1}\end{bmatrix}+\textsf{Z}^{w}\begin{bmatrix}\phi_{0}^{w}\\ \phi_{1}^{w}\\ \phi_{2}^{w}\\ \vdots\\ \phi_{M-1}^{w}\end{bmatrix} (8)

with

Zw[000z0w(1)00z0w(2)z1w(2)0z0w(M)z1w(M)zM1w(M)](M+1)×M,\textsf{Z}^{w}\doteq\begin{bmatrix}0&0&\cdots&0\\ \textsf{z}_{0}^{w}(1)&0&\cdots&0\\ \textsf{z}_{0}^{w}(2)&\textsf{z}_{1}^{w}(2)&\cdots&0\\ \vdots&\vdots&\ddots&\vdots\\ \textsf{z}_{0}^{w}(M)&\textsf{z}_{1}^{w}(M)&\cdots\ &\textsf{z}_{M-1}^{w}(M)\end{bmatrix}\in\mathbb{R}^{(M+1)\times M},

and where, respectively, Z{X,U}Z\in\{X,U\} and z{x,u}\textsf{z}\in\{\textsf{x},\textsf{u}\} are placeholders and M=NM=N for XX and M=N1M=N-1 for UU.

Results

To illustrate turnpike behavior, we solve the stochastic OCP (5) as given above over different optimization horizons N=3,6,,24N=3,6,...,24 using PolyChaos.jl (Mühlpfordt et al., 2020). Fig. 2 shows the trajectories of the optimal solutions for a total of 16 realizations of the uncertainties. At first glance, the realizations of the solutions are noisy and appear not to exhibit the turnpike property.

Hence, in order to uncover the turnpike, we consider the trajectories of the PCE coefficients. Note that each PCE basis ϕiw\phi_{i}^{w} induced by the noise is a standard Gaussian distributed random variable, the sum i=0k1ziw(k)ϕiw,z{x,u}\sum_{i=0}^{k-1}\textsf{z}_{i}^{w}(k)\phi_{i}^{w},\textsf{z}\in\{\textsf{x},\textsf{u}\} is equal to a new Gaussian distributed random variable with zero mean and variance. Hence, for the sake of simplified illustration, we consider

zΣw(k)ϕΣw=i=0k1ziw(k)ϕiw\textsf{z}_{\Sigma}^{w}(k)\phi_{\Sigma}^{w}=\sum\nolimits_{i=0}^{k-1}\textsf{z}_{i}^{w}(k)\phi_{i}^{w} (9a)
with
ϕΣw=θΣ,zΣw(k)=i=0k1(ziw(k))2.\phi_{\Sigma}^{w}=\theta_{\Sigma},\quad\textsf{z}_{\Sigma}^{w}(k)=\sqrt{\sum\nolimits_{i=0}^{k-1}(\textsf{z}_{i}^{w}(k))^{2}}. (9b)
Refer to caption
Figure 2: 16 sample realization trajectories of optimal solutions to OCP (8).
Refer to caption
Figure 3: PCE coefficients solutions to OCP (8).

Here ϕΣ\phi_{\Sigma} is a random variable with standard Gaussian distribution (zero mean). Therefore, instead of ziw(k),i𝕀[0,N1]\textsf{z}_{i}^{w}(k),i\in\mathbb{I}_{[0,N-1]}, only one PCE coefficient zΣw(k)\textsf{z}_{\Sigma}^{w}(k) suffices to represent the uncertainty caused by noise. Note that this transformation is used only for illustration priposes and not in the underlying computation.

Using the PCE reformulation detailed above, Fig. 3 illustrates that the turnpike phenomenon occurs in terms of PCE coefficients. Actually, the turnpike property of PCE coefficients suggests that the optimal steady-state is a random variable with stationary distribution. This distribution can be calculated from the optimal steady-state problem formulated via PCE. Doing so, i.e. solving

minX¯,U¯\displaystyle\min_{\bar{X},\bar{U}} U¯2\displaystyle~{}\bar{U}^{2}
subject to
X¯\displaystyle\bar{X} =2X¯+U¯+W¯\displaystyle=2\bar{X}+\bar{U}+\bar{W}
xmin\displaystyle{x}_{min} 𝔼[X¯]±λ(εx)𝕍[X¯]xmax,\displaystyle\leq\mathbb{E}[\bar{X}]\pm\lambda(\varepsilon_{x})\sqrt{\mathbb{V}[\bar{X}]}\leq{x}_{max},

we obtain the stationary distribution depicted in Fig. 4. This figure also shows the histogram at k=25k=25 obtained from sampling 10410^{4} realizations of the uncertainty from the optimal PCE solution for N=50N=50. As one can see, the behavior in the middle of the horizon corresponds to the solution obtained for the steady state problem.

The time evolution of the state histograms and the distributions obainted via PCE is shown in Fig. 5 for k=0,10,20,30,40,50k=0,10,20,30,40,50. It is not surprising that the histograms follow the calculated PDF perfectly and the state X(k)X(k) keeps the same distribution in the middle of trajectory. This illustrates the turnpike phenomenon in the distributions.

Refer to caption
Figure 4: Comparison between PDF of the optimal steady state and the histogram of state trajectories at k=25k=25.
Refer to caption
Figure 5: Comparison of PDF and histogram of 10410^{4} samples for OCP (8) with N=50N=50.

3.2 Stochastic LQ OCP for a CSTR

As a second example, we consider a linearized CSTR reactor which appeared in several papers such as (Zanon and Faulwasser, 2018). The expected value quadratic stage cost and the linear discrete-time system with noise read

(X,U)\displaystyle\ell(X,U) =𝔼[XA+0.1U2+24XB0.5U]\displaystyle=\mathbb{E}[X_{A}+0.1U^{2}+24X_{B}-0.5U] (10a)
f(X,U)\displaystyle f(X,U) =[0.7600.120.88][XAXB]+[0.0050.005]U+[WAWB]\displaystyle=\begin{bmatrix}0.76&0\\ 0.12&0.88\end{bmatrix}\begin{bmatrix}X_{A}\\ X_{B}\end{bmatrix}+\begin{bmatrix}\phantom{-}0.005\\ -0.005\end{bmatrix}U+\begin{bmatrix}W_{A}\\ W_{B}\end{bmatrix} (10b)

where WA,WBW_{A},W_{B} are modeled as independent uniformly distributed random variables with support [0.1,0.1][-0.1,0.1]. The initial state X(0)X(0) is a Gaussian distributed random vector with known mean (0.5,0.8)(0.5,0.8) and variance (0.052,0.082)(0.05^{2},0.08^{2}). We solve the stochastic OCP via PCE over different horizons N=10,20,,80N=10,20,...,80. We obtain the trajectories of the optimal solutions for a total 1616 realization sequences of the uncertainties, see Fig. 6. As the dimension of PCE is quite large, we plot the first two moments instead of PCE coefficients, i.e. mean and the variance of state and input random variables, see Fig. 7. Similar to the previous example, the trajectories of mean and variance exhibit the turnpike property. Similar to before, Fig. 8 depicts the histograms of the state XAX_{A} at k=0,10,20,30,40,50k=0,10,20,30,40,50 for the considered realizations and the PDF obtained via PCE, see Fig. 8.

Refer to caption
Figure 6: Sample trajectories for the OCP with (10).
Refer to caption
Figure 7: Mean and variance obtained from the PCE coefficients for the OCP with (10).
Refer to caption
Figure 8: Comparison of PDF and histogram of 10410^{4} samples for the OCP with (10) N=50N=50.

As an additional means of assessing the turnpike phenomenon via simulation, we consider the following numerical setting:

  • Compute a random realization of the disturbance W(k)W(k), for k𝕀[0,M]k\in\mathbb{I}_{[0,M]} denoted as {W(k;ωk)}\{W(k;\omega_{k})\}

  • Pick horizons N1,,NsMN_{1},...,N_{s}\leq M and corresponding realizations of the initial condition X0(ω1),,X0(ωs)X_{0}(\omega_{1}),\dots,X_{0}(\omega_{s}).

  • For all horizons N1,,NsN_{1},\dots,N_{s} and the initial condition X0(ω1),,X0(ωs)X_{0}(\omega_{1}),\dots,X_{0}(\omega_{s}), simulate the response of the dynamics under the optimal input policy, while the disturbance sequence is fixed to {W(k;ωk)}\{W(k;\omega_{k})\} (or a subpart thereof).

The results of this numerical experiment are depicted in Figure 9.

Refer to caption
Figure 9: System response and corresponding input for the OCP with (10): different initial conditions and horizons with one identical noise realization.

As one can see, all the trajectories approach the same solution after some time. One can understand this solution as a time-varying path of a stationary turnpike solution, whose shape is governed by the considered disturbance sequence. Observe the difference to Figure 6, wherein for each trajectory a different disturbance realization sequence is considered.

3.3 Stochastic LQ OCP with Variance Penalization

What could we do if we want to get a optimal steady-state with a narrow distribution, or in other words, with small variance? Involving variance penalization in the objective function is one option. We consider the previous example augmented with a penalty of the variance of the state XAX_{A} in the stage cost

(X,U)=𝔼[XA+0.1U2+24XB0.5U]+γ𝕍[XA]\ell(X,U)=\mathbb{E}[X_{A}+0.1U^{2}+24X_{B}-0.5U]+\gamma\mathbb{V}[X_{A}] (11)

Here we choose γ=104\gamma=10^{4} and solve the stochastic OCP via PCE over different horizons N=10,20,,80N=10,20,...,80. The trajectories of the optimal solutions for a total 1616 realization sequences of the uncertainties are shown in Fig. 10. Observe that the state xAx_{A} is much less effected by the noise. It can also be seen in Fig. 11 that the variance of state XAX_{A} is smaller than in Example 2, while the variance of state XBX_{B} and the variance of the input UU increase. Fig. 12 also shows that the optimal steady state XAX_{A} has a narrower distribution.

Refer to caption
Figure 10: Sample trajectories for the OCP with (11).
Refer to caption
Figure 11: Mean and variance obtained from the PCE coefficients for the OCP with (11).
Refer to caption
Figure 12: Comparison of PDF and histogram of 10410^{4} samples for the OCP with (11) N=50N=50.

4 Summary

This paper has conducted a simulation study on turnpike properties in stochastic OCPs. It has presented three examples of stochastic LQ OCPs all of which exhibit the turnpike phenomenon. Indeed, the examples demonstrate that turnpike phenomena can be observed in different contexts:

  • in terms for statistical moments (or PCE coefficients which can be mapped to moments),

  • in terms of probability distributions of state and input vairables staying close to their optimal stationary distributions, and

  • in terms of the realization trajectories staying close to an orbit defined by the noise realization.

Moreover, our simulation study demonstrates that beyond the usual minimization of expected values, the turnpike phenomenon is also present in combination of expected value and min-variance objectives.

While this note did merely present simulation results, there is a clear prospect of extending the established notations of turnpike properties to stochastic OCPs and corresponding stochastic MPC formulations. Yet, at this stage, there is also an evident need for analytic results to understand the turnpike phenomenon in stochastic settings.

References

  • Bayer et al. (2016) Bayer, F., Lorenzen, M., Müller, M., and Allgöwer, F. (2016). Robust economic model predictive control using stochastic information. Automatica, 74, 151–161.
  • Carlson et al. (1991) Carlson, D., Haurie, A., and Leizarowitz, A. (1991). Infinite Horizon Optimal Control: Deterministic and Stochastic Systems. Springer Verlag.
  • Damm et al. (2014) Damm, T., Grüne, L., Stieler, M., and Worthmann, K. (2014). An exponential turnpike theorem for dissipative optimal control problems. SIAM Journal on Control and Optimization, 52(3), 1935–1957.
  • Dorfman et al. (1958) Dorfman, R., Samuelson, P., and Solow, R. (1958). Linear Programming and Economic Analysis. McGraw-Hill, New York.
  • Farina et al. (2013) Farina, M., Giulioni, L., Magni, L., and Scattolini, R. (2013). A probabilistic approach to model predictive control. In 52nd IEEE Conference on Decision and Control, 7734–7739. IEEE.
  • Faulwasser et al. (2020) Faulwasser, T., Grüne, L., Humaloja, J.P., and Schaller, M. (2020). The interval turnpike property for adjoints. Pure and Applied Functional Analysis. Accepted.
  • Faulwasser et al. (2018) Faulwasser, T., Grüne, L., and Müller, M. (2018). Economic nonlinear model predictive control: Stability, optimality and performance. Foundations and Trends in Systems and Control, 5(1), 1–98. 10.1561/2600000014.
  • Faulwasser et al. (2017) Faulwasser, T., Korda, M., Jones, C., and Bonvin, D. (2017). On turnpike and dissipativity properties of continuous-time optimal control problems. Automatica, 81, 297–304. 10.1016/j.automatica.2017.03.012.
  • Fristedt and Gray (2013) Fristedt, B. and Gray, L. (2013). A modern approach to probability theory. Springer Science & Business Media.
  • Gerster et al. (2019) Gerster, S., Herty, M., Chertkov, M., Vuffray, M., and Zlotnik, A. (2019). Polynomial chaos approach to describe the propagation of uncertainties through gas networks. In Progress in Industrial Mathematics at ECMI 2018, 59–65. Springer.
  • Grüne (2013) Grüne, L. (2013). Economic receding horizon control without terminal constraints. Automatica, 49(3), 725–734.
  • Grüne and Müller (2016) Grüne, L. and Müller, M. (2016). On the relation between strict dissipativity and turnpike properties. Sys. Contr. Lett., 90, 45 – 53.
  • Kim et al. (2013) Kim, K., Shen, D., Nagy, Z., and Braatz, R. (2013). Wiener’s polynomial chaos for the analysis and control of nonlinear dynamical systems with probabilistic uncertainties. IEEE Control Systems, 33(5), 58–67.
  • Kolokoltsov and Yang (2012) Kolokoltsov, V. and Yang, W. (2012). Turnpike theorems for Markov games. Dyn. Games Appl., 2(3), 294–312.
  • Marimon (1989) Marimon, R. (1989). Stochastic turnpike property and stationary equilibrium. J. Econom. Theory, 47(2), 282–306.
  • McKenzie (1976) McKenzie, L. (1976). Turnpike theory. Econometrica: Journal of the Econometric Society, 44(5), 841–865.
  • Mesbah and Streif (2015) Mesbah, A. and Streif, S. (2015). A probabilistic approach to robust optimal experiment design with chance constraints. IFAC-PapersOnLine, 48(8), 100–105.
  • Mühlpfordt et al. (2017) Mühlpfordt, T., Faulwasser, T., Roald, L., and Hagenmeyer, V. (2017). Solving optimal power flow with non-gaussian uncertainties via polynomial chaos expansion. In Proc. of 56th IEEE Conference on Decision and Control, 4490–4496. Melbourne, Australia. 10.1109/CDC.2017.8264321.
  • Mühlpfordt et al. (2018) Mühlpfordt, T., Findeisen, R., Hagenmeyer, V., and Faulwasser, T. (2018). Comments on quantifying truncation errors for polynomial chaos expansions. IEEE Control Systems Letters, 2(1), 169–174. 10.1109/LCSYS.2017.2778138.
  • Mühlpfordt et al. (2020) Mühlpfordt, T., Zahn, F., Hagenmeyer, V., and Faulwasser, T. (2020). PolyChaos.jl – a julia package for polynomial chaos in systems and control. In Proceedings of the 21. IFAC World Congress.
  • Paulson et al. (2014) Paulson, J., Mesbah, A., Streif, S., Findeisen, R., and Braatz, R. (2014). Fast stochastic model predictive control of high-dimensional systems. In 53rd IEEE Conference on decision and Control, 2802–2809. IEEE.
  • Sullivan (2015) Sullivan, T.J. (2015). Introduction to uncertainty quantification, volume 63. Springer.
  • Trélat and Zuazua (2015) Trélat, E. and Zuazua, E. (2015). The turnpike property in finite-dimensional nonlinear optimal control. Journal of Differential Equations, 258(1), 81–114.
  • Wiener (1938) Wiener, N. (1938). The homogeneous chaos. American Journal of Mathematics, 897–936.
  • Xiu and Karniadakis (2002) Xiu, D. and Karniadakis, G.E. (2002). The Wiener-Askey polynomial chaos for stochastic differential equations. SIAM Journal on Scientific Computing, 24(2), 619–644.
  • Zanon and Faulwasser (2018) Zanon, M. and Faulwasser, T. (2018). Economic MPC without terminal constraints: Gradient-correcting end penalties enforce stability. Journal of Process Control, 63, 1–14. 10.1016/j.jprocont.2017.12.005.