A Simulation Study on Turnpikes in Stochastic LQ Optimal Control
Abstract
This paper presents a simulation study on turnpike phenomena in stochastic optimal control problems. We employ the framework of Polynomial Chaos Expansions (PCE) to investigate the presence of turnpikes in stochastic LQ problems. Our findings indicate that turnpikes can be observed in the evolution of PCE coefficients as well as in the evolution of statistical moments. Moreover, the turnpike phenomenon can be observed for optimal realization trajectories and with respect to the optimal stationary distribution. Finally, while adding variance penalization to the objective alters the turnpike, it does not destroy the phenomenon.
keywords:
Stochastic optimal control, turnpike properties, stochastic uncertainty, polynomial chaos expansions1 Introduction
The last decade has seen substantial progress in terms of optimal and predictive control. This includes the analysis of stochastic MPC for set-point stabilization and the understanding of deterministic economic MPC schemes, wherein the objective is more general than a penalization of the distance to a given set-point.
A crucial point in the analysis of economic MPC is the interplay of turnpike and dissipativity properties. The former refers to the phenomenon that for many Optimal Control Problems (OCPs), the optimal solutions for varying horizon and varying initial conditions are structurally similar. More precisely, the turnpike phenomenon refers to the fact that in the middle part of the horizons the optimal solutions stay within a neighborhood of the optimal steady state, see (Dorfman et al., 1958; McKenzie, 1976; Carlson et al., 1991) for classical references and (Trélat and Zuazua, 2015; Faulwasser et al., 2020; Damm et al., 2014) for more recent results. It is worth to be noted that the research on turnpike properties of OCPs originated in economics. The analysis of the interplay between turnpike and dissipativity notions of OCPs has been investigated in a number of papers; indeed it can be shown that under mild assumptions the turnpike property is equivalent to a certain strict dissipativity notion (Grüne and Müller, 2016; Faulwasser et al., 2017). Moreover, this close relation can be exploited in the analysis of economic MPC schemes, see (Faulwasser et al., 2018) for a recent overview. However, when it comes to economic MPC under uncertainties much less has been done in terms of analysis—see (Bayer et al., 2016)—despite the fact that in the economics literature a number of investigations on turnpike properties in stochastic problems have been conducted see (Marimon, 1989; Kolokoltsov and Yang, 2012).
Regarding numerical computations with stochastic uncertainties, there has been recent interest into Polynomial Chaos Expansions (PCEs). The core idea of PCE is that a random variable can be modeled as an function in an appropriate Hilbert space and that in this space a polynomial basis can be used to parametrize the random variable by deterministic coefficients. The idea dates back to Wiener (1938). In recent years, PCE methods have been subject to renewed interest and have been widely investigated for uncertainty quantification (Sullivan, 2015). While in principle the number of terms in the series expansion is infinite, numerical implementation requires truncation. Recently, it was shown that for polynomial explicit mappings the truncation from applying Galerkin projection to the first basis functions can be characterized in closed form, which enables to quantify the error and to choose sufficiently many basis functions such that the error vanishes (Mühlpfordt et al., 2018). In systems and control, PCE has been used in a number of papers, e.g., in (Paulson et al., 2014; Mesbah and Streif, 2015; Kim et al., 2013). The PCE approach is also considered for uncertainty quantification in electrical power systems (Mühlpfordt et al., 2017) and gas networks (Gerster et al., 2019). A major advantage of the PCE framework is that it allows the consideration of a large class of non-Gaussian random variables with finite variance.
The present note conducts a simulation study on turnpike properties in stochastic Linear-Quadratic (LQ) OCPs. Specifically, we employ the PolyChaos.jl package by Mühlpfordt et al. (2020) to solve example problems. The contribution is to demonstrate that in understanding turnpike properties in stochastic LQ OCPs, one needs to compare the stochastically optimal steady state with the optimal distribution in the middle of the optimization horizon. Moreover, our numerical experiments show that the deterministic PCE coefficients of the state and input variables also exhibit a turnpike phenomenon. Indeed one can also observe the turnpike phenomenon if the disturbance realization sequence is identical for different initial conditions. Finally, we also show that the phenomenon is robust under consideration of variance penalization in the objective. Our results illustrate the prospect of systematically using stochastic turnpikes in the analysis of stochastic MPC schemes.
The remainder of the paper is structured as follows: Section 2 recalls the turnpike phenomenon via an illustrative example and we introduce the considered problem set-up. Moreover, we recall the basics of PCE and how one can avoid PCE truncation errors in the LQ-setting. Section 3 presents several examples of stochastic LQ problems, including uncertainty in the initial condition and additive stochastic disturbances. We also present an example which goes beyond the usual minimization of expected value objective. Finally, Section 4 provides a concise summary.
Notation
Deterministic state, input variables are written as etc., while their stochastic counterparts are written by . The expected value and variance are denoted as and . The deterministic PCE coefficients of random variables , are written as , , , respectively, as , , . denotes the set of positive integers .
2 Problem Set-up
To motivate the considered problem setting and the later stochastic examples, we first consider a motivating deterministic example taken from (Grüne, 2013).
Example 1 (Motivating example)
Consider the following deterministic OCP
subject to | |||
We solve the problem for an increasing sequence of horizons . The results are presented in Fig. 1. It can be seen that the optimal solutions all approach a neighborhood of and depart from it towards the end of the horizon. This phenomenon is known as turnpike property (McKenzie, 1976; Carlson et al., 1991). Subsequently, we are interested in exploring the turnpike phenomenon in stochastic LQ OCPs.

2.1 Problem Statement
We consider stochastic LQ OCPs of the following form
(1a) | ||||
subject to | ||||
(1b) | ||||
(1c) | ||||
(1d) | ||||
(1e) |
whereby at each time step , is an random variable on the underlying filtered probability space , where is the set of realizations, is a -algebra, is a filtration, and is the probability measure.
Note that the -algebras are related by the time evolution of the information, i.e.
We choose as the smallest filtration such that is an adapted process (which results form the evolution of the dynamics (1b)), i.e.
Then, the control at time is modeled as a stochastic process which is adapted to the filtration , i.e. .
The concept of stochastic filtrations can be understood as a stochastic causality requirement, i.e. the the stochastic input process at time step may only depend on the realization of the random variables up to time step . Note that the influence of the noise, which is also an adapted stochastic process, is handled implicitly via the state recursion. That is, depends on and thus also on . For details on stochastic filtrations we refer to (Fristedt and Gray, 2013).
The stage cost is given by
Indeed typical choice for are a combination of expected value and variance of some underlying deterministic stage cost function. The considered dynamics are a time-discrete stochastic system subject to noise modelled as a stochastic process. The constraints are written as chance constraints for states and inputs, whereby the underlying sets and are assumed to be closed. Moreover, and specify the probabilities with which the chance constraints shall be satisfied.
2.2 Basics of Polynomial Chaos Expansion
In order to obtain a tractable reformulation of the stochastic LQ OCP (1) we consider the framework of Polynomial Chaos Expansions (PCE). For an in-depth introduction we refer to (Sullivan, 2015). The underlying idea of PCE is that random variables from some with can be described using an appropriate basis. To this end, we consider an orthogonal polynomial basis which spans .
Definition 1 (Polynomial chaos expansion)
In order to obtain a computationally tractable formulation, one has to truncate the PCE series after terms
(3) |
The choice of the basis polynomials can be inferred via the Wiener-Askey scheme (Sullivan, 2015). For example, in case of a standard Gaussian random variable one would consider Hermite polynomials as they allow modeling the Gaussian with the first two terms of the PCE series. Moreover, in case of explicit polynomial maps—e.g., consider state transitions —of finite degree, one can quantify the truncation errors arising from considering only the first PCE coefficients. One may also infer such that no truncation error arises. We refer to Mühlpfordt et al. (2018) for details.
Finally, we remark that whenever the truncated series representation (3) is exact, the first two moments of a random variable can be computed in terms of PCE coefficients as follows
3 Simulation Study
We consider numerical examples that show the turnpike phenomena in the stochastic setting: The first example is an extension of the motivating example but with uncertain initial condition and additive Gaussian noise. The second example considers a linearized chemical reactor subject to non-Gaussian noise. The third example extends the second one via variance penalization in the objective.
3.1 Scalar Dynamics with Noise
Consider the stochastic variant of the motivating example
(4a) |
where denotes the initial random variable with known probability distribution , denotes system noise modeled as a white Gaussian noise such that all have an identical known probability distribution . We arrive at the following stochastic LQ OCP
(5a) | ||||
s.t. | (5b) | |||
(5c) | ||||
(5d) |
where and . We approximate the chance constraint as
(6) |
with , for a derivation see (Farina et al., 2013).
We consider to follow a uniform distribution with the support . The noise at time , , is a Gaussian distribution with mean and variance . Additionally, is set to 0.8 and thus we have .
Without the noise, considering a first-order PCE () for and with identical basis functions, exactness of the PCE representation of is guaranteed since the system dynamic is linear, see Mühlpfordt et al. (2018). Therefore, including noise, the PCE dimension needed for an exact representation is determined by the horizon . More precisely, , where two PCE terms are induced by the uncertainty of the initial condition and the rest is caused by the noise. The PCE basis and coefficients read
(7a) | |||
(7b) | |||
(7c) | |||
(7d) | |||
with | |||
(7e) | |||
(7f) |
where is a standard uniformly distributed random variable while are independent Gaussian random variables. Additionally, and , since is uniformly distributed.
Expressing Stochastic Filtrations in the PCE Framework
Stochastic modeling via adapted filtrations expresses the idea that the noise at time influences the next state and input and the subsequent time instances but not at . In terms of PCE representation this implies that
(8) |
with
and where, respectively, and are placeholders and for and for .
Results
To illustrate turnpike behavior, we solve the stochastic OCP (5) as given above over different optimization horizons using PolyChaos.jl (Mühlpfordt et al., 2020). Fig. 2 shows the trajectories of the optimal solutions for a total of 16 realizations of the uncertainties. At first glance, the realizations of the solutions are noisy and appear not to exhibit the turnpike property.
Hence, in order to uncover the turnpike, we consider the trajectories of the PCE coefficients. Note that each PCE basis induced by the noise is a standard Gaussian distributed random variable, the sum is equal to a new Gaussian distributed random variable with zero mean and variance. Hence, for the sake of simplified illustration, we consider
(9a) | |||
with | |||
(9b) |


Here is a random variable with standard Gaussian distribution (zero mean). Therefore, instead of , only one PCE coefficient suffices to represent the uncertainty caused by noise. Note that this transformation is used only for illustration priposes and not in the underlying computation.
Using the PCE reformulation detailed above, Fig. 3 illustrates that the turnpike phenomenon occurs in terms of PCE coefficients. Actually, the turnpike property of PCE coefficients suggests that the optimal steady-state is a random variable with stationary distribution. This distribution can be calculated from the optimal steady-state problem formulated via PCE. Doing so, i.e. solving
subject to | |||
we obtain the stationary distribution depicted in Fig. 4. This figure also shows the histogram at obtained from sampling realizations of the uncertainty from the optimal PCE solution for . As one can see, the behavior in the middle of the horizon corresponds to the solution obtained for the steady state problem.
The time evolution of the state histograms and the distributions obainted via PCE is shown in Fig. 5 for . It is not surprising that the histograms follow the calculated PDF perfectly and the state keeps the same distribution in the middle of trajectory. This illustrates the turnpike phenomenon in the distributions.


3.2 Stochastic LQ OCP for a CSTR
As a second example, we consider a linearized CSTR reactor which appeared in several papers such as (Zanon and Faulwasser, 2018). The expected value quadratic stage cost and the linear discrete-time system with noise read
(10a) | ||||
(10b) |
where are modeled as independent uniformly distributed random variables with support . The initial state is a Gaussian distributed random vector with known mean and variance . We solve the stochastic OCP via PCE over different horizons . We obtain the trajectories of the optimal solutions for a total realization sequences of the uncertainties, see Fig. 6. As the dimension of PCE is quite large, we plot the first two moments instead of PCE coefficients, i.e. mean and the variance of state and input random variables, see Fig. 7. Similar to the previous example, the trajectories of mean and variance exhibit the turnpike property. Similar to before, Fig. 8 depicts the histograms of the state at for the considered realizations and the PDF obtained via PCE, see Fig. 8.



As an additional means of assessing the turnpike phenomenon via simulation, we consider the following numerical setting:
-
•
Compute a random realization of the disturbance , for denoted as
-
•
Pick horizons and corresponding realizations of the initial condition .
-
•
For all horizons and the initial condition , simulate the response of the dynamics under the optimal input policy, while the disturbance sequence is fixed to (or a subpart thereof).
The results of this numerical experiment are depicted in Figure 9.

As one can see, all the trajectories approach the same solution after some time. One can understand this solution as a time-varying path of a stationary turnpike solution, whose shape is governed by the considered disturbance sequence. Observe the difference to Figure 6, wherein for each trajectory a different disturbance realization sequence is considered.
3.3 Stochastic LQ OCP with Variance Penalization
What could we do if we want to get a optimal steady-state with a narrow distribution, or in other words, with small variance? Involving variance penalization in the objective function is one option. We consider the previous example augmented with a penalty of the variance of the state in the stage cost
(11) |
Here we choose and solve the stochastic OCP via PCE over different horizons . The trajectories of the optimal solutions for a total realization sequences of the uncertainties are shown in Fig. 10. Observe that the state is much less effected by the noise. It can also be seen in Fig. 11 that the variance of state is smaller than in Example 2, while the variance of state and the variance of the input increase. Fig. 12 also shows that the optimal steady state has a narrower distribution.



4 Summary
This paper has conducted a simulation study on turnpike properties in stochastic OCPs. It has presented three examples of stochastic LQ OCPs all of which exhibit the turnpike phenomenon. Indeed, the examples demonstrate that turnpike phenomena can be observed in different contexts:
-
•
in terms for statistical moments (or PCE coefficients which can be mapped to moments),
-
•
in terms of probability distributions of state and input vairables staying close to their optimal stationary distributions, and
-
•
in terms of the realization trajectories staying close to an orbit defined by the noise realization.
Moreover, our simulation study demonstrates that beyond the usual minimization of expected values, the turnpike phenomenon is also present in combination of expected value and min-variance objectives.
While this note did merely present simulation results, there is a clear prospect of extending the established notations of turnpike properties to stochastic OCPs and corresponding stochastic MPC formulations. Yet, at this stage, there is also an evident need for analytic results to understand the turnpike phenomenon in stochastic settings.
References
- Bayer et al. (2016) Bayer, F., Lorenzen, M., Müller, M., and Allgöwer, F. (2016). Robust economic model predictive control using stochastic information. Automatica, 74, 151–161.
- Carlson et al. (1991) Carlson, D., Haurie, A., and Leizarowitz, A. (1991). Infinite Horizon Optimal Control: Deterministic and Stochastic Systems. Springer Verlag.
- Damm et al. (2014) Damm, T., Grüne, L., Stieler, M., and Worthmann, K. (2014). An exponential turnpike theorem for dissipative optimal control problems. SIAM Journal on Control and Optimization, 52(3), 1935–1957.
- Dorfman et al. (1958) Dorfman, R., Samuelson, P., and Solow, R. (1958). Linear Programming and Economic Analysis. McGraw-Hill, New York.
- Farina et al. (2013) Farina, M., Giulioni, L., Magni, L., and Scattolini, R. (2013). A probabilistic approach to model predictive control. In 52nd IEEE Conference on Decision and Control, 7734–7739. IEEE.
- Faulwasser et al. (2020) Faulwasser, T., Grüne, L., Humaloja, J.P., and Schaller, M. (2020). The interval turnpike property for adjoints. Pure and Applied Functional Analysis. Accepted.
- Faulwasser et al. (2018) Faulwasser, T., Grüne, L., and Müller, M. (2018). Economic nonlinear model predictive control: Stability, optimality and performance. Foundations and Trends in Systems and Control, 5(1), 1–98. 10.1561/2600000014.
- Faulwasser et al. (2017) Faulwasser, T., Korda, M., Jones, C., and Bonvin, D. (2017). On turnpike and dissipativity properties of continuous-time optimal control problems. Automatica, 81, 297–304. 10.1016/j.automatica.2017.03.012.
- Fristedt and Gray (2013) Fristedt, B. and Gray, L. (2013). A modern approach to probability theory. Springer Science & Business Media.
- Gerster et al. (2019) Gerster, S., Herty, M., Chertkov, M., Vuffray, M., and Zlotnik, A. (2019). Polynomial chaos approach to describe the propagation of uncertainties through gas networks. In Progress in Industrial Mathematics at ECMI 2018, 59–65. Springer.
- Grüne (2013) Grüne, L. (2013). Economic receding horizon control without terminal constraints. Automatica, 49(3), 725–734.
- Grüne and Müller (2016) Grüne, L. and Müller, M. (2016). On the relation between strict dissipativity and turnpike properties. Sys. Contr. Lett., 90, 45 – 53.
- Kim et al. (2013) Kim, K., Shen, D., Nagy, Z., and Braatz, R. (2013). Wiener’s polynomial chaos for the analysis and control of nonlinear dynamical systems with probabilistic uncertainties. IEEE Control Systems, 33(5), 58–67.
- Kolokoltsov and Yang (2012) Kolokoltsov, V. and Yang, W. (2012). Turnpike theorems for Markov games. Dyn. Games Appl., 2(3), 294–312.
- Marimon (1989) Marimon, R. (1989). Stochastic turnpike property and stationary equilibrium. J. Econom. Theory, 47(2), 282–306.
- McKenzie (1976) McKenzie, L. (1976). Turnpike theory. Econometrica: Journal of the Econometric Society, 44(5), 841–865.
- Mesbah and Streif (2015) Mesbah, A. and Streif, S. (2015). A probabilistic approach to robust optimal experiment design with chance constraints. IFAC-PapersOnLine, 48(8), 100–105.
- Mühlpfordt et al. (2017) Mühlpfordt, T., Faulwasser, T., Roald, L., and Hagenmeyer, V. (2017). Solving optimal power flow with non-gaussian uncertainties via polynomial chaos expansion. In Proc. of 56th IEEE Conference on Decision and Control, 4490–4496. Melbourne, Australia. 10.1109/CDC.2017.8264321.
- Mühlpfordt et al. (2018) Mühlpfordt, T., Findeisen, R., Hagenmeyer, V., and Faulwasser, T. (2018). Comments on quantifying truncation errors for polynomial chaos expansions. IEEE Control Systems Letters, 2(1), 169–174. 10.1109/LCSYS.2017.2778138.
- Mühlpfordt et al. (2020) Mühlpfordt, T., Zahn, F., Hagenmeyer, V., and Faulwasser, T. (2020). PolyChaos.jl – a julia package for polynomial chaos in systems and control. In Proceedings of the 21. IFAC World Congress.
- Paulson et al. (2014) Paulson, J., Mesbah, A., Streif, S., Findeisen, R., and Braatz, R. (2014). Fast stochastic model predictive control of high-dimensional systems. In 53rd IEEE Conference on decision and Control, 2802–2809. IEEE.
- Sullivan (2015) Sullivan, T.J. (2015). Introduction to uncertainty quantification, volume 63. Springer.
- Trélat and Zuazua (2015) Trélat, E. and Zuazua, E. (2015). The turnpike property in finite-dimensional nonlinear optimal control. Journal of Differential Equations, 258(1), 81–114.
- Wiener (1938) Wiener, N. (1938). The homogeneous chaos. American Journal of Mathematics, 897–936.
- Xiu and Karniadakis (2002) Xiu, D. and Karniadakis, G.E. (2002). The Wiener-Askey polynomial chaos for stochastic differential equations. SIAM Journal on Scientific Computing, 24(2), 619–644.
- Zanon and Faulwasser (2018) Zanon, M. and Faulwasser, T. (2018). Economic MPC without terminal constraints: Gradient-correcting end penalties enforce stability. Journal of Process Control, 63, 1–14. 10.1016/j.jprocont.2017.12.005.