Minimum-Time Trajectory Optimization With
Data-Based Models: A Linear Programming Approach

Nan Li [email protected] Ehsan Taheri [email protected] Ilya Kolmanovsky [email protected] Dimitar Filev [email protected] Department of Aerospace Engineering, Auburn University, Auburn, AL, USA Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI, USA Hagler Institute for Advanced Study, Texas A&M University, College Station, TX, USA

Abstract

In this paper, we develop a computationally-efficient approach to minimum-time trajectory optimization using input-output data-based models, to produce an end-to-end data-to-control solution to time-optimal planning/control of dynamic systems and hence facilitate their autonomous operation. The approach integrates a non-parametric data-based model for trajectory prediction and a continuous optimization formulation based on an exponential weighting scheme for minimum-time trajectory planning. The optimization problem in its final form is a linear program and is easy to solve. We validate the approach and illustrate its application with a spacecraft relative motion planning problem.

keywords:

Trajectory Optimization, Time-Optimal Control, Data-Driven Control, Linear Programming

^†^†thanks: This paper was not presented at any IFAC meeting. Corresponding author N. Li. Email: [email protected]

, , ,

1 Introduction

Minimum-time trajectory optimization (also known as “time-optimal control”) is frequently involved in robot path planning and tracking (Lepetič et al., 2003; Verscheure et al., 2009), space mission design and operation (Shirazi et al., 2018), and other disciplines wherever a task is desired to be accomplished in a least amount of time, typically subject to limited resources. Due to its significant relevance, the investigation into this topic has been extensive over decades (Kalman and Bertram, 1959; LaSalle, 1959; O’Reilly, 1981).

The formulation of a minimum-time trajectory optimization problem can be either in continuous time or in discrete time. In continuous time, solution techniques are usually based on indirect methods (also known as “variational methods”): One first applies Pontryagin’s maximum principle to reduce the trajectory optimization problem to a two-point boundary value problem (TPBVP), and then solves the TPBVP using a shooting method (Trélat, 2012; Taheri et al., 2017). In discrete time, in contrast, direct methods are more often considered. Various direct methods for minimum-time trajectory optimization in discrete time have been proposed in the literature: In the approach of Carvallo et al. (1990), a minimum-time problem is reformulated into a mixed-integer program, where a set of Boolean variables are used to indicate if a given target condition is reached at each time step over the planning horizon. In the approaches of Van den Broeck et al. (2011) and Zhang et al. (2014), a minimum-time problem is solved using a bi-level algorithm, where the lower level solves a fixed-horizon trajectory planning problem treating the target condition as a terminal constraint and the upper level adjusts the planning horizon of the lower-level problem to find the minimum horizon length such that the lower-level problem admits a feasible solution. In the approaches of Rösmann et al. (2015) and Wang et al. (2017), the time is scaled and the scaling factor is treated as an optimization variable. This way, minimizing the final time of the original free-final-time problem is achieved through minimizing the scaling factor in a related fixed-final-time problem. However, in such an approach based on time scaling, the optimization problem is necessarily nonlinear and nonconvex, even for linear systems. In the approach of Verschueren et al. (2017), the deviation from the target condition is penalized with a weight that increases exponentially with time. It is shown that a minimum-time trajectory solution can be obtained if the weight parameter is chosen to be sufficiently high. The approaches reviewed above typically use a state-space model of the considered dynamic system to predict the trajectories. A common approach to obtaining a state-space model, to be used for trajectory optimization, is to first derive the model from first principles and then calibrate its parameters according to prior knowledge and/or data of the system.

Recently, a non-parametric modeling approach based on behavioral systems theory is gaining attention due to its unique applicability to emerging data-driven paradigms (Markovsky and Dörfler, 2021). This approach uses input-output time-series data of a given system to build up a (non-parametric) predictive model, which is referred to as a data-based model in this paper, rather than using the data to identify/calibrate a (parametric) state-space model. The integration of such data-based models into predictive control algorithms has been investigated by various researchers, e.g., Coulson et al. (2019, 2021); Berberich et al. (2020); Baros et al. (2022); Huang et al. (2023), and has demonstrated superior performance than conventional control methods in a number of applications (Elokda et al., 2021; Huang et al., 2021; Chinde et al., 2022). Along with bypassing the step of state-space model identification/calibration, such a predictive control algorithm using an input-output data-based model determines control input values directly based on output measurements and does not require state estimation using an observer. Therefore, it provides an “end-to-end” data-to-control solution, which may be simpler to implement from the point of view of practitioners and may facilitate fully autonomous operation of future intelligent systems.

In the above context, the goal of this paper is to develop an approach to minimum-time trajectory optimization based on input-output data-based models, to produce an end-to-end data-to-control solution to many time-optimal planning and control problems in robotics, aerospace, and other disciplines. To the best of our knowledge, there has been no previous work addressing this topic. We focus on linear time-invariant systems, for which we adopt the approach based on behavioral systems theory to build up a non-parametric data-based model for trajectory prediction. We then exploit an exponential weighting scheme extended from the one of Verschueren et al. (2017) to solve for minimum-time trajectories. We show that the optimization problem in its final form is a linear program and hence is easy to solve. Main contributions of this paper are as follows:

•

We develop an approach to minimum-time trajectory optimization based on input-output data-based models, which is the first such approach in the literature.
•

We extend the trajectory prediction method in previous predictive control algorithms using data-based models (e.g., the ones in Coulson et al. (2019, 2021); Berberich et al. (2020); Baros et al. (2022); Huang et al. (2023)) to enable predicting trajectories over an extended planning horizon without relying on a high-dimensional model. This extension is especially relevant to trajectory optimization because a long planning horizon is not rare in a trajectory optimization setting. Our method is inspired by the multiple shooting method (Trélat, 2012).
•

Using an exponential weighting scheme extended from that of Verschueren et al. (2017), we formulate the minimum-time trajectory optimization problem into a linear program, which is easy to solve. We prove that minimum-time trajectories can be obtained with the linear program if a weight parameter is chosen to be sufficiently high. In particular, we make the following extensions to the exponential weighting scheme: 1) The scheme was used in Verschueren et al. (2017) for minimum-time trajectory planning based on a state-space model; we extend the scheme to apply it to minimum-time trajectory planning based on an input-output data-based model. 2) The scheme was used in Verschueren et al. (2017) for point-to-point trajectory planning; we extend the scheme to more general point-to-set trajectory planning. To enable these extensions, different assumptions are made, and, correspondingly, new proofs are developed to show the ability of this exponential weighting scheme to produce minimum-time trajectory solutions.
•

We validate the developed approach to minimum-time trajectory optimization and illustrate its application with an aerospace example.

Organization: The minimum-time trajectory optimization problem addressed in this paper is described in Section 2. We introduce the method using a non-parametric input-output data-based model to predict trajectories over an extended planning horizon in Section 3. We reformulate the minimum-time trajectory optimization problem into a linear program based on an exponential weighting scheme and prove its theoretical properties in Section 4. A spacecraft relative motion planning problem is considered in Section 5 to illustrate the approach. The paper is concluded in Section 6.

Notations: The symbol $\mathbb{R}$ denotes the set of real numbers and $\mathbb{Z}$ the set of integers; $\mathbb{R}^{n}$ denotes the set of $n$ -dimensional real vectors, $\mathbb{R}^{n\times m}$ the set of $n$ -by- $m$ real matrices, and $\mathbb{Z}_{\geq a}$ the set of integers that are greater than or equal to $a$ ; $I_{n}$ denotes the $n$ -dimensional identity matrix, $0_{n,m}$ the $n$ -by- $m$ zero matrix, and $1_{n}$ the $n$ -dimensional column vector of ones. Given multiple vectors $v_{k}\in\mathbb{R}^{n_{k}}$ or matrices with the same number of columns $M_{k}\in\mathbb{R}^{n_{k}\times m}$ , $k=1,...,K$ , the operator $\text{col}(\cdot)$ stack them on top of one another, i.e., $\text{col}(v_{1},...,v_{K})=[v_{1}^{\top},...,v_{K}^{\top}]^{\top}$ and $\text{col}(M_{1},...,M_{K})=[M_{1}^{\top},...,M_{K}^{\top}]^{\top}$ . For a discrete-time signal $z(\cdot):\mathbb{Z}\to\mathbb{R}^{n}$ , we use ${\bf z}_{[a:b]}$ , with $a,b\in\mathbb{Z}$ and $a\leq b$ , to denote $\text{col}(z(a),...,z(b))$ . We call both the sequence $z(a),...,z(b)$ and the column vector ${\bf z}_{[a:b]}=\text{col}(z(a),...,z(b))$ a trajectory (of length $l=b-a+1$ ).

2 Problem Statement

We study trajectory optimization problems associated with finite-dimensional linear time-invariant systems which can be represented in state-space form as


$\displaystyle x(t+1)$	$\displaystyle=Ax(t)+Bu(t)$	(1a)
$\displaystyle y(t)$	$\displaystyle=Cx(t)+Du(t)$	(1b)

where $t\in\mathbb{Z}$ denotes the discrete time step, $x(t)\in\mathbb{R}^{n}$ represents the system state at time $t$ , $u(t)\in\mathbb{R}^{m}$ is the control input, $y(t)\in\mathbb{R}^{p}$ is the output, and $A$ , $B$ , $C$ and $D$ are matrices of appropriate dimensions. We make the following assumption about the system:

Assumption 1: The system is controllable and observable.

Given a state-space model (1) of the system, we can write the following equation that relates an input trajectory of length $l$ , $l\in\mathbb{Z}_{\geq 1}$ , to its corresponding output trajectory:

{\bf y}_{[0:l-1]}=\mathcal{O}_{l}x(0)+\mathcal{C}_{l}{\bf u}_{[0:l-1]}

(2)

where $x(0)$ is the system state at the initial time of the trajectory, and $\mathcal{O}_{l}$ and $\mathcal{C}_{l}$ are matrices defined as follows:

\mathcal{O}_{l}=\begin{bmatrix}C\\ CA\\ \vdots\\ CA^{l-1}\end{bmatrix}\quad\mathcal{C}_{l}=\begin{bmatrix}D&&&\\ CB&D&&&\\ \vdots&\ddots&\ddots&\\ CA^{l-2}B&\cdots&CB&D\end{bmatrix}

(3)

The smallest integer $l$ such that the matrix $\mathcal{O}_{l}$ defined above has full rank is called the lag of the system and denoted as $l_{\min}$ . Under Assumption 1, $l_{\min}$ exists and satisfies $1\leq l_{\min}\leq n$ .

Given an arbitrary pair of input trajectory ${\bf u}_{[0:l-1]}$ and output trajectory ${\bf y}_{[0:l-1]}$ , both of length $l$ , if (2) holds for some $x(0)\in\mathbb{R}^{n}$ , then the pair $({\bf u}_{[0:l-1]},{\bf y}_{[0:l-1]})$ is called admissible by the system. In particular, an admissible pair of input-output trajectories of length $l$ , $({\bf u}_{[0:l-1]},{\bf y}_{[0:l-1]})$ , with $l\geq l_{\min}$ , corresponds to a unique initial state $x(0)$ , which can be determined according to (2) as follows:

x(0)=\mathcal{O}_{l}^{\dagger}({\bf y}_{[0:l-1]}-\mathcal{C}_{l}{\bf u}_{[0:l-1]})

(4)

where $(\cdot)^{\dagger}$ denotes the Moore-Penrose pseudoinverse.

In this paper, we focus on minimum-time trajectory optimization problems given in the following form:


$\displaystyle\min_{u(\cdot),y(\cdot),T\geq 0}$	$\displaystyle\quad T$	(5a)
s.t.	$\displaystyle\quad({\bf u}_{[-K_{i}:-1]},{\bf y}_{[-K_{i}:-1]})=({\bf u}_{i},{\bf y}_{i})$	(5b)
	$\displaystyle\quad{\bf y}_{[T:T+K_{f}-1]}\in Y_{f}$	(5c)
	$\displaystyle\quad c(u(t),y(t))\leq 0,\quad t=0,...,T+K_{f}-1$	(5d)

where $K_{i}\geq l_{\min}$ ; ${\bf u}_{i}\in\mathbb{R}^{mK_{i}}$ and ${\bf y}_{i}\in\mathbb{R}^{pK_{i}}$ represent given initial conditions for the input and output trajectories; $K_{f}\geq 1$ ; $Y_{f}\subset\mathbb{R}^{pK_{f}}$ represents a target set for the output trajectory; and $c(u(t),y(t))\leq 0$ represents prescribed path constraints for the trajectory to satisfy. The goal represented by the cost function in (5a) and the constraint in (5c) is to drive the output trajectory to reach the target set $Y_{f}$ in the minimum time, starting with the initial condition in (5b), while satisfying the path constraints in (5d). We first make the following assumption about the initial condition $({\bf u}_{i},{\bf y}_{i})$ :

Assumption 2: The pair $({\bf u}_{i},{\bf y}_{i})$ represents an admissible pair of input-output trajectories (of length $K_{i}$ ) that satisfies the path constraints $c(u(t),y(t))\leq 0$ at all times.

Assumption 2 is reasonable because ${\bf u}_{i}$ and ${\bf y}_{i}$ , according to (5b), represent the input and output trajectories of the system over the past $K_{i}$ time steps, and hence, the pair $({\bf u}_{i},{\bf y}_{i})$ is supposed to be admissible and satisfies any path constraints. Under Assumption 2, given a state-space model (1) of the system, (5b) corresponds to the following initial condition for the state:

x(0)=A^{K_{i}}\mathcal{O}_{K_{i}}^{\dagger}({\bf y}_{i}-\mathcal{C}_{K_{i}}{\bf u}_{i})+\begin{bmatrix}A^{K_{i}-1}B&\cdots&AB&B\end{bmatrix}{\bf u}_{i}

(6)

where $\mathcal{O}_{K_{i}}$ and $\mathcal{C}_{K_{i}}$ are the matrices defined in (3) with $l=K_{i}$ .

We consider polyhedral target set $Y_{f}\subset\mathbb{R}^{pK_{f}}$ and linear-inequality path constraints $c(u(t),y(t))\leq 0$ , i.e., they can be written as:


	$\displaystyle Y_{f}=\{{\bf y}_{f}\in\mathbb{R}^{pK_{f}}:G{\bf y}_{f}\leq g,H{\bf y}_{f}=h\}$		(7a)
	$\displaystyle c(u(t),y(t))=\begin{bmatrix}S_{u}&S_{y}\end{bmatrix}\begin{bmatrix}u(t)\\ y(t)\end{bmatrix}-s\leq 0$		(7b)

where $G,H,S_{u},S_{y}$ and $g,h,s$ are matrices and vectors of compatible dimensions. Furthermore, we make the following “controlled invariance” assumption about $Y_{f}$ :

Assumption 3: Let $\bar{K}_{f}=\max(K_{f},l_{\min})$ . For any admissible pair of input-output trajectories of length $\bar{K}_{f}$ , $({\bf u}_{[0:\bar{K}_{f}-1]},{\bf y}_{[0:\bar{K}_{f}-1]})$ , that satisfies the path constraints $c(u(t),y(t))\leq 0$ for $t=0,...,\bar{K}_{f}-1$ and the target condition ${\bf y}_{[\bar{K}_{f}-K_{f}:\bar{K}_{f}-1]}\in Y_{f}$ , there exist an input $u(\bar{K}_{f})$ and its corresponding output $y(\bar{K}_{f})$ such that they satisfy $c(u(\bar{K}_{f}),y(\bar{K}_{f}))\leq 0$ and ${\bf y}_{[\bar{K}_{f}-K_{f}+1:\bar{K}_{f}]}\in Y_{f}$ .

In Assumption 3, because $\bar{K}_{f}\geq l_{\min}$ , an admissible pair of input-output trajectories $({\bf u}_{[0:\bar{K}_{f}-1]},{\bf y}_{[0:\bar{K}_{f}-1]})$ corresponds to a unique initial state $x(0)$ and a unique state trajectory $x(0),x(1),...,x(\bar{K}_{f})$ . Hence, for an input $u(\bar{K}_{f})$ , the corresponding output $y(\bar{K}_{f})$ is also unique. Assumption 3 means that for any pair of input-state trajectories the output trajectory of which enters the target set $Y_{f}$ while satisfying the path constraints, there exists a control to maintain the output trajectory in $Y_{f}$ while satisfying the path constraints. Hence, it specifies a “controlled invariance” property of $Y_{f}$ with respect to the system dynamics and the path constraints.

Remark 1: While in many conventional trajectory optimization problem formulations the initial condition is a given value $x_{i}$ for the state at the initial time $t=0$ , a given value for the output or a given pair of input-output at a single time is not sufficient for uniquely determining the trajectory. Therefore, we consider a pair of input-output trajectories of length $K_{i}$ that satisfies $K_{i}\geq l_{\min}$ as the initial condition which uniquely determines the initial state according to (6) and hence the trajectory. For the terminal condition, we consider a target set $Y_{f}$ instead of a single point to represent a larger class of problems which includes a single target point as a special case.

Lastly, we assume that a state-space model of the considered system (i.e., the matrices $(A,B,C,D)$ in (1)) is not given and only input-output trajectory data of the system are available. For instance, we may deal with a real system that has uncertain parameters while can generate input-output data (see the example in Section 5). The goal of this paper is to develop a computationally-efficient approach to the minimum-time trajectory optimization problem (5) for such a setting. We note that although it is possible to first identify the matrices $(A,B,C,D)$ using input-output data and system identification techniques and then solve the problem (5) based on the identified state-space model, this two-step approach may be cumbersome from the point of view of practitioners and $(A,B,C,D)$ that is compatible with given data is in general not unique. Therefore, we pursue an end-to-end solution – directly from input-output data to a solution to (5).

3 Data-based Model for Long-term Trajectory Prediction

Assume that we have input-output trajectory data of length $M$ :

\mathcal{D}_{M}=\left\{\begin{bmatrix}u^{d}(0)\\ y^{d}(0)\end{bmatrix},\begin{bmatrix}u^{d}(1)\\ y^{d}(1)\end{bmatrix},...,\begin{bmatrix}u^{d}(M-1)\\ y^{d}(M-1)\end{bmatrix}\right\}

(8)

where the superscript $d$ indicates “data.” Note that these input-output pairs $(u^{d}(t),y^{d}(t))$ , $t=0,...,M-1$ , should be sampled from a single trajectory¹¹1If the data are from multiple trajectories, then the inputs shall satisfy a collectively persistently exciting condition (van Waarde et al., 2020).. Construct the following Hankel data matrices:


	$\displaystyle\!\mathcal{H}_{L}(u^{d})\!=\!\begin{bmatrix}u^{d}(0)&u^{d}(1)&\cdots&u^{d}(M-L)\\ u^{d}(1)&u^{d}(2)&\cdots&u^{d}(M-L+1)\\ \vdots&\vdots&\ddots&\vdots\\ u^{d}(L-1)&u^{d}(L)&\cdots&u^{d}(M-1)\end{bmatrix}$		(9a)
	$\displaystyle\!\mathcal{H}_{L}(y^{d})\!=\!\begin{bmatrix}y^{d}(0)&y^{d}(1)&\cdots&y^{d}(M-L)\\ y^{d}(1)&y^{d}(2)&\cdots&y^{d}(M-L+1)\\ \vdots&\vdots&\ddots&\vdots\\ y^{d}(L-1)&y^{d}(L)&\cdots&y^{d}(M-1)\end{bmatrix}$		(9b)

where $L$ indicates the number of stacks of the signal $u^{d}$ or $y^{d}$ in each column. The control input trajectory $u^{d}(0),u^{d}(1),...,u^{d}(M-1)$ is said to be persistently exciting of order $L$ if $\mathcal{H}_{L}(u^{d})$ has full rank.

Our approach uses a result known as the fundamental lemma of Willems et al. (2005). The following lemma is an equivalent statement of Willems’ fundamental lemma in a state-space context (van Waarde et al., 2020):

Lemma 1: If the system (1) is controllable and the control input trajectory $u^{d}(0),u^{d}(1),...,u^{d}(M-1)$ is persistently exciting of order $L+n$ , then a pair of input-output trajectories of length $L$ , $({\bf u}_{[t:t+L-1]},{\bf y}_{[t:t+L-1]})$ , is admissible by the system (1) if and only if

\begin{bmatrix}{\bf u}_{[t:t+L-1]}\\ {\bf y}_{[t:t+L-1]}\end{bmatrix}=\begin{bmatrix}\mathcal{H}_{L}(u^{d})\\ \mathcal{H}_{L}(y^{d})\end{bmatrix}\zeta

(10)

for some vector $\zeta\in\mathbb{R}^{M-L+1}$ .

Assume $L>K_{i}$ and let $t=-K_{i}$ . Then, (10) can be written as

\begin{bmatrix}{\bf u}_{[-K_{i}:-1]}\\ {\bf u}_{[0:L-K_{i}-1]}\\ {\bf y}_{[-K_{i}:-1]}\\ {\bf y}_{[0:L-K_{i}-1]}\end{bmatrix}=\begin{bmatrix}\mathcal{H}_{L}(u^{d})\\ \mathcal{H}_{L}(y^{d})\end{bmatrix}\zeta

(11)

Given $({\bf u}_{[-K_{i}:-1]},{\bf y}_{[-K_{i}:-1]})=({\bf u}_{i},{\bf y}_{i})$ , (11) is a linear equation of variables $({\bf u}_{[0:L-K_{i}-1]},{\bf y}_{[0:L-K_{i}-1]},\zeta)\in\mathbb{R}^{m(L-K_{i})}\times\mathbb{R}^{p(L-K_{i})}\times\mathbb{R}^{M-L+1}$ . In Coulson et al. (2019), (11) is used as a model for predicting the outputs $y(t)$ over the entire planning horizon $t=0,...,N-1$ . This requires $L=N+K_{i}$ . When the planning horizon $N$ is large, which is not rare in a trajectory optimization setting, such a strategy requires high-dimensional data matrices $(\mathcal{H}_{L}(u^{d}),\mathcal{H}_{L}(y^{d}))$ and requires the control input trajectory $u^{d}(0),u^{d}(1),...,u^{d}(M-1)$ to be persistently exciting of an order of $L+n=N+K_{i}+n$ . Inspired by the multiple shooting method for trajectory optimization over an extended planning horizon (Trélat, 2012), we consider partitioning the entire trajectory of length $N=K(L-K_{i})$ into $K$ segments:


	$\displaystyle{\bf u}_{[0:N-1]}=\begin{bmatrix}{\bf u}_{[0:L-K_{i}-1]}\\ {\bf u}_{[L-K_{i}:2(L-K_{i})-1]}\\ \vdots\\ {\bf u}_{[(K-1)(L-K_{i}):N-1]}\end{bmatrix}$		(12a)
	$\displaystyle{\bf y}_{[0:N-1]}=\begin{bmatrix}{\bf y}_{[0:L-K_{i}-1]}\\ {\bf y}_{[L-K_{i}:2(L-K_{i})-1]}\\ \vdots\\ {\bf y}_{[(K-1)(L-K_{i}):N-1]}\end{bmatrix}$		(12b)

For each segment $k$ , $k=1,...,K$ , we stack its previous $K_{i}$ points on top to get the following vectors:


$\displaystyle{\sf u}_{k}$	$\displaystyle=\begin{bmatrix}{\bf u}_{[(k-1)(L-K_{i})-K_{i}:(k-1)(L-K_{i})-1]}\\ {\bf u}_{[(k-1)(L-K_{i}):k(L-K_{i})-1]}\end{bmatrix}\in\mathbb{R}^{mL}$	(13a)
$\displaystyle{\sf y}_{k}$	$\displaystyle=\begin{bmatrix}{\bf y}_{[(k-1)(L-K_{i})-K_{i}:(k-1)(L-K_{i})-1]}\\ {\bf y}_{[(k-1)(L-K_{i}):k(L-K_{i})-1]}\end{bmatrix}\in\mathbb{R}^{pL}$	(13b)

According to Lemma 1, for each $k=1,...,K$ , the pair $({\sf u}_{k},{\sf y}_{k})$ is an admissible pair of input-output trajectories if and only if

\begin{bmatrix}{\sf u}_{k}\\ {\sf y}_{k}\end{bmatrix}=\begin{bmatrix}\mathcal{H}_{L}(u^{d})\\ \mathcal{H}_{L}(y^{d})\end{bmatrix}\zeta_{k}

(14)

for some $\zeta_{k}\in\mathbb{R}^{M-L+1}$ . We refer to (14) for $k=1,...,K$ as equality dynamic constraints with $\zeta_{k}$ as auxiliary variables. Then, we impose the following equality matching conditions for $k=1,...,K-1$ to piece together the segments and form a long admissible trajectory:


$\displaystyle\begin{bmatrix}I_{mK_{i}}\\ 0_{m(L-K_{i}),mK_{i}}\end{bmatrix}^{\top}{\sf u}_{k+1}$	$\displaystyle=\begin{bmatrix}0_{m(L-K_{i}),mK_{i}}\\ I_{mK_{i}}\end{bmatrix}^{\top}{\sf u}_{k}$	(15a)
$\displaystyle\begin{bmatrix}I_{pK_{i}}\\ 0_{p(L-K_{i}),pK_{i}}\end{bmatrix}^{\top}{\sf y}_{k+1}$	$\displaystyle=\begin{bmatrix}0_{p(L-K_{i}),pK_{i}}\\ I_{pK_{i}}\end{bmatrix}^{\top}{\sf y}_{k}$	(15b)

i.e., we enforce the first $K_{i}$ points of ${\sf u}_{k+1}$ (resp. ${\sf y}_{k+1}$ ) to be equal to the last $K_{i}$ points of ${\sf u}_{k}$ (resp. ${\sf y}_{k}$ ). Note that because $K_{i}\geq l_{\min}$ , an admissible pair of input-output trajectories of length $K_{i}$ corresponds to a unique state trajectory of length $K_{i}+1$ . Specifically, given a state-space model (1) of the system, for an admissible pair of input-output trajectories of length $K_{i}$ , with $K_{i}\geq l_{\min}$ , the unique corresponding initial state can be determined by (4) with $l=K_{i}$ , and then the states over the next $K_{i}$ steps are uniquely determined by this initial state and the given input trajectory. Therefore, matching the first $K_{i}$ points of $({\sf u}_{k+1},{\sf y}_{k+1})$ to the last $K_{i}$ points of $({\sf u}_{k},{\sf y}_{k})$ creates a continuous state trajectory $x((k-1)(L-K_{i})-K_{i}),...,x((k+1)(L-K_{i}))$ .


$\displaystyle{\bf u}_{[-K_{i}:N-1]}$	$\displaystyle=\text{col}\left(\begin{bmatrix}I_{mK_{i}}\\ 0_{m(L-K_{i}),mK_{i}}\end{bmatrix}^{\top}{\sf u}_{1},\begin{bmatrix}0_{mK_{i},m(L-K_{i})}\\ I_{m(L-K_{i})}\end{bmatrix}^{\top}{\sf u}_{1},...,\begin{bmatrix}0_{mK_{i},m(L-K_{i})}\\ I_{m(L-K_{i})}\end{bmatrix}^{\top}{\sf u}_{K}\right)$	(18a)
$\displaystyle{\bf y}_{[-K_{i}:N-1]}$	$\displaystyle=\text{col}\left(\begin{bmatrix}I_{pK_{i}}\\ 0_{p(L-K_{i}),pK_{i}}\end{bmatrix}^{\top}{\sf y}_{1},\begin{bmatrix}0_{pK_{i},p(L-K_{i})}\\ I_{p(L-K_{i})}\end{bmatrix}^{\top}{\sf y}_{1},...,\begin{bmatrix}0_{pK_{i},p(L-K_{i})}\\ I_{p(L-K_{i})}\end{bmatrix}^{\top}{\sf y}_{K}\right)$	(18b)

Refer to caption — Figure 1: Illustration of the structure of the variables.

The initial condition for the input-output trajectories in (5b) can be imposed through the following equality constraints on $({\sf u}_{1},{\sf y}_{1})$ :


$\displaystyle\begin{bmatrix}I_{mK_{i}}\\ 0_{m(L-K_{i}),mK_{i}}\end{bmatrix}^{\top}{\sf u}_{1}$	$\displaystyle={\bf u}_{i}$	(16a)
$\displaystyle\begin{bmatrix}I_{pK_{i}}\\ 0_{p(L-K_{i}),pK_{i}}\end{bmatrix}^{\top}{\sf y}_{1}$	$\displaystyle={\bf y}_{i}$	(16b)

Also, the path constraints in (5d) can be imposed through the following inequality constraints for $k=1,...,K$ and $l=K_{i}+1,...,L$ :

S_{u}\begin{bmatrix}0_{m(l-1),m}\\ I_{m}\\ 0_{m(L-l),m}\end{bmatrix}^{\top}{\sf u}_{k}+S_{y}\begin{bmatrix}0_{p(l-1),p}\\ I_{p}\\ 0_{p(L-l),p}\end{bmatrix}^{\top}{\sf y}_{k}\leq s

(17)

The input-output trajectories over the entire planning horizon $t=-K_{i},...,N-1$ can be constructed from the vectors ${\sf u}_{k}$ and ${\sf y}_{k}$ , $k=1,...,K$ , through the equations in (18). And we arrive at the following result:

Lemma 2: Suppose the system (1) is controllable and the control input trajectory $u^{d}(0),u^{d}(1),...,u^{d}(M-1)$ is persistently exciting of order $L+n$ . Then, a pair of input-output trajectories $({\bf u}_{[-K_{i}:N-1]},{\bf y}_{[-K_{i}:N-1]})$ , where $N$ can be written as $N=K(L-K_{i})$ for some $K\in\mathbb{Z}_{\geq 1}$ , is admissible by the system (1) and satisfies the initial condition in (5b) and the path constraints in (5d) if and only if there exist vectors $({\sf u}_{k},{\sf y}_{k},\zeta_{k})\in\mathbb{R}^{mL}\times\mathbb{R}^{pL}\times\mathbb{R}^{M-L+1}$ , $k=1,...,K$ , that satisfy the constraints in (14), (15), (16) and (17) and $({\bf u}_{[-K_{i}:N-1]},{\bf y}_{[-K_{i}:N-1]})$ relate to the vectors $({\sf u}_{k},{\sf y}_{k})$ , $k=1,...,K$ , according to (18).

Proof: This result follows from Lemma 1 and the constructions of the vectors $({\sf u}_{k},{\sf y}_{k})$ , $k=1,...,K$ , in (12)–(13) and the constraints in (14)–(17). $\blacksquare$

4 Linear Program for Minimum-time Trajectory Optimization

Now we deal with the target condition ${\bf y}_{[T:T+K_{f}-1]}\in Y_{f}$ . Recall that the goal is to minimize the time $T$ at which the output trajectory reaches the target set $Y_{f}=\{{\bf y}_{f}\in\mathbb{R}^{pK_{f}}:G{\bf y}_{f}\leq g,H{\bf y}_{f}=h\}$ .

Assume that the minimum time $T^{*}$ is in the range of $[T_{0},T_{1}]$ . For a given problem, such a range may be estimated based on prior knowledge or set to be sufficiently large (e.g., $T_{0}=0$ and $T_{1}$ being a large number). It will become clear soon that a smaller range (i.e., a closer estimate of $T^{*}$ ) makes the formulated problem simpler in terms of having less decision variables and constraints. For each $t\in[T_{0},T_{1}]$ , define a slack variable $\varepsilon_{t}\in\mathbb{R}^{q_{g}+q_{h}}$ , where $q_{g}$ is the dimension of $g$ and $q_{h}$ is that of $h$ . Impose the following constraints for $\varepsilon_{t}$ :


	$\displaystyle G{\bf y}_{[t:t+K_{f}-1]}-g\leq\varepsilon_{t,1:q_{g}}$		(19a)
	$\displaystyle H{\bf y}_{[t:t+K_{f}-1]}-h=\varepsilon_{t,q_{g}+1:q_{g}+q_{h}}$		(19b)

where $\varepsilon_{t,1:q_{g}}$ denotes the first $q_{g}$ rows of $\varepsilon_{t}$ and $\varepsilon_{t,q_{g}+1:q_{g}+q_{h}}$ denotes the remaining rows of $\varepsilon_{t}$ . Then, ${\bf y}_{[t:t+K_{f}-1]}\in Y_{f}$ if and only if $0$ is a feasible value for $\varepsilon_{t}$ .

Consider the following function:

J=\sum_{t=T_{0}}^{T_{1}}\theta^{t-T_{0}}\|\varepsilon_{t}\|_{1}

(20)

where $\theta>1$ is a sufficiently large constant and $\|\cdot\|_{1}$ denotes the $1$ -norm. The analysis in what follows shows that minimizing (20) can lead to a minimum-time trajectory solution:

Assume $T^{*}\in[T_{0},T_{1}]$ is known and consider the following problem parameterized by $\eta=\{\eta_{t}\}_{t=T^{*},...,T_{1}}$ :


$\displaystyle\min_{u(\cdot),y(\cdot),\varepsilon_{(\cdot)}}\!\!$	$\displaystyle\,J_{0}(\theta,\{\varepsilon_{t}\}_{t=T_{0},...,T^{}-1})=\sum_{t=T_{0}}^{T^{}-1}\theta^{t-T_{0}}\\|\varepsilon_{t}\\|_{1}$	(21a)
s.t.	$\displaystyle\,({\bf u}_{[-K_{i}:-1]},{\bf y}_{[-K_{i}:-1]})=({\bf u}_{i},{\bf y}_{i})$	(21b)
	$\displaystyle\,c(u(t),y(t))\leq 0,\,\,\,t=0,...,N-1$	(21c)
	$\displaystyle\,G{\bf y}_{[t:t+K_{f}-1]}-g\leq\varepsilon_{t,1:q_{g}}$	(21d)
	$\displaystyle\,H{\bf y}_{[t:t+K_{f}-1]}-h=\varepsilon_{t,q_{g}+1:q_{g}+q_{h}},\,\,t=T_{0},...,T_{1}$	(21e)
	$\displaystyle\,\varepsilon_{t}=\eta_{t},\,\,\,t=T^{*},...,T_{1}$	(21f)

where $N$ represents a planning horizon that satisfies $N\geq T_{1}+K_{f}$ , and $\eta_{t}\in\mathbb{R}^{q_{g}+q_{h}}$ , $t=T^{*},...,T_{1}$ , are parameters with the nominal value $\eta_{t}^{*}=0$ . The following result clarifies the relation between the minimum-time problem of interest, (5), and the above problem (21):

Theorem 1: (i) Suppose Assumptions 2 and 3 hold. Let $({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},T^{*})$ be an optimal solution to the minimum-time problem (5) with $T^{*}\in[T_{0},T_{1}]$ . Then, (21) with $\eta=0$ has a feasible solution $({\bf u}_{[-K_{i}:N-1]}^{\prime},{\bf y}_{[-K_{i}:N-1]}^{\prime},\{\varepsilon_{t}^{\prime}\}_{t=T_{0},...,T_{1}})$ that satisfies $({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime})=({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{*})$ and $\varepsilon_{t}^{\prime}=0$ for $t=T^{*},...,T_{1}$ .

(ii) Suppose (5) is feasible and has a minimum time $T^{*}\in[T_{0},T_{1}]$ . Let $({\bf u}_{[-K_{i}:N-1]}^{\prime},{\bf y}_{[-K_{i}:N-1]}^{\prime},\{\varepsilon_{t}^{\prime}\}_{t=T_{0},...,T_{1}})$ be an arbitrary feasible solution to (21) with $\eta=0$ . Then, the triple $({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime},T^{*})$ is an optimal solution to (5).

Proof: For (i), let $({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},T^{*})$ be an optimal solution to (5). Because $K_{i}\geq l_{\min}$ and $T^{*}\geq 0$ , $\bar{K}_{f}=\max(K_{f},l_{\min})$ must satisfy $-K_{i}\leq T^{*}+K_{f}-\bar{K}_{f}\leq T^{*}+K_{f}-1$ . Then, under Assumption 2, $({\bf u}_{[T^{*}+K_{f}-\bar{K}_{f}:T^{*}+K_{f}-1]}^{*},{\bf y}_{[T^{*}+K_{f}-\bar{K}_{f}:T^{*}+K_{f}-1]}^{*})$ is an admissible pair of input-output trajectories of length $\bar{K}_{f}$ that satisfies $c(u(t),y(t))\leq 0$ for $t=T^{*}+K_{f}-\bar{K}_{f},...,T^{*}+K_{f}-1$ and ${\bf y}_{[T^{*}:T^{*}+K_{f}-1]}\in Y_{f}$ . In this case, under Assumption 3, there exist inputs $u^{\prime}(T^{*}+K_{f}),...,u^{\prime}(N-1)$ and the corresponding outputs $y^{\prime}(T^{*}+K_{f}),...,y^{\prime}(N-1)$ such that $c(u(t),y(t))\leq 0$ for $t=T^{*}+K_{f},...,N-1$ and ${\bf y}_{[t:t+K_{f}-1]}\in Y_{f}$ for $t=T^{*}+1,...,T_{1}$ . Now let ${\bf u}_{[-K_{i}:N-1]}^{\prime}=\text{col}({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},u^{\prime}(T^{*}+K_{f}),...,u^{\prime}(N-1))$ and ${\bf y}_{[-K_{i}:N-1]}^{\prime}=\text{col}({\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{*},y^{\prime}(T^{*}+K_{f}),...,y^{\prime}(N-1))$ , and let $\varepsilon_{t}^{\prime}$ be defined according to $\varepsilon_{t,1:q_{g}}^{\prime}=G{\bf y}_{[t:t+K_{f}-1]}^{\prime}-g$ and $\varepsilon_{t,q_{g}+1:q_{g}+q_{h}}^{\prime}=H{\bf y}_{[t:t+K_{f}-1]}^{\prime}-h$ for $t=T_{0},...,T^{*}-1$ and $\varepsilon_{t}^{\prime}=0$ for $t=T^{*},...,T_{1}$ . Then, the triple $({\bf u}_{[-K_{i}:N-1]}^{\prime},{\bf y}_{[-K_{i}:N-1]}^{\prime},\{\varepsilon_{t}^{\prime}\}_{t=T_{0},...,T_{1}})$ is a feasible solution to (21) with $\eta=0$ . This proves part (i).

For part (ii), let $({\bf u}_{[-K_{i}:N-1]}^{\prime},{\bf y}_{[-K_{i}:N-1]}^{\prime},\{\varepsilon_{t}^{\prime}\}_{t=T_{0},...,T_{1}})$ be a feasible solution to (21) with $\eta=0$ . Due to the constraints in (21d)–(21f), this solution satisfies ${\bf y}_{[T^{*}:T^{*}+K_{f}-1]}^{\prime}\in Y_{f}$ . Therefore, the triple $({\bf u}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime},{\bf y}_{[-K_{i}:T^{*}+K_{f}-1]}^{\prime},T^{*})$ is an optimal solution to the minimum-time problem (5). $\blacksquare$

Remark 2: From the proof of Theorem 1 the necessity of both Assumptions 2 and 3 for guaranteeing the result of part (i) should be clear. For instance, suppose the second half of Assumption 2 does not hold, i.e., the pair $({\bf u}_{i},{\bf y}_{i})$ does not satisfy the path constraints $c(u(t),y(t))\leq 0$ at all times, and suppose $K_{f}<l_{\min}$ and $T^{*}<l_{\min}-K_{f}$ . Then, the pair $({\bf u}_{[T^{*}+K_{f}-\bar{K}_{f}:T^{*}+K_{f}-1]}^{*},{\bf y}_{[T^{*}+K_{f}-\bar{K}_{f}:T^{*}+K_{f}-1]}^{*})$ may not satisfy $c(u(t),y(t))\leq 0$ for all $t=T^{*}+K_{f}-\bar{K}_{f},...,T^{*}+K_{f}-1$ . Consequently, for such a pair of input-output trajectories of length $\bar{K}_{f}$ , there may not exist an input $u^{\prime}(T^{*}+K_{f})$ that is able to maintain the output trajectory in $Y_{f}$ while satisfying the path constraints even if Assumption 3 holds. In contrast, the result of part (ii) does not rely on Assumptions 2 and 3.

Theorem 1 indicates that minimum-time trajectory solutions (i.e., optimal solutions to (5)) can be obtained through (21). However, the formulation of (21) relies on the exact knowledge of the minimum time $T^{*}$ , which is typically not known a priori (otherwise the problem reduces to a fixed-time trajectory planning problem). In what follows we show that an optimal solution to (21) can be obtained through another related problem the formulation of which does not rely on exact knowledge of $T^{*}$ . The technique originates from exact penalty methods for handling constraints in constrained optimization.

Now let $({\bf u}_{[-K_{i}:N-1]}^{*}(\theta),{\bf y}_{[-K_{i}:N-1]}^{*}(\theta),\{\varepsilon_{t}^{*}(\theta)\}_{t=T_{0},...,T_{1}})$ be an optimal solution to (21) with $\eta=\{\eta_{t}\}_{t=T^{*},...,T_{1}}=0$ and a certain value of $\theta$ . Let $\lambda_{k}(\theta)\in\mathbb{R}^{q_{g}+q_{h}}$ be the Lagrange multiplier associated with the constraint $\varepsilon_{k}=\eta_{k}$ , $k=T^{*},...,T_{1}$ . Its value satisfies

\lambda_{k,i}(\theta)\!=\!\frac{\text{d}J_{0}(\theta,\{\varepsilon_{t}^{*}(\theta,\eta_{k,i})\}_{t=T_{0},...,T^{*}-1})}{\text{d}\eta_{k,i}},\,\,i\!=\!1,...,q_{g}+q_{h}

(22)

where $\varepsilon_{t}^{*}(\theta,\eta_{k,i})$ denotes the perturbed value of $\varepsilon_{t}^{*}(\theta)$ due to a perturbation $\eta_{k,i}\in\mathbb{R}$ to the $i$ th entry of the parameter $\eta_{k}$ , i.e., the Lagrange multiplier represents the sensitivity of the cost to perturbations to the constraint (Büskens and Maurer, 2001). We make the following assumption about the optimal solution $({\bf u}_{[-K_{i}:N-1]}^{*}(\theta),{\bf y}_{[-K_{i}:N-1]}^{*}(\theta),\{\varepsilon_{t}^{*}(\theta)\}_{t=T_{0},...,T_{1}})$ to (21):

Assumption 4: For sufficiently large $\theta$ and $t=T_{0},...,T^{*}-1$ , the sensitivity of $\varepsilon_{t}^{*}(\theta)$ to perturbations to the constraints $\varepsilon_{k}=\eta_{k}$ , $k=T^{*},...,T_{1}$ , is bounded by a constant $R$ , i.e.,

\left\|\frac{\text{d}\varepsilon_{t}^{*}(\theta,\eta_{k,i})}{\text{d}\eta_{k,i}}\right\|_{1}\leq R,\quad i=1,...,q_{g}+q_{h}

(23)

where $\|\cdot\|_{1}$ denotes the $1$ -norm.

Under Assumption 4, we can derive the following bound on $\lambda_{k,i}(\theta)$ :

		$\displaystyle\|\lambda_{k,i}(\theta)\|=\left\|\frac{\text{d}J_{0}(\theta,\{\varepsilon_{t}^{}(\theta,\eta_{k,i})\}_{t=T_{0},...,T^{}-1})}{\text{d}\eta_{k,i}}\right\|$
		$\displaystyle\leq\sum_{t=T_{0}}^{T^{}-1}\left\|\frac{\partial J_{0}}{\partial\varepsilon_{t}^{}}\cdot\frac{\partial\varepsilon_{t}^{}}{\partial\eta_{k,i}}\right\|\leq\sum_{t=T_{0}}^{T^{}-1}\theta^{t-T_{0}}\left\\|\frac{\text{d}\varepsilon_{t}^{*}}{\text{d}\eta_{k,i}}\right\\|_{1}$
		$\displaystyle\leq R\sum_{t=T_{0}}^{T^{}-1}\theta^{t-T_{0}}\leq\frac{R}{\theta-1}\theta^{T^{}-T_{0}}$		(24)

Hence, if $\theta\geq R+1$ , we have

\|\lambda_{k}(\theta)\|_{\infty}=\max_{i}|\lambda_{k,i}(\theta)|\leq\theta^{T^{*}-T_{0}}

(25)

for $k=T^{*},...,T_{1}$ , where $\|\cdot\|_{\infty}$ denotes the sup-norm. We arrive at the following result:

Theorem 2: Suppose (5) is feasible and has a minimum time $T^{*}\in[T_{0},T_{1}]$ . Let $({\bf u}_{[-K_{i}:N-1]}^{*}(\theta),{\bf y}_{[-K_{i}:N-1]}^{*}(\theta),$ $\{\varepsilon_{t}^{*}(\theta)\}_{t=T_{0},...,T_{1}})$ be an optimal solution to (21) with $\eta~{}=~{}0$ and $\theta>1$ . Suppose Assumption 4 holds and $\theta$ is sufficiently large. Then, $({\bf u}_{[-K_{i}:N-1]}^{*}(\theta),{\bf y}_{[-K_{i}:N-1]}^{*}(\theta),$ $\{\varepsilon_{t}^{*}(\theta)\}_{t=T_{0},...,T_{1}})$ is an optimal solution to the following problem, the formulation of which does not rely on $T^{*}$ :


$\displaystyle\min_{u(\cdot),y(\cdot),\varepsilon_{(\cdot)}}\!\!$	$\displaystyle\,J=\sum_{t=T_{0}}^{T_{1}}\theta^{t-T_{0}}\\|\varepsilon_{t}\\|_{1}$	(26a)
s.t.	$\displaystyle\,({\bf u}_{[-K_{i}:-1]},{\bf y}_{[-K_{i}:-1]})=({\bf u}_{i},{\bf y}_{i})$	(26b)
	$\displaystyle\,c(u(t),y(t))\leq 0,\,\,\,t=0,...,N-1$	(26c)
	$\displaystyle\,G{\bf y}_{[t:t+K_{f}-1]}-g\leq\varepsilon_{t,1:q_{g}}$	(26d)
	$\displaystyle\,H{\bf y}_{[t:t+K_{f}-1]}-h=\varepsilon_{t,q_{g}+1:q_{g}+q_{h}},\,\,t=T_{0},...,T_{1}$	(26e)

Proof: The cost function of (26) can be written as $J=J_{0}+\sum_{t=T^{*}}^{T_{1}}\theta^{t-T_{0}}\|\varepsilon_{t}\|_{1}$ , where $J_{0}$ is the cost function of (21). If Assumption 4 holds and $\theta\geq R+1$ , then according to (25), for $t=T^{*},...,T_{1}$ , the Lagrange multiplier $\lambda_{t}(\theta)$ associated with the constraint $\varepsilon_{t}=\eta_{t}=0$ of (21) satisfies $\|\lambda_{t}(\theta)\|_{\infty}\leq\theta^{T^{*}-T_{0}}\leq\theta^{t-T_{0}}$ . This implies that the term $\theta^{t-T_{0}}\|\varepsilon_{t}\|_{1}$ in the cost function of (26) is an exact penalty for the constraint $\varepsilon_{t}=0$ . Therefore, (26) is a reformulation of (21) by replacing all constraints in (21f) with exact penalties and hence the result follows (Han and Mangasarian, 1979). $\blacksquare$

Theorem 2 indicates that optimal solutions to (21), which, according to Theorem 1, are optimal solutions to (5) (i.e., minimum-time trajectories), can be obtained through (26). The cost function of (26), which is non-smooth due to the $1$ -norms, can be readily converted into a smooth function using a linear programming reformulation. We combine the results of the previous section on data-based trajectory prediction and of this section on minimum-time trajectory planning and arrive at the following problem formulation:

		$\displaystyle\min_{\{{\sf u}_{k},{\sf y}_{k},\zeta_{k}\}_{k=1,...K},\{\varepsilon_{t}\geq 0\}_{t=T_{0},...,T_{1}}}\!J=\sum_{t=T_{0}}^{T_{1}}\theta^{t-T_{0}}(1_{q_{g}+q_{h}}^{\top}\varepsilon_{t})$
		s.t.		(27)
		$\displaystyle\text{dynamic constraints: \eqref{equ:14} for }k=1,...,K$
		$\displaystyle\text{matching conditions: \eqref{equ:15} for }k=1,...,K-1$
		initial condition: (16)
		$\displaystyle\text{path constraints: \eqref{equ:17} for }k=1,..,K\text{ and }l=K_{i}+1,...,L$
		terminal condition:
		$\displaystyle G{\bf y}_{[t:t+K_{f}-1]}-g\leq\varepsilon_{t,1:q_{g}}$
		$\displaystyle H{\bf y}_{[t:t+K_{f}-1]}-h\leq\varepsilon_{t,q_{g}+1:q_{g}+q_{h}}$
		$\displaystyle h-H{\bf y}_{[t:t+K_{f}-1]}\leq\varepsilon_{t,q_{g}+1:q_{g}+q_{h}}\text{ for }t=T_{0},...,T_{1}$

in which the dimensions of the decision variables are ${\sf u}_{k}\in\mathbb{R}^{mL}$ , ${\sf y}_{k}\in\mathbb{R}^{pL}$ , $\zeta_{k}\in\mathbb{R}^{M-L+1}$ , $\varepsilon_{t}\in\mathbb{R}^{q_{g}+q_{h}}$ , and the number $K$ should be chosen as $K=\lceil\frac{T_{1}+K_{f}}{L-K_{i}}\rceil$ , where $\lceil\cdot\rceil$ means rounding up to the nearest integer. As discussed in the second paragraph of this section, the range $[T_{0},T_{1}]$ may be estimated based on prior knowledge about the problem or set as $[T_{0},T_{1}]=[0,T^{\prime}]$ with $T^{\prime}$ sufficiently large. It can be seen that the cost function of (4) is a linear equation of decision variables and all constraints of (4) are either linear equalities or linear inequalities of decision variables. Hence, (4) is a linear program (LP) and can be easily solved using off-the-shelf LP solvers.

Remark 3: In (4), the auxiliary variables $\zeta_{k}\in\mathbb{R}^{M-L+1}$ only appear in the dynamic constraints (14). Using the following approach we can drop these variables from the problem: Let ${\sf v}_{k}=\text{col}({\sf u}_{k},{\sf y}_{k})\in\mathbb{R}^{(m+p)L}$ and $\mathcal{H}=\text{col}(\mathcal{H}_{L}(u^{d}),\mathcal{H}_{L}(y^{d}))\in\mathbb{R}^{(m+p)L\times(M-L+1)}$ . Then, (14) can be written as ${\sf v}_{k}=\mathcal{H}\zeta_{k}$ . Assume $\text{rank}(\mathcal{H})=r$ . We partition the rows of $\mathcal{H}$ into two groups: $\mathcal{H}_{1}\in\mathbb{R}^{r\times(M-L+1)}$ collects $r$ linearly independent rows and $\mathcal{H}_{2}\in\mathbb{R}^{((m+p)L-r)\times(M-L+1)}$ collects the remaining rows. Since $\text{rank}(\mathcal{H})=r$ , the rows of $\mathcal{H}_{2}$ are linear combinations of the rows of $\mathcal{H}_{1}$ , i.e., $\mathcal{H}_{2}$ can be written as $\mathcal{H}_{2}=\Gamma\mathcal{H}_{1}$ , where $\Gamma\in\mathbb{R}^{((m+p)L-r)\times r}$ is uniquely determined by $\Gamma=\mathcal{H}_{2}\mathcal{H}_{1}^{\dagger}$ . Then, (14) can be written as

\begin{bmatrix}{\sf v}_{k,1}\\ {\sf v}_{k,2}\end{bmatrix}=\begin{bmatrix}\mathcal{H}_{1}\\ \mathcal{H}_{2}\end{bmatrix}\zeta_{k}=\begin{bmatrix}\mathcal{H}_{1}\zeta_{k}\\ \Gamma\mathcal{H}_{1}\zeta_{k}\end{bmatrix}=\begin{bmatrix}\mathcal{H}_{1}\zeta_{k}\\ \Gamma{\sf v}_{k,1}\end{bmatrix}

(28)

where ${\sf v}_{k,1}$ (resp. ${\sf v}_{k,2}$ ) collects the entries of ${\sf v}_{k}$ corresponding to the rows of $\mathcal{H}_{1}$ (resp. $\mathcal{H}_{2}$ ), and the last equality is obtained by substituting ${\sf v}_{k,1}=\mathcal{H}_{1}\zeta_{k}$ in the first row into the second row. Recall that, according to Lemma 1, ${\sf v}_{k}=\text{col}({\sf u}_{k},{\sf y}_{k})$ represents an admissible pair of input-output trajectories if and only if there exists $\zeta_{k}\in\mathbb{R}^{M-L+1}$ such that (14) (equivalently, (28)) holds. Since ${\sf v}_{k,1}\in\mathbb{R}^{r}$ and $\mathcal{H}_{1}$ has a rank of $r$ , for any ${\sf v}_{k,1}$ there exists $\zeta_{k}$ such that the equation ${\sf v}_{k,1}=\mathcal{H}_{1}\zeta_{k}$ in the first row of (28) is satisfied. In this case, there exists $\zeta_{k}\in\mathbb{R}^{M-L+1}$ such that (28) holds if and only if ${\sf v}_{k,1}$ and ${\sf v}_{k,2}$ satisfy the equation ${\sf v}_{k,2}=\Gamma{\sf v}_{k,1}$ in the second row of (28). Therefore, we can replace the dynamic constraints (14) in (4) with the linear-equality constraints ${\sf v}_{k,2}=\Gamma{\sf v}_{k,1}$ for $k=1,...,K$ , after which we obtain a linear program that is equivalent to (4) while does not involve auxiliary variables $\zeta_{k}$ . Note that the partition of $\mathcal{H}=\text{col}(\mathcal{H}_{L}(u^{d}),\mathcal{H}_{L}(y^{d}))$ into $\mathcal{H}_{1}$ and $\mathcal{H}_{2}$ and the calculation of $\Gamma=\mathcal{H}_{2}\mathcal{H}_{1}^{\dagger}$ can be done offline and they are independent of $k$ .

5 Example: Spacecraft Relative Motion Planning

To illustrate the approach, we consider a motion planning problem for a low-thrust spacecraft relative to a target body on a nominal circular orbit. The continuous-time dynamics are represented by the Clohessy-Wiltshire-Hill (CWH) equations:

\dot{x}=A_{c}x+B_{c}u

(29)

with

A_{c}=\begin{bmatrix}0&0&0&1&0&0\\ 0&0&0&0&1&0\\ 0&0&0&0&0&1\\ 3\omega^{2}&0&0&0&2\omega&0\\ 0&0&0&-2\omega&0&0\\ 0&0&-\omega^{2}&0&0&0\end{bmatrix}\quad B_{c}=\begin{bmatrix}0_{3,3}\\ \frac{T_{\max}}{m_{s}}I_{3}\end{bmatrix}

(30)

where $\omega=\sqrt{\mu/r_{o}^{3}}$ is the orbital rate of the target (in radians/s), $\mu=398,600$ km³/s² is the gravitational parameter, $r_{o}=6,928$ km is the radius of the target’s circular orbit, $m_{s}=50$ kg is the mass of the ego spacecraft, and $T_{\max}=2\times 10^{-4}$ kN represents the ego spacecraft’s maximum thrust. The first three entries of the state vector $x\in\mathbb{R}^{6}$ represent the relative positions of the ego spacecraft with respect to the target body along the ${\sf x}$ -, ${\sf y}$ -, and ${\sf z}$ -axes of the CWH frame (in km), and the last three entries represent the relative velocity components (in km/s). The entries of the control input vector $u\in\mathbb{R}^{3}$ represent the thrust level components of the ego spacecraft along the ${\sf x}$ -, ${\sf y}$ -, and ${\sf z}$ -axes. For simplicity, we discretize the continuous-time model (29) using the forward Euler method with a sampling period of $\Delta t=10$ s and obtain the discrete-time model:

x(t+1)=\underbrace{(I_{6}+A_{c}\Delta t)}_{A}x(t)+\underbrace{(B_{c}\Delta t)}_{B}u(t)

(31)

We treat the discrete-time model (31) as the actual system and use it for both data generation and simulation. Also for simplicity, we assume a sup-norm bound on the control input vector: $\|u(t)\|_{\infty}\leq 1$ , which can be equivalently expressed as individual bounds on each thrust level component:

-1\leq u_{i}(t)\leq 1,\quad i\in\{x,y,z\}

(32)

We remark that higher-fidelity modeling is possible: For piecewise-constant or piecewise-linear control inputs, exact discrete-time models can be obtained through the zero- or first-order hold method. For a 2-norm bound on the thrust level vector, $\|u(t)\|_{2}\leq 1$ , one can use a polygonal approximation (Blackmore et al., 2012). The considered task is for the ego spacecraft to travel from the initial condition $x_{i}=(-1,0,-1,0,0,0)$ to the target condition $x_{f}=(0,0,0,0,0,0)$ in the minimum time.

Assume we do not have the model (31). This may be due to uncertainty about the target’s orbital rate $\omega$ or uncertainty about the ego spacecraft’s precise mass $m_{s}$ , the maximum thrust $T_{\max}$ its propulsion system produces, or thruster alignment. And assume we are only able to measure relative positions, i.e.,

y(t)=Cx(t)=\begin{bmatrix}I_{3}&0_{3,3}\end{bmatrix}x(t)

(33)

We apply the approach developed in this paper to solve the minimum-time trajectory planning problem for the ego spacecraft in such a setting.

First, we build up a data-based model $(\mathcal{H}_{L}(u^{d}),\mathcal{H}_{L}(y^{d}))$ with $L=40$ using a pair of input-output trajectories $(u^{d},y^{d})$ of length $M=10,000$ . It is checked that the persistent excitation of order $L+n=46$ condition is satisfied by this trajectory. The lag of this system is $l_{\min}=2$ . Therefore, we choose $K_{i}=2$ and $K_{f}=2$ and consider the following initial condition for $({\bf u}_{[-2:-1]},{\bf y}_{[-2:-1]})$ and terminal condition for ${\bf y}_{[T:T+1]}$ :


	$\displaystyle{\bf u}_{i}\!=\!\text{col}\big{(}(0,0,0),(0,0,0)\big{)},\,{\bf y}_{i}\!=\!\text{col}\big{(}(-1,0,-1),(-1,0,-1)\big{)}$		(34a)
	$\displaystyle Y_{f}={\bf y}_{f}=\text{col}\big{(}(0,0,0),(0,0,0)\big{)}$		(34b)

These conditions equivalently express the initial and terminal conditions $x_{i}$ and $x_{f}$ for the state vector $x$ . We estimate that the minimum time $T^{*}$ is in the range of $[T_{0},T_{1}]=[100,140]$ , and hence we choose $K=\lceil\frac{T_{1}+K_{f}}{L-K_{i}}\rceil=4$ . In the LP formulation (4), we use $\theta=2$ , which is shown to be sufficiently large to yield a minimum-time trajectory solution.

To validate the result obtained by our approach, we also implement a (state-space) model-based mixed-integer programming (MIP) approach for minimum-time trajectory optimization, which is modified from the approach of Carvallo et al. (1990). The MIP formulation for the considered spacecraft relative motion planning problem is given as:


$\displaystyle\min_{u(\cdot),x(\cdot),\delta(\cdot)}\!\!$	$\displaystyle\quad\sum_{t=T_{0}}^{T_{1}}t\,\delta(t)$	(35a)
s.t.	dynamic constraints:	(35b)
	$\displaystyle x(t+1)=Ax(t)+Bu(t),\,\,t=0,...,T_{1}-1$
	$\displaystyle\text{initial condition: }\,x(0)=x_{i}$	(35c)
	path constraints:
	$\displaystyle-1_{m}\leq u(t)\leq 1_{m},\,\,t=0,...,T_{1}-1$	(35d)
	terminal condition:
	$\displaystyle x(t)-x_{f}\leq(1-\delta(t))W1_{n}$
	$\displaystyle x_{f}-x(t)\leq(1-\delta(t))W1_{n}$	(35e)
	$\displaystyle\delta(t)\in\{0,1\},\,\,\,\,t=T_{0},...,T_{1}$
	$\displaystyle\sum_{t=T_{0}}^{T_{1}}\delta(t)=1$	(35f)

where $n=6$ , $m=3$ , $\delta(t)\in\{0,1\}$ , $t=T_{0},...,T_{1}$ , are indicator variables, $W>0$ is a large constant, the constraints in (35e) mean that the state reaches the target condition $x_{f}$ at time $t$ if $\delta(t)=1$ , the constraint in (35f) makes sure that a feasible trajectory must reach $x_{f}$ at some $t\in[T_{0},T_{1}]$ , and the goal represented by the cost function in (35a) is to minimize the time to reach $x_{f}$ .

Figs. 2–4 illustrate the trajectory solutions from our LP approach based on data-based model (black solid curves) and the MIP approach based on state-space model (red dotted curves). Fig. 2 shows the ego spacecraft relative position trajectories, and Fig. 3 shows the thrust level vector time histories. It can be seen that the trajectories from the two approaches are different, especially for the z-direction. However, the two trajectories have the same time-of-flight, $TOF=T^{*}\times\Delta t=124\times 10=1240$ (s), where $T^{*}=124$ is the minimum (discrete) time obtained by both approaches and $\Delta t=10$ (s) is the sampling period, i.e., the two trajectories are both minimum-time trajectories. It is well-known that in a discrete-time setting minimum-time trajectories are typically not unique. While the MIP approach (35) returns an arbitrary minimum-time solution, our LP approach (4) returns a minimum-time solution that also minimizes the secondary objective function $J_{0}=\sum_{t=T_{0}}^{T^{*}-1}\theta^{t-T_{0}}\|{\bf y}_{[t:t+1]}-{\bf y}_{f}\|_{1}$ (roughly, the cumulative distance from the target condition). Correspondingly, it can be seen from Fig. 3 that the thrust vector profile obtained by our LP approach is smoother than that from the MIP approach²²2To avoid obtaining a non-smooth control profile due to the non-uniqueness of minimum-time trajectories, one can add a small regularization term to the cost function (Carvallo et al., 1990). In our LP approach, the secondary objective function $J_{0}=\sum_{t=T_{0}}^{T^{*}-1}\theta^{t-T_{0}}\|{\bf y}_{[t:t+1]}-{\bf y}_{f}\|_{1}$ plays the role of such a regularization term.. Fig. 4 shows the values of the slack variables: $\varepsilon_{t}$ in our LP approach (4) and $\delta(t)$ in the MIP approach (35). At $t=T^{*}=124$ (corresponding to continuous time $T^{*}\times\Delta t=1240$ (s)), $\varepsilon_{t}$ converges to zero (according to the criterion $\|\varepsilon_{t}\|_{1}<10^{-3}$ ) and $\delta(t)$ equals one, indicating that both approaches capture the minimum time $T^{*}=124$ .

In terms of the computations, the difference between the two approaches is significant: We solve the problems in the MATLAB environment running on a Windows 10 Enterprise OS desktop with Intel Xeon Gold 2.30 GHz processor and $32$ GB of RAM. It takes the LP approach $76$ milliseconds (averaged over $10$ experiments) to find a minimum-time solution (after elimination of the auxiliary variables $\zeta_{k}$ using the approach in Remark 3 and solved with MATLAB linprog function and default settings); while it takes the MIP approach $4.43$ seconds to find one (solved with MATLAB intlinprog function and default settings) – our LP approach based on data-based model is approximately $58$ times faster than the MIP approach based on state-space model.

6 Conclusions

In this paper, we developed an LP-based approach to minimum-time trajectory optimization using input-output data-based models. The approach was based on an effective integration of non-parametric data-based models for trajectory prediction and a continuous optimization formulation using an exponential weighting scheme for minimum-time trajectory planning. We proved that minimum-time trajectories could be obtained with the approach if the weight parameter was chosen to be sufficiently high. We validated the approach and illustrated its application with an aerospace example. Future work includes integrating the approach into a model predictive control framework to achieve repeated replanning and closed-loop control, where real-time data may be used to update the model, and extending the approach to handle noisy data and nonlinear systems.

References

Baros et al. (2022) Baros, S., Chang, C.-Y., Colon-Reyes, G. E., Bernstein, A., 2022. Online data-enabled predictive control. Automatica 138, 109926.
Berberich et al. (2020) Berberich, J., Köhler, J., Müller, M. A., Allgöwer, F., 2020. Data-driven model predictive control with stability and robustness guarantees. IEEE Transactions on Automatic Control 66 (4), 1702–1717.
Blackmore et al. (2012) Blackmore, L., Açıkmeşe, B., Carson III, J. M., 2012. Lossless convexification of control constraints for a class of nonlinear optimal control problems. Systems & Control Letters 61 (8), 863–870.
Büskens and Maurer (2001) Büskens, C., Maurer, H., 2001. Sensitivity analysis and real-time optimization of parametric nonlinear programming problems. Online Optimization of Large Scale Systems, 3–16.
Carvallo et al. (1990) Carvallo, F. D., Westerberg, A. W., Morari, M., 1990. MILP formulation for solving minimum time optimal control problems. International Journal of Control 51 (4), 943–947.
Chinde et al. (2022) Chinde, V., Lin, Y., Ellis, M. J., 2022. Data-enabled predictive control for building HVAC systems. Journal of Dynamic Systems, Measurement, and Control 144 (8), 081001.
Coulson et al. (2019) Coulson, J., Lygeros, J., Dörfler, F., 2019. Data-enabled predictive control: In the shallows of the DeePC. In: 18th European Control Conference (ECC). IEEE, pp. 307–312.
Coulson et al. (2021) Coulson, J., Lygeros, J., Dörfler, F., 2021. Distributionally robust chance constrained data-enabled predictive control. IEEE Transactions on Automatic Control 67 (7), 3289–3304.
Elokda et al. (2021) Elokda, E., Coulson, J., Beuchat, P. N., Lygeros, J., Dörfler, F., 2021. Data-enabled predictive control for quadcopters. International Journal of Robust and Nonlinear Control 31 (18), 8916–8936.
Han and Mangasarian (1979) Han, S. P., Mangasarian, O. L., 1979. Exact penalty functions in nonlinear programming. Mathematical Programming 17, 251–269.
Huang et al. (2021) Huang, L., Coulson, J., Lygeros, J., Dörfler, F., 2021. Decentralized data-enabled predictive control for power system oscillation damping. IEEE Transactions on Control Systems Technology 30 (3), 1065–1077.
Huang et al. (2023) Huang, L., Zhen, J., Lygeros, J., Dörfler, F., 2023. Robust data-enabled predictive control: Tractable formulations and performance guarantees. IEEE Transactions on Automatic Control 68 (5), 3163–3170.
Kalman and Bertram (1959) Kalman, R. E., Bertram, J., 1959. General synthesis procedure for computer control of single-loop and multiloop linear systems (an optimal sampling system). Transactions of the American Institute of Electrical Engineers, Part II: Applications and Industry 77 (6), 602–609.
LaSalle (1959) LaSalle, J., 1959. Time optimal control systems. Proceedings of the National Academy of Sciences 45 (4), 573–577.
Lepetič et al. (2003) Lepetič, M., Klančar, G., Škrjanc, I., Matko, D., Potočnik, B., 2003. Time optimal path planning considering acceleration limits. Robotics and Autonomous Systems 45 (3-4), 199–210.
Markovsky and Dörfler (2021) Markovsky, I., Dörfler, F., 2021. Behavioral systems theory in data-driven analysis, signal processing, and control. Annual Reviews in Control 52, 42–64.
O’Reilly (1981) O’Reilly, J., 1981. The discrete linear time invariant time-optimal control problem–an overview. Automatica 17 (2), 363–370.
Rösmann et al. (2015) Rösmann, C., Hoffmann, F., Bertram, T., 2015. Timed-elastic-bands for time-optimal point-to-point nonlinear model predictive control. In: European Control Conference. IEEE, pp. 3352–3357.
Shirazi et al. (2018) Shirazi, A., Ceberio, J., Lozano, J. A., 2018. Spacecraft trajectory optimization: A review of models, objectives, approaches and solutions. Progress in Aerospace Sciences 102, 76–98.
Taheri et al. (2017) Taheri, E., Li, N. I., Kolmanovsky, I., 2017. Co-state initialization for the minimum-time low-thrust trajectory optimization. Advances in Space Research 59 (9), 2360–2373.
Trélat (2012) Trélat, E., 2012. Optimal control and applications to aerospace: Some results and challenges. Journal of Optimization Theory and Applications 154, 713–758.
Van den Broeck et al. (2011) Van den Broeck, L., Diehl, M., Swevers, J., 2011. A model predictive control approach for time optimal point-to-point motion control. Mechatronics 21 (7), 1203–1212.
van Waarde et al. (2020) van Waarde, H. J., De Persis, C., Camlibel, M. K., Tesi, P., 2020. Willems’ fundamental lemma for state-space systems and its extension to multiple datasets. IEEE Control Systems Letters 4 (3), 602–607.
Verscheure et al. (2009) Verscheure, D., Demeulenaere, B., Swevers, J., De Schutter, J., Diehl, M., 2009. Time-optimal path tracking for robots: A convex optimization approach. IEEE Transactions on Automatic Control 54 (10), 2318–2327.
Verschueren et al. (2017) Verschueren, R., Ferreau, H. J., Zanarini, A., Mercangöz, M., Diehl, M., 2017. A stabilizing nonlinear model predictive control scheme for time-optimal point-to-point motions. In: 56th Conference on Decision and Control (CDC). IEEE, pp. 2525–2530.
Wang et al. (2017) Wang, Z., Liu, L., Long, T., 2017. Minimum-time trajectory planning for multi-unmanned-aerial-vehicle cooperation using sequential convex programming. Journal of Guidance, Control, and Dynamics 40 (11), 2976–2982.
Willems et al. (2005) Willems, J. C., Rapisarda, P., Markovsky, I., De Moor, B. L., 2005. A note on persistency of excitation. Systems & Control Letters 54 (4), 325–329.
Zhang et al. (2014) Zhang, X., Fang, Y., Sun, N., 2014. Minimum-time trajectory planning for underactuated overhead crane systems with state and control constraints. IEEE Transactions on Industrial Electronics 61 (12), 6915–6925.

		$\displaystyle\|\lambda_{k,i}(\theta)\|=\left\|\frac{\text{d}J_{0}(\theta,\{\varepsilon_{t}^{}(\theta,\eta_{k,i})\}_{t=T_{0},...,T^{}-1})}{\text{d}\eta_{k,i}}\right\|$
		$\displaystyle\leq\sum_{t=T_{0}}^{T^{}-1}\left\|\frac{\partial J_{0}}{\partial\varepsilon_{t}^{}}\cdot\frac{\partial\varepsilon_{t}^{}}{\partial\eta_{k,i}}\right\|\leq\sum_{t=T_{0}}^{T^{}-1}\theta^{t-T_{0}}\left\\|\frac{\text{d}\varepsilon_{t}^{*}}{\text{d}\eta_{k,i}}\right\\|_{1}$
		$\displaystyle\leq R\sum_{t=T_{0}}^{T^{}-1}\theta^{t-T_{0}}\leq\frac{R}{\theta-1}\theta^{T^{}-T_{0}}$		(24)

Minimum-Time Trajectory Optimization With Data-Based Models: A Linear Programming Approach