Curves of Minimax Curvature

C. Yalçın Kaya¹¹1Mathematics, UniSA STEM, University of South Australia, Mawson Lakes, S.A. 5095, Australia. E-mail: [email protected] . Lyle Noakes²²2Department of Mathematics and Statistics, The University of Western Australia, Nedlands WA 6009, Australia. Email: [email protected] . Philip Schrader³³3School of Mathematics, Statistics, Chemistry and Physics, Murdoch University, Murdoch WA 6150, Australia. Email: [email protected]

Abstract

We consider the problem of finding curves of minimum pointwise-maximum curvature, i.e., curves of minimax curvature, among planar curves of fixed length with prescribed endpoints and tangents at the endpoints. We reformulate the problem in terms of optimal control and use the maximum principle, as well as some geometrical arguments, to produce a classification of the types of solutions. Using the classification, we devise a numerical method which reduces the infinite-dimensional optimization problem to a finite-dimensional problem with just six variables. The solution types, together with some further observations on optimality, are illustrated via numerical examples.

Key words. Minimax curvature, Markov–Dubins path, Optimal control, Singular control, Bang–bang control.

AMS subject classifications. Primary 49J15, 49K15 Secondary 65K10, 90C30

1 Introduction

We are interested in finding a curve $z:[0,t_{f}]\longrightarrow{\rm{I\ \kern-5.39993ptR}}^{2}$ which minimises the almost everywhere (a.e.) pointwise maximum curvature of $z(t)$ , i.e. the $L^{\infty}$ -norm of the curvature, among all curves having fixed length; prescribed endpoints $z_{0}$ and $z_{f}$ at $0$ and $t_{f}$ respectively; and prescribed tangents $v_{0}$ and $v_{f}$ at the endpoints. Since the curvature is independent of parametrisation we will consider only curves which are parametrised with respect to arc length so that the curvature is $\|\ddot{z}(t)\|$ , and the parameter $t_{f}$ is the length of the whole curve (and therefore $t_{f}$ is fixed). The problem can then be posed as

\mbox{(P)}\left\{\begin{array}[]{rl}\displaystyle\min_{z(\cdot)}&\ \displaystyle\max_{t\in[0,t_{f}]}\|\ddot{z}(t)\|\\[11.38109pt] \mbox{s.t.}&\ z(0)=z_{0}\,,\ z(t_{f})=z_{f}\,,\\[5.69054pt] &\ \dot{z}(0)=v_{0}\,,\ \dot{z}(t_{f})=v_{f}\,,\\[5.69054pt] &\ \|\dot{z}(t)\|=1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\end{array}\right.

where $\dot{z}=dz/dt$ , $\ddot{z}=d^{2}z/dt^{2}$ , $\|\cdot\|$ is the Euclidean norm, and $v_{0}$ and $v_{f}$ are given such that $\|v_{0}\|=\|v_{f}\|=1$ .

If the $L^{\infty}$ -norm is replaced by the squared $L^{2}$ -norm, of $\ddot{z}$ , namely $\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{2}\,dt$ , then critical curves for Problem (P) become the celebrated (fixed-length) Euler’s elastica. The $L^{p}$ -norm generalizations of the Euler elastica problem for $p\in[1,\infty)$ , where $\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{2}\,dt$ is replaced by $\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{p}\,dt$ , have been referred to as the $p$ -elastica problem in the literature—see [9]. Similarly, potential minimisers of Problem (P) have been called the $\infty$ -elastica in [20]. Here we refer to (P) as the problem of finding curves of minimum pointwise-maximum curvature, or simply curves of minimax curvature, because it is more descriptive and we are ultimately focused on finding global minimisers.

Another problem related to (P) is the Markov–Dubins problem, in which one is to minimize this time the (free) length $t_{f}$ of the curve of interest subject to a fixed bound $a$ imposed on the curvature, while the remaining constraints appearing in (P) are kept in place. This problem was first proposed and some instances studied by Andrey Markov in 1889 [17] (also see [16]) but fully solved by Lester Dubins only in 1957 [5]. Dubins’ result characterizes minimum-length solutions in terms of concatenations of straight lines and circular arcs: Let a straight line segment be denoted by an $S$ and a circular arc of the maximum allowed curvature $a$ (or, turning radius $1/a$ ) by a $C$ , then the shortest curve is a concatenation of type $CCC$ , or of type $CSC$ , or a subset thereof. See [10] and the references therein for a literature survey of the Markov–Dubins problem.

The first published work on Problem (P) appears to be that of Moser [20] where it is studied for curves in $\mathbb{R}^{n}$ , followed by the extension to Riemannian manifolds in [7]. In [20] $\infty$ -elastica are defined as minimisers of the sum of maximum curvature and a penalty term which is a kind of $L^{2}$ -distance between curves. In order to obtain differential equations which characterise $\infty$ -elastica, Moser studies the Euler–Lagrange equations for the sum of $p$ -elastic energy with the same penalty term, in the limit $p\to\infty$ . From the differential equations Moser obtains a classification of $\infty$ -elastica as concatenations of circular arcs and straight lines, similar to the solutions of the Markov-Dubins problem, except that $\infty$ -elastica can contain more than three pieces, and closed loops are allowed. If we denote by $O$ a full circle with the maximum curvature, and $C,S$ as before, the conclusion is that $\infty$ -elastica can be of type $CSOSOC$ , perhaps with more or less loops and straight line segments, or of type $CC\ldots C$ with any number of $C$ segments and such that the interior segments have equal length and alternating orientation.

The present work takes a fundamentally different approach to that of Moser: we show that by using the same technique applied in [15], Problem (P) can be reformulated as an optimal control problem without any approximation by $p$ -elastic energy. As usual, the maximum principle then provides us with necessary conditions for optimality in the form of differential equations for the state and costate variables, and constraints on the control. A careful analysis of these conditions – in particular of the phase portrait of the switching function, which is very similar to the analysis of the Markov–Dubins problem in [10, 11] – allows us to draw similar conclusions to Moser’s about which types of curves can be optimal.

The necessary conditions obtained from the maximum principle are then combined with a series of geometric arguments, such as Lemma 21 showing that a curve of type $CCCC$ cannot be optimal because it can be deformed into a feasible curve with the same maximum curvature, but which does not satisfy the necessary conditions. These observations lead to our main result, a refined classification in Theorem 22: any solution to Problem (P) is of type $CCC$ , or $CSC$ , or $COC$ , or $SOS$ , or a subset thereof, with all circular arcs having the same maximum possible radius (or minimum possible maximum curvature). So we have that, just like in the Markov–Dubins problem, solutions of problem (P) cannot consist of more than three segments. However, unlike in the Markov–Dubins problem, under specific conditions (Propositions 23 and 25), they can contain closed circular loops.

Since Problem (P) is related to the Markov–Dubins problem by interchange of constraint and cost, it is perhaps not surprising that the solutions are so similar. Indeed, a relationship between solutions of the two problems was established in [20, Proposition 5]: a solution of the Markov–Dubins problem is a solution to Problem (P). In Theorem 5 we give an alternative proof of this fact in terms of the optimal control formulation, and then demonstrate that the converse does not hold (Remark 6).

We devise a numerical method, which, using the classification of the solution curves stated in Theorem 22 and switching time optimization techniques, reduces the infinite-dimensional optimization problem (P) to a finite-dimensional optimization problem (Ps) with just six variables. We carry out extensive numerical experiments. We especially focus on solutions containing a loop to exemplify the specific results involving loops, as well as gain insights about those instances for which no specific results are available.

The paper is organized as follows. In Section 2, we reformulate Problem (P) as an optimal control problem. In Section 3, we write down the maximum principle for the optimal control problem. In Section 4, we classify the types of solutions for Problem (P) and provide the main theoretical result as Theorem 22. In Section 5, we develop a numerical method for solving Problem (P), and provide example applications and numerical solutions using off-the-shelf optimization software, so as to illustrate the theoretical results. Finally, in Section 6, concluding remarks and possible directions for future work are provided.

2 Reformulations

Obviously, if the distance separating $z_{0}$ and $z_{f}$ is more than the fixed length $t_{f}$ then Problem (P) has no solution. In fact, with $t_{f}=\|z_{0}-z_{f}\|$ , a feasible solution may still not exist. Consider the points $z_{0}=(0,0)$ and $z_{f}=(1,0)$ . With $t_{f}=1$ , unless $v_{0}=v_{f}=(1,0)$ , there exists no feasible curve between $z_{0}$ and $z_{f}$ . Therefore we make the following assumption.

Assumption 1

$t_{f}>\|z_{0}-z_{f}\|$ .

Before reformulating (P) as an optimal control problem and classifying the minimisers, we first address the existence of solutions.

Proposition 2

(Existence of minimisers.) Given initial and final points $z_{0},z_{f}$ and directions $v_{0},v_{f}$ as in Problem (P), there exists a minimiser $z\in W^{2,\infty}((0,t_{f}),\mathbb{R}^{2})$ which is a solution to Problem (P).

Since the proof of Proposition 2 is somewhat long and does not contribute directly to our main result, we defer it until Section 4.1.

Problem (P) is nonsmooth because of the $\max$ operator appearing in its objective functional. On the other hand, Problem (P) can be transformed into a smooth variational problem by using a standard technique from nonlinear programming, in the same way it was also done in [15]:

\mbox{(P1)}\left\{\begin{array}[]{rl}\displaystyle\min_{a,z(\cdot)}&\ a\\[5.69054pt] \mbox{s.t.}&\ z(0)=z_{0}\,,\ z(t_{f})=z_{f}\,,\\[5.69054pt] &\ \dot{z}(0)=v_{0}\,,\ \dot{z}(t_{f})=v_{f}\,,\\[5.69054pt] &\ \|\ddot{z}(t)\|\leq a\,,\ \ \ \|\dot{z}(t)\|=1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\end{array}\right.

where the bound $a\geq 0$ is a new optimization variable of the problem.

Remark 3

One has the solution $a=0$ if and only if the curve joining $z_{0}$ and $z_{f}$ is a straight line. This is possible only in the case when $v_{0}=v_{f}$ and $v_{0}$ and $v_{f}$ are colinear with $z_{f}-z_{0}$ . $\Box$

Problem (P) can equivalently be cast as an optimal control problem as follows. Let $z(t):=(x(t),y(t))\in{\rm{I\ \kern-5.39993ptR}}^{2}$ , with $\dot{x}(t):=\cos\theta(t)$ and $\dot{y}(t):=\sin\theta(t)$ , where $\theta(t)$ is the angle the velocity vector $\dot{z}(t)$ of the curve $z(t)$ makes with the horizontal. These definitions verify that $\|\dot{z}(t)\|=1$ . Moreover,

\|\ddot{z}\|^{2}=\ddot{x}^{2}+\ddot{y}^{2}=\dot{\theta}^{2}\,.

Therefore, $|\dot{\theta}(t)|$ is nothing but the curvature. In fact, $\dot{\theta}(t)$ itself, which can be positive or negative, is referred to as the signed curvature. For example, consider a vehicle travelling along a circular path. If $\dot{\theta}(t)>0$ then the vehicle travels in the counter-clockwise direction, i.e., it turns left, and if $\dot{\theta}(t)<0$ then the vehicle travels in the clockwise direction, i.e., it turns right. If $\dot{\theta}(t)=0$ then the vehicle travels along a straight line.

Suppose that the directions at the points $z_{0}$ and $z_{f}$ are denoted by the angles $\theta_{0}$ and $\theta_{f}$ , respectively. The curvature minimizing problem (P1), or equivalently Problem (P), can then be re-written as a (parametric) optimal control problem, where the objective functional is the maximum curvature $a$ (the parameter), and $x$ , $y$ and $\theta$ are the state variables and $u$ is the control variable :

\mbox{(Pc)}\left\{\begin{array}[]{rll}\displaystyle\min_{a,u(\cdot)}&\ \displaystyle a&\\[5.69054pt] \mbox{s.t.}&\ \dot{x}(t)=\cos\theta(t)\,,&x(0)=x_{0}\,,\ x(t_{f})=x_{f}\,,\\[5.69054pt] &\ \dot{y}(t)=\sin\theta(t)\,,&y(0)=y_{0}\,,\ y(t_{f})=y_{f}\,,\\[5.69054pt] &\ \dot{\theta}(t)=u(t)\,,&\theta(0)=\theta_{0}\,,\ \theta(t_{f})=\theta_{f}\,,\\[5.69054pt] &&|u(t)|\leq a\,,\mbox{ for a.e. }t\in[0,t_{f}]\,.\end{array}\right.

Recall by Remark 3 that the solution to Problem (P), equivalently (Pc), with $a=0$ constitutes a special configuration of the given oriented endpoints: the angle $\theta(t)=\theta_{0}=\theta_{f}=\arctan((y_{f}-y_{0})/(x_{f}-x_{0}))$ for all $t\in[0,t_{f}]$ , and so the curve $z(t)$ is a straight line from $(x_{0},y_{0})$ to $(x_{f},y_{f})$ .

Note that with the unit speed constraint $\|\dot{z}(t)\|=1$ in Problem (P) the solution curve $z(t)$ is parametrized with respect to its length. Therefore the terminal time $t_{f}$ is nothing but the length of the curve $z(t)$ . When the maximum allowed curvature $a$ is fixed, the length $t_{f}$ is allowed to vary, and the objective functional $a$ in Problem (Pc) is replaced by $t_{f}$ , Problem (Pc) becomes the celebrated Markov–Dubins problem [5, 16, 17], where one looks for the shortest curve between the oriented points $(x_{0},y_{0},\theta_{0})$ and $(x_{f},y_{f},\theta_{f})$ , with a prescribed bound $a$ on its curvature:

\mbox{(MD)}\left\{\begin{array}[]{rll}\displaystyle\min_{t_{f},u(\cdot)}&\ \displaystyle t_{f}&\\[5.69054pt] \mbox{s.t.}&\mbox{Constraints in Problem~{}(Pc) with $a$ fixed}.\end{array}\right.

Let a straight line segment be denoted by an $S$ and a circular arc segment of curvature $a$ (or, turning radius $1/a$ ) by a $C$ . Dubins’ result is stated in terms of concatenations of type $S$ and type $C$ curve segments in the following theorem.

Theorem 4 (Dubins’ theorem [5])

A solution to Problem (MD) exists, and any such solution, that is to say, any $C^{1}$ and piecewise- $C^{2}$ curve of minimum length in the plane between two prescribed endpoints, where the slopes of the curve at these endpoints as well as a bound on the pointwise curvature are also prescribed, is of type $CSC$ , or of type $CCC$ , or a subset thereof.

In the following theorem, we establish a relationship between Problems (MD) and (Pc). In fact, the proof of the theorem utilizes Theorem 4 as well as the following simple observation: Problem (MD) minimizes the length $t_{f}$ while the bound $a$ on the curvature is fixed, and Problem (Pc) minimizes $a$ while $t_{f}$ is kept fixed.

Theorem 5 (Solutions of (MD) and (Pc))

Any solution $t_{f}^{*}$ to Problem (MD) with $a=a^{*}$ fixed is the solution $a^{*}$ to Problem (Pc) with $t_{f}=t_{f}^{*}$ fixed.

Proof. Suppose that, with $a=a^{*}$ fixed, the solution of Problem (MD) yields $t_{f}^{*}$ . By Theorem 4, either $|\dot{\theta}^{*}(t)|=a^{*}$ or $\dot{\theta}^{*}(t)=0$ , a.e. $t\in[0,t_{f}^{*}]$ ; in other words, with the optimal length $t_{f}^{*}$ , the optimal angular velocity $\dot{\theta}^{*}$ has to satisfy

|\dot{\theta}^{*}(t)|\notin(0,a^{*})\,,\ \ \mbox{a.e.}\ \ t\in[0,t_{f}^{*}].

(1)

Now suppose that, with $t_{f}=t_{f}^{*}$ fixed, solution of Problem (Pc) yields $\widehat{a}$ . Obviously, $0\leq\widehat{a}\leq a^{*}$ . We can obtain a contradiction for the case when $\widehat{a}<a^{*}$ as follows.

(i)

$\widehat{a}\neq 0$ : Since this solution of Problem (Pc) satisfies the constraints of Problem (MD) with $a=a^{*}$ , and has the optimal length $t_{f}^{*}$ for Problem (MD), it must also be a solution of (MD). Then it must satisfy (1). However, since it is a solution of Problem (Pc) it also satisfies $0\leq|\dot{\theta}(t)|\leq\widehat{a}$ for a.e. $t\in[0,t_{f}^{*}]$ , and so we cannot have $\widehat{a}<a^{*}$ unless $\widehat{a}=0$ .
(ii)

$\widehat{a}=0$ : In this case, we have the trivial case $\theta_{0}=\theta_{f}=\arctan((y_{f}-y_{0})/(x_{f}-x_{0}))$ , which yields $\dot{\theta}(t)=0$ , a.e. $t\in[0,t_{f}^{*}]$ , as the solution of both Problems (Pc) and (MD), with $\widehat{a}=a^{*}=0$ .

Therefore, with $t_{f}=t_{f}^{*}$ fixed, the solution of Problem (Pc) yields $\widehat{a}=a^{*}$ . So, if the pair $(t_{f}^{*},a^{*})$ solves Problem (MD), then it also solves Problem (Pc). $\Box$

Remark 6

The converse of Theorem 5 is not true: Consider the oriented points $(x_{0},y_{0},\theta_{0})=(0,0,0)$ and $(x_{f},y_{f},\theta_{f})=(1,0,0)$ . Fix $t_{f}=\widehat{t}_{f}=2>\|x_{f}-x_{0}\|=1$ . Then clearly the solution $\widehat{a}$ to Problem (Pc) is bounded away from 0, as $\widehat{a}=0$ is the solution where $x_{0}$ and $x_{f}$ are joined by a straight line with $t_{f}=1<2$ . On the other hand, with the curvature bound $a=\widehat{a}$ fixed, Problem (MD) yields the solution $t_{f}^{*}=1$ , and $\theta^{*}(t)=0$ for all $t\in[0,t_{f}^{*}]$ (the straight line joining $x_{0}$ and $x_{f}$ ). So one gets $t_{f}^{*}<\widehat{t}_{f}$ , furnishing a counterexample for the converse of Theorem 5. $\Box$

3 Maximum Principle

In this section, we study the optimality conditions for a re-formulation of Problem (Pc) in a classical form. Let us redefine the control variable $u$ such that $\alpha(t)\,v(t):=\dot{\theta}(t)$ , where $\alpha(t):=a$ and so $\dot{\alpha}(t):=0$ , for a.e. $t\in[0,t_{f}]$ . Problem (Pc), or equivalently Problem (P), can then be re-written as the following optimal control problem:

\mbox{(OC)}\left\{\begin{array}[]{rll}\displaystyle\min_{v(\cdot)}&\ \displaystyle\int_{0}^{t_{f}}\alpha(t)\,dt&\\[11.38109pt] \mbox{s.t.}&\ \dot{x}(t)=\cos\theta(t)\,,&x(0)=x_{0}\,,\ x(t_{f})=x_{f}\,,\\[5.69054pt] &\ \dot{y}(t)=\sin\theta(t)\,,&y(0)=y_{0}\,,\ y(t_{f})=y_{f}\,,\\[5.69054pt] &\ \dot{\theta}(t)=\alpha(t)\,v(t)\,,&\theta(0)=\theta_{0}\,,\ \theta(t_{f})=\theta_{f}\,,\ |v(t)|\leq 1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\\[5.69054pt] &\ \dot{\alpha}(t)=0\,,&\mbox{for a.e. }t\in[0,t_{f}]\,.\end{array}\right.

Define the Hamiltonian function for Problem (OC) as

H(x,y,\theta,\alpha,\lambda_{0},\lambda_{1},\lambda_{2},\lambda_{3},\lambda_{4},v):=\lambda_{0}\,\alpha+\lambda_{1}\,\cos\theta+\lambda_{2}\,\sin\theta+\lambda_{3}\,\alpha\,v+\lambda_{4}\cdot 0\,,

(2)

where $\lambda_{0}\geq 0$ is a scalar (multiplier) parameter and $\lambda_{i}:[0,t_{f}]\to{\rm{I\ \kern-5.39993ptR}}$ , $i=1,2,3,4$ , are the adjoint (or costate) variables. Let

H[t]:=H(x(t),y(t),\theta(t),\alpha(t),\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t),v(t))\,.

The adjoint variables are required to satisfy

	$\displaystyle\dot{\lambda}_{1}(t):=-H_{x}[t]=0\,,$		(3)
	$\displaystyle\dot{\lambda}_{2}(t):=-H_{y}[t]=0\,,$		(4)
	$\displaystyle\dot{\lambda}_{3}(t):=-H_{\theta}[t]=\lambda_{1}(t)\,\sin\theta(t)-\lambda_{2}(t)\,\cos\theta(t)\,,$		(5)
	$\displaystyle\dot{\lambda}_{4}(t):=-H_{\alpha}[t]=-\lambda_{0}-\lambda_{3}(t)\,v(t)\,,\ \ \lambda_{4}(0)=0\,,\ \lambda_{4}(t_{f})=0\,,$		(6)

where $H_{x}=\partial H/\partial x$ , etc. By these definitions, the state and adjoint variables verify a Hamiltonian system in that, in addition to (3)–(6), one has $\dot{x}(t)=H_{\lambda_{1}}[t]$ , $\dot{y}(t)=H_{\lambda_{2}}[t]$ , $\dot{\theta}(t)=H_{\lambda_{3}}[t]$ and $\dot{\alpha}(t)=H_{\lambda_{4}}[t]$ .

Note that (3)–(4) imply that $\lambda_{1}(t)=\overline{\lambda}_{1}$ and $\lambda_{2}(t)=\overline{\lambda}_{2}$ for all $t\in[0,t_{f}]$ , where $\overline{\lambda}_{1}$ and $\overline{\lambda}_{2}$ are constants. Define new constants

\rho:=\sqrt{\overline{\lambda}_{1}^{2}+\overline{\lambda}_{2}^{2}}\,,\qquad\tan\phi:=\frac{\overline{\lambda}_{2}}{\overline{\lambda}_{1}}\,.

(7)

Then (2) and (5) can respectively be re-written as

H[t]=a\,\lambda_{0}+\rho\,\cos(\theta(t)-\phi)+a\,\lambda_{3}(t)\,v(t)\,,

(8)

where we have also used $\alpha(t)=a$ , for all $t\in[0,t_{f}]$ , and

\dot{\lambda}_{3}(t)=\rho\,\sin(\theta(t)-\phi)\,.

(9)

Since $H$ does not depend on $t$ explicitly,

H[t]=h\,,

(10)

where $h$ is some real constant.

The maximum principle [21, Theorem 1] for Problem (OC) can simply be stated as follows. Suppose that $x,y,\theta,\alpha\in W^{1,\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}})$ and $v\in L^{\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}})$ solve Problem (OC). Then there exist a number $\lambda_{0}\geq 0$ and functions $\lambda_{i}\in W^{1,\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}})$ , $i=1,2,3,4$ , such that $\lambda(t):=(\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t))\neq\bf 0$ , for every $t\in[0,t_{f}]$ , and, in addition to the state differential equations and other constraints given in Problem (OC) and the adjoint differential equations (3)–(4), (6) and (9), the following condition holds:

u(t)\in\operatorname*{argmin}_{|w|\leq 1}H(x(t),y(t),\theta(t),\alpha(t),\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t),w)\,,

(11)

Using the definition in (2), (11) can be concisely written as

v(t)\in\operatorname*{argmin}_{|w|\leq 1}\ a\,\lambda_{3}(t)\,w\,,

(12)

which yields the optimal control as

v(t)=\left\{\begin{array}[]{ll}\ \ 1\,,&\mbox{if}\ a\,\lambda_{3}(t)<0\,,\\[8.53581pt] -1\,,&\mbox{if}\ a\,\lambda_{3}(t)>0\,,\\[8.53581pt] \mbox{undetermined}\,,&\mbox{if}\ a\,\lambda_{3}(t)=0\,.\end{array}\right.

(13)

Furthermore, (8) and (10) give

a\,\lambda_{3}(t)\,v(t)+\rho\,\cos(\theta(t)-\phi)+a\,\lambda_{0}=:h\,.

(14)

The control $v(t)$ to be chosen for the case when $a\,\lambda_{3}(t)=0$ for a.e. $t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ is referred to as singular control, because (12) does not yield any further information. On the other hand, when $a\,\lambda_{3}(t)\neq 0$ for a.e. $t\in[0,t_{f}]$ (which implies that $a>0$ ), i.e., it is possible to have $\lambda_{3}(t)=0$ only for isolated values of $t$ , the control $v(t)$ is said to be nonsingular. It should be noted that, if $\lambda_{3}(\tau)=0$ only at an isolated point $\tau$ , the optimal control at this isolated point can be chosen as $v(\tau)=-1$ or $v(\tau)=1$ , conveniently. Therefore, if the control $v(t)$ is nonsingular, it will take on either the value $-1$ or $1$ , the bounds on the control variable. In this case, the control $v(t)$ is referred to as bang–bang. Since the sign of $\lambda_{3}(t)$ determines the value of the optimal control $v(t)$ , $\lambda_{3}$ is referred to as the switching function.

Lemma 7 (Singularity and Straight Line Segments)

Suppose that the optimal control $v(t)$ for Problem (OC) is singular over an interval $[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ . Then $\theta(t)$ is constant for all $t\in[\zeta_{1},\zeta_{2}]$ . Moreover, if $a>0$ then $v(t)=0$ for all $t\in[\zeta_{1},\zeta_{2}]$ .

Proof. Suppose that the optimal control $v(t)$ is singular, i.e., $a\,\lambda_{3}(t)=0$ , for a.e. $t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ . If $a=0$ , then $\dot{\theta}(t)=0$ and so $\theta(t)$ is constant. If $a>0$ then $\dot{\lambda}_{3}(t)=0$ for a.e. $t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ . In other words, from (9), $\sin(\theta(t)-\phi)=0$ , which implies that $\theta(t)$ is constant, i.e., $\dot{\theta}(t)=u(t)=0$ , for all $t\in[\zeta_{1},\zeta_{2}]$ . $\Box$

Remark 8

Using Lemma 7, for $a>0$ , we can rewrite the optimal control in (13) as

v(t)=-\operatorname*{sgn}(\lambda_{3}(t))\,,\mbox{ a.e. }t\in[0,t_{f}]\,.

(15)

When $a\,\lambda_{3}(t)=0$ over an interval of time, $\theta(t)$ is constant, so the solution curve in that interval is a straight line. When $a\,\lambda_{3}(t)\neq 0$ , i.e., $\lambda_{3}(t)\neq 0$ , over an interval of time the solution curve in that interval is a circular arc segment with radius $1/a$ . $\Box$

By Lemma 7, we have established that the only straight-line solution of Problem (OC) is a singular control solution. In the rest of this section we will assume that $a>0$ , as there is nothing more to say for the simple case of $a=0$ , as far as the maximum principle is concerned.

Assumption 9

$a>0$ .

The problems that yield a solution with $\lambda_{0}=0$ are referred to as abnormal, so are their solutions, in the optimal control theory literature, for which the conditions obtained from the maximum principle are independent of the objective functional $\int_{0}^{t_{f}}\alpha(t)\,dt$ in Problem (OC) and therefore not sufficiently informative. The problems that yield $\lambda_{0}>0$ , and their solutions, are referred to as normal. Lemma 10 below states that we can rule out abnormality for Problem (OC).

Lemma 10 (Normality of Solutions)

Problem (OC) is normal, i.e., one can set $\lambda_{0}:=1$ .

Proof. For contradiction purposes, suppose that $\lambda_{0}=0$ and recall the assumption that $a>0$ . Then, from (6) and (15), $\dot{\lambda}_{4}(t)=|\lambda_{3}(t)|\geq 0$ , for every $t\in[0,t_{f}]$ , which, along with the boundary conditions $\lambda_{4}(0)=\lambda_{4}(t_{f})=0$ , implies that $\lambda_{4}(t)=\lambda_{3}(t)=0$ for every $t\in[0,t_{f}]$ , giving rise to a totally singular solution. This in turn implies that $v(t)=0$ and so $\dot{\theta}(t)=0$ , for every $t\in[0,t_{f}]$ , and thus $a=0$ , which is a contradiction. Therefore $\lambda_{0}>0$ . Then, without loss of generality, one can set $\lambda_{0}=1$ . $\Box$

Remark 11

We recall that as proved in [10] a solution to a similar optimal control reformulation of the Markov–Dubins problem (MD) can be abnormal, while we prove in Lemma 10 that there are no abnormal solutions to Problem (OC). $\Box$

Lemma 12 (Singularity and $\rho$ )

Suppose that the optimal control $v(t)$ for Problem (OC) is singular over an interval $[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ . Then $\rho=|a-h|\neq 0$ , with $h$ as defined in (14).

Proof. Suppose the optimal control $u(t)$ is singular over $[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]$ . Then $\lambda_{3}(t)=\dot{\lambda}_{3}(t)=0$ for a.e. $t\in[\zeta_{1},\zeta_{2}]$ . From (9), $\sin(\theta(t)-\phi)=0$ , which implies that $\cos(\theta(t)-\phi)=1$ or $-1$ . Then, substituting $\lambda_{3}(t)=0$ , $\cos(\theta(t)-\phi)=\pm 1$ , and (from Lemma 10) $\lambda_{0}=1$ , into (14), one gets $\rho=\mp(a-h)\geq 0$ .

Next we show that $\rho\neq 0$ . For contradiction purposes, suppose that $\rho=0$ , i.e., $\lambda_{1}(t)=\lambda_{2}(t)=0$ , for every $t\in[0,t_{f}]$ , and recall the assumption that $a>0$ . Then $\dot{\lambda}_{3}(t)=0$ and so $\lambda_{3}(t)=0$ , for every $t\in[0,t_{f}]$ . Now from (6), we get $\dot{\lambda}_{4}(t)=-1$ , for every $t\in[0,t_{f}]$ , which does not have a solution with $\lambda_{4}(0)=\lambda_{4}(t_{f})=0$ . $\Box$

Lemma 13

The adjoint variable $\lambda_{3}$ for Problem (OC) solves the differential equation

\dot{\lambda}_{3}^{2}(t)+\left(a\,|\lambda_{3}(t)|-a+h\right)^{2}=\rho^{2}\,.

(16)

Proof. From (9),

\dot{\lambda}_{3}^{2}(t)=\rho^{2}\,\sin^{2}(\theta(t)-\phi)=\rho^{2}-\rho^{2}\,\cos^{2}(\theta(t)-\phi)\,.

(17)

Using $v(t)=-\operatorname*{sgn}(\lambda_{3}(t))$ and $\lambda_{0}=1$ in (14), one gets

\rho\,\cos(\theta(t)-\phi)=a\,|\lambda_{3}(t)|-a+h\,.

Substituting this into the right-hand side of (17) and rearranging give (16). $\Box$

In the rest of the paper, we will at times not show dependence of variables on $t$ for clarity of presentation.

Refer to caption — Figure 1: Phase portrait of (16) for the switching function $\lambda_{3}$ .

Remark 14 (Phase Portrait for Switchings)

Note that the differential equation (16) is given in terms of the phase variables $\lambda_{3}$ and $\dot{\lambda}_{3}$ , and can be put into the form

\left(\lambda_{3}\pm\frac{|a-h|}{a}\right)^{2}+\frac{\dot{\lambda}_{3}^{2}}{a^{2}}=\frac{\rho^{2}}{a^{2}}\,,

(18)

where the “ $+$ ” sign in the first square term stands for the case when $\lambda_{3}<0$ and the “ $-$ ” sign for $\lambda_{3}>0$ . Equation (18) clearly tells us that the trajectories in the phase plane for $\lambda_{3}$ (the $\lambda_{3}$ $\dot{\lambda}_{3}$ -plane) will be pieces or concatenations of pieces of concentric ellipses centred at $(-|a-h|/a,0)$ for $v(t)=1$ and $(|a-h|/a,0)$ for $v(t)=-1$ , as shown in Figure 1. Based on the phase plane diagram, also referred to as the phase portrait, of $\lambda_{3}$ depicted in Figure 1, the following observations are made.

(i)

When $\rho>|a-h|$ the trajectories are concatenations of (pieces of) ellipses, examples of which are shown by (dark blue) solid curves in Figure 1. The ellipses are concatenated at the switching points $(0,\sqrt{\rho^{2}-(a-h)^{2}})$ and $(0,-\sqrt{\rho^{2}-(a-h)^{2}})$ , where the value of the bang–bang control $v(t)$ switches from $1$ to $-1$ or from $-1$ to $1$ , respectively. The concatenated ellipses cross the $\lambda_{3}$ -axis at two points, $(-(\rho+|a-h|)/a,0)$ and $((\rho+|a-h|)/a,0)$ . One can promptly deduce from the diagram that if the bang–bang control has two switchings, the second arc must have a length $t_{f}$ strictly greater than $\pi/a$ , in other words, the smallest possible curvature $a$ is $\pi/t_{f}$ . The diagram, however, does not tell as to how many switchings the optimal control must have.
(ii)

Recall, by Lemma 12, that $\rho=|a-h|\neq 0$ for singular control. The case when only a part of the trajectory is singular, i.e., partially singular, referred to as a bang–singular trajectory, is represented by the two unique (red) dashed elliptic curves in Figure 1. Note that singular control takes place only at the origin $(0,0)$ of the phase plane, with $v(t)=0$ . At any other point, the control trajectory is of bang–bang type. We also note that a trajectory is totally singular if, and only if, $a=0$ , the solution given in Remark 3.
(iii)

For the case when $0\leq\rho<|a-h|$ , example elliptic trajectories are shown with (black) dotted curves in Figure 1. The trajectories cross the $\lambda_{3}$ -axis at four distinct points $(-(|a-h|+\rho)/a,0)$ and $(-(|a-h|-\rho)/a,0)$ when $\lambda_{3}<0$ , and $((|a-h|+\rho)/a,0)$ and $((|a-h|-\rho)/a,0)$ when $\lambda_{3}>0$ . However, the trajectories no longer intercept the $\dot{\lambda}_{3}$ -axis; therefore they represent bang–bang control with no switchings, i.e., either $v(t)=1$ for all $t\in[0,t_{f}]$ or $v(t)=-1$ for all $t\in[0,t_{f}]$ .

An optimal path will in general be a concatenation of straight lines (i.e., singular arcs, where $v(t)=0$ ) and circular arcs (i.e., nonsingular arcs, where $v(t)=1$ or $-1$ ). $\Box$

Lemma 15

Suppose that optimal control $v(t)$ for Problem (OC) is nonsingular over an interval $[\zeta_{3},\zeta_{4}]\subset[0,t_{f}]$ . Then

|\lambda_{3}(t)|=\frac{1}{a}\left[\rho\,\cos(\theta(t)-\phi)+a-h\right]\,,\ \ \mbox{ for a.e.\ }t\in[\zeta_{3},\zeta_{4}]\subset[0,t_{f}]\,.

(19)

Proof. Substitution of $v(t)=-\operatorname*{sgn}(\lambda_{3}(t))$ and $\lambda_{0}=1$ into (14) and re-arranging yield the required expression. $\Box$

Lemma 16 (Nonsingular Curves)

Consider Problem (OC) and the maximum principle for it.

(a)

If $\rho=0$ , then either $v(t)=1$ or $v(t)=-1$ , for all $t\in[0,t_{f}]$ .
(b)

If $0<\rho\neq|a-h|$ , then $v(t)$ is of bang–bang type.

Proof. (a) Suppose that $\rho=0$ . Then, from Lemma 15, $|\lambda_{3}(t)|=(a-h)/a$ , $\lambda_{3}(t)$ is a nonzero constant, i.e., $v(t)$ is either $1$ or $-1$ , for all $t\in[0,t_{f}]$ .
(b) Suppose that $0<\rho\neq|a-h|$ . Then, the contrapositive of Lemma 12 states that the optimal control is not singular, i.e., bang–bang. $\Box$

Remark 17

The constant optimal control in Lemma 16(a), $v(t)=1$ or $v(t)=-1$ , and the associated $|\lambda_{3}(t)|=(a-h)/a$ , can be viewed graphically as the limiting case when $\rho\to 0$ in Figure 1, i.e., the “size” of the black-dotted ellipse shrinks down to zero. $\Box$

4 Classification of Optimal Paths

Denote a straight line segment by $S$ and a circular arc by $C$ , so that a trajectory satisfying the necessary conditions can be described as being of type $C,S,CSC,\ldots$ etc. In [10, Lemma 6] the analog of Figure 1 is used to conclude that an optimal path for (MD) contains a straight line segment $S$ then it must be of type $CSC,CS,SC,$ or $S$ . Part of the argument is that a trajectory containing e.g. $SCS$ must traverse a full circle and is therefore not optimal because the objective in the Markov–Dubins problem is to minimize length.

For Problem (OC) we can also conclude from Figure 1 and (9) that $SCS$ must traverse a full circle, but it is no longer obvious that such a path cannot be optimal. In fact we will show that in some cases it is optimal ( Proposition 23, Proposition 25) while in others it is not (Corollary 20). It will be helpful to introduce the notation $O$ to represent a full circle once covered, and reserve $C$ for circular arcs which are not closed.

Lemma 18

If $\mathbf{x}^{*}(t):=(x^{*}(t),y^{*}(t),\theta^{*}(t))$ with $\alpha^{*}(t)=a^{*}$ is an optimal path for Problem (OC) then any sub-path of $\mathbf{x}^{*}$ is also optimal for the inherited constraints. That is, for any $[t_{1},t_{2}]\subset[0,t_{f}]$ the restriction $\mathbf{x}^{*}|[t_{1},t_{2}]$ is optimal for problem (OC) with the constraints replaced by $x(t_{i})=x^{*}(t_{i})$ , $y(t_{i})=y^{*}(t_{i})$ , $\theta(t_{i})=\theta^{*}(t_{i})$ for $i=1,2$ , with $t\in[t_{1},t_{2}]$ .

Proof. If the sub-path is a line then it is optimal. Assume it is not a line, and suppose it is not optimal. Then there exists a path satisfying the inherited constraints and having $0<a<a^{*}$ , meaning it contains circular arcs with lower curvature than those in $\mathbf{x}^{*}$ . We can insert it between $\mathbf{x}^{*}([0,t_{1}])$ and $\mathbf{x}^{*}([t_{2},t_{f}])$ to obtain a new path $\tilde{\mathbf{x}}(t)$ . Now either $\mathbf{x}^{*}$ and $\tilde{\mathbf{x}}$ have the same maximum curvature, or the maximum curvature of $\tilde{\mathbf{x}}$ is smaller. The latter contradicts our assumption that $\mathbf{x}^{*}$ is optimal. If they have the same maximum curvature, then $\tilde{\mathbf{x}}$ contains at least one circular arc with curvature $a^{*}$ and at least one circular arc with curvature $a$ , and so it does not satisfy the necessary conditions for optimality (specifically (13)). It then follows that $a^{*}$ cannot be a minimum, again contradicting our assumption on $\mathbf{x}$ . $\Box$

Lemma 19

Any path of type $XCY$ , where $X,Y$ are $C,O$ or $S$ but not both $C$ , is not optimal.

Proof. Since at least one of $X$ and $Y$ is $S$ or $O$ , in order for the path to be optimal it must have switching function on the (red) dashed ellipses in Figure 1. But since the middle $C$ in $XCY$ does not traverse a full loop it is impossible for the switching function to traverse a full ellipse and switch to $S$ or to $C$ or $O$ with the opposite orientation. (Note that we are assuming that, for example, in $CCS$ the two arcs have opposite orientation, otherwise this path would be written as $CS$ or $OCS$ ). In the case of $CCO$ (similarly $OCC$ ) this is perhaps not immediately obvious because the middle $C$ and the $O$ may have the same orientation, and then the switching function would not need to return to the origin of the phase portrait in Figure 1. However, we can reflect the loop about the endpoint with no change in maximum curvature, and the resulting path again is not compatible with the necessary conditions on the switching function. $\Box$

Corollary 20

Any path of type $SOSC$ or any permutation thereof, or type $COCC$ or any permutation thereof, cannot be optimal.

Proof. Note that the loop can be shifted anywhere along the path without changing the maximum curvature. For example a path of type $SOSC$ has the same maximum curvature as the path of type $SCO$ obtained by shifting the loop to the end. The latter is not optimal by Lemma 19, and therefore neither is the former. Similarly, the other permutations of $SOSC$ and permutations of $COCC$ are not optimal. $\Box$

Lemma 21

A $CCCC$ trajectory is not a solution to (Pc).

Sketch of the proof. Following the observations in Remark 14, a $CCCC$ trajectory must be of the type pictured in Figure 2, with the second and third circular arcs having the same length, which is greater than $\pi/a$ . The idea behind this proof is very simple: beginning with the $CCCC$ trajectory, we roll the circle $c_{1}$ around the circle $c_{0}$ while allowing the third arc to be ‘pulled-tight’ and allowing the fourth arc to relax to the circle centred at $\tilde{c}_{2}$ , to obtain the dashed trajectory in Figure 2. This new trajectory doesn’t satisfy the necessary conditions for optimality because the second circular arc doesn’t traverse a full circle (see Remark 14). However, it has the same maximum curvature as the $CCCC$ trajectory we started with, and therefore this maximum curvature cannot be minimal.

The reason the proof becomes somewhat technical below is that it does not seem to be easy to solve explicitly for the rolling angle $\beta_{0}$ which ensures that the dashed curve satisfies the length constraint. Instead we will prove (taking care with a couple of slightly different cases) that the length depends continuously on $\beta_{0}$ , and that there exist values of $\beta_{0}$ such that the length is greater than and less than the length of the original $CCCC$ trajectory.

Proof.

Figure 2: Diagram for the proof of Lemma 21, case 1: trajectory of type

CCCC

(solid) which satisfies the necessary conditions from the maximum principle, and another of type

CCSC

(dashed) with the same maximum curvature which does not.

Referring to Figure 2, we consider the points $c_{i}$ to be in the complex plane with $c_{0}=0$ , and assume that the radius of each of the circles is 1.

Case 1a: $\alpha_{3}<\pi/2$ and $\gamma<\pi/2$ . We reflect $c_{3}$ across $\theta_{f}$ and rotate $c_{1}$ about $c_{0}$ until we obtain the $CCSC$ trajectory pictured in Figure 2. Supposing there exists $\beta_{0}$ such that the pictured trajectory satisfies our length constraint, then it satisfies all the constraints of Problem (Pc). As explained in the sketch above, this second trajectory has the same maximum curvature as the original $CCCC$ but cannot be optimal, and therefore the $CCCC$ trajectory is not optimal.

It remains to prove that there exists $\beta_{0}$ such that the $CCSC$ trajectory satisfies the length constraint. Referring to Figure 2 we have

	$\displaystyle\tilde{c}_{1}$	$\displaystyle=2e^{i\beta_{0}}$
	$\displaystyle\tilde{c}_{2}$	$\displaystyle=z_{f}+e^{i(\pi/2+\theta_{f})}$
	$\displaystyle L$	$\displaystyle=\alpha_{0}+\beta_{0}+\beta_{1}+\ell+\beta_{2}$
	$\displaystyle\beta_{0}-\beta_{1}+\beta_{2}$	$\displaystyle=\theta_{f}-\pi/2$

where $L$ is the total length. If we let

w=\tilde{c}_{2}-\tilde{c}_{1}=z_{f}+e^{i(\pi/2+\theta_{f})}-2e^{i\beta_{0}}

then

$\displaystyle\ell$	$\displaystyle=\sqrt{\|w\|^{2}-2}$
$\displaystyle\delta$	$\displaystyle=\cos^{-1}(2/\|w\|)$
$\displaystyle\beta_{1}$	$\displaystyle=\delta-\arg(w)-\pi-\beta_{0}=\cos^{-1}(2/\|w\|)-\arg(w)-\pi-\beta_{0}$
$\displaystyle\beta_{2}$	$\displaystyle=\theta_{f}-\pi/2+\beta_{1}-\beta_{0}=\theta_{f}-3\pi/2+\cos^{-1}(2/\|w\|)-\arg(w)-2\beta_{0}$
$\displaystyle L$	$\displaystyle=\alpha_{0}+2\beta_{1}+\ell+\theta_{f}-\pi/2=\alpha_{0}+2\cos^{-1}(2/\|w\|)-2\arg(w)-2\pi-2\beta_{0}+\ell+\theta_{f}-\pi/2$	(20)

Since $w$ is determined by the boundary conditions and $\beta_{0}$ , it follows that each of $\ell,\delta,\beta_{1},\beta_{2}$ are determined by the the boundary conditions and $\beta_{0}$ and in fact $L$ depends continuously on $\beta_{0}$ . If we can prove that there are values of $\beta_{0}$ for which $L$ is above and below the length constraint, then by the intermediate value theorem a trajectory satisfying the length constraint exists.

Note that we can make $L$ as large as we like by increasing $\beta_{0}$ , and wrapping the trajectory around the circle centred at $c_{0}$ if necessary. To prove that $L<t_{f}$ can be achieved we consider two cases. First we assume that as in Figure 2 the reflected circle centred at $\tilde{c}_{2}$ does not intersect the circle centred at $c_{1}$ , and so we can take $\beta_{0}=0$ and observe that the new $CCSC$ trajectory, where the $S$ is a common tangent to $c_{1}$ and $\tilde{c}_{2}$ , is shorter. In the case where the reflected circle does intersect the circle centred at $c_{1}$ , we consider the $CCC$ trajectory pictured in Figure 3. Here $\ell=0$ and $\beta_{0}$ is at its minimum.

Figure 3: Left: trajectory of type

CCCC

(solid) which satisfies the necessary conditions for optimality, and another of type

CCC

(dashed) which we prove is shorter, and can therefore be extended to a trajectory of type

CCSC

(as in Figure 2) with the correct length. Right: enlargement of the internal triangles from the

CCC

trajectory.

The length of the original trajectory is $L_{1}=\alpha_{0}+2\pi+2\gamma+\alpha_{3}$ , and from (20) the length of the $CCC$ trajectory is $L_{2}=\alpha_{0}+2\beta_{1}+\theta_{f}-\pi/2$ . Hence, using $\alpha_{3}=\frac{\pi}{2}-\theta_{f}$

L_{1}-L_{2}=2\pi+2\gamma+2\alpha_{3}-2\beta_{1}

and our aim is to prove that this quantity is positive. With the angles labelled as in Figure 3, noting that $\gamma=\pi-\bar{\gamma}$ and $\beta_{1}=2\pi-\phi$ , we need to prove that $\bar{\gamma}<\phi+\alpha_{3}$ . For this, defining $s:=|c_{3}-c_{0}|$ note that

|\tilde{c}_{2}-c_{0}|=8-8\cos\phi=4+s^{2}-2s\cos(\theta_{0}-\alpha_{3})

and that this is still true if $\alpha_{3}>\theta_{0}$ . Supposing we allow $\alpha_{3}$ to vary, then we determine that

\frac{d\phi}{d\beta_{0}}=-\frac{s\sin(\theta_{0}-\alpha_{3})}{4\sin\phi}\frac{d\alpha_{3}}{d\beta_{0}}

Note that as $\beta_{0}$ increases so too does $\alpha_{3}$ , and so $\frac{d\phi}{d\beta_{0}}$ is negative for $\alpha_{3}<\theta_{0}$ and positive for $\alpha_{3}>\theta_{0}$ . It follows that

\frac{d}{d\beta_{0}}(\phi+\alpha_{3})=\frac{d\phi}{d\beta_{0}}\left(1-\frac{s\sin(\theta_{0}-\alpha_{3})}{4\sin\phi}\right).

Since $\phi+\alpha_{3}=\bar{\gamma}$ at $\beta_{0}=0$ , if we can show that the above quantity is always positive then we have the required result: $\bar{\gamma}<\phi+\alpha_{3}$ . When $\alpha_{3}>\theta_{0}$ this is immediate: all the quantities on the right hand side are positive. For $\alpha_{3}<\theta_{0}$ we refer to the right hand side of Figure 3 and observe that

\frac{\sin(\theta_{0}-\alpha_{3})}{q}=\frac{\sin\psi}{2}=\frac{\sin\phi}{p}

and therefore $\frac{\sin(\theta_{0}-\alpha_{3})}{\sin(\phi)}=\frac{q}{p}$ . Now $\frac{d}{d\beta_{0}}(\phi+\alpha_{3})$ will be positive if $4p>qs$ . Comparing with Figure 3, we know $q<1$ and since $\bar{\gamma}>\pi/2$ (by assumption) we have that $p>s/4$ , so we are done with this case.

Case 1b: $\alpha_{3}\geq\pi/2$ and $\gamma<\pi/2$ . In this case we cannot reflect $c_{3}$ about $\theta_{f}$ . We set $\tilde{c}_{2}$ to $c_{3}+2i$ instead (i.e. vertically above $c_{3}$ ), and then construct a trajectory of type $CCSCC$ . This is not very different to the previous case.

Figure 4: Diagram for Case 2 (

\gamma\geq\frac{\pi}{2}

) in the proof of Lemma 21: a trajectory of type

CCCC

(solid) which satisfies the necessary conditions from the maximum principle, and another of type

CCSC

(dashed) with the same maximum curvature which does not.

Case 2: $\gamma\geq\pi/2$ . Referring to Figure 4, let $\tilde{w}:=c_{3}-\tilde{c}_{1}$ and then we have

	$\displaystyle\tilde{c}_{1}$	$\displaystyle=2e^{i\beta_{0}}$
	$\displaystyle c_{3}$	$\displaystyle=z_{f}+e^{i(\theta_{f}-\pi/2)}$
	$\displaystyle l$	$\displaystyle=\|\tilde{w}\|$
	$\displaystyle L_{2}$	$\displaystyle=\alpha_{0}+\beta_{0}+\beta_{1}+\ell+\beta_{2}+\alpha_{3}$
		$\displaystyle=\alpha_{0}+2\beta_{0}+2\pi+\|\tilde{w}\|+\alpha_{3}$

Arguing as in case 1, we have that $L_{2}$ depends continuously on $\beta_{0}$ and it is enough to show that $L_{2}<L_{1}$ when $\beta_{0}=0$ , where $L_{1}=\alpha_{0}+2\pi+2\gamma+\alpha_{3}$ is the length of the original trajectory. Indeed, at $\beta_{0}$ we have $L_{1}-L_{2}=2\gamma-|\tilde{w}|$ . Noting that $|\tilde{w}|^{2}=8+8\cos\gamma$ and recalling that we have assumed $\gamma>\pi/2$ we have that $|\tilde{w}|^{2}\leq 8<(2\gamma)^{2}$ and therefore $L_{1}>L_{2}$ at $\beta_{0}=0$ . $\Box$

Theorem 22

Any solution to problem (P) is of type $CXC$ or sub-path thereof, where $X$ can be $S$ , $C$ , or $O$ ; or of type $SOS$ or sub-path thereof. The radii of all the $C$ and $O$ arcs in a solution are the same.

Proof. First note that any path containing more than one loop cannot be optimal. The loops can be brought together without changing the maximum curvature, but a sub-path of type $OO$ is not optimal - it can be expanded into a single loop with lower curvature. Consider all paths of type $SXYZ$ , where $X,Y,Z$ can each be $C,O,S$ or void,

•

$X=C$ : $SC$ can be optimal, $SCY$ cannot, by Lemma 19.
•

$X=O$ : $SO$ can be optimal and so can $SOS$ , but $SOO$ cannot by the argument above, and $SOC$ is excluded by Lemma 20. Similarly, we rule out $SOSO$ and $SOSC$ .
•

A single straight line segment $S$ can be optimal (similarly $C$ and $O$ ).

By Lemma 18 we have exhausted the possibilities for optimal paths beginning with $S$ . Next consider paths of type $CXYZ$ :

•

$X=C$ : $CC$ and $CCC$ can be optimal, $CCO$ and $CCS$ are not optimal by Lemma 19. $CCCC$ , $CCCO$ and $CCCS$ are excluded by Lemmas 21, Corollary 20 and Lemma 19 respectively.
•

$X=O$ : $CO$ and $COC$ can be optimal, $COS$ is ruled out by Corollary 20 along with $COCC$ and $COCS$ .
•

$X=S$ : $CS$ and $CSC$ are possible, $CSO$ and $CSCO$ are not optimal by Corollary 20, and Lemma 19 rules out $CSCS$ and $CSCC$ .

Finally, for paths of type $OXYZ$ : $OC$ can be optimal but $OCC$ and $OCS$ are not by Lemma 19 and Corollary 20 respectively, $OS$ can be optimal but $OSC$ is not by Lemma 20.

$\Box$

The boundary conditions that allow for paths of types $SOS$ and $COC$ are quite specific, and so we are able to prove a little more about the conditions under which these paths may be optimal.

Proposition 23

Consider an $SOS$ path as per Figure 5, and let $d=\|z(0)-z(t_{f})\|$ be the distance between the initial and final points. If $\frac{d}{t_{f}}>b$ where $b$ is the solution of $\operatorname{sinc}^{-1}(b)(1-b)=\pi/2$ , then the path is not optimal.

Proof.

Figure 5: Diagram for the proof of Lemma 23: trajectory of type

SCS

, and a trajectory of type

CCC

with lower maximum curvature.

We show that with the assumption on $d/t_{f}$ , the proposed trajectory of type $CCC$ in Figure 5 has $r>R$ . Note that $d=4r\sin(\alpha)$ (still true if $\alpha$ is obtuse) and $t_{f}=4r\alpha$ gives $d/t_{f}=\operatorname{sinc}(\alpha)$ . $d/t_{f}$ is at most $1$ and so $\operatorname{sinc}^{-1}(d/t_{f})$ has a solution between $0$ and $\pi$ and $\alpha=\operatorname{sinc}^{-1}(d/t_{f})$ . Moreover from $2\pi R+d=t_{f}$ we then have

R=\frac{t_{f}-d}{2\pi}=\frac{4r(\alpha-\sin\alpha)}{2\pi}

and therefore $r>R$ while $2(\alpha-\sin\alpha)/\pi<1$ . Rewriting this inequality in terms of $d/t_{f}$ gives

\operatorname{sinc}^{-1}\left(\frac{d}{t_{f}}\right)\left(1-\frac{d}{t_{f}}\right)<\frac{\pi}{2}

and on the domain $[0,1)$ the left hand side of the inequality is decreasing.

$\Box$

Remark 24

Note that the full circular arc, or the loop, in Figure 5 can be placed anywhere on the straight line $S$ without changing the maximum curvature or the boundary conditions and so the above result also applies to paths of type $OS$ and $SO$ . $\Box$

Note that a path of type $COC$ cannot be optimal if the $C$ ’s have different orientations. If they do then the loop can be shifted to the terminal point to obtain a path of type $CCO$ with the same maximum curvature, but such a path is not optimal by Lemma 19.

Proposition 25

Consider a path of type $COC$ and of length $t_{f}=\beta/a+2\,\pi$ as in Figure 6 (solid). Such a path can be optimal only if $\beta\leq\frac{\pi}{2}$ .

Proof. For comparison we consider the $CCC$ path also pictured in Figure 6 (dashed). Assuming that the circular arcs all have radius 1, the length of the $COC$ path is $L_{1}=\beta+2\pi$ , and the length of the $CCC$ is $L_{2}=2\alpha+2\pi-\beta=4\pi-3\beta$ . Therefore if $\beta>\frac{\pi}{2}$ then $L_{2}<L_{1}$ and then, adjusting the angles as necessary, the radii of each of the circular arcs in the $CCC$ curve can be increased (reducing the maximum curvature) until the curve has the correct length $\beta+2\pi$ , and so the original $COC$ is not a minimiser. $\Box$

Figure 6: Diagram for Proposition 25: A path of type

COC

(solid) and shorter curve of type

CCC

(dashed) with the same maximum curvature.

Remark 26

Note that the full circular arc, or the loop, in Figure 6 can be placed anywhere on the circular arc $C$ between $z_{0}$ and $z_{f}$ without changing the maximum curvature or the boundary conditions and so the above result also applies to paths of type $OC$ and $CO$ . $\Box$

The numerical experiments in Section 5.1, specifically Examples 1 and 2, suggest that the converses to Propositions 23 and 25 are true. It may be possible to verify this by direct comparison with each of the potential minimisers allowed by Theorem 22 which satisfy the constraints (at least in the $SOS$ case there do not seem to be too many of these), but we will not attempt to do so here.

4.1 Proof of Proposition 2

Proof. The proof is sketched in [7] but for the convenience of the reader we fill out the details. For any $p\in[1,\infty)$ , consider the $p$ -elastic energy $\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{p}dt$ for $z$ in the Sobolev space $W^{2,p}((0,t_{f}),\mathbb{R}^{2})$ and satisfying the constraints of Problem (P). The existence of minimisers of the $p$ -elastic energy subject to Dirichlet boundary conditions is proved using the direct method in [19, Proposition 4.1], and the same proof works for the boundary conditions we have here. We may therefore consider a sequence $(z_{p})$ where each $z_{p}$ is a minimiser of the $p$ -elastic energy subject to the constraints of problem (P), and let $z_{*}$ be any element of $W^{2,\infty}((0,t_{f}),\mathbb{R}^{2})$ satisfying the constraints. Then for any $p\geq q$ , using the minimality of $z_{q}$ and $z_{p}$ and the embedding $L^{p}\subset L^{q}$ we have

{\|\ddot{z}_{q}\|}_{L^{q}}\leq{\|\ddot{z}_{p}\|}_{L^{q}}\leq c{\|\ddot{z}_{p}\|}_{L^{p}}\leq c{\|\ddot{z}_{*}\|}_{L^{p}}\leq{t_{f}}^{\tfrac{1}{q}}{\|\ddot{z}_{*}\|}_{L^{\infty}}

(21)

where $c={t_{f}}^{\tfrac{1}{q}-\tfrac{1}{p}}$ . Note also that for any $z\in W^{2,q}((0,t_{f}),\mathbb{R}^{2})$ satisfying the constraints of (P), since $\|\dot{z}\|=1$ we have ${\|\dot{z}\|}_{L^{q}}={t_{f}}^{\frac{1}{q}}$ , and if $q\geq 2$ then, using the fundamental theorem of calculus, Cauchy-Schwarz’ inequality and Young’s inequality with $\varepsilon$ :

\displaystyle\|z(t)\|^{2}=\|z(0)\|^{2}+\int_{0}^{t}\frac{d}{d\tau}\|z(\tau)\|^{2}d\tau=\|z(0)\|^{2}+\int_{0}^{t}\langle z(\tau),\dot{z}(\tau)\rangle d\tau\leq\|z(0)\|^{2}+\frac{1}{2\varepsilon}\|z\|^{2}_{L^{2}}+\frac{t\varepsilon}{2}.

Rearranging and integrating over $(0,t_{f})$ gives

\displaystyle\|z\|^{2}_{L^{2}}-\frac{t_{f}}{2\varepsilon}\|z\|^{2}_{L^{2}}

\displaystyle\leq t_{f}\|z(0)\|^{2}+t_{f}^{2}\varepsilon,

and then setting $\varepsilon=t_{f}$ gives

\|z\|_{L^{q}}\leq c\|z\|^{2}_{L^{2}}\leq 2c(t_{f}\|z(0)\|^{2}+t_{f}^{3}).

for $q\geq 2$ . Combining these lower order estimates with (21), we have that for each $2\leq q<\infty$ the subsequence $(z_{p})_{p\geq q}$ is bounded in $W^{2,q}((0,t_{f}),\mathbb{R}^{2})$ , and therefore has a subsequence which converges weakly in $W^{2,q}((0,t_{f}),\mathbb{R}^{2})$ (by the local weak compactness of reflexive Banach spaces, [23, p. 126]) and strongly in $C^{1}([0,t_{f}],\mathbb{R}^{2})$ (by the Rellich-Kondrachov theorem, [1, Theorem 6.3]). By induction we can construct successive subsequences $(z_{p_{2}(i)})(z_{p_{3}(i)}),\ldots(z_{p_{k}(i)}),\ldots$ , such that $(z_{p_{k+1}(i)})$ is a subsequence of $(z_{p_{k}(i)})$ which converges weakly in $W^{2,k+1}$ . Then the diagonal sequence $(z_{\tilde{p}(i)})$ where $\tilde{p}(i)=p_{i}(i)$ converges weakly in $W^{2,k}$ for every $k\geq 2$ . Indeed, since weak convergence in $W^{2,k}$ implies weak convergence in $W^{2,q}$ for all $q\leq k$ by the embedding $W^{2,q}\subset W^{2,k}$ , we have that $z_{\tilde{p}(i)}$ converges weakly in $W^{2,q}$ (and strongly in $C^{1}$ ) for every $q\geq 2$ to $z_{\infty}\in\bigcap_{q\geq 2}W^{2,q}((0,t_{f}),\mathbb{R}^{2})$ . By the strong $C^{1}$ convergence, $z_{\infty}$ satisfies the constraints. Since $\|\ddot{z}_{\infty}\|_{L^{q}}$ is uniformly bounded for all $q$ , we have $z_{\infty}\in W^{2,\infty}((0,t_{f}),\mathbb{R}^{2})$ and $\lim_{p\to\infty}{\|\ddot{z}_{\infty}\|}_{L^{p}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}}$ by [1, Theorem 2.14], and also $\lim_{p\to\infty}{\|\ddot{z}_{p}\|}_{L^{p}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}}$ . Finally, $z_{\infty}$ is a minimiser of maximum curvature, because if not then there exists $z$ and $\varepsilon>0$ such that ${\|\ddot{z}\|}_{L^{\infty}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}}-\varepsilon$ , and then choosing $p$ sufficiently large:

{t_{f}}^{-\frac{1}{p}}{\|\ddot{z}_{p}\|}_{L^{p}}>{\|\ddot{z}_{\infty}\|}_{L^{\infty}}-\tfrac{\varepsilon}{2}>{\|\ddot{z}\|}_{L^{\infty}}\geq{t_{f}}^{-\frac{1}{p}}{\|\ddot{z}\|}_{L^{p}}

which contradicts the assumption that $z_{p}$ is a minimiser of the $p$ -elastic energy. $\Box$

5 A Numerical Method

In this section, we provide a numerical method utilizing arc, or switching time, parametrization techniques earlier used for optimal control problems exhibiting discontinuous controls in [13, 14, 12, 18]. Theorem 22 asserts that a curve of minimax curvature can be of type (described by the strings) $CCC$ , or $COC$ , or $CSC$ , or $SOS$ , or a substring thereof, and nothing else. Then the problem of finding such a curve can be reduced to finding a correct concatenation/configuration of circular arcs ( $C$ ), straight lines ( $S$ ) and a loop ( $O$ ) as listed, as well as finding the length of each arc involved. Recall that a loop is nothing but a circular arc with length $2\,\pi/a$ . Therefore, for computational purposes, we will treat a loop as any other circular arc $C$ with an unknown length. We also note that if there are more than one circular arc in a string then they must have the same curvature $a$ , i.e., the same radius.

A circular arc $C$ can either be a left-turn or a right-turn arc, which we denote by $L$ and $R$ , respectively. Then a curve of type $CCC$ can be represented by the string $LRLR$ . Clearly, at least one of the arcs in $LRLR$ must be of zero length in order to represent $CCC$ or a substring thereof, and this should be determined by the numerical method to be proposed. A curve of type $CSC$ can similarly be represented by $LRSLR$ , again with at least one of the arcs of type $L$ or $R$ being of zero length. Note that if the length of the straight line (S) is zero, then $LRSLR$ effectively reduces to $LRLR$ representing $CCC$ . Therefore, the string $LRSLR$ is general enough to represent both of the types $CCC$ and $CSC$ . The type $SCS$ (or equivalently $SOS$ ), on the other hand, can be represented by the string $SLRS$ . Recall that, by Remark 24, any solution of type $SOS$ can be obtained from a solution of type $SO$ or $OS$ , simply by placing the loop anywhere on the straight line segment between the endpoints. Therefore either the string $SLR$ or the string $LRS$ will suffice to represent $SCS$ . As a result, one would cover all the solution types in Theorem 22 by using the string $LRSLR$ .

Let $L_{\xi_{1}}$ denote a left-turn arc with length $\xi_{1}$ , $R_{\xi_{2}}$ a right-turn arc with length $\xi_{2}$ and $S_{\xi_{3}}$ a straight line with length $\xi_{3}$ . Similarly, let $L_{\xi_{4}}$ denote a left-turn arc with length $\xi_{4}$ and $R_{\xi_{5}}$ a right-turn arc with length $\xi_{5}$ . Then the types of solution curves described in Theorem 22 can all be represented by the string

L_{\xi_{1}}R_{\xi_{2}}S_{\xi_{3}}L_{\xi_{4}}R_{\xi_{5}}\,.

For example, the type $RLR$ is given with $\xi_{1}=\xi_{3}=0$ and $\xi_{2},\xi_{4},\xi_{5}>0$ . On the other hand, the type $LR$ is obtained either with $\xi_{3}=\xi_{4}=\xi_{5}=0$ and $\xi_{1},\xi_{2}>0$ or with $\xi_{1}=\xi_{2}=\xi_{3}=0$ and $\xi_{4},\xi_{5}>0$ .

Let the initial time $t_{0}:=0$ and the terminal time $t_{5}:=t_{f}$ . Also define the switching times $t_{j}$ , $j=1,\ldots 4$ , such that

\xi_{j}:=t_{j}-t_{j-1}\,,\quad\mbox{for }j=1,\ldots,5\,.

(22)

The lengths $\xi_{j}$ may also be referred to as arc durations. Note that along an arc of type $L$ or $R$ the optimal control is bang–bang; in particular, $v(t)\equiv 1$ along an arc of type $L$ and $v(t)\equiv-1$ along an arc of type $R$ . Along an arc of type $S$ , on the other hand, optimal control is singular; in particular, $v(t)\equiv 0$ , by Lemma 7. Since $v(t)$ is constant ( $1$ , $-1$ or 0) for $t_{j-1}\leq t<t_{j}$ (along the $j$ th arc), the ODEs in Problem (OC) with $\alpha(t)=a$ , or equivalently Problem (Pc) with $u(t)=a\,v(t)$ , can be solved as follows.

	$\displaystyle\theta(t)=\theta(t_{j-1})+a\,v(t)\,(t-t_{j-1})\,,\quad\mbox{ if }j=1,\ldots,5\,,$		(23)
	$\displaystyle x(t)=\left\{\begin{array}[]{ll}x(t_{j-1})+\big{(}\sin\theta(t)-\sin\theta(t_{j-1})\big{)}/\big{(}a\,v(t)\big{)}\,,&\mbox{ if }j=1,2,4,5\,,\\[2.84526pt] x(t_{j-1})+\cos\theta(t)\,(t-t_{j-1})\,,&\mbox{ if }j=3\,,\end{array}\right.$		(26)
	$\displaystyle y(t)=\left\{\begin{array}[]{ll}y(t_{j-1})+\big{(}\cos\theta(t)-\cos\theta(t_{j-1})\big{)}/\big{(}a\,v(t)\big{)}\,,&\mbox{ if }j=1,2,4,5\,,\\[2.84526pt] y(t_{j-1})+\sin\theta(t)\,(t-t_{j-1})\,,&\mbox{ if }j=3\,,\end{array}\right.$		(29)

where

v(t)=\left\{\begin{array}[]{rl}1\,,&\mbox{ if }j=1,4\,,\\[2.84526pt] -1\,,&\mbox{ if }j=2,5\,,\\[2.84526pt] 0\,,&\mbox{ if }j=3\,,\end{array}\right.

(30)

for $t_{j-1}\leq t<t_{j}$ . We have used the solution for $\theta(t)$ in (23) in obtaining the solutions for $x(t)$ and $y(t)$ in (26)–(29). Note that the control variable $v(t)$ is a piecewise constant function, which takes here the sequence of values $\{1,-1,0,1,-1\}$ . It should be clear from the context that the control variable $u(t)$ defined in Problem (Pc) takes the sequence of values $\{a,-a,0,a,-a\}$ . After evaluating the state variables in (23)–(29) at the switching times and carrying out algebraic manipulations, one can equivalently rewrite Problem (Pc), or Problem (OC), as follows.

\mbox{(Ps)}\left\{\begin{array}[]{rl}\min&\ a\\[11.38109pt] \mbox{s.t.}&\displaystyle\ x_{0}-x_{f}+\frac{1}{a}\big{(}-\!\sin\theta_{0}+2\,\sin\theta_{1}-2\,\sin\theta_{2}+2\,\sin\theta_{4}-\sin\theta_{f}\big{)}+\xi_{3}\,\cos\theta_{2}=0\,,\\[8.53581pt] &\displaystyle\ y_{0}-y_{f}+\frac{1}{a}\big{(}\cos\theta_{0}-2\,\cos\theta_{1}+2\,\cos\theta_{2}-2\,\cos\theta_{4}+\cos\theta_{f}\big{)}+\xi_{3}\,\sin\theta_{2}=0\,,\\[8.53581pt] &\displaystyle\ \sin\theta_{f}=\sin\theta_{5}\,,\ \ \cos\theta_{f}=\cos\theta_{5}\,,\\[5.69054pt] &\displaystyle t_{f}=\sum_{j=1}^{5}\,\xi_{j}\,,\qquad\xi_{j}\geq 0\,,\ \mbox{ for }j=1,\ldots,5\,,\end{array}\right.

where

\theta_{1}=\theta_{0}+a\,\xi_{1}\,,\qquad\theta_{2}=\theta_{1}-a\,\xi_{2}\,,\qquad\theta_{4}=\theta_{2}+a\,\xi_{4}\,,\qquad\theta_{5}=\theta_{4}-a\,\xi_{5}\,.

(31)

Substitution of $\theta_{1}$ , $\theta_{2}$ , $\theta_{4}$ and $\theta_{5}$ in (31) into Problem (Ps) yields a finite-dimensional nonlinear optimization problem in just six variables, $\xi_{j}$ , $j=1,\ldots,5$ , and $a$ .

Remark 27

With the constraints $\sin\theta_{f}=\sin\theta_{5}$ and $\cos\theta_{f}=\cos\theta_{5}$ , we make sure that we satisfy the slope condition at the terminal point. Otherwise, the constraint $\theta_{f}=\theta_{5}$ is stronger, and imposing it might result in missing some of the feasible solutions. $\Box$

Remark 28

Problem (Ps) can be solved by standard optimization methods and software, for example, Algencan [2, 3], which implements augmented Lagrangian techniques, or Ipopt [22], which implements an interior point method, or SNOPT [8], which implements a sequential quadratic programming algorithm, or Knitro [4], which implements various interior point and active set algorithms to choose from. For general nonconvex optimization problems like Problem (Ps), what one can hope for, by using these software, is to get (at best) a locally optimal solution. $\Box$

5.1 Numerical experiments

For computations numerically solving Problem (Ps), or equivalently Problem (P), we have employed the AMPL–Knitro computational suite: AMPL is an optimization modelling language [6] and Knitro is a popular optimization software [4] (Version 3.0.1 is used here). In all runs, we have used the Knitro options ‘alg=0 feastol=1e-12 opttol=1e-12’ in AMPL; in other words, we have set both the feasibility and optimality tolerances at $10^{-12}$ , and the particular algorithm to be employed was left to be chosen by Knitro itself.

In what follows, we present example instances of problems for which we solve Problem (Ps) by using the AMPL–Knitro suite and display the solution curves. The best numerical solution we have identified for an instance is depicted by a solid curve in the graphs. The other solutions reported by the suite as “locally optimal” are depicted by dotted curves, and referred to here as “critical”, as they satisfy the necessary conditions of optimality furnished by the maximum principle.

5.1.1 Example 1

Recall from Corollary 23 that $b$ is the solution of the equation $\operatorname{sinc}^{-1}(b)(1-b)=\pi/2$ . After rearranging this equation one gets $f(b):=b-\operatorname{sinc}\big{(}\pi/(2\,(1-b))\big{)}=0$ . A numerical solution to $f(b)=0$ can be obtained as $b\approx 0.319966693534110$ (correct to 15 dp) in seven iterations by using the secant method starting with initial points $0.1$ and $0.2$ . The uniqueness of this solution can be established by observing that the graph of $f$ crosses the $b$ -axis only once (not shown here).

Figure 7 shows curves of minimax curvature between the oriented points $(x_{0},y_{0},\theta_{0})=(0,0,0)$ and $(x_{f},y_{f},\theta_{f})=(0,1,0)$ , which is a similar situation to that in the diagram in Figure 5. The orientation of the endpoints of the curves in Figure 7 are depicted by (black) arrows. We note that, for the two given endpoints, $d=1$ .

Proposition 23 asserts that the $SOS$ type of trajectory is not optimal (nor $OS$ or $SO$ , by Remark 24) for $t_{f}<1/b$ , i.e., $t_{f}\leq 3.12532$ (correct to 5 dp). This is exemplified in Figures 7(a) and 7(b) respectively for $t_{f}=1.5$ and $t_{f}=3$ , where it is numerically verified that the CCC concatenation of curves is optimal: For $t_{f}=1.5$ , the solution curve has $a=3.9887508486$ (correct to 10 dp) and is of type $R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}$ (with $\xi_{1}=\xi_{3}=0$ ), where the arc lengths $\xi_{2}=0.375$ , $\xi_{4}=0.75$ , and $\xi_{5}=0.375$ . For $t_{f}=3$ , $a=3.0384835468$ (correct to 10 dp) and the solution is of the same type, with $\xi_{2}=0.75$ , $\xi_{4}=1.5$ , and $\xi_{5}=0.75$ . We point to the symmetry in the arc lengths in both cases, presumably because of the colinear configuration of the endpoints.

The converse of Proposition 23 for this example instance (which is not proved) is that the $SOS$ (or $OS$ or $SO$ , by Remark 24) concatenation is optimal for $t_{f}\geq 1/b$ , i.e., $t_{f}\geq 3.12533$ (correct to 5 dp). This converse is exemplified in Figures 7(c) and 7(d) respectively for $t_{f}=5$ and $t_{f}=7$ . For $t_{f}=5$ , the solution curve has $a=\pi/2$ and is of type $L_{\xi_{1}}S_{\xi_{3}}$ (with $\xi_{2}=\xi_{4}=\xi_{5}=0$ ), where $\xi_{1}=4$ , and $\xi_{3}=1$ . For $t_{f}=5$ , $a=\pi/3$ and the solution is of the same type, with $\xi_{1}=6$ , and $\xi_{3}=1$ .

Each of the solution curves presented in Figure 7 can be “flipped over” or “reflected about” the $x$ -axis to get “symmetric” solution curves: For example when an $LS$ curve is reflected one gets an $RS$ curve which has the same length, oriented endpoints and maximum curvature. An $RLR$ curve can also be reflected in the same way to get an $LRL$ curve which has the same length, oriented endpoints and maximum curvature. These reflected curves have also been obtained as computational solutions to Problem (Ps).

5.1.2 Example 2

Suppose that the endpoints are placed on a unit circle with the end velocities chosen tangent to the circle. Recall from Proposition 25 that a curve of minimax curvature between these endpoints can be of type $COC$ (or equivalently type $OC$ or type $CO$ , as the loop $O$ can be shifted anywhere on a circular arc $C$ ) only in the case when the endpoints are separated by at most $\pi/2$ radians, i.e., they are separated by an angle $\beta\leq\pi/2$ (see Figure 6), and $t_{f}$ is chosen as $\beta+2\,\pi$ . Here we numerically illustrate Proposition 25 and explore instances with various other values of the separation angle $\beta$ and with $t_{f}=\beta+2\,\pi$ —see Figure 8(a)–(d).

We place the pair of oriented endpoints on a unit circle, and impose $t_{f}=\beta+2\,\pi$ (so that a loop can be observed at least in a critical curve), in each part of Figure 8, and obtain optimal solutions as follows.

(a)

$\beta=\pi/3<\pi/2$ , i.e., $t_{f}=7\,\pi/3$ : $(-1/2,-\sqrt{3}/2,-\pi/6)$ and $(1/2,-\sqrt{3}/2,\pi/6)$ . The optimal curve is of type $L_{\xi_{4}}R_{\xi_{5}}$ (with $\xi_{1}=\xi_{2}=\xi_{3}=0$ ), where $a=1$ , $\xi_{4}=\pi/3$ , and $\xi_{5}=2\,\pi$ .
(b)

$\beta=2\,\pi/3$ , i.e., $t_{f}=8\,\pi/3$ : $(-\sqrt{3}/2,-1/2,-\pi/3)$ and $(\sqrt{3}/2,-1/2,\pi/3)$ . The optimal curve is of type $R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}$ (with $\xi_{1}=\xi_{3}=0$ ), where $a=0.8174619979$ , $\xi_{2}=1.4538775285$ , $\xi_{4}=5.4698253525$ , and $\xi_{5}=1.4538775285$ , all correct to 10 dp.
(c)

$\beta=4\,\pi/3$ , i.e., $t_{f}=10\,\pi/3$ : $(-\sqrt{3}/2,1/2,-2\,\pi/3)$ and $(\sqrt{3}/2,1/2,2\,\pi/3)$ . The optimal curve is of type $R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}$ , where $a=0.5245840019$ , $\xi_{2}=0.6217500975$ , $\xi_{4}=9.2284753170$ , and $\xi_{5}=0.6217500975$ , all correct to 10 dp.
(d)

$\beta=5\,\pi/3$ , i.e., $t_{f}=11\,\pi/3$ : $(-1/2,\sqrt{3}/2,-5\,\pi/6)$ and $(1/2,\sqrt{3}/2,5\,\pi/6)$ . The optimal curve is of type $R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}$ , where $a=0.5026360614$ , $\xi_{2}=0.2755293870$ , $\xi_{4}=10.9681142892$ , and $\xi_{5}=0.2755293870$ , all correct to 10 dp.

In each of the problem instances, one curve of $CO$ type, two curves of $CCC$ type, and one curve of $CSC$ type have been found as solutions of Problem (Ps) by the AMPL–Knitro suite. Of these four distinct solutions in each case, the curve with the smallest $a$ was identified as the optimal curve for that case.

Similarly to Example 5.1.1, the loop in each of the $LR$ curves presented in Figure 8 can be “flipped over” or “reflected about” the circle to get “symmetric” curves of type $L$ with $\xi_{1}=\beta+2\,\pi$ , with the same endpoints and maximum curvatures. These reflected curves have also been encountered as computational solutions to Problem (Ps), but are not being shown in Figure 8.

We note that if $t_{f}\neq\beta+\pi/2$ then one does not encounter a solution curve (optimal or only critical) which contains a loop.

5.1.3 Example 3

Consider the problem of finding a curve of minimax curvature between the oriented points $(0,0,-\pi/3)$ and $(0.4,0.4,-\pi/6)$ —note that the same oriented endpoints were used in [10] for finding the shortest curve of bounded curvature, i.e., the Markov–Dubins path. For various lengths, namely for $t_{f}=0.8,1.3,2$ and $2.5$ , here we find curves minimizing the bound $a$ on the curvature. Figure 9(a)–(d) depict these curves found as a solution of Problem (Ps) by employing the AMPL–Knitro software suite.

In Figure 9(a)–(d), the optimal curves are shown as solid curves. These curves have been identified amongst all the critical solutions to Problem (Ps) (of the allowed types) that the AMPL–Knitro suite could find, as the ones with the smallest curvature, or indeed with the biggest turning radius. All other critical curves are shown as dotted curves in Figure 9(a)–(d). A summary of all these solutions in each part of Figure 9 can be given as follows, with arc lengths and curvatures correct up to 10 dp.

(a)

$t_{f}=0.8$ : The optimal curve has $a=8.3661513485$ and is of type $L_{\xi_{1}}S_{\xi_{3}}R_{\xi_{5}}$ (with $\xi_{2}=\xi_{4}=0$ ), where $\xi_{1}=0.3141136578$ , $\xi_{3}=0.2343580660$ , $\xi_{5}=0.2515282761$ . Only one other critical curve is found which has $a=24.6216106492$ and is of type $RSR$ .
(b)

$t_{f}=1.3$ : The optimal curve has $a=6.0477371511$ and is of type $R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}$ with $\xi_{2}=0.0684530840$ , $\xi_{4}=0.6932888171$ , $\xi_{5}=0.5382580988$ . Four other critical curves are found which respectively have $a=7.8842574114$ , $a=15.7179879207$ , $a=9.3041564227$ , $a=6.4570352671$ , and are of types $RSR$ , $RSL$ , $LSL$ , $LRL$ .
(c)

$t_{f}=2$ : The optimal curve has $a=4.0557744873$ and is of type $R_{\xi_{2}}S_{\xi_{3}}R_{\xi_{5}}$ with $\xi_{2}=1.1520596173$ , $\xi_{3}=0.5799046398$ , $\xi_{5}=0.2680357429$ . Five other critical curves are found which respectively have $a=8.1329787696$ , $a=4.1795061273$ , $a=4.7799043550$ , $a=4.9575871447$ , $a=5.0128303633$ and are of types $RSL$ , $RLR$ , $LSL$ , $LRL$ , $LRL$ .
(d)

$t_{f}=2.5$ : The optimal curve has $a=3.0172721767$ and is of type $R_{\xi_{2}}S_{\xi_{3}}R_{\xi_{5}}$ with $\xi_{2}=1.5726285431$ , $\xi_{3}=0.5911279480$ , $\xi_{5}=0.3362435090$ . Six other critical curves are found which respectively have $a=6.0662510507$ , $a=3.0516505811$ , $a=3.8632611845$ , $a=3.5528730863$ , $a=4.1925834507$ , $a=3.6102865904$ and are of types $RSL$ , $RLR$ , $RLR$ , $LSL$ , $LRL$ , $LRL$ .

6 Conclusion

We have presented a classification of the solutions to the problem of finding planar curves of minimax curvature between two endpoints with the endpoint tangents specified (Problem (P)). We devised a numerical procedure for finding the critical points of Problem (P), or the $\infty$ -elastica, of the classification types we have presented. Of these critical solutions the one with the smallest curvature is the (global) solution to Problem (P), which we refer to as a curve of minimax curvature.

A natural extension of this paper would be the study of interpolating curves which not only satisfy the endpoint constraints but also must pass through a number of specified intermediate points.

It would also be interesting to study another extension of Problem (P), from curves in ${\rm{I\ \kern-5.39993ptR}}^{2}$ to curves in ${\rm{I\ \kern-5.39993ptR}}^{n}$ . In ${\rm{I\ \kern-5.39993ptR}}^{3}$ alone, it is highly likely that one would encounter curve segments of types different from just circular and straight line segments.

A further extension from the Euclidean space to curved spaces would be of great interest where the curves of minimax curvature would be in a Riemannian manifold.

References

[1] R. A. Adams, J. J. F. Fournier, Sobolev Spaces, Second Edition, Elsevier, 2003.
[2] R. Andreani, E. G. Birgin, J. M. Martínez, and M. L. Schuverdt, On augmented Lagrangian methods with general lower-level constraints, SIAM J. Optim., 18(4) (2008), pp. 1286–1309.
[3] E. G. Birgin and J. M. Martínez, Practical Augmented Lagrangian Methods for Constrained Optimization, SIAM Publications, 2014.
[4] R. H. Byrd, J. Nocedal, R. A. Waltz, KNITRO: An integrated package for nonlinear optimization. In: G. di Pillo and M. Roma, eds., Large-Scale Nonlinear Optimization, Springer, New York, 35–59, 2006. https://doi.org/10.1007/0-387-30065-1_4
[5] L. E. Dubins, On curves of minimal length with a constraint on average curvature and with prescribed initial and terminal positions and tangents, Amer. J. Math., 79 (1957), pp. 497–516.
[6] R. Fourer, D. M. Gay, and B. W. Kernighan, AMPL: A Modeling Language for Mathematical Programming, Second Edition, Brooks/Cole Publishing Company / Cengage Learning, 2003.
[7] E. Gallagher and R. Moser, The $\infty$ -elastica problem on a Riemannian manifold, J. Geom. Anal., 33(7) (2023), 226.
[8] P. E. Gill, W. Murray, and M. A. Saunders, SNOPT: an SQP algorithm for large-scale constrained optimization, SIAM Rev., 47(1) (2005), 99–131.
[9] R. Huang, A note on the $p$ -elastica in a constant sectional curvature manifold, J. Geom. Phys., 49(3–4) (2004), 343–-349.
[10] C. Y. Kaya, Markov–Dubins path via optimal control theory, Comput. Optim. Appl., 68(3) (2017), 719–747.
[11] C. Y. Kaya, Markov–Dubins interpolating curves, Comput. Optim. Appl., 73(2) (2019), 647–677.
[12] C. Y. Kaya, S. K. Lucas, and S. T. Simakov, Computations for bang–bang constrained optimal control using a mathematical programming formulation, Opt. Contr. Appl. Meth., 25 (2004), 295–308.
[13] C. Y. Kaya and J. L. Noakes, Computations and time‐optimal controls, Opt. Contr. Appl. Meth., 17 (1996), 171–185.
[14] C. Y. Kaya and J. L. Noakes, Computational algorithm for time-optimal switching control, J. Optim. Theory App., 117 (2003), 69–92.
[15] C. Y. Kaya and J. L. Noakes, Finding interpolating curves minimizing $L^{\infty}$ acceleration in the Euclidean space via optimal control theory, SIAM J. Control Optim., 51(1) (2013), 442–464.
[16] M. G. Kreĭn and A. A. Nudel’man, The Markov Moment Problem and Extremal Problems, Translations of Mathematical Monographs, Amer. Math. Soc., 1977.
[17] A. A. Markov, Some examples of the solution of a special kind of problem on greatest and least quantities, Soobscenija Charkovskogo Matematiceskogo Obscestva, 2-1(5,6), 250–276, 1889 (in Russian).
[18] H. Maurer, C. Büskens, J.-H. R. Kim, and C. Y. Kaya, Optimization methods for the verification of second order sufficient conditions for bang–bang controls, Opt. Cont. Appl. Meth., 26 (2005), 129–156.
[19] T. Miura and K. Yoshizawa, Pinned planar p-elasticae, arXiv preprint arXiv:2209.05721 (2022).
[20] R. Moser, Structure and classification results for the $\infty$ -elastica problem, Amer. J. Math., 144(5) (2022), 1299–1329.
[21] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko, The Mathematical Theory of Optimal Processes (Russian), English translation by K. N. Trirogoff, ed. by L. W. Neustadt, Interscience Publishers, New York, 1962.
[22] A. Wächter and L. T. Biegler, On the implementation of a primal-dual interior point filter line search algorithm for large-scale nonlinear programming, Math. Program., 106 (2006), 25–57.
[23] K. Yosida, Functional Analysis, Fifth Edition, Springer-Verlag, 1978.