This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Curves of Minimax Curvature

C. Yalçın Kaya111Mathematics, UniSA STEM, University of South Australia, Mawson Lakes, S.A. 5095, Australia. E-mail: [email protected] .    Lyle Noakes222Department of Mathematics and Statistics, The University of Western Australia, Nedlands WA 6009, Australia. Email: [email protected]  .    Philip Schrader333School of Mathematics, Statistics, Chemistry and Physics, Murdoch University, Murdoch WA 6150, Australia. Email: [email protected]
Abstract

We consider the problem of finding curves of minimum pointwise-maximum curvature, i.e., curves of minimax curvature, among planar curves of fixed length with prescribed endpoints and tangents at the endpoints. We reformulate the problem in terms of optimal control and use the maximum principle, as well as some geometrical arguments, to produce a classification of the types of solutions. Using the classification, we devise a numerical method which reduces the infinite-dimensional optimization problem to a finite-dimensional problem with just six variables. The solution types, together with some further observations on optimality, are illustrated via numerical examples.

Key words. Minimax curvature, Markov–Dubins path, Optimal control, Singular control, Bang–bang control.

AMS subject classifications. Primary 49J15, 49K15  Secondary 65K10, 90C30

1 Introduction

We are interested in finding a curve z:[0,tf]IR2z:[0,t_{f}]\longrightarrow{\rm{I\ \kern-5.39993ptR}}^{2} which minimises the almost everywhere (a.e.) pointwise maximum curvature of z(t)z(t), i.e. the LL^{\infty}-norm of the curvature, among all curves having fixed length; prescribed endpoints z0z_{0} and zfz_{f} at 0 and tft_{f} respectively; and prescribed tangents v0v_{0} and vfv_{f} at the endpoints. Since the curvature is independent of parametrisation we will consider only curves which are parametrised with respect to arc length so that the curvature is z¨(t)\|\ddot{z}(t)\|, and the parameter tft_{f} is the length of the whole curve (and therefore tft_{f} is fixed). The problem can then be posed as

(P){minz()maxt[0,tf]z¨(t)s.t.z(0)=z0,z(tf)=zf,z˙(0)=v0,z˙(tf)=vf,z˙(t)=1, for a.e. t[0,tf],\mbox{(P)}\left\{\begin{array}[]{rl}\displaystyle\min_{z(\cdot)}&\ \displaystyle\max_{t\in[0,t_{f}]}\|\ddot{z}(t)\|\\[11.38109pt] \mbox{s.t.}&\ z(0)=z_{0}\,,\ z(t_{f})=z_{f}\,,\\[5.69054pt] &\ \dot{z}(0)=v_{0}\,,\ \dot{z}(t_{f})=v_{f}\,,\\[5.69054pt] &\ \|\dot{z}(t)\|=1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\end{array}\right.

where z˙=dz/dt\dot{z}=dz/dt, z¨=d2z/dt2\ddot{z}=d^{2}z/dt^{2}, \|\cdot\| is the Euclidean norm, and v0v_{0} and vfv_{f} are given such that v0=vf=1\|v_{0}\|=\|v_{f}\|=1.

If the LL^{\infty}-norm is replaced by the squared L2L^{2}-norm, of z¨\ddot{z}, namely 0tfz¨(t)2𝑑t\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{2}\,dt, then critical curves for Problem (P) become the celebrated (fixed-length) Euler’s elastica. The LpL^{p}-norm generalizations of the Euler elastica problem for p[1,)p\in[1,\infty), where 0tfz¨(t)2𝑑t\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{2}\,dt is replaced by 0tfz¨(t)p𝑑t\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{p}\,dt, have been referred to as the pp-elastica problem in the literature—see [9]. Similarly, potential minimisers of Problem (P) have been called the \infty-elastica in [20]. Here we refer to (P) as the problem of finding curves of minimum pointwise-maximum curvature, or simply curves of minimax curvature, because it is more descriptive and we are ultimately focused on finding global minimisers.

Another problem related to (P) is the Markov–Dubins problem, in which one is to minimize this time the (free) length tft_{f} of the curve of interest subject to a fixed bound aa imposed on the curvature, while the remaining constraints appearing in (P) are kept in place. This problem was first proposed and some instances studied by Andrey Markov in 1889 [17] (also see [16]) but fully solved by Lester Dubins only in 1957 [5]. Dubins’ result characterizes minimum-length solutions in terms of concatenations of straight lines and circular arcs: Let a straight line segment be denoted by an SS and a circular arc of the maximum allowed curvature aa (or, turning radius 1/a1/a) by a CC, then the shortest curve is a concatenation of type CCCCCC, or of type CSCCSC, or a subset thereof. See [10] and the references therein for a literature survey of the Markov–Dubins problem.

The first published work on Problem (P) appears to be that of Moser [20] where it is studied for curves in n\mathbb{R}^{n}, followed by the extension to Riemannian manifolds in [7]. In [20] \infty-elastica are defined as minimisers of the sum of maximum curvature and a penalty term which is a kind of L2L^{2}-distance between curves. In order to obtain differential equations which characterise \infty-elastica, Moser studies the Euler–Lagrange equations for the sum of pp-elastic energy with the same penalty term, in the limit pp\to\infty. From the differential equations Moser obtains a classification of \infty-elastica as concatenations of circular arcs and straight lines, similar to the solutions of the Markov-Dubins problem, except that \infty-elastica can contain more than three pieces, and closed loops are allowed. If we denote by OO a full circle with the maximum curvature, and C,SC,S as before, the conclusion is that \infty-elastica can be of type CSOSOCCSOSOC, perhaps with more or less loops and straight line segments, or of type CCCCC\ldots C with any number of CC segments and such that the interior segments have equal length and alternating orientation.

The present work takes a fundamentally different approach to that of Moser: we show that by using the same technique applied in [15], Problem (P) can be reformulated as an optimal control problem without any approximation by pp-elastic energy. As usual, the maximum principle then provides us with necessary conditions for optimality in the form of differential equations for the state and costate variables, and constraints on the control. A careful analysis of these conditions – in particular of the phase portrait of the switching function, which is very similar to the analysis of the Markov–Dubins problem in [10, 11] – allows us to draw similar conclusions to Moser’s about which types of curves can be optimal.

The necessary conditions obtained from the maximum principle are then combined with a series of geometric arguments, such as Lemma 21 showing that a curve of type CCCCCCCC cannot be optimal because it can be deformed into a feasible curve with the same maximum curvature, but which does not satisfy the necessary conditions. These observations lead to our main result, a refined classification in Theorem 22: any solution to Problem (P) is of type CCCCCC, or CSCCSC, or COCCOC, or SOSSOS, or a subset thereof, with all circular arcs having the same maximum possible radius (or minimum possible maximum curvature). So we have that, just like in the Markov–Dubins problem, solutions of problem (P) cannot consist of more than three segments. However, unlike in the Markov–Dubins problem, under specific conditions (Propositions 23 and 25), they can contain closed circular loops.

Since Problem (P) is related to the Markov–Dubins problem by interchange of constraint and cost, it is perhaps not surprising that the solutions are so similar. Indeed, a relationship between solutions of the two problems was established in [20, Proposition 5]: a solution of the Markov–Dubins problem is a solution to Problem (P). In Theorem 5 we give an alternative proof of this fact in terms of the optimal control formulation, and then demonstrate that the converse does not hold (Remark 6).

We devise a numerical method, which, using the classification of the solution curves stated in Theorem 22 and switching time optimization techniques, reduces the infinite-dimensional optimization problem (P) to a finite-dimensional optimization problem (Ps) with just six variables. We carry out extensive numerical experiments. We especially focus on solutions containing a loop to exemplify the specific results involving loops, as well as gain insights about those instances for which no specific results are available.

The paper is organized as follows. In Section 2, we reformulate Problem (P) as an optimal control problem. In Section 3, we write down the maximum principle for the optimal control problem. In Section 4, we classify the types of solutions for Problem (P) and provide the main theoretical result as Theorem 22. In Section 5, we develop a numerical method for solving Problem (P), and provide example applications and numerical solutions using off-the-shelf optimization software, so as to illustrate the theoretical results. Finally, in Section 6, concluding remarks and possible directions for future work are provided.

2 Reformulations

Obviously, if the distance separating z0z_{0} and zfz_{f} is more than the fixed length tft_{f} then Problem (P) has no solution. In fact, with tf=z0zft_{f}=\|z_{0}-z_{f}\|, a feasible solution may still not exist. Consider the points z0=(0,0)z_{0}=(0,0) and zf=(1,0)z_{f}=(1,0). With tf=1t_{f}=1, unless v0=vf=(1,0)v_{0}=v_{f}=(1,0), there exists no feasible curve between z0z_{0} and zfz_{f}. Therefore we make the following assumption.

Assumption 1

tf>z0zft_{f}>\|z_{0}-z_{f}\|.

Before reformulating (P) as an optimal control problem and classifying the minimisers, we first address the existence of solutions.

Proposition 2

(Existence of minimisers.) Given initial and final points z0,zfz_{0},z_{f} and directions v0,vfv_{0},v_{f} as in Problem (P), there exists a minimiser zW2,((0,tf),2)z\in W^{2,\infty}((0,t_{f}),\mathbb{R}^{2}) which is a solution to Problem (P).

Since the proof of Proposition 2 is somewhat long and does not contribute directly to our main result, we defer it until Section 4.1.

Problem (P) is nonsmooth because of the max\max operator appearing in its objective functional. On the other hand, Problem (P) can be transformed into a smooth variational problem by using a standard technique from nonlinear programming, in the same way it was also done in [15]:

(P1){mina,z()as.t.z(0)=z0,z(tf)=zf,z˙(0)=v0,z˙(tf)=vf,z¨(t)a,z˙(t)=1, for a.e. t[0,tf],\mbox{(P1)}\left\{\begin{array}[]{rl}\displaystyle\min_{a,z(\cdot)}&\ a\\[5.69054pt] \mbox{s.t.}&\ z(0)=z_{0}\,,\ z(t_{f})=z_{f}\,,\\[5.69054pt] &\ \dot{z}(0)=v_{0}\,,\ \dot{z}(t_{f})=v_{f}\,,\\[5.69054pt] &\ \|\ddot{z}(t)\|\leq a\,,\ \ \ \|\dot{z}(t)\|=1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\end{array}\right.

where the bound a0a\geq 0 is a new optimization variable of the problem.

Remark 3

One has the solution a=0a=0 if and only if the curve joining z0z_{0} and zfz_{f} is a straight line. This is possible only in the case when v0=vfv_{0}=v_{f} and v0v_{0} and vfv_{f} are colinear with zfz0z_{f}-z_{0}. \Box

Problem (P) can equivalently be cast as an optimal control problem as follows. Letz(t):=(x(t),y(t))IR2z(t):=(x(t),y(t))\in{\rm{I\ \kern-5.39993ptR}}^{2}, with x˙(t):=cosθ(t)\dot{x}(t):=\cos\theta(t) and y˙(t):=sinθ(t)\dot{y}(t):=\sin\theta(t), where θ(t)\theta(t) is the angle the velocity vector z˙(t)\dot{z}(t) of the curve z(t)z(t) makes with the horizontal. These definitions verify that z˙(t)=1\|\dot{z}(t)\|=1. Moreover,

z¨2=x¨2+y¨2=θ˙2.\|\ddot{z}\|^{2}=\ddot{x}^{2}+\ddot{y}^{2}=\dot{\theta}^{2}\,.

Therefore, |θ˙(t)||\dot{\theta}(t)| is nothing but the curvature. In fact, θ˙(t)\dot{\theta}(t) itself, which can be positive or negative, is referred to as the signed curvature. For example, consider a vehicle travelling along a circular path. If θ˙(t)>0\dot{\theta}(t)>0 then the vehicle travels in the counter-clockwise direction, i.e., it turns left, and if θ˙(t)<0\dot{\theta}(t)<0 then the vehicle travels in the clockwise direction, i.e., it turns right. If θ˙(t)=0\dot{\theta}(t)=0 then the vehicle travels along a straight line.

Suppose that the directions at the points z0z_{0} and zfz_{f} are denoted by the angles θ0\theta_{0} and θf\theta_{f}, respectively. The curvature minimizing problem (P1), or equivalently Problem (P), can then be re-written as a (parametric) optimal control problem, where the objective functional is the maximum curvature aa (the parameter), and xx, yy and θ\theta are the state variables and uu is the control variable :

(Pc){mina,u()as.t.x˙(t)=cosθ(t),x(0)=x0,x(tf)=xf,y˙(t)=sinθ(t),y(0)=y0,y(tf)=yf,θ˙(t)=u(t),θ(0)=θ0,θ(tf)=θf,|u(t)|a, for a.e. t[0,tf].\mbox{(Pc)}\left\{\begin{array}[]{rll}\displaystyle\min_{a,u(\cdot)}&\ \displaystyle a&\\[5.69054pt] \mbox{s.t.}&\ \dot{x}(t)=\cos\theta(t)\,,&x(0)=x_{0}\,,\ x(t_{f})=x_{f}\,,\\[5.69054pt] &\ \dot{y}(t)=\sin\theta(t)\,,&y(0)=y_{0}\,,\ y(t_{f})=y_{f}\,,\\[5.69054pt] &\ \dot{\theta}(t)=u(t)\,,&\theta(0)=\theta_{0}\,,\ \theta(t_{f})=\theta_{f}\,,\\[5.69054pt] &&|u(t)|\leq a\,,\mbox{ for a.e. }t\in[0,t_{f}]\,.\end{array}\right.

Recall by Remark 3 that the solution to Problem (P), equivalently (Pc), with a=0a=0 constitutes a special configuration of the given oriented endpoints: the angle θ(t)=θ0=θf=arctan((yfy0)/(xfx0))\theta(t)=\theta_{0}=\theta_{f}=\arctan((y_{f}-y_{0})/(x_{f}-x_{0})) for all t[0,tf]t\in[0,t_{f}], and so the curve z(t)z(t) is a straight line from (x0,y0)(x_{0},y_{0}) to (xf,yf)(x_{f},y_{f}).

Note that with the unit speed constraint z˙(t)=1\|\dot{z}(t)\|=1 in Problem (P) the solution curve z(t)z(t) is parametrized with respect to its length. Therefore the terminal time tft_{f} is nothing but the length of the curve z(t)z(t). When the maximum allowed curvature aa is fixed, the length tft_{f} is allowed to vary, and the objective functional aa in Problem (Pc) is replaced by tft_{f}, Problem (Pc) becomes the celebrated Markov–Dubins problem [5, 16, 17], where one looks for the shortest curve between the oriented points (x0,y0,θ0)(x_{0},y_{0},\theta_{0}) and (xf,yf,θf)(x_{f},y_{f},\theta_{f}), with a prescribed bound aa on its curvature:

(MD){mintf,u()tfs.t.Constraints in Problem (Pc) with a fixed.\mbox{(MD)}\left\{\begin{array}[]{rll}\displaystyle\min_{t_{f},u(\cdot)}&\ \displaystyle t_{f}&\\[5.69054pt] \mbox{s.t.}&\mbox{Constraints in Problem~{}(Pc) with $a$ fixed}.\end{array}\right.

Let a straight line segment be denoted by an SS and a circular arc segment of curvature aa (or, turning radius 1/a1/a) by a CC. Dubins’ result is stated in terms of concatenations of type SS and type CC curve segments in the following theorem.

Theorem 4 (Dubins’ theorem [5])

A solution to Problem (MD) exists, and any such solution, that is to say, any C1C^{1} and piecewise-C2C^{2} curve of minimum length in the plane between two prescribed endpoints, where the slopes of the curve at these endpoints as well as a bound on the pointwise curvature are also prescribed, is of type CSCCSC, or of type CCCCCC, or a subset thereof.

In the following theorem, we establish a relationship between Problems (MD) and (Pc). In fact, the proof of the theorem utilizes Theorem 4 as well as the following simple observation: Problem (MD) minimizes the length tft_{f} while the bound aa on the curvature is fixed, and Problem (Pc) minimizes aa while tft_{f} is kept fixed.

Theorem 5 (Solutions of (MD) and (Pc))

Any solution tft_{f}^{*} to Problem (MD) with a=aa=a^{*} fixed is the solution aa^{*} to Problem (Pc) with tf=tft_{f}=t_{f}^{*} fixed.

Proof. Suppose that, with a=aa=a^{*} fixed, the solution of Problem (MD) yields tft_{f}^{*}. By Theorem 4, either |θ˙(t)|=a|\dot{\theta}^{*}(t)|=a^{*} or θ˙(t)=0\dot{\theta}^{*}(t)=0, a.e. t[0,tf]t\in[0,t_{f}^{*}]; in other words, with the optimal length tft_{f}^{*}, the optimal angular velocity θ˙\dot{\theta}^{*} has to satisfy

|θ˙(t)|(0,a),a.e.t[0,tf].|\dot{\theta}^{*}(t)|\notin(0,a^{*})\,,\ \ \mbox{a.e.}\ \ t\in[0,t_{f}^{*}]. (1)

Now suppose that, with tf=tft_{f}=t_{f}^{*} fixed, solution of Problem (Pc) yields a^\widehat{a}. Obviously, 0a^a0\leq\widehat{a}\leq a^{*}. We can obtain a contradiction for the case when a^<a\widehat{a}<a^{*} as follows.

  1. (i)

    a^0\widehat{a}\neq 0 : Since this solution of Problem (Pc) satisfies the constraints of Problem (MD) with a=aa=a^{*}, and has the optimal length tft_{f}^{*} for Problem (MD), it must also be a solution of (MD). Then it must satisfy (1). However, since it is a solution of Problem (Pc) it also satisfies 0|θ˙(t)|a^0\leq|\dot{\theta}(t)|\leq\widehat{a} for a.e. t[0,tf]t\in[0,t_{f}^{*}], and so we cannot have a^<a\widehat{a}<a^{*} unless a^=0\widehat{a}=0.

  2. (ii)

    a^=0\widehat{a}=0 : In this case, we have the trivial case θ0=θf=arctan((yfy0)/(xfx0))\theta_{0}=\theta_{f}=\arctan((y_{f}-y_{0})/(x_{f}-x_{0})), which yields θ˙(t)=0\dot{\theta}(t)=0, a.e. t[0,tf]t\in[0,t_{f}^{*}], as the solution of both Problems (Pc) and (MD), with a^=a=0\widehat{a}=a^{*}=0.

Therefore, with tf=tft_{f}=t_{f}^{*} fixed, the solution of Problem (Pc) yields a^=a\widehat{a}=a^{*}. So, if the pair (tf,a)(t_{f}^{*},a^{*}) solves Problem (MD), then it also solves Problem (Pc). \Box

Remark 6

The converse of Theorem 5 is not true: Consider the oriented points (x0,y0,θ0)=(0,0,0)(x_{0},y_{0},\theta_{0})=(0,0,0) and (xf,yf,θf)=(1,0,0)(x_{f},y_{f},\theta_{f})=(1,0,0). Fix tf=t^f=2>xfx0=1t_{f}=\widehat{t}_{f}=2>\|x_{f}-x_{0}\|=1. Then clearly the solution a^\widehat{a} to Problem (Pc) is bounded away from 0, as a^=0\widehat{a}=0 is the solution where x0x_{0} and xfx_{f} are joined by a straight line with tf=1<2t_{f}=1<2. On the other hand, with the curvature bound a=a^a=\widehat{a} fixed, Problem (MD) yields the solution tf=1t_{f}^{*}=1, and θ(t)=0\theta^{*}(t)=0 for all t[0,tf]t\in[0,t_{f}^{*}] (the straight line joining x0x_{0} and xfx_{f}). So one gets tf<t^ft_{f}^{*}<\widehat{t}_{f}, furnishing a counterexample for the converse of Theorem 5. \Box

3 Maximum Principle

In this section, we study the optimality conditions for a re-formulation of Problem (Pc) in a classical form. Let us redefine the control variable uu such that α(t)v(t):=θ˙(t)\alpha(t)\,v(t):=\dot{\theta}(t), where α(t):=a\alpha(t):=a and so α˙(t):=0\dot{\alpha}(t):=0, for a.e. t[0,tf]t\in[0,t_{f}]. Problem (Pc), or equivalently Problem (P), can then be re-written as the following optimal control problem:

(OC){minv()0tfα(t)𝑑ts.t.x˙(t)=cosθ(t),x(0)=x0,x(tf)=xf,y˙(t)=sinθ(t),y(0)=y0,y(tf)=yf,θ˙(t)=α(t)v(t),θ(0)=θ0,θ(tf)=θf,|v(t)|1, for a.e. t[0,tf],α˙(t)=0,for a.e. t[0,tf].\mbox{(OC)}\left\{\begin{array}[]{rll}\displaystyle\min_{v(\cdot)}&\ \displaystyle\int_{0}^{t_{f}}\alpha(t)\,dt&\\[11.38109pt] \mbox{s.t.}&\ \dot{x}(t)=\cos\theta(t)\,,&x(0)=x_{0}\,,\ x(t_{f})=x_{f}\,,\\[5.69054pt] &\ \dot{y}(t)=\sin\theta(t)\,,&y(0)=y_{0}\,,\ y(t_{f})=y_{f}\,,\\[5.69054pt] &\ \dot{\theta}(t)=\alpha(t)\,v(t)\,,&\theta(0)=\theta_{0}\,,\ \theta(t_{f})=\theta_{f}\,,\ |v(t)|\leq 1\,,\mbox{ for a.e. }t\in[0,t_{f}]\,,\\[5.69054pt] &\ \dot{\alpha}(t)=0\,,&\mbox{for a.e. }t\in[0,t_{f}]\,.\end{array}\right.

Define the Hamiltonian function for Problem (OC) as

H(x,y,θ,α,λ0,λ1,λ2,λ3,λ4,v):=λ0α+λ1cosθ+λ2sinθ+λ3αv+λ40,H(x,y,\theta,\alpha,\lambda_{0},\lambda_{1},\lambda_{2},\lambda_{3},\lambda_{4},v):=\lambda_{0}\,\alpha+\lambda_{1}\,\cos\theta+\lambda_{2}\,\sin\theta+\lambda_{3}\,\alpha\,v+\lambda_{4}\cdot 0\,, (2)

where λ00\lambda_{0}\geq 0 is a scalar (multiplier) parameter and λi:[0,tf]IR\lambda_{i}:[0,t_{f}]\to{\rm{I\ \kern-5.39993ptR}}, i=1,2,3,4i=1,2,3,4, are the adjoint (or costate) variables. Let

H[t]:=H(x(t),y(t),θ(t),α(t),λ0,λ1(t),λ2(t),λ3(t),λ4(t),v(t)).H[t]:=H(x(t),y(t),\theta(t),\alpha(t),\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t),v(t))\,.

The adjoint variables are required to satisfy

λ˙1(t):=Hx[t]=0,\displaystyle\dot{\lambda}_{1}(t):=-H_{x}[t]=0\,, (3)
λ˙2(t):=Hy[t]=0,\displaystyle\dot{\lambda}_{2}(t):=-H_{y}[t]=0\,, (4)
λ˙3(t):=Hθ[t]=λ1(t)sinθ(t)λ2(t)cosθ(t),\displaystyle\dot{\lambda}_{3}(t):=-H_{\theta}[t]=\lambda_{1}(t)\,\sin\theta(t)-\lambda_{2}(t)\,\cos\theta(t)\,, (5)
λ˙4(t):=Hα[t]=λ0λ3(t)v(t),λ4(0)=0,λ4(tf)=0,\displaystyle\dot{\lambda}_{4}(t):=-H_{\alpha}[t]=-\lambda_{0}-\lambda_{3}(t)\,v(t)\,,\ \ \lambda_{4}(0)=0\,,\ \lambda_{4}(t_{f})=0\,, (6)

where Hx=H/xH_{x}=\partial H/\partial x, etc. By these definitions, the state and adjoint variables verify a Hamiltonian system in that, in addition to (3)–(6), one has x˙(t)=Hλ1[t]\dot{x}(t)=H_{\lambda_{1}}[t], y˙(t)=Hλ2[t]\dot{y}(t)=H_{\lambda_{2}}[t], θ˙(t)=Hλ3[t]\dot{\theta}(t)=H_{\lambda_{3}}[t] and α˙(t)=Hλ4[t]\dot{\alpha}(t)=H_{\lambda_{4}}[t].

Note that (3)–(4) imply that λ1(t)=λ¯1\lambda_{1}(t)=\overline{\lambda}_{1} and λ2(t)=λ¯2\lambda_{2}(t)=\overline{\lambda}_{2} for all t[0,tf]t\in[0,t_{f}], where λ¯1\overline{\lambda}_{1} and λ¯2\overline{\lambda}_{2} are constants. Define new constants

ρ:=λ¯12+λ¯22,tanϕ:=λ¯2λ¯1.\rho:=\sqrt{\overline{\lambda}_{1}^{2}+\overline{\lambda}_{2}^{2}}\,,\qquad\tan\phi:=\frac{\overline{\lambda}_{2}}{\overline{\lambda}_{1}}\,. (7)

Then (2) and (5) can respectively be re-written as

H[t]=aλ0+ρcos(θ(t)ϕ)+aλ3(t)v(t),H[t]=a\,\lambda_{0}+\rho\,\cos(\theta(t)-\phi)+a\,\lambda_{3}(t)\,v(t)\,, (8)

where we have also used α(t)=a\alpha(t)=a, for all t[0,tf]t\in[0,t_{f}], and

λ˙3(t)=ρsin(θ(t)ϕ).\dot{\lambda}_{3}(t)=\rho\,\sin(\theta(t)-\phi)\,. (9)

Since HH does not depend on tt explicitly,

H[t]=h,H[t]=h\,, (10)

where hh is some real constant.

The maximum principle [21, Theorem 1] for Problem (OC) can simply be stated as follows. Suppose that x,y,θ,αW1,(0,tf;IR)x,y,\theta,\alpha\in W^{1,\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}}) and vL(0,tf;IR)v\in L^{\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}}) solve Problem (OC). Then there exist a number λ00\lambda_{0}\geq 0 and functions λiW1,(0,tf;IR)\lambda_{i}\in W^{1,\infty}(0,t_{f};{\rm{I\ \kern-5.39993ptR}}), i=1,2,3,4i=1,2,3,4, such that λ(t):=(λ0,λ1(t),λ2(t),λ3(t),λ4(t))𝟎\lambda(t):=(\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t))\neq\bf 0, for every t[0,tf]t\in[0,t_{f}], and, in addition to the state differential equations and other constraints given in Problem (OC) and the adjoint differential equations (3)–(4), (6) and (9), the following condition holds:

u(t)argmin|w|1H(x(t),y(t),θ(t),α(t),λ0,λ1(t),λ2(t),λ3(t),λ4(t),w),u(t)\in\operatorname*{argmin}_{|w|\leq 1}H(x(t),y(t),\theta(t),\alpha(t),\lambda_{0},\lambda_{1}(t),\lambda_{2}(t),\lambda_{3}(t),\lambda_{4}(t),w)\,, (11)

Using the definition in (2), (11) can be concisely written as

v(t)argmin|w|1aλ3(t)w,v(t)\in\operatorname*{argmin}_{|w|\leq 1}\ a\,\lambda_{3}(t)\,w\,, (12)

which yields the optimal control as

v(t)={1,ifaλ3(t)<0,1,ifaλ3(t)>0,undetermined,ifaλ3(t)=0.v(t)=\left\{\begin{array}[]{ll}\ \ 1\,,&\mbox{if}\ a\,\lambda_{3}(t)<0\,,\\[8.53581pt] -1\,,&\mbox{if}\ a\,\lambda_{3}(t)>0\,,\\[8.53581pt] \mbox{undetermined}\,,&\mbox{if}\ a\,\lambda_{3}(t)=0\,.\end{array}\right. (13)

Furthermore, (8) and (10) give

aλ3(t)v(t)+ρcos(θ(t)ϕ)+aλ0=:h.a\,\lambda_{3}(t)\,v(t)+\rho\,\cos(\theta(t)-\phi)+a\,\lambda_{0}=:h\,. (14)

The control v(t)v(t) to be chosen for the case when aλ3(t)=0a\,\lambda_{3}(t)=0 for a.e. t[ζ1,ζ2][0,tf]t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}] is referred to as singular control, because (12) does not yield any further information. On the other hand, when aλ3(t)0a\,\lambda_{3}(t)\neq 0 for a.e. t[0,tf]t\in[0,t_{f}] (which implies that a>0a>0), i.e., it is possible to have λ3(t)=0\lambda_{3}(t)=0 only for isolated values of tt, the control v(t)v(t) is said to be nonsingular. It should be noted that, if λ3(τ)=0\lambda_{3}(\tau)=0 only at an isolated point τ\tau, the optimal control at this isolated point can be chosen as v(τ)=1v(\tau)=-1 or v(τ)=1v(\tau)=1, conveniently. Therefore, if the control v(t)v(t) is nonsingular, it will take on either the value 1-1 or 11, the bounds on the control variable. In this case, the control v(t)v(t) is referred to as bang–bang. Since the sign of λ3(t)\lambda_{3}(t) determines the value of the optimal control v(t)v(t), λ3\lambda_{3} is referred to as the switching function.

Lemma 7 (Singularity and Straight Line Segments)

Suppose that the optimal control v(t)v(t) for Problem (OC) is singular over an interval [ζ1,ζ2][0,tf][\zeta_{1},\zeta_{2}]\subset[0,t_{f}]. Then θ(t)\theta(t) is constant for all t[ζ1,ζ2]t\in[\zeta_{1},\zeta_{2}]. Moreover, if a>0a>0 then v(t)=0v(t)=0 for all t[ζ1,ζ2]t\in[\zeta_{1},\zeta_{2}].

Proof. Suppose that the optimal control v(t)v(t) is singular, i.e., aλ3(t)=0a\,\lambda_{3}(t)=0, for a.e. t[ζ1,ζ2][0,tf]t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]. If a=0a=0, then θ˙(t)=0\dot{\theta}(t)=0 and so θ(t)\theta(t) is constant. If a>0a>0 then λ˙3(t)=0\dot{\lambda}_{3}(t)=0 for a.e. t[ζ1,ζ2][0,tf]t\in[\zeta_{1},\zeta_{2}]\subset[0,t_{f}]. In other words, from (9), sin(θ(t)ϕ)=0\sin(\theta(t)-\phi)=0, which implies that θ(t)\theta(t) is constant, i.e., θ˙(t)=u(t)=0\dot{\theta}(t)=u(t)=0, for all t[ζ1,ζ2]t\in[\zeta_{1},\zeta_{2}]. \Box

Remark 8

Using Lemma 7, for a>0a>0, we can rewrite the optimal control in (13) as

v(t)=sgn(λ3(t)), a.e. t[0,tf].v(t)=-\operatorname*{sgn}(\lambda_{3}(t))\,,\mbox{ a.e. }t\in[0,t_{f}]\,. (15)

When aλ3(t)=0a\,\lambda_{3}(t)=0 over an interval of time, θ(t)\theta(t) is constant, so the solution curve in that interval is a straight line. When aλ3(t)0a\,\lambda_{3}(t)\neq 0, i.e., λ3(t)0\lambda_{3}(t)\neq 0, over an interval of time the solution curve in that interval is a circular arc segment with radius 1/a1/a. \Box

By Lemma 7, we have established that the only straight-line solution of Problem (OC) is a singular control solution. In the rest of this section we will assume that a>0a>0, as there is nothing more to say for the simple case of a=0a=0, as far as the maximum principle is concerned.

Assumption 9

a>0a>0.

The problems that yield a solution with λ0=0\lambda_{0}=0 are referred to as abnormal, so are their solutions, in the optimal control theory literature, for which the conditions obtained from the maximum principle are independent of the objective functional 0tfα(t)𝑑t\int_{0}^{t_{f}}\alpha(t)\,dt in Problem (OC) and therefore not sufficiently informative. The problems that yield λ0>0\lambda_{0}>0, and their solutions, are referred to as normal. Lemma 10 below states that we can rule out abnormality for Problem (OC).

Lemma 10 (Normality of Solutions)

Problem (OC) is normal, i.e., one can set λ0:=1\lambda_{0}:=1.

Proof. For contradiction purposes, suppose that λ0=0\lambda_{0}=0 and recall the assumption that a>0a>0. Then, from (6) and (15), λ˙4(t)=|λ3(t)|0\dot{\lambda}_{4}(t)=|\lambda_{3}(t)|\geq 0, for every t[0,tf]t\in[0,t_{f}], which, along with the boundary conditions λ4(0)=λ4(tf)=0\lambda_{4}(0)=\lambda_{4}(t_{f})=0, implies that λ4(t)=λ3(t)=0\lambda_{4}(t)=\lambda_{3}(t)=0 for every t[0,tf]t\in[0,t_{f}], giving rise to a totally singular solution. This in turn implies that v(t)=0v(t)=0 and so θ˙(t)=0\dot{\theta}(t)=0, for every t[0,tf]t\in[0,t_{f}], and thus a=0a=0, which is a contradiction. Therefore λ0>0\lambda_{0}>0. Then, without loss of generality, one can set λ0=1\lambda_{0}=1. \Box

Remark 11

We recall that as proved in [10] a solution to a similar optimal control reformulation of the Markov–Dubins problem (MD) can be abnormal, while we prove in Lemma 10 that there are no abnormal solutions to Problem (OC). \Box

Lemma 12 (Singularity and ρ\rho)

Suppose that the optimal control v(t)v(t) for Problem (OC) is singular over an interval [ζ1,ζ2][0,tf][\zeta_{1},\zeta_{2}]\subset[0,t_{f}]. Then ρ=|ah|0\rho=|a-h|\neq 0, with hh as defined in (14).

Proof. Suppose the optimal control u(t)u(t) is singular over [ζ1,ζ2][0,tf][\zeta_{1},\zeta_{2}]\subset[0,t_{f}]. Then λ3(t)=λ˙3(t)=0\lambda_{3}(t)=\dot{\lambda}_{3}(t)=0 for a.e. t[ζ1,ζ2]t\in[\zeta_{1},\zeta_{2}]. From (9), sin(θ(t)ϕ)=0\sin(\theta(t)-\phi)=0, which implies that cos(θ(t)ϕ)=1\cos(\theta(t)-\phi)=1 or 1-1. Then, substituting λ3(t)=0\lambda_{3}(t)=0, cos(θ(t)ϕ)=±1\cos(\theta(t)-\phi)=\pm 1, and (from Lemma 10) λ0=1\lambda_{0}=1, into (14), one gets ρ=(ah)0\rho=\mp(a-h)\geq 0.

Next we show that ρ0\rho\neq 0. For contradiction purposes, suppose that ρ=0\rho=0, i.e., λ1(t)=λ2(t)=0\lambda_{1}(t)=\lambda_{2}(t)=0, for every t[0,tf]t\in[0,t_{f}], and recall the assumption that a>0a>0. Then λ˙3(t)=0\dot{\lambda}_{3}(t)=0 and so λ3(t)=0\lambda_{3}(t)=0, for every t[0,tf]t\in[0,t_{f}]. Now from (6), we get λ˙4(t)=1\dot{\lambda}_{4}(t)=-1, for every t[0,tf]t\in[0,t_{f}], which does not have a solution with λ4(0)=λ4(tf)=0\lambda_{4}(0)=\lambda_{4}(t_{f})=0. \Box

Lemma 13

The adjoint variable λ3\lambda_{3} for Problem (OC) solves the differential equation

λ˙32(t)+(a|λ3(t)|a+h)2=ρ2.\dot{\lambda}_{3}^{2}(t)+\left(a\,|\lambda_{3}(t)|-a+h\right)^{2}=\rho^{2}\,. (16)

Proof. From (9),

λ˙32(t)=ρ2sin2(θ(t)ϕ)=ρ2ρ2cos2(θ(t)ϕ).\dot{\lambda}_{3}^{2}(t)=\rho^{2}\,\sin^{2}(\theta(t)-\phi)=\rho^{2}-\rho^{2}\,\cos^{2}(\theta(t)-\phi)\,. (17)

Using v(t)=sgn(λ3(t))v(t)=-\operatorname*{sgn}(\lambda_{3}(t)) and λ0=1\lambda_{0}=1 in (14), one gets

ρcos(θ(t)ϕ)=a|λ3(t)|a+h.\rho\,\cos(\theta(t)-\phi)=a\,|\lambda_{3}(t)|-a+h\,.

Substituting this into the right-hand side of (17) and rearranging give (16). \Box

In the rest of the paper, we will at times not show dependence of variables on tt for clarity of presentation.

Refer to caption
Figure 1: Phase portrait of (16) for the switching function λ3\lambda_{3}.
Remark 14 (Phase Portrait for Switchings)

Note that the differential equation (16) is given in terms of the phase variables λ3\lambda_{3} and λ˙3\dot{\lambda}_{3}, and can be put into the form

(λ3±|ah|a)2+λ˙32a2=ρ2a2,\left(\lambda_{3}\pm\frac{|a-h|}{a}\right)^{2}+\frac{\dot{\lambda}_{3}^{2}}{a^{2}}=\frac{\rho^{2}}{a^{2}}\,, (18)

where the “++” sign in the first square term stands for the case when λ3<0\lambda_{3}<0 and the “-” sign for λ3>0\lambda_{3}>0. Equation (18) clearly tells us that the trajectories in the phase plane for λ3\lambda_{3} (the λ3\lambda_{3}λ˙3\dot{\lambda}_{3}-plane) will be pieces or concatenations of pieces of concentric ellipses centred at (|ah|/a,0)(-|a-h|/a,0) for v(t)=1v(t)=1 and (|ah|/a,0)(|a-h|/a,0) for v(t)=1v(t)=-1, as shown in Figure 1. Based on the phase plane diagram, also referred to as the phase portrait, of λ3\lambda_{3} depicted in Figure 1, the following observations are made.

  • (i)

    When ρ>|ah|\rho>|a-h| the trajectories are concatenations of (pieces of) ellipses, examples of which are shown by (dark blue) solid curves in Figure 1. The ellipses are concatenated at the switching points (0,ρ2(ah)2)(0,\sqrt{\rho^{2}-(a-h)^{2}}) and (0,ρ2(ah)2)(0,-\sqrt{\rho^{2}-(a-h)^{2}}), where the value of the bang–bang control v(t)v(t) switches from 11 to 1-1 or from 1-1 to 11, respectively. The concatenated ellipses cross the λ3\lambda_{3}-axis at two points, ((ρ+|ah|)/a,0)(-(\rho+|a-h|)/a,0) and ((ρ+|ah|)/a,0)((\rho+|a-h|)/a,0). One can promptly deduce from the diagram that if the bang–bang control has two switchings, the second arc must have a length tft_{f} strictly greater than π/a\pi/a, in other words, the smallest possible curvature aa is π/tf\pi/t_{f}. The diagram, however, does not tell as to how many switchings the optimal control must have.

  • (ii)

    Recall, by Lemma 12, that ρ=|ah|0\rho=|a-h|\neq 0 for singular control. The case when only a part of the trajectory is singular, i.e., partially singular, referred to as a bang–singular trajectory, is represented by the two unique (red) dashed elliptic curves in Figure 1. Note that singular control takes place only at the origin (0,0)(0,0) of the phase plane, with v(t)=0v(t)=0. At any other point, the control trajectory is of bang–bang type. We also note that a trajectory is totally singular if, and only if, a=0a=0, the solution given in Remark 3.

  • (iii)

    For the case when 0ρ<|ah|0\leq\rho<|a-h|, example elliptic trajectories are shown with (black) dotted curves in Figure 1. The trajectories cross the λ3\lambda_{3}-axis at four distinct points ((|ah|+ρ)/a,0)(-(|a-h|+\rho)/a,0) and ((|ah|ρ)/a,0)(-(|a-h|-\rho)/a,0) when λ3<0\lambda_{3}<0, and ((|ah|+ρ)/a,0)((|a-h|+\rho)/a,0) and ((|ah|ρ)/a,0)((|a-h|-\rho)/a,0) when λ3>0\lambda_{3}>0. However, the trajectories no longer intercept the λ˙3\dot{\lambda}_{3}-axis; therefore they represent bang–bang control with no switchings, i.e., either v(t)=1v(t)=1 for all t[0,tf]t\in[0,t_{f}] or v(t)=1v(t)=-1 for all t[0,tf]t\in[0,t_{f}].

An optimal path will in general be a concatenation of straight lines (i.e., singular arcs, where v(t)=0v(t)=0) and circular arcs (i.e., nonsingular arcs, where v(t)=1v(t)=1 or 1-1). \Box

Lemma 15

Suppose that optimal control v(t)v(t) for Problem (OC) is nonsingular over an interval [ζ3,ζ4][0,tf][\zeta_{3},\zeta_{4}]\subset[0,t_{f}]. Then

|λ3(t)|=1a[ρcos(θ(t)ϕ)+ah], for a.e. t[ζ3,ζ4][0,tf].|\lambda_{3}(t)|=\frac{1}{a}\left[\rho\,\cos(\theta(t)-\phi)+a-h\right]\,,\ \ \mbox{ for a.e.\ }t\in[\zeta_{3},\zeta_{4}]\subset[0,t_{f}]\,. (19)

Proof. Substitution of v(t)=sgn(λ3(t))v(t)=-\operatorname*{sgn}(\lambda_{3}(t)) and λ0=1\lambda_{0}=1 into (14) and re-arranging yield the required expression. \Box

Lemma 16 (Nonsingular Curves)

Consider Problem (OC) and the maximum principle for it.

  1. (a)

    If ρ=0\rho=0, then either v(t)=1v(t)=1 or v(t)=1v(t)=-1, for all t[0,tf]t\in[0,t_{f}].

  2. (b)

    If 0<ρ|ah|0<\rho\neq|a-h|, then v(t)v(t) is of bang–bang type.

Proof. (a) Suppose that ρ=0\rho=0. Then, from Lemma 15, |λ3(t)|=(ah)/a|\lambda_{3}(t)|=(a-h)/a, λ3(t)\lambda_{3}(t) is a nonzero constant, i.e., v(t)v(t) is either 11 or 1-1, for all t[0,tf]t\in[0,t_{f}].
(b) Suppose that 0<ρ|ah|0<\rho\neq|a-h|. Then, the contrapositive of Lemma 12 states that the optimal control is not singular, i.e., bang–bang. \Box

Remark 17

The constant optimal control in Lemma 16(a), v(t)=1v(t)=1 or v(t)=1v(t)=-1, and the associated |λ3(t)|=(ah)/a|\lambda_{3}(t)|=(a-h)/a, can be viewed graphically as the limiting case when ρ0\rho\to 0 in Figure 1, i.e., the “size” of the black-dotted ellipse shrinks down to zero. \Box

4 Classification of Optimal Paths

Denote a straight line segment by SS and a circular arc by CC, so that a trajectory satisfying the necessary conditions can be described as being of type C,S,CSC,C,S,CSC,\ldots etc. In [10, Lemma 6] the analog of Figure 1 is used to conclude that an optimal path for (MD) contains a straight line segment SS then it must be of type CSC,CS,SC,CSC,CS,SC, or SS. Part of the argument is that a trajectory containing e.g. SCSSCS must traverse a full circle and is therefore not optimal because the objective in the Markov–Dubins problem is to minimize length.

For Problem (OC) we can also conclude from Figure 1 and (9) that SCSSCS must traverse a full circle, but it is no longer obvious that such a path cannot be optimal. In fact we will show that in some cases it is optimal ( Proposition 23, Proposition 25) while in others it is not (Corollary 20). It will be helpful to introduce the notation OO to represent a full circle once covered, and reserve CC for circular arcs which are not closed.

Lemma 18

If 𝐱(t):=(x(t),y(t),θ(t))\mathbf{x}^{*}(t):=(x^{*}(t),y^{*}(t),\theta^{*}(t)) with α(t)=a\alpha^{*}(t)=a^{*} is an optimal path for Problem (OC) then any sub-path of 𝐱\mathbf{x}^{*} is also optimal for the inherited constraints. That is, for any [t1,t2][0,tf][t_{1},t_{2}]\subset[0,t_{f}] the restriction 𝐱|[t1,t2]\mathbf{x}^{*}|[t_{1},t_{2}] is optimal for problem (OC) with the constraints replaced by x(ti)=x(ti)x(t_{i})=x^{*}(t_{i}), y(ti)=y(ti)y(t_{i})=y^{*}(t_{i}), θ(ti)=θ(ti)\theta(t_{i})=\theta^{*}(t_{i}) for i=1,2i=1,2, with t[t1,t2]t\in[t_{1},t_{2}].

Proof. If the sub-path is a line then it is optimal. Assume it is not a line, and suppose it is not optimal. Then there exists a path satisfying the inherited constraints and having 0<a<a0<a<a^{*}, meaning it contains circular arcs with lower curvature than those in 𝐱\mathbf{x}^{*}. We can insert it between 𝐱([0,t1])\mathbf{x}^{*}([0,t_{1}]) and 𝐱([t2,tf])\mathbf{x}^{*}([t_{2},t_{f}]) to obtain a new path 𝐱~(t)\tilde{\mathbf{x}}(t). Now either 𝐱\mathbf{x}^{*} and 𝐱~\tilde{\mathbf{x}} have the same maximum curvature, or the maximum curvature of 𝐱~\tilde{\mathbf{x}} is smaller. The latter contradicts our assumption that 𝐱\mathbf{x}^{*} is optimal. If they have the same maximum curvature, then 𝐱~\tilde{\mathbf{x}} contains at least one circular arc with curvature aa^{*} and at least one circular arc with curvature aa, and so it does not satisfy the necessary conditions for optimality (specifically (13)). It then follows that aa^{*} cannot be a minimum, again contradicting our assumption on 𝐱\mathbf{x}. \Box

Lemma 19

Any path of type XCYXCY, where X,YX,Y are C,OC,O or SS but not both CC, is not optimal.

Proof. Since at least one of XX and YY is SS or OO, in order for the path to be optimal it must have switching function on the (red) dashed ellipses in Figure 1. But since the middle CC in XCYXCY does not traverse a full loop it is impossible for the switching function to traverse a full ellipse and switch to SS or to CC or OO with the opposite orientation. (Note that we are assuming that, for example, in CCSCCS the two arcs have opposite orientation, otherwise this path would be written as CSCS or OCSOCS). In the case of CCOCCO (similarly OCCOCC) this is perhaps not immediately obvious because the middle CC and the OO may have the same orientation, and then the switching function would not need to return to the origin of the phase portrait in Figure 1. However, we can reflect the loop about the endpoint with no change in maximum curvature, and the resulting path again is not compatible with the necessary conditions on the switching function. \Box

Corollary 20

Any path of type SOSCSOSC or any permutation thereof, or type COCCCOCC or any permutation thereof, cannot be optimal.

Proof. Note that the loop can be shifted anywhere along the path without changing the maximum curvature. For example a path of type SOSCSOSC has the same maximum curvature as the path of type SCOSCO obtained by shifting the loop to the end. The latter is not optimal by Lemma 19, and therefore neither is the former. Similarly, the other permutations of SOSCSOSC and permutations of COCCCOCC are not optimal. \Box

Lemma 21

A CCCCCCCC trajectory is not a solution to (Pc).

Sketch of the proof. Following the observations in Remark 14, a CCCCCCCC trajectory must be of the type pictured in Figure 2, with the second and third circular arcs having the same length, which is greater than π/a\pi/a. The idea behind this proof is very simple: beginning with the CCCCCCCC trajectory, we roll the circle c1c_{1} around the circle c0c_{0} while allowing the third arc to be ‘pulled-tight’ and allowing the fourth arc to relax to the circle centred at c~2\tilde{c}_{2}, to obtain the dashed trajectory in Figure 2. This new trajectory doesn’t satisfy the necessary conditions for optimality because the second circular arc doesn’t traverse a full circle (see Remark 14). However, it has the same maximum curvature as the CCCCCCCC trajectory we started with, and therefore this maximum curvature cannot be minimal.

The reason the proof becomes somewhat technical below is that it does not seem to be easy to solve explicitly for the rolling angle β0\beta_{0} which ensures that the dashed curve satisfies the length constraint. Instead we will prove (taking care with a couple of slightly different cases) that the length depends continuously on β0\beta_{0}, and that there exist values of β0\beta_{0} such that the length is greater than and less than the length of the original CCCCCCCC trajectory.

Proof.

z0,θ0z_{0},\theta_{0}zf,θfz_{f},\theta_{f}α0\alpha_{0}γ\gammaγ\gammaα3\alpha_{3}c1c_{1}c2c_{2}c3c_{3}c0c_{0}c~1\tilde{c}_{1}c~2\tilde{c}_{2}β1\beta_{1}β0\beta_{0}β2\beta_{2}\ellδ\deltaδ\delta
Figure 2: Diagram for the proof of Lemma 21, case 1: trajectory of type CCCCCCCC (solid) which satisfies the necessary conditions from the maximum principle, and another of type CCSCCCSC (dashed) with the same maximum curvature which does not.

Referring to Figure 2, we consider the points cic_{i} to be in the complex plane with c0=0c_{0}=0, and assume that the radius of each of the circles is 1.

Case 1a: α3<π/2\alpha_{3}<\pi/2 and γ<π/2\gamma<\pi/2. We reflect c3c_{3} across θf\theta_{f} and rotate c1c_{1} about c0c_{0} until we obtain the CCSCCCSC trajectory pictured in Figure 2. Supposing there exists β0\beta_{0} such that the pictured trajectory satisfies our length constraint, then it satisfies all the constraints of Problem (Pc). As explained in the sketch above, this second trajectory has the same maximum curvature as the original CCCCCCCC but cannot be optimal, and therefore the CCCCCCCC trajectory is not optimal.

It remains to prove that there exists β0\beta_{0} such that the CCSCCCSC trajectory satisfies the length constraint. Referring to Figure 2 we have

c~1\displaystyle\tilde{c}_{1} =2eiβ0\displaystyle=2e^{i\beta_{0}}
c~2\displaystyle\tilde{c}_{2} =zf+ei(π/2+θf)\displaystyle=z_{f}+e^{i(\pi/2+\theta_{f})}
L\displaystyle L =α0+β0+β1++β2\displaystyle=\alpha_{0}+\beta_{0}+\beta_{1}+\ell+\beta_{2}
β0β1+β2\displaystyle\beta_{0}-\beta_{1}+\beta_{2} =θfπ/2\displaystyle=\theta_{f}-\pi/2

where LL is the total length. If we let

w=c~2c~1=zf+ei(π/2+θf)2eiβ0w=\tilde{c}_{2}-\tilde{c}_{1}=z_{f}+e^{i(\pi/2+\theta_{f})}-2e^{i\beta_{0}}

then

\displaystyle\ell =|w|22\displaystyle=\sqrt{|w|^{2}-2}
δ\displaystyle\delta =cos1(2/|w|)\displaystyle=\cos^{-1}(2/|w|)
β1\displaystyle\beta_{1} =δarg(w)πβ0=cos1(2/|w|)arg(w)πβ0\displaystyle=\delta-\arg(w)-\pi-\beta_{0}=\cos^{-1}(2/|w|)-\arg(w)-\pi-\beta_{0}
β2\displaystyle\beta_{2} =θfπ/2+β1β0=θf3π/2+cos1(2/|w|)arg(w)2β0\displaystyle=\theta_{f}-\pi/2+\beta_{1}-\beta_{0}=\theta_{f}-3\pi/2+\cos^{-1}(2/|w|)-\arg(w)-2\beta_{0}
L\displaystyle L =α0+2β1++θfπ/2=α0+2cos1(2/|w|)2arg(w)2π2β0++θfπ/2\displaystyle=\alpha_{0}+2\beta_{1}+\ell+\theta_{f}-\pi/2=\alpha_{0}+2\cos^{-1}(2/|w|)-2\arg(w)-2\pi-2\beta_{0}+\ell+\theta_{f}-\pi/2 (20)

Since ww is determined by the boundary conditions and β0\beta_{0}, it follows that each of ,δ,β1,β2\ell,\delta,\beta_{1},\beta_{2} are determined by the the boundary conditions and β0\beta_{0} and in fact LL depends continuously on β0\beta_{0}. If we can prove that there are values of β0\beta_{0} for which LL is above and below the length constraint, then by the intermediate value theorem a trajectory satisfying the length constraint exists.

Note that we can make LL as large as we like by increasing β0\beta_{0}, and wrapping the trajectory around the circle centred at c0c_{0} if necessary. To prove that L<tfL<t_{f} can be achieved we consider two cases. First we assume that as in Figure 2 the reflected circle centred at c~2\tilde{c}_{2} does not intersect the circle centred at c1c_{1}, and so we can take β0=0\beta_{0}=0 and observe that the new CCSCCCSC trajectory, where the SS is a common tangent to c1c_{1} and c~2\tilde{c}_{2}, is shorter. In the case where the reflected circle does intersect the circle centred at c1c_{1}, we consider the CCCCCC trajectory pictured in Figure 3. Here =0\ell=0 and β0\beta_{0} is at its minimum.

θ0α3\theta_{0}-\alpha_{3}c3c_{3}c0c_{0}β0+θ0\beta_{0}+\theta_{0}ϕ\phic~2\tilde{c}_{2}ψ\psippqqγ¯\overline{\gamma}α3\alpha_{3}c3c_{3}c0c_{0}θ0\theta_{0}β0\beta_{0}ϕ\phic~2\tilde{c}_{2}
Figure 3: Left: trajectory of type CCCCCCCC (solid) which satisfies the necessary conditions for optimality, and another of type CCCCCC (dashed) which we prove is shorter, and can therefore be extended to a trajectory of type CCSCCCSC (as in Figure 2) with the correct length. Right: enlargement of the internal triangles from the CCCCCC trajectory.

The length of the original trajectory is L1=α0+2π+2γ+α3L_{1}=\alpha_{0}+2\pi+2\gamma+\alpha_{3}, and from (20) the length of the CCCCCC trajectory is L2=α0+2β1+θfπ/2L_{2}=\alpha_{0}+2\beta_{1}+\theta_{f}-\pi/2. Hence, using α3=π2θf\alpha_{3}=\frac{\pi}{2}-\theta_{f}

L1L2=2π+2γ+2α32β1L_{1}-L_{2}=2\pi+2\gamma+2\alpha_{3}-2\beta_{1}

and our aim is to prove that this quantity is positive. With the angles labelled as in Figure 3, noting that γ=πγ¯\gamma=\pi-\bar{\gamma} and β1=2πϕ\beta_{1}=2\pi-\phi, we need to prove that γ¯<ϕ+α3\bar{\gamma}<\phi+\alpha_{3}. For this, defining s:=|c3c0|s:=|c_{3}-c_{0}| note that

|c~2c0|=88cosϕ=4+s22scos(θ0α3)|\tilde{c}_{2}-c_{0}|=8-8\cos\phi=4+s^{2}-2s\cos(\theta_{0}-\alpha_{3})

and that this is still true if α3>θ0\alpha_{3}>\theta_{0}. Supposing we allow α3\alpha_{3} to vary, then we determine that

dϕdβ0=ssin(θ0α3)4sinϕdα3dβ0\frac{d\phi}{d\beta_{0}}=-\frac{s\sin(\theta_{0}-\alpha_{3})}{4\sin\phi}\frac{d\alpha_{3}}{d\beta_{0}}

Note that as β0\beta_{0} increases so too does α3\alpha_{3}, and so dϕdβ0\frac{d\phi}{d\beta_{0}} is negative for α3<θ0\alpha_{3}<\theta_{0} and positive for α3>θ0\alpha_{3}>\theta_{0}. It follows that

ddβ0(ϕ+α3)=dϕdβ0(1ssin(θ0α3)4sinϕ).\frac{d}{d\beta_{0}}(\phi+\alpha_{3})=\frac{d\phi}{d\beta_{0}}\left(1-\frac{s\sin(\theta_{0}-\alpha_{3})}{4\sin\phi}\right).

Since ϕ+α3=γ¯\phi+\alpha_{3}=\bar{\gamma} at β0=0\beta_{0}=0, if we can show that the above quantity is always positive then we have the required result: γ¯<ϕ+α3\bar{\gamma}<\phi+\alpha_{3}. When α3>θ0\alpha_{3}>\theta_{0} this is immediate: all the quantities on the right hand side are positive. For α3<θ0\alpha_{3}<\theta_{0} we refer to the right hand side of Figure 3 and observe that

sin(θ0α3)q=sinψ2=sinϕp\frac{\sin(\theta_{0}-\alpha_{3})}{q}=\frac{\sin\psi}{2}=\frac{\sin\phi}{p}

and therefore sin(θ0α3)sin(ϕ)=qp\frac{\sin(\theta_{0}-\alpha_{3})}{\sin(\phi)}=\frac{q}{p}. Now ddβ0(ϕ+α3)\frac{d}{d\beta_{0}}(\phi+\alpha_{3}) will be positive if 4p>qs4p>qs. Comparing with Figure 3, we know q<1q<1 and since γ¯>π/2\bar{\gamma}>\pi/2 (by assumption) we have that p>s/4p>s/4, so we are done with this case.

Case 1b: α3π/2\alpha_{3}\geq\pi/2 and γ<π/2\gamma<\pi/2. In this case we cannot reflect c3c_{3} about θf\theta_{f}. We set c~2\tilde{c}_{2} to c3+2ic_{3}+2i instead (i.e. vertically above c3c_{3}), and then construct a trajectory of type CCSCCCCSCC. This is not very different to the previous case.

x0,θ0x_{0},\theta_{0}xf,θfx_{f},\theta_{f}α0\alpha_{0}γ\gammaγ\gammaα3\alpha_{3}c1c_{1}c2c_{2}c3c_{3}c0c_{0}c~1\tilde{c}_{1}β1\beta_{1}β0\beta_{0}β2\beta_{2}\ell
Figure 4: Diagram for Case 2 (γπ2\gamma\geq\frac{\pi}{2}) in the proof of Lemma 21: a trajectory of type CCCCCCCC (solid) which satisfies the necessary conditions from the maximum principle, and another of type CCSCCCSC (dashed) with the same maximum curvature which does not.

Case 2: γπ/2\gamma\geq\pi/2. Referring to Figure 4, let w~:=c3c~1\tilde{w}:=c_{3}-\tilde{c}_{1} and then we have

c~1\displaystyle\tilde{c}_{1} =2eiβ0\displaystyle=2e^{i\beta_{0}}
c3\displaystyle c_{3} =zf+ei(θfπ/2)\displaystyle=z_{f}+e^{i(\theta_{f}-\pi/2)}
l\displaystyle l =|w~|\displaystyle=|\tilde{w}|
L2\displaystyle L_{2} =α0+β0+β1++β2+α3\displaystyle=\alpha_{0}+\beta_{0}+\beta_{1}+\ell+\beta_{2}+\alpha_{3}
=α0+2β0+2π+|w~|+α3\displaystyle=\alpha_{0}+2\beta_{0}+2\pi+|\tilde{w}|+\alpha_{3}

Arguing as in case 1, we have that L2L_{2} depends continuously on β0\beta_{0} and it is enough to show that L2<L1L_{2}<L_{1} when β0=0\beta_{0}=0, where L1=α0+2π+2γ+α3L_{1}=\alpha_{0}+2\pi+2\gamma+\alpha_{3} is the length of the original trajectory. Indeed, at β0\beta_{0} we have L1L2=2γ|w~|L_{1}-L_{2}=2\gamma-|\tilde{w}|. Noting that |w~|2=8+8cosγ|\tilde{w}|^{2}=8+8\cos\gamma and recalling that we have assumed γ>π/2\gamma>\pi/2 we have that |w~|28<(2γ)2|\tilde{w}|^{2}\leq 8<(2\gamma)^{2} and therefore L1>L2L_{1}>L_{2} at β0=0\beta_{0}=0. \Box

Theorem 22

Any solution to problem (P) is of type CXCCXC or sub-path thereof, where XX can be SS, CC, or OO; or of type SOSSOS or sub-path thereof. The radii of all the CC and OO arcs in a solution are the same.

Proof. First note that any path containing more than one loop cannot be optimal. The loops can be brought together without changing the maximum curvature, but a sub-path of type OOOO is not optimal - it can be expanded into a single loop with lower curvature. Consider all paths of type SXYZSXYZ, where X,Y,ZX,Y,Z can each be C,O,SC,O,S or void,

  • X=CX=C: SCSC can be optimal, SCYSCY cannot, by Lemma 19.

  • X=OX=O: SOSO can be optimal and so can SOSSOS, but SOOSOO cannot by the argument above, and SOCSOC is excluded by Lemma 20. Similarly, we rule out SOSOSOSO and SOSCSOSC.

  • A single straight line segment SS can be optimal (similarly CC and OO).

By Lemma 18 we have exhausted the possibilities for optimal paths beginning with SS. Next consider paths of type CXYZCXYZ:

  • X=CX=C: CCCC and CCCCCC can be optimal, CCOCCO and CCSCCS are not optimal by Lemma 19. CCCCCCCC, CCCOCCCO and CCCSCCCS are excluded by Lemmas 21, Corollary 20 and Lemma 19 respectively.

  • X=OX=O: COCO and COCCOC can be optimal, COSCOS is ruled out by Corollary 20 along with COCCCOCC and COCSCOCS.

  • X=SX=S: CSCS and CSCCSC are possible, CSOCSO and CSCOCSCO are not optimal by Corollary 20, and Lemma 19 rules out CSCSCSCS and CSCCCSCC.

Finally, for paths of type OXYZOXYZ: OCOC can be optimal but OCCOCC and OCSOCS are not by Lemma 19 and Corollary 20 respectively, OSOS can be optimal but OSCOSC is not by Lemma 20.

\Box

The boundary conditions that allow for paths of types SOSSOS and COCCOC are quite specific, and so we are able to prove a little more about the conditions under which these paths may be optimal.

Proposition 23

Consider an SOSSOS path as per Figure 5, and let d=z(0)z(tf)d=\|z(0)-z(t_{f})\| be the distance between the initial and final points. If dtf>b\frac{d}{t_{f}}>b where bb is the solution of sinc1(b)(1b)=π/2\operatorname{sinc}^{-1}(b)(1-b)=\pi/2, then the path is not optimal.

Proof.

z0z_{0}zfz_{f}RRα\alphaα\alpha2α2\alpharr
Figure 5: Diagram for the proof of Lemma 23: trajectory of type SCSSCS, and a trajectory of type CCCCCC with lower maximum curvature.

We show that with the assumption on d/tfd/t_{f}, the proposed trajectory of type CCCCCC in Figure 5 has r>Rr>R. Note that d=4rsin(α)d=4r\sin(\alpha) (still true if α\alpha is obtuse) and tf=4rαt_{f}=4r\alpha gives d/tf=sinc(α)d/t_{f}=\operatorname{sinc}(\alpha). d/tfd/t_{f} is at most 11 and so sinc1(d/tf)\operatorname{sinc}^{-1}(d/t_{f}) has a solution between 0 and π\pi and α=sinc1(d/tf)\alpha=\operatorname{sinc}^{-1}(d/t_{f}). Moreover from 2πR+d=tf2\pi R+d=t_{f} we then have

R=tfd2π=4r(αsinα)2πR=\frac{t_{f}-d}{2\pi}=\frac{4r(\alpha-\sin\alpha)}{2\pi}

and therefore r>Rr>R while 2(αsinα)/π<12(\alpha-\sin\alpha)/\pi<1. Rewriting this inequality in terms of d/tfd/t_{f} gives

sinc1(dtf)(1dtf)<π2\operatorname{sinc}^{-1}\left(\frac{d}{t_{f}}\right)\left(1-\frac{d}{t_{f}}\right)<\frac{\pi}{2}

and on the domain [0,1)[0,1) the left hand side of the inequality is decreasing.

\Box

Remark 24

Note that the full circular arc, or the loop, in Figure 5 can be placed anywhere on the straight line SS without changing the maximum curvature or the boundary conditions and so the above result also applies to paths of type OSOS and SOSO. \Box

Note that a path of type COCCOC cannot be optimal if the CC’s have different orientations. If they do then the loop can be shifted to the terminal point to obtain a path of type CCOCCO with the same maximum curvature, but such a path is not optimal by Lemma 19.

Proposition 25

Consider a path of type COCCOC and of length tf=β/a+2πt_{f}=\beta/a+2\,\pi as in Figure 6 (solid). Such a path can be optimal only if βπ2\beta\leq\frac{\pi}{2}.

Proof. For comparison we consider the CCCCCC path also pictured in Figure 6 (dashed). Assuming that the circular arcs all have radius 1, the length of the COCCOC path is L1=β+2πL_{1}=\beta+2\pi, and the length of the CCCCCC is L2=2α+2πβ=4π3βL_{2}=2\alpha+2\pi-\beta=4\pi-3\beta. Therefore if β>π2\beta>\frac{\pi}{2} then L2<L1L_{2}<L_{1} and then, adjusting the angles as necessary, the radii of each of the circular arcs in the CCCCCC curve can be increased (reducing the maximum curvature) until the curve has the correct length β+2π\beta+2\pi, and so the original COCCOC is not a minimiser. \Box

z0z_{0}zfz_{f}α\alphaβ\beta
Figure 6: Diagram for Proposition 25: A path of type COCCOC (solid) and shorter curve of type CCCCCC (dashed) with the same maximum curvature.
Remark 26

Note that the full circular arc, or the loop, in Figure 6 can be placed anywhere on the circular arc CC between z0z_{0} and zfz_{f} without changing the maximum curvature or the boundary conditions and so the above result also applies to paths of type OCOC and COCO. \Box

The numerical experiments in Section 5.1, specifically Examples 1 and 2, suggest that the converses to Propositions 23 and 25 are true. It may be possible to verify this by direct comparison with each of the potential minimisers allowed by Theorem 22 which satisfy the constraints (at least in the SOSSOS case there do not seem to be too many of these), but we will not attempt to do so here.

4.1 Proof of Proposition 2

Proof. The proof is sketched in [7] but for the convenience of the reader we fill out the details. For any p[1,)p\in[1,\infty), consider the pp-elastic energy 0tfz¨(t)p𝑑t\int_{0}^{t_{f}}\|\ddot{z}(t)\|^{p}dt for zz in the Sobolev space W2,p((0,tf),2)W^{2,p}((0,t_{f}),\mathbb{R}^{2}) and satisfying the constraints of Problem (P). The existence of minimisers of the pp-elastic energy subject to Dirichlet boundary conditions is proved using the direct method in [19, Proposition 4.1], and the same proof works for the boundary conditions we have here. We may therefore consider a sequence (zp)(z_{p}) where each zpz_{p} is a minimiser of the pp-elastic energy subject to the constraints of problem (P), and let zz_{*} be any element of W2,((0,tf),2)W^{2,\infty}((0,t_{f}),\mathbb{R}^{2}) satisfying the constraints. Then for any pqp\geq q, using the minimality of zqz_{q} and zpz_{p} and the embedding LpLqL^{p}\subset L^{q} we have

z¨qLqz¨pLqcz¨pLpcz¨Lptf1qz¨L{\|\ddot{z}_{q}\|}_{L^{q}}\leq{\|\ddot{z}_{p}\|}_{L^{q}}\leq c{\|\ddot{z}_{p}\|}_{L^{p}}\leq c{\|\ddot{z}_{*}\|}_{L^{p}}\leq{t_{f}}^{\tfrac{1}{q}}{\|\ddot{z}_{*}\|}_{L^{\infty}} (21)

where c=tf1q1pc={t_{f}}^{\tfrac{1}{q}-\tfrac{1}{p}}. Note also that for any zW2,q((0,tf),2)z\in W^{2,q}((0,t_{f}),\mathbb{R}^{2}) satisfying the constraints of (P), since z˙=1\|\dot{z}\|=1 we have z˙Lq=tf1q{\|\dot{z}\|}_{L^{q}}={t_{f}}^{\frac{1}{q}}, and if q2q\geq 2 then, using the fundamental theorem of calculus, Cauchy-Schwarz’ inequality and Young’s inequality with ε\varepsilon:

z(t)2=z(0)2+0tddτz(τ)2𝑑τ=z(0)2+0tz(τ),z˙(τ)𝑑τz(0)2+12εzL22+tε2.\displaystyle\|z(t)\|^{2}=\|z(0)\|^{2}+\int_{0}^{t}\frac{d}{d\tau}\|z(\tau)\|^{2}d\tau=\|z(0)\|^{2}+\int_{0}^{t}\langle z(\tau),\dot{z}(\tau)\rangle d\tau\leq\|z(0)\|^{2}+\frac{1}{2\varepsilon}\|z\|^{2}_{L^{2}}+\frac{t\varepsilon}{2}.

Rearranging and integrating over (0,tf)(0,t_{f}) gives

z2L2tf2εz2L2\displaystyle\|z\|^{2}_{L^{2}}-\frac{t_{f}}{2\varepsilon}\|z\|^{2}_{L^{2}} tfz(0)2+tf2ε,\displaystyle\leq t_{f}\|z(0)\|^{2}+t_{f}^{2}\varepsilon,

and then setting ε=tf\varepsilon=t_{f} gives

zLqcz2L22c(tfz(0)2+tf3).\|z\|_{L^{q}}\leq c\|z\|^{2}_{L^{2}}\leq 2c(t_{f}\|z(0)\|^{2}+t_{f}^{3}).

for q2q\geq 2. Combining these lower order estimates with (21), we have that for each 2q<2\leq q<\infty the subsequence (zp)pq(z_{p})_{p\geq q} is bounded in W2,q((0,tf),2)W^{2,q}((0,t_{f}),\mathbb{R}^{2}), and therefore has a subsequence which converges weakly in W2,q((0,tf),2)W^{2,q}((0,t_{f}),\mathbb{R}^{2}) (by the local weak compactness of reflexive Banach spaces, [23, p. 126]) and strongly in C1([0,tf],2)C^{1}([0,t_{f}],\mathbb{R}^{2}) (by the Rellich-Kondrachov theorem, [1, Theorem 6.3]). By induction we can construct successive subsequences (zp2(i))(zp3(i)),(zpk(i)),(z_{p_{2}(i)})(z_{p_{3}(i)}),\ldots(z_{p_{k}(i)}),\ldots, such that (zpk+1(i))(z_{p_{k+1}(i)}) is a subsequence of (zpk(i))(z_{p_{k}(i)}) which converges weakly in W2,k+1W^{2,k+1}. Then the diagonal sequence (zp~(i))(z_{\tilde{p}(i)}) where p~(i)=pi(i)\tilde{p}(i)=p_{i}(i) converges weakly in W2,kW^{2,k} for every k2k\geq 2. Indeed, since weak convergence in W2,kW^{2,k} implies weak convergence in W2,qW^{2,q} for all qkq\leq k by the embedding W2,qW2,kW^{2,q}\subset W^{2,k}, we have that zp~(i)z_{\tilde{p}(i)} converges weakly in W2,qW^{2,q} (and strongly in C1C^{1}) for every q2q\geq 2 to zq2W2,q((0,tf),2)z_{\infty}\in\bigcap_{q\geq 2}W^{2,q}((0,t_{f}),\mathbb{R}^{2}). By the strong C1C^{1} convergence, zz_{\infty} satisfies the constraints. Since z¨Lq\|\ddot{z}_{\infty}\|_{L^{q}} is uniformly bounded for all qq, we have zW2,((0,tf),2)z_{\infty}\in W^{2,\infty}((0,t_{f}),\mathbb{R}^{2}) and limpz¨Lp=z¨L\lim_{p\to\infty}{\|\ddot{z}_{\infty}\|}_{L^{p}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}} by [1, Theorem 2.14], and also limpz¨pLp=z¨L\lim_{p\to\infty}{\|\ddot{z}_{p}\|}_{L^{p}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}} . Finally, zz_{\infty} is a minimiser of maximum curvature, because if not then there exists zz and ε>0\varepsilon>0 such that z¨L=z¨Lε{\|\ddot{z}\|}_{L^{\infty}}={\|\ddot{z}_{\infty}\|}_{L^{\infty}}-\varepsilon, and then choosing pp sufficiently large:

tf1pz¨pLp>z¨Lε2>z¨Ltf1pz¨Lp{t_{f}}^{-\frac{1}{p}}{\|\ddot{z}_{p}\|}_{L^{p}}>{\|\ddot{z}_{\infty}\|}_{L^{\infty}}-\tfrac{\varepsilon}{2}>{\|\ddot{z}\|}_{L^{\infty}}\geq{t_{f}}^{-\frac{1}{p}}{\|\ddot{z}\|}_{L^{p}}

which contradicts the assumption that zpz_{p} is a minimiser of the pp-elastic energy. \Box

5 A Numerical Method

In this section, we provide a numerical method utilizing arc, or switching time, parametrization techniques earlier used for optimal control problems exhibiting discontinuous controls in [13, 14, 12, 18]. Theorem 22 asserts that a curve of minimax curvature can be of type (described by the strings) CCCCCC, or COCCOC, or CSCCSC, or SOSSOS, or a substring thereof, and nothing else. Then the problem of finding such a curve can be reduced to finding a correct concatenation/configuration of circular arcs (CC), straight lines (SS) and a loop (OO) as listed, as well as finding the length of each arc involved. Recall that a loop is nothing but a circular arc with length 2π/a2\,\pi/a. Therefore, for computational purposes, we will treat a loop as any other circular arc CC with an unknown length. We also note that if there are more than one circular arc in a string then they must have the same curvature aa, i.e., the same radius.

A circular arc CC can either be a left-turn or a right-turn arc, which we denote by LL and RR, respectively. Then a curve of type CCCCCC can be represented by the string LRLRLRLR. Clearly, at least one of the arcs in LRLRLRLR must be of zero length in order to represent CCCCCC or a substring thereof, and this should be determined by the numerical method to be proposed. A curve of type CSCCSC can similarly be represented by LRSLRLRSLR, again with at least one of the arcs of type LL or RR being of zero length. Note that if the length of the straight line (S) is zero, then LRSLRLRSLR effectively reduces to LRLRLRLR representing CCCCCC. Therefore, the string LRSLRLRSLR is general enough to represent both of the types CCCCCC and CSCCSC. The type SCSSCS (or equivalently SOSSOS), on the other hand, can be represented by the string SLRSSLRS. Recall that, by Remark 24, any solution of type SOSSOS can be obtained from a solution of type SOSO or OSOS, simply by placing the loop anywhere on the straight line segment between the endpoints. Therefore either the string SLRSLR or the string LRSLRS will suffice to represent SCSSCS. As a result, one would cover all the solution types in Theorem 22 by using the string LRSLRLRSLR.

Let Lξ1L_{\xi_{1}} denote a left-turn arc with length ξ1\xi_{1}, Rξ2R_{\xi_{2}} a right-turn arc with length ξ2\xi_{2} and Sξ3S_{\xi_{3}} a straight line with length ξ3\xi_{3}. Similarly, let Lξ4L_{\xi_{4}} denote a left-turn arc with length ξ4\xi_{4} and Rξ5R_{\xi_{5}} a right-turn arc with length ξ5\xi_{5}. Then the types of solution curves described in Theorem 22 can all be represented by the string

Lξ1Rξ2Sξ3Lξ4Rξ5.L_{\xi_{1}}R_{\xi_{2}}S_{\xi_{3}}L_{\xi_{4}}R_{\xi_{5}}\,.

For example, the type RLRRLR is given with ξ1=ξ3=0\xi_{1}=\xi_{3}=0 and ξ2,ξ4,ξ5>0\xi_{2},\xi_{4},\xi_{5}>0. On the other hand, the type LRLR is obtained either with ξ3=ξ4=ξ5=0\xi_{3}=\xi_{4}=\xi_{5}=0 and ξ1,ξ2>0\xi_{1},\xi_{2}>0 or with ξ1=ξ2=ξ3=0\xi_{1}=\xi_{2}=\xi_{3}=0 and ξ4,ξ5>0\xi_{4},\xi_{5}>0.

Let the initial time t0:=0t_{0}:=0 and the terminal time t5:=tft_{5}:=t_{f}. Also define the switching times tjt_{j}, j=1,4j=1,\ldots 4, such that

ξj:=tjtj1,for j=1,,5.\xi_{j}:=t_{j}-t_{j-1}\,,\quad\mbox{for }j=1,\ldots,5\,. (22)

The lengths ξj\xi_{j} may also be referred to as arc durations. Note that along an arc of type LL or RR the optimal control is bang–bang; in particular, v(t)1v(t)\equiv 1 along an arc of type LL and v(t)1v(t)\equiv-1 along an arc of type RR. Along an arc of type SS, on the other hand, optimal control is singular; in particular, v(t)0v(t)\equiv 0, by Lemma 7. Since v(t)v(t) is constant (11, 1-1 or 0) for tj1t<tjt_{j-1}\leq t<t_{j} (along the jjth arc), the ODEs in Problem (OC) with α(t)=a\alpha(t)=a, or equivalently Problem (Pc) with u(t)=av(t)u(t)=a\,v(t), can be solved as follows.

θ(t)=θ(tj1)+av(t)(ttj1), if j=1,,5,\displaystyle\theta(t)=\theta(t_{j-1})+a\,v(t)\,(t-t_{j-1})\,,\quad\mbox{ if }j=1,\ldots,5\,, (23)
x(t)={x(tj1)+(sinθ(t)sinθ(tj1))/(av(t)), if j=1,2,4,5,x(tj1)+cosθ(t)(ttj1), if j=3,\displaystyle x(t)=\left\{\begin{array}[]{ll}x(t_{j-1})+\big{(}\sin\theta(t)-\sin\theta(t_{j-1})\big{)}/\big{(}a\,v(t)\big{)}\,,&\mbox{ if }j=1,2,4,5\,,\\[2.84526pt] x(t_{j-1})+\cos\theta(t)\,(t-t_{j-1})\,,&\mbox{ if }j=3\,,\end{array}\right. (26)
y(t)={y(tj1)+(cosθ(t)cosθ(tj1))/(av(t)), if j=1,2,4,5,y(tj1)+sinθ(t)(ttj1), if j=3,\displaystyle y(t)=\left\{\begin{array}[]{ll}y(t_{j-1})+\big{(}\cos\theta(t)-\cos\theta(t_{j-1})\big{)}/\big{(}a\,v(t)\big{)}\,,&\mbox{ if }j=1,2,4,5\,,\\[2.84526pt] y(t_{j-1})+\sin\theta(t)\,(t-t_{j-1})\,,&\mbox{ if }j=3\,,\end{array}\right. (29)

where

v(t)={1, if j=1,4,1, if j=2,5,0, if j=3,v(t)=\left\{\begin{array}[]{rl}1\,,&\mbox{ if }j=1,4\,,\\[2.84526pt] -1\,,&\mbox{ if }j=2,5\,,\\[2.84526pt] 0\,,&\mbox{ if }j=3\,,\end{array}\right. (30)

for tj1t<tjt_{j-1}\leq t<t_{j}. We have used the solution for θ(t)\theta(t) in (23) in obtaining the solutions for x(t)x(t) and y(t)y(t) in (26)–(29). Note that the control variable v(t)v(t) is a piecewise constant function, which takes here the sequence of values {1,1,0,1,1}\{1,-1,0,1,-1\}. It should be clear from the context that the control variable u(t)u(t) defined in Problem (Pc) takes the sequence of values {a,a,0,a,a}\{a,-a,0,a,-a\}. After evaluating the state variables in (23)–(29) at the switching times and carrying out algebraic manipulations, one can equivalently rewrite Problem (Pc), or Problem (OC), as follows.

(Ps){minas.t.x0xf+1a(sinθ0+2sinθ12sinθ2+2sinθ4sinθf)+ξ3cosθ2=0,y0yf+1a(cosθ02cosθ1+2cosθ22cosθ4+cosθf)+ξ3sinθ2=0,sinθf=sinθ5,cosθf=cosθ5,tf=j=15ξj,ξj0, for j=1,,5,\mbox{(Ps)}\left\{\begin{array}[]{rl}\min&\ a\\[11.38109pt] \mbox{s.t.}&\displaystyle\ x_{0}-x_{f}+\frac{1}{a}\big{(}-\!\sin\theta_{0}+2\,\sin\theta_{1}-2\,\sin\theta_{2}+2\,\sin\theta_{4}-\sin\theta_{f}\big{)}+\xi_{3}\,\cos\theta_{2}=0\,,\\[8.53581pt] &\displaystyle\ y_{0}-y_{f}+\frac{1}{a}\big{(}\cos\theta_{0}-2\,\cos\theta_{1}+2\,\cos\theta_{2}-2\,\cos\theta_{4}+\cos\theta_{f}\big{)}+\xi_{3}\,\sin\theta_{2}=0\,,\\[8.53581pt] &\displaystyle\ \sin\theta_{f}=\sin\theta_{5}\,,\ \ \cos\theta_{f}=\cos\theta_{5}\,,\\[5.69054pt] &\displaystyle t_{f}=\sum_{j=1}^{5}\,\xi_{j}\,,\qquad\xi_{j}\geq 0\,,\ \mbox{ for }j=1,\ldots,5\,,\end{array}\right.

where

θ1=θ0+aξ1,θ2=θ1aξ2,θ4=θ2+aξ4,θ5=θ4aξ5.\theta_{1}=\theta_{0}+a\,\xi_{1}\,,\qquad\theta_{2}=\theta_{1}-a\,\xi_{2}\,,\qquad\theta_{4}=\theta_{2}+a\,\xi_{4}\,,\qquad\theta_{5}=\theta_{4}-a\,\xi_{5}\,. (31)

Substitution of θ1\theta_{1}, θ2\theta_{2}, θ4\theta_{4} and θ5\theta_{5} in (31) into Problem (Ps) yields a finite-dimensional nonlinear optimization problem in just six variables, ξj\xi_{j}, j=1,,5j=1,\ldots,5, and aa.

Remark 27

With the constraints sinθf=sinθ5\sin\theta_{f}=\sin\theta_{5} and cosθf=cosθ5\cos\theta_{f}=\cos\theta_{5}, we make sure that we satisfy the slope condition at the terminal point. Otherwise, the constraint θf=θ5\theta_{f}=\theta_{5} is stronger, and imposing it might result in missing some of the feasible solutions. \Box

Remark 28

Problem (Ps) can be solved by standard optimization methods and software, for example, Algencan [2, 3], which implements augmented Lagrangian techniques, or Ipopt [22], which implements an interior point method, or SNOPT [8], which implements a sequential quadratic programming algorithm, or Knitro [4], which implements various interior point and active set algorithms to choose from. For general nonconvex optimization problems like Problem (Ps), what one can hope for, by using these software, is to get (at best) a locally optimal solution. \Box

5.1 Numerical experiments

For computations numerically solving Problem (Ps), or equivalently Problem (P), we have employed the AMPL–Knitro computational suite: AMPL is an optimization modelling language [6] and Knitro is a popular optimization software [4] (Version 3.0.1 is used here). In all runs, we have used the Knitro options ‘alg=0 feastol=1e-12 opttol=1e-12’ in AMPL; in other words, we have set both the feasibility and optimality tolerances at 101210^{-12}, and the particular algorithm to be employed was left to be chosen by Knitro itself.

In what follows, we present example instances of problems for which we solve Problem (Ps) by using the AMPL–Knitro suite and display the solution curves. The best numerical solution we have identified for an instance is depicted by a solid curve in the graphs. The other solutions reported by the suite as “locally optimal” are depicted by dotted curves, and referred to here as “critical”, as they satisfy the necessary conditions of optimality furnished by the maximum principle.

5.1.1 Example 1

Recall from Corollary 23 that bb is the solution of the equation sinc1(b)(1b)=π/2\operatorname{sinc}^{-1}(b)(1-b)=\pi/2. After rearranging this equation one gets f(b):=bsinc(π/(2(1b)))=0f(b):=b-\operatorname{sinc}\big{(}\pi/(2\,(1-b))\big{)}=0. A numerical solution to f(b)=0f(b)=0 can be obtained as b0.319966693534110b\approx 0.319966693534110 (correct to 15 dp) in seven iterations by using the secant method starting with initial points 0.10.1 and 0.20.2. The uniqueness of this solution can be established by observing that the graph of ff crosses the bb-axis only once (not shown here).

Figure 7 shows curves of minimax curvature between the oriented points (x0,y0,θ0)=(0,0,0)(x_{0},y_{0},\theta_{0})=(0,0,0) and (xf,yf,θf)=(0,1,0)(x_{f},y_{f},\theta_{f})=(0,1,0), which is a similar situation to that in the diagram in Figure 5. The orientation of the endpoints of the curves in Figure 7 are depicted by (black) arrows. We note that, for the two given endpoints, d=1d=1.

Refer to caption

(a) tf=1.5t_{f}=1.5: Solid curve RLR (red) with a3.989a\approx 3.989; dotted curve LS (blue) with a=4πa=4\,\pi.

Refer to caption

(b) tf=3t_{f}=3: Solid curve RLR (red) with a3.038a\approx 3.038; dotted curve LS (blue) with a=πa=\pi.

Refer to caption

(c) tf=5t_{f}=5: Solid curve LS (blue) with a=π/2a=\pi/2; dotted curve RLR (red) with a2.077a\approx 2.077.

Refer to caption

(d) tf=7t_{f}=7: Solid curve LS (blue) with a=π/3a=\pi/3; dotted curve RLR (red) with a1.565a\approx 1.565.

Figure 7: Example 1—Problem instances for Proposition 23 and its converse: (a)–(b) for d/tf>bd/t_{f}>b (tf3.12532t_{f}\leq 3.12532) and (c)–(d) for d/tfbd/t_{f}\leq b (tf3.12533t_{f}\geq 3.12533)—Curves of minimax curvature between the oriented points (0,0,0)(0,0,0) and (0,1,0)(0,1,0), for various tft_{f}. The solid curves are optimal (identified by numerical experiments) while the dotted ones are only critical.

Proposition 23 asserts that the SOSSOS type of trajectory is not optimal (nor OSOS or SOSO, by Remark 24) for tf<1/bt_{f}<1/b, i.e., tf3.12532t_{f}\leq 3.12532 (correct to 5 dp). This is exemplified in Figures 7(a) and 7(b) respectively for tf=1.5t_{f}=1.5 and tf=3t_{f}=3, where it is numerically verified that the CCC concatenation of curves is optimal: For tf=1.5t_{f}=1.5, the solution curve has a=3.9887508486a=3.9887508486 (correct to 10 dp) and is of type Rξ2Lξ4Rξ5R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}} (with ξ1=ξ3=0\xi_{1}=\xi_{3}=0), where the arc lengths ξ2=0.375\xi_{2}=0.375, ξ4=0.75\xi_{4}=0.75, and ξ5=0.375\xi_{5}=0.375. For tf=3t_{f}=3, a=3.0384835468a=3.0384835468 (correct to 10 dp) and the solution is of the same type, with ξ2=0.75\xi_{2}=0.75, ξ4=1.5\xi_{4}=1.5, and ξ5=0.75\xi_{5}=0.75. We point to the symmetry in the arc lengths in both cases, presumably because of the colinear configuration of the endpoints.

The converse of Proposition 23 for this example instance (which is not proved) is that the SOSSOS (or OSOS or SOSO, by Remark 24) concatenation is optimal for tf1/bt_{f}\geq 1/b, i.e., tf3.12533t_{f}\geq 3.12533 (correct to 5 dp). This converse is exemplified in Figures 7(c) and 7(d) respectively for tf=5t_{f}=5 and tf=7t_{f}=7. For tf=5t_{f}=5, the solution curve has a=π/2a=\pi/2 and is of type Lξ1Sξ3L_{\xi_{1}}S_{\xi_{3}} (with ξ2=ξ4=ξ5=0\xi_{2}=\xi_{4}=\xi_{5}=0), where ξ1=4\xi_{1}=4, and ξ3=1\xi_{3}=1. For tf=5t_{f}=5, a=π/3a=\pi/3 and the solution is of the same type, with ξ1=6\xi_{1}=6, and ξ3=1\xi_{3}=1.

Each of the solution curves presented in Figure 7 can be “flipped over” or “reflected about” the xx-axis to get “symmetric” solution curves: For example when an LSLS curve is reflected one gets an RSRS curve which has the same length, oriented endpoints and maximum curvature. An RLRRLR curve can also be reflected in the same way to get an LRLLRL curve which has the same length, oriented endpoints and maximum curvature. These reflected curves have also been obtained as computational solutions to Problem (Ps).

5.1.2 Example 2

Suppose that the endpoints are placed on a unit circle with the end velocities chosen tangent to the circle. Recall from Proposition 25 that a curve of minimax curvature between these endpoints can be of type COCCOC (or equivalently type OCOC or type COCO, as the loop OO can be shifted anywhere on a circular arc CC) only in the case when the endpoints are separated by at most π/2\pi/2 radians, i.e., they are separated by an angle βπ/2\beta\leq\pi/2 (see Figure 6), and tft_{f} is chosen as β+2π\beta+2\,\pi. Here we numerically illustrate Proposition 25 and explore instances with various other values of the separation angle β\beta and with tf=β+2πt_{f}=\beta+2\,\pi—see Figure 8(a)–(d).

We place the pair of oriented endpoints on a unit circle, and impose tf=β+2πt_{f}=\beta+2\,\pi (so that a loop can be observed at least in a critical curve), in each part of Figure 8, and obtain optimal solutions as follows.

  • (a)

    β=π/3<π/2\beta=\pi/3<\pi/2, i.e., tf=7π/3t_{f}=7\,\pi/3: (1/2,3/2,π/6)(-1/2,-\sqrt{3}/2,-\pi/6) and (1/2,3/2,π/6)(1/2,-\sqrt{3}/2,\pi/6). The optimal curve is of type Lξ4Rξ5L_{\xi_{4}}R_{\xi_{5}} (with ξ1=ξ2=ξ3=0\xi_{1}=\xi_{2}=\xi_{3}=0), where a=1a=1, ξ4=π/3\xi_{4}=\pi/3, and ξ5=2π\xi_{5}=2\,\pi.

  • (b)

    β=2π/3\beta=2\,\pi/3, i.e., tf=8π/3t_{f}=8\,\pi/3: (3/2,1/2,π/3)(-\sqrt{3}/2,-1/2,-\pi/3) and (3/2,1/2,π/3)(\sqrt{3}/2,-1/2,\pi/3). The optimal curve is of type Rξ2Lξ4Rξ5R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}} (with ξ1=ξ3=0\xi_{1}=\xi_{3}=0), where a=0.8174619979a=0.8174619979, ξ2=1.4538775285\xi_{2}=1.4538775285, ξ4=5.4698253525\xi_{4}=5.4698253525, and ξ5=1.4538775285\xi_{5}=1.4538775285, all correct to 10 dp.

  • (c)

    β=4π/3\beta=4\,\pi/3, i.e., tf=10π/3t_{f}=10\,\pi/3: (3/2,1/2,2π/3)(-\sqrt{3}/2,1/2,-2\,\pi/3) and (3/2,1/2,2π/3)(\sqrt{3}/2,1/2,2\,\pi/3). The optimal curve is of type Rξ2Lξ4Rξ5R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}, where a=0.5245840019a=0.5245840019, ξ2=0.6217500975\xi_{2}=0.6217500975, ξ4=9.2284753170\xi_{4}=9.2284753170, and ξ5=0.6217500975\xi_{5}=0.6217500975, all correct to 10 dp.

  • (d)

    β=5π/3\beta=5\,\pi/3, i.e., tf=11π/3t_{f}=11\,\pi/3: (1/2,3/2,5π/6)(-1/2,\sqrt{3}/2,-5\,\pi/6) and (1/2,3/2,5π/6)(1/2,\sqrt{3}/2,5\,\pi/6). The optimal curve is of type Rξ2Lξ4Rξ5R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}}, where a=0.5026360614a=0.5026360614, ξ2=0.2755293870\xi_{2}=0.2755293870, ξ4=10.9681142892\xi_{4}=10.9681142892, and ξ5=0.2755293870\xi_{5}=0.2755293870, all correct to 10 dp.

Refer to caption

(a) β=π/3\beta=\pi/3: Solid curve LR (blue) with a=1a=1; dotted curves, RLR (red) with a1.246a\approx 1.246, LRL (purple) with a1.754a\approx 1.754, RSR (green) with a1.978a\approx 1.978.

Refer to caption

(b) β=2π/3\beta=2\,\pi/3: Solid curve RLR (red) with a0.817a\approx 0.817; dotted curves, LR (blue) with a=1a=1, LRL (purple) with a1.620a\approx 1.620, RSR (green) with a1.836a\approx 1.836.

Refer to caption

(c) β=4π/3\beta=4\,\pi/3: Solid curve RLR (red) with a0.525a\approx 0.525; dotted curves, LR (blue) with a=1a=1, RSR (green) with a1.157a\approx 1.157, LRL (purple) with a1.514a\approx 1.514.

Refer to caption

(d) β=5π/3\beta=5\,\pi/3: Solid curve RLR (red) with a0.503a\approx 0.503; dotted curves, RSR (green) with a0.792a\approx 0.792, RLR (brown) with a0.798a\approx 0.798, LR (blue) with a=1a=1.

Figure 8: Example 2—Problem instances for Corollary 25: Curves of minimax curvature (and of particular length) between oriented points placed on a unit circle. The solid curves are optimal (identified by numerical experiments) while the dotted ones are only critical.

In each of the problem instances, one curve of COCO type, two curves of CCCCCC type, and one curve of CSCCSC type have been found as solutions of Problem (Ps) by the AMPL–Knitro suite. Of these four distinct solutions in each case, the curve with the smallest aa was identified as the optimal curve for that case.

Similarly to Example 5.1.1, the loop in each of the LRLR curves presented in Figure 8 can be “flipped over” or “reflected about” the circle to get “symmetric” curves of type LL with ξ1=β+2π\xi_{1}=\beta+2\,\pi, with the same endpoints and maximum curvatures. These reflected curves have also been encountered as computational solutions to Problem (Ps), but are not being shown in Figure 8.

We note that if tfβ+π/2t_{f}\neq\beta+\pi/2 then one does not encounter a solution curve (optimal or only critical) which contains a loop.

5.1.3 Example 3

Consider the problem of finding a curve of minimax curvature between the oriented points (0,0,π/3)(0,0,-\pi/3) and (0.4,0.4,π/6)(0.4,0.4,-\pi/6)—note that the same oriented endpoints were used in [10] for finding the shortest curve of bounded curvature, i.e., the Markov–Dubins path. For various lengths, namely for tf=0.8,1.3,2t_{f}=0.8,1.3,2 and 2.52.5, here we find curves minimizing the bound aa on the curvature. Figure 9(a)–(d) depict these curves found as a solution of Problem (Ps) by employing the AMPL–Knitro software suite.

In Figure 9(a)–(d), the optimal curves are shown as solid curves. These curves have been identified amongst all the critical solutions to Problem (Ps) (of the allowed types) that the AMPL–Knitro suite could find, as the ones with the smallest curvature, or indeed with the biggest turning radius. All other critical curves are shown as dotted curves in Figure 9(a)–(d). A summary of all these solutions in each part of Figure 9 can be given as follows, with arc lengths and curvatures correct up to 10 dp.

Refer to caption

(a) tf=0.8t_{f}=0.8: Solid curve LSR with a8.366a\approx 8.366.

Refer to caption

(b) tf=1.3t_{f}=1.3: Solid curve RLR with a6.048a\approx 6.048.

Refer to caption

(c) tf=2t_{f}=2: Solid curve RSR with a4.055a\approx 4.055.

Refer to caption

(d) tf=2.5t_{f}=2.5: Solid curve RSR with a3.017a\approx 3.017.

Figure 9: Example 3—Curves of minimax curvature between the oriented points (0,0,π/3)(0,0,-\pi/3) and (0.4,0.4,π/6)(0.4,0.4,-\pi/6). The solid (blue) curves are optimal (identified by numerical experiments) while the dotted ones (in various other colours) are only critical.
  • (a)

    tf=0.8t_{f}=0.8: The optimal curve has a=8.3661513485a=8.3661513485 and is of type Lξ1Sξ3Rξ5L_{\xi_{1}}S_{\xi_{3}}R_{\xi_{5}} (with ξ2=ξ4=0\xi_{2}=\xi_{4}=0), where ξ1=0.3141136578\xi_{1}=0.3141136578, ξ3=0.2343580660\xi_{3}=0.2343580660, ξ5=0.2515282761\xi_{5}=0.2515282761. Only one other critical curve is found which has a=24.6216106492a=24.6216106492 and is of type RSRRSR.

  • (b)

    tf=1.3t_{f}=1.3: The optimal curve has a=6.0477371511a=6.0477371511 and is of type Rξ2Lξ4Rξ5R_{\xi_{2}}L_{\xi_{4}}R_{\xi_{5}} withξ2=0.0684530840\xi_{2}=0.0684530840, ξ4=0.6932888171\xi_{4}=0.6932888171, ξ5=0.5382580988\xi_{5}=0.5382580988. Four other critical curves are found which respectively have a=7.8842574114a=7.8842574114, a=15.7179879207a=15.7179879207, a=9.3041564227a=9.3041564227, a=6.4570352671a=6.4570352671, and are of types RSRRSR, RSLRSL, LSLLSL, LRLLRL.

  • (c)

    tf=2t_{f}=2: The optimal curve has a=4.0557744873a=4.0557744873 and is of type Rξ2Sξ3Rξ5R_{\xi_{2}}S_{\xi_{3}}R_{\xi_{5}} withξ2=1.1520596173\xi_{2}=1.1520596173, ξ3=0.5799046398\xi_{3}=0.5799046398, ξ5=0.2680357429\xi_{5}=0.2680357429. Five other critical curves are found which respectively have a=8.1329787696a=8.1329787696, a=4.1795061273a=4.1795061273, a=4.7799043550a=4.7799043550, a=4.9575871447a=4.9575871447, a=5.0128303633a=5.0128303633 and are of types RSLRSL, RLRRLR, LSLLSL, LRLLRL, LRLLRL.

  • (d)

    tf=2.5t_{f}=2.5: The optimal curve has a=3.0172721767a=3.0172721767 and is of type Rξ2Sξ3Rξ5R_{\xi_{2}}S_{\xi_{3}}R_{\xi_{5}} withξ2=1.5726285431\xi_{2}=1.5726285431, ξ3=0.5911279480\xi_{3}=0.5911279480, ξ5=0.3362435090\xi_{5}=0.3362435090. Six other critical curves are found which respectively have a=6.0662510507a=6.0662510507, a=3.0516505811a=3.0516505811, a=3.8632611845a=3.8632611845, a=3.5528730863a=3.5528730863, a=4.1925834507a=4.1925834507, a=3.6102865904a=3.6102865904 and are of types RSLRSL, RLRRLR, RLRRLR, LSLLSL, LRLLRL, LRLLRL.

6 Conclusion

We have presented a classification of the solutions to the problem of finding planar curves of minimax curvature between two endpoints with the endpoint tangents specified (Problem (P)). We devised a numerical procedure for finding the critical points of Problem (P), or the \infty-elastica, of the classification types we have presented. Of these critical solutions the one with the smallest curvature is the (global) solution to Problem (P), which we refer to as a curve of minimax curvature.

A natural extension of this paper would be the study of interpolating curves which not only satisfy the endpoint constraints but also must pass through a number of specified intermediate points.

It would also be interesting to study another extension of Problem (P), from curves in IR2{\rm{I\ \kern-5.39993ptR}}^{2} to curves in IRn{\rm{I\ \kern-5.39993ptR}}^{n}. In IR3{\rm{I\ \kern-5.39993ptR}}^{3} alone, it is highly likely that one would encounter curve segments of types different from just circular and straight line segments.

A further extension from the Euclidean space to curved spaces would be of great interest where the curves of minimax curvature would be in a Riemannian manifold.

References

  • [1] R. A. Adams, J. J. F. Fournier, Sobolev Spaces, Second Edition, Elsevier, 2003.
  • [2] R. Andreani, E. G. Birgin, J. M. Martínez, and M. L. Schuverdt, On augmented Lagrangian methods with general lower-level constraints, SIAM J. Optim., 18(4) (2008), pp. 1286–1309.
  • [3] E. G. Birgin and J. M. Martínez, Practical Augmented Lagrangian Methods for Constrained Optimization, SIAM Publications, 2014.
  • [4] R. H. Byrd, J. Nocedal, R. A. Waltz, KNITRO: An integrated package for nonlinear optimization. In: G. di Pillo and M. Roma, eds., Large-Scale Nonlinear Optimization, Springer, New York, 35–59, 2006. https://doi.org/10.1007/0-387-30065-1_4
  • [5] L. E. Dubins, On curves of minimal length with a constraint on average curvature and with prescribed initial and terminal positions and tangents, Amer. J. Math., 79 (1957), pp. 497–516.
  • [6] R. Fourer, D. M. Gay, and B. W. Kernighan, AMPL: A Modeling Language for Mathematical Programming, Second Edition, Brooks/Cole Publishing Company / Cengage Learning, 2003.
  • [7] E. Gallagher and R. Moser, The \infty-elastica problem on a Riemannian manifold, J. Geom. Anal., 33(7) (2023), 226.
  • [8] P. E. Gill, W. Murray, and M. A. Saunders, SNOPT: an SQP algorithm for large-scale constrained optimization, SIAM Rev., 47(1) (2005), 99–131.
  • [9] R. Huang, A note on the pp-elastica in a constant sectional curvature manifold, J. Geom. Phys., 49(3–4) (2004), 343–-349.
  • [10] C. Y. Kaya, Markov–Dubins path via optimal control theory, Comput. Optim. Appl., 68(3) (2017), 719–747.
  • [11] C. Y. Kaya, Markov–Dubins interpolating curves, Comput. Optim. Appl., 73(2) (2019), 647–677.
  • [12] C. Y. Kaya, S. K. Lucas, and S. T. Simakov, Computations for bang–bang constrained optimal control using a mathematical programming formulation, Opt. Contr. Appl. Meth., 25 (2004), 295–308.
  • [13] C. Y. Kaya and J. L. Noakes, Computations and time‐optimal controls, Opt. Contr. Appl. Meth., 17 (1996), 171–185.
  • [14] C. Y. Kaya and J. L. Noakes, Computational algorithm for time-optimal switching control, J. Optim. Theory App., 117 (2003), 69–92.
  • [15] C. Y. Kaya and J. L. Noakes, Finding interpolating curves minimizing LL^{\infty} acceleration in the Euclidean space via optimal control theory, SIAM J. Control Optim., 51(1) (2013), 442–464.
  • [16] M. G. Kreĭn and A. A. Nudel’man, The Markov Moment Problem and Extremal Problems, Translations of Mathematical Monographs, Amer. Math. Soc., 1977.
  • [17] A. A. Markov, Some examples of the solution of a special kind of problem on greatest and least quantities, Soobscenija Charkovskogo Matematiceskogo Obscestva, 2-1(5,6), 250–276, 1889 (in Russian).
  • [18] H. Maurer, C. Büskens, J.-H. R. Kim, and C. Y. Kaya, Optimization methods for the verification of second order sufficient conditions for bang–bang controls, Opt. Cont. Appl. Meth., 26 (2005), 129–156.
  • [19] T. Miura and K. Yoshizawa, Pinned planar p-elasticae, arXiv preprint arXiv:2209.05721 (2022).
  • [20] R. Moser, Structure and classification results for the \infty-elastica problem, Amer. J. Math., 144(5) (2022), 1299–1329.
  • [21] L. S. Pontryagin, V. G. Boltyanskii, R. V. Gamkrelidze, and E. F. Mishchenko, The Mathematical Theory of Optimal Processes (Russian), English translation by K. N. Trirogoff, ed. by L. W. Neustadt, Interscience Publishers, New York, 1962.
  • [22] A. Wächter and L. T. Biegler, On the implementation of a primal-dual interior point filter line search algorithm for large-scale nonlinear programming, Math. Program., 106 (2006), 25–57.
  • [23] K. Yosida, Functional Analysis, Fifth Edition, Springer-Verlag, 1978.