This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

An Optimal Control Theory for the Traveling Salesman Problem and Its Variants

I. M. Ross111Distinguished Professor & Program Director, Control and Optimization R. J. Proulx222Research Professor, Control and Optimization Laboratories M. Karpenko333Research Associate Professor & Associate Director, Control and Optimization Laboratories Naval Postgraduate School, Monterey, CA 93943
Abstract

We show that the traveling salesman problem (TSP) and its many variants may be modeled as functional optimization problems over a graph. In this formulation, all vertices and arcs of the graph are functionals; i.e., a mapping from a space of measurable functions to the field of real numbers. Many variants of the TSP, such as those with neighborhoods, with forbidden neighborhoods, with time-windows and with profits, can all be framed under this construct. In sharp contrast to their discrete-optimization counterparts, the modeling constructs presented in this paper represent a fundamentally new domain of analysis and computation for TSPs and their variants. Beyond its apparent mathematical unification of a class of problems in graph theory, the main advantage of the new approach is that it facilitates the modeling of certain application-specific problems in their home space of measurable functions. Consequently, certain elements of economic system theory such as dynamical models and continuous-time cost/profit functionals can be directly incorporated in the new optimization problem formulation. Furthermore, subtour elimination constraints, prevalent in discrete optimization formulations, are naturally enforced through continuity requirements. The price for the new modeling framework is nonsmooth functionals. Although a number of theoretical issues remain open in the proposed mathematical framework, we demonstrate the computational viability of the new modeling constructs over a sample set of problems to illustrate the rapid production of end-to-end TSP solutions to extensively-constrained practical problems.

keywords:
traveling salesman problems, functionals, optimal control theory, neighborhoods, forbidden neighborhoods, profits, time window, routing
journal: Journal of  Templates

1 Introduction

Recently, we showed that a continuous-variable, nonlinear static optimization problem can be framed as a dynamic optimization problem [21]. In this theory, a generic point-to-set algorithmic map is defined in terms of a controllable continuous-time444Time is just a proxy for an independent variable. trajectory, where, the decision variable is a continuous-time search vector. Starting with this simple idea, many well-known algorithms, such as the gradient method and Newton’s method, can be derived as optimal controllers over certain metric spaces [21]. If the control is set to the acceleration of a double-integrator model, then a similar theory[20] generates accelerated optimization techniques such as Polyak’s heavy ball method[17] and Nesterov’s accelerated gradient method[16]. In this paper, we further the theory of framing optimization problems in terms of an optimal control problem. More specifically, we use a menu of modifications to the traveling salesman problem (TSP) and its variants [4, 12] to address a class of combinatorial optimization problems. To motivate the new mathematical paradigm, consider the following questions:

Question 1.

How do we define the distance between two sets if the vertices in a TSP graph are cities equipped with the power of the continuum? Assume the cities to be disjoint sets; see Fig. 1.

Refer to caption

Figure 1: A candidate distance function d(A,B)d(A,B) between two disjoint sets, AA and BB.

A version of this question was posed in [2] more than two decades ago. It continues to be an active research topic; see for example, [8] and [30]. To appreciate this question, consider a distance function defined by,

d(A,B):=missingmin𝒙A,𝒚Bd(𝒙,𝒚)=missingmin𝒙A,𝒚B𝒙𝒚2d(A,B):=\displaystyle\mathop{\text{missing}}{min}_{\boldsymbol{x}\in A,{\boldsymbol{y}}\in B}d(\boldsymbol{x},{\boldsymbol{y}})=\displaystyle\mathop{\text{missing}}{min}_{\boldsymbol{x}\in A,{\boldsymbol{y}}\in B}\left\|\boldsymbol{x}-{\boldsymbol{y}}\right\|_{2} (1)

Besides the fact that d(A,B)d(A,B) is not a metric, if (1) is used to construct the edge weights in the TSP graph, the resulting solution may comprise disconnected segments because the entry and exit points for a city may not necessarily be connected. Adding a “continuity segment” a posteriori does not generate an optimal solution as indicated by the three-city tour illustrated in Fig. 2.

Refer to caption

Figure 2: Illustrating how the shortest distance between two cities does not produce the shortest tour.

This solution is clearly not optimal because the triangle tour shown in Fig. 2 is shorter. In other words, an optimal tour is obtained by not using the shortest distance between two cities.

Remark 1.

It is critically important to note that Question 1 is not centered on solving the problem of the type illustrated in Fig. 2; rather, this question and others to follow, are focused more fundamentally on simply framing the problem mathematically.

Remark 2.

It is apparent by a cursory examination of Fig. 2 that the values of the arc weights are not independent of the path. That is, the objective function in the TSP must somehow account for the functional dependence of the sequence of cities in the computation of the distance between any two cites.

Question 2.

How do we define distances between two cities if the entry and exit points are constrained by some angle requirement?

This question was first addressed in [1] for the case when the cities are points; the new question posed here pertains to the the additional issues when the cities are not points as in Fig. 2.

Question 3.

How do we define distances between two cities in the presence of a no-drive, no-fly zone? See Fig. 3.

Refer to caption

Figure 3: Illustrating some problems in defining distance between cities in the presence of obstacles.

This question was addressed in [6] for the case of point-cities. It is apparent that any difficulty encountered in answering Questions 2 and 3 is further amplified by the issues resulting from the discussions related to Question 1; see also Remarks 1 and 2.

Question 4.

How do we define distances between neighborhoods that are in deterministic motion?

This question is related to the one posed in [14] with respect to the dynamic TSP. It remains a problem of ongoing interest; see, for example [3].

Question 5.

Assuming we can answer the preceding questions, is distance a correct measure for a minimum-time TSP (with neighborhoods)? If not, what is the proper mathematical problem formulation for a minimum-time TSP?

While limited versions of the aforementioned questions have been addressed in the literature as noted in the preceding paragraphs, the totality of Questions 1–5 appear in many practical and emerging mathematical problems that lie at the intersection of physics, operations research and engineering science. For example, the problem of touring the 79 moons of Jupiter [31] by a remote sensing spacecraft has all the elements of Questions 1–5. Relative orbits around the moons are the “cities” and the spacecraft is the “traveling salesman;” see Fig. 4.

Refer to caption

Figure 4: A rendition of a Jovian grand tour mission with only 7 of the 79 moons illustrated for clarity. Figure is not to scale.

The moons are in various non-circular orbits. The measure of “distance” (i.e., weights) is the amount of propellant it takes to transfer the spacecraft between two (moving) relative orbits. Not all visits to the moons are valued equally by the science team. The objective of a grand tour mission is to maximize the science return by orbiting around as many moons as possible under various constraints arising from the physics of gravitational motion, electromagnetic instrumentation, thermodynamics, electrical power, dollar cost and lifetime of the spacecraft. It is clear that modeling this optimization problem using the available constructs of a TSP [4] is neither apparent nor easy. In fact, the computation of the weights associated with the arcs of the graph (that represent the transfer trajectories) involve solving a constrained, nonlinear optimal control problem with variable endpoints [22, 28]. Consequently, even generating the data555To produce the TSP data, it is necessary to solve 𝒪(N2)\mathcal{O}(N^{2}) optimal control problems, where NN is the number of moons. to define this problem as a standard TSP is a nontrivial task.

The main contribution of this paper is a new mathematical problem formulation for addressing a class of information-rich, operations-research-type mathematical problems such as the modified TSPs discussed in the preceding paragraphs.

2 A New Mathematical Paradigm

Throughout this paper we use the word functional in the sense of mathematical analysis: a mapping from a space of measurable functions to the field of real numbers.

Definition 1 (\mathcal{F}-graph).

An \mathcal{F}-graph is a finite collection of functionals that constitute the arcs (edges) and vertices of a graph.

Let Vi:dom(Vi),i+V^{i}:\text{dom}(V^{i})\to\mathbb{R},i\in\mathbb{N}_{+} and Ek:dom(Ek),k+E^{k}:\text{dom}(E^{k})\to\mathbb{R},k\in\mathbb{N}_{+} be a finite collection of functionals, where dom()\text{dom}(\cdot) is the domain of ()(\cdot). Let \mathcal{F} be an \mathcal{F}-graph whose vertices and arcs/edges are given by ViV^{i} and EkE^{k} respectively. From standard graph theory, a walk in \mathcal{F} may be defined in terms of an alternating sequence of VV- and EE-functionals. In order to perform evaluations in \mathcal{F}, it is necessary to define some new constructs.

Definition 2 (\mathcal{F}-control).

An \mathcal{F}-control is a sequence of functions

ψ0,ψ1,,ψnn\langle\psi_{0},\psi_{1},\ldots,\psi_{n}\rangle\quad n\in\mathbb{N}

where each ψj,j=0,,n\psi_{j},\ j=0,\ldots,n is selected from the domain of ViV^{i} or the domain of EkE^{k}.

Remark 3.

An \mathcal{F}-control involves two simultaneous actions: selecting functions from the domain of the functionals that comprise \mathcal{F}, and ordering the selections in some sequence. Thus, different selections ordered the same way is a different \mathcal{F}-control. Conversely, the same selection ordered differently is a also a different \mathcal{F}-control (provided the reordering is consistent with the selection process).

Definition 3 (Control Walk).

A control walk (in \mathcal{F}) is an \mathcal{F}-control with the following property:

ψjdom(Ek)ψj1dom(Vl)and ψj+1dom(Vm)\psi_{j}\in\text{dom}(E^{k})\Rightarrow\psi_{j-1}\in\text{dom}(V^{l})\ \text{and }\psi_{j+1}\in\text{dom}(V^{m})

and EkE^{k} is the arc/edge that joins VlV^{l} to VmV^{m}.

The preceding definition of a control walk requires a sequence of at least three functions. In order to complete this definition and accommodate certain special situations we define a trivial control walk as follows:

Definition 4 (Trivial Control Walk).

A control walk ωc\omega_{c} is said to be trivial if:

  1. 1.

    ωc=ψ0\omega_{c}=\langle\psi_{0}\rangle and ψ0dom(Vi)\psi_{0}\in\text{dom}(V^{i}) for some ii\in\mathbb{N}; or,

  2. 2.

    ωc=ψ0,ψ1\omega_{c}=\langle\psi_{0},\psi_{1}\rangle and

    (ψ0dom(Vi),ψ1dom(Ek))(ψ0dom(Ek),ψ1dom(Vi))\big{(}\psi_{0}\in\text{dom}(V^{i}),\ \psi_{1}\in\text{dom}(E^{k})\big{)}\vee\big{(}\psi_{0}\in\text{dom}(E^{k}),\ \psi_{1}\in\text{dom}(V^{i})\big{)}

    for some i,ki,k in +\mathbb{N}_{+} and EkE^{k} joins ViV^{i} with itself or with some other vertex; or

  3. 3.

    ωc=ψ0,ψ1,ψ2\omega_{c}=\langle\psi_{0},\psi_{1},\psi_{2}\rangle and

    ψ0dom(Ek),ψ1dom(Vi),ψ2dom(Em)\psi_{0}\in\text{dom}(E^{k}),\psi_{1}\in\text{dom}(V^{i}),\psi_{2}\in\text{dom}(E^{m})

    for some i,k,mi,k,m in +\mathbb{N}_{+}, where EkE^{k} and EmE^{m} join ViV^{i} with itself or with some other vertex.

Remark 4.

Let GG be a graph that is isomorphic to an \mathcal{F}-graph. Furthermore, let GG be such that its vertices and arcs are a selection of functions from the domains of the functionals the constitute the \mathcal{F}-graph. Then, by construction, there is an uncountably infinite set of isomorphic graphs GG. A control walk may be interpreted as a walk in one of these isomorphic graphs.

It follows from Remarks 3 and 4 that a control walk involves the simultaneous action of choosing an isomorphism and a walk in the chosen isomorphic graph, GG.

Definition 5 (Objective Functional).

Let ωc\omega_{c} be a control walk defined over an \mathcal{F}-graph. An objective functional is a functional, ωc\omega_{c}\mapsto\mathbb{R}, defined over all possible control walks.

Definition 6 (Optimal Control Walk).

An optimal control walk is a control walk that optimizes an objective functional.

Remark 5.

From Remark 3, it follows that an optimal control walk jointly optimizes the selection of functions from the domains of the functionals that constitute an \mathcal{F}-graph as well as the walk itself. This feature of the optimal control walk directly addresses the comments in Remark 2 in the context of Fig. 2.

For the remainder of this paper, we will limit the discussions to a special type of an \mathcal{F}-graph defined over some measure space 𝕃\mathbb{L}.

Definition 7 (Label Space).

A measure space 𝕃\mathbb{L} is called a label space if 𝕃i,i=1,,|Nv|+\mathbb{L}^{i},i=1,\ldots,|N_{v}|\in\mathbb{N}_{+} are a given finite collection of disjoint measurable subsets of 𝕃\mathbb{L}.

Suppose we are given a label space. Let,

𝕃a\displaystyle\mathbb{L}^{a} :=𝕃i=1|Nv|𝕃i\displaystyle:=\mathbb{L}\setminus\displaystyle\bigcup_{i=1}^{|N_{v}|}\mathbb{L}^{i} (2a)
dom(Vi)\displaystyle\text{dom}(V^{i}) :={𝕃i:𝕃i is measurable}\displaystyle:=\left\{\mathbb{R}\to\mathbb{L}^{i}:\ \mathbb{R}\to\mathbb{L}^{i}\text{ is measurable}\right\} (2b)
dom(Ea)\displaystyle\text{dom}(E^{a}) :={𝕃a:𝕃a is measurable}\displaystyle:=\left\{\mathbb{R}\to\mathbb{L}^{a}:\ \mathbb{R}\to\mathbb{L}^{a}\text{ is measurable}\right\} (2c)
Definition 8 (𝒯\mathcal{T}-graph).

Let El,mE^{l,m} denote the arcs of an \mathcal{F}-graph with the property that El,mE^{l,m} joins VlV^{l} to VmV^{m} for all ll and mm in NvN_{v}. Set El,m=EaE^{l,m}=E^{a}, where the domain of EaE^{a} is given by (2c). Set the domains of ViV^{i} according to (2b). The resulting \mathcal{F}-graph is called a 𝒯\mathcal{T}-graph.

Definition 9 (Label Space Trajectory).

A label space trajectory is a measurable function 𝐥():[t0,tf]t𝕃{\boldsymbol{l}}(\cdot):\mathbb{R}\supseteq[t_{0},t_{f}]\ni t\mapsto\mathbb{L} where tft0>0t_{f}-t_{0}>0 is some non-zero time interval in \mathbb{R}.

Proposition 1.

A label space trajectory is an \mathcal{F}-control for the 𝒯\mathcal{T}-graph.

Proof.

Proof: Let 𝒍():[t0,tf]𝕃{\boldsymbol{l}}(\cdot):[t_{0},t_{f}]\to\mathbb{L} be a label space trajectory. Because 𝕃a\mathbb{L}^{a} and 𝕃i\mathbb{L}^{i} are all disjoint sets, we have,

𝒍(t)𝕃1𝕃2𝕃|Nv|𝕃a for a.a. t[t0,tf]{\boldsymbol{l}}(t)\in\mathbb{L}^{1}\vee\mathbb{L}^{2}\vee\ldots\vee\mathbb{L}^{|N_{v}|}\vee\mathbb{L}^{a}\text{ for a.a. }t\in[t_{0},t_{f}] (3)

As a result, we can perform the following construction: Let,

t0<t1,<,<tn<tn+1=tft_{0}<t_{1},<\ldots,<t_{n}<t_{n+1}=t_{f}

be an increasing sequence of clock times such that for each j=0,1,,n,nj=0,1,\ldots,n,\ n\in\mathbb{N}, (tj+1tj)(t_{j+1}-t_{j}) is the maximum time duration for which we have,

𝒍(t){𝕃i for some iNvand for a.a. t(tj,tj+1)or𝕃afor a.a. t(tj,tj+1){\boldsymbol{l}}(t)\in\left\{\begin{array}[]{ll}\mathbb{L}^{i}\text{ for some }i\in N_{v}&\hbox{and for a.a. }t\in(t_{j},t_{j+1})\\ &\text{or}\\ \mathbb{L}^{a}&\hbox{for a.a. }t\in(t_{j},t_{j+1})\end{array}\right. (4)

The feasibility of constructing (4) follows from (3). Let,

ψj:=𝒍()|(tj,tj+1),j=0,1,,n\psi_{j}:={\boldsymbol{l}}(\cdot)\big{|}_{(t_{j},t_{j+1})},\ j=0,1,\ldots,n (5)

be the restrictions of 𝒍(){\boldsymbol{l}}(\cdot) to (tj,tj+1)(t_{j},t_{j+1}) for j=0,1,,nj=0,1,\ldots,n. Then, by (4) and (5), we have,

Im(ψj)𝕃i𝕃a for some iNv and j=0,1,,n\text{Im}(\psi_{j})\subseteq\mathbb{L}^{i}\vee\mathbb{L}^{a}\text{ for some }i\in N_{v}\text{ and }\forall\ j=0,1,\ldots,n (6)

Hence from (2), it follows that,

ψjdom(Vi)dom(Ea), for some iNv\psi_{j}\in\text{dom}(V^{i})\vee\text{dom}(E^{a}),\text{ for some }i\in N_{v}

Consequently the sequence ψ0,,ψn\langle\psi_{0},\ldots,\psi_{n}\rangle is an \mathcal{F}-control for the 𝒯\mathcal{T}-graph. ∎

Definition 10 (Trivial Label Space Trajectory).

A label space trajectory is said to be trivial if Im(𝐥())𝕃a\textrm{Im}({\boldsymbol{l}}(\cdot))\subseteq\mathbb{L}^{a}.

Theorem 1.

A nontrivial, continuous label space trajectory generates a control walk in a 𝒯\mathcal{T}-graph.

Proof.

Proof: Let ψj,j=0,,n\psi_{j},j=0,\ldots,n\in\mathbb{N} be the restrictions of 𝒍(){\boldsymbol{l}}(\cdot) as defined by (5). By Proposition 1, ψ0,ψ1,,ψn\langle\psi_{0},\psi_{1},\ldots,\psi_{n}\rangle is an \mathcal{F}-control; hence, any given ψj\psi_{j} is either in the domain of EaE^{a}, or of ViV^{i}, for some iNvi\in N_{v}. The rest of the proof is broken down to three cases:

Case(a): n2n\geq 2 and ψjdom(Ea)\psi_{j}\in\text{dom}(E^{a}) for some 0<j<n0<j<n.
From (4) and (5) we get the following conditions:

  1. 1.

    ψj1dom(Vl)\psi_{j-1}\in\text{dom}(V^{l}) for some lNvl\in N_{v}

  2. 2.

    ψj+1dom(Vm)\psi_{j+1}\in\text{dom}(V^{m}) for some mNvm\in N_{v}

Hence, ψj1,ψj,ψj+1\langle\psi_{j-1},\psi_{j},\psi_{j+1}\rangle is a control subwalk.

Case(b): n2n\geq 2 and ψjdom(Vi)\psi_{j}\in\text{dom}(V^{i}) for some 0<j<n0<j<n and some iNvi\in N_{v}.
By continuity of 𝒍(){\boldsymbol{l}}(\cdot), we have,

limttk+1ψk(t)=limttk+1ψk+1(t),k=0,,n\lim_{t\uparrow t_{k+1}}\psi_{k}(t)=\lim_{t\downarrow t_{k+1}}\psi_{k+1}(t),\quad k=0,\ldots,n (7)

Setting k=j1k=j-1 and k=jk=j in (7) we get the two continuity conditions,

limttjψj1(t)\displaystyle\lim_{t\uparrow t_{j}}\psi_{j-1}(t) =limttjψj(t)\displaystyle=\lim_{t\downarrow t_{j}}\psi_{j}(t) (8a)
limttj+1ψj(t)\displaystyle\lim_{t\uparrow t_{j+1}}\psi_{j}(t) =limttj+1ψj+1(t)\displaystyle=\lim_{t\downarrow t_{j+1}}\psi_{j+1}(t) (8b)

Hence, from (2c), (2b), (4) and (5), we have,

ψj1dom(Ea) and ψj+1dom(Ea)\psi_{j-1}\in\text{dom}(E^{a})\text{ and }\psi_{j+1}\in\text{dom}(E^{a}) (9)

If n=2n=2 (j=1\Rightarrow j=1), we get a trivial control walk. If j>1j>1, by using the same arguments as in Case(a), we get ψj2dom(Vk) for some lNv and ψj+2dom(Vm) for some mNv\psi_{j-2}\in\text{dom}(V^{k})\text{ for some }l\in N_{v}\text{ and }\psi_{j+2}\in\text{dom}(V^{m})\text{ for some }m\in N_{v}; hence, ψj2,ψj1,ψj,ψj+1,ψj+2\langle\psi_{j-2},\psi_{j-1},\psi_{j},\psi_{j+1},\psi_{j+2}\rangle is a control subwalk.

Case(c): n<2n<2.
By similar arguments as in Case (a) and (b), it is straightforward to show that ψ0,ψ1\langle\psi_{0},\psi_{1}\rangle is trivial control walk.

From Cases (a), (b) and (c), it follows that ψ0,ψ1,,ψn,n\langle\psi_{0},\psi_{1},\ldots,\psi_{n}\rangle,n\in\mathbb{N} is a control walk. ∎

3 Two 𝒯\mathcal{T}-Graph-Based Formulations of a TSP

Let 𝕃\mathbb{L} be a finite dimensional normed space. If we set 𝕃i\mathbb{L}^{i} as the “cities” in 𝕃\mathbb{L}, then a TSP and its many variants may be framed in terms of finding nontrivial optimal label-space trajectories. To illustrate the theoretical simplicity of our approach, we allow 𝕃i\mathbb{L}^{i} to be time dependent (i.e., in deterministic motion) so that some answers to the questions posed in Section 1 are readily apparent.

Definition 11 (Atomic Return Function).

For a fixed time tt, an atomic return function Ra:(𝐥,𝕃i(t))R_{a}:({\boldsymbol{l}},\mathbb{L}^{i}(t))\mapsto\mathbb{R} is defined by,

Ra(𝒍,𝕃i(t)){0if𝒍𝕃i(t)=0otherwiseR_{a}({\boldsymbol{l}},\mathbb{L}^{i}(t))\left\{\begin{array}[]{lll}\neq 0&\hbox{if}\ {\boldsymbol{l}}\in\mathbb{L}^{i}(t)\\ =0&\hbox{otherwise}\end{array}\right. (10)
Definition 12 (Atomic Return Functional).

An atomic return functional RiR^{i} is defined by,

Ri[𝒍(),t0,tf]:=t0tfRa(𝒍(t),𝕃i(t))𝑑tR^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\int_{t_{0}}^{t_{f}}R_{a}\big{(}{\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\big{)}dt (11)

where tft0>0t_{f}-t_{0}>0 is the time horizon of interest.

Atomic return functionals may be used as vertex functionals whenever 𝒍(t)𝕃i(t){\boldsymbol{l}}(t)\in\mathbb{L}^{i}(t). Whether or not a vertex functional is defined for a given problem, we define the Kronecker indicator function,

(𝒍,𝕃i(t)):={1if 𝒍𝕃i(t)0otherwise\mathcal{I}({\boldsymbol{l}},\mathbb{L}^{i}(t)):=\left\{\begin{array}[]{ll}1&\hbox{if }{\boldsymbol{l}}\in\mathbb{L}^{i}(t)\\ 0&\hbox{otherwise}\end{array}\right. (12)

as a fundamental return function. Equation (12) thus generates by way of (11) a fundamental return functional:

Definition 13 (Time-on-Task Functional).

A time-on-task functional is defined by,

Ti[𝒍(),t0,tf]:=t0tf(𝒍(t),𝕃i(t))𝑑tT^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}\mathcal{I}({\boldsymbol{l}}(t),\mathbb{L}^{i}(t))\,dt (13)

Because (13) generates a dwell time over vertex ii, we define a visit in the context of a walk in the following manner:

Definition 14 (Vertex Visit).

The vertex ii is said to have been visited in [t0,tf][t_{0},t_{f}] if

Ti[𝒍(),t0,tf]0T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\neq 0 (14)

Correspondingly, we say the vertex ii has not been visited if Ti[𝐥(),t0,tf]=0T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=0.

In the context of a 𝒯\mathcal{T}-graph formulation of a basic TSP,666By a basic TSP, we mean a problem that does not come with any additional qualifiers. a vertex functional ViV^{i} assigns a value of one for a visit, zero otherwise. Furthermore, an arc functional EaE^{a} is any functional that assigns a numerical value (of “distance”) for segments of t𝒍t\mapsto{\boldsymbol{l}} that are not associated with a vertex. If a visit is preceded and followed by an arc functional with no other visits of a vertex in between (including itself), we say vertex ii has been visited once. In this context, we say 𝒍(){\boldsymbol{l}}(\cdot) is Hamiltonian if all vertices are visited exactly once.

3.1 A Derivative-Based Formulation of a TSP

Lemma 1.

Let Di[𝐥(),t0,tf]D^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}] be the functional defined by,

Di[𝒍(),t0,tf]:=t0tf|dt(𝒍(t),𝕃i(t))|𝑑tD^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\int_{t_{0}}^{t_{f}}\left|d_{t}\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right)\right|dt (15)

where, dtd_{t} denotes the distributional derivative with respect to tt. Let 𝐥():[t0,tf]𝕃{\boldsymbol{l}}(\cdot):[t_{0},t_{f}]\to\mathbb{L} be a continuous label space trajectory such that Ti[𝐥(),t0,tf]0T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\neq 0. Furthermore, let 𝐥(t0)𝕃i(t0){\boldsymbol{l}}(t_{0})\not\in\mathbb{L}^{i}(t_{0}) and 𝐥(tf)𝕃i(tf){\boldsymbol{l}}(t_{f})\not\in\mathbb{L}^{i}(t_{f}). Then,

Di[𝒍(),t0,tf]2+D^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\in 2\mathbb{N}_{+} (16)
Proof.

Proof: Consider the time intervals Δtout\Delta t_{out} and Δtin\Delta t_{in} defined by,

Δtout:={t[t0,tf]:(𝒍(t),𝕃i(t)=0}Δtin:={t[t0,tf]:(𝒍(t),𝕃i(t)=1}\Delta t_{out}:=\left\{t\in[t_{0},t_{f}]:\ \mathcal{I}({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)=0\right\}\quad\Delta t_{in}:=\left\{t\in[t_{0},t_{f}]:\ \mathcal{I}({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)=1\right\} (17)

By assumption Ti[𝒍(),t0,tf]0T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\neq 0; hence, we have μ(Δtin)>0\mu\left(\Delta t_{in}\right)>0. Likewise, we have μ(Δtout)>0\mu\left(\Delta t_{out}\right)>0. Hence, we can write,

d(𝒍(t),𝕃i(t))dt=0t{int(Δtout)}{int(Δtin)}\frac{d\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right)}{dt}=0\quad\forall\ t\in\left\{\text{int}\left(\Delta t_{out}\right)\right\}\cup\left\{\text{int}\left(\Delta t_{in}\right)\right\}

where, int()(\cdot) denotes the interior of the set ()(\cdot). Let txΔtintt_{x}\in\partial\Delta t_{int} where ()\partial(\cdot) denotes the boundary of the set ()(\cdot). Then, the distributional derivative of the function t(𝒍(t),𝕃i(t))t\mapsto\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right) may be written as,

dt(𝒍(t),𝕃i(t))=δ(ttx)δ(ttx)d_{t}\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right)=-\delta(t-t_{x})\vee\delta(t-t_{x}) (18)

where, δ(ttx)\delta(t-t_{x}) is the Dirac delta function centered at t=txt=t_{x}. Hence,

|dt(𝒍(t),𝕃i(t))|=δ(ttx)\left|d_{t}\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right)\right|=\delta(t-t_{x}) (19)

Because 𝒍(t0){\boldsymbol{l}}(t_{0}) and 𝒍(tf){\boldsymbol{l}}(t_{f}) are not in 𝕃i(t0)\mathbb{L}^{i}(t_{0}) and 𝕃i(tf)\mathbb{L}^{i}(t_{f}) respectively, integrating (19) we get,

t0tf|dt(𝒍(t),𝕃i(t))|𝑑t2+\int_{t_{0}}^{t_{f}}\left|d_{t}\mathcal{I}\left({\boldsymbol{l}}(t),\mathbb{L}^{i}(t)\right)\right|dt\in 2\mathbb{N}_{+}

where, we have used the fact that the integral of a delta function is unity. ∎

Corollary 1.

If 𝐥(t0)𝕃i(t0){\boldsymbol{l}}(t_{0})\in\mathbb{L}^{i}(t_{0}) and 𝐥(tf)𝕃i(tf){\boldsymbol{l}}(t_{f})\in\mathbb{L}^{i}(t_{f}), then (16) generalizes to Di[𝐥(),t0,tf]2D^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\in 2\mathbb{N}. If 𝐥(t0)𝕃i(t0){\boldsymbol{l}}(t_{0})\in\mathbb{L}^{i}(t_{0}) or 𝐥(tf)𝕃i(tf){\boldsymbol{l}}(t_{f})\in\mathbb{L}^{i}(t_{f}), then the statement of Lemma 1 may be further generalized to Di[𝐥(),t0,tf]D^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\in\mathbb{N}.

Theorem 2.

Let 𝐥():[t0,tf]𝕃{\boldsymbol{l}}(\cdot):[t_{0},t_{f}]\to\mathbb{L} be a continuous label space trajectory. Then, 𝐥(){\boldsymbol{l}}(\cdot) is Hamiltonian if and only if

Di[𝒍(),t0,tf]=2iNvD^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=2\quad\forall\ i\in N_{v} (20)
Proof.

Proof: If 𝒍(){\boldsymbol{l}}(\cdot) is Hamiltonian, then every vertex has been visited just once; hence, (20) follows from Lemma 1. If (20) holds, then every vertex has been visited just once in [t0,tf][t_{0},t_{f}] (with the understanding of a visit given by (14)). ∎

Although continuity of a label space trajectory is sufficiently smooth to generate a Hamiltonian cycle (Cf. Theorem 1), we now consider the space of absolutely continuous functions for the convenience of using the derivative of t𝒍t\mapsto{\boldsymbol{l}} to define arc lengths. To this end, let 𝒍˙(t)\dot{\boldsymbol{l}}(t) denote the time derivative of 𝒍(t){\boldsymbol{l}}(t); then, a distance functional may be defined according to,

Jdist[𝒍(),t0,tf]:=t0tf𝒍˙(t)𝑑tJ_{dist}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}\left\|\dot{\boldsymbol{l}}(t)\right\|dt (21)

The integrand in (21) is any finite-dimensional norm. If the two-norm is used, then the numerical value of JdistJ_{dist} is consistent with the notion of Euclidean distance illustrated in Fig. 2. Combining (21) with Theorem 2, a shortest distance TSP can be formulated as,

(D-TSP){MinimizeJdist[𝒍(),t0,tf]:=t0tf𝒍˙(t)𝑑tSubject to Di[𝒍(),t0,tf]=2iNv𝒍(tf)=𝒍(t0)\displaystyle\big{(}\text{$D$-TSP}\big{)}\left\{\begin{array}[]{lll}\text{Minimize}&J_{dist}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}\left\|\dot{\boldsymbol{l}}(t)\right\|dt\\[11.00008pt] \text{Subject to }&\displaystyle D^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=2\quad\forall\ i\in N_{v}\\[6.99997pt] &{\boldsymbol{l}}(t_{f})={\boldsymbol{l}}(t_{0})\end{array}\right. (25)

The last constraint equation in (25) simply ensures that the resulting label-space trajectory is a closed control-walk.

In comparing it with the various discrete variable optimization formulations of a TSP [5, 7, 15], it is apparent that (25) contains no explicit subtour-type elimination constraints. This is because the (absolute) continuity of the label space trajectory ensures (by Theorem 1) that the closed control-walk generated by (25) is a single Hamilton cycle.

3.2 An Integral-Based Formulation of a TSP

A TSP may also be formulated (in the sense of a 𝒯\mathcal{T}-graph) using the time-on-task functional to construct a new indicator-type functional.

Definition 15 (Control-Walk Indicator Functional).

A control-walk indicator functional is defined by,

Wi[𝒍(),t0,tf]:={1if Ti[𝒍(),t0,tf]00if Ti[𝒍(),t0,tf]=0W^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\left\{\begin{array}[]{lll}1&\hbox{if }\ T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\neq 0\\[11.00008pt] 0&\hbox{if }\ T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=0\end{array}\right. (26)

Using WiW^{i} as an indicator of a vertex visit, we arrive at an alternative formulation of a TSP:

(I-TSP){MinimizeJdist[𝒍(),t0,tf]:=t0tf𝒍˙(t)𝑑tSubject to Wi[𝒍(),t0,tf]=1iNv𝒍(tf)=𝒍(t0)\displaystyle\big{(}\text{$I$-TSP}\big{)}\left\{\begin{array}[]{lll}\text{Minimize}&J_{dist}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}\left\|\dot{\boldsymbol{l}}(t)\right\|dt\\[11.00008pt] \text{Subject to }&\displaystyle W^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=1\quad\forall\ i\in N_{v}\\[6.99997pt] &{\boldsymbol{l}}(t_{f})={\boldsymbol{l}}(t_{0})\end{array}\right. (30)

Equations (30) and (25) are identical except for the imposition of degree constraints. Formulation (II-TSP) requires that the vertex be visited at least once.

No Entity Discrete Optimization 𝒯\mathcal{T}-Graph Formulations
1 optimization variable discrete continuous function
2 cost matrix; i.e., explicit arc/edge weights required not required
3 explicit degree constraints required required in (25); not in (30)
4 explicit subtour elimination constraints required not required
Table 1: A discriminating comparison between a discrete-variable and 𝒯\mathcal{T}-graph formulations of a TSP

A comparison between the 𝒯\mathcal{T}-graph and discrete-variable problem formulations is summarized in Table 1. As noted in the second row of Table 1, the 𝒯\mathcal{T}-graph formulations do not require the explicit construction of the arc/edge weights; i.e., the cost matrix for the computation of the objective function in the discrete-variable formulation. This data is “generated” simultaneously via (21) as part of the process of solving the problem. That is, in terms of the concepts introduced in Section 2, the 𝒯\mathcal{T}-graph formulation incorporates the simultaneous selection of the function sequence and the functions themselves from the domains of the vertex and arc functionals.

Remark 6.

The 𝒯\mathcal{T}-graph formulations presented in this section are not transformations of the well-established discrete-optimization models of TSPs; rather, they are a realization of a fundamentally new domain of analysis.

4 Sample 𝒯\mathcal{T}-Graph Formulations of Several Variants of a TSP

Because of the large number of variants of a TSP, we limit the scope of this section to a small sample of selective problems to illustrate the new modeling framework.

4.1 An Orienteering Problem

Let σi>0\sigma^{i}>0 be the score associated with each 𝕃i(t)\mathbb{L}^{i}(t). Define a score functional according to:

Si[𝒍(),t0,tf]:=σiWi[𝒍(),t0,tf]S^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\sigma^{i}W^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]

where, WiW^{i} is given by (26). An orienteering problem (OP) may now be defined as,

(OP){MaximizeiNvSi[𝒍(),t0,tf]Subject tot0tf1𝑑ttmax𝒍(t0)𝕃0𝒍(tf)𝕃|Nv|\displaystyle\big{(}\text{OP}\big{)}\left\{\begin{array}[]{lll}\text{Maximize}&\displaystyle\sum_{i\in N_{v}}S^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\\[11.00008pt] \text{Subject to}&\displaystyle\int_{t_{0}}^{t_{f}}1\,dt\leq t_{max}\\[6.99997pt] &{\boldsymbol{l}}(t_{0})\in\mathbb{L}^{0}\\ &{\boldsymbol{l}}(t_{f})\in\mathbb{L}^{|N_{v}|}\end{array}\right. (35)

The payoff functional is the sum of the score functionals. The maximum allowable time is tmax>0t_{max}>0. The time constraint in (35) is written in terms of the integral of one to merely illustrate the fact that the resource constraint is a functional. The domain of 𝒍(){\boldsymbol{l}}(\cdot) in (35) is the space of continuous functions in accordance with Theorem 1.

4.2 An Orienteering Problem With Neighborhoods

Problem (OP) given by (35) was motivated by the usual orienteering problem discussed in the literature [11]. Using the atomic return function concept introduced in (10), we may now score a traversal through a neighborhood based on the values of the atomic return functional (Cf. (11)). Although there are many ways to score a traversal, we illustrate a problem formulation based on the following construction,

Snbdi[𝒍(),t0,tf]:={σiifRi[𝒍(),t0,tf]ri0ifRi[𝒍(),t0,tf]<riS^{i}_{nbd}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\left\{\begin{array}[]{lll}\sigma^{i}&\hbox{if}\ R^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\geq r^{i}\\ 0&\hbox{if}\ R^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]<r^{i}\end{array}\right. (36)

In (36), ri>0r^{i}>0 is a required “revenue” from a visit to 𝕃i(t)\mathbb{L}^{i}(t). If a label-space traversal t𝒍t\mapsto{\boldsymbol{l}} across the city does not generate a revenue of at least rir^{i} as measured by the return functional RiR^{i}, then the salesman gets no commission (i.e., zero score). On the other hand, if the salesman performs a judicious travel through the city to generate a revenue of at least rir^{i}, then, he is rewarded by σi>0\sigma^{i}>0. The salesman makes no extra commission for generating a revenue greater than rir^{i} at 𝕃i\mathbb{L}^{i}; i.e., he is encouraged to expand the market by visiting a different city. This situation arises in astronomy [4] where a telescope is required to scan a portion of the sky to collect a specific frequency of electromagnetic (EM) radiation [18]. Scientific value is generated based on the integration of EM collects. A critical amount of EM collects (rir^{i}) is necessary to perform useful science. No extra science is generated once the targeted amount rir^{i} is reached; hence, the telescope is awarded σi\sigma^{i} for performing a task successfully with no “extra” credit. Thus, the orienteering problem with neighborhoods may be defined according to,

(OP-nbd){MaximizeiNvSnbdi[𝒍(),t0,tf]Subject to Ck[𝒍(),t0,tf]CmaxkkNC𝒍(t0)𝕃0𝒍(tf)𝕃|Nv|\displaystyle\big{(}\text{OP-nbd}\big{)}\left\{\begin{array}[]{lll}\text{Maximize}&\displaystyle\sum_{i\in N_{v}}S^{i}_{nbd}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\\[11.00008pt] \text{Subject to }&\displaystyle C^{k}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\leq C^{k}_{max}\quad\forall\ k\in N_{C}\\[6.99997pt] &{\boldsymbol{l}}(t_{0})\in\mathbb{L}^{0}\\ &{\boldsymbol{l}}(t_{f})\in\mathbb{L}^{|N_{v}|}\end{array}\right. (41)

In (41), the constraints Ck[𝒍(),t0,tf]Cmaxk,kNCC^{k}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\leq C^{k}_{max},\ \forall\ k\in N_{C} are NCN_{C}\subset\mathbb{N} generic resource constraint, where Cmaxk>0C^{k}_{max}>0 is the maximum “capacity” associated with the kk-th resource.

4.3 A Fast TSP

Minimizing distance traveled does not necessarily equate to minimum time; this fact has been known since Bernoulli posed his famous Brachistochrone problem as a mathematical challenge in the year 1696 [22, 28]. In framing a more interesting minimum-time TSP, we let τi>0\tau^{i}>0 be an additional constraint of a minimum required dwell-time over 𝕃i(t)\mathbb{L}^{i}(t). In this case, a fast (i.e., minimum-time) TSP may be framed as follows:

(fastTSP){MinimizeJtime[𝒍(),t0,tf]:=t0tf1𝑑tSubject toTi[𝒍(),t0,tf]τiiNv𝒍(tf)=𝒍(t0)\displaystyle\big{(}\text{fastTSP}\big{)}\left\{\begin{array}[]{lll}\text{Minimize}&J_{time}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}1\,dt\\[11.00008pt] \text{Subject to}&T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\geq\tau^{i}\qquad\forall\ i\in N_{v}\\[6.99997pt] &{\boldsymbol{l}}(t_{f})={\boldsymbol{l}}(t_{0})\end{array}\right. (45)

For the same reason as the formulation of the constraint in (35), the cost function in (45) is written as an integral to emphasize the fact that the travel-time is a functional. The functional TiT^{i} in (45) is given by (13). Furthermore, while the dwell-time constraint in (45) may, in principle, be written as an equality, Ti[𝒍(),t0,tf]=τiT^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]=\tau^{i}, the advantage of an inequality is that it allows the solution to satisfy the stipulated requirement without incurring a penalty in performance for “navigating through the city” to find an optimal exit point at an optimal exit time.

4.4 Dynamic TSP with Time Windows

Because 𝕃i(t),iNv\mathbb{L}^{i}(t),i\in N_{v} are “moving cities,” dynamic TSPs are are explicitly incorporated in all of the preceding problem formulations. These formulations have also implicitly incorporated time windows because of the following argument: Let tai[t0,tf]t_{a}^{i}\in[t_{0},t_{f}] and tbi[t0,tf]t_{b}^{i}\in[t_{0},t_{f}] be two given clock-times associated with each 𝕃i(t)\mathbb{L}^{i}(t) with tai<tbit_{a}^{i}<t_{b}^{i}. Consider the augmented set defined by,

𝕃augi(t):=𝕃i(t)×[tai,tbi]\mathbb{L}^{i}_{aug}(t):=\mathbb{L}^{i}(t)\times[t_{a}^{i},t_{b}^{i}] (46)

Then, it is clear that the space defined by,

𝕃aug(t):=𝕃(t)×[t0,tf]\mathbb{L}_{aug}(t):=\mathbb{L}(t)\times[t_{0},t_{f}]

is a label space, whose disjoint sets are defined by (46). Hence, by defining a new label-space variable 𝒍aug:=(𝒍,t){\boldsymbol{l}}_{aug}:=({\boldsymbol{l}},t), it is apparent that no additional theoretical developments are necessary to incorporate time-window constraints in the proposed 𝒯\mathcal{T}-graph formulations. Furthermore, note that the time window in (46) may itself also vary with respect to time.

In view of economic considerations that go beyond distance and time, a significantly greater degree of flexibility in modeling can be obtained by transforming the preceding 𝒯\mathcal{T}-graph “variational” formulations to their optimal-control versions.

5 A 𝒯\mathcal{T}-Graph Optimal Control Framework

By limiting the space of allowable label space trajectories to the space of absolutely continuous functions, it is possible to frame a significantly richer class of traveling-salesman-type problems. To develop this transformed framework, we first set the derivative of 𝒍(t){\boldsymbol{l}}(t) as a candidate optimization variable,

𝒍˙(t)=𝒘(t)\dot{\boldsymbol{l}}(t)={\boldsymbol{w}}(t) (47)

where, 𝒘Nl,Nl{\boldsymbol{w}}\in{\mathbb{R}}^{N_{l}},N_{l}\in\mathbb{N} is the label-space tangent control variable. Thus, for example, (45) transforms according to,

(fastTSP: OC-ver){missingMinimizeJtime[𝒍(),𝒘(),t0,tf]:=tft0Subject to 𝒍˙(t)=𝒘(t)Ti[𝒍(),t0,tf]τiiNv𝒍(tf)=𝒍(t0)\displaystyle\big{(}\text{fastTSP: OC-ver}\big{)}\left\{\begin{array}[]{lll}\displaystyle\mathop{\text{missing}}{Minimize}&\displaystyle J_{time}[{\boldsymbol{l}}(\cdot),{\boldsymbol{w}}(\cdot),t_{0},t_{f}]:=t_{f}-t_{0}\\[11.00008pt] \text{Subject to }&\dot{\boldsymbol{l}}(t)={\boldsymbol{w}}(t)\\[6.99997pt] &T^{i}[{\boldsymbol{l}}(\cdot),t_{0},t_{f}]\geq\tau^{i}\qquad\forall\ i\in N_{v}\\[6.99997pt] &{\boldsymbol{l}}(t_{f})={\boldsymbol{l}}(t_{0})\end{array}\right. (52)

In comparing it with (45), note that (52) contains an additional decision variable 𝒘(){\boldsymbol{w}}(\cdot) and an additional constraint given by (47). This aspect of the transformation may be used advantageously; for example, if the objective functional in (52) is replaced by,

Jdist[𝒍(),𝒘(),t0,tf]:=t0tf𝒘(t)𝑑tJ_{dist}[{\boldsymbol{l}}(\cdot),{\boldsymbol{w}}(\cdot),t_{0},t_{f}]:=\displaystyle\int_{t_{0}}^{t_{f}}\left\|{\boldsymbol{w}}(t)\right\|dt

then, the resulting problem transforms to yet another formulation of a minimum-distance TSP (Cf. Section 3). In fact, the most important aspect of (52) is that it provides a clear avenue for generalization that is conducive to modeling traveling-salesman-type problems driven by constrained, nonlinear dynamical systems such as those illustrated in Fig. 4.

5.1 Generalization Based on Nonlinear Dynamics

In many applications such as robotics [13], the natural home space for the “salesman” is some state space 𝕏\mathbb{X} that is not necessarily the label space. Furthermore, the dynamics of the salesman is given by some nonlinear controllable ordinary differential equation of the type,

𝒙˙=𝒇(𝒙,𝒖,t)\dot{\boldsymbol{x}}={\boldsymbol{f}}(\boldsymbol{x},{\boldsymbol{u}},t) (53)

where, 𝒙Nx\boldsymbol{x}\in{\mathbb{R}}^{N_{x}} is a state variable, 𝒖Nu{\boldsymbol{u}}\in{\mathbb{R}}^{N_{u}} is the control variable and 𝒇:Nx×Nu×Nx{\boldsymbol{f}}:{\mathbb{R}}^{N_{x}}\times{\mathbb{R}}^{N_{u}}\times\mathbb{R}\to{\mathbb{R}}^{N_{x}} is the nonlinear dynamics function. Because all aspects of the problem definition so far are described in terms of a label space, it is necessary to connect it to the state space. Let this connection be given by some algebraic equation,

𝒈(𝒍,𝒙,t)=𝟎{\boldsymbol{g}}({\boldsymbol{l}},\boldsymbol{x},t)={\bf 0} (54)

where, 𝒈:Nl×Nx×Ng{\boldsymbol{g}}:{\mathbb{R}}^{N_{l}}\times{\mathbb{R}}^{N_{x}}\times\mathbb{R}\to{\mathbb{R}}^{N_{g}} is called a connexion function. An example of a physical description of state space, label space and the connexion function is illustrated in Fig. 5.

Refer to caption

Figure 5: A conceptual illustration of state space, label space and the connexion function for a drone tasking problem[24].

In this example, an uninhabited aerial vehicle (UAV) is tasked to collect over a geographic region [9, 24]. The areas of interest vary from a point area to a broad area of scan. Technical properties associated with the areas of interest are defined in label space. The state variables of the UAV is given in terms of of its position, velocity and the orientation of the maneuverable camera. The connexion function is the mathematical model that connects the label space variables to the state space variables based on the precise position and orientation of the camera at a given instant of time.

From (53) and (54), it follows that (47) may be written implicitly as a differential algebraic equation,

𝒙˙=𝒇(𝒙,𝒖,t)𝟎=𝒈(𝒍,𝒙,t)}𝒍˙=𝒘{\left.\begin{array}[]{l}\dot{\boldsymbol{x}}={\boldsymbol{f}}(\boldsymbol{x},{\boldsymbol{u}},t)\\ {\bf 0}={\boldsymbol{g}}({\boldsymbol{l}},\boldsymbol{x},t)\end{array}\right\}}\quad\Longrightarrow\quad\dot{\boldsymbol{l}}={\boldsymbol{w}} (55)

The significance of replacing (47) by the state-space representation is that it facilitates a direct means to incorporate the full nonlinear dynamics of a salesman in the optimization problem formulation.

5.2 Generalization Based on Economics-Driven Cost/Payoff/Constraint Functionals

As a result of (55), the decision variables expand to the tuple, (𝒍(),𝒙(),𝒖(),t0,tf)({\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}). This can be taken to great advantage by formulating cost, payoff and/or constraint functionals in their more natural “home spaces.” For example, in the electric vehicle-routing problem [25], the capacity constraint of the battery is more naturally expressed as a functional constraint in terms of the state 𝒙\boldsymbol{x} of the vehicle [26],

t0tfS(𝒙(t))𝑑tCsafe\int_{t_{0}}^{t_{f}}S(\boldsymbol{x}(t))\,dt\geq C_{safe} (56)

In (56), SS is the state-of-charge and CsafeC_{safe} is a battery charge required for safe operations. In the absence of extending the decision variables to incorporate the state vector, (56) would need to be transformed to a label space constraint using (54). This difficult task is completely circumvented by incorporating the state vector as part of the optimization variable. The alternative is to generate proxy models [25] as a means to extend the scope of vehicle routing problems [29].

In (45), the travel time is an implicit functional of the label-space trajectory 𝒍(){\boldsymbol{l}}(\cdot). Replacing (47) by (55), the travel time now becomes an implicit functional of 𝒍(),𝒙(){\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot) and 𝒖(){\boldsymbol{u}}(\cdot). That is, the functional,

Jtime[𝒍(),𝒙(),𝒖(),t0,tf]:=tft0J_{time}[{\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}]:=t_{f}-t_{0} (57)

with constraints given by (55) offers a more accurate model of travel time with obvious business implications in practical routing problems. Note that (57) also allows the clock time to be optimized. Equation (57) may be further modified to take into account gas/power consumption depending upon the vehicle type (standard/hybrid/electric). Analogous to (56), gas/power consumption may be written in terms of an integral,

t0tff0(𝒙(t),𝒖(t),t)𝑑t\displaystyle\int_{t_{0}}^{t_{f}}f_{0}\big{(}\boldsymbol{x}(t),{\boldsymbol{u}}(t),t\big{)}\,dt (58)

where, f0f_{0} is a function that models the time-varying gas/power consumption rate. This function may be well-modeled using the physics of the automobile power train [26]. A convex combination of (58) with (57) generates the functional,

Jhybrid[𝒍(),𝒙(),𝒖(),t0,tf]:=α(tft0)+(1α)t0tff0(𝒙(t),𝒖(t),t)𝑑tα[0,1]J_{hybrid}[{\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}]:=\alpha(t_{f}-t_{0})\\ +(1-\alpha)\displaystyle\int_{t_{0}}^{t_{f}}f_{0}\big{(}\boldsymbol{x}(t),{\boldsymbol{u}}(t),t\big{)}\,dt\qquad\alpha\in[0,1] (59)

The parameter α\alpha in (59) offers a sliding scale over the “trade-space” of time and energy consumption.

5.3 An Optimal Control Framework for a TSP and its Variants

Substituting (55) in (52) while simultaneously adding additional levels of abstraction for the functionals associated with the vertices and arcs of the underlying 𝒯\mathcal{T}-graph, we arrive at:

(𝒯𝒳P){missingMinimizeJ[𝒍(),𝒙(),𝒖(),t0,tf]Subject to 𝒙˙(t)=𝒇(𝒙(t),𝒖(t),t)𝒈(𝒍(t),𝒙(t),t)=𝟎Km[𝒍(),𝒙(),𝒖(),t0,tf]{0,mN0=0,mN=0(𝒍(t0),𝒍(tf))𝕃b𝕃𝒙(t)𝕏(t)𝒖(t)𝕌(t,𝒙(t))\displaystyle\big{(}\mathcal{T_{X}}P\big{)}\left\{\begin{array}[]{lll}\displaystyle\mathop{\text{missing}}{Minimize}&\displaystyle J[{\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}]\\ \text{Subject to }&\dot{\boldsymbol{x}}(t)={\boldsymbol{f}}(\boldsymbol{x}(t),{\boldsymbol{u}}(t),t)\\ &{\boldsymbol{g}}({\boldsymbol{l}}(t),\boldsymbol{x}(t),t)={\bf 0}\\ &K^{m}\left[{\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}\right]\left\{\begin{array}[]{ll}\leq 0,&\forall\ m\in N_{\geq 0}\\ =0,&\forall\ m\in N_{=0}\end{array}\right.\\[1.00006pt] &\big{(}{\boldsymbol{l}}(t_{0}),{\boldsymbol{l}}(t_{f})\big{)}\in\mathbb{L}^{b}\subseteq\mathbb{L}\\ &\boldsymbol{x}(t)\in\mathbb{X}(t)\\ &{\boldsymbol{u}}(t)\in\mathbb{U}(t,\boldsymbol{x}(t))\end{array}\right. (69)

In (69), the objective function is simply stated in terms of an abstract functional JJ in order to facilitate the formulation of a generic cost functional that go beyond those discussed in Sec. 5.2. Likewise, the functionals KmK^{m} in (69) may be time-on-task functionals, walk-indicator functionals, degree functionals, capacity-constraint functionals or some other functionals. Furthermore, because each vertex may have several constraints, the total number of these constraints may be greater than |Nv|\left|N_{v}\right|. The index sets N=0N_{=0} and N0N_{\geq 0} simply organize the constraint functionals into equalities and inequalities. The set 𝕃b𝕃\mathbb{L}^{b}\subseteq\mathbb{L} stipulates the boundary conditions for the label space trajectory. Also included in (69) are state variable constraints 𝒙(t)𝕏(t)\boldsymbol{x}(t)\in\mathbb{X}(t) and state-dependent control constraints given by 𝒖(t)𝕌(t,𝒙(t)){\boldsymbol{u}}(t)\in\mathbb{U}(t,\boldsymbol{x}(t)). Such constraints are included in Problem (𝒯𝒳P)(\mathcal{T_{X}}P) because they are critically important in practical applications[22, 28] as well as coordinate transformations of optimal control problems[22].

Because (69) is a generalization of the problems discussed in Sections 3–5, it is evident that a fairly large class of traveling-salesman-like problems can be modeled as instances of Problem (𝒯𝒳P)(\mathcal{T_{X}}P).

6 Illustrative Numerical Examples

A motorized TSP was first introduced in [27]. In essence, a motor is a differential equation; hence, a motorized TSP is one with dynamical constraints. Here, we combine this problem with several other variants of the TSP [10, 6] to generate a minimum-time close-enough motorized traveling salesman problem with forbidden neighborhoods given by:

𝒍4,𝒙4,𝒖2\displaystyle{\boldsymbol{l}}\in{\mathbb{R}}^{4},\quad\boldsymbol{x}\in{\mathbb{R}}^{4},\quad{\boldsymbol{u}}\in{\mathbb{R}}^{2}
(fastCEMTSPFN-1){missingMinimizeJ[𝒍(),𝒙(),𝒖(),t0,tf]:=tft0Subject to x˙1(t)=x3(t),|x3(t)|1t[t0,tf]x˙2(t)=x4(t),|x4(t)|1t[t0,tf]x˙3(t)=u1(t),|u1(t)|1t[t0,tf]x˙4(t)=u2(t),|u2(t)|1t[t0,tf]li(t)xi(t)=0,t[t0,tf]i=1,2,3,4Ti[𝒍(),t0,tf]τmini,i=1,,Nv(x1(t)x1iai)2+(x2(t)x2ibi)21,i=1,Nobs(𝒍(t0),𝒍(tf))=(𝟎,𝟎)\displaystyle\big{(}\textit{fastCEMTSPFN-1}\big{)}\left\{\begin{array}[]{lll}\displaystyle\mathop{\text{missing}}{Minimize}&\displaystyle J[{\boldsymbol{l}}(\cdot),\boldsymbol{x}(\cdot),{\boldsymbol{u}}(\cdot),t_{0},t_{f}]:=t_{f}-t_{0}\\ \text{Subject to }&\dot{x}_{1}(t)=x_{3}(t),\quad\left|x_{3}(t)\right|\leq 1\quad\forall\ t\in[t_{0},t_{f}]\\ &\dot{x}_{2}(t)=x_{4}(t),\quad\left|x_{4}(t)\right|\leq 1\quad\forall\ t\in[t_{0},t_{f}]\\ &\dot{x}_{3}(t)=u_{1}(t),\quad\left|u_{1}(t)\right|\leq 1\quad\forall\ t\in[t_{0},t_{f}]\\ &\dot{x}_{4}(t)=u_{2}(t),\quad\left|u_{2}(t)\right|\leq 1\quad\forall\ t\in[t_{0},t_{f}]\\ &l_{i}(t)-x_{i}(t)=0,\quad\forall\ t\in[t_{0},t_{f}]\\ &\qquad i=1,2,3,4\\ &T^{i}\left[{\boldsymbol{l}}(\cdot),t_{0},t_{f}\right]\geq\tau^{i}_{min},\quad\forall\ i=1,\ldots,N_{v}\\[5.0pt] &\displaystyle\left(\frac{x_{1}(t)-x_{1}^{i}}{a^{i}}\right)^{2}+\left(\frac{x_{2}(t)-x_{2}^{i}}{b^{i}}\right)^{2}\geq 1,\\ &\qquad\forall\ i=1,\ldots N_{obs}\\ &\big{(}{\boldsymbol{l}}(t_{0}),{\boldsymbol{l}}(t_{f})\big{)}=({\bf 0},{\bf 0})\end{array}\right. (81)

It is apparent that (81) is a special case of (69). In fact, it is a generalization of the fast TSP posed in (52). Equation (81) is motorized by the four differential equations resulting from an elementary application of Newtonian dynamics: The position and velocity of the salesman are (x1,x2)2(x_{1},x_{2})\in{\mathbb{R}}^{2} and (x3,x4)2(x_{3},x_{4})\in{\mathbb{R}}^{2} respectively. Both the velocity and acceleration (u1,u2)2(u_{1},u_{2})\in{\mathbb{R}}^{2} vectors are constrained in the \ell_{\infty} norm (i.e., components are constrained in terms of their absolute values). The NobsN_{obs} ellipsoidal constraints in (81) are the forbidden neighborhoods whose centroids are given by (x1i,x2i),i=1,,Nobs(x_{1}^{i},x_{2}^{i}),\ i=1,\ldots,N_{obs}. The parameters ai>0a^{i}>0 and bi>0b^{i}>0 determine the shape of the ellipse. The last constraint in (81) requires the salesman to start and end at the origin.

A sample data set for (81) with Nv=11N_{v}=11 and Nobs=2N_{obs}=2 is shown in Fig. 6.

Refer to caption

Figure 6: Solution to Problem (fastCEMTSPFN-1) for 11 cities and 2 forbidden neighborhoods marked FN1 and FN2.

Also shown in Fig. 6 is the solution obtained by solving (81) using DIDO’s implementation[19] of pseudospectral optimal control theory [23]. Some key points to note regarding the solution presented in Fig. 6 are:

  1. 1.

    Nearly all the “arcs” between city-pairs are curvilinear. This is because the salesman’s trajectory is required to satisfy the Newtonian dynamical constraints as well as the instantaneous \ell_{\infty} bounds on velocity and acceleration as dictated in (81).

  2. 2.

    The city-pair arcs as well as the entry and exit points to the city are a natural outcome of solving the posed problem (Cf. (81)). That is, the city-pair arcs were not determined either a priori or a posteriori to the determination of the city sequence. See also Remark 2.

  3. 3.

    The ellipsoidal forbidden neighborhood marked FN2 overlaps neighborhood No. 9. Thus, the allowable region for neighborhood No. 9 is nonconvex.

Next, consider a “variant” of (81) obtained by replacing the \ell_{\infty} constraints on velocity and acceleration by its 2\ell_{2} version,

(fastCEMTSPFN-2):x32(t)+x42(t)1,u12(t)+u22(t)1t[t0,tf](\textit{fastCEMTSPFN-2}):\quad\sqrt{x_{3}^{2}(t)+x_{4}^{2}(t)}\leq 1,\ \sqrt{u_{1}^{2}(t)+u_{2}^{2}(t)}\leq 1\qquad\forall\ t\in[t_{0},t_{f}] (82)

That is, Problem (fastCEMTSPFN-2) is identical to Problem (fastCEMTSPFN-1) except for the additional nonlinear constraint given by (82). A solution to this problem is shown in Fig. 7.

Refer to caption

Figure 7: Solution to Problem (fastCEMTSPFN-2) for the same data shown in Fig. 6.

It is apparent that Fig. 7 differs from Fig. 6 in several ways:

  1. 1.

    The turn at neighborhood No. 6 in Fig. 6 is significantly sharper than the corresponding one in Fig. 7. The curvilinear turn in Fig. 7 is a clear and direct result of the 2\ell_{2} constraint given by (82).

  2. 2.

    The sequence of visits in Fig. 7 is different from that of Fig. 6; for example, compare the visit to neighborhood No. 8.

  3. 3.

    The entry and exit points to the same numbered cities in Figs. 6 and 7 are different; for example, compare neighborhoods Nos. 8, 11 and 4.

For the third and final case, we change the problem “data set” with a random distribution of a large number of forbidden zones as shown in Fig. 8. Only the neighborhood marked “FNB” was not randomly selected; rather, its size and location were purposefully set as indicated in Fig. 8 to generate a more interesting case study.

Refer to caption

Figure 8: Solution to Problem (fastCEMTSPFN-2) for a new data set of nonconvex allowable and forbidden neighborhoods.

Also shown in Fig. 8 is a solution to the problem whose vehicle is constrained by (82). Beyond the apparent efficacy of the process in obtaining a solution, noteworthy points pertaining to Fig. 8 are:

  1. 1.

    Some of the forbidden neighborhoods are nonconvex. The nonconvex sets are generated randomly in the sense that the circular neighborhoods are allowed to overlap.

  2. 2.

    Some of the allowable neighborhoods are nonconvex because the forbidden circular disks overlap the allowable regions.

  3. 3.

    Comparing Figs. 8 and 7, it is clear that the addition of forbidden neighborhoods has drastically altered the sequence of visits.

7 Conclusions

Many types of objective functions and constraints in emerging variations of the traveling salesman problem (TSP) are more naturally defined in their continuous-time home space. Modeling these variants of the TSP using a classic discrete optimization framework is neither straightforward nor easy. By inverting the traditional modeling process, that is, by formulating the naturally discrete quantities in terms of continuous-time nonsmooth functions, it is possible to generate a new framework for the TSP and some of its variants. In this context, the discrete-optimization formalism of a TSP may be viewed as a problem formulation in the co-domain of the functionals that constitute its 𝒯\mathcal{T}-graph. In sharp contrast, the problem formulation presented in this paper is in the domain of the functionals of the TSP 𝒯\mathcal{T}-graph. This perspective suggests that the discrete-optimization- and the variational formulations are effectively two sides of the same 𝒯\mathcal{T}-graph constructs introduced in this paper.

There is no doubt that there are a vast number of open theoretical issues in the proposed framework. Nonetheless, we have demonstrated, by way of a fast, close-enough motorized TSP with forbidden neighborhoods, that it is indeed possible to solve some challenging problems using the new formalisms. One of the interesting revelations indicated by the numerical studies is the big impact of seemingly small changes in the motion-constraints of the traveling salesman. This suggests that an exploitation of nonlinear models may provide a discriminating edge to a business/engineering operation by way of a non-intuitive economical utilization of its end-to-end systems.

Acknowledgments

Funding from the Defense Advanced Research Projects Agency, the Center for Multi-INT Studies, and the Department of Defense (DoD) are gratefully acknowledged. The views, opinions and/or findings expressed are those of the authors and should not be interpreted as representing the official views or policies of the DoD or the U.S. Government.

Strong support from the Secretary of the Navy and United States Navy’s Judge Advocate General’s Office is gratefully acknowledged. Portions of this work are protected by the following patents issued by the United States Patent and Trademark Office: Patent Nos. US 9,983,585 B1; US 10,476,584 B1; and 62/971,068 (patent pending).

References

  • Aggarwal et al. [1999] Aggarwal A, Coppersmith D, Khanna, S, Motwani R, Schieber B (1999) The angular-metric traveling salesman problem. SIAM J. Comput. 29(3):697–711.
  • Arkin and Hassin [1994] Arkin EM, Hassin R (1994) Approximation algorithms for the goemetric covering salesman problem. Discrete Appl. Math. 55(3):197–218.
  • Chowdhury et al. [2019] Chowdhury S, Marufuzzaman M, Tunc H, Bian L, Bullington W (2019) A modified ant colony optimization algorithm to solve a dynamic traveling salesman problem: A case study with drones for wildlife surveillance. J. Comp. Design and Engr. 6(3):368–386.
  • Cook [2012] Cook WJ (2012) In Pursuit of the Traveling Salesman: Mathematics at the Limits of Computation (Princeton University Press, Princeton, NJ).
  • Dantzig, Fulkerson and Johnson [1954] Dantzig G, Fulkerson R, Johnson S (1954) Solution of a large-scale traveling-salesman problem. J. Oper. Res. Soc. of America 2(4):393–410.
  • Fischer and Hungerlander [2017] Fischer A, Hungerlander P (2017) The traveling salesman problem on grids with forbidden neighborhoods. J. Combinatorial Optimization 34(3):891–915.
  • Flood [1956] Flood MM (1956) The traveling-salesman problem. Oper. Res. 4(1):61–75.
  • Gentilini, Margot and Shimada [2013] Gentilini I, Margot F, Shimada K (2013) The travelling salesman problem with neighbourhoods: MINLP solution. Optimization Methods and Software 28(2): 364–378.
  • Gottlieb and Shima [2015] Gottlieb Y, Shima T (2015) UAVs task and motion planning in the presence of obstacles and prioritized targets. Sensors 15:29734–29764.
  • Gulczynski, Heath and Price [2006] Gulczynski DJ, Heath JW, Price CC (2006) The close enough traveling salesman problem: A discusion of several heuristics. Alt FB, Fu MC Golden BL ed. Perspectives in Op. Res.: Papers in Honor of Saul Gass’ 80th Birthday (Springer, Boston) 271–283
  • Gunawan, Hoong and Vansteenwegen [2016] Gunawan A, Hoong CL, Vansteenwegen P (2016) Orienteering Problem: A survey of recent variants, solution approaches and applications. European J. Op. Res. 255:315–332.
  • Gutin and Punnen [2007] Gutin G, Punnen AP (2007) The Traveling Salesman Problem and its Variations (Springer, New York)
  • Ma and Castanon [2006] Ma X, Castanon DA (2006) Receding horizon planning for Dubins traveling salesman problems. Proc. 45th IEEE CDC 5453–5458.
  • Psaraftis [1988] Psaraftis, HN (1988) Dynamic vehicle routing problems. B. Golden, A. Assad (Editors), Vehicle routing: methods and studies, (Elsevier Science Publishers BV) 223–248.
  • Miller, Tucker and Zemlin [1960] Miller CE, Tucker AW, Zemlin RA (1960) Integer programming formulations and traveling salesman problems. J. ACM 7, 326–329.
  • Nesterov [1983] Nesterov YE A method of solving a convex programming problem with convergence rate 𝒪(1/k2)\mathcal{O}(1/k^{2}), Soviet Math. Dokl., 27/2 371–376 (Translated by A. Rosa).
  • Polyak [1964] Polyak BT (1964) Some methods of speeding up the convergence of iteration methods, USSR Computational Math. and Math. Phys., 4/5, 1–17 (Translated by H. F. Cleaves).
  • Rana, Anand and Bose [2019] Rana J, Anand S, Bose S (2019) Optimal search strategy for finding transients in large-sky error regions under realistic constraints. The Astrophysical Journal 876(2):104.
  • Ross [2020] Ross IM (2020) Enhancements to the DIDO© optimal control toolbox. arXiv preprint. arXiv:2004.13112.
  • [20] Ross IM (2019) An optimal control theory for accelerated optimization. arXiv preprint arXiv:1902.09004.
  • Ross [2019] Ross IM (2019) An optimal control theory for nonlinear optimization. J. Computational and Appl. Math. 354:39–51.
  • [22] Ross IM (2015) A Primer on Pontryagin’s Principle in Optimal Control, Second Edition, Collegiate Publishers, San Francisco, CA.
  • Ross and Karpenko [2012] Ross IM, Karpenko M (2012) A review of pseudospectral optimal control: From theory to flight. Annual Reviews in Control 36:182–197.
  • Ross, Proulx and Karpenko [2019] Ross IM, Proulx RJ, Karpenko M (2019) Autonomous UAV sensor planning, scheduling and maneuvering: An obstacle engagement technique. ACC 65–70.
  • Schneider, Stenger and Goeke [2014] Schneider M, Stenger A, Goeke D (2014) The electric vehicle-routing problem with time windows and recharging stations. Transportation Sc. 48(4): 500–520.
  • Sciarretta, Back and Guzzella [2004] Sciarretta A, Back M, Guzzella L (2004) Optimal control of parallel hybrid electric vehicles. IEEE Tran. Control Sys. Tech. 12(3): 352–363.
  • von Stryk and Glocker [2001] von Stryk O, Glocker M (2001) Numerical mixed-integer optimal control and motorized traveling salesmen problems. J. Européen des Systèmes Automatisés 35(4): 519–533.
  • Vinter [2000] Vinter RB (2000) Optimal Control (Birkhäuser, Boston)
  • Vidal [2017] Vidal T (2017) Node, edge, arc routing and turn penalties: Multiple problems–one neighborhood extension. Op. Res. 65(4): 992–1010.
  • Wang, Golden and Wasil [2019] Wang X, Golden B, Wasil E (2019) A Steiner zone variable neighborhood search heuristic for the close-enough traveling salesman problem. Computers and Op. Res. 101: 200–219.
  • Witze [2018] Witze A (2018) Jupiter has 10 more moons we didn’t know about — and they’re weird. Nature 559:312–313.