This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Incentive Mechanism and Path Planning for UAV Hitching over Traffic Networks

Ziyi Lu, Na Yu and Xuehe Wang Z. Lu, N. Yu and X. Wang (corresponding author) are with the School of Artificial Intelligence, Sun Yat-sen University, Zhuhai 519082, China (E-mail: [email protected], [email protected], [email protected]). X. Wang is also with the Guangdong Key Laboratory of Big Data Analysis and Processing, Guangzhou 510006, China.
Abstract

Package delivery via the UAVs is a promising transport mode to provide efficient and green logistic services, especially in urban areas or complicated topography. However, the energy storage limit of the UAV makes it difficult to perform long-distance delivery tasks. In this paper, we propose a novel multimodal logistics framework, in which the UAVs can call on ground vehicles to provide hitch services to save their own energy and extend their delivery distance. This multimodal logistics framework is formulated as a two-stage model to jointly consider the incentive mechanism design for ground vehicles and path planning for UAVs. In Stage I, to deal with the motivations for ground vehicles to assist UAV delivery, a dynamic pricing scheme is proposed to best balance the vehicle response time and payments to ground vehicles. It shows that a higher price should be decided if the vehicle response time is long to encourage more vehicles to offer a ride. In Stage II, the task allocation and path planning of the UAVs over traffic network is studied based on the vehicle response time obtained in Stage I. To address pathfinding with restrictions and the performance degradation of the pathfinding algorithm due to the rising number of conflicts in multi-agent pathfinding, we propose the suboptimal conflict avoidance-based path search (CABPS) algorithm, which has polynomial time complexity. Finally, we validate our results via simulations. It is shown that our approach is able to increase the success rate of UAV package delivery. Moreover, we estimate the delivery time of the UAV in a pessimistic case, it is still twice as fast as the delivery time of the ground vehicle only.

Index Terms:
Crowdsourcing, UAV hitching, Minimal-connecting tours, Conflict avoidance.

I Introduction

I-A Background

In recent years, the continuous expansion of the e-commerce market has promoted the rapid development of the express logistics industry [1]. Especially during the epidemic, people’s living habits have changed dramatically in order to maintain social distance. New logistics service models have emerged in order to avoid face-to-face contact, such as non-contact delivery [2]. However, the rapid development of urban logistics has also brought a series of problems, such as traffic congestion, air pollution, low logistics efficiency [3]. Due to its agility and mobility, the delivery services enabled by the Unmanned Aerial Vehicle (UAV) are not restricted by terrain and traffic conditions, which can freely pass through urban region during rush hours to provide logistics services flexibly and efficiently. Many companies all over the world have launched commercial UAV delivery services, such as Amazon, Google, and UPS in the United States, DHL in Germany, and JD, SF-express in China [4]. However, due to the limitation of UAV energy capacity, current UAV logistics services are limited to short-distance delivery [5]. Therefore, how to expand the range of UAV delivery services is an urgent problem to be solved.

With the development of new technologies such as 5G, Internet of Things (IoT), and Artificial Intelligence (AI), crowdsourcing, as a mode of using large-scale network users to assist in specific tasks, has received more and more attention from different application platforms, and has been widely used in various scenarios, such as food delivery riders, car sharing, and data cleaning, etc [6]. Inspired by the crowdsourcing model, the ground vehicles such as trucks can be utilized to offer riding for the UAVs, which greatly saves the battery consumption of UAVs and extend their delivery distance. Amazon has proposed a UAV-truck riding strategy to allow UAVs to land on transportation vehicles from different shipping companies for temporary transport, by making an agreement with the owner of the transportation vehicles [7]. In addition, the technology of docking UAVs on stationary or high-speed moving vehicles is relatively mature [8, 9], which lays the foundation for the development of air-ground collaborative multimodal logistics system.

However, this new multimodal logistics system based on crowdsourcing faces the following challenges:

  • Incentives for ground vehicles’ cooperation. The provision of hitch services by ground vehicles is the basis for the implementation of multimodal logistics systems. However, the ground vehicles incur hitching costs (e.g., fuel consumption) when offer riding service, and they should be rewarded and well motivated to assist UAV delivery. Moreover, the ground vehicles are heterogeneous in nature. They will randomly arrive the targeting interchange point and their private costs for offering riding service are different and unknown. Thus, the pricing design under the incomplete vehicle information is challenging. In addition, a higher monetary reward leads to a shorter vehicle response time (i.e., UAV waiting time), which improves the delivery efficiency of multimodal logistics system. For sustainable management of the multimodal logistics system, the pricing strategy should be dynamic to best balance the payment to ground vehicles and their response time.

  • Crowdsourcing-based multimodal logistics system modeling under time-varying traffic networks and UAV energy constraint. By integrating the UAVs with the ground vehicles, the UAVs should decide where to hitch on ground vehicles and whether hopping between different vehicles to complete one delivery task, which creates large number of interchange points on the basis of the road network for path planning. Moreover, due to the mobility of ground vehicles and the instability of traffic conditions, the modeling of the integrated system is challenging.

  • Design of fast and efficient task allocation and path planning algorithms for the multimodal logistics system. The ground vehicles’ arrivals and response time in the vicinity of the UAVs at a certain time are unknown. In addition, the path planning of the UAVs should be carefully designed to prevent conflicts between UAVs, such as multiple UAVs landing at the same vehicle at the same time. Such combinatorial optimization problems are often NP-hard problems and challenging to solve.

I-B Main Contributions

To tackle the above challenges, a two-stage model is proposed to jointly analyze the incentive mechanism design for ground vehicles and path planning for UAVs. We summarize our main contributions as follows:

  • Crowdsourcing-based multimodal logistics system modeling. To our best knowledge, this paper is the first work studying the crowdsourcing-based multimodal logistics system, in which the ground vehicles are invited to offer hitching services for UAVs. Such multimodal logistics system effectively extends the range and efficiency of UAV deliveries. We formulate this system as a two-stage model to jointly study the incentive mechanism design for ground vehicles and path finding for UAVs. In Stage I, a dynamic pricing scheme is proposed to motivate the ground vehicles to assist UAV delivery under the incomplete information about the ground vehicles’ random arrivals and private hitching costs. In Stage II, the task allocation and path planning of the UAVs over traffic network by considering the UAV energy constraint and the conflicts between UAVs.

  • Dynamic pricing for ground vehicle under incomplete information. To balance the payment to ground vehicles and their response time for efficient delivery, a optimal dynamic pricing scheme is proposed under incomplete information case regarding ground vehicles’ random arrivals and private hitching costs. We prove that if the vehicle response time is long, a high price offer should be decided to encourage the ground vehicles to assist delivery. According to the proposed dynamic pricing, the vehicle response time for each interchange station can be predicted and utilized for the path planning in Stage II.

  • Algorithm Design for fast and efficient task allocation and path planning. To reduce the computation complexity, the deployment of UAV delivery tasks is split into two layers. In task allocation layer, we propose a near-optimal algorithm with polynomial computation complexity to generate the delivery sequence. In path planning layer, to avoid the computational burden due to the order-of-magnitude increase in conflicts, we propose a conflict avoidance-based path search algorithm and prove that the bounded suboptimal paths for all UAVs can be obtained in polynomial time.

I-C Related work

The access path planning problem for UAVs can be viewed as the Traveling Salesman Problem (TSP) [10, 11]. This type of combinatorial optimization problem is a classical NP-hard problem. Once the order of package delivery for all UAVs is found, the system begins to find the optimal delivery path for each UAV. Classical algorithms such as A\text{A}* or Dijkstra for solving the shortest path are often used to solve the optimal UAV trajectory problem [12, 13]. In addition, control theory is also utilized for modeling and solving UAV path planning problems [14, 15]. The constraints of UAV energy storage and conflicts between UAVs make the UAV path planning problem more challenging, which is NP-hard to get the optimal solution. Some methods such as Conflict-based search (CBS) [16], have been shown to be efficient methods to solve this Multi-Agent Path Finding (MAPF) problem.

Utilizing the ground vehicles to extend the UAVs’ delivery range has attracted wide attentions from both academia and industry. The researchers in Stanford University have shown that combining UAVs with the existing transit network can increase the delivery coverage of UAVs by up to 360%360\% [17]. However, using buses for UAV delivery lacks fault tolerance. For example, the bus schedules may not be accurate. If the UAV happens to miss the arrival time of the current bus, the UAV needs to wait longer for the next bus to arrive, or has to change its route, which may cause the delivery task to fail. In the crowdsourcing-based system considered in this paper, the request is sent to the ground vehicle at the time when the UAV needs the hitching service and any passing-by ground vehicle along the UAV’s routes can be the UAV carrier, so there is no case of missing a vehicle. E-commerce giant Amazon has proposed in its patent that commercial vans from different operators can be used to assist UAVs in delivery services [7]. However, this is also based on the pre-signed agreement with the van operators. The number of available commercial vans is limited and cannot cover the entire transportation network, which will greatly reduce the efficiency of UAV delivery. In this paper, the ground vehicles (such as trucks, buses) with wide coverage are added to the logistics network based on crowdsourcing technology, which helps keep the waiting time for UAVs to a minimum.

The rest of this paper is organized as follows. The system model is given in Section II, in which the multimodal logistics system is formulated as two stages. In Section III, the incentive mechanism for the crowdsourcing-based hitching services is designed in Stage I. Sections IV and V discuss the task allocation and multi-agent path finding for the UAVs in Stage II. Section VI validates our results via simulations. Section VII concludes this paper.

II System Model

In this paper, we aim to solve the UAV’s limited flight range problem caused by its energy constraint in the process of delivery. We propose an efficient multimodal logistics system to improve the delivery range of UAVs, in which a crowdsourcing mechanism is implemented by inviting the ground vehicles (trucks, buses, etc.) to offer the hitch service for UAVs.

II-A Problem

Consider a logistics system with NN UAVs performing delivery tasks to MM known delivery addresses from KK depots. Denote the set of MM delivery addresses as VG={g1,,gM}2V_{G}=\{g_{1},\dots,g_{M}\}\subset\mathbb{R}^{2} and the set of KK depot locations as VD={d1,,dK}2V_{D}=\{d_{1},\dots,d_{K}\}\subset\mathbb{R}^{2}. We assume that any package can be dispatched from any depot and the depot can also charge or change the battery of the UAV quickly. When a UAV delivers a package from a depot to a delivery location, the UAV can call a vehicle on the ground to provide a hitch service, which significantly extends the UAV’s delivery range. In order to facilitate management, we specify that the UAV can stop and wait for ground vehicles only at specific interchange points equipped with surveillance cameras for safety’s sake. We generate LL potential interchange routes on the traffic network starting and ending at certain interchange points, where the UAV can wait for ground vehicles while performing delivery tasks. The set of interchange points is denoted as VI={i1,,iL}2V_{I}=\{i_{1},\dots,i_{L^{{}^{\prime}}}\}\subset\mathbb{R}^{2} (LLL^{{}^{\prime}}\leq L).

In the crowdsourcing-based multimodal logistics system, the goal is to (i) design the optimal dynamic pricing to deal with the tradeoff between the payment to ground vehicles and UAVs’ waiting time; (ii) find the delivery path of each UAV to deliver all packages in a reasonable combination of UAVs’ own flight path and transit routes on the traffic network, while minimizing the maximum delivery time for each UAV without exceeding the storage capacity limit of the UAV.

Refer to caption

Figure 1: Two-stages multimodal logistics system: In stage I, the incentive mechanism for the crowdsourcing-based hitching services is designed, and the expected vehicle response time is predicted for the following path finding algorithm design. In stage II, the path planning problem is decomposed into two layers: first assign delivery tasks to all UAVs in Layer 1, and then find the shortest time-consuming path for each UAV on the basis of the vehicle response time in Layer 2. In the UAVs path finding section, we use the solid point to represent the interchange point, the dotted line to represent the flight route of UAVs and the solid line to represent the transit route of UAVs.

II-B Approach

In a crowdsourcing-based air-ground collaborative logistics system, the behavioral decisions of the ground vehicles will affect the UAV transportation efficiency. If the UAV waits long time for the ground vehicles’ response to assist in transportation, the passage time of the corresponding road section will be prolonged, which will lead to the delay of UAV mission completion time. Therefore, it is crucial to design an effective crowdsourcing incentive mechanism to encourage the ground vehicles to assist delivery. The passage time of each road section will also influence the design of subsequent task allocation and path planning algorithms. However, higher incentives can shorten the UAV waiting time, but cause an increase in UAV platform’s cost. Therefore, we will design a dynamic pricing scheme to best balance the reward expenditure to ground vehicles and the expected vehicle response time of each road section.

Completing all package delivery requests means that each package needs to be assigned to one and only one UAV. So this problem can be viewed as a variation of the traveler problem. Because of the limitations of UAV carrying capacity and energy storage, we specify that a UAV can only deliver one package at a time and return to some depot for charging after completing the delivery task (charging time for UAVs is ignored here). The delivery time of a package by a UAV consists of the UAV flight time and the ride time including the vehicle response time (i.e., UAV waiting time) and the vehicle traveling time when carrying the UAV. We describe the energy storage limit of each UAV as a maximum flight time constraint, and each UAV must not exceed the maximum flight time when performing its delivery tasks. In addition, to avoid collisions between UAVs, multiple UAVs can’t land at the same interchange point at the same time. The above joint optimization problem for UAV-vehicle path planning can be modeled as mixed integer linear programming. However, most of the existing mixed integer linear programming algorithms can only handle small-scale models, which are not suitable for large number of UAVs/packages and vast traffic network scenarios. We decompose the joint path planning problem into two layers of decision models for discussion, and design algorithms with low computational complexity at each layer, respectively.

We formulate our multimodal logistics system as two stages as shown in Fig. 1. In stage I, we predict the vehicle response time at each interchange point based on our designed dynamic pricing scheme, and update this information into the time weights of the interchange routes in the traffic network for the following path planning. In Stage II, we first analyze the task allocation of UAVs to solve the problem of which UAV delivers which packages and the order of delivery between packages, and then the path finding for each delivery subtask. To be specific, for task allocation in Layer 1, we temporarily ignore the specific route of the UAV from the depot to the package delivery address. We only decide which packages are sent from which depot and which UAV is responsible for which series of packages to be delivered, depending on the estimated time (the shortest path from a certain depot to the destination) between the depot and delivery location. We use linear programming algorithms with relaxation conditions that accomplish the task allocation problem in polynomial time. For path finding in Layer 2, we take one UAV to deliver a package from a depot and then return to a depot as a subtask, and perform route planning for the UAV fleet to execute the allocated delivery subtask. To avoid collisions between multiple UAVs, only a limited number of UAVs are allowed to stay at each interchange. We adopt the idea of CBS to design an efficient MAPF algorithm called conflict avoidance-based path search (CABPS), i.e., each UAV jointly maintains the same list of interchange occupancy information and performs its own path planning based on this constraint.

II-C Framework And Workflow

In Stage I, we motivate the ground vehicles to offer riding for UAVs by proper incentive mechanism design, e.g., monetary reward. However, the ground vehicles are mobile and their private costs for carrying the UAVs for certain distance is unknown. Under this circumstance, the waiting time for the UAVs to call for ground vehicles at different interchange points at different time is variable and unknown. In the following, we design a dynamic pricing scheme to balance the response time of ground vehicles and the monetary payment under the incomplete information about the ground vehicles’ random arrivals at certain interchange point and private carrying costs. On the basis of the designed dynamic pricing scheme, the expected response time of ground vehicles at each interchange can be generated, which will be taken into consideration when performing path planning for each UAV.

Algorithm 1 UAVTaskAllocationPathPlanning.
0:  Network Graph GTG_{T}; Location of depots VDV_{D}; Location of packages VGV_{G}; Number of UAV NN; Waiting time for each interchange point WW;
0:  Paths of each UAV PP;
1:  Initiate constraints on interchange points IcI_{c}; Paths of each UAV PP; Each UAV delivery order OTaskAllocation(GT,VD,VG,N)O\leftarrow\text{TaskAllocation}(G_{T},V_{D},V_{G},N); i0i\leftarrow 0; Sum of delivered packages s0s\leftarrow 0;
2:  while s<|O|s<|O| do
3:     for each UAV order n=1:Nn=1:N in OO do
4:        PniCABPS(GT,Oni,W,Ic)P_{n}^{i}\leftarrow\text{CABPS}(G_{T},O_{n}^{i},W,I_{c});
5:        update constraints on interchange points IcI_{c} based on current path of UAV;
6:        ss+1s\leftarrow s+1;
7:     end for
8:     ii+1i\leftarrow i+1;
9:  end while
10:  return  PP

In Stage II, we propose a comprehensive algorithm UAV-Task-Allocation-Path-Planning for building an efficient air-ground cooperative multimodal logistics system. It guarantees a near-optimal solution in polynomial time. Our goal is to plan delivery paths for NN UAVs after accepting requests for MM packages.

The detailed design is given in Algorithm 1. Denote the traffic network graph as GT=(VT,ET)G_{T}=(V_{T},E_{T}), where the vertex set VT=VGVDVIV_{T}=V_{G}\cup V_{D}\cup V_{I}, and each directed edge (u,v)(u,v) for every u,vVTu,v\in V_{T} contains a weight wuvw_{uv} denotes the passage time from uu to vv and an attribute auva_{uv} denotes the power consumption of the UAV from uu to vv. We divide the directed edges (u,v)(u,v) into two types, one is the flight edge, which indicates that the UAV flies from uu to vv. The other is the transit edge, which indicates that the UAV hitches a ground vehicle from uu to vv. It is worth mentioning that if the directed edge (u,v)(u,v) is a transit edge, its power consumption auva_{uv} is 0. Relatively, it has a longer passage time wuvw_{uv} (the passage time from uu to vv includes the UAV’s waiting time and the carrying time from uu to vv) than the flight edge.

Layer 1 (line 1): Task Allocation is the first layer of our algorithm, which takes the traffic network graph GTG_{T}, the set of depot location VDV_{D}, the set of package locations VGV_{G}, and the number of UAVs NN as inputs, returns the set of UAV paths OO, where OniO_{n}^{i} indicates the ithi^{th} delivery subtask for the nthn^{th} UAV. Each subtask OniO_{n}^{i} consists of three values: starting depot, package address and returning depot.

Layer 2 (lines 2-10): The second layer Multi-Agent-Path Finding uses the Conflict Avoidance Based Path Search (CABPS) algorithm to calculate the path for each UAV separately. CABPS takes the ithi^{th} subtask OniO_{n}^{i} delivered by the nthn^{th} UAV, the current waiting time estimate for each interchange and the occupancy time of each interchange as inputs, it gives the specific delivery path PniP_{n}^{i} for each UAV’s subtask OniO_{n}^{i}.

Our system eventually returns the set of delivery path PP for all UAVs. PniP_{n}^{i} refers to the path planned by the nthn^{th} UAV performing its own ithi^{th} delivery subtask, and we use Time(Pn)\text{Time}({P_{n}}) to indicate the total time taken by the nthn^{th} UAV to complete all its delivery tasks. The purpose of our algorithm is to minimize maxnNTime(Pn){\max}_{n\in N}\text{Time}({P_{n}}). Note that in the second layer of the algorithm, the paths for all delivery subtasks of the UAVs are not computed simultaneously. The delivery subtask of each UAV is computed successively once the previous subtask is accomplished, by considering the conflicts between UAVs. In the following sections, we will describe the design of the dynamic pricing mechanism and the two-layer algorithms in more detail.

III Stage I: Mechanism Design for Ground Vehicle Crowdsourcing

In this section, we will discuss the incentive mechanism design for crowdsourcing at a typical interchange point. Consider a discrete time horizon with time slot t=0,,Twt=0,...,T_{w}. When the UAV arrives at the interchange station, it sends the request and pricing reward to nearby vehicles. We use s(t)=1s(t)=1 to denote appropriate crowdsourcing vehicles’ (such as trucks, buses, etc.) arrivals at certain interchange point at time tt, and s(t)=0s(t)=0 otherwise. The probability of s(t)=1s(t)=1 can be viewed as the traffic density α\alpha (e.g., number of vehicles per meter, veh/mveh/m) at this interchange point. The vehicle decides whether to accept the current price p(t)p(t) and provide hitching service according to its private cost c[0,b]c\in[0,b] for carrying the UAVs, where the upper bound bb is estimated from historical data. In this paper, we consider non-trivial traffic density α\alpha. Otherwise, the UAV platform should always set the price p(t)p(t) to be the upperbound bb to encourage the rarely arriving ground vehicles to provide hitching services.

The vehicle response time W(t)W(t) at time tt starting from the moment the UAV sent hitching requests (i.e., t=0t=0) is affected by the traffic density α\alpha and the dynamic pricing p(t)p(t). If a ground vehicle appears at time tt (i.e., s(t)=1s(t)=1) and its carrying cost is less than the offered price p(t)p(t) (i.e., cp(t)c\leq p(t)), the vehicle will provide hitching service. Otherwise, the vehicle response time W(t+1)W(t+1) at time t+1t+1 increases from W(t)W(t) to W(t)+1W(t)+1. Therefore, starting from time slot 0 with initial UAV waiting time W(0)=0W(0)=0, the dynamics of the vehicle response time is as follows:

W(t+1)={W(t),cp(t)&s(t)=1,W(t)+1, otherwise. {W}(t+1)=\left\{\begin{aligned} &{W}(t),\qquad&c\leq p(t)\&s(t)=1,\\ &{W}(t)+1,&\text{ otherwise. }\end{aligned}\right. (1)

According to the Cumulative Distribution Function F(c)F(c) of the vehicles’ private costs and the traffic density α\alpha at the interchange point, the probability of crowdsourcing vehicles accepting hitching service at time slot tt is αF(p(t))\alpha F(p(t)). Therefore, at time slot t+1t+1, the conditional expectation of the vehicle response time is:

E[W(t+1)W(t);p(t),α]\displaystyle\mathrm{E}[W(t+1)\mid W(t);p(t),\alpha] =W(t)αF(p(t))+(W(t)+1)\displaystyle=W(t)\alpha F(p(t))+(W(t)+1) (2)
(1αF(p(t)))\displaystyle\qquad(1-\alpha F(p(t)))
=W(t)+1αF(p(t)).\displaystyle=W(t)+1-\alpha F(p(t)).

Denote W¯(t+1):=E[W(t+1)W(t);p(t),α]\bar{W}(t+1):=\mathrm{E}[W(t+1)\mid W(t);p(t),\alpha]. Thus, the dynamics of the expected response time can be obtained as follows:

W¯(t+1)=W¯(t)+1αF(p(t)).\bar{W}(t+1)=\bar{W}(t)+1-\alpha F(p(t)). (3)

At each time slot tt, the expected payment to the ground vehicle is p(t)αF(p(t))p(t)\alpha F(p(t)). In order to maximize the benefits of the platform, the dynamic price p(t)p(t) should not exceed the maximum private cost bb of the vehicle. Consider the uniform CDF of vehicles’ private costs, i.e., F(p(t))=p(t)/bF(p(t))=p(t)/b. The goal of the UAV platform is to design dynamic prices p(t)[0,b]p(t)\in[0,b], t{0,,Tw}t\in\{0,\ldots,T_{w}\} to minimize the summation of total expected payment and vehicle response time:

U(T)=minp(t)[0,b],t{0,,Tw}t=0Twρt(W¯2(t)+αp2(t)b),U(T)=\min_{p(t)\in[0,b],t\in\{0,\ldots,T_{w}\}}\sum_{t=0}^{T_{w}}\rho^{t}\left(\bar{W}^{2}(t)+\frac{\alpha p^{2}(t)}{b}\right), (4)
 s.t. W¯(t+1)=W¯(t)+(1αp(t)b).\text{ s.t. }\bar{W}(t+1)=\bar{W}(t)+\left(1-\alpha\frac{p(t)}{b}\right). (5)

where ρ(0,1)\rho\in(0,1) is the discount factor.

The above dynamic programming problem is not easy to solve by considering the huge number of price combinations over time. In the following, we will analyze the optimal solution to this dynamic problem by using dynamic control techniques.

Proposition III.1

The optimal dynamic pricing p(t),t{0,,Tw}p(t),t\in\{0,\ldots,T_{w}\} as the optimal solution to the dynamic program (4)-(5) is monotonically increasing in W¯(t)\bar{W}(t) and given by

Refer to caption

Figure 2: Optimal dynamic pricing p(t)p(t) versus time tt with α=1\alpha=1 over Tw=100T_{w}=100 time slots.

Refer to caption

Figure 3: Vehicle response time (UAV waiting time) W¯(t)\bar{W}(t) versus time tt with α=1\alpha=1 over Tw=100T_{w}=100 time slots.
p(t)=ρMt+1+2ρQt+1(W¯(t)+1)2+2ρQt+1αb,p(t)=\frac{\rho M_{t+1}+2\rho Q_{t+1}(\bar{W}(t)+1)}{2+2\rho Q_{t+1}\frac{\alpha}{b}}, (6)

with p(Tw)=0p(T_{w})=0, and the vehicle response time W¯(t)\bar{W}(t) at time t,t[1,Tw]t,t\in\left[1,T_{w}\right] is

W¯(t)=2ρMtαb2+2ρQtαb+s=1t12ρMsαb2+2ρQsαbi=s+1t11+ρQiαb,\bar{W}(t)=\frac{2-\rho M_{t}\frac{\alpha}{b}}{2+2\rho Q_{t}\frac{\alpha}{b}}+\sum_{s=1}^{t-1}\frac{2-\rho M_{s}\frac{\alpha}{b}}{2+2\rho Q_{s}\frac{\alpha}{b}}\prod_{i=s+1}^{t}\frac{1}{1+\rho Q_{i}\frac{\alpha}{b}}, (7)

where

Qt=1+ρQt+11+ρQt+1αb,Q_{t}=1+\frac{\rho Q_{t+1}}{1+\rho Q_{t+1}\frac{\alpha}{b}}, (8)
Mt=ρ(Mt+1+2Qt+1)1+ρQt+1αb,M_{t}=\frac{\rho\left(M_{t+1}+2Q_{t+1}\right)}{1+\rho Q_{t+1}\frac{\alpha}{b}}, (9)

with W¯(0)=0\bar{W}(0)=0, QTw=1Q_{T_{w}}=1, MTw=0M_{T_{w}}=0 on the boundary.

Proof: Denote the cost objective function from initial time tt as

J(p,t)=s=tTwρst(W¯2(s)+αbp2(s)),J(p,t)=\sum_{s=t}^{T_{w}}\rho^{s-t}\left(\bar{W}^{2}(s)+\frac{\alpha}{b}p^{2}(s)\right), (10)

and the value function given the initial response time W¯(t)\bar{W}(t) as

V(W¯(t),t)=min{p(s)[0,b]}s=tTw(J(p,t)W¯(t)).V(\bar{W}(t),t)=\min_{\{p(s)\in[0,b]\}_{s=t}^{T_{w}}}(J(p,t)\mid\bar{W}(t)). (11)

Then, we have the dynamic programming equation at time tt:

V(W¯(t),t)=minp(t)[0,b]\displaystyle V(\bar{W}(t),t)=\min_{p(t)\in[0,b]} (W¯2(t)+αbp2(t))\displaystyle\left(\bar{W}^{2}(t)+\frac{\alpha}{b}p^{2}(t)\right) (12)
+ρV(W¯(t+1),t+1),\displaystyle+\rho V(\bar{W}(t+1),t+1),
 s.t. W¯(t+1)=W¯(t)+(1αp(t)b).\text{ s.t. }\bar{W}(t+1)=\bar{W}(t)+\left(1-\alpha\frac{p(t)}{b}\right). (5)

According to the first-order condition V(W¯(t),t)p(t)=0\frac{\partial V(\bar{W}(t),t)}{\partial p(t)}=0 when solving (12) in the backward induction process, we notice that p(t)p(t) is a linear function of W¯(t)\bar{W}(t). Thus, the value function in (12) should follow the following quadratic structure with respect to W¯(t)\bar{W}(t) :

V(W¯(t),t)=QtW¯2(t)+MtW¯(t)+St,V(\bar{W}(t),t)=Q_{t}\bar{W}^{2}(t)+M_{t}\bar{W}(t)+S_{t}, (13)

yet we still need to determine Qt,Mt,StQ_{t},M_{t},S_{t}. This will be accomplished by finding the recursion in the following.

First, we have QTw=1,MTw=0,STw=0Q_{T_{w}}=1,M_{T_{w}}=0,S_{T_{w}}=0 on the boundary due to V(W¯(Tw),Tw)=W¯2(Tw)V(\bar{W}(T_{w}),T_{w})=\bar{W}^{2}(T_{w}). Given V(W¯(t+1),t+1)=Qt+1W¯2(t+1)+Mt+1W¯(t+1)+St+1V(\bar{W}(t+1),t+1)=Q_{t+1}\bar{W}^{2}(t+1)+M_{t+1}\bar{W}(t+1)+S_{t+1} as in (13), the dynamic programming equation at time tt is

V(W¯(t),t)=\displaystyle V(\bar{W}(t),t)= minp(t)(W¯2(t)+αbp2(t)+ρQt+1W¯2(t+1)\displaystyle\min_{p(t)}\left(\bar{W}^{2}(t)+\frac{\alpha}{b}p^{2}(t)+\rho Q_{t+1}\bar{W}^{2}(t+1)\right. (14)
+ρMt+1W¯(t+1)+ρSt+1).\displaystyle\left.+\rho M_{t+1}\bar{W}(t+1)+\rho S_{t+1}\right).

Substitute W¯(t+1)\bar{W}(t+1) in (5) into (14) and let V(W¯(t),t)p(t)=0\frac{\partial V(\bar{W}(t),t)}{\partial p(t)}=0, we obtain the optimal price p(t)p(t) in (6). Then, we substitute p(t)p(t) in (6) into V(W¯(t),t)V(\bar{W}(t),t) in (14), and obtain V(W¯(t),t)V(\bar{W}(t),t) as a function of Qt+1,Mt+1,St+1Q_{t+1},M_{t+1},S_{t+1} and W¯(t)\bar{W}(t). Finally, by reformulating V(W¯(t),t)V(\bar{W}(t),t) in (14) and noting that V(W¯(t),t)=QtW¯2(t)+MtW¯(t)+StV(\bar{W}(t),t)=Q_{t}\bar{W}^{2}(t)+M_{t}\bar{W}(t)+S_{t} , we obtain the recursive functions of QtQ_{t} and MtM_{t} in (8) and (9). Substitute p(t)p(t) in (6) into (5), we obtain the expected age W¯(t)\bar{W}(t) in (7).\hfill\blacksquare

Figs. 2 and 3 simulate the system dynamics over time under the optimal dynamic price p(t)p(t) in (6). We can see that p(t)p(t) is low initially, allowing a modest increase in the response time to save the UAV platform’s costs. The price p(t)p(t) then increases with the increased response time to encourage the ground vehicles’ help until both of them reach steady states. This is consistent with the monotonically increasing relationship between p(t)p(t) and W¯(t)\bar{W}(t) in (6). However, when approaching the end of the time horizon Tw=100T_{w}=100, the price p(t)p(t) drops to 0 without worrying about its effect on the future waiting time. As a result, the vehicle response time W¯(t)\bar{W}(t) increases again but lasts only a few time slots.

Lemma III.1

As TwT_{w}\rightarrow\infty, QtQ_{t} in (8) and MtM_{t} in (9) respectively converge to the following steady-states:

Q=12(1b(1ρ)ρα+(1b(1ρ)ρα)2+4bρα),Q=\frac{1}{2}\left(1-\frac{b(1-\rho)}{\rho\alpha}\right.\\ \left.+\sqrt{\left(1-\frac{b(1-\rho)}{\rho\alpha}\right)^{2}+\frac{4b}{\rho\alpha}}\right), (15)
M=2ρQ1ρ+ρQαb.M=\frac{2\rho Q}{1-\rho+\rho Q\frac{\alpha}{b}}. (16)

Proof: Starting from the boundary condition QTw=1,MTw=0Q_{T_{w}}=1,M_{T_{w}}=0, according to Qt=1+ρQt+11+ρQt+1αbQ_{t}=1+\frac{\rho Q_{t+1}}{1+\rho Q_{t+1}\frac{\alpha}{b}} in (8) and Mt=ρ(Mt+1+2Qt+1)1+ρQt+1αbM_{t}=\frac{\rho\left(M_{t+1}+2Q_{t+1}\right)}{1+\rho Q_{t+1}\frac{\alpha}{b}} in (9), we can obtain QtQ_{t} and MtM_{t} backward given Qt+1bQ_{t+1}^{b} and Mt+1M_{t+1} at next time slot. Rewrite QtQ_{t} above as

Qt=1+ρ1Qt+1+ραb.Q_{t}=1+\frac{\rho}{\frac{1}{Q_{t+1}}+\rho\frac{\alpha}{b}}. (17)

Starting from QTw=1Q_{T_{w}}=1 and note that ρ1Qt+1+ραb>0\frac{\rho}{\frac{1}{Q_{t+1}}+\rho\frac{\alpha}{b}}>0, we have QTw1>1>QTwQ_{T_{w}-1}>1>Q_{T_{w}}. Note that (17) is increasing with Qt+1Q_{t+1} and limQt+1=1+bα<+\lim_{Q_{t+1}\rightarrow\infty}=1+\frac{b}{\alpha}<+\infty , we can conclude that {QTw,QTw1,,Q0}\left\{Q_{T_{w}},Q_{T_{w}-1},\ldots,Q_{0}\right\} is bounded increasing sequence in the reverse time order and will converge to the steady-state QQ , which can be solved as (15) by removing the time subscripts from (8).

By removing the time subscripts from (9), the stead-ystate MM is solved as M=2ρQ1ρ+ρQαbM=\frac{2\rho Q}{1-\rho+\rho Q\frac{\alpha}{b}} in (16). In the following, we will show that {MTw,MTw1b,,M0}\left\{M_{T_{w}},M_{T_{w}-1}^{b},\ldots,M_{0}\right\} is bounded increasing sequence in the reverse time order and converges to MM. According to Mt=ρ(Mt+1+2Qt+1)1+ρQt+1αbM_{t}=\frac{\rho\left(M_{t+1}+2Q_{t+1}\right)}{1+\rho Q_{t+1}\frac{\alpha}{b}} in (9), define y(Qt+1)y\left(Q_{t+1}\right) as the right-hand side of (9) given the steady-state MM, i.e.,

y(Qt+1)=ρ(M+2Qt+1)1+ρQt+1αb.y\left(Q_{t+1}\right)=\frac{\rho\left(M+2Q_{t+1}\right)}{1+\rho Q_{t+1}\frac{\alpha}{b}}. (18)

Refer to caption

Figure 4: QtQ_{t} in (8) and MtM_{t} in (9) versus time tt with Tw=100T_{w}=100 time slots, b=2b=2,α\alpha=1,ρ\rho=0.9.

Take the derivative of y(Qt+1)y\left(Q_{t+1}\right) with respect to Qt+1Q_{t+1}, we have

dy(Qt+1)dQt+1=2ρMρ2αb(1+ρQt+1αb)2.\frac{dy\left(Q_{t+1}\right)}{dQ_{t+1}}=\frac{2\rho-M\rho^{2}\frac{\alpha}{b}}{\left(1+\rho Q_{t+1}\frac{\alpha}{b}\right)^{2}}. (19)

According to M=2ρQ1ρ+ρQαbM=\frac{2\rho Q}{1-\rho+\rho Q\frac{\alpha}{b}}, we have Mραb=2ρQραb1ρ+ρQαb=2ρ1ρQραb+1<2M\rho\frac{\alpha}{b}=\frac{2\rho Q\rho\frac{\alpha}{b}}{1-\rho+\rho Q\frac{\alpha}{b}}=\frac{2\rho}{\frac{1-\rho}{Q\rho\frac{\alpha}{b}}+1}<2. Thus,dy(Qt+1)dQt+1>0\frac{dy\left(Q_{t+1}\right)}{dQ_{t+1}}>0 and y(Qt+1)y\left(Q_{t+1}\right) increases with Qt+1Q_{t+1}. Since {QTw,QTw1,,Q0}\left\{Q_{T_{w}},Q_{T_{w}-1},\ldots,Q_{0}\right\} is an increasing sequence and converges to QQ, y(QTw)<y(QTw1)<<y(Q)y\left(Q_{T_{w}}\right)<y\left(Q_{T_{w}-1}\right)<\cdots<y(Q).

Starting from MTw=0M_{T_{w}}=0, we have MTw1=2ρ1+ραb>0=MTwM_{T_{w}-1}=\frac{2\rho}{1+\rho\frac{\alpha}{b}}>0=M_{T_{w}}. Since 2ρQt+11+ρQt+1αb\frac{2\rho Q_{t+1}}{1+\rho Q_{t+1}\frac{\alpha}{b}} increases with Qt+1Q_{t+1}, the equation set {Mt=ρ(Mt+1+2Qt+1)1+ρQt+1αb}\left\{M_{t}=\frac{\rho\left(M_{t+1}+2Q_{t+1}\right)}{1+\rho Q_{t+1}\frac{\alpha}{b}}\right\} intersects with the vertical axis with 2ρQTw11+ρQTw1αb2ρQTw21+ρQTw2αb2ρQTw41+ρQTw4αb\frac{2\rho Q_{T_{w}-1}}{1+\rho Q_{T_{w}-1}\frac{\alpha}{b}}\leq\frac{2\rho Q_{T_{w}-2}}{1+\rho Q_{T_{w}-2}\frac{\alpha}{b}}\leq\cdots\leq\frac{2\rho Q_{T_{w}-4}}{1+\rho Q_{T_{w}-4}\frac{\alpha}{b}}. The slope of each equation satisfies 0<ρ1+ρQt+1αb<10<\frac{\rho}{1+\rho Q_{t+1}\frac{\alpha}{b}}<1. Therefore, starting from MTw1>0M_{T_{w}-1}>0, we have MTw2>MTw1M_{T_{w}-2}>M_{T_{w}-1} based on MTw2=ρ(MTw1+2QTw1)1+ρQTw1αbM_{T_{w}-2}=\frac{\rho\left(M_{T_{w}-1}+2Q_{T_{w}-1}\right)}{1+\rho Q_{T_{w}-1}\frac{\alpha}{b}}. Then, from MTw2M_{T_{w}-2}, we have MTw3>MTw2M_{T_{w}-3}>M_{T_{w}-2} based on MTw3=ρ(MTw3+2QTw3)1+ρQTw3αbM_{T_{w}-3}=\frac{\rho\left(M_{T_{w}-3}+2Q_{T_{w}-3}\right)}{1+\rho Q_{T_{w}-3}\frac{\alpha}{b}}. Thus, we can obtainMTw<MTw1<MTw2<MTw3<M_{T_{w}}<M_{T_{w}-1}<M_{T_{w}-2}<M_{T_{w}-3}<\cdotsuntil Mt=Mt+1=MM_{t}=M_{t+1}=M.\hfill\blacksquare

Fig. 4 simulates the dynamics of QtQ_{t} in (8) and MtM_{t} in (9). We can see that both QtQ_{t} and MtM_{t} fast converge to their steady-states in a few rounds. In the following, we will show the steady-state of the dynamic pricing p(t)p(t) in (6) for infinite time horizon case.

Remark III.1

For the infinite time horizon case, the optimal dynamic pricing is simplified from (6) to

p(t)=ρM+2ρQ(W¯(t)+1)2+2ρQαb,p^{\infty}(t)=\frac{\rho M+2\rho Q\left(\bar{W}^{\infty}(t)+1\right)}{2+2\rho Q\frac{\alpha}{b}}, (20)

and the vehicle response time W¯(t)\bar{W}^{\infty}(t) at time tt is

W¯(t)=2ρMαb2+2ρQαb1(11+ρQαb)t111+ρQαb.\bar{W}^{\infty}(t)=\frac{2-\rho M\frac{\alpha}{b}}{2+2\rho Q\frac{\alpha}{b}}\frac{1-\left(\frac{1}{1+\rho Q\frac{\alpha}{b}}\right)^{t}}{1-\frac{1}{1+\rho Q\frac{\alpha}{b}}}. (21)

As tt\rightarrow\infty, according to QQ in (15) and MM in (16), and note that 11+ρQαb<1\frac{1}{1+\rho Q\frac{\alpha}{b}}<1, the vehicle response time converge to the steady-state W¯\bar{W}:

W¯=limtW¯(t)=(1ρ)(1+ρQαb)ρQ(αb(1ρ)+ρQ(αb)2),\bar{W}=\lim_{t\rightarrow\infty}\bar{W}^{\infty}(t)=\frac{(1-\rho)\left(1+\rho Q\frac{\alpha}{b}\right)}{\rho Q\left(\frac{\alpha}{b}(1-\rho)+\rho Q\left(\frac{\alpha}{b}\right)^{2}\right)}, (22)

and the optimal dynamic pricing converges to

limtp(t)=bα.\lim_{t\rightarrow\infty}p^{\infty}(t)=\frac{b}{\alpha}. (23)

which shows that p(t)bp^{\infty}(t)\leq b always holds when α1\alpha\geq 1.

IV Stage II: Layer 1: Task Allocation

IV-A Algorithm Overview

This section focuses on the first layer of the stage II. We design the UAV task allocation scheme to traverse all packages and minimize the maximum delivery time of each UAV. Package nodes VGV_{G} and the depot nodes VDV_{D} in the traffic network graph GTG_{T} are picked out to generate the task allocation graph, which is denoted as GA=(VA,EA)G_{A}=(V_{A},E_{A}), where the set of vertices VA=VGVDV_{A}=V_{G}\cup V_{D}, and the weight wuvw_{uv} of directed edge (u,v)EA(u,v)\in E_{A} is the predicted passage time from position uu to position vv. We run the shortest path search algorithm in the traffic network graph GTG_{T} to obtain the predicted passage time wuvw_{uv} between all points of the task allocation graph GAG_{A}. We solve such a problem in GAG_{A} to find a shortest path to access all packages, which means we find the delivery sequence of all packages and it has the smallest total predicted passage time.

This layer aims to design the task allocation scheme Pn=d1g1d2g2dkgkdk+1,n{1,,N}P_{n}=d_{1}g_{1}d_{2}g_{2}\dots d_{k}g_{k}d_{k+1},n\in\{1,\dots,N\} to minimize the longest UAV task completion time, i.e., minmaxnNTime(Pn)\min{\max}_{n\in N}\text{Time}(P_{n}), where Time(Pn)\text{Time}(P_{n}) is the total time for the nthn^{th} UAV to complete the corresponding package delivery service and return to depot dk+1d_{k+1} under task allocation PnP_{n} from depot d1d_{1} in delivery order {g1,,gk}\{g_{1},\dots,g_{k}\}. Task allocation PnP_{n} contains the information of which UAV delivers which packages and the delivery order. To find the task allocation scheme for NN UAVs, we first consider the minimum connection tours (MCT) problem that connects all packages:

P1:min(u,v)EAwuvxuv,P1:\min{\sum}_{(u,v)\in E_{A}}w_{uv}x_{uv}, (24)
s.t.(g,d)EAxgd=\displaystyle s.t.{\sum}_{(g,d)\in E_{A}}x_{gd}= (d,g)EAxdg=1,\displaystyle{\sum}_{(d,g)\in E_{A}}x_{dg}=1, (25)
dVD,gVG,\displaystyle\forall d\in V_{D},g\in V_{G},
(v,d)EAxvd=(d,v)EAxdv=1,dVD,{\sum}_{(v,d)\in E_{A}}x_{vd}={\sum}_{(d,v)\in E_{A}}x_{dv}=1,\forall d\in V_{D}, (26)
xdg,xgd{0,1},dVD,gVG,x_{dg},x_{gd}\in\{0,1\},\forall d\in V_{D},g\in V_{G}, (27)
xdd0,d,dVD,x_{dd^{{}^{\prime}}}\in{\mathbb{N}}_{\geq 0},\forall d,d^{{}^{\prime}}\in V_{D}, (28)

where constraint (25) indicates that each package address gVGg\in V_{G} must be visited once and comes from one depot, and the UAV must return to one depot after completing delivery. Constraint (26) indicates that the UAVs entering and exiting each depot are equal, which paves the way for subsequently transforming this problem into a minimum cost circulation problem and designing algorithms in polynomial time. Constraint (27) indicates that the edge connected to the package address is used at most once. The last constraint (28) indicates that multiple round trips between depots are possible.

Algorithm 2 TaskAllocation.
0:  Network Graph GTG_{T}; Location of depots VDV_{D}; Location of packages VGV_{G}; Number of UAV NN;
0:  Each UAV packages delivery order OO;
1:  Allocation graph GASubGraph(GT,VD,VG)G_{A}\leftarrow\text{SubGraph}(G_{T},V_{D},V_{G});
2:  Edge selection vector XMCC(GA)X\leftarrow\text{MCC}(G_{A});
3:  Connected components CGetComponents(GA,X)C\leftarrow\text{GetComponents}(G_{A},X);
4:  while |C|>1|C|>1 do
5:     for c,cc,c^{{}^{\prime}}in CC do
6:        if wdd+wddw_{dd^{{}^{\prime}}}+w_{d^{{}^{\prime}}d}is minimum that dc,dcd\in c,d^{{}^{\prime}}\in c^{{}^{\prime}} then
7:           CMergeComponent(C,c,c)C\leftarrow\text{MergeComponent}(C,c,c^{{}^{\prime}});
8:        end if
9:     end for
10:  end while
11:  Get the tour that start from one depot then visit all the goods finally end at a depot TGetTours(C)T\leftarrow\text{GetTours}(C);
12:  Split Tour TT into NN orders OSplitTour(T,N)O\leftarrow\text{SplitTour}(T,N);
13:  return  OO

The detailed design is given in Algorithm 2, in which we aim to minimize the sum of the estimated passage time between depots and packages while ensuring that each package is dispatched by one and only one depot.

SubGraph. At the task allocation layer, we seek a set of NN paths that minimize the maximum delivery time of any UAV. Here we ignore the specific routes from the depot to the package and only consider how they are connected. A new task allocation graph GAG_{A} is generated from the network GTG_{T} based on the known geographic locations of depots and packages, GAG_{A} contains only the nodes of depots and packages, and the weights between them are the estimated passage times between the two locations.

MinimumCostCirculation (MCC). The problem P1P1 can be viewed as an NN minimal visiting paths problem[11] and is NP-hard. Note that the structure of the minimal visiting paths problem is very similar to the MCC problem, and the latter can be solved by a polynomial-time algorithm[18]. In problem P1P1, xuvx_{uv} indicates the number of times to select the edge (u,v)(u,v), which is an integer. To reduce the computational complexity of the linear programming problem, we transform the integer constraints of the combinatorial optimization problem P1P1 into real numbers, thus transforming the problem into a MCC problem for analytical solution. Finally, we divide the minimum connected path obtained by linear programming into NN paths for each UAV.

GetStrongConnectedComponent. The solution X={xuvX=\{x_{uv}, uu, vVA}v\in V_{A}\} given by MCC shows the number of times the edge (u,v)(u,v) is used in the task allocation graph GAG_{A}. The depot nodes are connected to the package nodes according to the edges used and forming multiple connected components. Then we combine all the connected components to generate the entire connected graph, which has edges covering the trips we need to traverse all the packages with the least cost.

SplitTours. After obtaining the trip that traverses all the packages with minimum cost, we distribute it evenly among all the UAVs. Evenly means that the estimated time of package delivery undertaken by each UAV is close to the total estimated time divided by the total number of UAVs NN. The estimated package delivery time is not equivalent to the actual UAV delivery time. The time spent by the UAV includes the flight time, carrying time (hitching time), waiting time (vehicle response time), and the time spent by the UAV for conflict resolution.

IV-B Computational complexity and optimality

Since we relax the variable X={xuv,u,vVA}X=\{{x_{uv},u,v\in V_{A}}\} in linear programming from an integer constraint to a real number constraint, the MCT problem is converted to a MCC problem. For the MCC problem, we can compute an integer optimal solution in polynomial time[18, 19].

Our goal is to minimize the maximum delivery time of UAVs, by returning the set P=P1,,PNP={P_{1},...,P_{N}}, where PnP_{n} indicates the sequence of package delivered by the nthn^{th} UAV. Then our objective can be expressed as

minimize maxnNTime(Pn).\text{minimize }{\max}_{n\in N}\text{Time}(P_{n}). (29)

Let the optimal task allocation set be P~={P1~,,PN~}\widetilde{P}=\{\widetilde{P_{1}},\dots,\widetilde{P_{N}}\}, where OPT:=maxnNTime(Pn~)OPT:={\max}_{n\in N}\text{Time}(\widetilde{P_{n}}). In the task allocation algorithm, linear programming usually returns more than one connected component, and we aim to find the minimum weights of the edges between depots to connect different connected components, which finally generates a strongly connected path over the graph. Such a path TT traverses all the packages from a depot and then back to a depot, and contains the newly added edges (the depot to another depot), which makes our result slightly deviate from the optimal solution. We observe that

|T|\displaystyle|T| nNTime(Pn~)+αmaxd,dVD(wdd+wdd)\displaystyle\leq{\sum}_{n\in N}\text{Time}(\widetilde{P_{n}})+\alpha\mathop{\max}_{d,d^{{}^{\prime}}\in V_{D}}(w_{dd^{{}^{\prime}}}+w_{d^{{}^{\prime}}d}) (30)
NmaxnNTime(Pn~)+αmaxd,dVD(wdd+wdd),\displaystyle\leq N\mathop{\max}_{n\in N}\text{Time}(\widetilde{P_{n}})+\alpha\mathop{\max}_{d,d^{{}^{\prime}}\in V_{D}}(w_{dd^{{}^{\prime}}}+w_{d^{{}^{\prime}}d}),

where |T||T| is the total weights of the paths TT and α\alpha is the number of times to merge the connected components. We assign a task to each UAV by specifying that its package delivery prediction time is less than the total prediction time divided by total number of UAVs NN. The worst outcome of this is that a UAV is assigned an additional task after it has been assigned exactly the average prediction time, such that we can derive that

maxnNTime(Pn)|T|/N+maxd,dVD,gVG(wdg+wgd).{\max}_{n\in N}\text{Time}(P_{n})\leq|T|/N+\mathop{\max}_{d,d^{{}^{\prime}}\in V_{D},g\in V_{G}}(w_{dg}+w_{gd^{{}^{\prime}}}). (31)

We find that the predicted time undertaken by any one UAV in this layer of the algorithm does not exceed the average predicted time plus the estimated time for one maximum package delivery time. Combining the above equations, we finally get the gap between our results and the optimal solution:

maxnNTime(Pn)OPT+β+γ,{\max}_{n\in N}\text{Time}(P_{n})\leq OPT+\beta+\gamma, (32)

where β:=aNmaxd,dVD(wdd+wdd)\beta:=\frac{a}{N}\mathop{\max}_{d,d^{{}^{\prime}}\in V_{D}}(w_{dd^{{}^{\prime}}}+w_{d^{{}^{\prime}}d}), γ:=maxd,dVD,gVG(wdg+wgd)\gamma:=\mathop{\max}_{d,d^{{}^{\prime}}\in V_{D},g\in V_{G}}(w_{dg}+w_{gd^{{}^{\prime}}}). Obviously, as the number NN of UAVs and the number MM of package delivery requests rises, our result gets closer to the optimal solution and solves in polynomial time.

V Stage II: Layer 2: Multi-Agent Path Finding

Refer to caption
(a)
Refer to caption
(b)
Refer to caption
(c)
Figure 5: Three types of UAV delivery networks, where the solid line represents the hitching route and the dotted line represents the flight route. (a) The UAV will not carry ground vehicles during delivery mission. (b) The UAV can only carry a ground vehicle once during a delivery mission. (c) The UAV can carry ground vehicles more than once during a delivery mission.

According to the task allocation results of the UAVs in the first layer, the task of each UAV can be decomposed into a series of dgddgd^{{}^{\prime}} delivery subtasks, i.e., delivering packages from depot dd to destination gg and back to the same or different depot dd^{{}^{\prime}}. The conflict between UAVs comes from the space limitation of the ground vehicles and interchange points. Here, we define as only one UAV can stay at an interchange point at the same time to wait for ground vehicles.111Our algorithms can be extended to the case with more than one UAV staying at an interchange point at the same time, and the simulation result is shown in Section VI. Based on whether the UAVs hitch on ground vehicles and whether hop between different ground vehicles for each subtask, we discuss the following three types of UAV delivery networks as shown in Fig. 5 (a) direct flight network (b) single-hop hitch network (c) multi-hop hitch network.

V-A Direct Flight Network

In a direct flight network, UAVs are utilized for delivering packages without the help of ground vehicles. In this case, we consider that the UAVs deliver packages by flying from the depot to the package address directly, and thus the delivery time of a UAV is equal to the straight-line distance from the depot to the package address divided by the UAV’s average flight speed. Though delivering directly by UAVs saves a lot of time especially during peak hours or in an a complex geographical environment, the energy limitation of the UAVs has restricted their service range. In the direct flight network, for each package delivery request executed by the UAV, we need to consider the constraint that whether the flight distance consumed in the whole journey exceeds its own limit. When the UAV cannot reach the designated location, we consider the delivery mission to fail. The multimodal logistics system can solve the flight distance limitation of direct flight network, and we compare the mission delivery failure rates of the three networks in Section VI later.

V-B Single-Hop Hitch Network

Algorithm 3 Single-Hop Path Finding.
0:  Departure location dd; End point location ee; Transit sections set PrP_{r};
0:  Path of the current subtask PP;
1:  Initialize maximum aircraft time for UAVs T¯\overline{T}; Minimum passage time tmin+t_{min}\leftarrow+\infty;
2:  for r in PrP_{r} do
3:     (i,j)r(i,j)\leftarrow r;
4:     if tdi+tjeT¯2t^{{}^{\prime}}_{di}+t^{{}^{\prime}}_{je}\leq\frac{\overline{T}}{2} and not exceeding the capacity of the interchange point ii then
5:        if tdi+tij+tjetmint^{{}^{\prime}}_{di}+t_{ij}+t^{{}^{\prime}}_{je}\leq t_{min} then
6:           tmintdi+tij+tjet_{min}\leftarrow t^{{}^{\prime}}_{di}+t_{ij}+t^{{}^{\prime}}_{je};
7:           P{d,i,j,e}P\leftarrow\{d,i,j,e\};
8:        end if
9:     end if
10:  end for
11:  return  PP

In a single-hop hitch network, the UAV hitch on a ground vehicle only once during the journey. For a delivery subtask of dgddgd^{{}^{\prime}}, the UAV can depart from the depot dd, fly to the interchange point ii to wait and hitch on the vehicle through the transit section r=(i,j)r=(i,j), and then fly from jj to the delivery destination gg. After completing the subtask, it returns to the depot dd^{{}^{\prime}} in order to perform the next subtask. Denote the passage time of the UAV from the depot to the package address be Tn,r,dg=tdi+tij+tjgT_{n,r,dg}=t^{{}^{\prime}}_{di}+t_{ij}+t^{{}^{\prime}}_{jg}, where tdit^{{}^{\prime}}_{di} and tjgt^{{}^{\prime}}_{jg} are the corresponding UAV flight time (which can be obtained from the UAV speed) and tijt_{ij} is the ride time. Based on the crowdsourcing incentive mechanism, the vehicle response time for the transit section r=(i,j)r=(i,j) is predicted to be W¯i\overline{W}_{i}. So the ride time is tij=W¯i+tijvt_{ij}=\overline{W}_{i}+t^{v}_{ij}, where tijvt^{v}_{ij} is the vehicle passage time for section rr (which can be obtained from the average vehicle speed). We decompose one delivery subtask into two segments dgdg and gdgd^{{}^{\prime}}. The transit sections rr and rr^{{}^{\prime}} can be the same or different sections. The transit sections are directed edges and the set of all transit sections is PrP_{r}. With the objective of minimizing delivery subtask completion time, the UAV hitch problem is modeled as

P2:minrPr(Tn,r,dgyn,r,dg+Tn,r,gdyn,r,gd),\displaystyle P2:\min{\sum}_{r\in P_{r}}(T_{n,r,dg}y_{n,r,dg}+T_{n,r,gd^{{}^{\prime}}}y_{n,r,gd^{{}^{\prime}}}), (33)
n{1,,N},\displaystyle\forall n\in\{1,\dots,N\},
s.t.n{1,,N}(yn,r,dg+yn,r,dg)Cr,rPr,s.t.{\sum}_{n\in\{1,\dots,N\}}(y_{n,r,dg}+y_{n,r,dg^{{}^{\prime}}})\leq C_{r},\forall r\in P_{r}, (34)
rPryn,r,dg=rPryn,r,gd=1,\displaystyle{\sum}_{r\in P_{r}}y_{n,r,dg}={\sum}_{r\in P_{r}}y_{n,r,gd^{{}^{\prime}}}=1, (35)
n{1,,N},\displaystyle\forall n\in\{1,\dots,N\},
rPr(tn,r,dgyn,r,dg+tn,r,gdyn,r,gd)T¯,\displaystyle{\sum}_{r\in P_{r}}({t^{{}^{\prime}}_{n,r,dg}}y_{n,r,dg}+{t^{{}^{\prime}}_{n,r,gd^{{}^{\prime}}}}y_{n,r,gd^{{}^{\prime}}})\leq\overline{T}, (36)
n{1,,N},\displaystyle\forall n\in\{1,\dots,N\},
yn,r,dg,yn,r,gd{0,1},rPr,n{1,,N},y_{n,r,dg},y_{n,r,gd^{{}^{\prime}}}\in\{0,1\},\forall r\in P_{r},\forall n\in\{1,\dots,N\}, (37)

where constraint (34) indicates that the UAV that selects a transit section rr cannot exceed its capacity CrC_{r} (no more than CrC_{r} UAVs can stay at an interchange point at the same time), thereby avoiding UAV conflicts. The constraint (35) indicates that only one transit section can be selected from dd to gg and from gg to dd^{{}^{\prime}}. And constraint (36) indicates that the total flight time of the UAV cannot exceed its maximum flight time T¯\overline{T}, where tn,r,dg{t^{{}^{\prime}}_{n,r,dg}} is the flight time of the UAV nn that selects section rr.

A delivery subtask of dgddgd^{{}^{\prime}} can be divided into two parts: from dd to gg and from gg to dd^{{}^{\prime}}. Thus each UAV needs to invoke the path finding algorithm twice with half the energy constraint each time to perform a subtask and the detail of single-hop path finding is given in Algorithm 3. The algorithm takes the departure location dd, destination ee and interchange sections PrP_{r} as inputs. Single-Hop Path Finding algorithm only has to traverse all the interchanges once to find the path that takes the shortest time and does not exceed the UAV’s flight limit.

V-C Multi-Hop Hitch Network

In multi-hop hitch networks, UAVs can extend their delivery range by hopping between more than one ground vehicles in one delivery task, which makes the path finding problem more challenging due to more combinations of transit sections. When performing path planning, UAVs need to consider their own flight distance limitations and the conflicts between UAVs. CBS can solve the conflict problem between UAVs. It first ignores the conflict between UAVs in the upper layer and finds the optimal path for each UAV, then detects whether there is a conflict between the paths of each UAV in the lower layer. If a conflict occurs, CBS adds a new restriction and runs the path planning algorithm in the upper layer again. However, multiple agents operate on the same traffic network, and whenever a conflict is detected between UAVs, a new conflict restriction needs to be added for each conflicting UAV and a new path is planned for each UAV again. As the number of UAVs NN and the number of package delivery requests MM increases, the number of conflicts between UAVs will become larger and larger. Thus, the system will frequently plan new routes for UAVs, leading to degradation of network performance [16].

Algorithm 4 CABPS.
0:  Network Graph GTG_{T}; Departure location dd; End point location ee; Vehicle response time for each interchange point W¯\overline{W}; Constraints on interchange points IcI_{c};
0:  Path of the current subtask PP;
1:  Initialize maximum aircraft time for UAVs T¯\overline{T}; Initialize an empty Priority Queue QQ;
2:  Generate a new subgraph GSSubGraph(GT,d,e)G_{S}\leftarrow\text{SubGraph}(G_{T},d,e);
3:  Push dd into Priority Queue QQ;
4:  while |Q|>0|Q|>0 do
5:     current node cpop(Q)c\leftarrow\text{pop}(Q);
6:     if c==ec==e then
7:        return  Path PP from departure point to cc;
8:     end if
9:     for ii in neighbors of point cc do
10:        if wci+w_{ci}+ total flight time cost from dd to c<T¯2c<\frac{\overline{T}}{2} then
11:           if edge(c,i)(c,i) in interchange edges of GSG_{S} then
12:              Consider response times W¯\overline{W} and occupancy conflicts IcI_{c} at interchange point;
13:           end if
14:           push ii into Priority Queue QQ;
15:        end if
16:     end for
17:  end while
18:  return  empty path PP

To deal with the performance degradation of CBS, we propose a conflict avoidance-based path search algorithm for the path optimization problem under multi-hop hitch network, which can be solved in polynomial time. Firstly, we search for the shortest path for UAVs by A\text{A}^{*} heuristic algorithm. When a UAV determines a clear path, we want to make sure other UAVs don’t clash with it. So the system will updates the information about the occupied interchange points in the network as it plans the path for each UAV, and that occupancy information needs to be taken into account when planning paths for other UAVs. Thus we are able to ensure that no conflicts occur in the network. The detail of CABPS is given in Algorithm 4. The inputs to the algorithm are the entire network graph GTG_{T}, the departure location dd, destination point ee, the vehicle response time W¯\overline{W} of each interchange predicted based on the crowdsourcing incentive mechanism and the information about the current timestamp IcI_{c} of each interchange being occupied.

Step 1 (lines 1-2): Construct a new path search graph GS=(VS,ES)G_{S}=(V_{S},E_{S}) from the original graph based on the departure node dd and end node ee of the current subtask, where VS=deVIV_{S}=d\cup e\cup V_{I}. We connect such edges like (u,v)ES(u,v)\in E_{S} that u,vVSu,v\in V_{S}, for any edge (u,v)(u,v) with two properties, one is the passage time spent from uu to vv, and the other is the UAV flight time consumed. We define the path search graph GSG_{S} as follows.

Definition V.1

There is only one depot node and one package node in the path search graph GSG_{S}, and the rest are interchange points. For an edge (u,v)(u,v), its properties are divided into the following cases: (i) If one of uu and vv is a depot or package node, the passage time of the edge is the straight-line distance from uu to vv divided by the average speed of the UAV, and the consumption time of the edge is the flight time of the UAV. (ii) If both uu and vv are interchange points and the edge (u,v)(u,v) is a transit route, the passage time of the edge is the distance of the journey from uu to vv divided by the average speed of the vehicle, and the consumption of flight time is 0. (iii) Both uu and vv are interchange points, but the edge (u,v)(u,v) is not a transit route. The passage time of this edge is the straight-line distance from uu to vv divided by the UAV speed and the consumption time of the edge is the flight time of the UAV.

Step 2 (lines 3-17): Use the A\text{A}^{*} heuristic algorithm[20] to search for the shortest path in the subgraph GSG_{S}. A\text{A}^{*} uses a priority queue to store the information of historical paths from start node to the candidate nodes, and the heuristic function is the shortest path search algorithm from the current node to the destination node. In the path search process, we save the paths that do not consume more flight than the limit, and the passage time consumed by the current path includes the delivery time, waiting time at the interchange and the additional waiting time due to conflict avoidance.

Step 3 (lines 18): If A\text{A}^{*} cannot find a path that matches the condition, we return an empty set to indicate that the delivery subtask failed.

V-D Feasibility and Time Complexity

For the direct flight network, the UAV only considers the straight-line distance from the origin location to the destination point, so its result is unique. As for the single-hop hitch network where the UAV only carries the ground vehicle once during the process from the delivery task, the path combination of the single-hop hitch network is limited and only related to the number of interchange routes in the network, thus we mainly discuss the multi-hop hitch network here.

V-D1 The shortest path in time

A\text{A}^{*} selects a new node when performing an iterative search based on the cost of the path and an estimate of the cost required to extend the path to the destination. A\text{A}^{*} selects the minimized path as

f(n)=g(n)+h(n),f(n)=g(n)+h(n), (38)

where nn is the candidate nodes on the path, g(n)g(n) is the passage time spent from the starting point to nn. h(n)h(n) is a heuristic function to estimate the cost of the shortest path from nn to the destination, which we represent by the time-consuming shortest path between two points calculated by Dijkstra’s algorithm. In the UAV path search, the actual resulting shortest path must not take less time than the unconstrained shortest path, considering the energy storage limitation of UAV, thus our heuristic function design is reasonable.

When A\text{A}^{*} stops searching, it indicates that it has found a path from the starting point to the goal that has an actual cost lower than the estimated cost of any other path. Thus A\text{A}^{*} will never overestimate the actual cost of reaching the destination, and is guaranteed to return the lowest cost path from the starting point to the destination.

V-D2 Avoiding conflicting paths

In CABPS, we ensure that each planned path will not be affected by the subsequent path. That is, each time a path is planned for a UAV, we no longer change it, but update the occupancy information of this path on the interchange points of the network for the subsequent path planning. The occupancy information is a time-stamped interval, its lower bound is the arrival time of the UAV at the site, and upper bound is the arrival time plus the response time of the vehicles on the ground. The situation is considered as a conflict if the UAV’s arrival time is within the interval of the interchange’s occupancy timestamp. Different from CBS in which the conflicting paths are jointly planned, in CABPS, when a conflict occurs, the original path will not be replanned, and the subsequent UAV needs to wait until another UAV leaves the interchange point, which means that more waiting time is introduced. In the following lemma, we show that even though CABPS may get longer delivery time by reducing the computational burden, the performance gap between CBS and CABPS is small.

Lemma V.1

quasiOPTquasiOPT is the approximate optimal solution of cbsOPTcbsOPT, where cbsOPTcbsOPT and quasiOPTquasiOPT are the passage time of the paths obtained by CBS and CABPS, respectively.

Proof: Suppose there are two paths pp, qq with conflicting in certain time stamp. Denote TrT_{r} as the time spent on path rr. When a conflict occurs between paths pp and qq, the problem is divided into two subproblems according to the definition of CBS[16], i.e., pp maintains the original path and qq adds restrictions to find a new path qq^{{}^{\prime}} or qq maintains the original path and pp adds restrictions. Then the extra time spent on the path after CBS resolution can be expressed as min(TqTq,TpTp)\min(T_{q^{{}^{\prime}}}-T_{q},T_{p^{{}^{\prime}}}-T_{p}). According to the CABPS definition, we always guarantee that a path is not subject to change. Now assume that the invariable path is pp. The occupancy time of path pp at the interchange point that conflicts with path qq is W¯p\overline{W}_{p}, and the extra time can be expressed as min(TqTq,W¯p)\min(T_{q^{{}^{\prime}}}-T_{q},\overline{W}_{p}). When the extra time is W¯p\overline{W}_{p}, it indicates that we maintain the original path qq, and the extra time for path qq is the waiting time for the UAV on path pp leaving the occupied interchange point, i.e., W¯p\overline{W}_{p}. Thus, we have

cbsOPT=Tp+Tq+min(TqTq,TpTp),cbsOPT=T_{p}+T_{q}+\min(T_{q^{{}^{\prime}}}-T_{q},T_{p^{{}^{\prime}}}-T_{p}), (39)
quasiOPT=Tp+Tq+min(TqTq,W¯p).quasiOPT=T_{p}+T_{q}+\min(T_{q^{{}^{\prime}}}-T_{q},\overline{W}_{p}). (40)

The different between CBS and CABPS is the extra time spent on conflict resolution and it can be observed that

quasiOPT=\displaystyle quasiOPT= cbsOPT+min(TqTq,W¯p)\displaystyle cbsOPT+\min(T_{q^{{}^{\prime}}}-T_{q},\overline{W}_{p}) (41)
min(TqTq,TpTp).\displaystyle-\min(T_{q^{{}^{\prime}}}-T_{q},T_{p^{{}^{\prime}}}-T_{p}).

Assuming that Tq<TqT_{q^{{}^{\prime}}}<T_{q}, then qq^{{}^{\prime}} will be the optimal path before the conflict occurs, which causes the contradiction. Similarly Tp>TpT_{p^{{}^{\prime}}}>T_{p}. So we have quasiOPT<=cbsOPT+min(TqTq,W¯p)quasiOPT<=cbsOPT+\min(T_{q^{{}^{\prime}}}-T_{q},\overline{W}_{p}). We note that quasiOPT<=cbsOPT+W¯pquasiOPT<=cbsOPT+\overline{W}_{p} holds constantly, where W¯p\overline{W}_{p} is the vehicle response time depending on the incentive mechanism and traffic density.

Now we extend the problem to NN paths in conflict. According to the definition of CBS, the conflicts of the first two paths are resolved first and branching is done based on their conflicts. Denote the time spent to resolve the conflicts of NN paths as i{1,,N1}cbsOPTi{\sum}_{i\in\{1,\dots,N-1\}}{cbsOPT}_{i}. The CABPS algorithm determines each path in order, so the time spent to resolve the conflicts of NN paths is i{1,,N1}quasiOPTi{\sum}_{i\in\{1,\dots,N-1\}}{quasiOPT}_{i}. In the worst case, we have

i{1,,N1}quasiOPTi\displaystyle{\sum}_{i\in\{1,\dots,N-1\}}{quasiOPT}_{i} (42)
i{1,,N1}cbsOPTi+(N1)maxrPrW¯r\displaystyle\leq{\sum}_{i\in\{1,\dots,N-1\}}{cbsOPT}_{i}+(N-1){\max}_{r\in P_{r}}\overline{W}_{r}
(1+C)i{1,,N1}cbsOPTi,\displaystyle\leq(1+C){\sum}_{i\in\{1,\dots,N-1\}}{cbsOPT}_{i},

where C=(N1)maxrPrW¯ri{1,,N1}cbsOPTiC=\frac{(N-1){\max}_{r\in P_{r}}\overline{W}_{r}}{{\sum}_{i\in\{1,\dots,N-1\}}{cbsOPT}_{i}} and PrP_{r} is the set of conflicting paths. According to the dynamic pricing, the maximum vehicle response time can be predicted. For small vehicle response time, the factor CC is small, which makes the bound on quasiOPTquasiOPT closer to the optimal solution cbsOPTcbsOPT.\hfill\blacksquare

TABLE I: The average computation time (seconds) of the task allocation algorithm on a fixed network GT=(VT,ET)G_{T}=(V_{T},E_{T}), where |VT|=846|V_{T}|=846, MM is the number of packages and KK is the number of depots.
M K=2 K=3 K=4 K=5 K=10 K=15 K=20
10 0.010 0.016 0.037 0.088 0.258 0.923 1.875
20 0.013 0.019 0.041 0.100 0.294 0.975 2.192
50 0.013 0.024 0.048 0.133 0.350 1.182 2.646
100 0.015 0.021 0.051 0.128 0.360 1.203 2.685
200 0.019 0.034 0.082 0.216 0.533 1.802 3.790
400 0.029 0.047 0.123 0.263 0.735 2.644 5.024
600 0.036 0.057 0.152 0.403 11.585 2.880 5.989

V-D3 Computational complexity

As mentioned above, in CABPS, the planned UAV route will not be changed, and later UAVs can only choose to replan route to avoid conflict or wait for the previous UAV to leave. Thus, the shortest path search algorithm only needs to be run once for each UAV. Noting that the UAV cannot exceed its own maximum flight distance during delivery, but adding this restriction to the shortest path problem makes it NP-hard. Motivated by [21], we extend the A\text{A}^{*} into MultiConstraint Shortest Path (A_MCSP\text{A}^{*}\_\text{MCSP}) algorithm and use it as a low-level search algorithm for CABPS. As A_MCSP\text{A}^{*}\_\text{MCSP} prunes the unnecessary search space by forward-looking information and eligibility tests, it significantly reduces the computational complexity and solve the problem in polynomial time. Thus, the time complexity of CABPS is also polynomial.

VI Simulations

VI-A Task Allocation

In this section, we validate our results via simulations. The traffic network is generated on the basis of some real-world scenarios.

In Section IV, we show that our method can obtain an near-optimal solution. In Table I, we verify the efficiency of the task allocation algorithm for different numbers of depots and package delivery requests. Our approach is to randomly select some nodes on the network as depots or packages, and get an evaluation matrix with size of (K+M)×(K+M)(K+M)\times(K+M), where KK is the number of depots and MM is the number of packages. Then we run the task allocation algorithm on this matrix. As shown in Table I, the task allocation running time of our algorithm grows polynomially as KK and MM increase.

VI-B Multi-Agent Path Finding

In this sub-section, we verify the efficiency of our results in MAPF layer, by examining the success rate of UAV deliveries, impacts of the interchange capacities, the number of UAVs and depots on the package delivery time, respectively.

Refer to caption

Figure 6: The average delivery failure rate of three UAV delivery networks vs. number of transit edges.

Refer to caption

Figure 7: The average delivery time vs. capacity of interchange point for different maximum vehicle response time.

Even though all the packages are allocated to certain UAVs at the task allocation layer, there are still some packages that the UAVs cannot deliver due to the energy constraint. If the CABPS algorithm cannot find a feasible path or its execution time exceeds 300300 seconds, we consider this package delivery task a failure. Fig. 6 compares the failure rates of UAVs delivering package under three types of UAV delivery networks. We use the number of transit edges in the networks as variables, and 5050 experiments are conducted for each number of transit edges. The network is randomly generated for each experiment, and we take the average of the final results. To demonstrate the effectiveness of our multi-hop hitch network in improving the range of UAV deliveries, we set the maximum flight distance of the UAV to be small. As shown in Fig. 6, the average package delivery failure rate for the multi-hop hitch network eventually drops to 0%0\%, comparing to 70%70\% and 65%65\% for the direct flight and single-hop hitch networks. It also shows that the performance of our multi-hop hitch network improves significantly as the number of interchange edges increased.

Refer to caption

Figure 8: Average delivery time vs. maximum flight distance for vehicles only and our multimodal logistics system.

We extend to the case with more than one UAV staying at an interchange point at the same time, which reduces the probability of conflict in the logistics system. Assuming that the capacity of each interchange point is C¯\bar{C} which means that up to C¯\bar{C} UAVs can stay at an interchange point simultaneously. Fig. 7 shows how the UAV’s delivery time changes with the capacity of the interchange point for different vehicle response time. For each setting, 5050 experiments are conducted. It shows that the delivery time reduces for smaller vehicle response time with less additional cost associated with conflict avoidance by UAVs. In addition, as the capacity of the interchange points increases, the delivery time decreases due to less conflicts.

We compare the average delivery time of vehicles only and our multimodal logistics system. Considering the speed limit of the vehicle in the urban area, we set the speed of the UAV to 1.31.3 times the speed of the vehicle. The delivery time of the vehicle is obtained based on the shortest path. We use the flight distance limit of the UAV as an experimental variable and set the number of interchange paths to 6060 in order to ensure that the UAV can deliver most of the packages. 5050 experiments are conducted separately for different flight limits. It is shown in Fig. 8 that, as the UAV’s maximum flight distance increase, the delivery time for the air-ground combination delivery method reduces. This is because, within large flight distance, the UAV relies less on the ground vehicles to complete the package delivery task, and thus takes less time.

TABLE II: The average time (in seconds) for sub-task path planning calculation time, sub-task delivery time and maximum delivery time for any UAV to complete a delivery task, where NN is the number of UAVs and K=|VD|K=|V_{D}| is the number of depots. 5050 experiments are conducted for each setting.
{ Avg Calculation Time, Avg Delivery Time, Max Delivery Time}
N K=1 K=3 K=5 K=10 K=20 K=30
1 {0.237, 52.0, 5202} {0.037, 27.8, 2779} {0.005, 15.4, 1541} {0.005, 12.9, 1289} {0.003, 9.5, 945} {0.0002, 7.5, 747}
5 {0.236, 52.1, 1244} {0.038, 27.8, 666} {0.005, 15.3, 404} {0.005, 12.7, 312} {0.003, 9.2, 220} {0.0001, 7.0, 170}
10 {0.243, 52.2, 688} {0.038, 27.8, 383} {0.004, 15.2, 224} {0.005, 12.6, 182} {0.002, 8.6, 128} {0.0002, 6.5, 96}
20 {0.244, 52.3, 392} {0.038, 27.7, 223} {0.004, 15.0, 139} {0.005, 12.3, 112} {0.002, 8.3, 78} {0.0001, 6.0, 60}
30 {0.245, 52.5, 301} {0.039, 27.7, 179} {0.004, 14.8, 108} {0.005, 12.1, 94} {0.002, 8.1, 64} {0.0001, 5.8, 50}

Finally, we verify the impact of the number of depots and UAVs on the network performance. The experimental results for different configurations are given in Table II. As the number of depots rises, the UAV can be allocated to the depots closer to it for delivery, leading to shorter delivery time. The increase in the number of UAVs allows us to distribute tasks out evenly, which reduces the time to complete all package delivery requests.

VII Conclusion

We propose an effective air-ground multimodal logistics system based on crowdsourcing. To deal with the UAVs’ energy storage limitation problem, the ground vehicles are invited to provide hitching services for UAVs. We formulate the crowdsourcing-based multimodal logistics problem by two-stages. In Stage I, a dynamic pricing scheme is proposed to motivate the ground vehicles to help UAV delivery, which balance the monetary reward and waiting time for UAVs for cost minimization. In Stage II, the path planning over the traffic network is discussed. We divide the problem into two layers to solve. In the first layer, we use an approximate optimal solution algorithm to distribute delivery tasks in polynomial time. In the second layer, a multi-UAV path search algorithm is proposed based on conflict avoidance, which is an approximate solution of CBS and maintains its performance in the case of a large number of conflicts. Finally, several simulations are conducted to verify that our approach is efficient. It is shown that our system substantially increases the delivery range of the UAV (its delivery success rate increased by 325%325\% in our experiments) and its delivery time is much less than vehicle deliveries.

References

  • [1] O. Hultkrantz and K. Lumsden, “E-commerce and consequences for the logistics industry,” in Proceedings for Seminar on “The Impact of E-Commerce on Transport.” Paris.   Citeseer, 2001.
  • [2] W. Liu, Y. Liang, X. Bao, J. Qin, and M. K. Lim, “China’s logistics development trends in the post covid-19 era,” International Journal of Logistics Research and Applications, pp. 1–12, 2020.
  • [3] J. Muñuzuri, J. Larrañeta, L. Onieva, and P. Cortés, “Solutions applicable by local administrations for urban logistics improvement,” Cities, vol. 22, no. 1, pp. 15–28, 2005.
  • [4] A. W. Sudbury and E. B. Hutchinson, “A cost analysis of amazon prime air (drone delivery),” Journal for Economic Educators, vol. 16, no. 1, pp. 1–12, 2016.
  • [5] N. Agatz, P. Bouman, and M. Schmidt, “Optimization approaches for the traveling salesman problem with drone,” Transportation Science, vol. 52, no. 4, pp. 965–981, 2018.
  • [6] M. Hossain and I. Kauranen, “Crowdsourcing: a comprehensive literature review,” Strategic Outsourcing: An International Journal, 2015.
  • [7] D. Buchmueller, S. A. Green, A. Kalyan, and G. Kimchi, “Communications and landings of unmanned aerial vehicles on transportation vehicles for transport,” Nov. 3 2020, uS Patent 10,822,081.
  • [8] A. Borowczyk, D.-T. Nguyen, A. Phu-Van Nguyen, D. Q. Nguyen, D. Saussié, and J. Le Ny, “Autonomous landing of a multirotor micro air vehicle on a high velocity ground vehicle,” Ifac-Papersonline, vol. 50, no. 1, pp. 10 488–10 494, 2017.
  • [9] D. Falanga, A. Zanchettin, A. Simovic, J. Delmerico, and D. Scaramuzza, “Vision-based autonomous quadrotor landing on a moving platform,” in 2017 IEEE International Symposium on Safety, Security and Rescue Robotics (SSRR), 2017, pp. 200–207.
  • [10] M. Jünger, G. Reinelt, and G. Rinaldi, “The traveling salesman problem,” Handbooks in operations research and management science, vol. 7, pp. 225–330, 1995.
  • [11] T. Bektas, “The multiple traveling salesman problem: an overview of formulations and solution procedures,” omega, vol. 34, no. 3, pp. 209–219, 2006.
  • [12] M. Naazare, D. Ramos, J. Wildt, and D. Schulz, “Application of graph-based path planning for uavs to avoid restricted areas,” in 2019 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), 2019, pp. 139–144.
  • [13] A. Rovira-Sugranes and A. Razi, “Predictive routing for dynamic uav networks,” in 2017 IEEE International Conference on Wireless for Space and Extreme Environments (WiSEE), 2017, pp. 43–47.
  • [14] Z. Zhang, J. Li, and J. Wang, “Sequential convex programming for nonlinear optimal control problems in uav path planning,” Aerospace Science and Technology, vol. 76, pp. 280–290, 2018.
  • [15] M. Tuqan, N. Daher, and E. Shammas, “A simplified path planning algorithm for surveillance missions of unmanned aerial vehicles,” in 2019 IEEE/ASME International Conference on Advanced Intelligent Mechatronics (AIM), 2019, pp. 1341–1346.
  • [16] G. Sharon, R. Stern, A. Felner, and N. R. Sturtevant, “Conflict-based search for optimal multi-agent pathfinding,” Artificial Intelligence, vol. 219, pp. 40–66, 2015.
  • [17] S. Choudhury, K. Solovey, M. J. Kochenderfer, and M. Pavone, “Efficient large-scale multi-drone delivery using transit networks,” Journal of Artificial Intelligence Research, vol. 70, pp. 757–788, 2021.
  • [18] É. Tardos, “A strongly polynomial minimum cost circulation algorithm,” Combinatorica, vol. 5, no. 3, pp. 247–255, 1985.
  • [19] R. K. Ahuja, T. L. Magnanti, and J. B. Orlin, “Network flows: Theory,” Algorithms, and, 1993.
  • [20] P. E. Hart, N. J. Nilsson, and B. Raphael, “A formal basis for the heuristic determination of minimum cost paths,” IEEE transactions on Systems Science and Cybernetics, vol. 4, no. 2, pp. 100–107, 1968.
  • [21] Y. Li, J. Harms, and R. Holte, “Fast exact multiconstraint shortest path algorithms,” in 2007 IEEE International Conference on Communications, 2007, pp. 123–130.