This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

UAV-assisted Emergency Integrated Sensing and Communication Networks: A CNN-based Rapid Deployment Approach

Zao Wang1, Lianming Xu1, Luyang Hou1, Ruoguang Li2, and Li Wang1
1School of Computer Science, Beijing University of Posts and Telecommunications, Beijing, China
2National Mobile Communications Research Laboratory, Southeast University, Nanjing, China
{wangzao, xulianming, luyang.hou, liwang}@bupt.edu.cn, [email protected]
This work was supported in part by the National Key Research and Development Program of China under Grant 2020YFC1511801, and in part by National Natural Science Foundation of China under grants 62171054, 62201071, 62201089 and U2066201. (Corresponding author: Li Wang.)
Abstract

UAV-assisted integrated sensing and communication (ISAC) network is crucial for post-disaster emergency rescue. The speed of UAV deployment will directly impact rescue results. However, the ISAC UAV deployment in emergency scenarios is difficult to solve, which contradicts the rapid deployment. In this paper, we propose a two-stage deployment framework to achieve rapid ISAC UAV deployment in emergency scenarios, which consists of an offline stage and an online stage. Specifically, in the offline stage, we first formulate the ISAC UAV deployment problem and define the ISAC utility as the objective function, which integrates communication rate and localization accuracy. Secondly, we develop a dynamic particle swarm optimization (DPSO) algorithm to construct an optimized UAV deployment dataset. Finally, we train a convolutional neural network (CNN) model with this dataset, which replaces the time-consuming DPSO algorithm. In the online stage, the trained CNN model can be used to make quick decisions for the ISAC UAV deployment. The simulation results indicate that the trained CNN model achieves superior ISAC performance compared to the classic particle swarm optimization algorithm. Additionally, it significantly reduces the deployment time by more than 96%.

Index Terms:
Emergency rescue, ISAC, UAV deployment, CNN.

I Introduction

After natural disasters occur, the existing network infrastructure is severely damaged [1]. Building an emergency sensing and communication (ISAC) network is helpful for the smooth implementation of subsequent rescue operations, where the emergency ISAC network is an emergency task-oriented wireless communication network with sensing capabilities. Recently, unmanned aerial vehicles (UAV) have been widely used in the construction of emergency ISAC networks due to their high mobility and flexibility [2]. When building a UAV-assisted emergency network, there are two key issues to consider: deployment and resource allocation.

The UAV deployment and resource allocation has been investigated from several perspectives. Ref. [3] employed gibbs sampling and block coordinate descent to jointly optimize UAV locations, bandwidth, and power allocation to maximize network throughput while ensuring localization accuracy constraints are met. Ref. [4] proposed an approach based on genetic algorithms and multi-intelligence deep reinforcement learning to optimize UAV locations in order to maximize the ISAC performance. Ref. [5] develop a method based on the bipartite graph to solve the problem of optimal matching for a wireless communication network. Ref. [6] proposed a roth and vande vate-based distributed scheme to solve the dynamic matching problem in device-to-device communications network. The above-mentioned method requires a lot of time to compute UAV locations and resource allocation results. Whenever the input information of algorithms is changed, it needs to restart the computation. However, emergency scenarios are characterized by complex environments and dynamic user locations, which require the execution time of algorithms to be as short as possible. Therefore, the above-mentioned method is difficult to apply in emergency scenarios.

Several studies have also investigated rapid UAV deployment. Ref. [7] adopted KMeans to solve the UAV communication deployment problem. Ref. [8] developed a method based on virtual force field to optimize UAV locations with the aim of maximizing the network throughput. However, these methods did not yield satisfactory network performance. Ref. [9] proposed a UAV deployment algorithm based on deep neural networks (DNN). This algorithm builds the optimal UAV locations dataset through reinforcement learning, and then trains a DNN model to compute UAV locations. However, DNN cannot well understand the two-dimensional user distribution information. In [10], a learning framework based on CNN is designed to address the cache placement and trajectory optimization problem with the aim of maximizing the overall network throughput in vehicular networks. CNN can understand user distribution information, but CNN is primarily sensitive to image data. However, in emergency scenarios where personnel distribution is sparse, traditional CNNs have limitations. In addition, there is currently no rapid UAV deployment algorithm for ISAC network.

Based on aforementioned analysis, this paper solves the rapid ISAC UAV deployment problem by proposing a CNN-based algorithm. The main contributions of this paper are summarized as follows:

  • To solve the above problem in real time, we propose a two-stage framework based on CNN, which consists of an offline and an online stage. In the offline stage, we construct a UAV optimized deployment dataset for complex ISAC network scenarios and train a CNN deployment model based on this dataset. In the online stage, we utilize the trained CNN model to perform fast UAV deployment decisions.

  • In the offline stage, we design a DPSO algorithm to compute UAV locations. Specifically, we introduce a nonlinear weight and learning factor update strategy to speed up the convergence and prevents algorithm falling into local optima.

  • To solve the problem caused by sparse distribution of users in emergency scenario, we further propose a feature enhancement method based on gaussian distribution to make CNN understand user distribution better.

II System Model and problem formulation

As illustrated in Fig. 1, we consider a multi-UAV-assisted ISAC system for emergency rescue in a mountainous forest area, where sensing only considers localization. Specifically, the system consists of a control vehicle and MM UAVs, in which the control vehicle serves as a centralized controller to schedule all the UAVs, and UAVs provide simultaneous communication and localization service for NN on-ground affected users. In this system, a signal service cycle is divided into two phases in time order: the communication phase and the localization phase. During the communication phase, each user can only establish a communication connection with one UAV. During the localization phase, Each user establishes localization connections with three UAVs. All UAVs share the same frequency band. Each UAV BS provides communication and localization services for users by frequency division multiple access technology (FDMA). The UAV and user sets are denoted by ={1,2,,M}\mathcal{M}=\left\{1,2,\cdots,M\right\} and 𝒩={1,2,,N}\mathcal{N}=\left\{1,2,\cdots,N\right\}, respectively. The coordinates of mm-th UAV and ground nn-th user can be represented as 𝐮m=[xm,ym,zm]{\mathbf{u}}_{m}=[{x_{m}},{y_{m}},{z_{m}}] and 𝐨n=[xn,yn,0]{\mathbf{o}}_{n}=[{x_{n}},{y_{n}},0], respectively. The distance between mm-th UAV and nn-th user can be expressed as dm,n=𝐮m𝐨n2{{d}_{m,n}}={\left\|\mathbf{u}_{m}-\mathbf{o}_{n}\right\|}_{2}.

In practice, the location of UAVs is also subject to collision avoidance constraints and safety altitude constraints, i.e.,

dm1,m2dmin,m1,m2,m1m2,{{d}_{m_{1},m_{2}}}\geq{{d}_{min}},\forall m_{1}\in\mathcal{M},\forall m_{2}\in\mathcal{M},m_{1}\neq m_{2}\text{,} (1)
zmhmin,m,{{z}_{m}}\geq{{h}_{\min}},\forall m\in\mathcal{M}\text{,} (2)

where dmind_{min} denotes the minimum inter-UAV distance to ensure collision avoidance, hminh_{min} is the minimum flight altitude to prevent collision with the mountain.

The total path loss (in dB) between mm-th UAV and nn-th user in forest environments can be represented as follows, according to [11],

PLm,n=20lg(4πfd0c)+10ηlg(dm,nd0)+Xσ+PLm,nSlant,PL_{m,n}=20\lg\left(\frac{4\pi f{{d}_{0}}}{c}\right)+10\eta\lg\left(\frac{{{d}_{m,n}}}{{{d}_{0}}}\right)+{{X}_{\sigma}}+PL_{m,n}^{Slant}\text{,} (3)

where ff represents the carrier frequency, d0{{d}_{0}} is the reference distance, and cc represents the speed of signal. η\eta is the path loss factor, XσX_{\sigma} is the shadowing fading, which is a Gaussian random variable with zero-mean and standard deviation σ\sigma. PLm,nSlantPL_{m,n}^{Slant} represents the additional path loss in forest [12].

Therefore, the channel power gain between mm-th UAV and nn-th user can be expressed as

gm,n=10PLm,n10.{{g}_{m,n}}={{10}^{\frac{-{PL}_{m,n}}{10}}}\text{.} (4)

II-A Communication Model

During the communication phase, we define the matrix 𝐀{{\mathbf{A}}} to represent the communication association between UAVs and users. Let αm,n=1\alpha_{m,n}=1 if mm-th UAV provides communication service to nn-th user, otherwise αm,n=0\alpha_{m,n}=0. When αm,n=1\alpha_{m,n}=1, we consider the signal-to-interference-plus-noise ratio (SINR) of nn-th user receiving mm-th UAV as

γm,n=pm,ngm,nimipi,ngi,n+ψ,\gamma_{m,n}=\frac{{{p}_{m,n}}{{g}_{m,n}}}{\sum\nolimits_{i\neq m}^{i\in\mathcal{M}}{{{p}_{i,n}}{{g}_{i,n}}+\psi}}\text{,} (5)

where pm,n{{p}_{m,n}} is the transmitting power allocated by mm-th UAV to nn-th user, assuming that each UAV equally allocates the transmit power to the served users such that pm,n=/pmaxj𝒩αm,j{{p}_{m,n}}={}^{{{p}_{\max}}}/{}_{\sum\nolimits_{j\in\mathcal{N}}{\alpha_{m,j}}}, where pmaxp_{max} denotes the transmit power limit of the UAV. To ensure the communication signal quality for the served users, each UAV can service up to KC{{K}_{C}} users in communication phase and the SINR for each user should be greater than γC{{\gamma}_{C}}.

We use throughput to measure the communication performance, the throughput of nn-th user can be expressed as

Rn=bm,nlog(1+γm,n),{{R}_{n}}={{b}_{m,n}}\log\left(1+{{\gamma}_{m,n}}\right)\text{,} (6)

where bm,n{{b}_{m,n}} is the bandwidth allocated by mm-th UAV to nn-th user, assuming that each UAV distributes its total bandwidth bmaxb_{max} evenly across all the users it services such that bm,n=/bmaxj𝒩αm,j{{b}_{m,n}}={}^{b_{max}}/{}_{\sum\nolimits_{j\in\mathcal{N}}{\alpha_{m,j}}}.

II-B Localization Model

Refer to caption
Figure 1: An overview of the UAV-assisted ISAC network

During the localization phase, the ground device calculates its location by estimating the time difference of arrival (TDOA) of signals emitted by the UAVs. Similarly, we define a matrix 𝐁{{\mathbf{B}}} to represent the location association between the UAVs and the users, where βm,n=1\beta_{m,n}=1 when mm-th UAV provides location service for nn-th user; otherwise βm,n=0\beta_{m,n}=0. To ensure the localization signal quality for the served users, each UAV is assumed to be able to serve up to KP{K}_{P} users in the localization phase and the corresponding SINR must be greater than the threshold γP{\gamma}_{P}.

We adopt PDOP as the indicator to measure the localization accuracy. When the user’s location can be calculated, we define the UAVs localization service group for nn-th user as ϵn={mi,n|i{1,2,3}mi,n}\epsilon_{n}=\{{{m}_{i,n}}|i\in\{1,2,3\}\wedge{{m}_{i,n}}\in\mathcal{M}\}, the PDOP of nn-th user can be expressed as follows, according to [13],

ρn=tr{[𝐇nT𝐇n]1}.{{\rho}_{n}}=\sqrt{tr\{[\mathbf{H}_{n}^{T}\mathbf{H}_{n}]^{-1}\}}\text{.} (7)

Assume that mn,1{{m}_{n,1}}-th UAV serves as the TDOA reference station in ϵn\epsilon_{n}, for simplicity of exposition, we define unit vectors 𝐯m,n=𝐮m𝐨ndm,n\mathbf{v}_{m,n}=\frac{\mathbf{u}_{m}-\mathbf{o}_{n}}{d_{m,n}}, where 𝐇n\mathbf{H}_{n} is given by

𝐇n=[𝐯m2,n,n𝐯m1,n,n𝐯m3,n,n𝐯m1,n,n].\mathbf{H}_{n}=\begin{bmatrix}\mathbf{v}_{m_{2,n},n}-\mathbf{v}_{m_{1,n},n}\\ \mathbf{v}_{m_{3,n},n}-\mathbf{v}_{m_{1,n},n}\end{bmatrix}\text{.} (8)

We define a piecewise function Gn{{G}_{n}} to represent the localization performance of nn-th user

Gn={ρn,ρnρmax,ρmax,otherwise,{{G}_{n}}=\left\{\begin{matrix}{{\rho}_{n}}\text{,}&{{\rho}_{n}}\leq{{\rho}_{\max}}\text{,}\\ {{\rho}_{\max}}\text{,}&\text{otherwise}\text{,}\\ \end{matrix}\right. (9)

where ρmax{{\rho}_{\max}} is the upper limit threshold of PDOP.

II-C Problem Formulation

We consider a weighted sum utility function to measure the ISAC performance, which is given by

χn=(λRn+(1λ)1Gn),{{\chi}_{n}}=(\lambda{{R}_{n}}+(1-\lambda)\frac{1}{{{G}_{n}}})\text{,} (10)

where λ\lambda represents the weight of communication and localization performance. Because Rn{{R}_{n}} and 1Gn\frac{1}{{{G}_{n}}} have different dimensions and magnitudes, we use the max-min normalization method to standardize the two indicators.

We hope to optimize UAV locations under a known UAV-user association strategy. Let 𝒰={𝐮m|m}\mathcal{U}=\{{{\mathbf{u}}_{m}}|m\in\mathcal{M}\} represent the UAV locations, and 𝒜={𝐀,𝐁}\mathcal{A}=\{{{\mathbf{A}}},{{\mathbf{B}}}\} represent the association strategy between UAVs and users. The goal is to jointly optimize 𝒰\mathcal{U} and 𝒜\mathcal{A}, in order to maximize the ISAC performance of all users. This optimization problem can be formulated as

max𝒰,𝒜n𝒩χn\displaystyle\underset{\begin{smallmatrix}\mathcal{U},\mathcal{A}\end{smallmatrix}}{\mathop{\max}}\,\sum\nolimits_{n\in\mathcal{N}}{{{\chi}_{n}}} (11a)
s.t.\displaystyle s.t.\quad m=1Mαm,n1,n𝒩,\displaystyle\sum\limits_{m=1}^{M}{\alpha_{m,n}}\leq 1,\forall n\in\mathcal{N}\text{,} (11b)
m=1Mβm,n3,n𝒩,\displaystyle\sum\limits_{m=1}^{M}{\beta_{m,n}}\leq 3,\forall n\in\mathcal{N}\text{,} (11c)
n=1Nαm,nKC,m,\displaystyle\sum\limits_{n=1}^{N}{\alpha_{m,n}}\leq{{K}_{C}},\forall m\in\mathcal{M}\text{,} (11d)
n=1Nβm,nKP,m,\displaystyle\sum\limits_{n=1}^{N}{\beta_{m,n}}\leq{{K}_{P}},\forall m\in\mathcal{M}\text{,} (11e)
γm,n>γC,αm,n=1,\displaystyle\gamma_{m,n}>{{\gamma}_{C}},\alpha_{m,n}=1\text{,} (11f)
γm,n>γP,βm,n=1,\displaystyle\gamma_{m,n}>{{\gamma}_{P}},\beta_{m,n}=1\text{,} (11g)
zmhmin,m,\displaystyle{{z}_{m}}\geq{{h}_{\min}},\forall m\in\mathcal{M}\text{,} (11h)
dm1,m2dmin,m1,m2,m1m2,\displaystyle{{d}_{m_{1},m_{2}}}\geq{{d}_{min}},\forall m_{1}\in\mathcal{M},\forall m_{2}\in\mathcal{M},m_{1}\neq m_{2}\text{,} (11i)

where (11b) indicates that each user can be served by at most one UAV, (11c) represents that at most three UAVs collaborate in locating each user, (11d) and (11e) represent the maximum number of communication and localization users that can be accessed by each UAV, respectively, (11f) and (11g) represent the minimum SINR for communication and localization signals of users, respectively, (11h) represents the minimum flight altitude, and (11i) represents the minimum safety distance between UAVs.

III UAVs Deployment Based On CNN

The UAV deployment problem formulatd in Section II is a mixed integer nonlinear programming problem that is difficult to solve. Traditionally, solving this problem need to calculate ISAC performance of all users. However it will cost a lot of time. Therefore, We propose a new UAV deployment method named CNN-based Deployment Agent (CNNDA), which can achieve rapid UAV depolyment. Fig. 2 illustrates the processing flow of the proposed CNNDA, which consists of offline and online stage. In the offline stage, we design a DPSO algorithm to optimize the UAV locations. Based on this, we construct a dataset consisting of pairs of optimal UAV locations and user locations. Then, we train a CNN model using this dataset to learn the mapping function from users distribution to optimal UAV locations. In the online stage, we can quickly obtain UAV locations with the trained CNN model.

Refer to caption
Figure 2: Process flow of the CNNDA framework

III-A Dataset Construction

In the offline stage, we build an optimal UAV locations dataset to train CNN. Specifically, we first generate KD{K_{D}} pieces of ground user locations according to random distribution, and we define the kk-th data as 𝒪k={𝐨k,1,,𝐨k,N}{\mathcal{O}_{k}}=\{\mathbf{o}_{k,1},\cdots,\mathbf{o}_{k,N}\}. Then, we compute KD{K_{D}} corresponding optimal UAV locations according to the proposed DPSO. There are various methods to obtain the optimal UAV locations, such as exhaustive algorithm and heuristic algorithms. Considering exhaustive algorithm requires a lot of time and computational resources, we adopts a heuristic method to obtain the optimal UAV locations.

The UAV deployment problem formulatd in Section II can be divided into two subproblems: one is to determine the association between UAVs and users 𝒜\mathcal{A}, and another is to find the optimal UAV locaTtions 𝒰\mathcal{U}. When solving 𝒜\mathcal{A}, we need to ensure that 𝒰\mathcal{U} is fixed, and vice versa.

III-A1 UAV-UE Association

To solve the subproblem of association between UAVs and users, We propose the benefit based greedy association strategy (BGAS) that takes into account both service quality and UAV capacity constraints. Specifically, we first define the reward functions of communication and localization for user nn as qnCq^{C}_{n} and qnPq^{P}_{n}, respectively. Once 𝒰\mathcal{U} is determined, we calculate two benefits for all users within the communication or localization coverage range. Based on these benefits, we sort all users in descending order and associate the top-ranked user with corresponding UAV.

In the communication phase, we define the communication benefit qnCq_{n}^{C} for nn-th user, which is expressed as

qnC=γm,n,m=argmaxiSnγi,n,{q}_{n}^{C}={{\gamma}_{m,n}},\ m=\underset{i\in S_{n}}{\mathop{\arg\max}}\,\gamma_{i,n}\text{,} (12)
Sn={i|iγi,n>γCKC(i)>0},S_{n}=\left\{i|i\in\mathcal{M}\wedge\gamma_{i,n}>\gamma_{C}\wedge{{K}_{C}}(i)>0\right\}\text{,} (13)

where KC(i){{K}_{C}}(i) represents the remaining communication connections of ii-th UAV, and SnS_{n} represents the set of UAVs that can provide communication services for nn-th user.

In the localization phase, we define the localization benefit of nn-th user as qnPq_{n}^{P}, which is represented as

qnP=ρwn,n,wn=argminwnWnρwn,n,q_{n}^{P}={{\rho}_{{{w}_{n}},n}},w_{n}=\underset{w_{n}\in{{W}_{\text{n}}}}{\mathop{\arg\min}}\,{{\rho}_{w_{n},n}}\text{,} (14)
Wn={ϵi|mϵiγm,n>γPKP(m)>0},{{W}_{n}}=\left\{{\epsilon_{i}}|\forall m\in{\epsilon_{i}}\wedge{{\gamma}_{m,n}}>\gamma_{P}\wedge{{K}_{P}}(m)>0\right\}\text{,} (15)

where KP(m){{K}_{P}}(m) represents the remaining localization connection times for mm-th UAV, and Wn{{W}_{n}} represents the set of UAVs that can provide localization services for nn-th user.

III-A2 UAV deployment optimization

In the subproblem of optimizing UAV locations, we propose a DPSO algorithm address issues such as local optima and slow convergence in traditional PSO algorithm. In DPSO, each particle location ll represents the location of all UAVs: l={𝒖1,𝒖2,𝒖M}l=\left\{{{\bm{u}}_{1}},{{\bm{u}}_{2}},\cdots{{\bm{u}}_{M}}\right\}, the main process of DPSO is as follows:

2.a) Initialization Stage: The initial population contains kk particles, i.e., ={l1,l2,,lk}\ell=\left\{{{l}_{1}},{{l}_{2}},\cdots,{{l}_{k}}\right\} . We use the KMeans algorithm to initialize the location of a certain particle lil_{i}. Specifically, we calculate MM users cluster centers as the initial locations of the MM UAVs according to KMeans and set the value of lil_{i}. Other particles are initialized randomly. Finally, we evaluate all particles and set each individual’s optimal location pip_{i} and the population’s optimal location gg.

2.b) Evaluation Stage: Based on BGAS, we can get the UAV-user association 𝒜\mathcal{A} for each particle. We then calculate the fitness value for each particle. We define a fitness function F(l)F(l) to evaluate each particle, F(l)F(l) is related to the optimization objective and its constraints, and it can be expressed as

F(l)={(11a),(11h) and (11i),Fmin,otherwise,F(l)=\left\{\begin{matrix}(11a),&\text{(11h) and (11i),}\\ {{F}_{\min}},&otherwise\text{,}\\ \end{matrix}\right. (16)

where FminF_{min} is the minimum fitness in the current population.

1 Initialize DPSO parameters;
2for episode kk:=11,…,KDK_{D} do
3      
4      Generate user locations 𝒪k{\mathcal{O}_{k}} randomly;
5      Initialize particles by KMeans and Random;
6      for epoch tt:=11,…,tmaxt_{max} do
7            
8            Calculate UAV-UE association 𝒜\mathcal{A} by BGAS;
9            Calculate w(t)w(t) according to (19);
10            Calculate c1(t)c_{1}(t), c2(t)c_{2}(t) according to (20);
11            for each particle ii do
12                  
13                  Calculate fitness F(li)F(l_{i}) according to (16);
14                  Update viv_{i}, lil_{i} according to (17), (18) respectively;
15                  if F(li)>F(pi)F(l_{i})>F(p_{i}) then
16                        
17                        Update the individual best location pip_{i};
18                  
19                  if F(li)>F(g)F(l_{i})>F(g) then
20                        
21                        Update the population best location gg;
22                  
23            
24      
25      Save the users’ location 𝒪k{{\mathcal{O}}_{k}} and final UAVs’ location 𝒰k=g{{\mathcal{U}}_{k}}=g as a piece of dataset;
26
Get a dataset 𝐃\mathbf{D}.
Algorithm 1 Dataset Construction in the Offline stage

2.c) Update Stage: For particle ii, if F(li)F(l_{i}) is better than its individual best fitness F(pi)F(p_{i}), then update the individual best location pip_{i}. If F(li)F(l_{i}) is better than its population best fitness F(g)F(g), then update the population best location gg. The update strategy of velocity viv_{i} and location lil_{i} are expressed by

vi(t+1)=wvi(t)+c1(t)r1(pili(t))+c2(t)r2(gili(t)),v_{i}(t+1)=wv_{i}(t)+{{c}_{1}(t)}{{r}_{1}}({{p}_{i}}-l_{i}(t))+{{c}_{2}(t)}{{r}_{2}}({{g}_{i}}-l_{i}(t))\text{,} (17)
li(t+1)=li(t)+vi(t+1),l_{i}(t+1)=l_{i}(t)+v_{i}(t+1)\text{,} (18)

where tt represents the number of iterations, ww is the inertia weight, c1c_{1} and c2c_{2} are learning factors, r1r_{1} and r2r_{2} are randomly generated numbers between [0,1][0,1]. Distinguishing from the traditional PSO algorithm, we introduce a non-linearly decreasing w(t)w(t). In the early stages of the search, w(t)w(t) is a larger value, allowing particles to explore the search space globally. In the later stages, w(t)w(t) is a smaller value, facilitating local optimization, which is expressed as

w(t)=(winiwend0.2)exp(11+7t/tmax),{w(t)}=({{w}_{ini}}-{{w}_{end}}-0.2)\exp(\frac{1}{1+{7t}/{{{t}_{\max}}}})\text{,} (19)

where wini{w}_{ini} represents the initial value of the inertia weight, wend{w}_{end} represents the final value of the inertia weight, and tmaxt_{max} represents the maximum number of iterations.

For c1(t)c_{1}(t) and c2(t)c_{2}(t), when c1(t)c2(t)c_{1}(t)\geq c_{2}(t), the particle tends to move to the individual’s best location, conversely prefer population best. In DPSO, we make c1(t)c_{1}(t) non-linearly decreasing and c2(t)c_{2}(t) non-linearly increasing. This allows us to emphasize global search capability in the early stage and shift towards local search capability in the later stage. c1(t)c_{1}(t) and c1(t)c_{1}(t) share the same formula, which is expressed as

c(t)=cend+(cinicend)(tmaxt)1.2t1.2,{{c}(t)}={{c}_{end}}+({{c}_{ini}}-{{c}_{end}})\frac{{{({{t}_{\max}}-t)}^{1.2}}}{{{t}^{1.2}}}\text{,} (20)

where cini{c}_{ini} and cend{c}_{end} respectively represent the initial value and final value of the learning factors during the iteration process.

Algorithm 1 shows how to get a dataset. According to BGAS+DPSO, we can calculate the near-optimal UAVs deployment set 𝒰k{\mathcal{U}_{k}} for each user information 𝒪k{{\mathcal{O}}_{k}}. By repeating BGAS+DPSO KD{K_{D}} times, we can obtain the optimal strategy dataset 𝐃\mathbf{D} of KD{K_{D}} samples, which can be represented as

𝐃=[(𝒪1,𝒰1),,((𝒪KD,𝒰KD].\mathbf{D}=[({{\mathcal{O}}_{1}},{{\mathcal{U}}_{1}}),\cdots,(({{\mathcal{O}}_{{{K}_{D}}}},{{\mathcal{U}}_{{{K}_{D}}}}]\text{.} (21)

III-B Training the CNN

After obtaining the dataset 𝐃\mathbf{D}, we train a CNN model to obtain a mapping function from users to UAV locations. Our model consist of input layer, feature layer and output layer.

1) Input: We divide the ground area into L×LL\times L grids and define the kk-th data’s user distribution matrix OkO_{k}. In the input layer, we perform feature enhancement on the original user distribution data to solve gradient disappearance caused by the sparse users distribution. Specifically, for the grid position of nn-th user (in,jn)(i_{n},j_{n}), and we use the gaussian enhancement method to calculate the pixel value of the ii-th row and jj-th column of the matrix, which can be expressed as

𝐎k,(i,j)=max{exp((iin)2+(jjn)22ξ2)|n𝒩},{{\mathbf{O}}_{k,(i,j)}}=\max\left\{\exp(\frac{{{(i-{{i}_{n}})}^{2}}+{{(j-{{j}_{n}})}^{2}}}{-2{{\xi}^{2}}})|\forall n\in\mathcal{N}\right\}\text{,} (22)

where ξ\xi is the standard deviation of Gaussian enhancement. In this way, we enrich the image data without interfering with the original user distribution data, which is more conducive to the learning of CNN.

Refer to caption
Figure 3: The convergence of the proposed algorithm and its benchmarks

2) Network structure: The CNN model consists of a feature extraction layer and a decision mapping layer. The feature extraction layer consists of two convolutional layers. The first convolutional layer consists of a 64-channel 7×77\times 7 convolution kernel and a maximum pooling layer. The second The first convolutional layer consists of a 128-channel 3×33\times 3 convolutional kernel and a maximum pooling layer. The decision mapping layer uses a multi-layer perceptron, and its input layer, hidden layer, and output layer are composed of 1024, 2048, and 3M3M neurons respectively; all neurons in the network use ReLU as the activation function.

3) Output: The output of the CNN corresponds to all UAV locations, which can be represented as

𝐔=[x1,y1,z1;x2,y2,z2;;xM,yM,zM].\mathbf{U}=[{{x}_{1}},{{y}_{1}},{{z}_{1}};{{x}_{2}},{{y}_{2}},{{z}_{2}};\cdots;{{x}_{M}},{{y}_{M}},{{z}_{M}}]\text{.} (23)

We adopt Mean Squared Error (MSE) as the loss function[14]. Additionally, we set pruning probabilities for each neuron to avoid overfitting problem[15].

IV Simulation Results

In this section, we evaluate the performance of the proposed method through simulations. We consider a 55 km×\times 55 km area. The default values of main parameters about UAVs are: M=5M=5, KC=10K_{C}=10, KD=30K_{D}=30, hmin=50h_{min}=50 m, dmin=100d_{min}=100 m, γC=3\gamma_{C}=3 dB, γP=1\gamma_{P}=1 dB, pmax=3p_{max}=3 w and bmax=1b_{max}=1 MHz. For the forest channel, the default values of main parameters are: A=0.25A=0.25, C=0.39C=0.39, E=0.25E=0.25, G=0G=0, H=0.05H=0.05, σ=6\sigma=6, η=5\eta=5, ψ=140\psi=-140 dbm and f=1.4f=1.4 GHz. In the DPSO algorithm, the number of iteration is 200. We construct a dataset of 10,000 sample, and the ratio of training set to test set is 9:1. Since the CNNDA is a two-stage algorithm, we analyze and evaluate the algorithm performance from the offline and online stages.

Refer to caption
Figure 4: The ISAC performance versus the number of users.

For the offline stage, we compared three different algorithms, namely PSO, dynamic weight PSO (DWPSO), dynamic learning factor PSO (DCPSO) to verify the effectiveness of DPSO. Fig. 3 shows the convergence results of different PSO algorithms, and the vertical axis is the ISAC utility. After curve comparison, it can be observed that all algorithms converge before 100 generations, but compared with other three algorithm, DPSO can effectively help the algorithm jump out of the local optimal and improve the ISAC performance.

For the online stage, we considered the generalization ability of the CNN model under different user quantity scenarios from the perspectives of ISAC performance and algorithm execution time. We selected the heuristic algorithm PSO and the machine learning algorithm Kmeans, as well as the proposed DPSO algorithm during offline optimization, as benchmark algorithms. In terms of ISAC performance, Fig. 4 shows our CNN model outperforms PSO and Kmeans in different user quantity scenarios, and its performance is closed to DPSO, which effectively learns the optimization ability of DPSO. In terms of algorithm execution time, Fig. 5 shows that our CNN model exhibits a deployment time close to the Kmeans, and its CPU execution time remains between 0.3-0.4s. Compared with heuristic algorithms such as DPSO and PSO, the CPU computing time of the proposed method is reduced by more than 96%. Since the CNN model does not need to iteratively calculate the ISAC performance of each user, its deployment time is short and stable for any user scenario.

V Conclusions

This paper investigates ISAC UAV deployment a in emergency scenarios with the aim of achieving desirable performance with short deployment time. The proposed CNNDA algorithm takes user locations as input and quickly determines UAV locations through the trained CNN model. This algorithm avoids computing the ISAC performance for each user, which is a time-consuming process. The simulation results show that the CNN model can achieve excellent ISAC performance on the one hand, and on the other hand, its execution time is maintained between 0.3-0.4s, which reduces the deployment time by more than 96% compared with other iterative algorithms. Additionally, the CNN model exhibits excellent generalization ability, as its execution time remains stable and at a second-level across scenarios with varying numbers of users.

Refer to caption
Figure 5: Algorithm execution time versus the number of users.

References

  • [1] L. Wang, J. Zhang, J. Chuan, R. Ma and A. Fei, ”Edge Intelligence for Mission Cognitive Wireless Emergency Networks,” in IEEE Wireless Communications, vol. 27, no. 4, pp. 103-109, August 2020.
  • [2] W. Li, L. Wang and A. Fei, ”Minimizing Packet Expiration Loss With Path Planning in UAV-Assisted Data Sensing,” in IEEE Wireless Communications Letters, vol. 8, no. 6, pp. 1520-1523, Dec. 2019.
  • [3] S. Bi, J. Yu, Z. Yang, X. Lin and Y. Wu, ”Joint 3-D Deployment and Resource Allocation for UAV-Assisted Integrated Communication and Localization,” in IEEE Wireless Communications Letters, vol. 12, no. 10, pp. 1672-1676, Oct. 2023.
  • [4] K. Zhuang, L. Xu, L. Li, L. Wang and A. Fei, ”GA-MADDPG: A Demand-Aware UAV Network Adaptation Method for Joint Communication and Positioning in Emergency Scenarios,” 2023 IEEE Wireless Communications and Networking Conference (WCNC), pp. 1-6, 2023.
  • [5] L. Wang, H. Tang, H. Wu and G. L. Stüber, ”Resource Allocation for D2D Communications Underlay in Rayleigh Fading Channels,” in IEEE Transactions on Vehicular Technology, vol. 66, no. 2, pp. 1159-1170, Feb. 2017.
  • [6] L. Wang, H. Wu, Z. Han, P. Zhang and H. V. Poor, ”Multi-Hop Cooperative Caching in Social IoT Using Matching Theory,” in IEEE Transactions on Wireless Communications, vol. 17, no. 4, pp. 2127-2145, April 2018.
  • [7] L. D. Nguyen, K. K. Nguyen, A. Kortun and T. Q. Duong, ”Real-Time Deployment and Resource Allocation for Distributed UAV Systems in Disaster Relief,” 2019 IEEE 20th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), Cannes, France, pp. 1-5, 2019.
  • [8] H. Wang, H. Zhao, W. Wu, J. Xiong, D. Ma and J. Wei, ”Deployment Algorithms of Flying Base Stations: 5G and Beyond With UAVs,” in IEEE Internet of Things Journal, vol. 6, no. 6, pp. 10009-10027, Dec. 2019.
  • [9] J. Liu, Q. Wang, X. Li and W. Zhang, ”A Fast Deployment Strategy for UAV Enabled Network Based on Deep Learning,” 2020 IEEE 31st Annual International Symposium on Personal, Indoor and Mobile Radio Communications, pp. 1-6, 2020.
  • [10] H. Wu, F. Lyu, C. Zhou, J. Chen, L. Wang and X. Shen, ”Optimal UAV Caching and Trajectory in Aerial-Assisted Vehicular Networks: A Learning-Based Approach,” in IEEE Journal on Selected Areas in Communications, vol. 38, no. 12, pp. 2783-2797, Dec. 2020.
  • [11] C. R. Anderson, H. I. Volos and R. M. Buehrer, ”Characterization of Low-Antenna Ultrawideband Propagation in a Forest Environment,” in IEEE Transactions on Vehicular Technology, vol. 62, no. 7, pp. 2878-2895, Sept. 2013.
  • [12] L. Wang, X. Wu, Y. Wang, Z. Xiao, L. Li and A. Fei, ”On UAV Serving Node Deployment for Temporary Coverage in Forest Environment: A Hierarchical Deep Reinforcement Learning Approach,” in Chinese Journal of Electronics, vol. 32, no. 4, pp. 760-772, July 2023.
  • [13] S. Yuan, Y. Teng and H. Li, ”Comparison of Position Dilution of Precision (PDOP) at System and User Level in Multi-GNSS Positioning,” 2020 IEEE 3rd International Conference on Electronics and Communication Engineering (ICECE), pp. 47-50, 2020.
  • [14] I. Goodfellow, Y. Bengio, and A. Courville, Deep Learning. Cambridge, MA, USA: MIT Press, Nov. 2016.
  • [15] N Srivastava et al. , ”Dropout: A simple way to prevent neural networks from overfitting,” The journal of machine learning research, vol. 15, no. 1, pp. 1929-1958, Jan. 2014.