5G Air-to-Ground Network Design and Optimization: A Deep Learning Approach

Yun Chen¹, Xingqin Lin², Talha Khan², Mehrnaz Afshang², and Mohammad Mozaffari² This work was done while Yun Chen was with Ericsson Research, USA.
¹The University of Texas at Austin, USA, Email: [email protected]
²Ericsson Research, Santa Clara, CA, USA,
Emails: {xingqin.lin, talha.khan, mehrnaz.afshang, mohammad.mozaffari}@ericsson.com

Abstract

Direct air-to-ground (A2G) communications leveraging the fifth-generation (5G) new radio (NR) can provide high-speed broadband in-flight connectivity to aircraft in the sky. A2G network deployment entails optimizing various design parameters such as inter-site distances, number of sectors per site, and the up-tilt angles of sector antennas. The system-level design guidelines in the existing work on A2G network are rather limited. In this paper, a novel deep learning-based framework is proposed for efficient design and optimization of a 5G A2G network. The devised architecture comprises two deep neural networks (DNNs): the first DNN is used for approximating the 5G A2G network behavior in terms of user throughput, and the second DNN is developed as a function optimizer to find the throughput-optimal deployment parameters including antenna up-tilt angles and inter-site distances. Simulation results are provided to validate the proposed model and reveal system-level design insights.

I Introduction

Despite phenomenal advances in terrestrial mobile communications, providing broadband in-flight connectivity (IFC) to aircraft passengers remains a pain point [1]. IFC services can be provided using satellites or direct air-to-ground (A2G) communications. Satellites have the advantage of a global coverage spanning both land and sea, which makes them suitable for intercontinental flights. Satellite-based IFC services, however, suffer from a limited system capacity and long latency [2]. The alternative approach based on A2G communications leverages cellular technology to establish direct connectivity between terrestrial base-stations (BSs) and aircraft [3, 4, 5]. For instance, the European Aviation Network connects the European skies using satellites in combination with an A2G network based on long-term evolution (LTE) [3]. The Gogo Biz A2G network uses a variant of the third-generation (3G) code division multiple access (CDMA) 2000 technology to provide IFC in North America [3]. A2G networks offer a larger system capacity and shorter latencies than satellites [3] but their coverage is limited to over land or along coastal belts. Therefore, IFC solutions based on A2G and satellite communications complement each other.

The advent of 5G new radio (NR) offers new opportunities to enhance A2G performance. The existing A2G systems are based on older generations and suffer from a limited system capacity, resulting in lower data rates for the end users [3]. An NR-based A2G network can benefit from the large bandwidth, increased spectral efficiency, low latency, advanced antenna technologies, and ultra-lean design attributes of NR [6]. With this motivation, the 3rd generation partnership project (3GPP) has been developing specifications for NR-based non-terrestrial networks which include support for A2G communications.

There are several differences between an A2G network and a conventional cellular network although both employ terrestrial BSs. First, the traditional BSs use down-tilted antennas to serve terrestrial users whereas A2G BSs use up-tilted antennas to face the sky. Second, A2G networks typically have larger inter-site distances (ISD) than the traditional networks due to sparsely dispersed traffic demand. Third, the high aircraft speed in A2G networks poses stringent requirements on mobility. Therefore, the system-level design insights inherited from traditional terrestrial cellular networks are not directly applicable to A2G networks.

A2G network deployment entails optimizing various design parameters including the ISD, the number of sectors per BS site, and up-tilt angles for sector antennas. The optimal design depends on various factors such as traffic profile, altitude range, interference characteristics and cell load. The system design guidelines are rather obscure in the existing literature on A2G networks [7, 8, 9, 3]. In [7], several possibilities were outlined to enhance existing LTE systems for A2G communications. In [8], a performance comparison for IFC was presented for a system based on A2G and satellite communications. In [9], preliminary simulations results were provided for an NR-based A2G system. In [3], the technical issues facing an NR A2G system were discussed and its performance was evaluated in a range of frequency bands. In short, the prior work has largely been limited to A2G performance evaluation under different scenarios. This motivates the need for A2G network deployment optimization to enable broadband connectivity in the sky.

The problem of A2G network optimization with various interdependent parameters is challenging and requires efficient solutions. A promising approach for solving complex optimization problems is to exploit tools from deep learning [10, 11, 12]. In [10], an optimization framework that combined evolutionary search with deep neural networks (DNNs) was proposed for solving optimization problems within the class of the maximum satisfiability problem. A power allocation problem to maximize the signal-to-interference-plus-noise-ratio (SINR) was considered in [11], where the input and output of a resource allocation algorithm was treated as an unknown non-linear mapping and a fully-connected DNN was used to approximate it. In [12], a set of network features was identified via deep learning for a link scheduling problem. The final solution was obtained using branch and bound or dynamic programming methods. These methods are only suitable for specific models that rely on traditional optimization algorithms.

The main contribution of this paper is a novel deep learning-based framework developed for A2G network design and deployment optimization. We propose a bi-DNN architecture consisting of two DNNs to model the A2G network behavior and solve network optimization problems involving numerous parameters. Specifically, the first DNN is trained to approximate the user throughput in an A2G network by emulating a system-level A2G network simulator. The second DNN is designed to optimize the network design parameters including the antenna up-tilt angles and ISD. We provide system-level simulation results to validate our model and evaluate the downlink (DL) performance of an NR A2G network in terms of user throughput and SINR. The results reveal useful insights on system performance under different antenna configurations, ISDs, number of sectors, and traffic loads.

II System Model

We consider a dedicated A2G network consisting of a hexagonal tessellation of terrestrial BSs that exclusively serve multiple airborne aircraft within a two-dimensional plane at a certain altitude from the ground. We assume that all BSs are identical and each BS has S sectors where the antenna for sector $i\in\{0,\cdots,\textrm{S}-1\}$ is up-tilted with an angle $\Theta_{i}\in[\Theta_{\min},\,\,\,\Theta_{\max}]$ with $\Theta_{\min},\Theta_{\max}\in[0\degree,\,\,\,90\degree]$ . We define $\mathbf{\Theta}=[\Theta_{0},\cdots,\Theta_{\textrm{S}-1}]$ . We note that $\Theta_{i}=90\degree$ means that sector $i$ 's antenna boresight is pointed upward. All sectors are otherwise identical in terms of the antenna beam pattern and transmit power. The horizontal (azimuthal) plane is split uniformly among the sectors (see Fig. 1). At a given BS, the up-tilt angle for each sector can be set independently. Given the symmetrical tessellation, we assume that the same up-tilt angle configuration $\mathbf{\Theta}$ applies to all BSs. We focus on three practically relevant cases for the number of sectors per site, namely $\textrm{S}\in\{1,3,4\}$ . We use $d$ to denote the ISD of the hexagonal tessellation. We consider the 3GPP rural macrocell spatial channel model specialized to line-of-sight propagation conditions. We assume full frequency reuse such that the received DL signal at an aircraft is subjected to thermal noise as well as inter-cell interference. We summarize the notation used in this paper in Table I.

Refer to caption — Figure 1: Network model for A2G communications.

TABLE I: Summary of notations.

Notation	Definition
S	Number of sectors per site
$t_{X}$	$X$ th-percentile user throughput
$l$	DL higher layer traffic load
$d$	ISD
$\mathbf{\Theta}$	$1\times\textrm{S}$ vector of antenna up-tilt angles
$\mathcal{N}_{A}$	Approximator-NN approximates user throughput with $d$ , $\mathbf{\Theta}$ and $l$ as inputs
$\mathcal{N}_{D}$	Optimizer-NN outputs a batch of throughput-optimal ISD and antenna configurations
D	${n\times(\textrm{S}+2-c)}$ output from $\mathcal{N}_{D}$
$n$	Number of deployments in a batch output from $\mathcal{N}_{D}$
$c$	Number of fixed parameters
$\tau$	Number of training steps per epoch for training $\mathcal{N}_{D}$
$\epsilon_{\min}$	Minimum error for each epoch when training $\mathcal{N}_{D}$
$\epsilon^{*}$	Minimum error across all epochs when training $\mathcal{N}_{D}$
$k$	Number of iterations for which $\epsilon^{*}$ remains unchanged
$k_{\max}$	Maximum value for $k$ until the training for $\mathcal{N}_{D}$ stops

We aim to find the A2G network deployment configuration in terms of the ISD and up-tilt angles that maximizes the user throughput under the given set of system parameters. The user throughput is a random quantity that depends on the SINR which itself is a function of various factors such as antenna configuration, channel gain, user distribution, network geometry and traffic load. We consider the $X$ -th percentile user throughput $t_{X}$ in our objective function. For example, half of the aircraft in a cell experience a throughput of at least $t_{50}$ . Specifically, we treat the optimization problem:

\displaystyle\begin{array}[]{rrclcl}\displaystyle\max_{d,\mathbf{\Theta},l}&t_{X}\\ \textrm{s.t.}&\displaystyle l_{\min}\leq&l&{}\\ &\displaystyle d_{\min}\leq&d&\leq d_{\max}\\ &\displaystyle\Theta_{\min}\leq&\mathbf{\Theta}&\leq\Theta_{\max}\\ \end{array}

(5)

where $l\geq l_{\min}$ is the DL traffic load which captures the higher-layer traffic demand in the network. The traffic load is a proxy for the average physical layer resource utilization in a cell. A higher traffic load results in a higher resource utilization, increasing the inter-cell interference. We note that the constraints $d_{\min}$ and $d_{\max}$ on ISD are informed by economic factors such as capital and operating expenses for a site. In our model, we set $\Theta_{\min}=0\degree$ and $\Theta_{\max}=90\degree$ .

The optimization problem in (5) is non-convex and nonlinear. The objective function depends on a myriad of interdependent underlying factors which further compounds the problem. In practical scenarios, finding a tractable expression for the considered objective function is usually infeasible. Consequently, one has to resort to exhaustive system-level simulations which require time and compute resources. This calls for an efficient technique for modeling and solving the considered optimization problem for A2G networks.

III Bi-DNN Based Network Deployment Optimization

In this section, we describe the proposed bi-DNN architecture for A2G network deployment optimization. The bi-DNN model illustrated in Fig. 2 consists of two DNNs, namely, Approximator-NN and Optimizer-NN.

III-A Approximator-NN

Our goal is to replace the cumbersome system-level NR simulator with an agile entity to expedite the training process for Optimizer-NN. Approximator-NN, denoted as $\mathcal{N}_{A}(\cdot)$ , is a DNN designed to model the behavior of an NR A2G network and estimate its performance. Specifically, it approximates the $X$ th-percentile user throughput for a given set of input parameters consisting of ISD, antenna up-tilt angles and traffic load. All other system parameters such as the number of BSs and the number of sectors per site remain fixed. We use the data generated from a 5G NR system-level simulator for the training and validation of Approximator-NN. Formally,

\displaystyle t_{X}

\displaystyle=\mathcal{N}_{A}(d,\mathbf{\Theta},l;\textbf{w}_{A})

(6)

denotes the approximated user throughput output by Approximator-NN, and ${\textbf{w}_{A}}$ denotes the neural network (NN) parameters which are initialized randomly. As illustrated in Fig. 3a, the DNN consists of 3 fully connected hidden layers with a rectified linear unit (ReLU) [13] as the activation function. For training, we select the mean squared error (MSE) based on the throughput outputs of $\mathcal{N}_{A}(\cdot)$ and the NR simulator as the loss function, and use Adam [14] as the optimizer.

III-B Optimizer-NN

Optimizer-NN, denoted as $\mathcal{N}_{D}(\cdot)$ , is a DNN designed for solving A2G deployment optimization problem leveraging the results obtained by Approximator-NN. It outputs the throughput-optimal values for the A2G deployment parameters which are also fed as an input to Approximator-NN. A neural network is usually trained to minimize a certain error function. For Optimizer-NN, we set the error function equal to the negative of the average throughput of a batch of candidate deployment configurations. Consequently, minimizing the considered error function during the training process ends up maximizing the average throughput in the batch.

In this work, we implement Optimizer-NN using a word embedding algorithm [15]. Word embedding, usually used for solving natural language problems, aims to map the words in a word set to numerical vectors that represent the word features. In our model, we treat a batch consisting of $n$ candidate deployment configurations $\{[d_{i},\mathbf{\Theta}_{i},l_{i}]\}_{i=1}^{n}$ as the word set to be embedded, i.e., the embedding vector for the $i$ th word contains the features $[d_{i},\mathbf{\Theta}_{i},l_{i}]$ , same as the configuration parameters to be optimized. Hence, finding the optimal embedding vectors for the deployment configurations leads to throughput-optimal ISD and up-tilt angles.

The training process of Optimizer-NN is based on a modified version of word2vec [15]. The detailed architecture is shown in Fig. 3b. A Tanh layer is put before ReLU to normalize the output range from the network to [-1,1] so that after ReLU, the output is restricted to [0,1]. This enables proper processing of the outputs which further serve as inputs for Approximator-NN. After the convergence, the output from Optimizer-NN consists of a batch of well-tuned configurations, among which we can select the optimal deployment that maximizes the user throughput.

We next describe the modified version of word2vec algorithm for our use case. Traditionally, in a word embedding NN, the indices $\{n\}=\{0,...,n-1\}$ of the words in a given word set with cardinality $n$ constitute the input for the embedding layer. The desired outputs are the numerical embedding vectors representing either explicit or implicit features of each word in the word set. In this work, we treat a set of $n$ candidate configurations as a word set, i.e., Optimizer-NN takes the set of indices $\{n\}$ as an input, and outputs a matrix D containing vectors representing each deployment configuration:

\displaystyle=\mathcal{N}_{D}(\{n\};\textbf{w}_{D})

(7)

where the matrix $\textbf{w}_{D}$ contains the NN parameters. Furthermore, $\textbf{D}=\left[D_{0},\cdots,D_{n-1}\right]^{\textrm{T}}\in\mathds{R}^{n\times(\textrm{S}+2-c)}$ , where each row $D_{i}$ for $i\in\{n\}$ equals $[d_{i},\mathbf{\Theta}_{i},l_{i}]$ when $c=0$ , and T denotes the matrix transpose. The deployment configuration $D_{i}$ that, when input to Approximator-NN, yields the maximum user throughput is chosen as the optimal solution, i.e.,

\displaystyle D^{*}

\displaystyle=\operatorname*{argmax}_{D_{i},i\in\{n\}}\mathcal{N}_{A}({D}_{i};\textbf{w}_{A}).

(8)

For training Optimizer-NN, we define the loss function $L(\mathbf{w}_{D})$ as the negative of the batch average of the approximated user throughput, i.e.,

\displaystyle L(\mathbf{w}_{D})

\displaystyle=\frac{-\sum_{i=0}^{n-1}t_{X,i}}{n}=\frac{-\sum_{i=0}^{n-1}\mathcal{N}_{A}(D_{i};\textbf{w}_{A})}{n}.

(9)

We use the Adam optimizer [14] for minimizing $L(\mathbf{w}_{D})$ such that the output D from $\mathcal{N}_{D}(\{n\};\textbf{w}_{D})$ contains candidate configurations with potentially high throughputs. We describe the training process in Algorithm 1. We initialize the matrices $\mathbf{w}_{D}$ and D randomly and run $\tau$ iterations per epoch (lines 7-21). If we intend to fix (rather than optimize) any of the parameters $\{d,\mathbf{\Theta},l\}$ , the fixed parameter(s) can be concatenated with the output from Optimizer-NN (line 8). We compute the mean error for the loss function in line 15. The batch $\textbf{D}^{\prime}$ with the minimum error is stored for every epoch (lines 17-20) and the batch $\textbf{D}^{*}$ with the minimum error across all the epochs $\epsilon^{*}$ is retained (lines 22-28). Optimizer-NN converges when $\epsilon^{*}$ remains unchanged for more than $k_{\max}$ iterations. Finally, the throughput-optimal parameter values are obtained using (8) (line 30). The algorithm has a linear time complexity, which makes it attractive for implementation.

Algorithm 1 : Optimizer-NN training procedure

1:Initialization:

2:Choose S and

c

and find embedding size:

\textrm{S}+2-c

;

3:Choose the batch size

n

and parameters

\tau

and

k_{\max}

;

4:Set

\epsilon_{\min}=0

\epsilon^{*}=0

and

k=0

;

5:while

k<k_{\max}

\epsilon_{\min}=0

;

7: for

\tau_{i}=1:\tau

\textbf{D}=\mathcal{N}_{D}(\{n\};\textbf{w}_{D})\in\mathds{R}^{n\times(\textrm{S}+2-c)}

;

\textbf{D}^{\prime}\leftarrow\textbf{D}

;

10: if

c>0

then

11: Construct

\textbf{D}^{\prime}\in\mathds{R}^{n\times(\textrm{S}+2)}

by concatenating the

c

fixed

12: parameters to D;

13: end if

14: Calculate error

\upepsilon=-\mathcal{N}_{A}(\textbf{D}^{\prime};\textbf{w}_{A})\in\mathds{R}^{n\times 1}

;

15: Evaluate loss function:

L(\textbf{w}_{D})=\text{mean}(\upepsilon)

;

16: Optimize

L(\textbf{w}_{D})

using Adam optimizer;

17: if

\min(\upepsilon)<\epsilon_{\min}

then

18:

\textbf{D}^{{\dagger}}\leftarrow\textbf{D}^{\prime}

;

19:

\epsilon_{\min}\leftarrow\min(\upepsilon)

;

20: end if

21: end for

22: if

\epsilon_{\min}<\epsilon^{*}

then

23:

\epsilon^{*}\leftarrow\epsilon_{\min}

;

24:

\textbf{D}^{*}\leftarrow\textbf{D}^{{\dagger}}

;

25:

k\leftarrow 0

;

26: else

27:

k=k+1

;

28: end if

29:end while

30:

D^{*}=\operatorname*{argmax}\limits_{D_{i},i\in\{n\}}\mathcal{N}_{A}(\textbf{D}^{*};\mathbf{w}_{A})

;

IV Simulation Results and Discussions

We generate the dataset for our simulations using a system-level simulator for 5G NR. The simulator models the DL of an A2G network consisting of 19 BSs serving aircraft located randomly within the simulation area. The BS height is 35 m while the aircraft are at an altitude of 12 km. For each sector, we assume a transmit power of 49 dBm and a cross-polarized $4\times 4$ planar array with half-wavelength element spacing. We set the carrier frequency and system bandwidth to 3.5 GHz and 100 MHz. We assume full frequency reuse and consider wrap-around to account for inter-cell interference. We discuss the simulation results for Approximator-NN and Optimizer-NN in section IV-A and IV-B. For performance comparison, we use the optimal deployment configuration obtained from an exhaustive search of the dataset as a benchmark.

IV-A Approximator-NN training and validation

We train Approximator-NN to output the 50th percentile user throughput $t_{50}$ . We use the parameter values shown in Table II for data collection. For $S=1$ , the number of neurons in each hidden layer is $10$ . For $S\in\{3,4\}$ , each hidden layer has $16$ neurons to further improve the approximation accuracy. The training dataset consists of $760$ , $48905$ , and $30405$ entries for $S=1$ , $S=3$ , and $S=4$ , respectively.

TABLE II: Dataset parameter ranges for Approximator-NN.

Paramter	Range	Step size
ISD	[20, 160] km	20 km
Up-tilt angles	[0 $\degree$ , 90 $\degree$ ]	5 $\degree$ ( $S\in\{1,3\}$ ); 10 $\degree$ ( $S=4$ )
Traffic load	[10, 70] Mbps	20 Mbps

In Fig. 4, we show the convergence of the training process in terms of the MSE of the user throughput. In Fig. 5, we plot the cumulative distribution function (CDF) of the user throughput prediction error. We observe that for $S=1$ , 95% of the prediction errors are 6.5 Mbps or lower. For $S\in\{3,4\}$ , 95% of the prediction errors are within 4.5 Mbps thanks to larger datasets. We emphasize that our goal is to optimize A2G network parameters to achieve high throughput, which is typically on the order of hundreds of Mbps in the considered scenario. This suggests that the throughput approximation errors in Approximator-NN are relatively small compared to the anticipated throughput range of an A2G system. Therefore, Approximator-NN can provide satisfactory throughput approximations for a given input $[d,\mathbf{\Theta},l]$ .

IV-B A2G network deployment optimization

We obtain optimal deployment configuration using Algorithm 1 with parameter values $\tau=1000$ , $k_{\max}=20$ , $n=4$ for $S=1$ , and $n=8$ for $S\in\{3,4\}$ . For each $S\in\{1,3,4\}$ , the optimal configuration based on Optimizer-NN has an ISD of 20 km and a traffic load of 4 Mbps, same as the minimum allowed values in (5). The optimal configurations in the dataset have an ISD of 20 km and a traffic load of 10 Mbps, same as the minimum possible values in the dataset. This trend is plausible since a smaller ISD helps improve the received signal power while a lower traffic load reduces the inter-cell interference.

In Table III, we provide the optimal values for the antenna up-tilt angles with the corresponding user throughput for both Optimizer-NN and the dataset. The reported results for throughput are obtained directly from the simulator by plugging in the optimized configuration parameters. We find that the antennas need to be up-tilted somewhat aggressively (around 60 $\degree$ to 90 $\degree$ ) due to a relatively small ISD of 20 km. For the 4-sector case, one antenna needs to be in an almost upright position to boost throughput. This is because three sectors are sufficient to cover the entire cell due to its small size. With an almost upright fourth sector, the inter-cell interference is reduced as more power is directed to the region above the BS.

We note that the proposed algorithm learns the benefit of operating in a lightly-loaded system, as evident from the optimal load of 4 Mbps based on Optimizer-NN despite the minimum value of 10 Mbps considered in the dataset. Consequently, the throughput for the optimal configurations based on Optimizer-NN is higher than that based on the dataset.

We next consider another practically relevant scenario where the ISD is large. In Table IV, we provide the optimal antenna up-tilt angles for different number of sectors for a fixed ISD of 80 km. We observe that due to a larger ISD, it is optimal to have only moderately up-tilted antennas (around $30\degree$ ) for coverage extension. This finding holds for all considered values for the number of sectors. With a small ISD, the network is interference-limited and using highly up-tilted antennas (as in Table III) helps direct the signal power towards the aerial region above the BS, thus reducing inter-cell interference. As the cell size expands, the network becomes coverage-limited and the antennas need to be slanted to provide coverage throughout the cell. Furthermore, the larger ISD also results in a lower throughput. For instance, compared to the case with 20 km ISD, $t_{50}$ is reduced by 49%, 35% and 32% for $S=1$ , $S=3$ , and $S=4$ , respectively.

In Fig. 6, we examine the throughput variation for the optimal antenna configuration in Table III and the optimal ISD under different traffic conditions. A higher traffic load corresponds to a higher resource utilization. The plot reaffirms the intuition that a higher traffic load leads to a stronger inter-cell interference, thus reducing the average user throughput. We also observe that the throughput for the optimal configuration obtained using the proposed framework is almost indistinguishable from that obtained using the dataset. Another noticeable trend in Fig. 6 is the throughput variation for different sectors under varying traffic conditions. Under a low traffic load, the 1-sector case results in the lowest throughput among all cases. As the traffic load increases, the 1-sector case yields the highest throughput while the 4-sector case the lowest. This is because deploying more sectors per site reduces the cell size which increases the received signal power for a typical user. This, however, also increases the total radiated power per site, which manifests as inter-cell interference. Under a high traffic load, the degradation due to interference overshadows the improvement in the received signal power, thus reducing the SINR and throughput. Finally, in Fig. 7, we observe similar SINR CDFs for the optimal configurations based on Optimizer-NN and the dataset.

TABLE III: Up-tilt angles based on Optimizer-NN and dataset for the optimal ISD

d=20

km.

S	Optimizer-NN Dataset
S	$\mathbf{\Theta}$	$t_{50}$ (Mbps)	$\mathbf{\Theta}$	$t_{50}$ (Mbps)
1	78°	295.39	70°	287.27
3	[57°, 58°, 57°]	333.68	[60°, 60°, 60°]	305.51
4	[90°, 58°, 62°, 61°]	322.36	[50°, 80°, 60°, 50°]	295.35

TABLE IV: Up-tilt angles based on Optimizer-NN and dataset for a fixed ISD

d=80

km.

S	Optimizer-NN Dataset
S	$\mathbf{\Theta}$	$t_{50}$ (Mbps)	$\mathbf{\Theta}$	$t_{50}$ (Mbps)
1	35°	149.07	30°	138.76
3	[29°, 32°, 27°]	216.48	[30°, 30°, 30°]	198.18
4	[26°, 35°, 36°, 37°]	216.56	[30°, 30°, 30°, 30°]	195.68

V Conclusion

In this paper, we have developed a new deep learning-based framework for A2G network design and deployment optimization. Specifically, we have proposed a bi-DNN architecture for modeling and optimizing A2G networks that involve a wide range of parameters. In the proposed framework, the first DNN helps approximate the aircraft throughput and the second DNN determines the throughput-optimal network design parameters including the antenna up-tilt angles and the ISD. We have drawn several insights from the simulation results. It is throughput-optimal to aggressively up-tilt the antennas when the ISD is small. With a large ISD, the network gets coverage-limited and the antennas need to be slanted to provide coverage throughout the cell. Moreover, under a low traffic load, it is throughput-optimal to deploy a large number of sectors. Under a high traffic load, the network becomes interference-limited and operating with fewer sectors is beneficial as it reduces interference.

References

[1] J. P. Rula et al., ``Mile high WiFi: A first look at in-flight internet connectivity,'' in Proc. World Wide Web Conference, 2018, pp. 1449–1458.
[2] D. Krichene, M. Sliti, W. Abdallah, and N. Boudriga, ``An aeronautical visible light communication system to enable in-flight connectivity,'' in Proc. IEEE 17th International Conference on Transparent Optical Networks (ICTON), 2015, pp. 1–6.
[3] X. Lin, A. Furuskär, O. Liberg, and S. Euler, ``Sky high 5G: New Radio for air-to-ground communications,'' arXiv preprint arXiv:2003.06361, Mar. 2020.
[4] L. Afonso et al., ``Cellular for the skies: Exploiting mobile network infrastructure for low altitude air-to-ground communications,'' IEEE Aerosp. Electron. Syst. Mag., vol. 31, no. 8, pp. 4–11, Aug. 2016.
[5] M. Mozaffari, W. Saad, M. Bennis, Y. Nam, and M. Debbah, ``A tutorial on UAVs for wireless networks: Applications, challenges, and open problems,'' IEEE Commun. Surveys Tuts., vol. 21, no. 3, pp. 2334–2360, third quarter 2019.
[6] X. Lin et al., ``5G New Radio: Unveiling the essentials of the next generation wireless access technology,'' IEEE Commun. Standards Mag., vol. 3, no. 3, pp. 30–37, Sept. 2019.
[7] N. Tadayon, G. Kaddoum, and R. Noumeir, ``Inflight broadband connectivity using cellular networks,'' IEEE Access, vol. 4, pp. 1595–1606, Mar. 2016.
[8] M. Vondra et al., ``Performance study on seamless DA2GC for aircraft passengers toward 5G,'' IEEE Commun. Mag., vol. 55, no. 11, pp. 194–201, Nov. 2017.
[9] L. Liu, ``Performance evaluation of direct air-to-ground communication using New Radio (5G),'' M.S. thesis, School of Electrical Engineering, KTH, Aug. 2017. [Online]. Available: http://kth.diva-portal.org/smash/get/diva2:1129315/FULLTEXT01.pdf
[10] J. Caldwell et al., ``Deep optimisation: Solving combinatorial optimisation problems using deep neural networks,'' arXiv preprint arXiv:1811.00784, Nov. 2018.
[11] H. Sun et al., ``Learning to optimize: Training deep neural networks for wireless resource management,'' in Proc. IEEE 18th International Workshop on Signal Processing Advances in Wireless Communications (SPAWC), 2017, pp. 1–6.
[12] B. Yu et al., ``A deep learning approach for maximum activity links in D2D communications,'' Sensors, vol. 19, no. 13, p. 2941, July 2019.
[13] X. Glorot, A. Bordes, and Y. Bengio, ``Deep sparse rectifier neural networks,'' in Proc. 14th international conference on artificial intelligence and statistics, 2011, pp. 315–323.
[14] D. P. Kingma and J. Ba, ``Adam: A method for stochastic optimization,'' arXiv preprint arXiv:1412.6980, Dec. 2014.
[15] T. Mikolov, K. Chen, G. Corrado, and J. Dean, ``Efficient estimation of word representations in vector space,'' arXiv preprint arXiv:1301.3781, Jan. 2013.