Distributed Massive MIMO for LEO Satellite Networks

Mohammed Y. Abdelsadek, Gunes Karabulut Kurt,
and Halim Yanikomeroglu This article was presented in part at the IEEE International Conference on Communications (ICC) Workshops 2021 in [1].
M. Y. Abdelsadek and H. Yanikomeroglu are with the Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, Canada. Emails: {mohammedabdelsadek, halim}@sce.carleton.ca. M. Y. Abdelsadek is also with (on leave) the Department of Electrical Engineering, Assiut University, Assiut 71516, Egypt.G. Karabulut Kurt is with the Department of Electrical Engineering, Polytechnique Montreal, Montreal, QC H3T 1J4, Canada. Email: [email protected]. G. Karabulut Kurt is also an Adjunct Research Professor in the Department of Systems and Computer Engineering, Carleton University, Ottawa, ON K1S 5B6, Canada.

Abstract

The ultra-dense deployment of interconnected satellites will characterize future low Earth orbit (LEO) mega-constellations. Exploiting this towards a more efficient satellite network (SatNet), this paper proposes a novel LEO SatNet architecture based on distributed massive multiple-input multiple-output (DM-MIMO) technology allowing ground user terminals to be connected to a cluster of satellites. To this end, we investigate various aspects of DM-MIMO-based satellite network design, the benefits of using this architecture, the associated challenges, and the potential solutions. In addition, we propose a distributed joint power allocation and handover management (D-JPAHM) technique that jointly optimizes the power allocation and handover management processes in a cross-layer manner. This framework aims to maximize the network throughput and minimize the handover rate while considering the quality-of-service (QoS) demands of user terminals and the power capabilities of the satellites. Moreover, we devise an artificial intelligence (AI)-based solution to efficiently implement the proposed D-JPAHM framework in a manner suitable for real-time operation and the dynamic SatNet environment. To the best of our knowledge, this is the first work to introduce and study DM-MIMO technology in LEO SatNets. Extensive simulation results reveal the superiority of the proposed architecture and solutions compared to conventional approaches in the literature.

Index Terms:

Satellite communication networks, LEO constellations, cell-free massive MIMO, handover management, resource allocation.

I Introduction

Satellites are envisaged to play a critical role in future communication networks. Although satellite networks (SatNets) are considered to be one of the most promising solutions for connecting the unconnected in remote and rural areas, they can provide a plethora of additional applications and services elsewhere on earth and in space. In this regard, SatNets can be used to reinforce connectivity solutions in underserved areas [2], and there is a growing interest in utilizing satellite systems to realize ubiquitous Internet of Things (IoT) [3]. Moreover, SatNets can enable more efficient backhaul services [4], data offloading applications [5], space exploration [6], among other benefits and use cases.

The many use cases of SatNets have drawn attention from network operators, standardization bodies, and manufacturing companies. In this respect, the Third Generation Partnership Project (3GPP) has been studying the integration of a satellite component in terrestrial fifth-generation (5G) networks through several study items (SIs) in the recent releases. The SI in [7] was implemented in Release 15 to investigate the support of non-terrestrial networks (NTN) (satellites and high-altitude platform stations (HAPS) [8]) in the 5G New Radio (NR). This study has been expanded to several SIs in Releases 16 and 17 to study the following: 1) use cases and satellite access [2]; 2) integration scenarios and architectural aspects [9]; 3) management and monitoring of satellite components [10]; and 4) radio access network architecture and interface protocols [11]. The focus of standardization efforts in Release 17 is on transparent satellite architecture to enable broadband and backhauling use cases. Future releases will consider the regenerative payload architectures, IoT, and dual connectivity use cases.

Although geostationary Earth orbit (GEO) satellites have been used for a long time to offer connectivity and broadcast services, there are several issues associated with their services, such as long delay, high path loss, and over-subscription. These drawbacks are in addition to the high costs of manufacturing and deploying GEO satellites. Alternatively, low Earth orbit (LEO) satellites are characterized by low latency and low path loss communications due to lower deployment altitudes (i.e., as low as 300 km compared to $36,000$ km for GEOs). Besides, the deployment costs of LEOs are much lower than GEOs. This enables LEO SatNets to provide low-cost services with high quality. Therefore, thousands of LEO satellites are being launched to build mega-constellations orbiting the Earth by 2030, such as those built by SpaceX, Telesat, OneWeb, and Amazon, to name a few.

However, LEO SatNets suffer from several drawbacks due to their motion to ground user terminals (UTs). This relative motion causes UTs to switch links among different LEO satellites to maintain a network connection. This handover process is implemented at the link-layer and network-layer. The former used for switching over communication links from one satellite to another one in the UT’s visibility. By contrast, a network-layer handover is used for switching higher-layer protocols (e.g., transmission control protocol (TCP), user datagram protocol (UDP)) to a new Internet protocol (IP) address of a UT when it is connected to a different home network due to a satellite handover. The rate of this satellite handover process is high due to the fact that the LEO satellites are only visible to UTs for a few minutes at a time as they pass in orbit. This high rate of satellite handover entails high signalling overhead, throughput losses, processing delay, data forwarding, and location update issues [12]. Therefore, the mobility of LEO satellites can have a significant impact on the network performance, spectrum utilization, and users’ quality of service (QoS).

I-A Related Work

Several approaches have been proposed to address challenges associated with the high mobility of LEO satellites. In [13], the authors proposed a software-defined networking (SDN) architecture to control LEO satellites using a controller on the ground that connects to the LEOs via a GEO satellite. Along similar lines, the authors in [14] proposed an extensible architecture utilizing several layers of terrestrial relays (TRs), HAPSs, LEOs, and GEOs for relay purposes, and they studied the handover procedure among these different systems. However, using GEOs in the network, as investigated in these studies, entails a long-delay segment in the communication cycle that directly impacts the QoS of users and requires coordination between different satellite operators. In addition, these two studies, and other current approaches (e.g., [15, 16]), focused on the connectivity of UTs with a single satellite in their visibility. Therefore, the UT’s service time (i.e., connection time without handover interruption) is limited by the visibility of a single LEO satellite, which is only a few minutes (about 10 minutes in the Iridium system [14], for example). The handover rate is inevitably high, regardless of the adopted handover management technique. Moreover, most of these works are based on old LEO constellations (e.g., Iridium) and do not exploit the features of the new LEO mega-constellations, such as SpaceX’s Starlink and Telesat’s Lightspeed.

To overcome the challenges of single-satellite connectivity, the use of multiple-input multiple-output (MIMO) techniques in satellite communications, and in LEO SatNets in particular, has been investigated in a handful of works. Former studies, such as [17, 18, 19], investigated the connection of ground terminals to two GEO satellites, or to two antennas deployed on a single GEO satellite, for diversity purposes to address fading issues (e.g., in rainy and foggy weathers). As for LEO satellites, the authors in [20] focused on feeder links by studying the connection to multiple ground stations via optical links to realize site diversity in future LEO SatNets. In [21], the authors adopted a bipartite graph model for the ground gateway stations and the multiple LEO satellites visible to them. They solved this multi-connectivity problem by using maximum matching techniques. However, they utilized basic MIMO techniques (i.e., considering the general case that each ground station node can be connected to multiple satellite nodes and vice versa) and assumed that the ground stations had accurate information about the motion of the LEO satellites in their visibility. In [22], the authors considered a similar MIMO model and analyzed the capacity of the LEO-MIMO links by taking into consideration the Doppler shift due to the motion of the satellites. Nevertheless, these studies considered classical MIMO models to describe the connectivity of ground UTs with multiple satellites without investigating the details of the network architecture, channel estimation, precoding, and interference between users.

On another front, massive MIMO in LEO SatNets was studied in [23, 24]. In these works, the authors assumed that the LEO satellites use arrays of uniform planar antennas that can realize massive MIMO. However, due to the line-of-sight (LoS) connection to the ground terminals, the collocated satellite massive MIMO system cannot achieve the desired benefits as in terrestrial networks if the ground users are not sufficiently separated due to the so-called “unfavourable propagation” environment [25] that results in a “keyhole channel” matrix [26]. Therefore, collocated massive MIMO would not be suitable for single-user MIMO scenarios (sending multiple layers to the same user), for instance.

One significant difference between future LEO SatNets and old LEO constellations is ultra-dense deployment. That is, future LEO mega-constellations will include thousands of satellites. For example, SpaceX is planning to deploy $30,000$ LEO satellites for their second-generation constellation in addition to the current plan of around $12,000$ satellites [27]. This means that multiple LEO satellites will be visible to ground UTs simultaneously, which will open the door for advanced distributed MIMO techniques, such as cell-free massive MIMO (CF-mMIMO).

CF-mMIMO was recently proposed for next-generation terrestrial networks that build on coordinated multi-point (CoMP) and network MIMO techniques for large spectral efficiency, power efficiency, and network flexibility gains [28]. In terrestrial CF-mMIMO, multiple access points can be used to cooperatively communicate with users in a cell-free manner. This technique can be utilized in future LEO satellite networks exploiting an ultra-dense deployment, very high-speed inter-satellite links (ISLs), and LoS connections with ground UTs. Moreover, the CF-mMIMO architecture enables the cross-layer design to jointly optimize the upper and lower layers of networking. This cross-layer design is of utmost importance to LEO SatNets given that the network nodes are moving (i.e., the network topology is dynamic) and all links (with UTs, gateways, and other satellites) are wireless (radio frequency (RF) or free-space optical (FSO)). This means that the design of the lower layers has a significant impact on the performance of the upper ones. Therefore, CF-mMIMO can be leveraged for an efficient, resilient satellite network.

Noting that CF-mMIMO is a terrestrial technology that belongs to the distributed MIMO techniques, in this paper, we focus on the generalization of CF-mMIMO and propose a distributed massive MIMO (DM-MIMO) LEO satellite network architecture. Besides, we investigate the cross-layer design and an artificial intelligence (AI)-based implementation. To the best of our knowledge, this is the first work to introduce and study a DM-MIMO approach in LEO constellations.

I-B Paper Contributions and Structure

The major contributions of this paper can be summarized as follows:

•

We propose a LEO SatNet architecture based on DM-MIMO techniques. More specifically, we investigate the network topology, required ISLs, duplexing mode, beamforming, power control, frequency reuse, and handover management strategies. Moreover, we highlight the benefits of using this architecture and investigate the associated challenges and potential solutions to realize it.
•

We develop an optimized cross-layer design framework based on the proposed DM-MIMO-based architecture, such that the power allocation and handover management processes are jointly optimized. For this purpose, we describe the channel model, uplink training and channel estimation, downlink data transmission, and formulate a novel multi-objective optimization problem. It should be noted that it is not straightforward to establish a combined channel and data transmission models based on those from cell-free massive MIMO and satellite communications. In the optimization problem, the aggregate throughput is maximized while minimizing the handover rate. We refer to this optimized cross-layer design as the distributed joint power allocation and handover management (D-JPAHM) technique.
•

We introduce an AI-based implementation for the developed cross-layer control framework, which can be used in practical satellite systems. For this purpose, we leverage deep learning to provide accurate predictions for the solution of the formulated multi-objective optimization problem without actually solving the problem. Deep learning provides several benefits compared to traditional optimization-based and heuristic approaches. For instance, it exploits offline computations to reduce the computational complexity of the online operation. It also adapts to changing environments, which is vital for the dynamic satellite network. Furthermore, the scalability and support of distributed data processing and storage provided by deep learning techniques are crucial for LEO satellites that are less capable compared to GEO satellites.
•

We conduct extensive simulations to evaluate the performance of the proposed architecture, cross-layer design, and AI-based implementation. In addition, we compare the performance with that of conventional approaches and architectures from the literature. The simulation results show the superiority of the proposed DM-MIMO-based architecture and solutions compared to the traditional single-connectivity approach.

The remainder of this paper is organized as follows. In Section II, the proposed DM-MIMO-based LEO SatNet architecture is discussed, and several network design aspects are investigated. Then, in Section III, we highlight the benefits of using this architecture, the associated challenges and potential solutions. In Section IV, the optimized cross-layer design is detailed. In Section V, we discuss the AI-driven approach for implementing the proposed cross-layer optimization framework. In Section VI, we present and discuss the results of the simulations to evaluate the performance of the proposed DM-MIMO-based architecture and solutions in comparison with that of traditional single satellite connectivity. Finally, we conclude the paper in Section VII.

II DM-MIMO-Based SatNets

II-A Architecture

Refer to caption — Figure 1: DM-MIMO-based LEO SatNet. The user terminals are connected to a cluster of LEO satellite access points (SAPs) that are controlled by a central processing unit (CPU), which is deployed on a super satellite node (SSN).

Fig. 1 shows the proposed DM-MIMO-based LEO SatNet architecture. As we can see, the satellites are divided into clusters. To be consistent with the terrestrial CF-mMIMO terminology, each satellite in the cluster is called a satellite access point (SAP). In the event that a satellite has multiple antennas, each antenna would be considered a separate SAP. These SAPs are connected to a central processing unit (CPU) through ISLs for fronthaul purposes. This CPU can be deployed on a central satellite with more advanced computing capabilities. These central satellites are called super satellite nodes (SSNs).

II-B TDD Operation

To exploit the reciprocity of the uplink (UL) and downlink (DL) channels, time division duplexing (TDD) is the adopted duplexing mode in CF-mMIMO systems [28]. For the purpose of channel estimation, both UL and DL pilots can be used [29]. However, in most CF-mMIMO studies (e.g., [28, 30, 31]), only UL pilots are considered. This is also suitable for SatNets because the users do not need to estimate their effective channel gain and to use most of the TDD frame for data transmission. It is important to mention that the propagation delay and Doppler shift can be compensated in the time and frequency synchronization processes, given that the satellites move in a predetermined orbit, and their motion information can be known a priori [32]. Therefore, TDD was adopted in several satellite LEO communication studies [33] and systems [34, 35]. Accordingly, the TDD frame can be structured, as shown in Fig. 2.

The channel coherence interval is defined as the time-frequency interval during which the channel characteristics can be considered static. This coherence interval depends on the channel condition, the mobility of the satellite and the UT, and the carrier frequency. The coherence interval samples–or channel uses–designated $\tau_{c}$ , are divided into three parts: the initial $\tau^{p}_{u}$ samples are used for UL pilot transmission, the next $\tau^{d}_{u}$ samples are used for UL data transmission, and the last $\tau^{d}_{d}$ samples are reserved for DL data transmission. It is worth mentioning that the guard intervals are excluded from this coherence time interval. Utilizing the UL pilots, all the UL channels are estimated at the SAPs locally without forwarding them to the CPU. This supports the scalability of the network, since the signalling overhead is independent of the number of SAPs. Due to reciprocity, these channel estimates are valid for the DL direction as well. Therefore, the estimated channels are used for DL data precoding and UL data detection.

II-C Radio Resource Allocation

Efficient radio resource allocation techniques can exploit the advantages of the proposed architecture (including transmit and receive diversity) to achieve higher throughput for the connected terminals, ensuring their QoS satisfaction, minimizing interference, and minimizing the handover rate. In this subsection, we discuss different aspects of resource allocation for the proposed DM-MIMO-based SatNets architecture.

II-C1 Pilot assignment

UTs can be assigned mutually orthogonal UL pilots to minimize the interference between them. However, this requires the number of UL training samples, $\tau^{p}_{u}$ , to be more than the number of connected UTs, which is difficult in SatNets due to the large number of connected UTs. Therefore, every subset of the UTs can be assigned one pilot from the mutually orthogonal pilot set. This results in what is known as pilot contamination, which needs to be taken into consideration while designing the resource allocation procedure. The pilot assignment can be implemented locally at the SAPs in a distributed manner or centrally at the CPU. The pilot assignment information can be transmitted to the UTs over the random access channel during the random access process.

II-C2 Beamforming

Several beamforming techniques can be utilized for this DM-MIMO-based satellite networks. One of the widely used schemes in the literature (e.g., in [28, 30, 36]), is the maximum ratio processing (i.e., conjugate beamforming in the DL direction and matched filtering in the UL). This method can be employed to exploit the distributed channel estimation at the SAPs, which is considered one of the major benefits of using DM-MIMO, as this reduces the computational complexity and the required fronthaul signalling between the SAPs and the CPU [28]. However, other centralized beamforming techniques, such as zero-forcing (ZF) and minimum mean square error (MMSE) can be used [37, 31]. These schemes can be utilized to optimize the beamforming design at a global manner to improve performance. However, this centralized operation requires more fronthaul signalling between the SAPs and CPU to allow the CPU to collect the channel parameters from the SAPs.

II-C3 Frequency reuse

By using precoding in the DL and maximum ratio combining for the UL transmissions, full frequency reuse (FFR) can be realized instead of using conventional four colour frequency reuse patterns to mitigate the interference between the spot beams [38]. This leads to efficient spectrum usage since the whole allocated spectrum can be used anytime and anywhere while minimizing the interference among users.

II-C4 Power control

Power control plays an essential role in optimizing the cooperative transmission and reception of SAPs to maximize the network throughput and ensure users satisfaction. The power allocation should consider the interference between the UTs, the pilot assignment, and the achievable data rates. In addition, power allocation can be optimized to maximize the service time of the UTs, thereby minimizing the handover rate. This is the focus of Section IV.

II-D Handover Management

The traditional satellite handover process (i.e., based on single satellite connectivity) is depicted in Fig. 3a. In this case, when the signal level is below a certain threshold, the link is switched to the next LEO satellite in the cone visibility of the UT. This can be accomplished by using the satellite reference signals that are broadcast by the satellite. In the network layer, handover is required to forward the data arriving for the old address to the new home network, since the UT is given a new IP address in the new home network. As discussed above, the service time in this case is limited by the satellite visibility, which is a few minutes in LEO SatNets. However, using the proposed DM-MIMO architecture, the UT is connected to a cluster of satellites or SAPs. Consequently, the service time is limited by the visibility of the target cluster, which is longer than that of a single satellite. In addition, the resource allocation process can be optimized such that the service time is maximized. This minimizes the handover rate, associated losses, processing delays, and signalling overhead. In what follows, we discuss the link-layer and network-layer handover processes based on the proposed DM-MIMO architecture.

II-D1 Link-layer handover

In DM-MIMO-based SatNets, power allocation can be adjusted such that the service times of the ground UTs are maximized. This is because the UTs are served by all SAPs in the serving cluster. Therefore, the cooperative transmission of those SAPs can compensate for the signal level decaying due to the movement of the satellites. In addition, in the UL direction, the data is decoded on the basis of the received signals by all SAPs in the cluster. Nevertheless, a cluster handover is required to switch to the next cluster when resource allocation cannot satisfy the UT’s minimum required data rate level. This can be detected while allocating the radio resources (i.e., the transmit power), as detailed in Section IV. Besides, since the next serving cluster is known, the handover decision can be confirmed by the next cluster that detects the UL pilot from the ground UT by its edge SAPs. Fig. 3b shows the handover between satellite clusters when the minimum QoS level cannot be guaranteed by the old cluster’s established link due to moving far from the UT.

II-D2 Network-layer handover

When link handover is triggered and implemented, a network-layer handover is required to assign a new address to the UT and forward the incoming data to the new home network. In terrestrial IP networks, several mobility management protocols are adopted, such as Mobile IPv6 (MIPv6) [39] and Proxy Mobile IPv6 (PMIPv6) [40]. In these protocols, mobility anchors are used to establish tunnels to forward the data to the new network and update the binding cash. In the proposed DM-MIMO architecture, the addressing issues can be tackled by using cluster mobility anchors (CMAs) that are located in the SSNs of the clusters (i.e., along with the CPUs). When the link handover is implemented, the CMA of the new cluster provides a new home network prefix (HNP) to the UT and establishes a bidirectional tunnel with the old serving CMA such that the data are forwarded to the new cluster, as shown in Fig. 3b. In this scheme, the old IP address of the UT (HNP1::UT) becomes Deprecated, and the newly assigned address (HNP2::UT) becomes Preferred.

The link and network handover procedure for the proposed DM-MIMO can be implemented as depicted in Fig. 4. When a handover is triggered at the link level and the visibility of the UT by the next cluster is confirmed, a network handover procedure is implemented to create a tunnel between the old and new clusters. This allows the packets arriving for the UT with the old IP address to be directed to the new cluster. Then, cross-layer control is implemented to assign new pilot, power, beamforming vector, … etc., to the UT in the new cluster to resume the data transmission via that cluster of SAPs. The complete signal diagram is shown in Fig. 4.

III Benefits, Challenges, and Potential Solutions

Based on the previous discussion of the different design aspects of DM-MIMO-based SatNets, we highlight the major benefits of using this architecture, the main challenges associated, and potential solutions to address these challenges in this section.

III-A Benefits

The major benefits of using DM-MIMO in LEO SatNets can be summarized as follows:

•

Improved throughput: As indicated earlier, the dominant LoS communication between the ground UTs and LEO satellites imposes limitations on the use of MIMO techniques in LEO SatNets. This is due to the unfavourable propagation environment that lowers the rank of the channel matrix between the UTs and satellites [25]. Therefore, this limits the MIMO gain to the keyhole capacity as investigated in [26], except if proper user grouping and spacing are employed to alleviate this effect. The use of DM-MIMO realizes this spatial distribution among the serving satellites and could address the collocation issues. This is in addition to the full frequency reuse that enhances spectral efficiency. Therefore, the use of DM-MIMO could significantly improve the satellite network throughput.
•

Lower handover rate: As discussed in Section II-D, the use of DM-MIMO-based architecture enables the UTs to be served by a cluster of LEO satellites. Therefore, this extends the UT’s service time to the visibility period of a cluster of satellites instead of a single one. This directly reduces the handover rate of UTs and the issues associated with a high handover rate, such as signalling overhead, processing delay, data forwarding, tunnelling, IP addressing issues, and location update.
•

Flexible operation: The concept of CPUs and SAPs is similar to centralized units (CUs) and distributed units (DUs) that are used in the context of open radio access networks (O-RAN), as discussed in [41]. This opens the door to the advanced flexible O-RAN techniques [42] to be utilized in LEO SatNets.
•

Cross-layer design: As indicated in Section I, the nodes of the satellite network are moving and have wireless links among them and to the users on the ground. Therefore, the performance of lower layers highly impacts the upper ones. The use of DM-MIMO enables the cross-layer design of SatNets to jointly optimize upper and lower layers, which provides several benefits to SatNets [43]. In this regard, a cross-layer framework is proposed in Section IV to show the benefits of using this design.

III-B Challenges and Potential Solutions

Although the DM-MIMO-based architecture provides several benefits to LEO SatNets, several challenges need to be addressed to leverage this technology in SatNets. The major challenges and potential solutions can be summarized as follows:

•

Dynamic topology: Due to the dynamic environment of LEO satellites (due to movement at different directions and at different altitudes), the topology of network nodes changes with time. This entails issues with defining the cluster that serves the UTs (i.e., the number of SAPs and which SAPs serve each UT). However, this cluster formation process can be optimized by the SSN on the basis of different performance objectives and associated deployment costs. For instance, this depends on the amount of traffic per spot (e.g., hotspots should be served with a larger number of SAPs compared to cold spots), the capabilities of the serving SAPs (e.g., the power and computational resources), and the kind of traffic (e.g., narrowband versus broadband). An example of such a dynamic clustering approach is proposed in [44] for terrestrial cell-free massive MIMO networks. This concept can be adapted to DM-MIMO-based SatNets. Moreover, user grouping (see [45] for instance) can be employed to assign groups of ground UTs to dynamic clusters of SAPs to optimize user-cluster association.
•

Synchronization: To achieve such cooperative transmission between a cluster of satellites, synchronization in time, frequency, and phase is required. Several time synchronization techniques have been proposed in the literature to align the signals at the symbol level and achieve coherent transmission (see [46, 47] for instance). Besides, frequency and phase synchronization has been widely studied in the literature to achieve cooperative transmission or sensing as discussed in [48]. These synchronization techniques can be utilized and adapted to be applied in the considered scenario of DM-MIMO-based LEO SatNets.
•

Outdated channel information: As discussed in Section II, the channel knowledge (which is vital to several network management functionalities) is based on the estimated channel utilizing the UL pilots. However, due to the long propagation delay (around $7.9$ ms for LEOs), this channel information could be outdated. To overcome this issue, intelligent machine learning techniques can be utilized to estimate the channel while taking into account the propagation delay. An example of this approach is studied in [49], where the authors propose a deep learning model that is based on long short term with memory (LSTM) to predict the channels while exploiting their correlation to overcome channel aging issues in LEO SatNets.
•

Complexity of SSNs: The SSNs are used to control the SAPs in their clusters. Therefore, this requires additional computing and power resources for these satellites to be able to perform such control tasks. To overcome this issue, distributed processing can be utilized as much as possible. For example, as discussed in Section II, several network management functions (e.g., beamforming and pilot assignment) can be implemented in a distributed manner to reduce the computations on the SSNs. In addition, software operation can be exploited to flexibly implement the functions of the CPU without requiring extra hardware at the SSNs.
•

Signalling overhead: To implement the cooperative operation of SAPs, additional signalling between the CPU and SAPs is required. To overcome this signalling overhead, the exchange of the information between the SAPs and the CPU should be minimized; for example, by restricting it to payload data and power control coefficients as discussed in [28]. Besides, the use of high-speed FSO communications [50], which enables the information to be exchanged at very high rate with low latency, can be utilized to overcome this issue.

IV Optimized Cross-Layer Design

For the cross-layer design of the power allocation and handover management processes in the proposed DM-MIMO-based SatNet architecture, we start by discussing the channel model and estimation. We then derive the achievable data rates and formulate the optimization problem. The frequently used symbols are summarized in Table I.

TABLE I: Frequently Used Symbols

Symbol	Description
$\mathcal{K},~{}\mathcal{M}$	Sets of UTs and SAPs, respectively
$K,~{}M$	Cardinalities of $\mathcal{K},~{}\mathcal{M}$ , respectively
$\tau_{c},~{}\tau_{u}^{p}$	Length of coherence interval and UL pilot, respectively
$\tau_{u}^{d},~{}\tau_{d}^{d}$	Length of UL and DL data transmissions, respectively
$q_{k},~{}p_{m,k}$	UL pilot power and DL power factor of SAP $m$ to UT $k$
$h_{m,k}$	Channel coefficient between UT $k$ and SAP $m$
$\hat{h}_{m,k}$	Estimated channel coefficient between UT $k$ and SAP $m$
$L_{m,k}$	Large-scale fading and losses between UT $k$ and SAP $m$
$R_{k}$	Achievable DL data rate in bps/Hz for UT $k$
$R_{k}^{min}$	Minimum rate of UT $k$ in bps/Hz
$P_{m}^{max}$	Maximum power factor of SAP $m$
$\alpha$	Handover rate minimization priority factor

IV-A Channel Model

We consider a cluster of LEO satellites that includes a set of $M$ SAPs indexed by $\mathcal{M}=\{1,~{}2,\cdots,~{}m,\cdots,~{}M\}$ . This cluster serves a set of single-antenna ground UTs set, indexed by $\mathcal{K}=\{1,~{}2,\cdots,~{}k,\cdots,~{}K\}$ . Assume that the channel conditions are static in a coherence time interval of $\tau_{c}$ samples. Due to the strong LoS component between the UTs and SAPs, the channel between the $k$ th UT and the $m$ th SAP is modelled as Rician and can be calculated by [51]

\displaystyle h_{m,k}=\sqrt{L_{m,k}}\left(\sqrt{\frac{\kappa_{m,k}}{\kappa_{m,k}+1}}h^{\prime}_{m,k}+\sqrt{\frac{1}{\kappa_{m,k}+1}}h^{\prime\prime}_{m,k}\right),

(1)

where $\kappa_{m,k}$ is the Rician K-factor, $h^{\prime}_{m,k}$ and $h^{\prime\prime}_{m,k}$ represent the LoS and non-LoS (NLoS) components, respectively. The large scale fading and losses are represented by $L_{m,k}=\newline 10^{-(L^{dist}_{m,k}+L^{shad}_{m,k}+L^{angl}_{m,k})/10}$ , where $L^{dist}_{m,k}$ is the power loss (in dB) due to distance between UT $k$ and SAP $m$ , $L^{shad}_{m,k}\sim\mathcal{N}(0,\sigma_{sh}^{2})$ is the shadowing attenuation (in dB) with variance $\sigma_{sh}^{2}$ , and $L^{angl}_{m,k}$ is the loss due to the boresight angle and can be calculated (in dB) by [52]

\displaystyle L^{angl}_{m,k}=-10\log_{10}\left(\cos(\theta_{m,k})^{\eta}\frac{32\log 2}{2\left(2~{}\text{arccos}(\sqrt[\eta]{0.5})\right)^{2}}\right),

(2)

where $\theta_{m,k}$ is the boresight angle between the $k$ th UT and the $m$ th SAP, and $\eta$ is the antenna factor determining the coverage radius, assuming that the aperture efficiency is unity.

Suppose that the NLoS component, $h^{\prime\prime}_{m,k}$ , is a Rayleigh random variable, i.e., $h^{\prime\prime}_{m,k}\sim\mathcal{C}\mathcal{N}(0,1)$ . The LoS component is given by $h^{\prime}_{m,k}=e^{j\phi_{m,k}}$ , where $\phi_{m,k}\sim\mathcal{U}[-\pi,\pi]$ is a uniform random variable that represents the phase shift due to the mobility of the SAP and UT and propagation delay.

For simplicity, we rewrite (1) as follows:

\displaystyle h_{m,k}=\sqrt{\beta_{m,k}}e^{j\phi_{m,k}}+\tilde{h}_{m,k},

(3)

where

\displaystyle\beta_{m,k}=\frac{\kappa_{m,k}}{\kappa_{m,k}+1}L_{m,k}.

(4)

In (3), $\tilde{h}_{m,k}\sim\mathcal{C}\mathcal{N}(0,\lambda_{m,k})$ , $\lambda_{m,k}=L_{m,k}/(\kappa_{m,k}+1)$ , and $\beta_{m,k}\in\mathbb{R}$ . Therefore, $\beta_{m,k}$ and $\lambda_{m,k}$ are changing slowly [53] compared to small-scale fading that changes instantaneously. Since they mainly depend on the UT’s position, they can be calculated a priori. In addition, we assume that the propagation delay and Doppler shift are compensated in the time and frequency synchronization processes. This is a reasonable assumption since they primarily depend on the satellite and user location and velocity, which can be determined given the satellite orbital information [32]. In addition, other advanced techniques can be utilized to compensate for Doppler offset in real-time and achieve symbol synchronization without a priori satellite information (e.g., [32, 54]).

IV-B Uplink Training and Channel Estimation

As discussed in Section II, TDD is used with the frame structure shown in Fig. 2. Therefore, to estimate the UL channels at the SAPs, every UT transmits a pilot on the initial $\tau_{u}^{p}$ samples of the coherence block. Since we assume that the number of UTs is larger than the number of mutual orthogonal pilots (i.e., $K>\tau_{u}^{p}$ ), every subset of UTs is assigned the same pilot. The subset of UTs that are assigned the same pilot as UT $k$ is denoted by $\mathcal{C}_{k}$ . Define $\sqrt{q_{k}}\psi_{k}\in\mathbb{C}^{\tau_{u}^{p}\times 1}$ as the $\tau_{u}^{p}$ -length pilot sequence transmitted by the $k$ th UT, where $q_{k}$ is the pilot power and $\psi_{k}^{H}\psi_{k}=||\psi_{k}||^{2}=\tau_{u}^{p}$ . Therefore, the received signal vector at the $m$ th SAP, $\textbf{y}_{m}^{p}\in\mathbb{C}^{\tau_{u}^{p}\times 1}$ , from all $K$ UTs’ pilot transmissions is given thus:

\displaystyle\textbf{y}_{m}^{p}=\sum_{k=1}^{K}\sqrt{q_{k}}h_{m,k}\psi_{k}+\textbf{n}_{m}^{p},

(5)

where $\textbf{n}_{m}^{p}\sim\mathcal{CN}(\textbf{0}_{\tau_{u}^{p}},\sigma_{n^{p}}^{2}\textbf{I}_{\tau_{u}^{p}})$ is the additive white Gaussian noise (AWGN) vector.

To estimate the UL channel of UT $k$ , sufficient statistics are derived from the received signal by calculating the inner product between the received signal vector, $\textbf{y}_{m}^{p}$ , and $\psi_{k}$ , as follows:

	$\displaystyle y_{m,k}^{p}$	$\displaystyle=\psi_{k}^{H}\textbf{y}_{m}^{p}=\sum_{k^{\prime}=1}^{K}\sqrt{q_{k^{\prime}}}h_{m,k^{\prime}}\psi_{k}^{H}\psi_{k^{\prime}}+\psi_{k}^{H}\textbf{n}_{m}^{p}$		(6)
		$\displaystyle=\sqrt{q_{k}}\tau_{u}^{p}h_{m,k}+\sum_{k^{\prime}\in\mathcal{C}_{k}\backslash\{k\}}\sqrt{q_{k^{\prime}}}h_{m,k^{\prime}}\tau_{u}^{p}+\psi_{k}^{H}\textbf{n}_{m}^{p}.$		(7)

This is because

\displaystyle\psi_{k}^{H}\psi_{k^{\prime}}=\begin{cases}\tau_{u}^{p},&k^{\prime}\in\mathcal{C}_{k}\\ 0,&\text{otherwise}\\ \end{cases}.

(8)

This statistic can be used to estimate the UL channel, $h_{m,k}$ , at the $m$ th SAP using techniques such as MMSE and linear MMSE (LMMSE) estimators. We assume that a phase-aware MMSE channel estimator is used. Therefore, the estimated UL channel can be given accordingly [36]:

$\displaystyle\hat{h}_{m,k}$	$\displaystyle=\sqrt{\beta_{m,k}}e^{j\phi_{m,k}}+\frac{\sqrt{q_{k}}\lambda_{m,k}(y^{p}_{m,k}-\bar{y}^{p}_{m,k})}{\gamma_{m,k}},$	(9)
$\displaystyle\bar{y}^{p}_{m,k}$	$\displaystyle=\sum_{k^{\prime}\in\mathcal{C}_{k}}\sqrt{q_{k^{\prime}}}\tau_{u}^{p}\sqrt{\beta_{m,k^{\prime}}}e^{j\phi_{m,k^{\prime}}},$	(10)
$\displaystyle\gamma_{m,k}$	$\displaystyle=\sum_{k^{\prime}\in\mathcal{C}_{k}}q_{k^{\prime}}\tau_{u}^{p}\lambda_{m,k^{\prime}}+\sigma_{n^{p}}^{2},$	(11)

with the following statistics

	$\displaystyle\mathbb{E}\{\hat{h}_{m,k}\|\phi_{m,k}\}$	$\displaystyle=\sqrt{\beta_{m,k}}e^{j\phi_{m,k}},$		(12)
	$\displaystyle\text{Var}\{\hat{h}_{m,k}\|\phi_{m,k}\}$	$\displaystyle=\frac{q_{k}\tau_{u}^{p}\lambda_{m,k}^{2}}{\gamma_{m,k}},$		(13)

where $\mathbb{E}\{\cdot\},~{}\text{and}~{}\text{Var}\{\cdot\}$ are the expectation and variance operators, respectively.

IV-C Downlink Data Transmission

Given that most of the traffic is in the DL direction, we consider the DL power allocation and manage the handover process. In the DL, the SAPs transmit the same symbol to the UT in a cooperative manner. Assume that the symbol to be sent to UT $k$ is $s_{k}\in\mathbb{C}$ . Every symbol is precoded by a precoding vector $\textbf{v}_{k}=[v_{1,k},v_{2,k},\cdots,v_{M,k}]^{T}$ , where $v_{m,k}\in\mathbb{C}$ . Therefore, if the signal vector to be sent to the $K$ UTs is $\textbf{s}=[s_{1},s_{2},\cdots,s_{K}]^{T}$ , then the signal vector to be transmitted by the $M$ SAPs is given thus:

\displaystyle=\textbf{V}\textbf{s}=\textbf{v}_{1}s_{1}+\textbf{v}_{2}s_{2}+\cdots+\textbf{v}_{K}s_{K},

(14)

where $\textbf{V}=[\textbf{v}_{1},\textbf{v}_{2},\cdots,\textbf{v}_{K}]$ is an $M\times K$ matrix.

Therefore, the signal received by the $k$ th UT can be calculated by

	$\displaystyle y_{k}$	$\displaystyle=\textbf{h}_{k}^{H}\textbf{x}$		(15)
		$\displaystyle=\textbf{h}_{k}^{H}\textbf{v}_{k}s_{k}+\sum_{k^{\prime}\in\mathcal{K}\backslash k}\textbf{h}_{k}^{H}\textbf{v}_{k^{\prime}}s_{k^{\prime}}+n_{k},$		(16)

where $\textbf{h}_{k}=[h_{1,k},h_{2,k},\cdots,h_{M,k}]^{T}$ and $n_{k}\sim\mathcal{CN}(0,\sigma^{2}_{n})$ is the AWGN noise. Assuming that the UT approximates the precoded channel by the average value $\mathbb{E}\{\textbf{v}_{k}^{H}\textbf{h}_{k}\}$ , the signal-to-interference and noise (SINR) can be calculated by [36]

\displaystyle\text{SINR}_{k}=\frac{|\mathbb{E}\{\textbf{v}_{k}^{H}\textbf{h}_{k}\}|^{2}}{\sum_{i=1}^{K}\mathbb{E}\{|\textbf{v}_{i}^{H}\textbf{h}_{k}|^{2}\}-|\mathbb{E}\{\textbf{v}_{k}^{H}\textbf{h}_{k}\}|^{2}+\sigma_{n}^{2}}.

(17)

In this study, we adopt coherent beamforming as the technique used to minimize the interference between the UTs. Therefore, the precoding coefficient for the $k$ th UT and $m$ th SAP is $v_{m,k}=\sqrt{p_{m,k}}\hat{h}_{m,k}$ , where $p_{m,k}$ is a power scaling factor and $\hat{h}_{m,k}$ is the estimated UL channel, which is valid for the DL direction by virtue of channel reciprocity. That is, the precoding vector for the $k$ th UT is given by

\displaystyle\textbf{v}_{k}=\textbf{P}_{k}^{1/2}\hat{\textbf{h}}_{k},

(18)

where $\textbf{P}_{k}=\text{diag}\left(\frac{p_{1,k}}{\mathbb{E}\{|\hat{h}_{1,k}|^{2}\}},\frac{p_{2,k}}{\mathbb{E}\{|\hat{h}_{2,k}|^{2}\}},\cdots,\frac{p_{M,k}}{\mathbb{E}\{|\hat{h}_{M,k}|^{2}\}}\right)$ and $\hat{\textbf{h}}_{k}=[\hat{h}_{1,k},\hat{h}_{2,k},\cdots,\hat{h}_{M,k}]^{T}$ .

Accordingly, by using coherent beamforming as in (18), phase-aware MMSE channel estimation as in (9), and the SINR in (17), the SINR can be derived as in [36] as follows:

\displaystyle\text{SINR}_{k}^{\text{mmse}}=\frac{|\text{tr}(\textbf{P}_{k}^{1/2}\textbf{D}_{k})|^{2}}{\text{Den}},

(19)

$\displaystyle\text{Den}=$	$\displaystyle\sum_{k^{\prime}=1}^{K}\text{tr}(\textbf{P}_{k^{\prime}}\textbf{A}^{\prime}_{k}\textbf{D}_{k^{\prime}})$
	$\displaystyle+\sum_{k^{\prime}\in\mathcal{C}_{k}\backslash k}q_{k}q_{k^{\prime}}(\tau_{u}^{p})^{2}\|\text{tr}(\textbf{P}_{k^{\prime}}^{1/2}\textbf{A}_{k}\textbf{G}_{k^{\prime}}\textbf{A}_{k^{\prime}})\|^{2}$
	$\displaystyle-\text{tr}(\textbf{P}_{k}\textbf{B}_{k}^{2})+\sigma_{n}^{2},$	(20)

where

$\displaystyle\textbf{A}_{k}$	$\displaystyle=\text{diag}(\lambda_{1,k},\lambda_{2,k},\cdots,\lambda_{M,k}),$	(21)
$\displaystyle\textbf{A}^{\prime}_{k}$	$\displaystyle=\text{diag}(\lambda^{\prime}_{1,k},\lambda^{\prime}_{2,k},\cdots,\lambda^{\prime}_{M,k}),$	(22)
$\displaystyle\lambda^{\prime}_{m,k}$	$\displaystyle=\lambda_{m,k}+\beta_{m,k},$	(23)
$\displaystyle\textbf{B}_{k}$	$\displaystyle=\text{diag}(\beta_{1,k},\beta_{2,k},\cdots,\beta_{M,k}),$	(24)
$\displaystyle\textbf{D}_{k}$	$\displaystyle=q_{k}\tau_{u}^{p}\textbf{A}_{k}\textbf{G}_{k}\textbf{A}_{k}+\textbf{B}_{k},$	(25)
$\displaystyle\textbf{G}_{k}$	$\displaystyle=\text{diag}(\gamma_{1,k},\gamma_{2,k},\cdots,\gamma_{M,k})^{-1}.$	(26)

Therefore, the achievable DL data rate (in bps/Hz) of the $k$ th UT served by this cluster of SAPs can be calculated as

\displaystyle R_{k}=\frac{\tau_{d}^{d}}{\tau_{c}}\log_{2}\left(1+\text{SINR}_{k}\right)

(27)

IV-D Cross-Layer Problem Formulation

To optimize the power allocation and the handover decisions such that the cluster throughput is maximized and the handover rate is minimized, we formulate the power allocation and handover processes as a multi-objective optimization problem, where the objective functions to be maximized are the UTs’ aggregate data rate and their service time before being switched to another cluster. For the latter, we maximize the number of served UTs with a guaranteed minimum data rate based on their link conditions. When the link condition does not allow optimized power allocation to serve the UT with the minimum required data rate, a handover request is issued. Then, a handover decision is taken when this repeats, and the visibility of the UT is confirmed by the next serving cluster, as discussed in Section II. That is, the two objectives of the optimization problem at the $t$ th time slot are given as follows:

\displaystyle\max_{p_{m,k},I_{k}}~{}~{}\sum_{k=1}^{K}R_{k}[t]I_{k}[t]~{}~{}~{}\text{and}~{}~{}~{}\max_{p_{m,k},I_{k}}~{}~{}\sum_{k=1}^{K}I_{k}[t],

(28)

where $R_{k}[t]$ is the data rate of the $k$ th UT during the $t$ th time slot based on its channel conditions and power allocation during that time slot, as given in (27). $I_{k}[t]$ is an indicator variable that indicates whether the $k$ th UT can be served by the cluster during the $t$ th time slot with an acceptable data rate, through optimizing the power allocation, or it is infeasible and a handover might be considered. If a handover is triggered, then the implementation procedure would be as shown in Fig. 4. By introducing this indicator variable, we can jointly optimize the data rate of the UTs and their service time.

To deal with this multi-objective optimization problem, we construct a weighted sum of the two objectives to combine the two conflicting objectives into a single function. Thus, the complete handover-aware power allocation problem at time slot $t$ is formulated as follows:

	$\displaystyle\max_{p_{m,k},I_{k}}~{}~{}(1-\alpha)\sum_{k=1}^{K}R_{k}[t]I_{k}[t]+\alpha\sum_{k=1}^{K}I_{k}[t]$	(29)
$\displaystyle\mathrm{s.t.}~{}~{}$	$\displaystyle R_{k}[t]\geq R_{k}^{\text{min}}I_{k}[t],~{}\forall k\in\mathcal{K}$	(29a)
	$\displaystyle\sum_{k=1}^{K}p_{m,k}\leq P_{m}^{\text{max}},~{}\forall m\in\mathcal{M}$	(29c)
	$\displaystyle I_{k}[t]\in\{0,1\},~{}\forall k\in\mathcal{K}$	(29b)
	$\displaystyle p_{m,k}\geq 0,~{}\forall m\in\mathcal{M},~{}k\in\mathcal{K},$	(29d)

where $\alpha$ is a weighting coefficient that combines the two competing objectives and can be used to prioritize them. That is, by setting $\alpha=0$ , we target maximizing the aggregate UTs’ data rate only, and by setting $\alpha=1$ , we target minimizing the handover rate only. Constraint (29a) is used to ensure that the served UTs satisfy their minimum rate level, where $R_{k}^{min}$ is the required minimum rate of UT $k$ . Constraint (29c) is expressed to ensure that the total power scaling factors of every SAP are within the required range, where $P_{m}^{max}$ is the maximum total value of transmit power of the $m$ th SAP. The binary value of the indicator variable, $I_{k}[t]$ , and the non-negative value of $p_{u,k}$ are imposed by constraints (29b) and (29d), respectively.

Since the decision variables contain continuous (i.e., the power coefficient) and discrete (i.e., the handover indicator) ones and observing its structure, this optimization problem is modelled as a mixed-integer non-linear program (MINLP). This kind of optimization problem is known to be generally NP-hard due to its combinatorial behaviour [55]. Therefore, exponential computational complexity is required to solve the problem in (29). This means that solving this problem optimally cannot be done in real-time. In the next section, we leverage deep learning techniques to build a model that can predict efficient solutions to this problem in a computationally efficient manner. This is one of several benefits provided by AI techniques, as discussed in the next section.

V AI-Based Solution

In this section, we utilize data-driven techniques to implement the cross-layer control designed in Section IV. This can overcome the complexity of solving the optimization problem in (29) by moving most of the computations to be offline and brings the other benefits of AI to the satellite network management. For example, AI-based techniques adapt to the changing network conditions, which is one of the main characteristics of LEO SatNets. In addition, as multiple LEO satellites cooperate to serve the UTs as a cluster, distributed processing can be employed to reduce the processing load on the SSNs. In this regard, machine learning techniques support distributed implementation to a large extent [56]. Therefore, implementing the proposed cross-layer control utilizing AI techniques can help in designing a more resilient satellite network.

Deep learning is one of the most efficient machine learning techniques, since it provides multi-layered models that are capable of learning efficient data representations from unstructured, complex datasets. Therefore, considering the optimized solution of the problem in (29) as the output of a certain mapping function, $f(\cdot)$ , that maps the input of the optimization problem (i.e., user channel coefficients and QoS parameters), we design a deep neural network (DNN) model that can “learn” this mapping function utilizing labelled training examples. For the latter, we use the traditional mathematical optimization tools to solve the problem in (29) offline for different inputs, and use it as the desired output of the DNN model. In doing so, we move the complexity of solving the optimization problem offline. We can then use the “trained” DNN model to predict the optimized solution for new inputs without the need to mathematically solve the optimization problem again for the new inputs. It is worth noting that a trained DNN model can predict the output with much lower complexity compared to solving the optimization problem, since it is mainly executed as a multiplication of matrices, as will be discussed in the sequel. It should be also noted that model-free machine learning techniques (e.g., deep reinforcement learning) can be utilized to predict the solution of the problem in (29) by directly interacting with the network environment and learning from the decisions. However, by doing so, we would not utilize the labelled data that can be generated based on the slow-varying large-scale fading parameters. This is in addition to the other drawbacks of using model-free algorithms as discussed in [57].

We determine the input and output vectors of the optimization problem in (29) as follows. The input vector is $\textbf{x}\in\mathbb{R}^{U_{0}},~{}U_{0}=MK+K+M$ , and is given accordingly:

\displaystyle\textbf{x}=\left[\{L_{m,k}\}_{m=1,k=1}^{m=M,k=K},~{}\{R_{k}^{\text{min}}\}_{k=1}^{K},~{}\{P_{m}^{\text{max}}\}_{m=1}^{M}\right]^{T}.

(30)

This input is used by the DNN model to predict the desired output of the optimization problem, which is $\textbf{z}\in\mathbb{R}^{U_{E}},~{}U_{E}=MK+K$ , and is calculated thus:

	z	$\displaystyle=f(\textbf{x})$		(31)
		$\displaystyle=\left[\{p^{}_{m,k}\}_{m=1,k=1}^{m=M,k=K},~{}\{I_{k}^{}\}_{k=1}^{K}\right]^{T},$		(32)

where $f(\cdot)$ is the function that represents the mapping process of the optimization problem in (29), and $p^{*}_{m,k}$ and $I_{k}^{*}$ are the desired values of power allocation and handover indicator, respectively. Therefore, the output of the DNN model is the predicted value for this desired vector, z. That is, $\hat{\textbf{z}}=\hat{f}(\textbf{x})$ , where $\hat{f}(\cdot)$ is the approximated mapping function.

The adopted DNN model is shown in Fig. 5. The DNN model is composed of $E$ layers that include an output layer and $E-1$ hidden layers. Each layer has a number of neurons or units. Assume that the $e$ th layer incorporates $U_{e}$ units. The output of the DNN model, $\hat{\textbf{z}}$ , can be calculated using a multi-layered matrix multiplication process. That is, the output of the $e$ th layer can be calculated as follows:

\displaystyle\textbf{z}^{(e)}=g^{(e)}\left(\textbf{W}^{(e)}\textbf{z}^{(e-1)}+\textbf{b}^{(e)}\right),

(33)

where $\textbf{z}^{(e)}$ is the output of the $e$ th layer (i.e., $\hat{\textbf{z}}=\textbf{z}^{(E)}$ ) and $g^{(e)}(\cdot)$ is the activation function that maps the output of the previous layer to the input of the subsequent one. In addition to the activation function, the parameters $\textbf{b}^{(e)}$ and $\textbf{W}^{(e)}$ are used to calculate the input of the activation function based on the output of the previous layer, where $\textbf{b}^{(e)}\in\mathbb{R}^{U_{e}}$ is the bias vector of the $e$ th layer and $\textbf{W}^{(e)}\in\mathbb{R}^{U_{e}\times U_{e-1}}$ is its weights matrix such that $w^{(e)}_{ij}$ is the weight of the edge from the $j$ th unit in layer $e-1$ to the $i$ th unit in layer $e$ . These parameters ( $\textbf{b}^{(e)}$ and $\textbf{W}^{(e)}$ ) are optimized to provide accurate predictions for the desired output vector. This optimization process is implemented during the training phase.

To train the DNN model (i.e., to optimize $\textbf{b}^{(e)}$ and $\textbf{W}^{(e)}$ ), a labelled dataset is constructed by using the optimal solution of the optimization problem in (29) at different inputs. It is worth mentioning that, alternatively, a suboptimal solution can be used, in case the optimal solution is too expensive to calculate. However, the DNN model would mimic the suboptimal procedure. Assume that the training set is composed of $T$ tuples $\{(\textbf{x}^{\{1\}},\textbf{z}^{\{1\}}),(\textbf{x}^{\{2\}},\textbf{z}^{\{2\}}),(\textbf{x}^{\{3\}},\textbf{z}^{\{3\}}),\cdots,\newline (\textbf{x}^{\{T)},\textbf{z}^{\{T\}})\}$ . Therefore, the DNN model is trained by solving the following unconstrained optimization problem:

\displaystyle\min_{\{\textbf{W}^{(e)}\},\{\textbf{b}^{(e)}\}}~{}~{}\frac{1}{T}\sum_{i=1}^{T}\mathcal{L}\left(\textbf{z}^{\{i\}},\hat{\textbf{z}}^{\{i\}}\right),

(34)

where $\mathcal{L}(\cdot)$ is a loss function that is selected to represent the error between the prediction and the ground truth of the training examples.

After the offline training phase, the DNN model becomes ready to provide efficient predictions for the optimized power allocation and handover indicator online based on the new input. The new input is mainly composed of the values of the large-scale parameters at each time instant. This online operation does not need re-training the DNN model for those new values as the DNN model learns from the training examples during the training phase.

V-A Complexity Analysis

During online operation, the DNN model calculates predictions for the desired output vector for the input values based on the experience gained in the training phase. For this purpose, the output of each layer is calculated as in (33). Therefore, the computational complexity to calculate $\textbf{z}^{(e)}$ is due to the multiplication operation and the activation function. Using the big-O notation, this computational complexity is in the order of $\mathcal{O}(U_{e}U_{e-1}+U_{e})=\mathcal{O}(U_{e}U_{e-1})$ . Therefore, the complexity of calculating the output of the DNN model is $\sum_{e=1}^{E}\mathcal{O}(U_{e}U_{e-1})$ . For example, if two hidden layers are used, and each layer has the same number of units as the input, then the computational complexity can be calculated as follows:

$\displaystyle\sum_{e=1}^{3}\mathcal{O}(U_{e}U_{e-1})$	$\displaystyle=\mathcal{O}(U_{0}^{2})+\mathcal{O}(U_{0}^{2})+\mathcal{O}(U_{0}U_{E})$	(35)
	$\displaystyle=\mathcal{O}(U_{0}^{2})+\mathcal{O}(U_{0}U_{E})$	(36)
	$\displaystyle=\mathcal{O}((MK+K+M)^{2})+$	(37)
	$\displaystyle\mathcal{O}((MK+K+M)(MK+K))$
	$\displaystyle=\mathcal{O}(M^{2}K^{2}).$	(38)

This is much lower than the exponential complexity of solving an NP-hard problem, such as (29). In addition, other benefits of using machine learning-based techniques are utilized, such as adaptability to the dynamics of the satellite link and topology, scalability, and support of distributed data processing and storage.

VI Simulation Results

TABLE II: Simulation Parameters

Parameter	Value
Satellites altitude	$550$ km
Antenna factor ( $\eta$ )	20 [52]
Carrier frequency	$30$ GHz
Shadowing std	$5$ dB
Noise figure	$7$ dB
Noise power spectral density	$-174$ dBm/Hz
Sat. max power ( $P_{m}^{\text{max}}$ )	$15$ dBW
Sat. antenna gain	$30$ dB
UT antenna gain	$5$ dB
Pilot power ( $q_{k}$ )	$5$ dBW
Coherence intervals: $\tau_{c},~{}\tau_{u}^{p}$	$300,~{}30$ samples
Number of runs	$10$
Number of UTs ( $K$ )	$10$
Distribution of UTs	Uniform
Number of antennas per satellite	1
Priority factor ( $\alpha$ )	$0.5$

Here, we present and discuss the simulation results to evaluate the performance of the proposed DM-MIMO-based architecture, optimized cross-layer design, and AI-based implementation. In addition, we compare the performance with that of baseline techniques from the literature.

We consider a set of $M$ -LEO satellites that serves a set of UTs distributed uniformly over a $1,000\times 1,000~{}\text{km}^{2}$ area. The values of the adopted simulation parameters are summarized in Table II. For the proposed D-JPAHM approach, this set of satellites is considered a cluster of SAPs, and each UT is connected to the whole cluster in a DM-MIMO manner. The CPU directs the UTs’ downlink data to the SAPs and determines the power allocation for each SAP-UT link according to the optimization problem in (29). Without loss of generality, we assume that conjugate beamforming is used to determine the beamforming vectors, as been used in several studies in the literature (e.g.,[28, 30, 36]). To solve this optimization problem, we use Matlab’s genetic algorithm (GA) solver. It should be noted that the sub-objectives of the problem in (29) should be normalized. This normalization along with the weighting factor ( $\alpha$ ) ensure that the combined objectives are on the same scale. As a baseline, we compare the proposed DM-MIMO-based architecture with traditional single-satellite connectivity. For the handover scheme, we consider two well-known techniques. In the first scheme, each UT connects to the LEO satellite with the best channel condition and switches to a different one when the new satellite has a better link condition [15]. This approach maximizes the throughput of each UT and is referred to as BestChannel. In the second single-satellite connectivity technique, each UT remains connected to the LEO satellite in its visibility as long as its achievable rate is higher than the minimum acceptable value $R_{k}^{\text{min}}$ . When this minimum value can not be achieved by the current satellite connection, the UT switches to the best channel LEO satellite [15]. This technique maximizes the service time for each UT (i.e., minimizes its handover rate) and is referred to as MaxServTime.

The average service time of the UTs is plotted against the number of satellites, $M$ , using the three techniques in Fig. 6. As we can see, the MaxServTime technique achieves higher service time than BestChannel, which is as we would expect since MaxServTime prioritizes the longer service time, not the achievable data rate. However, the proposed D-JPAHM approach achieves a much higher value given that the visibility of a cluster of satellites is always longer than that of a single satellite. Therefore, the average service time increases with the increase of the number of SAPs in the cluster.

Fig. 7 shows the average spectral efficiency (in bps/Hz) versus the number of SAPs using the three approaches. As we can see, the cooperative transmission, full frequency reuse, and optimized power allocation of the DM-MIMO-based architecture improved spectral efficiency and outperformed conventional single-satellite connectivity techniques. Furthermore, this spectral efficiency gain increased with the increase of the number of SAPs. Although BestChannel prioritized spectral efficiency, it could not compete with the DM-MIMO architecture, as shown in the figure. This spectral efficiency improvement could be exploited towards direct broadband connectivity of handheld devices.

TABLE III: DNN Model Parameters

Parameter	Value
Dataset size	$33,000$ samples
Training dataset percentage	$70\%$
Optimizer	Adam (learning rate = $0.01$ )
Batch size	$32$
Number of hidden layers	$2$
Number of units per layer	$40,~{}50$
Activation functions	Hard-sigmoid
Loss functions	MSE, Binary Cross-entropy
Training epochs	$50$

To evaluate the accuracy of the proposed DNN model discussed in Section V, we generated a dataset with simulations that are implemented in Matlab. We simulate a cluster of four satellites that serves a set of six UTs, which were uniformly distributed in the service area, and the positions were changed randomly at each simulation run to generate a total of $33,000$ samples of channel realizations for the UTs. The simulation parameters are summarized in Table II. Given the channel conditions of the UTs at every time instant, the optimization problem in (29) was solved using Matlab’s GA solver to construct the input and corresponding output of the dataset, as discussed in Section V. This offline-generated dataset was then used for the training and testing of the DNN model. The DNN model was implemented and tested on Python using TensorFlow and Sklearn packages. The DNN model parameters are summarized in Table III.

To evaluate the accuracy of the predictions of the DNN model, we used the mean squared error (MSE) of the predicted power allocation (normalized to $P_{m}^{\text{max}}$ ) and the F1 score for the prediction of the handover indicator. The F1 score is an efficient metric that is used widely in data science and binary classification problems [58]. The F1 score can be calculated using the following expression:

\displaystyle\text{F1 Score}=\frac{2\times\text{Precision}\times\text{Recall}}{\text{Precision}+\text{Recall}},

(39)

where

	$\displaystyle\text{Precision}=\frac{\text{True Positives}}{\text{True Positives}+\text{False Positives}},$		(40)
	$\displaystyle\text{Recall}=\frac{\text{True Positives}}{\text{True Positives}+\text{False Negatives}}.$		(41)

Figure 8 shows the MSE of the predicted power allocation $\{\hat{p}_{m,k}\}$ (normalized to $P_{m}^{\text{max}}$ ) versus the number of training epochs to represent the learning curve of the DNN model. This was plotted for both the training and testing datasets. As we can see, the predictions achieved a high accuracy compared to the optimized solutions at the end of the training process. In addition, the gap between the testing and training MSE was small, which means that there is no overfitting problem in this DNN model.

Similarly, the Precision, Recall, and F1 score of the predicted handover indicator, $\{\hat{I}_{k}\}$ , were plotted against the training epochs in Fig. 9 to evaluate the accuracy of the prediction. Again, the DNN model achieved accurate predictions for the handover process. Therefore, this shows that the DNN model can effectively mimic the implicit mapping function of the optimized cross-layer design developed in Section IV. However, it operates online with much lower complexity compared to that of solving the NP-hard problem in (29) using traditional mathematical optimization techniques. This means that the proposed cell-free, optimized cross-layer design can be executed in practice utilizing deep learning techniques.

VII Conclusions and Future Work

In this paper, we introduced a novel DM-MIMO-based LEO satellite network architecture and discussed its various design aspects, benefits, associated challenges, and potential solutions. Based on the proposed architecture, we developed an optimized cross-layer control framework, where the power allocation and handover management processes were jointly optimized. In addition, we introduced an AI-based implementation suitable for real-time operation and the dynamic environment of the LEO satellite network. The simulation results demonstrated that the proposed DM-MIMO-based architecture achieves better spectral efficiency for the network than baseline techniques and reduces the handover rate of the user terminals, exploiting the ultra-dense deployment of the LEO satellites. Moreover, the results showed that the deep learning-based implementation can predict efficient and accurate solutions for the proposed optimization framework. Therefore, the proposed architecture and solutions can enable future ultra-dense LEO satellite networks to be more efficient, resilient, and intelligent.

In future work, we will investigate the other aspects of system design based on this architecture, such as optimizing the clustering process, the synchronization between transmissions, and the outdated channel information. The potential solutions that were proposed in the literature for these challenges were discussed in Section III-B. However, further optimization needs to be investigated.

Acknowledgment

This work has been supported by the National Research Council Canada’s (NRC) High Throughput Secure Networks program (CSTIP Grant #CH-HTSN-607) within the Optical Satellite Communications Consortium Canada (OSC) framework.

References

[1] M. Y. Abdelsadek, H. Yanikomeroglu, and G. K. Kurt, “Future ultra-dense LEO satellite networks: A cell-free massive MIMO approach,” in IEEE International Conference on Communications Workshops (ICC Workshops), pp. 1–6, Jun. 2021.
[2] 3GPP, “Study on using satellite access in 5G (Release 16),” TR 22.822, v16.0.0, Jun. 2018.
[3] M. Centenaro, C. E. Costa, F. Granelli, C. Sacchi, and L. Vangelista, “A survey on technologies, standards and open challenges in satellite IoT,” IEEE Communications Surveys & Tutorials, vol. 23, no. 3, pp. 1693–1720, Third Quarter 2021.
[4] M. Shaat, E. Lagunas, A. I. Perez-Neira, and S. Chatzinotas, “Integrated terrestrial-satellite wireless backhauling: Resource management and benefits for 5G,” IEEE Vehicular Technology Magazine, vol. 13, no. 3, pp. 39–47, Sep. 2018.
[5] B. Di, H. Zhang, L. Song, Y. Li, and G. Y. Li, “Ultra-dense LEO: Integrating terrestrial-satellite networks into 5G and beyond for data offloading,” IEEE Transactions on Wireless Communications, vol. 18, no. 1, pp. 47–62, Jan. 2019.
[6] L. Gu, L. Yu, W. Li, and K. Zhao, “A publish-subscribe networking architecture for future manned deep space exploration,” China Communications, vol. 17, no. 7, pp. 38–51, Jul. 2020.
[7] 3GPP, “Study on new radio (NR) to support non-terrestrial networks (Release 15),” TR 38.811, v15.4.0, Sep. 2020.
[8] G. K. Kurt, M. G. Khoshkholgh, S. Alfattani, A. Ibrahim, T. S. Darwish, M. S. Alam, H. Yanikomeroglu, and A. Yongacoglu, “A vision and framework for the high altitude platform station (HAPS) networks of the future,” IEEE Communications Surveys & Tutorials, vol. 23, no. 2, pp. 729–779, Second Quarter 2021.
[9] 3GPP, “Study on architecture aspects for using satellite access in 5G (Release 17),” TR 23.737, v17.2.0, Mar. 2021.
[10] 3GPP, “Study on management and orchestration aspects with integrated satellite components in a 5G network (Release 17),” TR 28.808, v17.0.0, Mar. 2021.
[11] 3GPP, “Solutions for NR to support non-terrestrial networks (Release 16),” TR 38.821, v16.1.0, Jun. 2021.
[12] T. Darwish, G. Kurt, H. Yanikomeroglu, G. Lamontagne, and M. Bellemare, “Location management in IP-based future LEO satellite networks: A review,” arXiv preprint arXiv:2101.08336, Jun. 2021.
[13] B. Yang, Y. Wu, X. Chu, and G. Song, “Seamless handover in software-defined satellite networking,” IEEE Communications Letters, vol. 20, no. 9, pp. 1768–1771, Sep. 2016.
[14] Y. Li, W. Zhou, and S. Zhou, “Forecast based handover in an extensible multi-layer LEO mobile satellite system,” IEEE Access, vol. 8, pp. 42768–42783, Mar. 2020.
[15] Z. Wu, F. Jin, J. Luo, Y. Fu, J. Shan, and G. Hu, “A graph-based satellite handover framework for LEO satellite communication networks,” IEEE Communications Letters, vol. 20, no. 8, pp. 1547–1550, May 2016.
[16] Y. Wu, G. Hu, F. Jin, and J. Zu, “A satellite handover strategy based on the potential game in LEO satellite networks,” IEEE Access, vol. 7, pp. 133641–133652, Sep. 2019.
[17] R. Schwarz, A. Knopp, B. Lankl, D. Ogermann, and C. Hofmann, “Optimum-capacity MIMO satellite broadcast system: Conceptual design for LOS channels,” in 4th IEEE Advanced Satellite Mobile Systems, pp. 66–71, Aug. 2008.
[18] F. Yamashita, K. Kobayashi, M. Ueba, and M. Umehira, “Broadband multiple satellite MIMO system,” in IEEE Vehicular Technology Conference, vol. 4, pp. 2632–2636, Sep. 2005.
[19] C.-I. Oh, S.-H. Choi, D.-I. Chang, and D.-g. Oh, “Analysis of the rain fading channel and the system applying MIMO,” in IEEE International Symposium on Communications and Information Technologies, pp. 507–510, Oct. 2006.
[20] E. Erdogan, I. Altunbas, G. Kurt, M. Bellemare, G. Lamontagne, and H. Yanikomeroglu, “Site diversity in downlink optical satellite networks through ground station selection,” IEEE Access, vol. 9, pp. 31179–31190, Feb. 2021.
[21] L. Feng, Y. Liu, L. Wu, Z. Zhang, and J. Dang, “A satellite handover strategy based on MIMO technology in LEO satellite networks,” IEEE Communications Letters, vol. 24, no. 7, pp. 1547–1550, Jul. 2020.
[22] D. Goto, H. Shibayama, F. Yamashita, and T. Yamazato, “LEO-MIMO satellite systems for high capacity transmission,” in IEEE Global Communications Conference (GLOBECOM), pp. 1–6, Dec. 2018.
[23] L. You, K.-X. Li, J. Wang, X. Gao, X.-G. Xia, and B. Ottersten, “Massive MIMO transmission for LEO satellite communications,” IEEE Journal on Selected Areas in Communications, vol. 38, no. 8, pp. 1851–1865, Aug. 2020.
[24] L. You, K.-X. Li, J. Wang, X. Gao, X.-G. Xia, and B. Otterstenx, “LEO satellite communications with massive MIMO,” in IEEE International Conference on Communications (ICC), pp. 1–6, Jun. 2020.
[25] J. Arnau and C. Mosquera, “Dissection of multibeam satellite communications with a large-scale antenna system toolbox,” in 20th European Wireless Conference, pp. 1–6, VDE, 2014.
[26] R. T. Schwarz, T. Delamotte, K.-U. Storek, and A. Knopp, “MIMO applications for multibeam satellites,” IEEE Transactions on Broadcasting, vol. 65, no. 4, pp. 664–681, 2019.
[27] FCC, “Request for Modification of the Authorization for the SpaceX NGSO Satellite System,” Apr. 2021. Accessed: June 2, 2022. [Online]. Available: https://docs.fcc.gov/public/attachments/FCC-21-48A1.pdf.
[28] H. Q. Ngo, A. Ashikhmin, H. Yang, E. G. Larsson, and T. L. Marzetta, “Cell-free massive MIMO versus small cells,” IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. 1834–1850, Jan. 2017.
[29] G. Interdonato, E. Björnson, H. Q. Ngo, P. Frenger, and E. G. Larsson, “Ubiquitous cell-free massive MIMO communications,” EURASIP Journal on Wireless Communications and Networking, vol. 2019, no. 1, p. 197, Dec. 2019.
[30] Z. Chen and E. Björnson, “Channel hardening and favorable propagation in cell-free massive MIMO with stochastic geometry,” IEEE Transactions on Communications, vol. 66, no. 11, pp. 5205–5219, Jun. 2018.
[31] E. Björnson and L. Sanguinetti, “Making cell-free massive MIMO competitive with MMSE processing and centralized implementation,” IEEE Transactions on Wireless Communications, vol. 19, no. 1, pp. 77–90, Sep. 2019.
[32] E. G. Peters and C. R. Benson, “A Doppler correcting software defined radio receiver design for satellite communications,” IEEE Aerospace and Electronic Systems Magazine, vol. 35, no. 2, pp. 38–48, 2020.
[33] T. Xia, M. M. Wang, J. Zhang, and L. Wang, “Maritime internet of things: Challenges and solutions,” IEEE Wireless Communications, vol. 27, no. 2, pp. 188–196, 2020.
[34] P. A. Monte, M. Louie, and R. Wiedeman, “Mobile telephony through LEO satellites: To OBP or not,” in NASA. Lewis Research Center, Space Communications Technology Conference: Onboard Processing and Switching, 1991.
[35] Z. Tan, H. Qin, L. Cong, and C. Zhao, “Positioning using IRIDIUM satellite signals of opportunity in weak signal environment,” Electronics, vol. 9, no. 1, p. 37, 2019.
[36] Ö. Özdogan, E. Björnson, and J. Zhang, “Performance of cell-free massive MIMO with Rician fading and phase shifts,” IEEE Transactions on Wireless Communications, vol. 18, no. 11, pp. 5299–5315, Aug. 2019.
[37] E. Nayebi, A. Ashikhmin, T. L. Marzetta, H. Yang, and B. D. Rao, “Precoding and power optimization in cell-free massive MIMO systems,” IEEE Transactions on Wireless Communications, vol. 16, no. 7, pp. 4445–4459, May 2017.
[38] M. A. Vazquez, A. Perez-Neira, D. Christopoulos, S. Chatzinotas, B. Ottersten, P.-D. Arapoglou, A. Ginesi, and G. Tarocco, “Precoding in multibeam satellite communications: Present and future challenges,” IEEE Wireless Communications, vol. 23, no. 6, pp. 88–95, Dec. 2016.
[39] C. Perkins, D. Johnson, and J. Arkko, “Mobility support in IPv6,” IETF RFC 6275, Jul. 2011.
[40] S. Gundavelli et al., “Proxy mobile IPv6,” IETF RFC 5213, Aug. 2008.
[41] V. Ranjbar, A. Girycki, M. A. Rahman, S. Pollin, M. Moonen, and E. Vinogradov, “Cell-free mMIMO support in the O-RAN architecture: A PHY layer perspective for 5G and beyond networks,” IEEE Communications Standards Magazine, vol. 6, no. 1, pp. 28–34, 2022.
[42] A. S. Abdalla, P. S. Upadhyaya, V. K. Shah, and V. Marojevic, “Toward next generation open radio access networks–what O-RAN can and cannot do!,” IEEE Network, 2022.
[43] G. Giambene and S. Kota, “Cross-layer protocol optimization for satellite communications networks: A survey,” International Journal of Satellite Communications and Networking, vol. 24, no. 5, pp. 323–341, 2006.
[44] E. Björnson and L. Sanguinetti, “Scalable cell-free massive MIMO systems,” IEEE Transactions on Communications, vol. 68, no. 7, pp. 4247–4261, 2020.
[45] K.-U. Storek and A. Knopp, “Fair user grouping for multibeam satellites with MU-MIMO precoding,” in IEEE Global Communications Conference, pp. 1–7, IEEE, 2017.
[46] L. Pan, T. Jiang, L. Zhou, H. Xu, and W. Chen, “A research on high-precision time-synchronization and ranging system between satellites,” in International Conference on Microwave and Millimeter Wave Technology, vol. 2, pp. 926–929, IEEE, 2008.
[47] X. Jiuling, Z. Chaojie, W. Chunhui, and J. Xiaojun, “Approach to inter-satellite time synchronization for micro-satellite cluster,” Journal of Systems Engineering and Electronics, vol. 29, no. 4, pp. 805–815, 2018.
[48] S. Jayaprakasam, S. K. A. Rahim, and C. Y. Leow, “Distributed and collaborative beamforming in wireless sensor networks: Classifications, trends, and research directions,” IEEE Communications Surveys & Tutorials, vol. 19, no. 4, pp. 2092–2116, 2017.
[49] Y. Zhang, Y. Wu, A. Liu, X. Xia, T. Pan, and X. Liu, “Deep learning-based channel prediction for LEO satellite massive MIMO communication system,” IEEE Wireless Communications Letters, vol. 10, no. 8, pp. 1835–1839, 2021.
[50] A. U. Chaudhry and H. Yanikomeroglu, “Free space optics for next-generation satellite networks,” IEEE Consumer Electronics Magazine, vol. 10, no. 6, pp. 21–31, Dec. 2021.
[51] H. Q. Ngo, H. Tataria, M. Matthaiou, S. Jin, and E. G. Larsson, “On the performance of cell-free massive MIMO in Ricean fading,” in IEEE Asilomar Conference on Signals, Systems, and Computers, pp. 980–984, Oct. 2018.
[52] Y. Li, N. Deng, and W. Zhou, “A hierarchical approach to resource allocation in extensible multi-layer LEO-MSS,” IEEE Access, vol. 8, pp. 18522–18537, Jan. 2020.
[53] W. Feng, N. Ge, and J. Lu, “Coordinated satellite-terrestrial networks: A robust spectrum sharing perspective,” in 26th Wireless and Optical Communication Conference (WOCC), pp. 1–5, IEEE, 2017.
[54] J. Li, Y. Zhang, Y. Zhang, W. Xiong, Y. Huang, and Z. Wang, “Fast tracking Doppler compensation for OFDM-based LEO satellite data transmission,” in IEEE International Conference on Computer and Communications (ICCC), pp. 1814–1817, 2016.
[55] C. H. Papadimitriou and K. Steiglitz, Combinatorial Optimization: Algorithms and Complexity. Courier Corporation, 1998.
[56] S. Hu, X. Chen, W. Ni, E. Hossain, and X. Wang, “Distributed machine learning for wireless communication networks: Techniques, architectures, and applications,” IEEE Communications Surveys & Tutorials, vol. 23, no. 3, pp. 1458–1493, Third Quarter 2021.
[57] K. Arulkumaran, M. P. Deisenroth, M. Brundage, and A. A. Bharath, “Deep reinforcement learning: A brief survey,” IEEE Signal Processing Magazine, vol. 34, no. 6, pp. 26–38, 2017.
[58] H. Huang, H. Xu, X. Wang, and W. Silamu, “Maximum F1-score discriminative training criterion for automatic mispronunciation detection,” IEEE/ACM Transactions on Audio, Speech, and Language Processing, vol. 23, no. 4, pp. 787–797, Apr. 2015.