Spectrum Learning-Aided Reconfigurable Intelligent Surfaces for ‘Green’ 6G Networks
Abstract
In the sixth-generation (6G) era, emerging large-scale computing based applications (for example processing enormous amounts of images in real-time in autonomous driving) tend to lead to excessive energy consumption for the end users, whose devices are usually energy-constrained. In this context, energy-efficiency becomes a critical challenge to be solved for harnessing these promising applications to realize ‘green’ 6G networks. As a remedy, reconfigurable intelligent surfaces (RIS) have been proposed for improving the energy efficiency by beneficially reconfiguring the wireless propagation environment. In conventional RIS solutions, however, the received signal-to-interference-plus-noise ratio (SINR) sometimes may even become degraded. This is because the signals impinging upon an RIS are typically contaminated by interfering signals which are usually dynamic and unknown. To address this issue, ‘learning’ the properties of the surrounding spectral environment is a promising solution, motivating the convergence of artificial intelligence and spectrum sensing, termed here as spectrum learning (SL). Inspired by this, we develop an SL-aided RIS framework for intelligently exploiting the inherent characteristics of the radio frequency (RF) spectrum for green 6G networks. Given the proposed framework, the RIS controller becomes capable of intelligently ‘think-and-decide’ whether to reflect or not the incident signals. Therefore, the received SINR can be improved by dynamically configuring the binary ON-OFF status of the RIS elements. The energy-efficiency benefits attained are validated with the aid of a specific case study. Finally, we conclude with a list of promising future research directions.
I Introduction
The emerging sixth-generation (6G) network concept has to be “green”, while supporting ubiquitous artificial intelligence (AI)-assisted applications at a high energy-efficiency, including autonomous driving, unmanned aerial vehicle (UAV) aided logistics, rescue and surveillance, multi-access/mobile edge computing (MEC), and smart manufacturing [1]. However, there are grave technical challenges ahead. Specifically, given the limited transmission powers, the received signal-to-interference-plus-noise ratio (SINR) will be considerably degraded due to severe signal fluctuations caused by both multipath fading and the presence of large obstacles. Although opting for millimeter wave (mm-Wave) and even Terahertz (THz) carrier frequencies has the potential of significantly improving the data rate, these frequencies inevitably suffer from high propagation losses, thereby they may result in an excessive energy consumption.
In this context, reconfigurable intelligent surfaces (RIS) have emerged as a promising 6G solution for improving the quality of wireless links by appropriately reflecting the incident signals with the aid of a large number of nearly-passive reflecting elements at a low power consumption [2]. Specifically, an array of passive scattering elements can be controlled in a software-defined manner to adjust the electromagnetic properties (such as the phase shifts and potentially the amplitude) of the incident radio frequency (RF) signals. As a result, the received signal strength can be enhanced via jointly optimizing the phase shifts of the reflecting elements of RISs [3]. However, the interfering signals tend to dynamically fluctuate and a conventional RIS ‘blindly’ reflects both the desired and interfering signals. This may precipitate severe interference at the desired receiver and cause a deleterious effect. To address this challenge, it turns out that AI has a beneficial ameliorating potential in support of spectrum sensing, termed as spectrum learning (SL). This is achieved by intelligently identifying the characteristics of the wireless spectral environment. Hence, we envision that the convergence of SL and RIS will simplify network planning, paving the way for a green 6G ecosystem.
In this article, we develop a dynamic SL-aided RIS framework for boosting the advantages of intelligent signal reflection through learning the inherent features of the incident RF signals at the RIS controller. The proposed framework ‘moves’ the complexity of online detection to offline training, thereby controlling the binary ON-OFF status of RISs in real-time and improving the energy-efficiency of 6G networks. The remainder of this article is outlined as follows. The following section provides an overview of green 6G services and its emerging applications empowered by RISs. Then, we present the design of the SL-aided RIS framework conceived for improving its energy-efficiency. Following that, we evaluate the performance of the proposed framework. Finally, we conclude this article with a list of compelling research opportunities.
II Reconfigurable Intelligent Surfaces Meet Green 6G Networks
II-A RIS-aided Green 6G Wireless Communication Systems
Recent years have witnessed immense research efforts dedicated to RIS-aided wireless designs for improving the link quality and network coverage. Sophisticated 3D index modulation, holographic MIMO schemes, mm-Wave and THz solutions, are expected to efficiently support high-quality services and seamless connectivity for a massive number of mobile terminals. However, their excessive energy consumption and hardware costs pose design challenges due to the relatively high operating frequency, especially when many RF chains are required. Again, holographic MIMO surfaces attracted attention as a possible solution for realizing massive MIMO systems at a reduced cost and power consumption [4]. Notably, mobile devices are usually energy-constrained in practice, and the direct line-of-sight (LoS) link is often unavailable due to the occlusion of obstacles, when again, RISs come to rescue.
Objective | Learning Approach | Model Architecture | Key Contributions |
---|---|---|---|
Channel estimation | Deep learning [11] | Multi-layer perceptron (MLP) with multiple layers | Jointly optimize the transmit beamforming matrix and the phase shifts together with only a few active elements on the RIS |
Joint beamforming and phase shift design | Deep reinforcement learning [12] | Deep deterministic policy gradient (DDPG) with -layered actor and critic networks | Jointly optimize the transmit beamforming matrix and the phase shifts |
Joint UAV trajectory and phase shift design | Deep learning [13] | Deep Q-network (DQN) with 3-layered convolutional neural network (CNN) | Jointly optimize the movement of the UAV, phase shifts of the RIS, and power allocation policy from the UAV to users. |
Secure beamforming | Reinforcement learning [14] | Model-free | Jointly optimize the anti-jamming power allocation and reflecting beamforming without knowing the jamming model |
Transmission strategy of each UAV-user pair | Multi-task learning [15] | Two-task (classification and regression) learning model with three hidden layers | Jointly infer the optimal RIS elements allocation and RIS phase shifts of each UAV-user pair for energy-efficiency |
As illustrated in Fig. 1, an RIS-aided wireless communication system designed for energy efficient green 6G services typically supports a number of mobile devices. Each RIS element can be reconfigured by the RIS controller, which is connected to a central server (e.g., base station or access point) via a dedicated channel. To explore the potential of RIS-aided reflections, the following three steps are generally sequentially performed:
-
•
Channel estimation: In RIS-aided systems, there exist three kinds of wireless channels between the transmitter (Tx) and the receiver (Rx). By considering uplink transmission as an example, we have to estimate the wireless channels, including the Tx-RIS link(s), the RIS-Rx link(s), and the direct Tx-Rx link(s). To fully explore the potential gains brought about by RISs, the acquisition of accurate channel state information (CSI) becomes crucial, yet practically challenging.
-
•
RIS reconfiguration: Following the acquisition of CSI, both the phase shift and the amplitude of RIS elements can be dynamically reconfigured, for example, based on positive-intrinsic-negative (PIN) diodes. This has been widely adopted in practical implementations as a benefit of its low energy consumption and hardware cost.
-
•
Incident signal reflection: After channel estimation and RIS reconfiguration, the RIS reflects the incident signal(s) with the optimized phase shift and amplitude. At the receiver, the direct and reflected signals are combined to improve the received signal power.
II-B Emerging 6G Applications Empowered by RIS
Conventional wireless communications are undergoing a significant transformation with the convergence of AI and 6G, evolving from the “Internet-of-information” to the “Internet-of-intelligence” [5]. This shall unleash the full potential of machine learning in innovative data-driven applications across a wide variety of industries, such as self-driving cars, UAV-aided surveillance, edge intelligence and smart factories. However, to support the aforementioned AI-based applications, huge amount of data is required for training. Against this background, we shall describe four typical RIS-empowered 6G applications, including terrestrial vehicular communications, drone communications, mobile edge computing, and IoT communications, as illustrated in Fig. 1.
II-B1 Autonomous Driving
Video analytics relying on AI becomes an indispensable technique in support of autonomous driving, where a large amount of information generated by diverse onboard sensors (such as cameras, LiDAR sensors and radar sensors) has to be processed in near-real-time for collision avoidance in extremely complex surroundings. However, promptly exploiting a deluge of delay-sensitive sensors’ data requires substantial onboard computation resources and incurs high energy consumption [6]. To maintain safety for self-driving, RISs usher in a paradigm shift from connected-vehicles to pervasive connected-intelligence by enhancing the link quality of cellular vehicular-to-everything (V2X) communications in complex urban environments at a low power.
II-B2 Unmanned Aerial Vehicles
In recent years, UAVs, also commonly known as drones, have been extensively used for civilian, commercial, and military services in combination with AI [7]. However, the provision of sufficient energy for propulsion is a challenge, which may be mitigated by joint energy and data networking and laser charging [8]. In this context, RISs show significant promise in terms of reducing the communications-related power of UAVs.
II-B3 Mobile/Multi-access Edge Computing
At the time of writing, MEC is considered as a promising technique for offloading the processing of data to edge servers having much higher computing capabilities, thus enabling low-latency joint processing and communications. As a result, the AI processing is relocated from the central cloud to the network edge [9]. This implies that a large amount of data has to be transferred from the end devices to the edge server via band-limited uplink channels, which may become the bottleneck in MEC. In this context, RISs play a crucial role in mitigating the propagation-induced impairments with the aid of signal reflection.
II-B4 Smart Industry
The smart industry, or “Industry 4.0”, integrating AI and the industrial IoT, paves the way for intelligent production by interconnecting isolated industrial assets [10]. The wireless links are typically of non-line-of-sight (NLOS) nature, thereby leading to vulnerable links in dynamic environments. To circumvent this challenge, RISs improve the propagation environment by constructing signal reflections, thereby enhancing the robustness of wireless communications.
II-C Challenge of Undesired-Reflections via RISs
Despite their potential, RISs face numerous challenges, including the acquisition of accurate CSI, their reflection coefficients optimization, multi-user mobility management, and implementation issues, just to name a few. In particular, due to the unpredictable nature of interference, undesired reflections become a critical challenge, which has not been resolved yet. Specifically, most of the existing treatises assume either that no interference exists, which rarely occurs in practice, or that the interference is already known. But naturally, it is not trivial to estimate the dynamically fluctuating interference in wireless networks. In reality, the SINR at the receiver might even be degraded compared to a system dispensing with RISs, unless specifically addressed. But the challenge is that RISs are nearly passive surfaces with no active sensing capabilities for channel and interference estimation. As a remedy, combining AI with spectrum sensing becomes attractive, which enables conventional RISs to learn the characteristics of the wireless environment. Hence, increasing research interests have been inspired by RIS-aided applications empowered by AI, which are contrasted in Table I.
In the following section, we shall present a dynamically configured ON-OFF111In this paper, the ON-OFF status is referred to having the entire set of elements of the RIS to be ON or OFF. RIS control scheme based on spectrum learning, thereby enabling RISs to make intelligent decisions whether to reflect or not the incident signals, depending on the wireless spectrum environment inferred. Naturally, this intelligent action is only possible at the cost of incorporating intelligence into the RISs.
III Spectrum Learning-Aided RISs for Green 6G: Principles and Procedures
III-A Design Principle
In a conventional system, the RISs passively and indiscriminately reflect all the incident RF signals. However, this kind of blind reflection sometimes contaminates the desired signal at the receiver. Fortunately, AI-aided spectrum sensing has the potential of intelligently orchestrating the incident signals by relying on an appropriately trained convolutional neural network (CNN) of the RIS controller.
More explicitly, the surrounding spectral environment is explored and learned by an intelligent classification algorithm, explicitly identifying the desired user, whose signal should be beneficially reflected, and the interfering users, whose signal should not be reflected. We arrange for having only a few active RIS elements for the baseband processing of the incident signal at the RIS controller at a low overhead222When the ratio of the active elements to the total number of elements is set to only and for a mmWave GHz scenario and for a low-frequency GHz scenario, respectively, the achievable rates can be maximized [11].. For simplicity, we assume that the base station (BS) of Fig. 2 has perfect CSI knowledge for all wireless channels, and that this CSI is fed back to the RIS controller via a dedicated control channel. In particular, when an RF signal impinges upon an RIS, the RIS controller identifies the users that are included in the aggregate signal by using a well-trained CNN. Based on the inferred estimate, the RIS controller determines the ON-OFF status of the RIS for maximizing the SINR at the tagged BS. For example, the entire RIS is ‘turned OFF’, if the received SINR at the tagged BS with respect to (wrt) the RIS-aided link is lower than the corresponding SINR wrt the direct LOS link. Otherwise, the entire RIS is ‘turned ON’.
In the following, we will introduce the two stages of the proposed SL-aided RIS framework: the offline training stage and the online inference stage.
III-B Offline Training Stage
In the offline training stage, we have a pair of functional modules in support of spectrum learning.
III-B1 RF Trace Collection Module
We collected RF traces by building a universal software radio peripheral (USRP2) based testbed, which is connected via the Gigabit Ethernet to a host computer, as illustrated in Fig. 2. The RF trace collection relies on a transmit unit and a receive unit that communicate over a MHz channel at a carrier frequency of GHz. The traces are collected over the signal-to-noise ratio (SNR) range spanning from to dB. Since each USRP2 unit has one transmit antenna, multiple interfering signals are obtained by letting multiple USRP2 units transmit at the same time. In particular, the transmit unit includes a transmitter host computer performing the baseband signal processing and a transmitter USRP2 for up-conversion, digital-to-analog (D/A) conversion, and wireless transmission. Upon receiving the RF signal via the wireless channel, the receiver USRP2 first processes the signals ahead from the radio interface and then performs A/D conversion and down-conversion. Finally, the required baseband processing is carried out by the host computer, where the Inphase (I) and Quadrature (Q) sequences are stored for a wide range of SNRs to account for different interference scenarios. The acquired I/Q samples capturing the features of the incident RF signal are used for training the CNN model.
III-B2 CNN Training Module
In the proposed framework, spectrum learning is considered as a classification task, which is fulfilled via an appropriately trained CNN model. Specifically, based on the acquired RF traces that amount to approximately one billion I and Q samples, the CNN is trained offline based on a GPU cluster (e.g., the NVIDIA Tesla P100 GPU computing processor illustrated in Fig. 2) by using the Adam algorithm that utilizes the cross-entropy as the loss function. The CNN model trained consists of four layers: two convolutional layers that use the rectified linear unit activation function, two dense fully-connected layers and the output layer that uses the softmax activation function.
III-C Online Inference Stage
In the online inference stage, the RIS controller can dynamically adjust the ON-OFF status of the entire RIS to maximize the received SINR at the BS of Fig. 2 based on the appropriately trained CNN model.
III-C1 Spectrum Learning Module
For the spectrum learning module, we can directly infer the spectrum access information related to the desired and interfering users by extracting the resultant I/Q samples from a copy of the impinging signals and feeding them into the appropriately trained CNN that is deployed at the RIS controller. Specifically, as soon as the RF signal arrives at the RIS elements, it undergoes A/D conversion and frequency down-conversion. Then, baseband processing is executed and the extracted I/Q sequence is fed into the CNN model for spectrum access inference.
III-C2 ON-OFF RIS Control Module
The ON-OFF RIS control module is trained for avoiding undesired reflections by dynamically setting the binary ON or OFF status of the RIS elements. In particular, we assume that both the CSI and phase shifts of the RIS can be perfectly estimated and optimized at the BS, which feeds this information back to the RIS controller via a dedicated control channel333Channel estimation and phase shifts optimization of RIS have been widely investigated, hence they are not the main focus of this paper.. By disabling and enabling the RIS (i.e., RIS is turned OFF and ON), the SINR that is achieved at the BS can be calculated respectively. If the calculated SINR with disabling the RIS is no worse than the corresponding SINR when the RIS is enabled, the entire RIS is turned OFF. Otherwise, the entire RIS is turned ON.
IV Case Study and Performance Evaluation
In this section, we characterize the experimental performance of the proposed SL-aided ON-OFF RIS control scheme.
IV-A Experimental Scenario
For characterizing the proposed SL-aided RIS framework, we study the experimental scenario of Fig. 3, where the RIS reflects the signal arriving from both the desired user (denoted as ) as well as from the interfering user (denoted as ). As a result, the signal received at the BS becomes a superposition of the desired signal from and the interfering signal from . The impact of the interference is, in particular, caused by undesired-reflections by the RIS and it is determined by the angle between the incident signal from the desired user and the incident signal from the interfering user (i.e., in Fig. 3). For example, the interference effect caused by at the BS may become more severe if is close to , i.e., becomes small. In this case, the proposed SL-aided RIS framework enables the RIS controller to intelligently control the ON-OFF status of the RIS according to the wireless environment.
In the scenario of two users ( and ), we have four combinations of signals arriving at the RIS since each user has a binary state, i.e., active (also transmitting) or inactive (also idle): (1) “Idle”, indicating that both and are inactive; (2) “”, indicating that only is active, i.e., no interference occurs; (3) “”, indicating that only is active; and (4) “”, indicating that both and are active. Upon receiving the superimposed incident signal, the RIS controller has to identify the signal’s composition, which is a four-class classification problem readily solved by the trained CNN model444Although a scenario with one interfering user is considered in this article for illustrative purposes, the proposed SL-aided RIS framework can be extended to more general scenarios where multiple interfering users coexist.. Therefore, the proposed SL-based framework ‘moves’ the algorithmic and computational complexity to offline training while the complexity during the online inference is quadratic with time as a function of , where indicates the number of layers of the trained CNN model, is the number of neurons and indicates the total number of RISs.
IV-B Performance Evaluation
In this part, we first analyze the inference accuracy of the CNN model with experimental data, and then we evaluate the performance of the proposed SL-aided ON-OFF RIS control scheme.
IV-B1 Correct Inference Probability
The correct spectrum inference probability of the trained CNN model is demonstrated by considering the experimental scenario of Fig. 2 supporting contaminated by . During the CNN model training, the number of time series I/Q samples is set to , , and . The experimental results show that the correct inference probability obtained via the CNN model for the Class-2 (“”), the Class-3 (“”) and the Class-4 (“”) is around . Notably, the trained CNN model attains a perfect accuracy for the Class-1 (“Idle”) prediction due to the pattern distinguishable from the rest of the classes. The relatively high inference probability of the trained CNN model results in typically feeding an accurate spectrum identification estimate to the ON-OFF RIS control module.
IV-B2 SL-enalbed ON-OFF RIS Control
In the considered scenario, there is one desired user (), one interfering user (), and RISs. We assume that the RISs are equally spaced by in the vertical direction, each having elements. The amplitude reflection coefficient is and the impact of interference is inversely proportional to . As illustrated in Fig. 3, the RIS1-, RIS1-, and RIS1-BS distances are , , and , respectively. Furthermore, the angle of incidence between the BS and at the RIS is , while between and is . To demonstrate the energy-efficiency of the proposed SL-aided ON-OFF RIS control scheme, we evaluate a pair of benchmark schemes: 1) RIS always ON, and 2) RIS always OFF, where the transmission power of is dBm and the noise power is - dBm.
The achievable SINR (in dB) versus the angle of incidence is depicted in Fig. 4, where a single RIS is considered, and the transmit power of the interfering user is dBm and dBm. We observe from Fig. 4 that the benchmark schemes exhibit an opposite trend as increases. In particular, as increases, the interference contaminating the desired receiver through a reflection from the RIS is reduced. Therefore, it becomes more likely that the RIS is ON, thereby increasing the achievable SINR of the conventional ‘RIS always ON’ scheme. By contrast, the achievable SINR of the ‘RIS always OFF’ scheme reduces with due to the reduction of the distance between and BS. In this case, the interference reflected by the RIS becomes more pronounced. It can be observed that the SINR obtained by the proposed SL-aided RIS control scheme is consistently higher than that of the two benchmark schemes, which is the benefit of intelligently controlling the status of the RIS according to the dynamically fluctuating environment. The improvement of SINR via the proposed scheme is at the expense of increasing the RIS controller’s complexity, i.e., the complexity introduced by the CNN online inference and the ON-OFF RIS state configuration. However, the associated complexity only grows linearly with the number of RISs, which is acceptable in practice.
The impact of the number of RISs () on the achievable SINR is evaluated in Fig. 5, where we set dBm and is randomly selected from . Intuitively, we observe that the SINR achieved by the ‘RIS always OFF’ benchmark scheme remains unchanged, while that of the other two schemes increases with the number of RISs. Moreover, the SINR obtained by the proposed SL-aided RIS control scheme is significantly higher than that of the two benchmark schemes. In particular, when we use the same transmission power, compared to the ‘RIS always OFF’ and ‘RIS always ON’ case studies, our proposed solution improves the received SINR from about dB and dB to dB for . This demonstrates the potential of the proposed SL-aided RIS framework in terms of improving the energy efficiency of future 6G networks.
V Conclusions and Promising Research Directions
In this article, we discussed the potential of spectrum-learning in addressing the critical challenges of RIS solutions in 6G. Explicitly, we proposed an ON-OFF RIS control scheme for dynamically configuring the binary status of RISs. In this context, the beneficial role of SL in improving the energy efficiency was validated by exemplifying the achievable SINR and was compared to a pair of benchmark schemes.
Nonetheless, the proposed SL-aided RIS communication framework still remains largely unexplored, in particular, from its energy efficiency perspective. Achieving reliable SL necessitates the collection of a large amount of labeled RF sequences, and the appropriately trained model sometimes has to be fine-tuned or even retained, when the environment significantly changes, especially in vehicular and drone applications. This will lead to high energy consumption. To facilitate SL relying on a limited number of labeled RF sequences in a dramatically changing environment, advanced learning techniques, such as deep reinforcement learning, transfer learning and self-supervised learning may be explored in future works.
References
- [1] K. B. Letaief, W. Chen, Y. Shi, J. Zhang and Y. A. Zhang, “The Roadmap to 6G: AI Empowered Wireless Networks,” IEEE Communications Magazine, vol. 57, no. 8, pp. 84-90, Aug. 2019.
- [2] C. Pan, H. Ren, K. Wang, M. Elkashlan, M. Chen, M. Di Renzo, Y. Hao, J. Wang, A.L. Swindlehurst, X. You, and L. Hanzo, “Reconfigurable intelligent surface for 6G and beyond: Motivations, principles, applications, and research directions,” arXiv preprint arXiv:2011.04300, Nov. 2020.
- [3] M. Di Renzo, A. Zappone, M. Debbah, M. S. Alouini, C. Yuen, J. De Rosny, and S. Tretyakov, “Smart radio environments empowered by reconfigurable intelligent surfaces: how it works, state of research, and the road ahead,” IEEE Journal on Selected Areas in Communications, vol. 38, no. 11, pp. 2450-2525, Nov. 2020.
- [4] C. Huang, S. Hu, G. C. Alexandropoulos, A. Zappone, C. Yuen, R. Zhang, M. Di Renzo, and M. Debbah, “Holographic MIMO surfaces for 6G wireless networks: opportunities, challenges, and trends,” IEEE Wireless Communications, vol. 27, no. 5, pp. 118-125, Oct. 2020.
- [5] Y. Wu, K. Zhang and Y. Zhang, “Digital twin networks: a survey,” IEEE Internet of Things Journal, early access, doi: 10.1109/JIOT.2021.3079510.
- [6] B. Yang, X. Cao, K. Xiong, C. Yuen, Y. L. Guan, S. Leng, L. Qian, and Z. Han, “Edge intelligence for autonomous driving in 6G wireless system: design challenges and solutions,” IEEE Wireless Communications, vol. 28, no. 2, pp. 40-47, Apr. 2021.
- [7] K. Zhang, D. Si, W. Wang, J. Cao, and Y. Zhang, “Transfer learning for distributed intelligence in aerial edge networks”, IEEE Wireless Communications, to be published.
- [8] J. Hu, K. Yang, G. Wen and L. Hanzo, “Integrated Data and Energy Communication Network: A Comprehensive Survey,” IEEE Communications Surveys Tutorials, vol. 20, no. 4, pp. 3169-3219, Fourthquarter 2018.
- [9] K. Zhang, J. Cao and Y. Zhang, “Adaptive digital twin and multi-agent deep reinforcement learning for vehicular edge computing and networks,” IEEE Transactions on Industrial Informatics, early access, doi: 10.1109/TII.2021.3088407.
- [10] B. Yang O. Fagbohungbe, X. Cao, C. Yuen, L. Qian, D. Niyato, and Y. Zhang, “A joint energy and latency framework for transfer learning over 5G industrial edge networks,” IEEE Transactions on Industrial Informatics, early access, doi: 10.1109/TII.2021.3075444.
- [11] A. Taha, M. Alrabeiah and A. Alkhateeb, “Enabling Large Intelligent Surfaces With Compressive Sensing and Deep Learning,” IEEE Access, vol. 9, pp. 44304-44321, Mar. 2021
- [12] C. Huang, R. Mo and C. Yuen, “Reconfigurable intelligent surface assisted multiuser MISO systems exploiting deep reinforcement learning,” IEEE Journal on Selected Areas in Communications, vol. 38, no. 8, pp. 1839-1850, Aug. 2020.
- [13] X. Liu, Y. Liu and Y. Chen, “Machine Learning Empowered Trajectory and Passive Beamforming Design in UAV-RIS Wireless Networks,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 7, pp. 2042-2055, Jul. 2021.
- [14] H. Yang, Z. Xiong, J. Zhao, D. Niyato, Q. Wu, H. V. Poor, and M. Tornatore, “Intelligent reflecting surface assisted anti-jamming communications: a fast reinforcement learning approach,” IEEE Transactions on Wireless Communications, vol. 20, no. 3, pp. 1963-1974, Mar. 2021.
- [15] X. Cao, B. Yang, C. Huang, C. Yuen, M. Di Renzo, D. Niyato, and Z. Han, “Reconfigurable intelligent surface-assisted aerial-terrestrial communications via multi-task learning,” IEEE Journal on Selected Areas in Communications, early access, doi: 10.1109/JSAC.2021.3088634.