This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

High-Speed Resource Allocation Algorithm Using a Coherent Ising Machine for NOMA Systems

Teppei Otsuka, Aohan Li, , Hiroki Takesue,
Kensuke Inaba, Kazuyuki Aihara, Mikio Hasegawa
Teppei Otsuka, Aohan Li and Mikio Hasegawa are with the Department of Electrical Engineering, Tokyo University of Science, Tokyo, 1258585, Japan. (E-mail: [email protected], [email protected], [email protected]).Hiroki Takesue and Kensuke Inaba are with NTT Basic Research Laboratories, NTT Corporation, Atsugi, Japan. (E-mail:{hiroki.takesue.km, kensuke.inaba.yg}@hco.ntt.co.jp).Kazuyuki Aihara is with International Research Center for Neurointelligence, The University of Tokyo, Tokyo, Japan. (E-mail:[email protected]).Corresponding Author: Aohan LiManuscript received XX XX, 2022; revised XX XX, 2022.
Abstract

Non-orthogonal multiple access (NOMA) technique is important for achieving a high data rate in next-generation wireless communications. A key challenge to fully utilizing the effectiveness of the NOMA technique is the optimization of the resource allocation (RA), e.g., channel and power. However, this RA optimization problem is NP-hard, and obtaining a good approximation of a solution with a low computational complexity algorithm is not easy. To overcome this problem, we propose the coherent Ising machine (CIM) based optimization method for channel allocation in NOMA systems. The CIM is an Ising system that can deliver fair approximate solutions to combinatorial optimization problems at high speed (millisecond order) by operating optimization algorithms based on mutually connected photonic neural networks. The performance of our proposed method was evaluated using a simulation model of the CIM. We compared the performance of our proposed method to simulated annealing, a conventional-NOMA pairing scheme, deep Q learning based scheme, and an exhaustive search scheme. Simulation results indicate that our proposed method is superior in terms of speed and the attained optimal solutions.

Index Terms:
Non-orthogonal multiple access, resource allocation, coherent Ising machine, mutually connected neural network

I Introduction

With the rapid increase in the number of mobile devices, there is an urgent need to increase the spectrum efficiency of mobile communications[1]. A non-orthogonal multiple access (NOMA) technique has been proposed as one of the candidate technologies to meet the requirements of the next generation of wireless networks [2, 3]. Unlike conventional orthogonal multiple access (OMA) schemes, NOMA schemes can effectively increase the spectrum efficiency by introducing extra-power-domains, which enables the data of multiple different users to be multiplexed on the same channel [3]. More specifically, by applying superposition coding in the transmitter and successive interference cancellation (SIC) technology in the receiver, signals from multiple users that are multiplexed can be distinguished.

In a NOMA system, resources such as channels and power must be appropriately allocated to improve the system performance, such as the data rate, and avoid system outages such as an SIC error [4, 5]. Previous studies have shown that optimal resource allocation (RA) can increase the total achievable data rate of a system in comparison with random RA [7]. Specifically, channel allocation methods that consider the differences in channel gain between users, and power allocation methods that balance system data rates and user fairness, are key aspects in a NOMA system design. In addition, it is important to allocate resources to avoid the failure conditions of SIC and maximize the performance, including the total data rate, quality of service (QoS), or fairness of the NOMA system. However, joint channel and power allocation problems have been proven to be NP-hard [8], and their high complexity hinders an efficient and reliable RA, resulting in a low performance of NOMA systems.

TABLE I: The abbreviations used in this paper
Abbreviation Description Abbreviation Description
ANN Attention-based Neural Network FD-NOMA Full Duplex-NOMA
BS Base Station IPSA Ideal Pairing Search Algorithm
BPNN Backpropagation Neural Network ML Machine Learning
BHD Balanced Homodyne Detection NOMA Non-Orthogonal Multiple Access
CNR Carrier-to-Noise Ratio OPA Optimal Power Allocation
CIM Coherent Ising Machine OMA Orthogonal Multiple Access
C-NOMA Conventional-NOMA PSA Phase Sensitive Amplifier
DQN Deep Q Learning QoS Quality of Service
DRL Deep Reinforcement Learning RL Reinforcement Learning
DOPO Degenerate Optical Parametric Oscillator RA Resource Allocation
D2D Device-to-Device SA Simulated Annealing
ES Exhaustive Search SIC Successive Interference Cancellation
FPGA Field Programmable Gate Array WLAN Wireless LAN

To solve the NP-hard RA optimization problems in NOMA systems, many approaches such as machine learning (ML) and heuristic approaches have been proposed [12, 13, 14, 9, 10, 11, 5]. Optimization using ML has the advantage of a rapid RA in large-scale problems by using trained models that have been learned in advance. However, a trained model may no longer be usable and must be retrained when the system environment changes. This has a disadvantage that the RA may not be sufficiently fast when the communication environments change. By contrast, optimization using heuristic methods is faster owing to a lower computational complexity; however, a fundamental trade-off emerges between the solution accuracy and computational complexity. By contrast, hardware-based search algorithms, such as quantum computers and Ising machines, have been proposed and are expected to solve various optimization problems at high speeds[15, 16, 17, 18, 19]. As described in [20], quantum technologies are emerging technologies that are expected to be deployed in 6G networks around 2030, although it is still unclear when a practical quantum computer will be available. D-wave[15] is a quantum annealing machine that has already been commercialized and is available for general users. The coherent Ising machine (CIM) [16, 17, 18, 19] is an optimization machine based on laser networks, which can deliver fair approximate solutions to combinatorial optimization problems at high speed on the order of milliseconds. Many NP-complete and NP-hard problems can be transformed into ground-state search problems of the Ising model [21]. Here, the CIM and D-Wave are Ising machines that can rapidly obtain a state near the ground state of the Ising Hamiltonian which represents a good solution to the optimization problem. As the main difference between the D-Wave and CIM, D-Wave reproduces a coupling of magnetic spins with a chimera topology having sparse connections, whereas in CIM, the Ising network of optical pulses generated by a laser oscillator is fully coupled[16, 17].

A previous study [18] demonstrated that the CIM is more effective than D-Wave for large-scale large-density problems, and it will be more suitable for the next-generation large-scale wireless communications. In [19], the authors showed that 100,000 fully connected spins can be implemented using the CIM. In previous studies [22, 23, 24, 25], various wireless communication systems are optimized using CIMs. A CIM-based optimization approach has been applied to solve the NP-hard optimization problems in wireless communications. In [26], it is shown that the optimal solutions obtained can be empirically scaled as O(1)O(1) using the CIM. It is therefore possible to achieve a real-time optimization of RA in a large-scale NOMA system using the CIM. In this paper, we propose a RA method for NOMA systems using the CIM. The main contributions of this paper can be summarized as follows.

  • We propose a high-speed RA method using a CIM for NOMA systems, especially for the channel assignment. The proposed method is a hardware-based method with superior speed (milliseconds-order) compared to other existing methods.

  • To the best of our knowledge, previous works related to RA using the CIM in wireless communication assumed that only one user could access each channel simultaneously. In contrast to previous studies, this study focuses on the RA optimization problem using the CIM for NOMA systems where multiple users can be multiplexed in the same channel. Hence, novel solutions are necessary to solve such problems which are different from those in our previous work. For instance, dummy users need to be introduced and the constraints are different, which leads to different strength of the interaction between spins and that of the external magnetic field on each spin. In this study, we formulated the RA combinatorial optimization problem carefully to meet the characteristics of the RA in the NOMA system and the feasibility of solving it using the CIM.

  • To solve the RA optimization problem in NOMA systems using the CIM, it is necessary to derive the parameters of the Ising model and introduce them into the CIM. Therefore, we first transformed the formulated RA combinational problem into a ground state search problem of the Ising Hamiltonian. Then, we derived the parameters of the Ising model that needed to be introduced into the CIM to obtain the solutions, i.e., the interaction between spins and the external magnetic field.

  • To verify the effectiveness of the proposed method, we first evaluated the convergence of the proposed method. Simulation results demonstrated the convergence of the proposed methods. Then, we evaluated the performance of the proposed method in the total data rate and compared it with that of other methods including simulated annealing (SA), exhaustive search (ES), the conventional-NOMA (C-NOMA), deep Q learning (DQN), and random methods. The simulation results indicate that our proposed approach can achieve higher data rates than other methods. The computation time of the proposed, SA, DQN and ES methods was evaluated, which demonstrates that the proposed method can obtain better performance at faster speed.

The rest of this paper is organized as follows. Section \@slowromancapii@ reviews previous related studies. Section \@slowromancapiii@ introduces the system model and problem formulation. Section \@slowromancapiv@ introduces the principle of the CIM in solving the optimization problems. Section \@slowromancapv@ presents the proposed CIM-based RA method for NOMA systems. Section \@slowromancapvi@ demonstrates the simulation results. Section \@slowromancapvii@ describes future work. Section \@slowromancapviii@ provides some concluding remarks regarding this research. The abbreviations used in this paper are listed in Table I.

II Related Work

In this section, we review the related studies on RA. We first review the heuristic RA approaches and ML-based RA approaches for NOMA systems. We then review the hardware-based RA, particularly for the existing studies on CIM-based RA in wireless communications that are not limited to NOMA RA problems.

II-A Heuristic RA Approaches

The high complexity of the RA problem in NOMA systems can be reduced using heuristic methods by dividing the problem into sub-problems [12, 9, 10, 11, 5]. In [5], the authors propose low complexity and efficient method for RA in a downlink NOMA system. In this study, user fairness, data rate, and energy efficiency are optimized. The RA problem is divided into the sub-problems of power allocation and channel allocation. The power allocation solution is derived from a theoretical analysis. The channel allocation is carried out based on the derived power allocation solution. In [9], the RA in full duplex-NOMA (FD-NOMA) systems is studied. In this study, the RA problem is successfully solved with low computational complexity by dividing the channel and power problems based on the block coordinate descent method. In [10], an algorithm for downlink NOMA systems that analytically examines the optimal power allocation (OPA) and ideal pairing search algorithm (IPSA) in terms of fairness was proposed. Simulation results indicate that the computational complexity is significantly reduced in comparison to the ES. In [11], NOMA-based device-to-device (D2D) communications are considered. It is shown that the communication performance can be improved to a significant extent using the NOMA technique. The RA solution is obtained with low computational complexity by applying an optimal power allocation after a channel allocation algorithm that assigns sub-carriers to the D2D groups. In [12], the RA in a NOMA system is optimized using the SA, which is a meta-heuristic algorithm. In this study, a joint optimization of the channel allocation, sub-channel power allocation, and inter-user power allocation is considered. Simulation results indicate that a sufficiently reliable solution in terms of system throughput can be obtained with low computational complexity in comparison to an ES.

As discussed above, we can see that the heuristic RA approaches can obtain a solution with low computational complexity. However, there is a fundamental trade-off between the performance of the solution and the computational complexity, and there have been no previous studies that can quickly optimize on the order of milliseconds.

II-B ML Based RA Approaches

With the development of ML applications in wireless communications, ML solutions for RA optimization problems in NOMA systems have been proposed [4, 13, 14]. In [4], deep reinforcement learning (DRL) based RA scheme is proposed for NOMA systems. Here, the framework of the DRL method is designed based on an attention-based neural network (ANN). Specifically, the joint optimization problem of channel and power allocation is formulated, and the channel allocation is learned according to the policy learned from the ANN. The simulation results indicate that the proposed method can achieve the same performance for six users with lower computational complexity in comparison to the ES. In [13], a DQN is adopted to achieve a joint RA for a large number of users, where reinforcement learning (RL) frames are used for the joint optimization of clustering, power allocation, and beamforming. Specifically, the clustering is adjusted using a DQN, and a backpropagation neural network (BPNN) is designed to learn the power coefficients for each cluster. Simulation results indicate that the performance of the proposed method is close to that of the ES. In [14], a method based on DRL is proposed to assign users to time-varying channels. Recently, DRL has been expected to be applied as a solution for such real-time dynamic decision-making problems in practical environments where the conventional RA algorithms are difficult to be applied. A complexity analysis and simulation results show that the proposed approach provides a superior performance with lower computational complexity in comparison to conventional optimization algorithms.

Therefore, ML-based RA methods can obtain near-optimal solutions with lower computational complexity than the ES. However, to obtain near-optimal solutions, a proper training time is required.

II-C Hardware-based RA

Optimization problems such as channel and power allocation are important not only in NOMA systems but also in a variety of other wireless communication systems. Optimization problems in wireless communication systems have recently become increasingly complex, and the demand for efficient solutions to meet the requirements of next-generation wireless communications has increased [28]. Hardware-based approaches, such as quantum computers and Ising machines, have been expected to be potential methods for solving optimization problems at high speed [15, 16, 17, 18]. In previous studies, several RA optimization problems in wireless communications have been studied using the CIM [22, 23, 24, 25]. In [23], the CIM is applied to RA problems in wireless communication systems such as IEEE802.11 wireless LAN systems and D2D communications. In addition, it has been shown that the CIM can approximately solve these optimization problems within the order of milliseconds. In [24], the CIM is applied to the optimization of scheduling problems for distributed antenna systems. In [25], the CIM is applied to the optimization problem of channel assignment in dense wireless LAN (WLAN) systems. In this study, the maximization of the system throughput in high-density WLAN systems using the CIM is considered. The performance of the proposed CIM-based channel assignment approach in comparison to other optimization approaches such as the SA and greedy algorithms is evaluated. Simulation results indicate that the CIM can obtain the optimal solution well with constant computational time, whereas other optimization algorithms increase the computational time and decrease the accuracy of the obtained solution as the number of users in the system increases.

In these studies, the CIM is assumed to work either as a base station (BS) or as a controller in the centralized systems. The ability of the CIM to operate at room temperature [29] and its simple components make it suitable for integration into many wireless transmission systems. Hence, the CIM is significantly more suitable for large-scale wireless networks if the size of the problem is up to the number of spins that can be achieved with the CIM.

In summary, the CIM may obtain optimal solutions to combinatorial optimization problems without the need for complex computer calculations. The principle of searching for optimal solutions using the CIM is based on the spontaneous energy reduction in physical phenomena. Existing research has demonstrated the higher speed of a CIM-based optimization. On the contrary, DRL or other ML algorithms require training samples to obtain the optimal solution, and trial and error to increase performance. In a real communication environment, such as a vehicle network, the environment changes frequently, and real-time optimization may not be possible for the ML algorithms because retraining is needed when the environment changes. In [4, 14], the complexity due to training in the ML algorithm was studied. In contrast to the complexity of training presented in these papers, the proposed CIM-based RA method can always solve the optimization problem at constant high speed, i.e., millisecond order [26]. Therefore, real-time RA can be achieved using our proposed method, which may improve the spectrum efficiency under dynamic changing environments.

III System Model and Problem Formulation

Refer to caption
Figure 1: System model.

We consider a downlink NOMA system where the BS transmits data to multiple users, as shown in Fig. 1.

Both the BS and users have a single antenna. At the BS, the transmitter sends signals with multiplexing the data of different users on the same channel. Assume that the signal to each user is multiplexed in one channel at the same time. At the receiver side, the SIC technology is used to restore the signal of each user. The BS is equipped with the CIM. Once the required communication information is gathered in the BS side, the channel will be allocated by the CIM. Let the set of users in the NOMA system be 𝐔={1,2,,u,,Nu}{\bf U}=\{1,2,\ldots,u,\ldots,N_{u}\} and the set of sub-channels be 𝐂={1,2,,c,,Nc}{\bf C}=\{1,2,\ldots,c,\ldots,N_{c}\}, where NuN_{u} and NcN_{c} are the numbers of the users and channels in the NOMA system. Note that, Nu>2N_{u}>2. Assume that the total bandwidth is BB while the bandwidth is divided into NcN_{c} sub-channels equally. Hence, the bandwidth of the ccth channel is Bc=B/NcB_{c}=B/N_{c}. The multiplexed signal on the ccth channel can be expressed as follows:

xc=i=1McPicbi,x_{c}=\sum_{i=1}^{Mc}\sqrt{P_{i}^{c}}b_{i}, (1)

where McM_{c} is the number of users multiplexed on the ccth channel, PicP_{i}^{c} is the power allocated to the iith user transmitted on the ccth channel, and bib_{i} is the transmission symbols of the iith user. At the receiver side, the multiplexed signal of the uuth user in the ccth channel can be expressed as follows:

yuc=Puchucbu+i=1iuMcPichicbi+zuc,y_{u}^{c}=\sqrt{P_{u}^{c}}h_{u}^{c}b_{u}+\sum_{{\begin{subarray}{c}i=1\\ i\neq u\end{subarray}}}^{M_{c}}{\sqrt{P_{i}^{c}}h_{i}^{c}b_{i}}+z_{u}^{c}, (2)

where zucz^{c}_{u} is additive white Gaussian noise with zero mean and variance σzc2\sigma^{2}_{z_{c}}. huch^{c}_{u} is the channel response between the BS and the uuth user, which can be calculated as follows:

|huc|2=(guc)2duα,\left|h_{u}^{c}\right|^{2}=\left(g_{u}^{c}\right)^{2}d_{u}^{-\alpha}, (3)

where gucg^{c}_{u} is a coefficient that follows a Rayleigh distribution, dud_{u} is the distance between the BS and the uuth user, and α\alpha is the path loss coefficient. The carrier-to-noise ratio (CNR) in the ccth channel of the uuth user is defined as follows:

Γuc=|huc|2σzc2.\Gamma_{u}^{c}=\frac{\left|h_{u}^{c}\right|^{2}}{\sigma_{z_{c}}^{2}}. (4)

Herein, we assume that the CNR of each user assigned to a channel cc is Γ1c>>Γuc>>ΓMcc\Gamma_{1}^{c}>\ldots>\Gamma_{u}^{c}>\ldots>\Gamma_{M_{c}}^{c}. In the NOMA protocol, higher power is allocated to the users with lower CNR [5, 6], i.e., P1c<<Puc<<PMccP_{1}^{c}<\ldots<P_{u}^{c}<\ldots<P_{M_{c}}^{c}. In addition, SIC is performed at the receiver side, which successively removes the signal of the user with the higher power. Therefore, the signal of the uuth user can be decoded by removing the signal of the iith user when Pic>PucP_{i}^{c}>P_{u}^{c}. The signal of the uuth user is considered as the interference of the iith user at that time. Based on the principle discussed above, when the SIC is fully working in the power domain NOMA, i.e., when there is a sufficient difference in gain between users that are multiplexed in the same channel, the signal-to-interference-plus-noise ratio of the uuth user can be calculated as follows:

γuc=PucΓuc1+i=1u1PicΓuc.\gamma_{u}^{c}=\frac{P_{u}^{c}\Gamma_{u}^{c}}{1+\sum_{i=1}^{u-1}P_{i}^{c}\Gamma_{u}^{c}}. (5)

The data rate achievable by the uuth user in the ccth channel can then be calculated as follows:

Ruc=Bclog2(1+PucΓuc1+i=1u1PicΓuc).R_{u}^{c}=B_{c}\log_{2}{\left(1+\frac{P_{u}^{c}\Gamma_{u}^{c}}{1+\sum_{i=1}^{u-1}P_{i}^{c}\Gamma_{u}^{c}}\right)}. (6)

Assuming that the total transmission power from the BS is PTP_{T}, the relation between the power coefficient PucP_{u}^{c} and total transmission power PTP_{T} can then be expressed as follows:

u=1Nuc=1NcPucPT.\sum_{u=1}^{N_{u}}\sum_{c=1}^{N_{c}}P_{u}^{c}\leq P_{T}. (7)

As the number of users assigned to the same channel increases, the complexity of SIC decoding at the receiver side increases, which leads to high complexity of the power allocation scheme to efficiently avoid SIC errors [5, 14]. SIC and power allocation are needed to be carefully considered if we assume that more than 2 NOMA access the same channel simultaneously, which is beyond the scope of this paper. The contribution of this paper is to achieve fast RA using the CIM for NOMA systems. Therefore, we focus on introducing how to use the CIM to solve the RA problem in NOMA. In other words, we consider the scenario where Mc=2M_{c}=2 for c𝐂\forall c\in\bf{C}. Thus, the data rate that can be achieved by any user in any channel can be expressed

RNOMAijk={Bclog2(1+PikjΓij),ifΓij>Γkj,Bclog2(1+PikjΓij1+PkijΓij),otherwise,\displaystyle R_{NOMA}^{ijk}=\begin{dcases}B_{c}\log_{2}{\left(1+P^{j}_{ik}\Gamma_{i}^{j}\right)},\ &\mathrm{if}\ \Gamma_{i}^{j}>\Gamma_{k}^{j},\\ B_{c}\log_{2}{\left(1+\frac{P^{j}_{ik}\Gamma_{i}^{j}}{1+P^{j}_{ki}\Gamma_{i}^{j}}\right)},\ &\mathrm{otherwise},\end{dcases} (8)

where for i𝐔,k𝐔,andj𝐂,i\in{\bf U},\ k\in{\bf U},\text{and}\ j\in{\bf C}, RNOMAijkR_{NOMA}^{ijk} is the data rate achieved by the iith user when multiplexed with the kkth user in the jjth channel, and PikjP^{j}_{ik} is the transmission power allocated to the iith user when the iith user is multiplexed into the jjth channel with the kkth user. In addition, when there is only one user ii is assigned to channel jj, the data rate of the iith user can be expressed as follows:

ROMAij=Bclog2(1+PijΓij),R^{ij}_{OMA}=B_{c}\log_{2}{\left(1+P^{j}_{i}\Gamma^{j}_{i}\right)}, (9)

where PijP_{i}^{j} is the transmit power allocated to the iith user on the jjth channel. Here, the expression of the relationship between the number of users and that of channels below must be stratified to transform the formulated problem into an Ising problem. That is, Nu=2NcN_{u}=2N_{c}. However, we assume that up to 2 users can be multiplexed to the same channel, which means that one or no users may be assigned to the channel. The transmissions will be OMA when only one user is assigned to the channel. Therefore, in addition to the user set 𝐔\bf U, we define the dummy user set as 𝐃={1,,Nd}{\bf D}=\{1,\ldots,N_{d}\}. If the user from set 𝐔\bf U is multiplexed with the user from set 𝐃\bf D, OMA transmission will be carried out by the user from set 𝐔\bf U, which means that the user in the set 𝐃\bf D is not a really existing user in the real world. As described above, we can understand that the total number of users and that of the dummy users is twice the number of channels i.e., Nd+Nu=2NcN_{d}+N_{u}=2N_{c}. Hence, the throughput achieved by users in the NOMA system can be redefined as follows:

Rikj={RNOMAijk,ifi,k𝐔,ROMAij,ifi𝐔,k𝐃,0,ifi𝐃,k.\displaystyle R^{j}_{ik}=\begin{dcases}R_{NOMA}^{ijk},\ \ \ \ &\mathrm{if}\ \ i,k\in\mathrm{\bf{U}},\\ R^{ij}_{OMA},\ \ \ \ &\mathrm{if}\ \ i\in\mathrm{\bf{U}},\ k\in\mathrm{\bf{D}},\\ 0,\ \ \ \ &\mathrm{if}\ \ i\in\mathrm{\bf{D}},\ \forall k.\\ \end{dcases} (10)

In this paper, our goal is to maximize the total data rate of the NOMA system. The objective function of this paper is expressed as follows:

maxx\displaystyle\genfrac{}{}{0.0pt}{}{\text{max}}{x} i=1Nu+Ndj=1Nck=1kiNu+Nd(Rikj+Rkij)xijxkj\displaystyle\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\sum_{{\begin{subarray}{c}k=1\\ k\neq i\end{subarray}}}^{N_{u}+N_{d}}\left(R^{j}_{ik}+R^{j}_{ki}\right)x_{ij}x_{kj} (11)
s.t.Rikj(Rij)min,fork,\displaystyle{\it s.t.}\;\;\ R^{j}_{ik}\geq\;(R_{ij})_{min},\ \ \mathrm{for}\ \ \forall k, (11.a)
i=1Nu+Ndj=1Nc(Pikj+Pkij)xijxkjPT,\displaystyle\ \ \ \ \ \ \sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\left(P_{ik}^{j}+P_{ki}^{j}\right)x_{ij}x_{kj}\leq P_{T}, (11.b)
j=1Ncxij=1,fori,\displaystyle\ \ \ \ \ \ \ \ \sum_{j=1}^{N_{c}}x_{ij}=1,\ \ \mathrm{for}\ \ \forall i, (11.c)
i=1Nu+Ndxij=2,forj,\displaystyle\ \ \ \ \ \ \sum_{i=1}^{N_{u}+N_{d}}x_{ij}=2,\ \ \mathrm{for}\ \ \forall j, (11.d)

where xijx_{ij} represents the channel assignment variable, which can be expressed

xij={1,iftheithuserusesthejthchannel,0,otherwise.\displaystyle x_{ij}=\begin{cases}1,\ \ \ \ \mathrm{if\ the}\ i\mathrm{th}\ \mathrm{user}\ \mathrm{uses}\ \mathrm{the}\ j\mathrm{th}\ \mathrm{channel},\\ 0,\ \ \ \ \ \mathrm{otherwise}.\end{cases} (12)

Constraint (11.a) indicates the minimum data rate that the user should achieve. In addition, (Rij)min(R_{ij})_{min} in constraint (11.a) is the QoS constraint, which represents the minimum data rate that the iith user using the jjth channel must achieve. Constraint (11.b) indicates the power constraint, i.e., the total power allocated to the users should not exceed the total transmission power from the BS. Constraints (11.c) and (11.d) indicate that one user, and up to two users, can be multiplexed into a single channel, respectively.

In this paper, power allocation is optimized using the method in [5], whereas channel allocation is optimized using the CIM. Specifically, we maximize Eq. (11) while satisfying constraints (11.c) and (11.d) using the CIM, and the power factor considering constraints (11.a) and (11.b) is then optimized using the method described in [5]. The method of power allocation in [5] can maximize the total data rate while satisfying the QoS constraints and total power constraint, i.e., constraints (11.a) and (11.b) can be satisfied by allocating power using the method described in [5]. Here, we assume that Γij>Γkj\Gamma_{i}^{j}>\Gamma_{k}^{j} and i,k𝐔i,k\in\mathrm{\bf{U}}. Then, the optimal PikjP_{ik}^{j} and PkijP_{ki}^{j} in Eq. (11) can be derived as

Pikj=ΓkjqjAkj+1AkjΓkj,Pkij=qjPikj,\begin{split}&P_{ik}^{j}=\frac{\Gamma_{k}^{j}q_{j}-A_{k}^{j}+1}{A_{k}^{j}\Gamma^{j}_{k}},\\ &P_{ki}^{j}=q_{j}-P_{ik}^{j},\end{split} (13)

where Akj=2(Rkj)minBc>2A_{k}^{j}=2^{\frac{(R_{kj})^{min}}{B_{c}}}>2, and qjq_{j} is the transmission power from the BS allocated to the jjth channel. By using this equation, the signals can be successfully decoded as avoiding SIC failure conditions is considered. Note that if user i𝐔i\in{\bf U} is multiplexed with dummy user k𝐃k\in{\bf D} for OMA communication, PikjP_{ik}^{j} is set as a constant qjq_{j} because the power allocated to the dummy user is zero. The value of qjq_{j} maximizing the total data rate can be obtained using the following equations:

qj=[BcΛAkjΓij+AkjΓkj1Γkj]γj,γc=Akj(Aij1)Γij+Akj1Γkj,\begin{split}&q_{j}=\left[\frac{B_{c}}{\Lambda}-\frac{A_{k}^{j}}{\Gamma_{i}^{j}}+\frac{A_{k}^{j}}{\Gamma_{k}^{j}}-\frac{1}{\Gamma_{k}^{j}}\right]^{\infty}_{\gamma_{j}},\\ &\gamma_{c}=\frac{A^{j}_{k}\left(A^{j}_{i}-1\right)}{\Gamma_{i}^{j}}+\frac{A^{j}_{k}-1}{\Gamma_{k}^{j}},\end{split} (14)

where Λ\Lambda needs to be selected to satisfy j=1Ncqj=PT\sum^{N_{c}}_{j=1}q_{j}=P_{T}. []\left[\bullet\right] is a well-known water-filling form [5]. It means the power allocation solution for the formulated combinatorial optimization problem, i.e., Eq. (11) is uniquely determined if qjq_{j} is in the range [γj,+)\left[\gamma_{j},+\infty\right). In the following, we introduce the optimal channel allocation using the CIM.

IV Coherent Ising Machine

In this section, we first introduce the Ising Hamiltonian. Then, we present the measurement feedback CIM, which is a measurement-feedback type CIM that has been proposed to easily increase the number of couplings between spins to solve larger-scale problems. Finally, we investigate the operation process of the measurement feedback CIM.

IV-A Ising Hamiltonian

The CIM is an Ising-type machine that can artificially reproduce an Ising model, a physical model of magnetic spins. The Ising model consists of two states of magnetic spins: upward and downward. Spins are mutually coupled and subject to interactions from other spins and effects from external magnetic fields. We consider Ising spins with an N×MN\times M two-dimensional structure. Here, the energy function of the Ising model, i.e., the Ising Hamiltonian is expressed as follows:

E(σ)=12i=1Nj=1Mk=1Nl=1MJijklσijσkl+i=1Nj=1Mλijσij,E(\sigma)=-\frac{1}{2}\sum_{i=1}^{N}\sum_{j=1}^{M}\sum_{k=1}^{N}\sum_{l=1}^{M}{J_{ijkl}\sigma_{ij}\sigma_{kl}}+\sum_{i=1}^{N}\sum_{j=1}^{M}{\lambda_{ij}\sigma_{ij}}, (15)

where σij{1,+1}\sigma_{ij}\in\{-1,+1\} is the spin direction of the (i,j)(i,j)th spin, JijklJ_{ijkl} is the strength of the interaction between the (i,j)(i,j)th and (k,l)(k,l)th spins, and λij\lambda_{ij} is the strength of the external magnetic field on the (i,j)(i,j)th spin. Here, the CIM is the machine that can obtain the ground state of the Ising Hamiltonian at a high speed. In other words, by setting JJ and λ\lambda to correspond to the optimal solution of the optimization problem to the minimum of Eq. (15), we can approximately obtain a fast optimal solution using the CIM.

Refer to caption
Figure 2: Coherent Ising Machine [16, 17].

IV-B Measurement Feedback Coherent Ising Machine

Initially, the CIM was a laser network consisting of one master laser and multiple slave lasers[27]. Currently, a measurement-feedback type CIM has been proposed to easily increase the number of couplings between spins to solve larger scale problems[16, 17]. A schematic diagram of the measurement feedback-type CIM is shown in Fig. 2. As shown in Fig. 2, parameters of the Ising model JijklJ_{ijkl} and the λij\lambda_{ij}, parameters of the Ising model are pre-configured in a field programmable gate array (FPGA) module. The FPGA module calculates the feedback value using the pre-configured parameters and feeds it back to the original pulse. Therefore, interactions between spins can be programmatically reproduced, which makes it possible to fully connect spins. In the measurement-feedback CIM, the phase of the degenerate optical parametric oscillator (DOPO) pulse circulating on an optical fiber represents Ising spins σij\sigma_{ij}. In [19], it is shown that 100,000 DOPO pulses can be realized over 5 km of polarization-maintaining optical fiber and reach the reference score of the MAX-CUT problem for 100,000 nodes within a time of 593 µs.

IV-C Operation Process of the Measurement Feedback CIM

The CIM is an Ising machine that can rapidly obtain the ground state of the Ising Hamiltonian by getting the combinations of the Ising spin directions. The set of the Ising spin directions that minimize the Ising Hamiltonian corresponds to the solution of the combinational optimization problem. The operation process of the CIM can be summarized as follows. As shown in Fig. 2, the initial pulses are first generated by a phase sensitive amplifier (PSA) on the optical fiber. The initial pulses are repeatedly amplified by PSA with a pump pulse each cycle and become a DOPO pulse. The phase of the DOPO pulse circulating on the fiber during oscillation is used to reproduce the Ising spin on the optical fiber in the CIM. Here, gradually increasing the pump pulse amplitude until the threshold is exceeded; the DOPO pulse will oscillate and take on a π\pi or 0 phase representing the orientation of the Ising spin σij\sigma_{ij}, where σij{1,+1}\sigma_{ij}\in\{-1,+1\}. 0 phase represents the up spin, i.e., σij=1\sigma_{ij}=1. Meanwhile, the π\pi phase represents the down spin, i.e., σij=1\sigma_{ij}=-1 While the DOPO pulse is being amplified, a part of the DOPO pulse on the fiber is separated by a coupler, and the in-phase amplitude is measured through a balanced homodyne detection (BHD). The measured in-phase amplitude σij\sigma_{ij} is input into the FPGA module. A feedback signal Jijklσij+λij-\sum{J_{ijkl}\sigma_{ij}}+\lambda_{ij} is calculated using the mutual coupling between Ising spins JijklJ_{ijkl} and external magnetic field λij\lambda_{ij}. Finally, the feedback signal is injected into the original DOPO pulse as a feedback pulse through the intensity modulator and push-pull modulator. By repeating the operation process of the CIM in this manner, we can eventually obtain a combination of σij\sigma_{ij} that minimizes Eq. (15), which corresponds to the optimal solution to the optimization problem.

From the above, to obtain the optimal solution of optimization problem using the CIM, we need to transform the objective function into the form of an Ising Hamiltonian, and then derive the corresponding JijklJ_{ijkl} and λij\lambda_{ij} of the objective function. Then, by pre-configuring the derived these parameters into the FPGA module of the CIM, we can search for the ground state of the Ising Hamiltonian, which corresponds to the optimal solution to the optimization problem. In the next section, we introduce the problem transformation and the derivation of the corresponding JijklJ_{ijkl} and λij\lambda_{ij} for our formulated objective function in Eq. (11).

V Applying the CIM to the RA Problem in NOMA

To solve the combinatorial optimization problem in the CIM, we must set JijklJ_{ijkl} and λij\lambda_{ij} in the FPGA module of the CIM. However, it is difficult to transform the formulated objective function, i.e., Eq. (11), directly into an Ising Hamiltonian using the {1,+1}\{-1,+1\} variables corresponding to the value of the Ising spin of the CIM. We therefore first express Eq. (11) using binary variables of {0,1}\{0,1\}, and then transform it into the form of an Ising Hamiltonian using the {1,+1}\{-1,+1\} variables. In this study, we use a mutually connected neural network to represent Eq. (15) using binary variables {0,1}\{0,1\}. Mutually connected neural networks have been proposed as algorithms for solving various minimization problems. In practice, the traveling salesman problem is solved using this type of neural network in[30], and can also be applied to the problem defined in Eq. (11). The mutually connected neural network and the Ising Hamiltonian have similar energy structures; therefore, after formulating Eq. (11) in the mutually connected neural network, we can derive the parameters of the Ising Hamiltonian, JijklJ_{ijkl} and λij\lambda_{ij}, by converting the output from {0,1}\{0,1\} into {1,+1}\{-1,+1\}. In the following, we first introduce mutually connected neural networks. We then present how to solve RA in NOMA using a mutually connected neural network. Finally, we present the RA in a NOMA system using the CIM.

V-A Mutually Connected Neural Network

Let us consider a neural network with a two-dimensional structure of N×MN\times M; the mutually connected neural network has the following energy structure:

E(x)=12i=1Nj=1Mk=1Nl=1Mwijklxijxkl+i=1Nj=1Mθijxij,E(x)=-\frac{1}{2}\sum_{i=1}^{N}\sum_{j=1}^{M}\sum_{k=1}^{N}\sum_{l=1}^{M}{w_{ijkl}x_{ij}x_{kl}}+\sum_{i=1}^{N}\sum_{j=1}^{M}{\theta_{ij}x_{ij}}, (16)

where xij{0,1}x_{ij}\in\{0,1\} is the state of the (i,j)(i,j)th neuron, wijklw_{ijkl} is the coupling weight between the (i,j)(i,j)th and (k,l)(k,l)th neurons, and θij\theta_{ij} is the firing threshold of the (i,j)(i,j)th neuron. From the similarity of the energy structure of the Ising Hamiltonian and that of the mutually connected neural network, i.e., Eqs. (15) and (16), the parameters of the CIM, JijklJ_{ijkl} and λij\lambda_{ij}, can be derived using the parameters (i.e., wijklw_{ijkl} and θij\theta_{ij}) of the mutually coupled neural network. Here, the state of a neuron in a network is defined by the following equation:

xij(t+1)=u[k=1Nl=1Mwijklxklθij],x_{ij}(t+1)=u\left[\sum_{k=1}^{N}\sum_{l=1}^{M}{w_{ijkl}x_{kl}-\theta_{ij}}\right], (17)

where uu is the Heaviside step function, namely, u[y]=0u[y]=0 for y0y\leq 0, and u[y]=1u[y]=1 for y>0y>0. The energy function of the mutually connected neural network, i.e., Eq. (16) always decreases and converges to a locally minimum value by updating the neurons iteratively according to Eq. (17) if the following conditions are satisfied. 1). The self-connections of all neurons are zero, i.e., wijij=0w_{ijij}=0. 2). The mutual connections are symmetric, i.e., wijkl=wklijw_{ijkl}=w_{klij}.

To solve the optimization problem using a mutually connected neural network, we must formulate the optimization problem as an objective function using a neuron xijx_{ij}. Then, the formulated objective function is compared with Eq. (16) to derive wijklw_{ijkl} and θij\theta_{ij}. By iteratively updating the neurons according to Eq. (17) using the derived parameters wijklw_{ijkl} and θij\theta_{ij}, Eq. (16) converges to a local minimum value, and the states of the neurons at that time correspond to the approximate optimal solution to the optimization problem. In the next subsection, we define neurons xijx_{ij} and derive wijklw_{ijkl} and θij\theta_{ij} for optimizing the RA in a NOMA system.

V-B Mutually Connected Neural Network to Solve RA in NOMA System

To optimize the RA problem in a NOMA system using a mutually connected neural network, we define the binary variables in Eq. (12) as neurons. Then, our formulated object function Eq. (11) can be transformed into the energy function of the mutually connected neural network, which is expressed as follows:

E1=i=1Nu+Ndj=1Nck=1kiNu+Nd(Rikj+Rkij)xijxkj=i=1Nu+Ndj=1Nck=1Nu+Ndl=1Ncδjl(1δik)(Rikj+Rkil)xijxkl,\begin{split}E_{1}&=\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\sum_{{\begin{subarray}{c}k=1\\ k\neq i\end{subarray}}}^{N_{u}+N_{d}}{-\left(R_{ik}^{j}+R_{ki}^{j}\right)}x_{ij}x_{kj}\\ &=\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\sum_{k=1}^{N_{u}+N_{d}}\sum_{l=1}^{N_{c}}{-\delta_{jl}\left(1-\delta_{ik}\right)\left(R_{ik}^{j}+R_{ki}^{l}\right)x_{ij}x_{kl}},\end{split} (18)

where δij\delta_{ij} is Kronecker’s delta, namely, δij=1\delta_{ij}=1 when i=ji=j, and δij=0\delta_{ij}=0, otherwise. In addition, the constraints (8.c) and (8.d) can be formulated as constraint terms using neurons, which can be expressed as follows:

E2=i=1Nu+Nd(j=1Ncxij1)2=i=1Nu+Ndj=1Nck=1Nu+Ndl=1Ncδik(1δjl)xijxkli=1Nu+Ndj=1Ncxij,\begin{split}E_{2}&=\sum_{i=1}^{N_{u}+N_{d}}\left(\sum_{j=1}^{N_{c}}x_{ij}-1\right)^{2}\\ &=\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\sum_{k=1}^{N_{u}+N_{d}}\sum_{l=1}^{N_{c}}{\delta_{ik}\left(1-\delta_{jl}\right)x_{ij}x_{kl}}-\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}{x_{ij}},\end{split} (19)
E3=j=1Nc(i=1Nu+Ndxij2)2=i=1Nu+Ndj=1Nck=1Nu+Ndl=1Ncδjl(1δik)xijxkl3i=1Nu+Ndj=1Ncxij.\begin{split}E_{3}&=\sum_{j=1}^{N_{c}}\left(\sum_{i=1}^{N_{u}+N_{d}}{x_{ij}-2}\right)^{2}\\ &=\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}\sum_{k=1}^{N_{u}+N_{d}}\sum_{l=1}^{N_{c}}{\delta_{jl}\left(1-\delta_{ik}\right)x_{ij}x_{kl}}-3\sum_{i=1}^{N_{u}+N_{d}}\sum_{j=1}^{N_{c}}{x_{ij}}.\end{split} (20)

Note that the above constraints term can be fully satisfied even if the number of users in the cell NuN_{u} is less than 2Nc2N_{c}, since we define number of the dummy users NdN_{d} to always satisfy Nu+Nd=2NcN_{u}+N_{d}=2N_{c}. Using E1E_{1}, E2E_{2}, and E3E_{3}, the energy function of the RA optimization problem in a NOMA system can be obtained as follows:

ENOMA=ϵE1+ζE2+ηE3,E_{NOMA}=\epsilon E_{1}+\zeta E_{2}+\eta E_{3}, (21)

where ϵ\epsilon, ζ\zeta, and η\eta are parameters that adjust the scaling of each term, which are used to adjust the objective and constraint terms, causing the energy structure to more likely reach the optimal solution. By comparing Eq. (21) with Eq. (16), the coupling weight wijklw_{ijkl} and firing threshold θij\theta_{ij} for solving the RA optimization problem in a NOMA system using mutually connected neural networks can be obtained as follows:

wijkl=2{ϵδjl(1δik)(Rijk+Rkli)ζδik(1δjl)ηδjl(1δik)},\begin{split}w_{ijkl}=&2\{{\epsilon\delta}_{jl}\left(1-\delta_{ik}\right)\left(R_{ijk}+R_{kli}\right)-\zeta\delta_{ik}\left(1-\delta_{jl}\right)\\ &-\eta\delta_{jl}\left(1-\delta_{ik}\right)\},\end{split} (22)
θij=(ζ+3η).\theta_{ij}=-\left(\zeta+3\eta\right). (23)

Using wijklw_{ijkl} and θij\theta_{ij} obtained in Eqs. (22) and (23), the neurons were updated according to Eq. (17). Thereby, the combination of neurons minimizes the energy function in Eq. (16), i.e., the optimal solution to the NOMA RA problem can be obtained. However, owing to the monotonically decreasing nature of the energy function, this neural network is likely to fall into a local minimum of the network state. Hence, the optimal solutions may not be achieved using the mutually connected neural network. Further, it has been shown that the Ising Hamiltonian near the optimal solution can be reached with high probability by amplifying the DOPO pulse to near the oscillation threshold in CIM [31]. Therefore, in the next subsection, we will use the derived parameters wijklw_{ijkl} and θij\theta_{ij} to derive JijklJ_{ijkl} and λij\lambda_{ij} which are the parameters used to solve the optimization problem in the CIM.

V-C CIM Used to Solve RA in NOMA System

As previously noted, the Ising Hamiltonian of the CIM and the energy function of the mutually connected neural network have similar energy structures. As a difference, the neuron xij{0,1}x_{ij}\in\{0,1\} in the mutually connected neural network has states of 0 and 1, whereas the Ising-spin σij{1,+1}\sigma_{ij}\in\{-1,+1\} in the Ising model has states of -1 and +1. To solve the RA problem using the CIM, we redefine the output of the neurons in a mutually connected neural network as -1 and +1. Using the neuron x~ij{1,+1}\tilde{x}_{ij}\in\{-1,+1\} with a redefined output, the update equation for the neuron is expressed as follows:

x~ij(t+1)=r[k=1Nl=1Nw~ijklx~klθ~ij],\tilde{x}_{ij}(t+1)=r\left[\sum_{k=1}^{N}\sum_{l=1}^{N}{\tilde{w}_{ijkl}\tilde{x}_{kl}-\tilde{\theta}_{ij}}\right], (24)

where r[y]=1r[y]=-1 for y0y\leq 0, and r[y]=+1r[y]=+1 for y>0y>0, and w~ijkl\tilde{w}_{ijkl} and θ~ij\tilde{\theta}_{ij} are the coupling weight and firing threshold of the neural network with output {1,+1}\{-1,+1\}, respectively. We then can obtain the following equation by transforming the internal state of Eq. (17) using x~ij=2xij1\tilde{x}_{ij}=2x_{ij}-1:

k=1Nl=1Nwijklxklθij=k=1Nl=1Nwijkl2x~kl(θijk=1Nl=1Nwijkl2).\begin{split}&\sum_{k=1}^{N}\sum_{l=1}^{N}{w_{ijkl}x_{kl}-\theta_{ij}}\\ &=\sum_{k=1}^{N}\sum_{l=1}^{N}{\frac{w_{ijkl}}{2}\tilde{x}_{kl}}-\left(\theta_{ij}-\sum_{k=1}^{N}\sum_{l=1}^{N}{\frac{w_{ijkl}}{2}}\right).\end{split} (25)

By comparing Eqs. (16) and (25), we can obtain the following equations for w~ijkl\tilde{w}_{ijkl} and θ~ij\tilde{\theta}_{ij}, respectively.

w~ijkl=Jijkl=wijkl2,\tilde{w}_{ijkl}=J_{ijkl}=\frac{w_{ijkl}}{2}, (26)
θ~ij=λij=θijk=1Nl=1Nwijkl2.\tilde{\theta}_{ij}=\lambda_{ij}=\theta_{ij}-\sum_{k=1}^{N}\sum_{l=1}^{N}{\frac{w_{ijkl}}{2}}. (27)

By setting these derived parameters in the FPGA module, a fast RA optimization of the NOMA system using the CIM can be performed.

Refer to caption
Figure 3: Overview of our proposed CIM-based RA method

Then, we present the details of the optimization procedure of RA using the CIM in NOMA systems. In this study, the BS performs channel allocation using the CIM for the NOMA system to achieve the optimal total reachable data rate. A schematic of the optimization procedure of the NOMA system is shown in Fig. 3. For simplicity, we present a NOMA system with 4 users and 2 channels as an example. As shown in Fig. 3, each user transmits the data to the BS. Then, the channel assignment is determined by the CIM equipped in the BS. There are DOPO pulses on the optical fiber of the CIM, each of which behaves as an Ising spin. Because this problem involves 4 users and 2 channels, the RA problem can be solved using the CIM implemented with 2×4=82\times 4=8 DOPO pulses on the fiber. When the CIM is operated, the states of the in-phase components shown on the left side of Fig. 3 are obtained. Here, the cycle number represents the number of times the DOPO pulse travels around the optical fiber of the CIM. As the cycle count increases, the in-phase component of the amplitude of the DOPO pulses increases and eventually splits into positive or negative values, i.e., 0 phase or π\pi phase. In the CIM, the spin orientation is determined by the in-phase components of the amplitude. Here, when user ii uses channel jj, the spin is +1, i.e., an upward spin. Thus, the obtained spin combination corresponds to the channel assignment. Finally, based on this channel assignment, the BS sends a signal to the users after assigning the optimal power using Eq. (13). The pseudo-code for the algorithm is presented as Algorithm 1.

Algorithm 1 Optimization of RA Using the CIM in NOMA System
0:  Distance between user ii and the BS did_{i}, and the channel coefficient of user ii in channel jj gijg_{i}^{j}
0:  channel allocation σij\sigma_{ij}, power allocation qjq_{j}, PikjP_{ik}^{j}.
1:  Initialize:
2:      Initialize the channel response hijh_{i}^{j} and the CNR Γij\Gamma_{i}^{j} according to (3), (4)
3:      Set the power allocation PikjP_{ik}^{j} according to (13) with qj=1.0jq_{j}=1.0\ \ \forall j.
4:      Calculate data rate for users according to (10).
5:  Channel allocation by the CIM:
6:      Calculate the JijklJ_{ijkl} and λij\lambda_{ij} according to (26), (27) and setting these parameters to FPGA.
7:      Obtaining spins that minimizes the Ising Hamiltonian σij\sigma_{ij} by the CIM as described in Section IV.
8:  Optimal power allocation:
9:      qjq_{j} is obtained by water filling algorithms [5] and re-calculate PikjP_{ik}^{j}.

V-D Complexity Analysis of the CIM

In this subsection, we discuss the time and computational complexity of the proposed method. In [26], it is demonstrated that the CIM takes a constant amount of time to obtain the ground state as long as the fiber length is constant. That is, the time complexity of the CIM is 𝒪(1)\mathcal{O}(1) experimentally. The reason is that the computation time of one cycle for the CIM is related to the length of a fiber. For the CIM with 2000 spins and 1km fiber, 1000 cycles running of the CIM can be achieved within 5 ms while the spins can converge to the ground state within 1000 cycles by amplifying pump pulse amplitude appropriately. In our paper, the problems can be solved using the CIM with 2000 spins. Hence, the computation time is within 5 ms.

Then, we discuss the computation complexity of the main computation part of the CIM, i.e., FPGA. Ref. [16] shows that the memory resources required for matrix computation and power consumption in FPGAs are scaled by 𝒪(N2)\mathcal{O}(N^{2}). A typical CPU runs 10-100Gflop per second, so its power efficiency is often below 1 Gop/J. Similarly, GPUs also suffer from high power consumption. Hence, CPUs and GPUs are difficult to apply to applications that require low power consumption [39]. In addition, although parallel processing is possible for modern CPUs with multi-core, unnecessary parts also need to be parallelized since the entire core is used. In the case of FPGA, only necessary processing needs to be programmed. Hence, parallel processing can be performed without wasting computation resources, increasing the computation efficiency. Moreover, owing to the merit of the parallel processing of the FPGA, the FPGA uses less circuit area when performing the same processing compared to the CPU/GPU and can perform processing without resource waste. It is natural with high power efficiency. In summary, our proposed method may be superior to the resource allocation method computed using the traditional computer in computation complexity.

VI Performance Evaluation

In this section, we evaluate the performance in the convergence, data rate, and computation time of our proposed CIM-based RA for a NOMA system and compare it with the SA (a meta-heuristic optimization method), a conventional-NOMA pairing scheme (C-NOMA)[7], a DRL and the ES methods. Specifically, the RA is applied as follows. First, the channels are allocated using each method. The optimal power allocation is then applied using Eqs. (13) and (14). The following of this section is organized as follows. First, the simulation settings is introduced. Then, the comparison methods are presented. Next, the performance in convergence, data rate, computation time of our proposed method and the comparison with the other methods are evaluated.

VI-A Simulation Settings

In the simulation, a circular cell of a NOMA system with a radius of 500 m is considered. A BS is placed at the center of the cell. Users are randomly placed in the cell. Table II shows the detailed parameter setting used in our simulation.

TABLE II: Simulation parameters
Parameters Values
Total Bandwidth BB 5.0 MHz
Minimal data rate of each users RminR_{min} 2 bps/Hz
Cell radius 500 m
Minimal distance between user and BS 50 m
Maximum number of multiplexes users 2 users
Noise power spectral density -170 dBm/Hz
Refer to caption
Refer to caption
Refer to caption
Figure 4: Convergence of in-phase components of DOPO pulse (left) and the corresponding Ising Hamiltonian at that time (center) and comparison of the total data rate obtained by the CIM-based RA with the optimal and the OMA methods at that time (right) in the NOMA system with Nu=12N_{u}=12 users and Nc=6N_{c}=6 sub-channels.

We use the following simulation model of the CIM to evaluate our proposed method:

dcijdt=(1+pcij2sij2)cij+k=1Nl=1NJijklcklλij,\frac{dc_{ij}}{dt}=\left(-1+p-c_{ij}^{2}-s_{ij}^{2}\right)c_{ij}+\sum_{k=1}^{N}\sum_{l=1}^{N}{J_{ijkl}c_{kl}}-\lambda_{ij}, (28)
dsijdt=(1pcij2sij2)sij+k=1Nl=1NJijklsklλij,\frac{ds_{ij}}{dt}=\left(-1-p-c_{ij}^{2}-s_{ij}^{2}\right)s_{ij}+\sum_{k=1}^{N}\sum_{l=1}^{N}{J_{ijkl}s_{kl}}-\lambda_{ij}, (29)

where cijc_{ij} and sijs_{ij} are the in-phase and quadrature-phase amplitude of the (i,j)(i,j)th optical pulses, respectively, and pp is the pump pulse, which is used to amplify cijc_{ij}. By running the simulation with a gradual increase in pp, the in-phase and quadrature components of the DOPO pulse can be obtained. As described in Section \@slowromancapiv@, the measurement feedback of the CIM reproduces the Ising spins using the amplitudes of DOPO pulses cycling over long fibers. In other words, the simulation model can be used to reproduce the behavior of the CIM and obtain the combination of spins that minimizes the Hamiltonian. Ising spin σij\sigma_{ij} is implemented by cijc_{ij}: If cij<0c_{ij}<0, then σij=1\sigma_{ij}=-1, whereas if cij>0c_{ij}>0, then σij=+1\sigma_{ij}=+1.

VI-B Compared Methods

VI-B1 SA-based RA

In SA-based RA, the Boltzmann machine model[32] is used as a computational model. The Boltzmann machine is a type of mutually connected neural network that incorporates the statistical behavior. Specifically, the output function of the mutually connected neural network is a sigmoid function with temperature TT introduced, and the probability that the value of neuron X{0,1}X\in\{0,1\} changes with the temperature TT. In our simulation, the initial temperature of the SA was set as Tini=5.0T_{ini}=5.0, and is cooled according to the following equation for each iteration tt as described in [33]:

T(t)=Tinilog(1+t),T(t)=\frac{T_{ini}}{log(1+t)}, (30)

where T(t)T(t) is the temperature of the SA at iteration tt. The number of SA iterations is measured as 10,000.

VI-B2 C-NOMA

In the C-NOMA, users are divided into a set of near-users that are located near the center of the cell and far-users that are located far from the center of the cell based on criteria such as the distance or channel gain. Thereafter, channels are allocated by pairing one user from the near-user set and one user from the far-user set, and assigning them to the same channel.

VI-B3 DRL-based RA method

As a DRL-based RA method for comparison, we designed a DQN method to solve our formulated RA optimization problem shown in Eq. (11). In the designed DQN method, channel assignment is based on the following information at the decision time tt. CSI matrix information CSItNu×Nc\textbf{CSI}_{t}\in\mathbb{R}^{N_{u}\times N_{c}}, user assignment variables xtNu×Nc\textbf{x}_{t}\in\mathbb{R}^{N_{u}\times N_{c}}, user to be assigned to the channel in the next step AtNu\textbf{A}_{t}\in\mathbb{R}^{N_{u}}, the achieved data rate of each user in the last step RatetNu\textbf{Rate}_{t}\in\mathbb{R}^{N_{u}}. That is, the state at the decision time tt, which is denoted as StS_{t} is defined as St=[CSIt,xt1,At,Rt1]S_{t}=[\textbf{CSI}_{t},\textbf{x}_{t-1},\textbf{A}_{t},\textbf{R}_{t-1}]. Here, the action is defined as the channel allocation, i.e., the channel allocated to which user. The policy used in our designed method is ϵ\epsilon-greedy method while the reward is defined as the data rate [14]. The Q network and Target network of DQN consist of 3 layers, and the size of each layer is |St||S_{t}|, 256, and NcN_{c}, respectively. RMSProp is used to update the parameters of neural network, and the learning rate is set to 10410^{-4}.

VI-B4 ES

In the ES, the channel allocation is the optimal solution by exhaustive searching.

Refer to caption
Figure 5: Total data rate versus number of users NuN_{u} in the NOMA system with total transmit power PT=12P_{T}=12 from the BS.
Refer to caption
Figure 6: Total data rate versus total transmit power PTP_{T} from the BS with number of users Nu=12N_{u}=12 and number of sub-channels Nc=6N_{c}=6 in the NOMA system.

VI-C Convergence of the Proposed Method

First, we evaluated the convergence of the proposed method in the NOMA system with 12 users and 6 subchannels, i.e., Nu=12N_{u}=12 and Nc=6N_{c}=6. Figure 4 shows the time evolution of the in-phase components of the DOPO pulses for each cycle and the corresponding energy of the Ising Hamiltonian at that time. Here, for the optimization time, we used the time required for 1000 cycles on an actual CIM machine as shown in [16].

The subfigures on the left side, in the center, on the right side of Fig. 4 are the in-phase amplitude of the spins, the corresponding Ising Hamiltonian, and the total data rate of the three RA methods in the NOMA system, respectively. From Fig. 4, we can see that when the in-phase amplitude of the spins converges, the minimum Ising Hamiltonian reaches, while the total data rate of the CIM-based RA achieves the optimal value. This means that in the process of the CIM operations, the spins spontaneously selected a combination of spins that exhibited the ground state of the Ising Hamiltonian, which corresponds to the solutions to the formulated optimization problem. Hence, the convergence of our proposed method has been verified.

In addition, it is shown in [19] that for certain MAX-CUT problems, the real CIM machines can provide a stable state approximately 1000 times faster than SA using state-of-the-art CPUs. Thus, our proposed method has a significant advantage in convergence speed. Moreover, in [40], it has been demonstrated that the phase of the DOPO pulse during oscillation converges to the ground state within 5 ms by amplifying the DOPO pulse with an appropriate pump pulse amplitude. The setting of the pump pulse amplitude in real CIM is out of the scope of this manuscript, while considering the space of the manuscript, we omit the detailed analytical proof process, which can be found in [19], [32], [40]. In this paper, we set the pump pulse amplitude, i.e., pp in equations (28) and (29), to an appropriate value by parameter search during the simulations. Thus, the in-phase amplitude of the DOPO pulse can converge within 5ms, as shown in Fig. 4.

VI-D Performance Evaluation of the Data Rate under Static Environment

VI-D1 Data rate vs the number of users

Then, we evaluated the total data rate when the number of users NuN_{u} and that of subchannels NcN_{c} are varied. Note that we consider that up to 2 users are multiplexed in each subchannel. Therefore, as NcN_{c} increases, NcN_{c} also increases as Nc=Nu/2N_{c}=\lceil N_{u}/2\rceil. In other words, NcN_{c} is varied from 12 to 30 in this simulation, while at the same time NcN_{c} is varied from 6 to 15. In addition, the transmission power PTP_{T} is set to 12. Figure 6 shows the simulation results. From Fig. 6, we can see that our proposed method obtained the highest total data rate among the compared methods. This indicates that the most suitable joint channel and power allocation is possible using the proposed method. In addition, we can observe that the NOMA methods achieve a higher total data rate than the existing OMA method. During this simulation, we set up to 30 users in the NOMA system. As NuN_{u}s increased, the number of combinations of user-channel pair increased, making it difficult to obtain an optimal solution using the ES. By contrast, Nc×Nu=450N_{c}\times N_{u}=450 spins are required when solving this RA problem with 30 users and 15 channels using the CIM. Because the actual CIM can currently solve problems with up to 100,000 spins[19], this RA problem can be solved using the CIM. In other words, using the CIM, it is possible to obtain an effective solution to the RA optimization problem in a large-scale NOMA system that cannot be solved using the ES at a speed on the order of milliseconds.

VI-D2 Data rate vs transmit power

Next, we evaluated the total data rate of the NOMA system when varying the total transmission power from the BS, i.e., PTP_{T}. In this simulation, the numbers of users and sub-channels in the NOMA system are set to Nu=12N_{u}=12 and Nc=6N_{c}=6, respectively. Here, PTP_{T} is set to 2, 4, 6, 8, 10, and 12 Watts. Figure 6 shows the simulation results. From Fig. 6, we can see that our proposed CIM-based method can obtain a reachable optimal solution regardless of the transmission power from the BS PTP_{T}.

Refer to caption
(a) α=3.0\alpha=3.0
Refer to caption
(b) α=4.0\alpha=4.0
Figure 7: Total data rate versus number of sub-channels NcN_{c} with number of users Nu=12N_{u}=12 in the NOMA system.
Note that the sub-carrier width changes as the number of sub-channels changes.
Refer to caption
(a) α=3.0\alpha=3.0
Refer to caption
(b) α=4.0\alpha=4.0
Figure 8: Total data rate versus number of users NuN_{u} with number of sub-channels Nc=5N_{c}=5 in the NOMA system.

VI-D3 Data rate vs the number of channels

Next, we evaluated the total data rate for varying number of channel NcN_{c}. In this simulation, the transmission power from the BS is set to PT=12P_{T}=12. The number of users is set to Nu=12N_{u}=12. NcN_{c} is varied from 6 to 10. In this study, the total bandwidth B=5.0B=5.0 MHz is constant, and thus the sub-carrier width is 5.0/Nc5.0/Nc MHz. Note that this means that the sub-carrier width changes as the number of sub-channels changes. Figure 7 shows the simulation results when the path loss coefficient α\alpha is set to 3 and 4. From Fig. 7 , we can see that the proposed method can achieve the same total data rate as the ES even when NcN_{c} increases. Hence, we can say that the performance of the proposed method is comparable to that of the ES. However, when NcN_{c} increases, the number of channel-user combinations highly increases. At that time, the total data rate of the random allocation method decreases rapidly. This indicates that the optimization using the CIM can result in an effective channel allocation. In addition, the total data rate decreases as NcN_{c} increases. One of the reasons for this is that the power allocated per channel decreases as NcN_{c} increases because the power from the BS is constant. Another reason to be considered can be summarized as follows. When NcN_{c} increases, the bandwidth for each sub-channel become small, whereas the OMA is applied in more channels. By contrast, when NcN_{c} decreases, the bandwidth for each sub-channel increases while NOMA is applied in more channels. Because the NOMA technique is more efficient than the OMA technique, the total data rate decreases with the increase in the number of sub-channels.

VI-D4 Data rate vs the number of users under fixed number of channels

Then, we evaluated the total data rate for the varying number of users NuN_{u}. In the simulation, NuN_{u} is varied from 5 to 10. The number of sub-channels is also set to Nc=5N_{c}=5. Under this setting, the OMA and NOMA coexist when NuN_{u} is less than 10. The transmission power from the BS is then set to PT=12P_{T}=12. figure 8 shows the simulation results when the path loss coefficient α\alpha is set to 3 and 4. From Fig. 8, we can see that the proposed method can achieve the same total data rate as the ES even when NuN_{u} increases. This indicates that the CIM can achieve the optimal channel selection even when the NOMA and OMA coexist. In addition, the total data rate of the system increases as NuN_{u} increases. As the reason for this, as NuN_{u} increases, the number of channels to be multiplexed increases. In other words, the advantages of the NOMA system were fully utilized.

Refer to caption
Figure 9: Comparison of computation time.

VI-E Comparison of the Computation Time

Then, we evaluate the computation time of the SA, the ES, the DQN-based, and the CIM-based RA method under the varying number of users. Here, the computation time of the SA and the ES is evaluated using an Intel Xeon Gold 5222 CPU @ 3.80 GHz and C language, and the DQN-based RA is evaluated using the same CPU and Python with PyTorch, respectively. The number of users in the NOMA system NuN_{u} is set as 6, 8, 10, 12, 14, 16, and 18 in the performance evaluation. The numbers of SA iterations under corresponding different NuN_{u} were set as 10, 50, 100, 500, 1000, 5000, and 10000 times, respectively. In addition, the number of training episodes for the DQN-based RA under different NuN_{u} is set as 100, 250, 400, 550, 700, 850, and 1000 times, respectively. Here, each episode consists of 120 steps. The BS (Base Station) collects information and trains the neural network for each step. The information consists of the CSI information, the channel assignment information, and the achieved data rate of the previous step of each user. The reason for the settings of the SA and DQN-based RA is that the larger the scale of the problem is, the larger number of iterations and training episodes required for SA and DQN-based RA methods. Here, owing to the enormous amount of computation time involved for the ES with the growth of the number of users, the method was evaluated for up to 14 users. Fig. 9 shows the evaluation results. The computation time of the real CIM in Fig. 9 is the time cost for RA using the CIM in the real world. In [16], it has been shown that the Ising problem can be solved using the CIM in 5 ms regardless of the problem size when the length of the fiber is 1-km-long. The computation time of DQN (train) and DQN (test) represents that for training the neural network and testing (making decisions) using the trained neural network, respectively. As shown in Fig. 9, the CIM-based and DQN-based RA for testing methods demonstrate almost constant computation time even as the number of users in the system increases. In contrast, other algorithms require a much longer time as the problem size increases. However, the DQN-based RA method requires training to obtain a good solution when the communication environment changes. The training time increases with the number of users in the NOMA system. Thus, as described above, the CIM is superior in speed compared to other methods.

VI-F Performance Evaluation of the Data Rate under Dynamic Environment

Finally, we evaluate the performance of the proposed method when more realistic user activation is considered. In the simulation, the user activity follows the setup for the urban case in 3GPP TR 36.885 [34, 35]. In this scenario, the initial locations of the users are randomly generated according to the spatial Poisson process. Each user drives on the road at a constant speed. The speed of each user is generated randomly between 36km/h to 54km/h. The number of lanes is 2 in each direction and 4 in total in each street. The road grid size by the distance between intersections is 433m*250m. The simulation area size is 1299m*750m. When the user moves to the intersection, the direction is changed with a certain probability that follows the uniform distribution. In the performance evaluation, the number of users is set to 10, 12, 14, and 16. The time interval between each decision making is set to 20ms [14]. In this scenario, the wireless environment changes faster than the resource allocation using the ES, SA, and DQN (train) methods due to their long computation time, as shown in Fig. 9. These methods may no longer apply to mobile NOMA systems. Hence, we only evaluate the performance of the CIM-based and the DQN (test)-based RA methods under these settings. The evaluation results in terms of the ratio of the total data rate and the optimal solution are shown in Fig. 10. From Fig. 10, we can see that the CIM-based RA can achieve a higher data rate than the DQN (test)-based RA regardless of the number of users in the NOMA system. The reason can be summarized as follows. For the DQN (test)-based RA method, the time consumption for resource allocation is longer than the time interval between the decision-making. However, the user location and channel information are changed after the time interval. Hence, the information used by the DQN (test)-based RA method is the previous information that is different from the current information when allocating resources. On the other hand, the time consumption for the CIM-based RA method is shorter than the time interval between the decision-making. The CIM-based RA method can allocate resources using the current information. Hence, the optimal solution of the RA can almost be achieved by the CIM-based RA method. In summary, the proposed CIM-based RA method is applicable to the NOMA system with mobile users.

Refer to caption
Figure 10: The ratio of the data rate to the optimal solution in the NOMA system with mobile users.

VII Discussion and Future Work

VII-A Multiple Users Multiplexing in the NOMA System

In this subsection, we discuss multiplexing more users (more than 2) in the same channel using the proposed CIM-based RA method for the NOMA system. This paper discusses NOMA systems where up to 2 users are multiplexed into the same channel. That is: in Eq. (11.d), we set a constraint term to limit the maximum number of users that are multiplexing in the same channel to up to 2. We believe that fast channel allocation in NOMA systems with more multiplexed users can be achieved by extending the constraint term in Eq. (11.d) as an arbitrary number of users nn. Specifically, given the constraint, the energy function in Eq. (20) can be calculated. Finally, by recomputing the Ising model parameters, JijklJ_{ijkl} and λij\lambda_{ij}, RA in the NOMA system with nn users multiplexed can be realized. Note that the number of dummy users NdN_{d} to must be set as Nu+Nd=nNcN_{u}+N_{d}=nN_{c} to satisfy the above constraint. In addition, the number of channel assignment variables xijx_{ij}, xkjx_{kj} in Eq. (11) needs to be increased to correspond to the number of users multiplexed into the same channel.

VII-B Scalability of the Proposed Method

In this subsection, we discuss the scalability of the proposed CIM-based RA methods in the practical settings with 2 or more users in the NOMA systems. As previously described, the CIM is the machine that can obtain the ground state of the Ising Hamiltonian at high speed by getting the optimal combination of the spin directions. The ground state of a quantum-mechanical system is its stationary state of lowest energy, which corresponds to the optimal solution of combinatorial optimization problems. In this paper, the combinatorial optimization problem is the RA problem in the NOMA system. The RA problem in the NOMA system can be solved by transforming the formulated RA combinatorial optimization problem into the form of the Ising Hamiltonian. The spin direction of the spin in the Ising Hamiltonian corresponds to the channel assignment variable of the RA problem in the NOMA system, i.e., whether the channel is allocated to the user or not. Each channel-user pair corresponds to one spin. The value of each spin direction is +1 or -1. If the spin direction is +1, the channel is allocated to the corresponding user. Otherwise, the channel is not allocated to the corresponding user. For example, channel ii-user jj pair corresponding to the spin direction σij\sigma_{ij}. If σij\sigma_{ij}, the channel ii is allocated to user jj. Otherwise, the channel ii is not allocated to user jj. Hence, to solve the RA optimization problem in the NOMA system, NuNcN_{u}*N_{c} spins are necessary, where NuN_{u} and NcN_{c} are the numbers of the users and channels, respectively. The related work [19] shows that the CIM can realize problems up to 100000 spins. This means that up to 446 users can be realized using the present CIM in our scenario, where up to 2 users are allocated to one channel simultaneously. In general, however, clustering with an arbitrary number of nn users of multiplexing is more attractive. Here, to achieve multiplexing at an arbitrary user nn using CIM, we need to change the definition of the assignment variable as described in the last subsection. Specifically, the number of indexes should be increased by considering which user ii is multiplexed with which user on channel jj in Eq. (12). Therefore, as n increases, the number of spins required to solve the problem also increases. For example, when n=3n=3, the required number of spins is Nu3/3N_{u}^{3}/3, and RA for up to 67 users can be realized using the present CIM with 100000 spins. With the development of the CIM, much larger-scale optimization problems are expected to be solved by increasing the number of spins in the future.

VII-C Unequal Bandwidth Allocation

In this study, the RA in the NOMA system is under the assumption of equal bandwidth allocation. According to [36], the performance of NOMA systems can be improved by performing unequal bandwidth and allocating higher power and wider bandwidth to users with stronger channel conditions. Therefore, we will consider unequal bandwidth allocation in our future work. We believe that our proposed CIM-based RA method can be easily extended to the NOMA system with unequal bandwidth allocation by assigning more than one resource block to one use, specifically, by considering extending the constraint in Eq. (11.c) to an arbitrary number n. Using these constraints, the energy function Eq. (19) can be computed. Finally, by recomputing JijklJ_{ijkl} and λij\lambda_{ij}, more than one bandwidth can be allocated to one user using the CIM. Here, by adjusting the value of n, the upper limit of the bandwidth allocated to users with stronger channel conditions can be set. Thus, better system performance may be achieved with unequal bandwidth allocation compared to that with equal bandwidth allocation.

VII-D Optimal Power Allocation Using the CIM

In this study, the channel allocation is optimized by the CIM, and then the power is assigned based on the optimizing methods in [5] using conventional computers. However, in large-scale NOMA systems, the time consumption for allocating power using conventional computers may reduce the speed of the optimization, which may bring a bottleneck to CIM-based optimization. Therefore, channel and power optimization will be jointly considered using the CIM in our future work.

VII-E Diversity Schemes in NOMA System

In this study, we assumed that both of the BS and users have a single antenna, and the signal to each user is multiplexed in one channel at the same time. To improve the performance of the system, we will consider diversity schemes such as receiver diversity and antenna diversity in our future work. Specifically, the receiver diversity can improve the communication performance, such as a bit error rate, by combining the same data in the different channels to achieve a diversity gain [37]. Antenna diversity can be used to improve the outage performance of downlink NOMA systems combined with signal user multiple-input multiple-output [38].

VII-F Implementation of the CIM

Although quantum technologies as emerging technologies are expected to be deployed in 6G networks around 2030, it is still unclear when a practical quantum computer will be available. Hence, we will work with the communications and quantum computing industries in the future to try to achieve a practical implementation of the CIM in the next generation of wireless communication.

VIII Conclusion

In this study, we proposed a CIM-based RA optimization method for NOMA systems. Initially, we formulated the RA problem as a combinational optimization problem. To apply RA optimization for NOMA systems in the CIM, we derived the interaction and external magnetic field by mapping the Ising Hamiltonian to the formulated objective function. Herein, the interaction and the external magnetic field are the parameters of the Ising model, which are used to search the optimal solutions. For correspondence between the Ising Hamiltonian and the formulated objective function, we transformed our formulated objective function into a similar form with an Ising Hamiltonian by the aid of the mutually connected neural network. The parameters of the coupling weights and firing threshold corresponding to the energy function of the mutually connected neural network were then derived. Subsequently, these parameters were used to derive the Ising model interaction between the spin and external magnetic field used for optimizing the RA in the CIM. To evaluate the performance of the proposed method, we conducted simulations using the simulation model of the CIM. Moreover, we compared our proposed method with other optimization methods and pairing algorithms. The simulation results indicated that the CIM is not only faster but also superior in searching an optimal solution.

References

  • [1] Z. Shi, X. Xie, H. Lu, H. Yang, J. Cai, and Z. Ding, “Deep reinforcement learning based multidimensional resource management for energy harvesting cognitive NOMA communications,” IEEE Trans. Commun., DOI: 10.1109/TCOMM.2021.3126626, Nov. 2021.
  • [2] P. S. Bouzinis, P. D. Diamantoulakis, and K. Karagiannidis, “Wireless federated learning (WFL) for 6G networks-Part II: The compute-then-transmit NOMA paradigm,” IEEE Commun. Lett., vol. 26, no. 1, pp. 8-12, Jan. 2022.
  • [3] M. Nikooroo and Z. Becvar, “Optimal positioning of flying base stations and transmission power allocation in NOMA networks,” IEEE Trans. Wireless Commun., vol. 21, no. 2, pp. 1319-1334, Feb. 2022.
  • [4] C. He, Y. Hu, Y. Chen, and B. Zeng, “Joint power allocation and channel assignment for NOMA with deep reinforcement learning,” IEEE J. Sel. Areas Commun., vol. 37, no. 9, pp. 2200-2210, Oct. 2019.
  • [5] J. Zhu, J. Wang, Y. Huang, S. He, X. You, and L. Yang, “On optimal power allocation for downlink non-orthogonal multiple access systems,” IEEE J. Sel. Areas Commun., vol. 35, no. 12, pp. 2744-2757, Dec. 2017.
  • [6] Z. Ding, Z. Yang, P. Fan and H. V. Poor, ”On the Performance of Non-Orthogonal Multiple Access in 5G Systems with Randomly Deployed Users,” IEEE Signal Process Lett, vol. 21, no. 12, pp. 1501-1505, Dec. 2014.
  • [7] L. Zhu, J. Zhang, X. Cao, and D. O. Wu, “Optimal user pairing for downlink non-orthogonal multiple access (NOMA),” IEEE Wireless Commun. Lett., vol. 8, no. 2, pp. 328-331, Apr. 2019.
  • [8] L. Salaün, C. S. Chen, and M. Coupechoux, “Optimal joint subcarrier and power allocation in NOMA is strongly NP-hard,” in Proc. IEEE ICC., pp. 1–7. May. 2018.
  • [9] A. Abrardo M. Moretti, and F. Saggese, “Power and subcarrier allocation in 5G NOMA-FD systems,” IEEE Trans. Wirel. Commun., vol. 19, no. 12, pp. 8246 - 8260, Dec. 2020.
  • [10] E. C. Cejudo, H. Zhu, and J. Wang, “Resource allocation in downlink multicarrier NOMA under a fairness constraint,” in Proc. IEEE VTC, pp. 1-5, Dec. 2020.
  • [11] C. Kai, Y. Wu, and W. Huang, “Joint uplink and downlink resource allocation for NOMA-enabled D2D communications,” IEEE Wireless Commun. Lett., vol. 10, no. 6, pp. 1247-1251, Jun. 2021.
  • [12] O. Abuajwa, M. B. Roslee, and Z. B. Yusoff, ”Simulated annealing for resource allocation in downlink NOMA systems in 5G networks,” Appl. Sci., vol. 11, no. 10, May. 2021.
  • [13] Y. Cao, G. Zhang, G. Li, and J. Zhang, “A deep Q-network based-resource allocation scheme for massive MIMO-NOMA,” IEEE Commun. Lett., vol. 25, no. 5, pp. 1544-1548, May. 2021.
  • [14] J. Zheng, X. Tang, X. Wei, and L. Zhao, “Channel assignment for hybrid NOMA systems with deep reinforcement learning,” IEEE Wireless Commun. Lett., vol. 10, no. 7, pp. 1370-1374, Jul. 2021.
  • [15] 2021. D-Wave Systems — The Practical Quantum Computing Company. [online] Available at: https://www.dwavesys.com [Accessed 23 Dec. 2021].
  • [16] T. Inagaki, et al., “A coherent Ising machine for 2000-node optimization problems,” Science, vol. 354, no. 6312, pp. 603-606, Nov. 2016.
  • [17] P. L. McMahon, et al., “A fully programmable 100-spin coherent Ising machine with all-to-all connections,” Science, vol. 354, no. 6312, pp. 614-617, Nov. 2016.
  • [18] R. Hamerly, et al., “Experimental investigation of performance differences between coherent Ising machines and a quantum annealer,” Science Advances. vol. 5, no. 5, pp. 1-10, May. 2019.
  • [19] T. Honjo, et al., “100,000-spin coherent Ising machine” Science Advances, vol. 7, no. 40, pp. 1-10, Sep. 2021.
  • [20] W. Jiang, B. Han, M. A. Habibi, and H. D. Schotten, “The road towards 6G: A comprehensive survey” IEEE Open J. Commun. Soc., vol. 2, pp. 334-366, Feb. 2021.
  • [21] A. Lucas, “Ising formulations of many NP problems,” Front. Phys., vol. 2, pp. 5, Feb. 2014.
  • [22] M. Hasegawa, H. Ito, H. Takesue, and K. Aihara, ”Optimization by neural networks in the coherent Ising machine and its application to wireless communication systems,” IEICE Trans. Commun., vol. E104.B, no. 3 pp. 210-216, 2021.
  • [23] H. Ito, Y. Jiang, H. Yasuda, H. Takesue, K. Aihara, and M. Hasegawa, ”High-speed optimization method for resource allocation in wireless communication systems by coherent Ising machine,” in Proc. IEEE ICAIIC, pp. 093-097, Feb. 2020.
  • [24] S. Kobayashi, Y. Murata, K. Aihara, and M. Hasegawa, “Fast optimization of scheduling based on quantum neural network in distributed antenna system,” IEICE Society Conference, vol. N-2-1, Aug. 2019.
  • [25] K. Kurasawa, K. Hashimoto, A. Li, K. Sato, K. Inaba, H. Takesue, K. Aihara, and M. Hasegawa, “A high-speed channel assignment algorithm for dense IEEE 802.11 systems via coherent Ising machine,” IEEE Wireless Commun. Lett., vol. 10, no. 8, pp. 1682-1686, Aug. 2021.
  • [26] Y. Haribara, et al., “A coherent Ising machine for maximum cut problems: Performance evaluation against semidefinite programming relaxation and simulated annealing,” Princ. Meth. Quantum Inf. Technol., vol. 911, pp. 251-262, Feb. 2016.
  • [27] S. Utsunomiya, K. Takata, and Y. Yamamoto, “Mapping of Ising models onto injection-locked laser systems,” Optics Express, vol. 19, no. 19, pp. 18091–18108, 2011.
  • [28] I. F. Akyildiz, A. Kak, S. Nie, and L. Zhao, “6G and beyond: The future of wireless communications systems,” IEEE Access., vol. 8, pp. 133995-134030, Jul. 2020.
  • [29] H. Takesue, T. Inagaki, K. Inaba, and T. Ikuta, T. Honjo, “Large-scale Coherent Ising Machine,” J. Phys. Soc. Jpn., vol. 88, pp. 061014, Mar. 2019.
  • [30] J. J. Hopfield and D. W. Tank, “Neural computation of decisions in optimization problems,” Biol. Cybern., vol. 52, pp. 141-152, Jul. 1985.
  • [31] Z. Wang, A. Marandi, K. Wen, R. L. Byer, Y. Yamamoto, “Coherent Ising machine based on degenerate optical parametric oscillators,” Phys. Rev. A., vol. 88, pp. 063853, Dec. 2013.
  • [32] E. H. L. Aarts and J. H. M. Korst, ”Boltzmann machines for travelling salesman problems,” Eur. J. Oper. Res., vol. 39, pp. 79-95, Mar. 1989
  • [33] B. Hajek, “Cooling schedules for optimal annealing,” Math. Oper. Res., vol. 13, pp. 311-329, May. 1988.
  • [34] H. Ye, G. Y. Li and B. -H. F. Juang, “Deep Reinforcement Learning Based Resource Allocation for V2V Communications,” IEEE Trans. Veh. Technol., vol. 68, no. 4, pp. 3163-3173, Apr. 2019.
  • [35] 3rd Generation Partnership Project: Technical Specification Group Radio Access Network: Study LTE-Based V2X Services: (Release 14), Standard 3GPP TR, Jun. 2016.
  • [36] J. Wang, H. Xu, L. Fan, B. Zhu, and A. Zhou, ”Energy-efficient joint power and bandwidth allocation for NOMA systems,” IEEE Commun. Lett., vol. 22, pp. 780-783, April. 2018.
  • [37] J. Kim, W. Lee, and H. Song, “Performance enhancement using receive diversity with power adaptation in the NOMA system,” IEEE ACCESS, vol. 7, pp. 102867-102875, 2019.
  • [38] M. Gong and Z. Yang, “The application of antenna diversity to NOMA with statistical channel state information,” IEEE Trans. Veh. Technol., vol. 68, no. 4, pp. 3755-3765, Apr. 2019.
  • [39] H. Nakahara, “Convolutional neural network implementation on an FPGA,” The journal of the Institute of Electronics, Information and Communication Engineers, vol. 103, no. 5, pp. 501-506, May 2020.
  • [40] D. Ito, T. Ueta and K. Aihara, ”Bifurcation analysis of eight coupled degenerate optical parametric oscillators,” Physica D Nonlinear Phenomena., vol. 372, pp. 22-30, June. 2018.