Secure Precoding in MIMO-NOMA:
A Deep Learning Approach

Jordan Pauls, , and Mojtaba Vaezi Jordan Pauls, and Mojtaba Vaezi The authors are with the Department of Electrical and Computer Engineering, Villanova University, Villanova, PA 19085 USA (e-mail: [email protected]; [email protected]).

Abstract

A novel signaling design for secure transmission over two-user multiple-input multiple-output non-orthogonal multiple access channel using deep neural networks (DNNs) is proposed. The goal of the DNN is to form the covariance matrix of users’ signals such that the message of each user is transmitted reliably while being confidential from its counterpart. The proposed DNN linearly precodes each user’s signal before superimposing them and achieves near-optimal performance with significantly lower run time. Simulation results show that the proposed models reach about 98% of the secrecy capacity rates. The spectral efficiency of the DNN precoder is much higher than that of existing analytical linear precoders–e.g., generalized singular value decomposition–and its on-the-fly complexity is several times less than the existing iterative methods.

Index Terms:

Deep learning, DNN, MIMO, NOMA, physical layer security, wiretap, precoding, covariance, GSVD.

I Introduction

Non-orthogonal multiple access (NOMA) is a promising candidate for connecting massively increasing devices to fifth-generation and beyond wireless networks [1]. NOMA is the optimal transmission strategy for both the single-input, single-output (SISO) and multiple-input, multiple-output (MIMO) cases in a single-cell network. In the SISO case, superposition coding at the transmitter with successive interference cancellation at the receiver is optimal. In contrast, in the MIMO case, dirty-paper coding (DPC) is the optimal solution. Nonetheless, in both cases, the base station (BS) broadcasts a superimposed signal of multiple users. This makes secure communications challenging in the presence of adversarial users as signals can be eavesdropped on by such users.

Physical layer security enables the exchange of confidential messages over a wireless medium in the presence of internal or unauthorized eavesdroppers [2]. Specifically, in two-user MIMO-NOMA networks, both users can transmit their messages concurrently and confidentially via secret dirty-paper coding (S-DPC) [3]. While S-DPC is the most spectral efficient precoding for the two-user MIMO-NOMA, it is excessively complex for practical uses. Alternatively, S-DPC region can be achieved by linear precoding and power allocation schemes. In the past years, various linear precoding schemes have been introduced [4, 5, 6, 7]. Generalized singular value decomposition (GSVD)-based precoder [4] is a fast analytical precoder, but it falls short of getting the capacity region when the users have a single antenna. Also, weighted sum-rate maximization [5] and power-splitting [6] approaches are still too time-consuming to be used in practice. More accurately, these approaches require much higher time than the coherence time of wireless channels which is about a few milliseconds [8].

This letter exploits deep learning (DL) to design the covariance matrices of the channel input vectors–or equivalently, to design precoding and power allocation matrices–for secure MIMO-NOMA transmission. Embedding DL into the mobile and wireless networks is well justified in various cases, e.g., when closed-form solutions require poor approximation or the complexity of existing techniques is high [9, 10, 11].

We leverage supervised deep neural networks (DNNs) for secure communication for the two-user MIMO-NOMA channel (see Fig. 1), resulting in a significantly faster solution while almost reaching the spectral efficiency of the S-DPC. To fulfill the task, we first decompose the two-user MIMO-NOMA with confidential messages into two wiretap channels [6] and use a wiretap channel solution for generating and labeling the training data set. We then build and train DNN models that learn to approximate the function mapping channel matrices, base station power, and required secrecy rate of the users to the covariance matrix of the channel input vector for each user. Simulation results prove the efficacy of the developed model since the proposed DL-based precoding scheme has near-optimal performance and outperforms GSVD-based precoding with a large margin. It also brings mapping time below the coherence time of the wireless channel. Therefore, the channel input signal can be designed before the channel coefficients become stale, which is crucial in practice.

Refer to caption — Figure 1: Two-user MIMO-NOMA with confidential messages. $M_{1}$ and $M_{2}$ are the messages for user $1$ and user $2$ , respectively. Each user is intended to decode its own message but not the other one.

II System Model and Existing Solutions

Consider a single-cell two-user MIMO-NOMA network, as shown in Fig. 1. Assume that the transmitter, user $1$ , and user $2$ are equipped with $n_{t}$ , $n_{1}$ , and $n_{2}$ antennas, respectively. The transmitter wishes to send two messages $M_{1}$ and $M_{2}$ to user $1$ , and user $2$ , respectively. When PHY security is a concern, each message must be kept confidential from the other user [12]. ¹¹1In information theory, this channel is known as the MIMO broadcast channel (BC) with two confidential messages [12]. That is, user $1$ should not be able to decode $M_{2}$ and vice versa. As such, the transmitter securely encodes $M_{1}$ and $M_{2}$ to codewords $\mathbf{x}_{1}\in\mathbb{R}^{n_{t}\times 1}$ and $\mathbf{x}_{2}\in\mathbb{R}^{n_{t}\times 1}$ , superimposes them $\mathbf{x}=\mathbf{x}_{1}+\mathbf{x}_{2}$ , and broadcasts $\mathbf{x}$ [12, 13]. Let $\mathbf{H}_{1}\in\mathbb{R}^{n_{1}\times n_{t}}$ and $\mathbf{H}_{2}\in\mathbb{R}^{n_{2}\times n_{t}}$ be the channel matrices corresponding to user $1$ and user $2$ . Then, the received signals at user $1$ and user $2$ , respectively, can be represented as


$\displaystyle\mathbf{y}_{1}$	$\displaystyle=\mathbf{H}_{1}(\mathbf{x}_{1}+\mathbf{x}_{2})+\mathbf{z}_{1},$	(1a)
$\displaystyle\mathbf{y}_{2}$	$\displaystyle=\mathbf{H}_{2}(\mathbf{x}_{1}+\mathbf{x}_{2})+\mathbf{z}_{2},$	(1b)

in which $\mathbf{z}_{1}\in\mathbb{R}^{n_{1}\times 1}$ and $\mathbf{z}_{2}\in\mathbb{R}^{n_{2}\times 1}$ are two independent identically distributed (i.i.d) Gaussian noise vectors with mean zero and identity covariance matrices.

II-A Secrecy Capacity Region

The secrecy capacity region of this channel under a matrix constraint on the covariance matrix of the input $\mathbf{x}$ is proved in [12, Theorem 1]. However, in practical MIMO systems, a total power constraint $P$ at the transmitter is more common. Under this assumption, using Corollary 1 in [3], the secrecy capacity region of this channel can be represented as


$\displaystyle R_{1}$	$\displaystyle\leq\frac{1}{2}\log\|\mathbf{I}_{1}+{\mathbf{H}_{1}\mathbf{Q}_{1}\mathbf{H}_{1}^{T}}\|-\frac{1}{2}\log\|\mathbf{I}_{2}+\mathbf{H}_{2}\mathbf{Q}_{1}\mathbf{H}_{2}^{T}\|,$	(2a)
$\displaystyle R_{2}$	$\displaystyle\leq\frac{1}{2}\log\bigg{\|}\mathbf{I}_{2}+\frac{\mathbf{H}_{2}\mathbf{Q}_{2}\mathbf{H}_{2}^{T}}{\mathbf{I}_{2}+\mathbf{H}_{2}\mathbf{Q}_{1}\mathbf{H}_{2}^{T}}\bigg{\|}$
	$\displaystyle\qquad\qquad-\frac{1}{2}\log\bigg{\|}\mathbf{I}_{1}+\frac{\mathbf{H}_{1}\mathbf{Q}_{2}\mathbf{H}_{1}^{T}}{\mathbf{I}_{1}+\mathbf{H}_{1}\mathbf{Q}_{1}\mathbf{H}_{1}^{T}}\bigg{\|},$	(2b)
	$\displaystyle\textmd{s.t.}\quad{\rm tr}(\mathbf{Q}_{1})+{\rm tr}(\mathbf{Q}_{2})\leq P,\;\mathbf{Q}_{1}\succcurlyeq\mathbf{0},\;\mathbf{Q}_{2}\succcurlyeq\mathbf{0},$	(2c)

in which $R_{i}$ , $i\in\{1,2\}$ is the secure achievable rate at user $i$ , $\mathbf{I}_{i}$ is an identity matrix of size $n_{i}$ , and $\mathbf{Q}_{i}$ is the covariance matrix of $\mathbf{x}_{i}$ . By definition, $\mathbf{Q}_{i}=\mathbb{E}(\mathbf{x}_{i}\mathbf{x}_{i}^{T})$ where $\mathbb{E}(\cdot)$ denotes expectation, and thus, $\mathbf{Q}_{i}$ is positive semidefinite, i.e., $\mathbf{Q}_{i}\succcurlyeq\mathbf{0}$ . Further, since total transmit power cannot be higher than $P$ , we have $P\geq\mathbb{E}(\mathbf{x}\mathbf{x}^{T})={\rm tr}(\mathbf{Q}_{1})+{\rm tr}(\mathbf{Q}_{2}).$

II-B Existing Solutions

Although the secure capacity region of the two-user MIMO-NOMA channel is given in (2), it is still unknown how to analytically form $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ to achieve the capacity region. This is because the right-hand side expressions both in (2a) and (2b) are non-convex, and thus, the corresponding optimization problems are challenging. Early works like [12] use an exhaustive search over all possible $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ , satisfying the constraints in (2c). Such an approach is, however, prohibitively complex for practical systems. Later, Fakoorian et al. [4] proposed a GSVD-based precoder for this problem. The rate region of this method is, however, far from the capacity region when $n_{1}$ and/or $n_{2}$ are small numbers and, in particular, when the users have a single antenna. Intriguingly, such cases are very prevalent and thus important in practice.

Lately, another approach was proposed to solve this problem [6] whose achievable rate region is very close to the optimal solution for any number of antennas at each node. The main observation is that the two-users MIMO-NOMA channel can be decomposed into two MIMO wiretap channels by splitting power between the two users. Then, the associated optimization problems are solved one at a time. Despite the fact that the rate region obtained from this solution is very close to the secure capacity region of the channel, this solution incurs an unacceptable amount of delay, which hinders it from being used for practical systems. To be specific, finding optimal $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ could take several hundred milliseconds, [6] wheres the coherence time of the wireless channel can be as small as a few milliseconds [8]. That is, the solution assumes that the channel is constant for several hundred milliseconds while it changes much faster in practice.

In this letter, we propose a DL-based signaling design to approach the secure capacity of the MIMO-NOMA channel within a practically acceptable delay. This is obtained at the expense of a slightly smaller achievable rate region. In the following, we describe the structure of the DNN, the training process, and the test results.

III Deep Learning-based Solution

In this section, we build a supervised deep learning model to determine suitable covariance matrices for (2). Put differently, we describe signaling design (precoding and power allocation) for secure transmission over the MIMO-NOMA channel. We present a DNN structure that includes data generation, training methodology, DNN structure, and hyper-parameters.

III-A Data Generation and Labeling

In supervised learning, labeled data is used for drawing inferences, i.e., for classification, regression, or approximation. In this paper, labeled training data is used to build predictive models to learn the functions mapping inputs to outputs, i.e., to approximate optimal signaling in the secure MIMO-NOMA by regression. Specifically, for each set of $\mathbf{H}_{1}$ and $\mathbf{H}_{2}$ we find $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ and use them for training a DNN and determining a model.

We decompose the secure MIMO-NOMA channel into two MIMO wiretap channels [6]. For $\alpha\in[0,1]$ , we allot $\alpha P$ and $(1-\alpha)P$ to user $1$ and user $2$ , respectively. Then, we find the covariance matrix $\mathbf{Q}_{1}$ from (2a), i.e.,


$\displaystyle\mathbf{Q}_{1}^{*}$	$\displaystyle=\arg\max\limits_{\mathbf{Q}_{1}}\frac{1}{2}\log\frac{\|\mathbf{I}_{1}+{\mathbf{H}_{1}\mathbf{Q}_{1}\mathbf{H}_{1}^{T}}\|}{\|\mathbf{I}_{2}+\mathbf{H}_{2}\mathbf{Q}_{1}\mathbf{H}_{2}^{T}\|},$	(3a)
	$\displaystyle{\rm s.t.}\quad\mathbf{Q}_{1}\succeq\mathbf{0},\;{\rm tr}(\mathbf{Q}_{1})\leq P_{1}=\alpha P.$	(3b)

Now, this problem can be seen as a wiretap channel in which user $1$ is the legitimate user and user $2$ is an eavesdropper. Thus, we can apply any wiretap channel solutions to solve it. Alternating optimization and water filling (AOWF) algorithm [14] and rotation-based [15] method are two of them. Once $\mathbf{Q}_{1}^{*}$ is obtained, we plug it in (2b) and manipulate it to get


$\displaystyle\mathbf{Q}_{2}^{*}$	$\displaystyle=\arg\max\limits_{\mathbf{Q}_{2}}\frac{1}{2}\log\frac{\|\mathbf{I}_{2}+\mathbf{H}^{\prime}_{2}\mathbf{Q}_{2}\mathbf{H}^{\prime T}_{2}\|}{\|\mathbf{I}_{1}+\mathbf{H}^{\prime}_{1}\mathbf{Q}_{2}\mathbf{H}^{\prime T}_{1}\|},$	(4a)
	$\displaystyle{\rm s.t.}\quad\mathbf{Q}_{2}\succeq\mathbf{0},\;{\rm tr}(\mathbf{Q}_{2})\leq P_{2}=(1-\alpha)P,$	(4b)

in which $\mathbf{H}^{\prime}_{i}\triangleq\mathbf{\Lambda}^{-\frac{1}{2}}_{i}\mathbf{V}^{T}_{i}\mathbf{H}_{i}$ , $i\in\{1,2\}$ , where $\mathbf{\Lambda}_{i}$ and $\mathbf{V}_{i}$ are obtained from eigenvalue decomposition of $\mathbf{I}+\mathbf{H}_{i}\mathbf{Q}^{*}_{1}\mathbf{H}_{i}^{T}$ , i.e., $\mathbf{I}+\mathbf{H}_{i}\mathbf{Q}^{*}_{1}\mathbf{H}_{i}^{T}=\mathbf{V}_{i}\mathbf{\Lambda}_{i}\mathbf{V}_{i}^{T}$ . Then again, (4) is the rate for a MIMO wiretap channel where $\mathbf{H}^{\prime}_{2}$ and $\mathbf{H}^{\prime}_{1}$ are the channels corresponding to the legitimate user and eavesdropper, respectively. Thus, we solve it using a wiretap solution. Although this approach is suboptimal, the resulting rate region is close to the optimal solution–obtained by a brute-force search.

We next describe the structure of the DNN used for finding suitable covariance matrices for the MIMO-NOMA networks.

III-B Network Structure

We use a multi-layer perceptrons (MLP) DNN in this paper. As feed-forward neural nets, MLPs are less complex, easy to design, and have quick run time. The structure of the network is shown in Fig. 2. As we will see in Section III-C, the input is a feature mapping of $\mathbf{H}_{1}$ and $\mathbf{H}_{2}$ –the channel matrices of user 1 and user 2. Rectified linear unit (ReLU) [16] serves as the activation function. ReLUs are sparse and have a reduced likelihood of vanishing gradient which reduce training and inference time for neural networks. The network has nine fully connected (FC) layers, each with a width of 256 nodes. The network then funnels through a 128 node layer and a 64 node layer before reaching the output layer. The output layer is the upper triangular elements of the covariance matrices ( $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ ) that the network is trying to learn how to predict. We note that since $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ are symmetric, once we get the upper triangular elements, we know all elements. The size of the output layer depends on $n_{t}$ and is equal to $n_{t}(n_{t}+1)$ .

We have investigated the effect of various hyperparameters including learning rate, drop factor, Adam optimizer, validation frequency, mini-batch size, and validation patience in order to most effectively train the network. The final, tuned hyperparameters are shown in Table I.

TABLE I: Hyper-parameters.

Hyper-parameter	Value	Hyper-parameter	Value
Initial learning rate	0.001	Mini batch size	256
Learn rate drop factor	0.5	Learn rate drop period	5
Training set size	$5\times 10^{5}$	Validation set size	$10^{5}$
Validation frequency	$10^{3}$ iters	Validation patience	5

III-C Pre-processing

III-C1 Input design

A big advantage of DL algorithms is that they reduce/eliminate the need for feature engineering as they try to learn high-level features from data. Hence, this problem the input could simply be the channel matrices $\mathbf{H}_{1}$ and $\mathbf{H}_{2}$ . However, we have observed that some nonlinear combinations of the channels improve the network performance. Specifically, observing that $\left|\mathbf{I}_{i}+\mathbf{H}_{i}\mathbf{Q}_{i}\mathbf{H}_{i}^{T}\right|=\left|\mathbf{I}_{i}+\mathbf{H}_{i}^{T}\mathbf{H}_{i}\mathbf{Q}_{i}\right|$ , we can rewrite (3) and (4) as functions of $\mathbf{H}_{i}^{T}\mathbf{H}_{i}$ . Then, the input is designed based on $\mathbf{H}_{i}^{T}\mathbf{H}_{i}$ , not $\mathbf{H}_{i}$ [10]. This makes our design independent of the number of antennas at the users since the size of $\mathbf{H}_{i}^{T}\mathbf{H}_{i}$ is $n_{t}\times n_{t}$ which does not depend on $n_{1}$ and $n_{2}$ , unlike the size of $\mathbf{H}_{i}$ . This also simplifies the inputs.

III-C2 Scaling

Before feeding the data to the network, we scale it to avoid over-fitting and improve the performance [17]. It makes back-propagation more efficient [18], and allows the network to more quickly learn the optimal parameters for each input node. Normalizing or standardizing the inputs are the two common ways of scaling. We normalize the input variables. To summarize, the input vector $\mathbf{v}$ is designed as

\displaystyle\mathbf{v}=[0.05\mathbf{v}_{1},0.002\mathbf{v}_{2}]^{T},

(5)

in which $\mathbf{v}_{1}$ and $\mathbf{v}_{2}$ are given by


	$\displaystyle\mathbf{v}_{1}={\rm vec}([\mathbf{H}_{1}^{T}\mathbf{H}_{1}\;\;\mathbf{H}_{2}^{T}\mathbf{H}_{2}]),$		(6a)
	$\displaystyle\mathbf{v}_{2}={\rm vec}([\mathbf{H}_{1}^{T}\mathbf{H}_{1}\;\;\mathbf{H}_{2}^{T}\mathbf{H}_{2}]^{T}[\mathbf{H}_{1}^{T}\mathbf{H}_{1}\;\;\mathbf{H}_{2}^{T}\mathbf{H}_{2}]),$		(6b)

where $\rm vec(\mathbf{A})$ converts matrix $\mathbf{A}$ to a vector. The coefficients of $\mathbf{v}_{1}$ and $\mathbf{v}_{2}$ in (5) are chosen based on the histogram of all elements of $\mathbf{v}_{1}$ and $\mathbf{v}_{2}$ for $10^{3}$ input channels. After this normalization, with a high probability, the elements of $\mathbf{v}$ will be in the range of $[-1\;\;1]$ .

IV Numerical Results

We next evaluate the performance of the proposed DNN-based covariance matrix design (precoding and power allocation) in different antenna settings. A large dataset was used to have the network generalize for many different channels, and cross-validation data was used to prevent over-fitting. The test dataset was based on 1,000 channel matrices $\mathbf{H}_{1}$ and $\mathbf{H}_{2}$ whose elements were generated randomly based on $\mathcal{N}(0,1)$ .

The performance of the proposed solution can be evaluated in different ways. We may find the mean square error (MSE) of elements $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ provided by the DNN and those obtained from a traditional capacity-approaching. The lower the MSEs, the better the regression. Alternatively, we can substitute $\mathbf{Q}_{1}$ and $\mathbf{Q}_{2}$ to (2) and compare the secure rate region achieved by the DNN with analytical methods like GSVD and maximum achievable rates (capacity) for this channel. We also compare the performance of the DL and traditional iterative methods in terms of computation time.

Figures 4 and 4 illustrate the accuracy of the proposed DNN model compared to GSVD and capacity region. The rate region plots were made for $n_{t}=2$ and $n_{t}=3$ . The users have a single antenna, and the BS power is 10 Watts. Our DNN is able to find covariance matrices for any given $\alpha\in[0,1]$ . To demonstrate this, rate region plots were made. These plots show the different rates simultaneously achieved by user 1 and user 2 as the power splitting factor $\alpha$ is changed. In order to be accurate at different power splitting factors, eleven networks were trained, each at a different alpha starting from $\alpha=0$ with a step size of 0.1. This produces a smooth, piecewise linear secure rate region curve and proves that the neural network can be generalized for any $\alpha$ . It is important to note that different values of $\alpha$ correspond to different services and result in very different covariance matrices. For example, $\alpha=0$ implies $\mathbf{Q}_{1}=\mathbf{0}$ which gives $R_{1}=0$ , i.e., security is important only for user 2, whereas $\alpha=1$ has the reverse implication. That being said, we choose the value of $\alpha$ and the associated DNN based on the users’ quality of service. Averaging over all $\alpha$ s, the DNN achieves %98.9 and %97.7 of the capacity rates for $n_{t}=2$ and $n_{t}=3$ , respectively. We have used the same regularization parameters on all networks. A finer tune would increase this accuracy.

As can be seen in Fig. 4 and 4, the DNN highly outperforms analytical methods like GSVD in terms of rate region. Further, the proposed method largely outperforms existing iterative solutions [5, 6] in terms of computation time. Table II shows the input signaling design time gain that we obtain with the DNN. It is several times faster than the rotation algorithm and AOWF. All algorithms are tested on the same machine in Matlab. The time difference becomes much larger as $n_{t}$ increases. In short, the DNN is able to get very close to the capacity region of the MIMO-NOMA and achieves this much faster, and thus, it is a viable solution.

TABLE II: Solution time in milliseconds

$n_{t}$	Rotation	AOWF	DNN
2	6.6	22.7	3.7
3	26.1	31.4	6.3

V Conclusion

A novel deep learning assisted covariance matrix design for the two-user MIMO-NOMA with confidential messages has been developed, trained, and tested in this paper. The proposed DNN, which is used to approximate the capacity region of this channel, is able to achieve nearly perfect accuracy for maximum secure rates for this channel. Remarkably, using the DNN for signaling design significantly reduces the solution time versus existing iteratively solutions and brings this time low enough that it can be used in practice. It also significantly outperforms GSVD precoding in achievable secure rates.

References

[1] M. Vaezi, Z. Ding, and H. V. Poor, Multiple Access Techniques for 5G Wireless Networks and Beyond. Cham, Switzerland: Springer, 2019.
[2] A. Mukherjee, S. A. A. Fakoorian, J. Huang, and A. L. Swindlehurst, “Principles of physical layer security in multiuser wireless networks: A survey,” IEEE Commun. Surveys Tuts., vol. 16, no. 3, pp. 1550–1573, 2014.
[3] E. Ekrem and S. Ulukus, “Capacity region of Gaussian MIMO broadcast channels with common and confidential messages,” IEEE Trans. Inf. Theory, vol. 58, no. 9, pp. 5669–5680, 2012.
[4] S. A. A. Fakoorian and A. L. Swindlehurst, “On the optimality of linear precoding for secrecy in the MIMO broadcast channel,” IEEE J. Sel. Areas Commun., vol. 31, no. 9, pp. 1701–1713, 2013.
[5] D. Park, “Weighted sum rate maximization of MIMO broadcast and interference channels with confidential messages,” IEEE Trans. Wireless Commun., vol. 15, no. 3, pp. 1742–1753, 2015.
[6] Y. Qi and M. Vaezi, “Secure transmission in MIMO-NOMA networks,” IEEE Commun. Lett., vol. 24, no. 12, pp. 2696–2700, 2020.
[7] M. F. Hanif and Z. Ding, “Robust power allocation in MIMO-NOMA systems,” IEEE Wireless Commun. Lett., vol. 8, no. 6, pp. 1541–1545, 2019.
[8] D. Tse and P. Viswanath, Fundamentals of Wireless Communication. Cambridge University Press, 2005.
[9] C. Zhang, P. Patras, and H. Haddadi, “Deep learning in mobile and wireless networking: A survey,” IEEE Commun. Surveys Tuts., vol. 21, no. 3, pp. 2224–2287, 2019.
[10] X. Zhang and M. Vaezi, “Deep learning based precoding for the MIMO Gaussian wiretap channel,” in Proc. IEEE Global Commun. Conf. Workshops (GC Workshops), pp. 1–6, 2019.
[11] M. Gümüş and T. M. Duman, “Deep neural network based precoding for wiretap channels with finite alphabet inputs,” IEEE Wireless Commun. Lett., vol. 10, no. 8, pp. 1652–1656, 2021.
[12] R. Liu, T. Liu, H. V. Poor, and S. Shamai, “Multiple-input multiple-output Gaussian broadcast channels with confidential messages,” IEEE Trans. Inf. Theory, vol. 56, no. 9, pp. 4215–4227, 2010.
[13] E. Ekrem and S. Ulukus, “The secrecy capacity region of the Gaussian MIMO multi-receiver wiretap channel,” IEEE Trans. Inf. Theory, vol. 57, no. 4, pp. 2083–2114, 2011.
[14] Q. Li, M. Hong, H.-T. Wai, Y.-F. Liu, W. Ma, and Z.-Q. Luo, “Transmit solutions for MIMO wiretap channels using alternating optimization,” IEEE J. Sel. Areas Commun., vol. 31, no. 9, pp. 1714–1727, 2013.
[15] X. Zhang, Y. Qi, and M. Vaezi, “A rotation-based method for precoding in Gaussian MIMOME channels,” IEEE Commun. Lett., vol. 69, no. 2, pp. 1189–1200, 2021.
[16] B. Xu, N. Wang, T. Chen, and M. Li, “Empirical evaluation of rectified activations in convolutional network,” arXiv:1505.00853, 2015.
[17] C. M. Bishop, Neural Networks for Pattern Recognition. Oxford Univ. Press, 1995.
[18] Y. A. LeCun, L. Bottou, G. B. Orr, and K.-R. Müller, “Efficient backprop,” in Neural Networks: Tricks of the Trade, pp. 9–48, Springer, 2012.

Secure Precoding in MIMO-NOMA: A Deep Learning Approach