Learning to Select for MIMO Radar based on Hybrid Analog-Digital Beamforming
Abstract
In this paper, we propose an energy-efficient radar beampattern design framework for Millimeter Wave (mmWave) massive multi-input multi-output (mMIMO) systems, equipped with a hybrid analog-digital (HAD) beamforming structure. Aiming to reduce the power consumption and hardware cost of the mMIMO system, we employ a learning approach to synthesize the probing beampattern based on a small number of RF chains and antennas. By leveraging a combination of softmax neural networks, the proposed solution is able to achieve a desirable beampattern with high accuracy while incurring low cost.
Index Terms— Hybrid beamforming, radar beampattern design, learn to select, softmax selection
1 Introduction
Sensing is viewed as an essential feature in next-generation wireless communication applications [1], such as vehicular networks, WLAN indoor positioning, and unmanned aerial vehicle (UAV) networks [2]. Indeed, in all those scenarios, sensing and communication are a pair of intertwined functionalities, often required to be carried out simultaneously for the purpose of increasing the spectral efficiency and reducing costs.
In order to promote both high-throughput communication and high-accuracy sensing performance, millimeter Wave (mmWave) signaling and massive multi-input multi-output (mMIMO) have emerged as two promising approaches [3, 4, 5]. The large bandwidth available at the mmWave spectrum provides not only Moreover, the large-scale antenna array can compensate for the path-loss of the mmWave signals by formulating “pencil-like” beams towards the communication users. At the same time, a large-scale antenna array offers enhanced performance in terms of the angular resolution for radar sensing. However, fully-digital mMIMO systems require as many RF chains as antenna elements. This requirement translates into high power consumption and hardware cost, which limit the applicability of fully-digital mMIMO in a practical setting, especially when the antennas and RF chains are operated in the mmWave band.
The hybrid analog-digital (HAD) beamforming structure is a low-cost solution for tackling the above issues [6] while reaping the performance gains of both mMIMO and mmWave signalling. A HAD architecture comprises a small number of RF chains, which are connected to a large number of antennas through a network of phase shifters. While the HAD beamforming for communication has already been well-studied[7, 8, 9, 10, 11], its application towards radar sensing remains to be explored. To this end, previous research efforts have focused on the design of phased-MIMO radar, which was proposed as a tradeoff between the phased-array and MIMO radars [12]. However, due to the exponentially increasing complexity and energy consumption in terms of both antennas and RF chains, the state-of-the-art research on phased-MIMO radar is restricted to small-scale antenna arrays [12, 13, 14], and thus cannot take advantage of the mMIMO capabilities. To address this issue, it is necessary to exploit a limited number of RF chains and/or antennas instead of using the full HAD array. Again, to the best of our knowledge, the literature on antenna/RF chain selection for phased-MIMO/HAD radar is rather sparse.
To further reduce the cost and improve energy efficiency of the conventional phased-MIMO radar [12], we propose a novel approach for optimally selecting a small number of RF chains and/or antennas from a dense hybrid analog-digital array, along with optimally designing the phase shifter network matrix and the beamforming matrix, so that the corresponding probing beampattern is close to that of a fully populated HAD structure.
The optimization problem is solved by modifying the softmax learning approach learn to select (L2S) in [15] where the selection of antennas is modeled by softmax neural networks. The proposed L2S is effective in formulating any desirable radar beampattern and can scale to a large number of RF chains and/or antennas to select from which is crucial to massive array.
While machine learning for antenna selection has been investigated in [16, 10, 17], the problem in those works was treated as a classification problem. However, the combinatorial explosion problem renders those methods impractical even in cases with a moderate number of antennas. On the other hand,
L2S in [15] can be efficiently scaled to larger problems as it avoids the combinatorial explosion of the selection problem. It also offers a flexible array design framework as the selection problem can be easily formulated for any metric.
For clarity, we note here that [15] considers a sparse array design problem where only one selection matrix is considered. In contrast, the problem considered in this paper involves two selection matrices, i.e., a phase-shifter network matrix with unit modulus, and a beamforming matrix. Both matrices are parameters in the optimization problem, thus
providing more degrees of freedom for approximating the desired beampattern while incurring lower cost.
2 Problem formulation
Let us consider a massive MIMO system equipped with antennas and RF chains. The antennas formulate a uniform linear array, with spacing between adjacent antennas denoted by . In the fully digital MIMO system, we have , which large huge costs when a large number of antennas are needed, especially in the case of RF chains operating in the mmWave band. To tackle this issue, we consider a HAD structure, which employs a smaller number of RF chains, i.e., , and where each RF chain is connected to all antennas via a phase-shifter network.
The phase-shifter network between antennas and RF chains can be modeled as a matrix , where all the entries in have constant modulus, i.e., . We assume that the antennas transmit narrow-band signals with carrier wavelength . The array output at angle is
(1) |
where is the steering vector at direction , and is the transmit array snapshot at time . Let
(2) |
with a baseband precoding matrix, and a white signal vector with zero-mean and unit identity covariance matrix. The array output vector at different angles is
(3) | |||||
(4) |
with being the steering matrix. In order to achieve a lower power consumption and hardware cost, we want to select RF chains () and array antennas (), so that the HAD system approximates the desired power pattern at angles. To proceed, let us first select out of RF chains by multiplying with a selection matrix , where all the elements in are zero, except for exactly one element per row which is equal to one. The transmitted snapshot from the selected RF chains can be expressed as
(5) |
Each column in contains at most one element that is equal to one so that we do not choose the same RF chain twice.
In general, if we select RF chains of indices then the elements of will be for , and otherwise. Correspondingly, matrix will be an diagonal matrix where the diagonal entries are one if the corresponding RF chains are active and zero otherwise. Similarly, another selection matrix can be introduced to select out of antennas.
The output of the sparse array can be expressed as
(6) | |||||
(7) |
Let be the desirable signal power at direction , so that the desired beampattern vector is . The sparse array output power at is
(9) | |||||
The goal is to find the selection matrices , , the mapping matrix and the precoding matrix that minimize the beam-pattern error, i.e.,
s.t. | |||
where and is an identity matrix.
3 Softmax Co-design
We propose to use the learning approach in [15] for the co-design of , , and . Let the loss function be
(10) |
Each row of selection matrices and can be modeled by a separate softmax neural network [18]. Taking the RF chain selection matrix as an example, the outputs of the -th network will be
(11) |
where , are respectively the weights and biases, and is the input. Note that and
(12) |
Essentially, represents the probability that RF chain will be our -th selected RF chain.
Since the selection matrix does not depend on time , the input should be constant, and thus, the constant value can be merged into the bias term . Without loss of generality, such a model is equivalent to a softmax model with , where the only trainable parameters are the biases.
The approximate selection matrix, , is formed based on the outputs of all the softmax models as its rows. Clearly, will be a soft selection matrix since the values range between 0 and 1. By the end of the training, the matrix should converge very close to hard binary values so the approximation will be successful.
In order to formulate the cost function we individually express in terms of real and imaginary parts, to facilitate the machine learning optimization which is based on real numbers.
The average output power at angle is
(13) |
and the beampattern error with respect to is
(14) |
where is the importance weight assigned to the angle .
In order to achieve a realistic solution, the softmax models must produce hard binary values. The following constraint enforces this requirement:
(15) |
Indeed, (15) holds iff . The ‘if’ part of this statement is obvious. The ‘only if’ part comes readily from (12) since
implying that at most one element of can be equal to 1 and all other elements must be equal to 0. Combined with (12) this means that exactly one element of is equal to 1 and all other elements are equal to 0.
We also need to impose another constraint since the same RF chain or antenna can not be selected more than once, i.e.
If then the above constraint is equivalent to
(16) |
Combining (15) and (16) it follows that must be equal to the identity matrix . Based on the power gain error and the selection matrix structure above, we formulate the following loss function:
(17) |
where denotes the matrix Frobenius norm, and and are cost parameters which reflect the relative importance of the latter constraint with respect to the desired beam-pattern error.
3.1 Learning to select RF chains and antennas
There are four sets of parameters to be trained: (i) the biases to approximate the selection on RF chains , (ii) the biases to approximate the selection on antennas (assuming ), (iii) the covariance shaping matrix and (iv) the phase-shifter network matrix . We propose a four-stage optimization approach, by alternating between optimizing over one set of parameters and fixing others.
The algorithm runs for learning epochs and each alternating stage runs for a small number of steps . The proposed scheme is shown in Algorithm 1. One can improve the speed of convergence by using other optimizers instead of gradient descent. In the simulations shown next we used the Adam optimizer [19].
4 Simulation results
Here, we demonstrate the performance and flexibility of the proposed method. In all experiments, a flat weight is used, i.e., and the antennas are spaced by half of wavelength. We used the Adam stochastic optimization procedure with different learning rates and epochs of training. In each epoch steps are executed. The training data are i.i.d. jointly complex Gaussian with zero mean and variance . The length of input data should always be larger than the maximum number of and to ensure the functionality of the model.
Our first experiment is designed to select a small number of RF chains to reduce the system cost. The desirable beam power profile equals to over the angle ranges degrees and degrees, and is zero otherwise. The number of antennas is . The learning rate is set to , while the parameter , used in (17), changes between learning epochs; it starts from , and linearly increases to at the final epoch. The weights the importance of a proper selection matrix during the learning process.

Fig. 1, shows the designed beampattern, when selecting out of or RF chains, which are typical antenna numbers considered in mMIMO systems. One can see that the matching to the desirable beampattern is pretty good. For this example, classification-based machine learning methods would have to choose the best class out of classes, which is a task that would require a prohibitively long time to compute.
In the second experiment, three different selection choices are tested: (i) select antennas only, (ii) select RF chains only, and (iii) select both RF chains and antennas. There are antennas and RF chains, among which we select antennas or/and RF chains. The desirable beam power profile is equal to at angle ranges degrees and degrees, and is zero otherwise. The parameter used in this experiment ranges from to , while the learning rate is the same . The beampattern of the designed system is shown in Figure.2, where one can see that selecting RF chains only and using all antennas performs best, giving rise to the lowest sidelobes. Selecting both RF chains and antennas, (hybrid selection) has the worst performance. Since antennas are inexpensive, antenna cost savings are rather insignificant. On the other hand, reduction on RF chains in the HAD array can save more while maintaining good performance.
Running on 12GB memory GPU Titan X maxwell, selection on only antennas or RF chains for the above example took and minutes, respectively, while the selection of both took minutes.

5 Conclusion
We have proposed a novel beampattern design framework for MIMO radar by selecting the antennas and RF chains from a mMIMO HAD system. The proposed L2S method leverages softmax neural networks to approximate the selection matrices and optimizes the trainable parameters alternatively. Compared with classification method, the complexity of the softmax selection does not grow exponentially. Numerical results have been provided to validate the performance of the proposed approach, showing that the L2S method is able to achieve the desired beampatterns via selecting a limited number of antennas and RF chains from a dense HAD array. Future work will explore the problem of using L2S to minimize the number of antennas/RF chains subject to an error constraint.
References
- [1] Andre Bourdoux et al., “6G white paper on localization and sensing,” arXiv preprint arXiv:2006.01779, 2020.
- [2] F. Liu, C. Masouros, A. Petropulu, H. Griffiths, and L. Hanzo, “Joint radar and communication design: Applications, state-of-the-art, and the road ahead,” IEEE Trans. Commun., vol. 66, no. 6, pp. 3834–3862, June 2020.
- [3] T. S. Rappaport, S. Sun, R. Mayzus, H. Zhao, Y. Azar, K. Wang, G. N. Wong, J. K. Schulz, M. Samimi, and F. Gutierrez, “Millimeter wave mobile communications for 5G cellular: It will work!,” IEEE Access, vol. 1, pp. 335–349, May 2013.
- [4] R. W. Heath, N. González-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process., vol. 10, no. 3, pp. 436–453, Apr 2016.
- [5] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” IEEE Trans. Wireless Commun., vol. 9, no. 11, pp. 3590–3600, Nov 2010.
- [6] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, Sept 2017.
- [7] X. Li and A. Alkhateeb, “Deep learning for direct hybrid precoding in millimeter wave massive MIMO systems,” in 2019 53rd Asilomar Conference on Signals, Systems, and Computers, 2019, pp. 800–805.
- [8] Hamed Hojatian, Jeremy Nadal, Jean-Francois Frigon, and Francois Leduc-Primeau, “Unsupervised deep learning for massive MIMO hybrid beamforming,” 2020.
- [9] J. Tao, J. Chen, J. Xing, S. Fu, and J. Xie, “Autoencoder neural network based intelligent hybrid beamforming design for mmWave massive MIMO systems,” IEEE Transactions on Cognitive Communications and Networking, vol. 6, no. 3, pp. 1019–1030, 2020.
- [10] A. M. Elbir and K. V. Mishra, “Joint antenna selection and hybrid beamformer design using unquantized and quantized deep learning networks,” IEEE Transactions on Wireless Communications, vol. 19, no. 3, pp. 1677–1688, 2020.
- [11] A. Kaushik, J. Thompson, E. Vlachos, C. Tsinos, and S. Chatzinotas, “Dynamic RF chain selection for energy efficient and low complexity hybrid beamforming in millimeter wave MIMO systems,” IEEE Transactions on Green Communications and Networking, vol. 3, no. 4, pp. 886–900, 2019.
- [12] A. Hassanien and S. A. Vorobyov, “Phased-MIMO radar: A tradeoff between phased-array and MIMO radars,” IEEE Trans. Signal Process., vol. 58, no. 6, pp. 3137–3151, Jun 2010.
- [13] D. Wilcox and M. Sellathurai, “On MIMO radar subarrayed transmit beamforming,” IEEE Trans. Signal Process., vol. 60, no. 4, pp. 2076–2081, 2012.
- [14] W. Wang, “Phased-MIMO radar with frequency diversity for range-dependent beamforming,” IEEE Sensors J., vol. 13, no. 4, pp. 1320–1328, 2013.
- [15] K. Diamantaras, Z. Xu, and A. Petropulu, “Sparse antenna array design for MIMO radar using softmax selection,” https://arxiv.org/abs/2102.05092, 2021.
- [16] J. Joung, “Machine learning-based antenna selection in wireless communications,” IEEE Communications Letters, vol. 20, no. 11, pp. 2241–2244, 2016.
- [17] T. X. Vu, L. Lei, S. Chatzinotas, and B. Ottersten, “Machine learning based antenna selection and power allocation in multi-user MISO systems,” in 2019 International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT), 2019, pp. 1–6.
- [18] Christopher M Bishop, Pattern recognition and machine learning, Springer, 2006.
- [19] Diederik P Kingma and Jimmy Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.