This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Learning to Select for MIMO Radar based on Hybrid Analog-Digital Beamforming

Abstract

In this paper, we propose an energy-efficient radar beampattern design framework for Millimeter Wave (mmWave) massive multi-input multi-output (mMIMO) systems, equipped with a hybrid analog-digital (HAD) beamforming structure. Aiming to reduce the power consumption and hardware cost of the mMIMO system, we employ a learning approach to synthesize the probing beampattern based on a small number of RF chains and antennas. By leveraging a combination of softmax neural networks, the proposed solution is able to achieve a desirable beampattern with high accuracy while incurring low cost.

Index Terms—  Hybrid beamforming, radar beampattern design, learn to select, softmax selection

1 Introduction

Sensing is viewed as an essential feature in next-generation wireless communication applications [1], such as vehicular networks, WLAN indoor positioning, and unmanned aerial vehicle (UAV) networks [2]. Indeed, in all those scenarios, sensing and communication are a pair of intertwined functionalities, often required to be carried out simultaneously for the purpose of increasing the spectral efficiency and reducing costs.

In order to promote both high-throughput communication and high-accuracy sensing performance, millimeter Wave (mmWave) signaling and massive multi-input multi-output (mMIMO) have emerged as two promising approaches [3, 4, 5]. The large bandwidth available at the mmWave spectrum provides not only Moreover, the large-scale antenna array can compensate for the path-loss of the mmWave signals by formulating “pencil-like” beams towards the communication users. At the same time, a large-scale antenna array offers enhanced performance in terms of the angular resolution for radar sensing. However, fully-digital mMIMO systems require as many RF chains as antenna elements. This requirement translates into high power consumption and hardware cost, which limit the applicability of fully-digital mMIMO in a practical setting, especially when the antennas and RF chains are operated in the mmWave band.
The hybrid analog-digital (HAD) beamforming structure is a low-cost solution for tackling the above issues [6] while reaping the performance gains of both mMIMO and mmWave signalling. A HAD architecture comprises a small number of RF chains, which are connected to a large number of antennas through a network of phase shifters. While the HAD beamforming for communication has already been well-studied[7, 8, 9, 10, 11], its application towards radar sensing remains to be explored. To this end, previous research efforts have focused on the design of phased-MIMO radar, which was proposed as a tradeoff between the phased-array and MIMO radars [12]. However, due to the exponentially increasing complexity and energy consumption in terms of both antennas and RF chains, the state-of-the-art research on phased-MIMO radar is restricted to small-scale antenna arrays [12, 13, 14], and thus cannot take advantage of the mMIMO capabilities. To address this issue, it is necessary to exploit a limited number of RF chains and/or antennas instead of using the full HAD array. Again, to the best of our knowledge, the literature on antenna/RF chain selection for phased-MIMO/HAD radar is rather sparse.
To further reduce the cost and improve energy efficiency of the conventional phased-MIMO radar [12], we propose a novel approach for optimally selecting a small number of RF chains and/or antennas from a dense hybrid analog-digital array, along with optimally designing the phase shifter network matrix and the beamforming matrix, so that the corresponding probing beampattern is close to that of a fully populated HAD structure. The optimization problem is solved by modifying the softmax learning approach learn to select (L2S) in [15] where the selection of antennas is modeled by softmax neural networks. The proposed L2S is effective in formulating any desirable radar beampattern and can scale to a large number of RF chains and/or antennas to select from which is crucial to massive array.
While machine learning for antenna selection has been investigated in [16, 10, 17], the problem in those works was treated as a classification problem. However, the combinatorial explosion problem renders those methods impractical even in cases with a moderate number of antennas. On the other hand, L2S in [15] can be efficiently scaled to larger problems as it avoids the combinatorial explosion of the selection problem. It also offers a flexible array design framework as the selection problem can be easily formulated for any metric. For clarity, we note here that [15] considers a sparse array design problem where only one selection matrix is considered. In contrast, the problem considered in this paper involves two selection matrices, i.e., a phase-shifter network matrix with unit modulus, and a beamforming matrix. Both matrices are parameters in the optimization problem, thus providing more degrees of freedom for approximating the desired beampattern while incurring lower cost.

2 Problem formulation

Let us consider a massive MIMO system equipped with NtN_{t} antennas and NRFN_{RF} RF chains. The antennas formulate a uniform linear array, with spacing between adjacent antennas denoted by dd. In the fully digital MIMO system, we have NRF=NtN_{RF}=N_{t}, which large huge costs when a large number of antennas are needed, especially in the case of RF chains operating in the mmWave band. To tackle this issue, we consider a HAD structure, which employs a smaller number of RF chains, i.e., NRFNtN_{RF}\leq N_{t}, and where each RF chain is connected to all NtN_{t} antennas via a phase-shifter network.

The phase-shifter network between antennas and RF chains can be modeled as a matrix 𝐅RFNt×NRF{{\mathbf{F}}_{RF}}\in{\mathbb{C}^{N_{t}\times{N_{RF}}}}, where all the entries in 𝐅RF{{\mathbf{F}}_{RF}} have constant modulus, i.e., |𝐅RF(i,j)|=1,i,j\left|{{{\mathbf{F}}_{RF}}\left({i,j}\right)}\right|=1,\forall i,j. We assume that the antennas transmit narrow-band signals with carrier wavelength λ\lambda. The array output at angle θ\theta is

y(t;θ)=𝐚(θ)H𝐯(t),y(t;\theta)=\mathbf{a}(\theta)^{H}\mathbf{v}(t), (1)

where 𝐚(θ)\mathbf{a}(\theta) is the steering vector at direction θ\theta, and 𝐯(t)Nt\mathbf{v}(t)\in\mathbb{C}^{N_{t}} is the transmit array snapshot at time tt. Let

𝐯(t)=𝐅RF𝐐𝐞(t)\mathbf{v}(t)={\mathbf{F}_{RF}}\mathbf{Q}\mathbf{e}(t) (2)

with 𝐐NRF×NRF\mathbf{Q}\in\mathbb{C}^{N_{RF}\times N_{RF}} a baseband precoding matrix, and 𝐞(t)NRF×1\mathbf{e}(t)\in\mathbb{C}^{N_{RF}\times 1} a white signal vector with zero-mean and unit identity covariance matrix. The array output vector at KK different angles is

𝐲\displaystyle\mathbf{y} \displaystyle\triangleq [y(t;θ1),,y(t;θK)]T\displaystyle[y(t;\theta_{1}),\dots,y(t;\theta_{K})]^{T} (3)
=\displaystyle= 𝐀H𝐯(t)=𝐀H𝐅RF𝐐𝐞(t)\displaystyle\mathbf{A}^{H}\mathbf{v}(t)=\mathbf{A}^{H}{\mathbf{F}_{RF}}\mathbf{Q}\mathbf{e}(t) (4)

with 𝐀=[𝐚(θ1),,𝐚(θK)]Nt×K\mathbf{A}=[\mathbf{a}(\theta_{1}),\dots,\mathbf{a}(\theta_{K})]\in\mathbb{C}^{N_{t}\times K} being the steering matrix. In order to achieve a lower power consumption and hardware cost, we want to select MRFM_{RF} RF chains (MRF<NRFM_{RF}<N_{RF}) and MtM_{t} array antennas (Mt<NtM_{t}<N_{t}), so that the HAD system approximates the desired power pattern at KK angles. To proceed, let us first select MRFM_{RF} out of NRFN_{RF} RF chains by multiplying 𝐅RF{{\mathbf{F}}_{RF}} with a selection matrix 𝐒1MRF×NRF{{\mathbf{S}}_{1}}\in{\mathbb{C}^{{M_{RF}}\times N_{RF}}}, where all the elements in 𝐒1{\mathbf{S}_{1}} are zero, except for exactly one element per row which is equal to one. The transmitted snapshot from the selected RF chains can be expressed as

𝐯s(t)=𝐅RF𝐒1T𝐒1𝐐𝐞(t)\mathbf{v}_{s}(t)={\mathbf{F}_{RF}}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}\mathbf{Q}\mathbf{e}(t) (5)

Each column in 𝐒1\mathbf{S}_{1} contains at most one element that is equal to one so that we do not choose the same RF chain twice.

In general, if we select RF chains of indices l1,,lMRFl_{1},\dots,l_{M_{RF}} then the elements of 𝐒1{\mathbf{S}_{1}} will be s1ij=1s_{1_{ij}}=1 for li=jl_{i}=j, and 0 otherwise. Correspondingly, matrix 𝐒1T𝐒1{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}} will be an NRF×NRFN_{RF}\times N_{RF} diagonal matrix where the diagonal entries are one if the corresponding RF chains are active and zero otherwise. Similarly, another selection matrix 𝐒2Mt×Nt{\mathbf{S}_{2}}\in\mathbb{C}^{M_{t}\times N_{t}} can be introduced to select MtM_{t} out of NtN_{t} antennas.

The output of the sparse array can be expressed as

𝐲s(t)[ys(t;θ1),,ys(t;θK)]T\displaystyle\mathbf{y}_{s}(t)\triangleq[y_{s}(t;\theta_{1}),\dots,y_{s}(t;\theta_{K})]^{T} (6)
=\displaystyle= 𝐀H𝐒2T𝐒2𝐯s(t)=𝐀H𝐒2T𝐒2𝐅RF𝐒1T𝐒1𝐐𝐞(t)\displaystyle\mathbf{A}^{H}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}\mathbf{v}_{s}(t)=\mathbf{A}^{H}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}{\mathbf{F}_{RF}}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}\mathbf{Q}\mathbf{e}(t) (7)

Let pi=p(θi)p_{i}=p(\theta_{i}) be the desirable signal power at direction θi\theta_{i}, so that the desired beampattern vector is 𝐩=[p1,,pK]T\mathbf{p}=[p_{1},\dots,p_{K}]^{T}. The sparse array output power at θi\theta_{i} is

p^i\displaystyle\hat{p}_{i} =\displaystyle= 𝔼{ys(t;θi)ys(t;θi)}\displaystyle\mathbb{E}\{y_{s}(t;\theta_{i})^{*}y_{s}(t;\theta_{i})\} (9)
=\displaystyle= 𝐚H(θi)𝐒2T𝐒2𝐅RF𝐒1T𝐒1𝐐\displaystyle\mathbf{a}^{H}(\theta_{i}){\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}{\mathbf{F}_{RF}}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}\mathbf{Q}
×𝐐H𝐒1T𝐒1𝐅RFH𝐒2T𝐒2𝐚(θi)\displaystyle\times\mathbf{Q}^{H}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}{\mathbf{F}_{RF}^{H}}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}\mathbf{a}(\theta_{i})

The goal is to find the selection matrices 𝐒1{\mathbf{S}_{1}}, 𝐒2{\mathbf{S}_{2}}, the mapping matrix 𝐅RF{\mathbf{F}_{RF}} and the precoding matrix 𝐐\mathbf{Q} that minimize the beam-pattern error, i.e.,

min𝐒1,𝐒2,𝐅RF,𝐐\displaystyle\min_{{\mathbf{S}_{1}},{\mathbf{S}_{2}},{\mathbf{F}_{RF}},\mathbf{Q}} i=1K(pip^i)2\displaystyle\sum_{i=1}^{K}(p_{i}-\hat{p}_{i})^{2}
s.t. |𝐅RF(i,j)|2=1,i,j;\displaystyle|{\mathbf{F}_{RF}}(i,j)|^{2}=1,\forall i,j;
𝐒1𝐒1T=𝐈MRF;𝐒2𝐒2T=𝐈Mt\displaystyle{\mathbf{S}_{1}}{\mathbf{S}_{1}^{T}}=\mathbf{I}_{M_{RF}};\quad{\mathbf{S}_{2}}{\mathbf{S}_{2}^{T}}=\mathbf{I}_{M_{t}}

where i=1K(pip^i)2=𝐩diag{𝐀H𝐒2T𝐒2𝐅RF𝐒1T𝐒1×𝐐𝐐H𝐒1T𝐒1𝐅RFH𝐒2T𝐒2𝐀}2\sum_{i=1}^{K}(p_{i}-\hat{p}_{i})^{2}=\|\mathbf{p}-\text{diag}\{\mathbf{A}^{H}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}{\mathbf{F}_{RF}}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}\times\mathbf{Q}\mathbf{Q}^{H}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}{\mathbf{F}_{RF}^{H}}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}\mathbf{A}\}\|^{2} and 𝐈M\mathbf{I}_{M} is an M×MM\times M identity matrix.

3 Softmax Co-design

We propose to use the learning approach in [15] for the co-design of 𝐒1{\mathbf{S}_{1}}, 𝐒2{\mathbf{S}_{2}}, 𝐅RF{\mathbf{F}_{RF}} and 𝐐\mathbf{Q}. Let the loss function be

(𝐒1,𝐒2,𝐅RF,𝐐)=\displaystyle\mathcal{L}({\mathbf{S}_{1}},{\mathbf{S}_{2}},{\mathbf{F}_{RF}},\mathbf{Q})= 𝐩diag{𝐀H𝐒2T𝐒2𝐅RF𝐒1T𝐒1\displaystyle\|\mathbf{p}-\text{diag}\{\mathbf{A}^{H}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}{\mathbf{F}_{RF}}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}
×𝐐𝐐H𝐒1T𝐒1𝐅RFH𝐒2T𝐒2𝐀}2.\displaystyle\times\mathbf{Q}\mathbf{Q}^{H}{\mathbf{S}_{1}^{T}}{\mathbf{S}_{1}}{\mathbf{F}_{RF}^{H}}{\mathbf{S}_{2}^{T}}{\mathbf{S}_{2}}\mathbf{A}\}\|^{2}. (10)

Each row of selection matrices 𝐒1{\mathbf{S}_{1}} and 𝐒2{\mathbf{S}_{2}} can be modeled by a separate softmax neural network [18]. Taking the RF chain selection matrix 𝐒1{\mathbf{S}_{1}} as an example, the outputs of the mm-th network will be

sm,i=exp(𝐰iT𝐱+bi)j=1NRFexp(𝐰jT𝐱+bj),i=1,,NRFs_{m,i}=\frac{\exp(\mathbf{w}_{i}^{T}\mathbf{x}+b_{i})}{\sum_{j=1}^{N_{RF}}\exp(\mathbf{w}_{j}^{T}\mathbf{x}+b_{j})},~{}~{}~{}~{}i=1,\dots,N_{RF} (11)

where 𝐰i\mathbf{w}_{i}, bib_{i} are respectively the weights and biases, and 𝐱\mathbf{x} is the input. Note that 0sm,j10\leq s_{m,j}\leq 1 and

j=1NRFsm,j=1.\sum_{j=1}^{N_{RF}}s_{m,j}=1. (12)

Essentially, sm,is_{m,i} represents the probability that RF chain ii will be our mm-th selected RF chain.

Since the selection matrix does not depend on time tt, the input 𝐱\mathbf{x} should be constant, and thus, the constant value bi=𝐰iT𝐱b_{i}^{\prime}=\mathbf{w}_{i}^{T}\mathbf{x} can be merged into the bias term bib_{i}. Without loss of generality, such a model is equivalent to a softmax model with 𝐱=0\mathbf{x}=0, where the only trainable parameters are the biases.

The approximate selection matrix, 𝐒^1\mathbf{\hat{S}}_{1}, is formed based on the outputs 𝐬m=[sm,1,,sm,NRF]\mathbf{s}_{m}=[s_{m,1},\dots,s_{m,N_{RF}}] of all the softmax models as its rows. Clearly, 𝐒^1\mathbf{\hat{S}}_{1} will be a soft selection matrix since the values sm,is_{m,i} range between 0 and 1. By the end of the training, the matrix should converge very close to hard binary values so the approximation will be successful.

In order to formulate the cost function we individually express ys(t;θk)y_{s}(t;\theta_{k}) in terms of real and imaginary parts, to facilitate the machine learning optimization which is based on real numbers.

The average output power at angle θk\theta_{k} is

p~k\displaystyle\tilde{p}_{k} =\displaystyle= 1Tt=1Tys(t;θk)ys(t;θk)\displaystyle\frac{1}{T}\sum_{t=1}^{T}y_{s}^{*}(t;\theta_{k})y_{s}(t;\theta_{k}) (13)

and the beampattern error with respect to pkp_{k} is

~=k=1Kγk(pkp~k)2\mathcal{\tilde{L}}=\sum_{k=1}^{K}\gamma_{k}(p_{k}-\tilde{p}_{k})^{2} (14)

where γk\gamma_{k} is the importance weight assigned to the angle θk\theta_{k}.

In order to achieve a realistic solution, the softmax models must produce hard binary values. The following constraint enforces this requirement:

i=1sm,i2=1,m.\sum_{i=1}s_{m,i}^{2}=1,\forall m. (15)

Indeed, (15) holds iff smi{0,1}s_{mi}\in\{0,1\}. The ‘if’ part of this statement is obvious. The ‘only if’ part comes readily from (12) since

[i=1smi]2i=1smi2=02ijsmismj=0\displaystyle\Bigl{[}\sum_{i=1}s_{mi}\Bigr{]}^{2}-\sum_{i=1}s_{mi}^{2}=0\Rightarrow 2\sum_{i\neq j}s_{mi}s_{mj}=0

implying that at most one element of 𝐬m\mathbf{s}_{m} can be equal to 1 and all other elements must be equal to 0. Combined with (12) this means that exactly one element of 𝐬m\mathbf{s}_{m} is equal to 1 and all other elements are equal to 0.

We also need to impose another constraint since the same RF chain or antenna can not be selected more than once, i.e.

sm,i=1sn,i=0,nms_{m,i}=1\Rightarrow s_{n,i}=0,\forall n\neq m

If sm,i{0,1}s_{m,i}\in\{0,1\} then the above constraint is equivalent to

𝐬mT𝐬n=0.\mathbf{s}_{m}^{T}\mathbf{s}_{n}=0. (16)

Combining (15) and (16) it follows that 𝐒^1𝐒^1T\mathbf{\hat{S}}_{1}\mathbf{\hat{S}}_{1}^{T} must be equal to the identity matrix 𝐈MRF\mathbf{I}_{M_{RF}}. Based on the power gain error and the selection matrix structure above, we formulate the following loss function:

0(𝐛1,𝐛2,𝐅RF,𝐐)=~+α1𝐒^1𝐒^1T𝐈F2+α2𝐒^2𝐒^2T𝐈F2.\mathcal{L}_{0}(\mathbf{b}_{1},\mathbf{b}_{2},\mathbf{F}_{RF},\mathbf{Q})=\mathcal{\tilde{L}}+\alpha_{1}\|\mathbf{\hat{S}}_{1}\mathbf{\hat{S}}_{1}^{T}-\mathbf{I}\|_{F}^{2}+\alpha_{2}\|\mathbf{\hat{S}}_{2}\mathbf{\hat{S}}_{2}^{T}-\mathbf{I}\|_{F}^{2}. (17)

where F\|\cdot\|_{F} denotes the matrix Frobenius norm, and α1\alpha_{1} and α2\alpha_{2} are cost parameters which reflect the relative importance of the latter constraint with respect to the desired beam-pattern error.

3.1 Learning to select RF chains and antennas

There are four sets of parameters to be trained: (i) the biases 𝐛1\mathbf{b}_{1} to approximate the selection on RF chains , (ii) the biases 𝐛2\mathbf{b}_{2} to approximate the selection on antennas (assuming 𝐱=0\mathbf{x}=0), (iii) the covariance shaping matrix 𝐐\mathbf{Q} and (iv) the phase-shifter network matrix 𝐅RF{\mathbf{F}_{RF}}. We propose a four-stage optimization approach, by alternating between optimizing over one set of parameters and fixing others.

The algorithm runs for NepochN_{epoch} learning epochs and each alternating stage runs for a small number of steps NstepN_{step}. The proposed scheme is shown in Algorithm 1. One can improve the speed of convergence by using other optimizers instead of gradient descent. In the simulations shown next we used the Adam optimizer [19].

for epoch=11 to NepochsN_{epochs} do
       Fix 𝐐\mathbf{Q}, 𝐅RF{\mathbf{F}_{RF}}, 𝐛2\mathbf{b}_{2} and optimize 0\mathcal{L}_{0} w.r.t. 𝐛1\mathbf{b}_{1}:
       for step=11 to NstepsN_{steps} do
             Update 𝐛1\mathbf{b}_{1}
      Fix 𝐛1\mathbf{b}_{1}, 𝐛2\mathbf{b}_{2}, 𝐐\mathbf{Q} and optimize 0\mathcal{L}_{0} w.r.t. 𝐅RF{\mathbf{F}_{RF}}:
       for step=11 to NstepsN_{steps} do
             Update 𝐅RF{\mathbf{F}_{RF}}
            
      Fix 𝐅RF{\mathbf{F}_{RF}}, 𝐛1\mathbf{b}_{1}, 𝐐\mathbf{Q} and optimize 0\mathcal{L}_{0} w.r.t. 𝐛2\mathbf{b}_{2}:
       for step=11 to NstepsN_{steps} do
             Update 𝐛2\mathbf{b}_{2}
      Fix 𝐛1\mathbf{b}_{1}, 𝐛2\mathbf{b}_{2}, 𝐅RF{\mathbf{F}_{RF}} and optimize 0\mathcal{L}_{0} w.r.t. 𝐐\mathbf{Q}:
       for step=11 to NstepsN_{steps} do
             Update 𝐐\mathbf{Q}
            
      
Algorithm 1 Learn to select.

4 Simulation results

Here, we demonstrate the performance and flexibility of the proposed method. In all experiments, a flat weight is used, i.e., γk=1,k\gamma_{k}=1,\forall k and the antennas are spaced by half of wavelength. We used the Adam stochastic optimization procedure with different learning rates and Nepoch=400N_{epoch}=400 epochs of training. In each epoch Nstep=10N_{step}=10 steps are executed. The training data are i.i.d. jointly complex Gaussian with zero mean and variance 11. The length of input data TT should always be larger than the maximum number of MRFM_{RF} and MtM_{t} to ensure the functionality of the model.

Our first experiment is designed to select a small number of RF chains to reduce the system cost. The desirable beam power profile equals to 11 over the angle ranges [27,23][-27,-23] degrees and [28,32][28,32] degrees, and is zero otherwise. The number of antennas is Nt=128N_{t}=128. The learning rate is set to β=0.04\beta=0.04, while the parameter α1\alpha_{1}, used in (17), changes between learning epochs; it starts from αinit=3200\alpha_{init}=3200, and linearly increases to αfinal=16000\alpha_{final}=16000 at the final epoch. The α\alpha weights the importance of a proper selection matrix during the learning process.

Refer to caption
Fig. 1: Beampatterm based on selecting MRF=32M_{RF}=32 out of NRF=64N_{RF}=64 (blue), or 128128 (red) RF chains. Nt=128N_{t}=128.

Fig. 1, shows the designed beampattern, when selecting MRF=32M_{RF}=32 out of NRF=64N_{RF}=64 or NRF=128N_{RF}=128 RF chains, which are typical antenna numbers considered in mMIMO systems. One can see that the matching to the desirable beampattern is pretty good. For this example, classification-based machine learning methods would have to choose the best class out of (6432)>1.8×1018{{64}\choose{32}}>1.8\times 10^{18} classes, which is a task that would require a prohibitively long time to compute.

In the second experiment, three different selection choices are tested: (i) select antennas only, (ii) select RF chains only, and (iii) select both RF chains and antennas. There are Nt=64N_{t}=64 antennas and NRF=32N_{RF}=32 RF chains, among which we select Mt=32M_{t}=32 antennas or/and MRF=16M_{RF}=16 RF chains. The desirable beam power profile is equal to 11 at angle ranges [2,2][-2,2] degrees and [19,23][19,23] degrees, and is zero otherwise. The parameter α\alpha used in this experiment ranges from 320320 to 16001600, while the learning rate is the same β=0.02\beta=0.02. The beampattern of the designed system is shown in Figure.2, where one can see that selecting RF chains only and using all antennas performs best, giving rise to the lowest sidelobes. Selecting both RF chains and antennas, (hybrid selection) has the worst performance. Since antennas are inexpensive, antenna cost savings are rather insignificant. On the other hand, reduction on RF chains in the HAD array can save more while maintaining good performance.

Running on 12GB memory GPU Titan X maxwell, selection on only antennas or RF chains for the above example took 1717 and 1515 minutes, respectively, while the selection of both took 4747 minutes.

Refer to caption
Fig. 2: Selection in HAD array compared with selection in the full digital array.

5 Conclusion

We have proposed a novel beampattern design framework for MIMO radar by selecting the antennas and RF chains from a mMIMO HAD system. The proposed L2S method leverages softmax neural networks to approximate the selection matrices and optimizes the trainable parameters alternatively. Compared with classification method, the complexity of the softmax selection does not grow exponentially. Numerical results have been provided to validate the performance of the proposed approach, showing that the L2S method is able to achieve the desired beampatterns via selecting a limited number of antennas and RF chains from a dense HAD array. Future work will explore the problem of using L2S to minimize the number of antennas/RF chains subject to an error constraint.

References

  • [1] Andre Bourdoux et al., “6G white paper on localization and sensing,” arXiv preprint arXiv:2006.01779, 2020.
  • [2] F. Liu, C. Masouros, A. Petropulu, H. Griffiths, and L. Hanzo, “Joint radar and communication design: Applications, state-of-the-art, and the road ahead,” IEEE Trans. Commun., vol. 66, no. 6, pp. 3834–3862, June 2020.
  • [3] T. S. Rappaport, S. Sun, R. Mayzus, H. Zhao, Y. Azar, K. Wang, G. N. Wong, J. K. Schulz, M. Samimi, and F. Gutierrez, “Millimeter wave mobile communications for 5G cellular: It will work!,” IEEE Access, vol. 1, pp. 335–349, May 2013.
  • [4] R. W. Heath, N. González-Prelcic, S. Rangan, W. Roh, and A. M. Sayeed, “An overview of signal processing techniques for millimeter wave MIMO systems,” IEEE J. Sel. Topics Signal Process., vol. 10, no. 3, pp. 436–453, Apr 2016.
  • [5] T. L. Marzetta, “Noncooperative cellular wireless with unlimited numbers of base station antennas,” IEEE Trans. Wireless Commun., vol. 9, no. 11, pp. 3590–3600, Nov 2010.
  • [6] A. F. Molisch, V. V. Ratnam, S. Han, Z. Li, S. L. H. Nguyen, L. Li, and K. Haneda, “Hybrid beamforming for massive MIMO: A survey,” IEEE Commun. Mag., vol. 55, no. 9, pp. 134–141, Sept 2017.
  • [7] X. Li and A. Alkhateeb, “Deep learning for direct hybrid precoding in millimeter wave massive MIMO systems,” in 2019 53rd Asilomar Conference on Signals, Systems, and Computers, 2019, pp. 800–805.
  • [8] Hamed Hojatian, Jeremy Nadal, Jean-Francois Frigon, and Francois Leduc-Primeau, “Unsupervised deep learning for massive MIMO hybrid beamforming,” 2020.
  • [9] J. Tao, J. Chen, J. Xing, S. Fu, and J. Xie, “Autoencoder neural network based intelligent hybrid beamforming design for mmWave massive MIMO systems,” IEEE Transactions on Cognitive Communications and Networking, vol. 6, no. 3, pp. 1019–1030, 2020.
  • [10] A. M. Elbir and K. V. Mishra, “Joint antenna selection and hybrid beamformer design using unquantized and quantized deep learning networks,” IEEE Transactions on Wireless Communications, vol. 19, no. 3, pp. 1677–1688, 2020.
  • [11] A. Kaushik, J. Thompson, E. Vlachos, C. Tsinos, and S. Chatzinotas, “Dynamic RF chain selection for energy efficient and low complexity hybrid beamforming in millimeter wave MIMO systems,” IEEE Transactions on Green Communications and Networking, vol. 3, no. 4, pp. 886–900, 2019.
  • [12] A. Hassanien and S. A. Vorobyov, “Phased-MIMO radar: A tradeoff between phased-array and MIMO radars,” IEEE Trans. Signal Process., vol. 58, no. 6, pp. 3137–3151, Jun 2010.
  • [13] D. Wilcox and M. Sellathurai, “On MIMO radar subarrayed transmit beamforming,” IEEE Trans. Signal Process., vol. 60, no. 4, pp. 2076–2081, 2012.
  • [14] W. Wang, “Phased-MIMO radar with frequency diversity for range-dependent beamforming,” IEEE Sensors J., vol. 13, no. 4, pp. 1320–1328, 2013.
  • [15] K. Diamantaras, Z. Xu, and A. Petropulu, “Sparse antenna array design for MIMO radar using softmax selection,” https://arxiv.org/abs/2102.05092, 2021.
  • [16] J. Joung, “Machine learning-based antenna selection in wireless communications,” IEEE Communications Letters, vol. 20, no. 11, pp. 2241–2244, 2016.
  • [17] T. X. Vu, L. Lei, S. Chatzinotas, and B. Ottersten, “Machine learning based antenna selection and power allocation in multi-user MISO systems,” in 2019 International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT), 2019, pp. 1–6.
  • [18] Christopher M Bishop, Pattern recognition and machine learning, Springer, 2006.
  • [19] Diederik P Kingma and Jimmy Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.